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GAAGCWCGATCAGCAGGCAGCTnTCCAGGGGAG 
TGGAAAAGGAAAAGTGAGAATTACATTAGTAACAAAGAATGArc 
GAGACGGCTACTATGAAGCAGAATTTGGACAAGAACGCAGACCTTTGTTT^ 
GAAGTAAAAGAAGCTATTATTACAAGAATAAAGGCAGGAATCAATCCATTCAATGTCCCTGAA 
5 TTGTG ACCTCAATGTGGTGAG ACTGTGTTTT CAAGTTTTTCTC CTCTT CCTC CTGTTG 

TCTCGAACCCAATTTATGACAACCGTGCTCCAAATACTGCAGAATTAAGGATTTC 
GGAGGAGATGAAATATTTCTACTTTGTGACAAAGTTCAGAAAGATGACATA 
AGGCATCTTTTCACAAGCTGATGTACACCGTCAAGTAGCCATTGTTT^ 

T AACAGTAAA^TG CAGTTG CGGAGACCTTCTGACCAGGAAGTT AG TGAATCT ATGG ATTTT AGAT ATCTG CCAGATGAAAAAGAT 
1 0 ACTTACGG CAATAAAGCAAAGAAACAAAAGACAACTCTG CTTTTC C CAGGATCACG T AG AAACAGGGTTTCGCCA 

TGTTGACCAGGATGGTCTTGAACTCCTGACATCAGGTGATCCACCCACCTTG^ 
CTGAGAGACCAAGACCTGGTCTCCTCGGTTCAATTGGAGAAGGAAGATACTTCAAA 
GTTGTGAGAGAAATGCCTACAGGGGTTTCAAGTCAAGCAGAATCCTACT 

TCATGCCTCAATGGCACCTCTGCCTTCTTCAAGCTGGT CAATACAAACCCACTGA 
1 5 GTAGTTTTTCAACAAGGACACITCCTrCTAATTCGCAAGGT^ 

T CT AATGCTTGCATTT ACAACAATGCCGATG ACAT AGT CGGAATGGAAGCGTCAT CCATG C CAT CAG CAG ATTT AT ATGGT ATTTC 

TGATCCCAACATGCTGTCTAATTCTTCTGTGAATATGATGACAACCAGC^ 

TGAGCATGAATCrTXSAAAACCCCTCATGTAATTCAGTC 

ATGTCAGCAGGCGCCAATTCCAATACTACTGTTTTTGTTTCAC^^ 

2 0 CAGCATGATAAATGAGTCGG5GACCATCAAACAGTACT 

GCAGTATGCAAAATGAGCAATTGAGTGACTCCTTTCCATA^ 
TTGATACCACCTATATAGATGCAGCATTTTGTATTTGTCTAACTGGGGATATAA 
TGAGAATATAATACTGTATTTGAGAATATAAAAAACITTrTTCAGGGAAGAA 
GAAGCTGTCATAAAAAGACAACTCAGAGGCCAGGCGCAGGNGCTC^ 
25 ATCACTTGAGACCAG 

HUMAN SEQUENCE - CODING 

ATGGCCTC03GTGCGTATAACCCGTATATAGAGATAATTGAACAA 

G CG ATCAGCAGG CAGCATT CCAGGGGAG CAC AG C ACAGACAACAACCGAACAT AC C CTT CT ATC CAG ATT ATGAACT ATT ATGG AA 

3 0 AAGG AAAAGTG AGAATTACATT AGT AACAAAG AATG ACC CAT AT AAACCT CATC CT CATG ATTT AGTTGGAAAAGACTGCAGAG AC 

GGCT ACT ATG AAG CAGAATTTGGACAAGAAC G CAG AC CTTTGTTTTT CCAAAATTTGGG T ATTCG ATGTGTGAAG AAAAAAG AAG T 
AAAAG AAG CT ATT ATT ACAAG AAT AAAGGCAGG AATCAATC CATT CAATG T C CCTG AAAAACAG CTG AATG AT ATTG AAGATTG TG 
ACCTCAATGTGGTGAGACTGTGTTTTCAACnTTTTCTCCCTGATGAACATGGT^ 
AACCCAATTTATGACAACCGTGCTCCAAATACTGCAGAATTAA 

3 5 AGATGAAATATTTCTACTTTGTGACAAAGTTCAGAAAGATGACATAGAAG 

TCTTTTCACAAGCTGATGTACACC<3TCAAGTAGCCATTGTTTTCA 

G T AAAAATG CAGTTG CGG AG ACCTT CTG ACCAGG AAG TT AG TG AATCTATGG ATTTT AG AT ATCTGC CAC^TG AAAAAC^T ACTT A 

CGGCAATAAAGCAAAGAAACAAAAGACAACTCTGCTTTTCCAGAAACTGTGC 

ACCAGGATGGTCTTGAACTCCTGACATCAGGTGATCCACCCACCTTGGCCTCCC^ 

4 0 AGACCAAGACCTGGTCTCCTCGGTTCAATrGGAGAAGGAAGATAC^ 

GAGAGAAATGCCTACAGGGGTTTCAAGTCAAGCAGAATCCTACTA 

C CTCAATGGCAC CT CTGCCTTCTTCAAGCTGGTCAT CAGTGGCCC ACCCCACCCCACGCTCAGGCAATACAAACCCACTGAGTAGT 
TTTTCAACAACXACACTTCCTTCTAATTCGCAAGGTATCCCACCATTCCT 

TGCTTGCATTTACAACAATGCCGATGACATAGTCGC^AATGGAAGCGTCATCCATGCCATCAGCAGATT 
4 5 CCAACATGCTGTCTAATTGTTCTGTGAATATGATGACAACCAGCACT 

ATGAATCTTGAAAACCCCTCATGTAATTCAGTGTTAGACCCAAGAGACTTGAGACAGCTCCATCAGATGTC 
AGCAGGCGCCAATTCCAATACTACTGTTTTTGTTTCACAATC^ 
TGATAAATCAGTCGGGACCATCAAACAGTACTAATCCAAACAGTCATGGT 
ATGCAAAATGAGCAATTGAGTGACTCCTTTCCATATGAATTTTTTCAAG 
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Table 28 

MOUSE NOMENCLATURE 
ICSGNM ICSbp 
• Celera mCG21199 

5 

HUMAN NOMENCLATURE 
HGNC ICSBP1 
Celera hCG17046 

10 MOUSE SEQUENCE - GENOMIC 

GCAGGAGGATGOAAAGTTGGAGGGTAGCCTGAC^ 

TACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAAACAAA 

TGCCTTTAATCTCAGCACTCAGGAGGTAGAGGCTGG AGAGT^ 

AO^CCAGGGGTACCTAGAGAGACCCTCACCAAAACAAAACAAAACAAA^ 
1 5 Otfl&CC(^CGCCACCCCCAAATCAACATCAAACTCTCT 

AGCGGACAGCACrGGAGCCTTCTGAAGCACTAACCTCTTTTAAA 

AAAGGGGCAAGCGCTCCTTGTGGGACACAGTCACCACITGTTGATG 

GTCACTTGCCACCAAGCCTGACACCTGATCTGCATCCCAGGAGCCACAG 

GCACCATGGCATGGGCTCCCTGCTAAACACGTATGCGTGCGCACACGCACATGC^ 
2 0 ATAGCGCACACATACIACAAAATTATTAATTAAAATGTAATTTAGATTCGGAGT 

TCACCAC^TCTATGGCTGGGGTTCTGAGCTCAGTGTTTCAGAGGCCTCCAG^ 

CTCATTGGTACAGAATGGATTCGTCACACATrT ACCAGTTCAA CCCGAT AAAGTGTTTCGG CCTAG ACCTGAAG ATCATGA 

AAAGTACAGCGGGGACCCCCAGAGGCCTCATAGACTGTACATTGTCATAAGAATAGGAAGTACAAAGAGGC^ 

GATGGCTCAGCGGGTAAGAGCATTGACTGCTCTTCCGAAGGTCCTGAGTTCAAAT^ 

2 5 TGTAATGAGAGCTGACTCCCTCCTCTGGAGTGTCTGAAGACAGCTACA 

ATAAAAGAGGCGTCCATACTGGGAAAAAGATACGATCAAGATGCTCG^ACT 
TCCCACCGGTGGATGCAAAGCTGAAAGTGGTTAAACCCTAGATAGGCATCCAGCGCCT 
AAGACCATGTCCAGCGCGTGCCTCAGGGGCACCGG^AGTTGGTCATGTGGTGGCATCT 
CTGACGCCACACAGCTTCAGCTCACAGACACACTTACTGAGGAAGCCGTTCTCATTAAAGTGTATG 

3 0 AGTAACITAATTTAAAAAAGGATTTCTTGTAAGCGATGTCAGTAATCTGG 

TGTTGTACTAAGTTTTTCCTGTGCCTAAGCCGTCTTCCTATTAGAGATGTAGACITGCT 
GGGATACATCGTTCTGAATTCATTGAAATTTCTGAGTGTGTGGCTCCCCTGGCCCTGCCCGAAATA 
GGGTTTGTGTGTGTCATAAGCACACTGGTATGAGTCATGTGTGGTTTATATTAAACACACAAACGTGTATTCTTTACAG 
TGTAGCCCAGGCTGGCCTCTAACTCATTTAGTTCAGGATAACCTGAATCCCAGAGC^ 

3 5 ACAGGTGTGTG CCAACAAACC CAGGGAAATCTACGTTTTCAGT AAATTGATATCTCTTGGCAAT AGTT CCC^ 

CATGTTTTATGTAGACTTAGAATCCATTTACTTGGAAGAGTTGTTCTTT^ 

CAAGCGACCTCCAGCCCCTGCAGGCAGGAGTTGCTGAATGCTTCCTGCGTTTCCCCCTTATCTTAACTCT 
ATCTTGCAGGCOTGCTCTGAACATGACAGCTAGCACACrCTGGGTATTTAA 

AGGTGAAAG CAGTTT ATTT AAG CT ACAAAT AAACACAGGTGGGT CTC AG ATTTGGCTTG TTTC TT CT CTGAAG CAATGGATGG GGT 

4 0 ATTGTTATCACCTAGTCAACTGTTGCCCGAGTCCACCTTGAGCAA 

AGAACAGGGC AAAGGCCACCT CT ATAGCACIAG ATAAGTTTCGCCACAGTC CTGAATGGCTTGCTAAGTCC CAATTAGCTAGCTTG A 
AGTAATCTCACTTTTATTGGTAGGAGGTGCTTCCTGCATGAAAGAAAAATCCACTTGTGCATGAGT^ 
CACTCACACACGTGCACATACCACACACATGTACTCGCACACAACATGCACACCCGCGCACACACACATAGATGCAC^ 
ACACACACACACACACACAGAGACACACAGGAGGACTCACATGCACACACTCAGATGGATAGAC 

4 5 TGCACACGCACACAGACTCATGACTGTATGCATATGGTCCTGAGCATCCT 

ATTGATGACTCTTCCCTTATCTTCTGGGATCGACTATAGCCTTTGACATTCT 

TGCACCCGCAGCCTGGGCTCAAGCCTCCACTCTGGCCCTAGGAGTATTCTGAGCCTGGCTGAATG 

TGTGTGGCTTTGTGTCCTGTCAATTTACGTGTCCAGCAGGATGGCTTTCCCTTGGA 

50 GAGCTCTCCTGAGCATCCCAGCAGCTAGCCTGACTGCTTTTC^GTGCACACCCACGAGACTCTGAGGCCTCCGTCCGCTTTGTTC 
CCT CAGTCTGGTGTCAG AGGTCCATCCTGTGCTGTGGCTGGATG CAG AATAGCCCGTTCTGATTACTG CAGTGCCCTCCACAGTGG 
GGTTGCTAATGCTTGTGTGTGTCCTAGTCACCATTCTATTGCTGTGAAGAGACACTGTGGCCACCACAACTCTTATGAGAGTTTAA 
CTGGGGCTGGCTTGCAGTTCAGAGGTTTAGCCGATTTTCATGATGGCGGGAAG 

GACCTACATCCTGACAGTGGGCACAGGGAGAGAGAGAGACAGAGAGACAGAGACAGAGAGAGACAAGAC 

5 5 CAGAGAGGAGACACACACACACACACACACACACACACAGATGTGGAGTCTGG 

CTAACAAGGCCATACCTCCCAATCCTTCCGATCCTTTCyUUlTACTCACCCCCITGCTACTAA^ 

GGGCCATCCTTATTCAAACCACCACAGTGGTTGTGCTATTTGAGAAATTTGTGATTTTTTTCCC 

TTTTTTTTCTTTTTTCTTGAGACAAGGTCTTATCCAGCCCAGCTTG 

TCTGATCTCCCTGCCCGTCTCCTGTGTGCTAGGATTACAGGCATGTACCTGCATGCCTGGCTTAG^ 

6 0 GGGCTTTGTCTCTAGCCGAGAGTTTTACAGCTACAG 

TGTGGACGTCCTGCCTTCACrTCTCTGGGGTGTGCTTAGCAGCTGGCCTGCCTGACTTAGA 
GCTTTAAAGGAATGGCAGTCTCATCCITCACX5TGGTTCC 
AACATCACCHTATAGTCAGCTCCCCTGGGAGATTTGGACCATTT^ 
rrCCTTTCCTTTCCTTTCCTTTCCTTCCCTTCCCTTCCCTTC 

6 5 CCCTTCCCTTCTTTTO TTTT^^ 

TCCTGGAACTCACTCTGTAGACCAGGCTGG CCTTGAACT CAGAAATTCG CCTGC CTCTGCCTCCCAAATGCTGGGATTAAAGG CGT 

GCACCACCAAGCCCGGCAGAT7TGGACCATTITCATCAGGACTCTAGGCTGTTCTGAGTG 

CCCCTCCATTTirCCTTCCCTCCCTCCTCTTTCCCCTCCTCCCTCTTTCCTTT 

7 0 TTGCTTGCTGGAGACAAGTCTGTX5CACTCAGGGCTTGGTAAGTTAAG 

AGTTGG CATTTG1TTTGTGTCCTTTTCCT CACAAAGATATTTTG AGAT CG TT AATGACC CCC AT AGTTGGTCAG CTTT GTG CAAAT 
AAATAATTTCXXSGGGTGGGGCAGAAA^CCTGAACACTTG CTTGGC CT AGAGGGAT AA CC CACG GATGCACAG AGGCTG CGCTAG AG 
ACAGACAGTGGCTATGAAGTCTTCAGACTCTGGCCTTTC 

GAGCCACAG CTGTCT CTG CATCGTG CT CCGAGG AGGT AGTG TT CAAACACAT ACGAAGTGGCTTCTGGG ATTTG ACCAG ACAG AAG 
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TGATTCTTCTTGGG ACGGGAGGAACCAG CACGCGCTGG'rrrri'C'ri' CTTT CACGGTGCCGGCTTCGG AACTCAGCGCCCCTGCTTG 

GGTTAAACATGGCCTCTACCGCGGAGCTGTACTACAGCTCAGTT^^ 

GTGTTCTGAGTAGAAGTCCrTTCTTTTTCTTTTTGA^ 

GATCACAGGCTTGCCCCGCCACCTTGGCCTAGCCAATCCGGGTTCrGGAATCCCA 
5 ATCCTAGTGCTGGGATCCCAGGTGTTGCCCCGCCCACCTCTGGTTTAGCTGCTCTTTCT 
GCTCCTCCCACTACTCAGAATACCAATTCAiC^ 

GT CTTCAG ACACTCCJVGAAG AGGGCG TCAG AT CTTGTT ACAG ATGGTTGTC CATGTGGTTG CTGGGATTTGAACT CCGGA 

CCTTCGGAAGAGOVGTCGGGTGCTCTTACCCACTGAGCCATCTCA 

CACAACCAATTCACACTTTTATCAGTTTTATTTATT 
10 GTGTGTATATGTGCXnt3TGCTTCIX5TGCATGTGTGCTGTGCATATGCGGGTATC 

TGTGCTGGGAACCCCATCAGACTGCAGGTGGTAGACCTGCATAGATGAGAAAGGC^^ 

GGCACACATAGAGGGTAACATGCTCACTGGGGAGCAACCTCCCTACACAGGAAAGGGGT^ 

GGGGACTTTGATC^CGAACAAAGAGGAGTAAGTTGGACTACACACTC^ 

CATATACACACATACACACATGCACGCACACATATACATACATATACACACACACATGC^ 
1 5 ATT AAAGCATG AAATG T AATTTTT AAAAGAT AT GAGTGTGGCTTGGAATCTGGGTTrrrr AAAATT AAGGT CCATGAAGGCCGG CAT 

AGGAAAGAGGTCAGAAGCTGGATCTTGAGGAGAAGGAGGTGGTAGGAGAACAATGGCTCCCCAAAG 

AGAACCAAGAATGATGTCAGGGACTGATACTCCTCATCTGACGTTGACTAGAGCAACGACTCT 

CAGCCACGAAACCCTTAGTGCIAGTGGCCCTCAACCT 

ACCATGACATTATTTTGCTGCTACCTCATAACTCTAATTTTGCT 
2 0 CTTAGGCAACTCTGTGAAAGGGGTCCCGACCTACAGGTTGAGAACCGCT 

GGACATGAATGCACATTTGCTGGCTTTGGAGATGAACGAGCTCACAAGCCAAGGAT^ 

TTCTCCTCAGAAGGACCTGCTGAGACAAGTGGGTGAGGAGTGCCT 

CAGTTTTTAGGTGACAGGAAGGTGTGTGTGTGTGTGTGTGTGTGAGAGAGA 

AGGGGACAGGGACAGAGGGACAGGGACAGAGAGAGTGAGACAGGGACAGAG 

2 5 AAAAGCCTGGGTCACAGCAACAGCAGCCTGGTCTCTCTGGGTGGGGGTGTGCTCT 

CACAAAACATCATGGCCAAGAAGCAAGTTGGGGAGGAAAGGGTTTATTC^ 

AGTCAGGATTGGAACTCAAGCAGGTCAGG AAGCAGGAG CTGATGCCGAGGCCACGGGGGG ATG CT ACTTACTGGCTTGCTTCCCCT 

GGCTTGCTCAGCCTGCTITTGATATAGAACCCAGGAGTACCAGCC 

TGATCACTAATTGAGAAAATGCCTTACAGCTGGATCTCATGGAGGCATTTCCT 

3 0 GTTGTGTCAGGTTGTCCCAGAGAACCAGTCAGTACACTGGGTCrcGAGGACATGTTTAATTGCCT 

GGTGATGGACTCTCTAGAACACTTCGGGGCCTCTTCTC 

TAGCTCCACCCCCGACATATGACTTATGCCAGGAGATCCTACACTGAATCTCTCCTGAGTGG 

ACACGTGGCTCAGATGAGACACCACCTTCATAGCTTCACCCTCAA 

GGTGGCTTGGGTTCTCTTAGGTCCTGAGTGGAGCTATTTCAGACTTCTAG^ 

3 5 CCGATGTATATGGCTTCAGGACAGTTGTCAGGCAGGCTGGGCTGGTC 

TTTGATATTTGTAAGTAGGAAAATCTGCATACGCTTATAAGGTGCCTATTTGGGAAATGA 

TGTTGTAGTAATCTGAGCGTGTCTGACTTCTGGACTCTGCCCTTCACAGCAAACACTTATTAAAACACT 

GAATTTGAGCTGGACTATGGTGGC!ACACACCTTTAATCCCAGCCCTTGGGAGGCAGAGGCTAGCAGATCTCTG 

4 0 AGCCTGGTCTAGT^CGGGGACAGCCAGGGCTACAGGGAAACCCTGTCTTGGAAA^ 

AACAAATGAGCrTTTGAAAAGTAGTTTTCATTTGAAATTCCTTO 

TATCATTGAACTAATACCCGGTAATGTCTTAAGTGCCTTTCAGGTATACCACAATGCTCATCAGGATAAACTAGA 
CTGTTTGG AAAAT C CTTATTTG AATT CXJT ATTT ATTTG AAACCT AGC CG T AT ACTT CTTGT ATTTGTGG AC CAAT CAAAAATT CTT 
GTCGCGGCTCCAACGTTTTCCTAAGTATTTTAAAAGCCAGACGC^TGAOA^ 
4 5 GCCTGCAACGAAAGTCCCTCTCX3ACCOTCACCTAAGGCTCCCCGCTC 



TTTCCCAGATTTCCTGCGCAATTTCAGGCGGTCATCCTCCATCCAGGAGGGCGC^ 
TGGGACTGGTCTTTGCTCTGAAACTCCAGCCTGAG 

AAACCTGAACGACACCCCGAGGATGATCCGTGCATCACCAGCCTCCTTGACC^ 
5 0 GGGGCAGCCCCCTCCCCCX3CCGCCCCCGGAGTAAAGAGAGAAAAGGACT 

AAGAGAGCGCTTCAGAGAAGGCGGATTTGGCAGGCTGCGCTGATTGGGCCGCGCCAGC 
CGACCGTCGCTCTGCGCGCGGGACCC^CCTTCTCCCCCGCCCCATCTAT^ 

CGGGCGGTCCCGGAGGCGCGGGCAGCGTGGGAACCGGCGGCAGGTAGGACGCGATGGGGTGGGCGCGGGGGCCGGGGCTGTGGCTC 
TCCCACCTTCGCCCAAACGGAGTGCACAGCACGTTAGCAGGTGCTTGCTCTGGCTG 

5 5 CAAGAACAACAACAAAACCCAACCAAATGAACAAAC^ 

TAGATCTGCCAATCACCGCGCGGCAGCCCGGCTTCCATGCATGAACTATCCAAAGCTCGCGACTGGGCT 
GCTTTGATTCTGACTAGTGGCAATCTGATTGCCCAGTGGCGCTGGACTTCTCrrGATTGTCCAGTGGCCC^ 

GGTGAGAAAGATTACTTACCCACAGCCGGGAAGGGCGTTCAGGGGGTAGA^ 

6 0 TAGATCACTGTTTTGTITGTTTTTGTTTTGTTTCTGATACAGGGTATCACT 

ATCCXXCTACCTCTTCTGAGTGTTAGGATTTCAGGC 

AGTCAAACGATAAGAGCAGTTTCGTGACAGTTCTAGGTTTGTCCCTTTAAGTTATGG 
TTCAGTGTCCCTGAGGAAGCTTGGGGCC^GAAAGAGATGGGTCTGTGATGGATTTCCGTTCAGTGGTG 

G CTG AATGAAAGGG ATT ACGGTGTG GTGTGAGC C CCGC CAAACATGG CTGGTG C AG ATGTTG ACAG CTG TGTT AGG CG AG CCTCTG 

6 5 TCC C CTGCCTTGTGAG AAG CCTG ACCT AAGACACGTTG CT CT AG ATGT AAGTTAG CTG TAGGCAGGATTGGG C C CACAGGG AATG C 

TGCCTCAAAATCAAGAAGAGGAAGCTAGAATCATGTGAGGGTGTTTATAAACCATTAAT^ 

ATGCCTGTCTAGGACCn'CAGATTGGCCrCCAAGGCAATAGAGGAGATTGTCCAGGTG 

TCATCTGCAGAATGAATGAAAGAAAGCATCCCTTCTGTGTGTACrGTCACTG^ 

GGGGATGGTGGTCCCTCCGAGTCTCTCCCCCTTCCGAGTGTAGAAGAGGCTGGGTTAGAGAATTTTAACCTCCT 

7 0 CrGCCATTCACCTTTCCTTTTAAACACCCCTTTrCT 

CCCAGGATCCCTGAGAAGAAAGTGCCATTCCTGTTCTGGAAAAAACTCCAGC 
TCAGGGACCATGGGGGTATGG^TCACAAATCTATCTCCAG^AGCTGAGTT 
CCCCTGACCCC^GCTCCCACCCCACCCrCATC 
AGAAC^GCTGTTGGAGCCAGGACTCTACTTGGTTTTAGGGGT 
75 CTTGTCACTTATGAACTCAGTGGTCTTAGAATACTGGGTTTC 
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ACTTAGGAGCCAATGGCTTCCCAAGGTGAAGAGCCCCTGCTGAGGTCAGGGTTTC 
CCTCAG^TGG(!ACCGCACTTGCCTAGCAGGCACAAG CTCT 
AGGCGTCCAAGGCCATCTTCATCAAACTAGGGAGTTGGAGATCA^ 
TGACTGGAGTCTGACTCCGCTTTAGCCCTCGCTCCCTTCCAAGCCCAC^ 
5 CTGGAGAGACCTCATCrCCCTCCTCTCAGGCTGCTGCCGTCTTTCCTGGCCC 
CCTCAGGCCTCTTTCCAGGAATCCTGACTTTGTGGGACATGCTGGCTCTG^ 

ACTGATGATAA CrrCl ' l"! i'l 1 1 1 n ^ "j' TT ' l TCCCTACCCCAAGAGGTCATTTTAGTTGGAAGACTAGAAGTGCrGTGTTCTGGTTT 
TCAGGGTGGGCGTCTGTCCAACTGCTTTGGGGATITCCAGGCTGTrCTAGTA 
TCTCTCCCCACGCTGTAGGAAAAGCAGACCCGAGAGTGTCCAAGTGTGCCATTAGGTGTCTGGGGA 
1 0 TCCCTGAGCTGGCTTTGTACTGTCCCTTTGTTCTCACTGAGGGCTGA 

AGGATACAGGCTCTGATTGCAGGTACTGTACACACAGCTCCCAGACCGAATCCTTAACACTCT 

CCCAGCC^CACAGGACTCCAGCTCTAACTCCATCCCAGTGGGAACAGGACCCTATACTGT 

TGGCAAAGGGCAACCTGCAAGCAGCCTTGAGGGTGTCCTCTTTCTCGGGTGG 

CTTGGGTATCAGT AGTCACGCTGGTTCTTAGAGAGGCGAG CAG CTTCCTAG 
1 5 CAGGTGCCCTCAGTGTCTGGGACAGGGGGTCAGGGAAGAGCTGGACCCAGA 
CTTTCGGTTTTCCTTTGGGTTTAGGGCCACCGGGCACTGCTGATTTCTCAT 

TTG CGTGG AC CTCTGAT CTT C CGG CAGTCAGTTCCGGGACTGCT CTG ACT CACAGAG ATGT ATTTG TT ATCCTG AAAG ATTTG CTT 
TCTTTCCAAGGATGTGTGACCGGAACGGCGGGCGGCGGCTGCGG CAGTGG CTGATCGAACAG AT CGACAG CAGCATGT ACCCGGGG 
CTGATCTGGGAAAATGATGAGAAGACCATGTTCCGTATCCCCTGGAA 
2 0 TTCCATCTTCAAGGTAAAGCCTCCTGCGTACCCACrCAGTGGTTTTTGGT^ 
TGACTAAACAGCCCACTATTTCAAGAGAAACAA 

TGCCCATTCCGCTCTTCTTGCTGCGGTGTACTGACATGCTCTGCTTGGTGTACTAAGGAA^ 
CTTGGCATGGftGGGATCAGCTATAAAGGGCGACAGGCT^ 

TTTAATGTTTCCCCTCTGAGAGGATTGAGTGCGATCACCGAGGGCAAGTCACAC^ 

2 5 CTC^CCCTGGGTGGCAGGTGATGCTCCTGTTTGCACTCTGTACTGCCACC^AGAGAAGCCACT^ 

TGTGATGGAGCCTGTGTTTGGGACTGGGTGGTCCAAGGTCTTAACC 
CC!AGCCCAGCCCAGTTCTGCAGGCAGGCAGGGTCAGACIATCCCCTC 
GGCACCTACATTCGTCTCTGGGAGTTTCAGAGTCTTAGGTC^ 
GGTTTGACCCAGAGTCCAGGTTGTAAACCACTGCCTTTAGCTCA^ 

3 0 TCAGAGGACCAGAAGCTGACTCAAGACCGTTTTCAAGCAGGAA^ 

GGTCCCTCCTGGGGCACAGTCTTCCCTTTCC^GTGAGAACTCT 
CACAGTGTGCTTGGCTGAAAGGCCATCACGAATCTTCC^GAAAAAAGGGCAGAGT 
ACAGCAGGCCTCGCTTCATCCAGTTCCCTGCTrCAGCAAAGCAGGA 
GTTCnTCACGGCCAGCAAGCCGTTTCAACAGCCCritSA^ 

3 5 GTCTGGCAATCTGAAATGAGACATCAATCTCAGCAGGAATGGCGGGAGATCAACGGGCGGGC^ 

ATTCAGGGCAGAGAGGGTCTTCTCTGAGCTGTCCTGAGTGGGGCGGGTACCTCCGCATCTTC^ 

AGGAACCGTAGGATGAGGCCTTGGGCAGATAGAGCCATCTTAGGGGCCGTCTGCTC 

AAAACCTTTAAGAAATGTAGGTGTTAAGGAAAAGTTATATGCTTGCTTCT 

GCTGTCTCCGGGCAGCGTCTTGCGGCTTGCTTCCTGTGTAGACCGCAGGGGGGAGAATCAAAAAGATCCT^ 

4 0 GGCAAGGGCTGAGATTCCTTGGG CCTCGCTCG AG ATAGGC CTGACCGTTT ACCACTTACTCTGTGTGCAGAGGGTCTTG AAGGAC^ 

GAATAGGGGTTAATTTTCACTTTACTTTATTTTGAGTTTGGAACTTAGGTAGTTCAGGTTGATCT 
TATAACCTTGACCTTCTGATCTTCCTGCCTCTATCTCCTGCGTGTTCGGATTACTGGTGTGGGCCACCA 
GCTGAGGATTGAACCCGAGGCTCTTG(ZATGCCGGACAAGTACTCCTrrATCAACTGAGCTACACT 
ACAGTCCAACAC^TGTCAGCAGAGAAGAGCTrCAAGCAGrrGGGCGCCC 

4 5 CCTGAGAGACCATGCAGCTAAGATAGCCCTGCAGAGATGGTTTCTGGCC^ 

CCAGTGCTGTCTCACGATGCTCCCTGACCATGGAGTCTGATTTGATTGGTACCTGGTATCT 
TGAGACCIAGAATGGAGGGATAAACTGGGCAGTACTGCTGAGATCCCACCGTTCTGG 
GTATCTGTGCTTTCTCGGGTGCCATTGGAGAGTGGGTTCACCCAATGACTTAAGG 
CTGGG CTGCTACATC CAGGGAGAGGT ACTGCTGGGGAATGCCTT^ 

5 0 GAGGTTTCCATGTAGCCCAGCCCCAAAGTCACTGTTCCAGTCTGCTTTGGAG 

AG TGC CATTT ACAAT AAAG G C CAGAATTGG AACTGTCT ACTT AAAAAAATTTTTTCT AAAAG 
GAAGTGACCACTCAGGCTCGGGGGTAGGAATGTTCCTCCAGCTTCTTGCTGAGCATAGATACAGAACT^ 

CACAAAAAACGATTTCACTGAACCTGGAGCTCAGGGATTAGGCTAGGTTGACTGCCAACGAACC 

5 5 CCCAGGGCTGGGGTCACAGGCAC^CATACCCATCTTTTTCTGTGGGCA 

AG CACTGT ATTAAGTTGGCCC CTG CTTTGAATTTCTGG CCTTTG CTT AAAG CACAAGCTTTGGATGGGTC^GAAAAAAG AAATT AA 
AT AAAAAG AAAAAAAAAAAAG AGG CACTT AAC!AGGG ACAACAGTTGGCT GG CT AG AATC ATG CAAAAAAAAAAAA 

AAAAAGCCCAAACAGATGTTCTATTAAGAGGAGAGACCGGTGGATACACATGTGTTCC^ 

AGCAGAAAGTCTTTG ACGGTGCTGGTGAAGCCATC CACATTTTCCTCTTTGG ACTTCATG AC CTAAGACACCCCTCTTGCCTCCTC 

6 0 CCCAGGTCCTTCAAGCCAGAGGACTTTGCCAAAGCTGCATCCTAGGCTGTGGGG^C 

TTTTGGGGGACCCTTACCCAGCTCTTGTCTGTTGAGATACAGTGGTCrrTCCTGAAGTCCAATT 
TGATGGCCAGGAAAGGCTCTGGGGGAAGGGGGCGGGCGGTATGCAGCCGGCCTTT 
GTTTACTGAGTTTCTCTCTCGTTCTTCAGGCCTGGGOVGTTTT^ 
TGGAAGAOSAGGTTACGCTGTGCTCTGAACAAGAGCCCAGATTTTGAAGAAGTG 

6 5 ATATAAAGTTTACCGAATTGTCCCCGAGGAAGAACIAAAAATGTAACTATCTGTTGGGACCAGGAAGCCC 

CCCCCCCCCCCCCCACGCTTAGCTCCTGAAGCCCT 
GGrrTTAGCTCACCAGAGTTGTTCTCrrCTTGTGTAA^ 

TG GCAG CTT CCT GCAG AGG GAGTGG CCC AGTGG AGG CAGGG AAGGGTGGTGG ACT CTGGG TCATGG CAGCCAGAAG CTGG AGGG AG 
CGAGTCCCCGGAGCTCAGCATAAACTCCTCCAAATGGGTTTGAGGCT 

7 0 GGCAGTCTACAGAGGAGGCTGGGGAGGAGTGGGCAAGGCAGGGACGGGCTGGTAAGGTCCAGGGA 

CGGCAGTGGGGACATCACTGATGGCTTCGGTGTGTCACT^ 

CCT AAGAGCATCTGTAAAGGAATGAACGCACACTGGCCTTCCCCAG CAGAGCCTCGTGGCTCCCAGGGCCTGGTCCGTTCTGTACT 
GCTTCTGGAAGGCCGGCTCCTTTGTCTTCGCAG"TGA 
CCAGGCTTGATCTGGTTAATGACCCTGACTTGTTTATAGTGTGG^ 
7 5 AGAAGGACATAGCATTTGCATGGCATTTGTCTO3GTATACTGTACTG 
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TGGGTGGGTGGCXJCTCTCAACCrCTGCTTGCCCAGC 

GATACAGGTTCCTGGGGGTGGGGAGCCTGGCGGNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNN^ 

TATACTGGCCGATGAAAGCATTGGTCCCCTGGGCAGGCAGGCAGCAGGCT 
5 GTATACACACACACACACACACACACACACACACACACACACACACACACACAC^ 
AAAAGGGCAAAAAGATTTTCTGTTACCTGGCACTCCCTACCTTCCAGGAA^ 
CTTACCTTCAGTATATGGGCGATAAATGCCTGGTATAAAATGCA 
CTTGTCCTTCACAAGCCAGTGTTTACAACCCCTC 

TGAGAAGCAAGGGCTGAGCCTTTCCCAAACCTTCCAGCAGCTGCTGGTCAAAA 
1 0 ATTCAGCATTTTAAGAAAAGGTTGCAGTGTGCTCGGCTGCCCTGGGTC^ 

GGCACCTGCAGGCTGCATGAGCGAAGTTCCTGAGATGGAGTGTGGCCGCTCAGAGATTGA 

GGGGTCTCAGGCGTCTT CCAGGGJ^GATGGGACATTGCrTCTAACCCTCAGAGGCACTGAGAG CCTCAGGCTCTGCAGGGAAGTGA 
GGGACACCCCAGCTACCTCCCCAGGCCAGCCTCCCTTT ACTGGTCAGGCCCTG C CCATCCAAGCATAGTTAAACCCATTG AAT AGC 
AGCTTTGAAGGTCTCCATGGAGTCTr AG CCTTGTTTCT CTTCTGGTCATCCCGTCG 
15 GCCCTCATGGTAACTAGATGGCTTGCTTTTCAAGACACAG^ 

GG AT C C ACTAAAGGTGGTGGGG AC CAGC CTGTGG CT CAGCGCTCACCT AT CGTGTACGACAGTTCCCAG CAGCAT ACACAAAACAC 
ACATCTTAGAATAATCCCCCATCACATGCATGGTTGGCAGCCTGGAACTGAAGGCTC^ 

C 1TT T TTGTGTGTCTG TTTTT AG C CTTTATATTGTGTCTGT AC AAGGTGGGG TTTCCATAG CTGAAACAACCCTTT AAG CTT AT AT 
AAGCAAGATGAGGTCTCTTTAGACGGCAGCKX3CCCAGACCCTCT 
2 0 CCTGCTTGAGCTCCAGAATGAATTCAAGGTAGAAAACTTCACGAGACCC 
CATAGCTCCXjTGGTAGAGCGCTCGCCTCGCGTGCACAAAGTCCCG^ 
AGAOTGTTGTTAGCAACGACACATCAGCTGTGTGGTCTGCAGCATTCrrCT 
TGCCCAGCrCTGCGGCACATGGCCGTTGTGATGTTCTGGGTGGTCTTGGTTCTGTC 
GGTATAAGCCTTTACTGCCAACTTGCCTGAGCCAGTAGATGGTGATGATG 

2 5 GTCTCCrAATTGGACCC^TCCACCACTGAAGGCCCGAGAG 

GGTGTCACCCCAGAGGGTTGCCCCAGAACTGGGCTATGTCTCCCAG 
CIAGGATGCCAGCCCAGTGGGTTTGTAGCTTAGGGTCTGATGAATTGCAAAGGGAAGCT 
TGGATTTAGTTGTGTGTCTGATATTGGGTTTTAC7AACATGGGGTCAGGT 
TTTCTGTCACCTGGGTCTACAGCCTTCTGTGGATGAGTACIATGGGTATGACCA 

3 0 GCCAG ATCCTCCCTGACTGGTGGGTCCAGCAGCCCAGTG CAGGTGAGTGTGGCTGTCCGGCACTAGCAACCACTAGG CTCTTTATG 

TTAGGGTAGGAGGTGCTTGCCACTGCTAGGATCCATGTCCTGTGGAAGGGTCATG 
GCAGCCTCTCAGGCAGGCACAAAGGTCTTC 

AG AG CCAGGG CGTT AGG AG CGGG AGGGG AG CT C AGTCAGG AAT AGG CG TT AG TTTGGAAC CACTITGGGCAGTTTTCATG TCTTG G 

3 5 TGGTGCCTCTTGTTTGCTCATCCATAAAATGGGCCACCIAGGACCTGCCT 

TGATAGTATCTGGGAGGTACACCCATGCTGTAGCTACTTAGAGGGTCA^ 
ATGGCTCAC!AGACATCTTGGTTTCCTCTTTTGCCTCrCTCCT 

CTGTGAGGTTTCCAGGCTGTGGGTGTTAGTGTCCTGCAAGGGGCCTTGCGCTGC^GGGGTGTC 
TCCAGCTTCCTTGTGAGACAAGTCCCGAACCATTGTGTATGGTCCTTGGGGTAGCAAG^ 

4 0 AGTG CAG AGG AAG T CC CCT CCATCATGG T AGGTTTCTGTGT CT CACG T AG AGTG AGGTCTG CC CACT AATGTTG ACACCAGCAGT G 

TCAGCCTCTCATCT AT AGCTCAGGGATACTTGGGTCCTCCCTGCACCAGCC CAGCAT CTTACAG 
GCCTGCCACTGGTGAC CGG ATATGCCGCCTATGACACACACCIATTCAGGT ACCAGGGT ACTCTGGG AGAGC ATGG CACAGGCTCTG 
CTTGAAAATAGCCCCATGATGTAGATGGGGTCATGAGGTCCCACAGACAAGTGTGGCCTAGGGG 
CTTCCTCGTTCTGTGTGGTTTGGGAGCAGTTATAGTCATCCAAC^ 

4 5 TGGACTGCCTTGGCCTCGCGTGGTGGGATCTGACCTCTTGGGGCGTC^GGTT 

GGTCTAGTAGGGACACAGCTGGCCAGTGTCTCCCATGTAGGGTCTGTCACCTCCTCTCACTTCCAA 
AGATGGTTGTACTGATATACTGATGGTGCTGAGGCTGGTGCAGATGTTGGTGGTGCTGATGGTGCT 
TGCAGATGTTGATGGTGCTGAATGGCTATGATGCTGCAGATGCTGCTGATGCT 
TCAGTCCTGTCTATAGAAGTTGGACCTGGTTCCTCCTCCCAGCCCCCATGCT 

5 0 TGAAGCTAGGCTCTTCTCTGACTAGATCATAACCTAAAATAACCCATTTATTTTAATC^ 

TGCTCAGGG ACCATGTGTCCCTCTCATCACAACTTCCCGGG AGAATCTC CCGCACTTCG CC CTATCCCAGAATTTTCTCTCCCTCC 
GGGATGTCTCTCTTCCTATTTCCTGCCTAAGCCATAGGCCGTCAGATTGACTGTTTC 

GTTCCCTCTACAGCTG CCTTAATGAGTGAAATGATTCTTTCTCCTGG AG CACTGTAGATGAAGAG CGAGCCTTCATCTGAATTCTC 
CCCTCCCTCCCGCTCCCTCCCCACCTGCGGTCTTCTGGCTGTCrCCTCTGCGGTCTGTGT^ 

5 5 TGATGTCTGTCTGTCCCTGACAAAGTGCTTTGCTCTGTATTTATGGAACAGCA.CAATAGGTA 

TATCTTTTTATAGAGATCTGGAGTTTTTTTTCAAGTTAAGACGT^ 

G CAG ACATGCTATTGATT^ATATCCTCGTGTGCCTTCCAACT CTT AGGGG AT ATTTCT 

G CTGTTTT CAG CG AACGATTAC CATAGTG TGT AC AATGT ATG CATG CAC AC AC ACACACACACACAC!ACACACACACACT ACT CTG 
GTCATAGGCACGGAAGGGTATATAATTTGAGATTTAAGCTATGOVTTTAGCTTGTGTT 

6 0 TGAGGCGACAT AAGGTTGGTTGCATGGGTCTTTCAAAGG CAGGTT AAACGGGCCG AGAACACAGTATGG AGGCAGAG CAAGGACGT 

AATTTTAAATGGTCTCAAGGCCAATGGTTAACCTGX3TTGCAGAGCCT 

AAAA71AGCTATTTCTATAGGTGACGGCGCTCCTGATGGTGGTGATGGTGGTGGTGGTGCTGCTGCTC 
TGATGATGGTGGTGGTGATGTTGAGGATCCTGATGGTGGTGGCGATCATGGTACT 

GTG CTG AT AGTGATGGCTGGTG TTT GT AG TGTG ATTTTGGGTGCCAG AG T CAAGGG TGAC ATTCTTGTGGG CTCTTG T CT CTTT C C 

6 5 CATGCT ATT AACCTCCT CTAGGTTATGAAGACAGGTCGTTTCCCATTTCAAGGCCCTGGG 

GGGATCTGCCATCCTCACTGCTGCTCGCTGTGAGGCAGGTGGCATCTCTGTGCCGTCCTGCGAGCCACra 
GACAGGCTCGGTGGGGCTCAGGGAGAAGCCCTGGGAGCTGTCACC^AAGGTAGAG^ 
AGACGTTCGCCCCTCTCTCTAGTGCTGGCCTGTCCATGTGTTGGGGTCCTTTTAGATCT 
CAGCTTCOVCAAGC^TITGTCrTCrCGACTGCCCTC^ 
70 GG CCTTGT CAG C C CCGGG CTGTGG C CCTGAG AAT CTG CTGACCAATG CATTCTC>GGG CCACACTGGCCGGG CCAG ACCTGGGG CCT 

TCTGAGTGAGAGGTAAAGGCAGGCTGTC^3GGAGGAAGTGCCCTTGAGCCTTGC^ 
TGCCCTCAGCCACAGCTGGGAGCTCTTCTTCCAGAACCCAGGCCACATGCCCCGTCTCT 
ACTGCAGAGAT^CCTGGGGTGAGATGX^TGnGGCAGGGG 

TCTGAGGTCTAACCCATGGTAGCTGCCTCCCTGTGGTAAGCAGTGTTATAGGTCTT^ 

7 5 T CAGAAAAACC CAG AC CTCTGG CTTTT AAATTTT ATTTT ATT CATTG ATTT ATTG ATTT ATTTTT AGAACAAT CTTGAG AGATG TG 
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T AGCTCGAGATGX3TCATGAAGTCACAATCCTCCTGCCTCGGCCTCG AGTGCTCTGG CATGAGGCACC CCTCTCTGTATTTGCCACC 
CACGCCAAGGCTTTCTCAGAGGTGCTCTGCTC 

GGTCAAACAGCAAGG CT1"1H 1 1 GGTGTGTCAGCACTTCAGGGTGAGGCAGJVGCCAGCCTGCTGTGGAGAAAAATC 
CCAGCAGACAGCTGGAGTGGCAGGGCTTGCCCTTTCACCT 
5 GCCAGATGTATGGATATGCCCAACCCCCGC^AACCAGGGATTTCAGTACGTAGTCTCAGGG 
TCCCGTTTCCCrGGCTCCTGTGGTCTCACAAGAGACCCCCCCACAAAGCTGGCGCT 
AGGCAOVGAGAGGTGGAGCCGCTTGCCTCAAGTCACACAGCTGTGAGGAA^ 

G CGATGCTG ATGATGCTGATAGCGATTTTAACGGTGGGCCTGGGAGTGAATGTGGGGCGAGCTG CCC 

ACCCAGGCAGTGGTGCAGTTGGTGACrGCAOVGCTTTAGGACACC^ 
1 0 AGCCTGGATTGGCCTGAGGTGGCGAGGCGAAGCITAGGATGCACAAGG 

AGCTTCTCCAACAACAAGCCTCTAG CTCTGGGCTCCTO CGAGCTTCCCTCCCAGGAGTCCCGCCTCT ATGATTCCCTCCCT 

GGAGCCCCGCCTCCACACTAGCTGTTAGTCAGGGGCCGCTGGGCCCTCTGCCTTGTC 

ATCTGGAATCCGGTTGTCCTTGCCTGCAAGAGGACGTCAGTACCTTCAGTACCGCCTGCT 

AAGGAAAGTCAACGAAAGCCAAATACTGGGTTGGCACGTTTCCAGCCTC^^ 
1 5 GAT CGTTGGGATT ATT ATGGT CGGTTGGTGC CAG ACTG AAGCTTTT CT CTCrriTTAAAACTrrrrrG rri rri"GTTTTGi"i~i i iG 

TTTTTCGATTGATGGGTITACGTTGTGGCACTGGGGATGAAGCCTAGGA 

ACCCCAGCTCCTGAGGTGACAGAGAGAAGGGTTGGCTAGGATGTAGCCTATGGTCACAGCTC 

GAGAAGGCTTCTAGGTCTTrCTGGAGGACAGGCTGTGTGTAGGATTGTACAGAG^ 

TGATAATGTTGATACCTTCACAGGTGGGTTT CTTCC 
2 0 TAGGCAGCTCATGAGAATTACATTGGCATGCTTGT^ 

TGCATGCACATGTGCACATGTATGCATGAATGCATGTT 

TTCATATGTGTGTAGGTTCATGTGTATGTGAATGCCTGTTCZACACX3TATG 

TTCTTGGGCATTGCCCGTCTTATTTTTTGAGACAGGG 

CAGCCTC03GGGTCAGCCTTTCTCTGCCTCrCATGGGCTGAG^ 

2 5 TTTAGGGGAGGCACTGGGGCrCCCCATCTTCTAGGCCTGATGGCCCTGTGCTCT^ 

AGCTTCTACTACGGGGGCAAGCTGGTGGGCCAGGCCACCACCACCTGCCT^ 
GCCTAAGTTGTATGGGCCGGATGGCCTGGAACCCGTGTGCTTTCCGAC^ 
CCCGGAAGCTGTTTGGGCACCrGGAACGTGGOSTGCrACTGCACAGCAA 
CGCGTGTTCTGCAGCGGCAACGCGGTGGTGTGCAAGGG<^GGCCCAAC^ 

3 0 CAACCAGTTCATCCGAGGTCAGTACCAAGTCACCT 

TG CCTCCTGT ATCATGGGCCAAGTGCACTCTTG CATGG CTITGGCAGTGGTATAGCTGGGGGAACCCTTGGAAGCCTAGCCTGCCT 
GTGCTTTCTGAG03CCTGGGCAACCCTTTCCTCACCTCCCCCT 
ACGAAGCTGATC^TTGTTCCTAGCCTGC^AACGGCTTAGCAATGCAGACT 
CCTGACAGGGTCCCTAGGGGATGTGGCATGATTTAGTGGGCAGGAGGATCATGGAG^ 

3 5 GTGAGATCCTGTCTCCCCAAAACAAATCCAGACCAACAAAAAGTTCCCCAAATCGCCTAGCAGTAT^ 

GGGTCCTTGGTTGGCTTCTAGCTAAGCAGCCTTGTATCCTAGGC^ 

TTCTGTACC CCAG CCCTGGGCCCTGG CTACACACAGCT CAGCGGTTCACTGGTGTCTGTGTTCTTCCA 

GCCACCCAG AGCCGCCT ACCTGACAGCAGGGTGGTCCTGTCCTTCGGGGAGGAGTTTCCGGACACT C CTTGCGCTCCAAACT 
CATTCTGGTGCAGGTGAGAGCCGGCAGCAGTGTCTGCCTGTGCCCAGGG^ 

4 0 CAGAGGTCGGGGACTAGATCATACGTGGCTCTTGTCACCTCT 

CTTCCCCTCTCTGCTTGCTCCTACTTTCCCrCCCTCrGTTCTCCTGTT 
TAAGATCTAAGTGACAGATCACGGTCTATTTGACTATGGTCTGAACGGTW 

CAGCCZAG CCTGGTTGGAGCAGTTCTGGG AGAAC ACTCTT AG CCATGTGCAGG AACGCTG CCCACCTGGCTCCATCTTGTCTCGGAG 
GGGATTGGAGGACCTGTGGTGGAGGATGGCCCC AGTTCCCCTC CACCTTGATTCTGACTTTG CACATGGCCCTTGCAGGTAG AGCA 

4 5 GCTGTATGCCAGGCAGCTGGTGGAGGAAGCGGGCAAGAGCTGCGGTGCTGGCT 

ACCAC^CTTTCCGCATGTTTCCGGATATCTGTACCTCACACCA 
TCAGTCCGGGCACCCCACCTCGCCTGAGCTCAAGCT 

TGGGGTGGGCGGGTGTCCTCCAAGGGGCCTCCGGAAGCCCACAGAGGGATGCGCTCCTGCT 
GGCTGTGGCCGCAACCTGTGATTAAAGCATTCCTTTCCTGCGTTTCCC 

5 0 TCTTT CGACAGTT CAAAT CAT CTGG TGGCAG CAG ACTCGCCTTTGCC CTTCTGCGG C CG AG£^CGG AG ATTT ATG ACTTT CTCTG C 

TTGGTTGGAGAAGAAGAATCTTTACTATTCAG LTl ' C ' Tl ' rrCTl ' l 

ATATTTGTATTCTCACAGGCTCAGCTGTCAATCACTTG 

TGCTTGCAGCCreCGCAC^AATTTTGGTTGTGGTTCATTGA 

AAGGAGG ACCAGG CACTGTGG AG AAG ACCCATTCACTGGCATCTCACCCTTC CTTGTCCAGCT CCATACCCAGTCCTAAG ACCCAG 

5 5 TGAAAAG CCACGT CCAAACTGTGCTCTGGG CTCATCAGTGCCC ACCCACGTACCAGGGAAAGG CACAC ACCCTACCCAGTGGGCAC 

AGAGCGGAATGTCCCCCTACCGCACCATTTGCGCCCCCAATCTGGCTGTCCAACCTAG 

AGCCCCGCCTAAGGGACACTTCCCGGAGGAGGGAGCOSCTG 

CAAGCAGATGTGGGGGCAGGGAGAGTCATCATCTGCTTTTGTCTGAGAGAAGGA 

CTGGGTTACATGCTCCACGCAGCTAGAGAAGCCTAGGTGCTCTGCATTCCCTGGGGAACTGCAGGAAAGCCT^ 

6 0 TGCTCTGGGGAAAAGCCTGAGGGTCCAGAGCAGCTACAAGCT^ 

CTTCCCGGCTACICTGTGAAGGTGCTGGGTGGGGCC^ 

TTGGAGTGTGTGATACATGCAAACAGAATCCTGGGCCCCTGCTTCTCCCCCTCAGTCA 

ACAACCTCTTC^CAAGGCCCCTrGTCACATGTCACCTTCCAC CTG CCTCAAGGAGTGCTAGTGTCC^UUVT ATTT ATTTTTGTATTC 
TCTTAAGAAGTATTGATTTCATCCTTTATTAAAAAAAGTTGCrCTTTCACAAAA 

6 5 TTTGAAT ATAGTTTT CATTAGATTCCAGG ATTTTTT 1 i rCTTT 1 1 AAATTTTGATGTTAAAAAACG AG AC CAG AGGTCTGGAGAGA 

TGGCTCAGTGGTTAAGAGCACTGACTGCTCTTCCAGAGGTCC^ 
TAATGGGATCTGATGCCCTCTTCTGGGATGTCTGAAGACAGCGACGGTGTAC^ 
CTTTAAAAAAAATGAGACCAGATCTGACTCTGCAGCC 
GGAGGAGAGGTGTGAGTCACTGTGAATCCTTAAAGGAATCTTT^ 

7 0 TTTTCAAGGATTTAGGGATTT'IT.1 T rTCTI CT CTT AAG ACAAT AG CAGAAAGCT CTGGAGAATAACAGACACAGGAGAG CACAG AG 

GGCGGGCAATCC^CGCTGTTCTACCAGATGTAGATTAGGGTTTCAGTGAGC^ 
GGAGACTATGCTGACAGCACCCTTCTGGAAGCTTCT 
CATTCGTGCACACATACTTATGCCAGAC!ACCGGC!TCAGAATGA 
AACTGGGAACAAAGGCCCTCCCCA.CCCCCAC^CCGCAGGTGGGCACAGACT 
7 5 CTCTG CTCTCATGGGATCTaVGAGACAGGGC 
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G AGTGAGGC CTCCCTTGTTCCGGGAGTGCAGTT AG AGCTTCTGCAAGATCCTTGGCCCAGAGG ACAGCT CAAAGT A 

AAGCAGCCATTTCTGCAGGGAGTCCCGTTGGCTTCAGAGGGCTCCAACGGGGG 
AAAGTGAATCCAGTCCTGTTTGTGTCTCTCAGAGTTCAACCTCTGGGCCAAG 
TTTGGCATCrCTGAAGGCTGGGGCCTCCTGGGACAGGACTTGGGTC 
5 TCCTGAAGGAAAGGCCAGCTACCTAACCACCACTCTTGGGATCATATAAA^ 
AGTGGCTACTTAGTTCCCTAATGTCTGCAAGGTCCACCTGAGCAGA^ 

AGGACCCTAGAGACATCGAGGGAAGAGCAGGGTGGGGTGAGGCAAGTGCC CTGGGGTGC CTGCTGTGTGAGGTATTGGCTGGGCCG 

GTGAAAGACAAGGGTTGATTGTGTGGCCAGAGGACAGGCTAGGGTTCCACT 

AGAGGGATTTGATACCACACACAAAGAGGCTTAACAGGCCCGAAG^ 
1 0 ACCGT ATTT CTGTTT ACCACTCAT CTGTAATT AAG CCAGG AGTG ATT AGAGAGCTGAACT AT AAAT ATT AAAT AG TC CTTG CTG AC 

AGGCTACAACGCCAATGGTTAAGTAAGTGTGTGTGTGTGTGTGTGTG^ 

TAATAGGGTGTGGTGGTGCACTCCTGTAAGCCAAGCATTCCAGAAGCTGAC^CAGG 

ACACAGTGAAACACTGTCTCAAGAAACCAAGCTGCAACAATAGTGG 

ACTG TGCAGAGTTATATG TTTTCCCTTT ATTT ATGGACACCG ATTT AAT^ 
1 5 ATTGT AAACAGT AATGAATTT ATT AAACTTCTCT CATT AGG^ C CTTG 

CTGACTCCCCXjATCCTGGCTTCCCTTCCGGACCAGAGCCTGGAGTTCTGTA^ 

TTTCTGAGAAAGAATCTTAATCTATATTCCAGGCTGACCrCAAA 

AGGCGTTGCCTGTGATCCACCCACCGGGAGCCCTGTGATGGGAAGAAGGGATCAGTGGTTGGCT 
ATCCCCTCATGAAG^GTGATCCCATGTTGCTTTAGGCAGTCCTACGGACTCGCA 

2 0 TAGAGGACATCAAAATAGTGTGCTTTTGGTTTAGCCTTTCCT 

ATCTGGGCTCTAGGATCCCTTGTGGTGAGGTTTGGGGAAGGGAAT^ 
CCTGCCCCCCCATCCCGTTAGCATGTGCAAGTGGAGGACTGAGCT 
GAACTCAATATATGTTGGCATTATGGGTCATACCTTTGTGCTGAGGGCTATCCCTACT 
CTACTATCCCCTACTTTGCITTTGTCTGAAGTATATAGAAAAAAAA 
25 ATCC CCTTAGCAGGTCCTTAGCCAGCT r i ^ GGGAGAAAAACCAGTGTCTCAG CCACCCTGGATCCCCTTTTCCTTCCTGTCCGTG 

GCTTCTGCAAGGGTCTCTCAGCCTCTAGTGCCATGAGCGGGGCATTGGTCAATGC^ 
ACTCTTGCCAAGGCCTCTGAGAACAAGTCAAACATCTGATGATGGGAGAGTCTTCTTGAT^ 
TTGAGTCAGAACTATGAAGAAGGGACACGTGGGTCAGAGCCAGAGAAGACACGTC 
ATGACTCTTGAAGCTGAGGACTGGCAGGGAAACACAGGAGGTTCTAGAAGGAA 

3 0 ACTGTCAGGGAACGGGTGTGTGCTGTTTTTAAGTCACTGCTACA 




JCCGGCTGAGAAA 



GCCACTTCCTTCTGCCAGCCTGCCGTCCATACTGGCATCTGTCTTCTGTCCTGCTGGGCTT^ 

3 5 GTGGGCTTCTGTGGAAAGATTTTCCCTGTGTTCCTCCCCTGACTGTGGCAGAAGTTT^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

AAGCTTGACCTTAAAGGACCGGTTG CCACTG AG AGATGTAAATG ACG CCATAGGACACAG ATCGCCTTAGGTGTGGGAGGAG AATG 
ACACTTTTAAAGAGGGGCTTCCCGAAGGCAAAGCTTCAGATGATGGCTTGTCT 

CAGTGAAATGGATGTGGATGTGTAAGCGAGGAGCCGCTGGGTCTGATGGGACTTGAAGGTTCTAGAACATATGGTCA 

4 0 CTAAGTGTGACCCATGTGCTGATCCCTTGAAATCATTCTCCAGGTGGATTACAGTGCCTCGAGTC 

CCGGTGGGGACAGCGATCCATTCCAGCATGCTGGATTCTACCCACGATCCT 
ACCCATGAGTTGCATGCCTCAGCCACCrrCCAGGGGCAGGTCACCTCAG 
CCATCCCATCTCCCTCCACTTCAAGAACATCACTCTCACTATTCAGATAGAGAAATTA^ 
AGCCGCTTGTAGTGGTCATGAGGTGGCCCAGTGCTTAGTGGAAGATGGGAGG^ 

CAGCCCTTCTTAGCCAATGAACAGGTACGAGGACT 

CTTACCTTGCTTGGCCTCCTGTGATGTGGGACTCTCCTATATGTCATGC^ 
TCCGAGAGACCACCTAGCCGGGAGAGGTTGGGTGTCACCCAGCTCTCTTGCAGGTCTGT^ 

5 0 TCTGGAGCAGAACTAGACCAACCTCACACCTCGAGCATTCCTGGATGGGTGC^ 

GAAAAGAGTGGGGATTTTTTTTTTGGCCGCTGTGTTGTCTTGTCCACGA 
CTCCTTTAGCCGCTCAATAATGTCCTGAGATGGTTGTCTCTGTTGACACCA^ 

CTCAGGGCTGATTGGTGGTGTTGTCTGAAG CCAGTCTC CTT AT CCACCATTG C TG TCCT CACAGAGG TTCACTGCAGAGCCCACG A 
AGGTTACAACCTGAGGGTTACAACAAACATTGGACTC^ 

5 5 ACAGTTGACAGCTAAGTCTCCGCCTCTGTGTTTTTCTGGAAATTAGAGGCTAAAG 

CACCACTCAACTCAAGCCATG^TGGACCCAGCCCGGGGTGGCCTCTGCTCCTGACACGGATAACC 

TTAGCACACTTTTGTTAATGAAATCCCACACTTTTCTGGTATTT 

ATGATCCCATATCCTAGGGTAGTCACATCTCTTTGACTGTGACrC 

TGTCCTACAGAGGCTCCCCCCAGCTGGGACTCTGTAGTCTTTTCCTCCTGTTTAACGGAGGATACTC 

6 0 ATGCCTTTAATCACAGCATTCAGGAGGCAGAGGCAGGCAAGTCTCTCAGAGTTAG 

GACAGCCAGGGCTACACAGTGAGACCCTGTCTCAAAATAAATAAATTAAAAATAAAATAAAATAA 

CAGGCACACTACACAGGTGGATTTTTTATXKK3TGCTGGTCTTGGTCTCCCAG 

AGTCCAAAGGCCAATAACAATGGGACAGTTCAGGGCTCAGGAACATGTGCTCTGC 

CCCCACGGGTCCCCCCCAGTTCCTGGGGTCTC CAGCAGTTC CTACAT ACTTT ACTTTTATTTTCTATTTTTTAAGAATGCTCTT'CA 
6 5 GAAAGTCAACCTGAACAGTTTTCCCAAAGGTCCAGAGATGACACGCCTGCGACACTCTGGATCCCCAGCCTGGTTTCT 
AGCTTCTCCTTCCTGCCACCTCAGCCACGACAGCTTCCTTTCXj 
GTGCACCCACCACTTCCTCTGCCTGGAATCTGCTTCCTTCCCrCGT 
CCTCCAGAATAAAATAAACAACTCCCTTTGCACTGTGGGAGCTCAGTACAAAGGGAGGCT 

70 ACTCCATCCATCCACCGATCAATCCACCTGTCACTTGCTCTTGAATTCACTTA^ 

CACCCATTCATTTATCCACTTGATAATGCATCTAT^ 

NNNNNNNNNNNNIWNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNICJNNNN^ 

NNMNNNNNNNNNIWNNNNNNNNNNNNNN^ 
75 NNNNNNNNNNNNNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNIJNNNNNNNNNNNN^ 
NNNNNNNNhfNNNNNNNNNNNNNW 
5 jjnjjnNNNNNNNNNNNNNNNNNNNN^ 
NhTNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NN^nWNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GTAGGGGGGAGTGCIXJGCCTGCAGTGTGAGGTGGGGGGAGTGCrGGCCTGCAGTGTAAGGTGTGTC 
1 0 CAGTGTAAC^TGTGTGTGGGGGAGAGTGCTGGCCTGCACT 

AGTGCTGGCCTGCAGTGTAAGGTGGGGGGGGAATGCTGGCCTGCAGTGTAAGGTGGGGGGGAATC 

GGGGGAGTGCTGGCCTACAGTGTGAGGTGTGTGTGTGTGTGTGTTGCTTTCAG 

GTGTGGGGGTAGTGCTGGCCTGCAATGTGAGGTGGGGGTAGTGCTGGTCTGCAG 

TGAGATGGGGTGGGGTTTGCTGGTCTG CAG TG TG AGG TGTGTGTGT ATG TGG AGGGGGAG TTCCTGGC CTT CAGTGTGAGG TGTGT 
1 5 GTGTGGGGTGCTGGCCTGCAGTGGAAGGGAGTGGGAATGGGGCTTGGACACT 

AGGATGATTGTGGTGCGATGAAATACAGGCAGAGAGCAGGGATGATGTCCAGTCGTGAAATGGCTCTGCGGGAGCCC 

CTGAGAGTGGCCAAATGTCCCAGGATGTGACTGCTGTGAGTGCTGCACTGGTTATAGGGGGCTTTAGGCT 

TGGCTGTGACCACTGCCAGGTGTCATGTGGGCGGGGCTGGGATGGAGGGAGAATGGTTTCTGTTGCTCTAGCTGT 

CTGTGCCCCATGCTCTACCTTCCTCCTCTGTAAAGCCAGGATATCCTGGGACTCCTGTTT 
20 G AGTGCCCACGTGCTGGTCCCCAGTGTGATTTCCC CACATTGATGAC CCTTGTCCAGGCAGCTAGGTGGATGCAAGTGACCAGAGA 

AAGCCATAGGTCAAAGGAACAGGGCATCCAGCTGGCATGGGGGGGGGGTACTGTGAACCCCACAGAATCAGCT 

TCCCAGGTTCAGAGTCAACCCCCGTTAGCATCCTGGTGTATGCTGTGGGTT^ 

TTGTAGATCTACGGTTCCAGAGTGAACTTGGATGCTCTCTCATGACCATGTCTGCT 

AAGATCCAAACCTG CT AT CTT CAGT AG CAGTGTGAGG CA^ 
2 5 AGCACCTCATAGCTGGTGTCTCATCCCAACCTCGCCTGCCTCTTGCCTGGTTACTC 

AAGCCTGTGCCATTAGGTTCATGAO^CTGCTGTCATGTCCTGTGGGGGCTCATTG 

AG ACATTTGG CTGTG AATCTGGTCTG CTATAGCAGCTCCTAGCTCATCTCCTCTGCTTGG 

MOUSE SEQUENCE - mRNA 

CCGGAGGCGCGGGCAGCGTGGGAACCGGCGGCAGGATGTGTGACCGGAACGGCGGGCGGCGGCTGCG^ 

ATCG ACAGCAGCATGTAC CCGGGGCTG ATCTGGGAAAATG ATG AG AAGACCATGTTCCGTATCCCCIX3GAAGCATG CCGGCAAGCA 
GGATTACAATCAGGAGGTGGATGCrrCCATCTTCAAGGCCTGGGCAGTTTTTAA^ 
CAGCCACGTGGAAGACGAGGTTACGCTGTGCTCTGAACAAGAGCCCAGATTTTG 
TCTGAGCCATATAAAGTTTACCGAATT^TCCCCGAGGAAGAACAAAAATGCAAGCT 
AGTTCCTGAGATGGAGTGTGGCCGCTCAGAGATTGAGGAGCTGATCAAGGAACCTTCT 
GGAGCCCATCCCCACCAGAGGCCTGCAGGAGCCAGATCCTCCCTCACrrGGTGGG 
ACCGGATATGCCGCCTATGACACACACCATTCAGCTTTCTCCCAGATGGTCA 
GGCCACCACCACCTGCCTTGAAGGCTGCCGTCTCrCCCrGAGCCAGCCGGGG 
CCGTGTGCTTTCCGACGGCCGACACCATCCCCAGTGAGCGGCIAGAGGGAG^ 
GTGCTACTGCACAGCAACCGCAAGGGCGTGTTCGTGAAGCGGCTGTGCCAGGGCCGCGTGT^ 
CAAGGGCAGGCCCAACAAGCTGGAGCGGGACGAGGTGGTGCAGGTCTTTGACACCAACCA 
ACGCCACCCAGAGCCGCCTACCTGACAGCAGGGTGGTCCTGTGCITCGGGGAGG 
CTCATTCTGGTGC^GGTAGAGCAGCTGTATGCCAGGCAGCTC^TGGAGGAAG 
AGCCCTGGAGGAGCCCCAGCCGGACCAGGCrTTCOSCATGTTTCCGGA 
ATGAACAGATGACCGTCTAAGCCTCAGTCCGGGCACCCCACCTCGCCTGAGCT 
GAAAGGATGTGGAGCCCTCTGACTGGGGTGGX3CGGGTCTCCTCCAAGGGGC 
GGCAGGTGTC!AGAAGCTTGCAGGGGCTGTGGCCGCAACCTGTGATTAAAGCATTCCTTTCCTC 
GCTGGCCTTTCTGTGTGCTGAGGTCTTTCGACAGTTCAAATCATCT 

CGGAGATTT ATGACTTTCTCTG CTTGGTTGGAGAAGAAGAATCT TT ACTATTCAGCITCTTTT CTTTTTGGCCAGAACTCTGAAAA 
AAAAAAAAACTCTTTTTAAGACAATATTTGTATTCTCACA 
ATTTTAAATATGGGTGTAGATACTGCTTGCAGCCTTCGCAGGAATTT^ 
TGACAGGGCTGTGTCGGGCATCAAAGGAGGACCIAGGCACTGTGGAGAAGACCCATTC^ 
CCATACCCAGTCCTAAGACCCAGTGAAAAGCCACGTCCAAACTGTGCTCTGGGCT 

CACACACCCTACCCAGTGGGCACAGAGCGGAATGTCCCCCTACCG CACCATTTGCGCCC CCAATCTGGCTGTCCAACCTAGTTTGT 
AAGTAATCTAAATCAGTGACTATAGCC CCGCCTAAGGGACACTTCCCGG AGGAGGGAGCCX3 CTGAAATGGAGTTAGTTTGAGGGTC 
AGTACACAACAGGGGCA.GAAAGCCAAGCA.GATGTGGGGGCAGGGAGAGTCATCATCTGC 
CGTTTGTTCAACTTTGTAACAAGCTC^TTACATGCTCCArcCAGCTAGAGAAGCCTAGGT 

GGAAAGCCTTACCTGCTGACTGTTGCTCTGGGG AAAAGCCTG AGGGTCCAG AGC^ CTACAGG CCATACCTTACAACCT 

GAAAAGCTAAGGACC^CGGTGACCTTCCCGGCTACTGTGTGAAGGTGCTGGGTGG 
TGTGATAGATGCAAAGAGAATCCTTGGAGTGTGTGATACATGCAAAC 
GGAGTGTCCCTTCCGAAGCCAGGACAACCTGTTCACAAGGCCCCIT^ 

T CCAAAT ATTT ATTTTTGT ATTCTCTT AAG AAGTATTG ATTT CAT CCTTTATT AAAAAAAG TTGCT CTTTCACAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

65 MOUSE SEQUENCE - CODING 

ATGTGTGAC C GG AACGGCGGG CGG CGG CTG CGGCAGTGG CTG ATCG AACAG AT CG ACAG CAG CATGT AC CCGGGG CT G ATCTGGGA 

AAATGATGAGAAGACCATGTTCCGTATCCCCTGGAAGCATGCCGGCAAGC^ 

AGGCCTGGGCAGTTTTTAAAGGGAAGTTTAAAGAGGGAGACAAAGCrrGAA 

AACAAGAGCCCAGATTTTGAAGAAGTGACTGACCGGTCCCAGCT 
7 0 GGAAGAACAAAAATGCAAGCTGGGCGTGGCACCTGCAGGCTGCATGAGCGAAGTTCCTGAGATG^ 

AGGAGCTGATCAAGGAACCTTCTGTG^ATGAGTACATGGGTATGA 

AT^CTCCCTGACTGGTGGGTCCAGCAGCCCAGTGCAGGCCTGCCACT 

TTT CTCC C AG ATGG TCAT CAGCTTCT ACT ACGGGGG CAAG d'GGTGGGC CAGGCCACCAC CACCTG CCTTO A A GG CTG C CG TCTCT 
CCCTGAGCCAGCCGGGGCTCCCTAAGTTGTATGGGCCGGATGGCCTGGAACC 
7 5 GAGCGGCAGAGGCAGGTGACCCGGAAGCTGTTTGGGCACCTGGJ^ 
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GAAGCGGCTGTGCC^GGGCCGCGTGTTCTGCAG 

TGGTGCAGGTCTTTGACACCAACCJ^TTCATCCG AGAG CTTGCAGCAATTCTACGCCACCCAGAGCCGCCTACCTGACAG CAGGGTG 
GTCCrGTGCTTCGGGGAGGAGTTTCCGGACACTGTGCCCTTGCGCTCCAAACT 
GCAGCTGGTGGAGGAAGCGGGCAAGAGCTGCGGTGCTGGCTCCCTGATGCC^ 
5 GCATGTTTCCGGATATCTGTACCTCACACCAGAGACCCTTTTTTAG 

HUMAN SEQUENCE - GENOMIC 

ACAGAGGTGTGAATTAGACACCCCTGCrrCTGGACTGCGCCT^ 

GTTTGAAGGCAAGAGGCTCCAGCCCCCCGAGCCTCAGTTrCCCCATCTGTCAAG 
1 0 GCAGGGCAAAGGAGATGATTTAAGAGCAAGTATTTTGTAA^ 

CAAAGCTGCCTAGCAAATTGATTCTTCTCCGTAGTTTCCTGCATTC 

GACTCCTGGGCCTGGGGAACCCACTCTGCCTa^AACCCTTCCT^ 

AACCATGCACTTACAGAGAAAGAAACAGCCAAAAGAAATAGAGAGAAGC^A^ 

CGAAAATACTTACTAAAGACCTATTAGGTGGCAATGAAGTTGACCTC^ 
1 5 ATGGATTAAGTAGTACACGCTCCTGAAATCACACAGCAACACTGCAATCACCTCTAACAGG 

CTTCCIAGGGACAGGTGGCTCCCACGTCACCAGGCACCTCATTCC^ 

ACTGAGGTAAGCATGGGGGCAAATTCTGCCCACTGACACACA^ 

CCACGCTCAAAGTTTGTGATTCTCCTTAGAGGTAAATTATTGTTGTTGTATTGA 
2 0 CAGATG CACCTGCTG CACATCTGCAAAGGTAAGCTGGAGATGAACATGGGTGAGCGTTCTGGAT^ 

ACATAGATTCATGATGCACTGAGTCAGAATCCCTGGAGGCTCCTCCTGGGAACCTACATC^ 
TTATGCTAAAATTTAAAAACACTCATCTATATAATATACAATAATATATATTT^ 

TAAGCAGGCTrGGAGTCCCATAGGTGAACCCACCTCCCAGGTTTCAG CAATTCTCATGCCTCAACCTCCTGAG CAGCTGGGATTAC 
AGGTGCCTACCACCAAGCCCAGCTAATTTTTGTATTTTTAGTAGAGAC^ 

2 5 TGACCTCAAGTGATCCACTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAG 

TTCACAAACCAGTATTTCCTGGCAATAGCC CTAAATGTTCAGGATATTTTTGATCTATGCTTAAG AAAC CTTTT ACTCTT AAGTTT 

TTTTTTCCTCCCCTGGGACTGTTTCAAGTTCTTCCGTTAGTAATAAATGTGAA 

GAATGCITAATGAAATTTTTTCCTTTAC^TTAAATGTTrA 

TACATT ACAG CT G CTT AAT AAACAG C CAAT AAATG AAC GCCAATG CATTT AAAAGT AATG AGTTTT AGGAT AT AAAAAATAG AAAT 

3 0 CTCAGGTTTTTATTTGTAATCCTTCTAGAGTAAAGAAACAGATTGC^ 

ATGAACAGAT AGAGCAAACTG ACT CAGTGAAAG AG AAATG ACTGGAAGCAAAATGGGG 

ATTTTACCCAGTAATATTTCTCTTCCCAAGTCCCCTGAGTCGGTCCCAACITACTC 

TAGCTTCTTACACGAAAACATTTACCACTTGTGGACGTATATAT^ 

TG CCTGAT AAACIGT AGTC ATTTGTTCTGTGTTGTGGAG ACTG AT AAGTG AATCTTTT C CTT ATCTG C CAGGGT AAACTG CCAACT 

3 5 TTCAAATACTCTTTC^TTTTCCCAGGTGAGTCATGCACCTTCCAGTGT 

TAGTTTTTTGACCTAGAGAAGGCATACCAAAGTCTGCTTGAG 

ATAAACCTTAAGTGG ATGG CAG^ATT ATGTTTCCACATGTGCACACCTGTTTACAGAC CACCCACATCAGGTAGGGCAAT CCCT CT 

GAGGGGCCGCAGCAGATTGTGAGGGGCTTCCTGTTTTCATTCCAACCTr 

TAAGCTTGAGCCCCCCGACCAAGTGAAGCATGAGAAAATGCAGCTTACCTTAGTCAC^ 

4 0 GGTAAATTAGCCACAGG CACCTTTGGAACCAGCAGC AGGTGCGGCAG ACCGG AC CCTC CTCGG AACGAGG AGCATGTG CAG AAGAG 

TTCGCACGGCAGGCTGGGAGTGCTGCCTCCAGGAAGGCCTGTGCTGGGGTTGAACC^ 
GGGGGTTCCTGCCACCCTAACGGATAAAAGTGCCTCACTGTGTCCAAACTATTTGTGCAAAC^ 

CTTTC CTT CTGGGAT CTGG AGTTTTTTGG AAC ACG CCAGG C AG AAGGTG C CT ATGTGG CCAG CACCC CCCAT AAAAAC C CAGGG CT 
CTGAGTCTCTAGTGAGCTTCTCTGGTTGACAGCATCTCAAATG 

4 5 GGATTCCACTGGAGAGGACTCTGGAAGCTTGGGCCTGGTT^ 

TCTGCATCTTTTTG CTGCAATCAACGGTAG CTGTG AGCGCAACTATATGGCAAGTC CTGCGGGCCCTCCTCCCGCGGAATCACTG A 

GCCTGGGGAGTTATTGGGGCCTCAACTCAGACAGGTATAAAAATTAATACACCTCCT 

GCTTTCTTTCCrTTTGCCCTCCCrGAGAT rra 

AGTGTTtX^ATAAGTAGGGGCACTGAAATAGCATAAGATTTTCCCCCT^^ 

5 0 GAGCAGGTACCAAATTATGTTATAAAGATTTCAGTATAGCTC^AGTCCAG 

GAAGGAGGACAAGATTTAAGGTGGATCTTAAGGATGATGCGGTAGGCAGATTAATGGCCCGCAGAGAGOT 
GAAACCTGGGATTGTGTTAGGCTACATGGCAAAC^GAAATTCATGTTGTAGATGGA^ 
TAGGGAGATGATCCAGGTGGACTCAATATAATTATACGGTCCTTAAACGTGGA 
GGAGGCTGTGGTGGGATGATTGTTTGAGACAGGAGTTCGAG^ 

5 5 CAGAGCAAGCCCTTGTCTCTTAAAAAAAAGAGGGGGCAGGAGAGAAGGGCAGAGTGATGTG 

CACAAAGCCATTGTGGGCTTTGAAGATGGAAGAAAAGGCCGTGAGCCAAGGCAGGCAGCTG 
C^TTCTCCGCTGCAGCCTCCAGGAGGAGCCGGCTCTGACGACACCTTCATTTTAGC 
GAAGAACrATAAGATTATAAACTTGCGGCCGGGCACGGTGGCTCTTC 
TCATGAGGTCAGGAGATCGAGACCATCCTGGCTAAAACGGTGAAACCCCGTCTCT^ 

6 0 GGTGGCGGGCGCCTCTAGTCCCAGCTACrCAGGAGGCIGAGGOVGAAGAATGG 

G AG ATTGGG CCACTGCACTTCAG CCTGGG CAACAG AG CC^G ACTCTGT CTCAAAAAG AAAAAAG AAAAAAGAAAAAAAAAG ATTG T 
AAACTTGTG CTGTTTT AAG C CATT AAGTTTG TG AT AATTTGTG ACAG CCAC C AT AGG AAGTG C ATTGAG ATTTGT AGT ATTTTG CG 
GGGTGAAGAGGAGGGTGAAAAGGAATGGACAAGCCrGTCAGGTCCCT 

AGTG AGAAT CCTGCATCCCC CATGC CCTACAGCT ACCAGGG AG ATGAGGCACTGGGAG AGTT AGTACT CGGCTGGTGX3TTTTGTGC 

6 5 CCTCTGIGGC CTAGAAAGTATGTAATGCTGCCATCCCTGTATG CG CAG AAAGATGGTTAATGCAGCCCCACTCAGGG CAGTTTGGGG 

GCTTACCTGAATTTCCCCACTGGGAGCCTCTCTCCTGACTTAACCT 
CCCCCAATGCCTGAAGACACACCTTCAGCCTCTCTCCTC^CGTTCCCCTGCCTTTCCAC^ 
TGGCTCTCTCCTGAGTGGCCCAGCTCTGGCCTATGGGACACGTC 
CCAGAGCAGTCCAGGCTCTTC<5CGCCACCACCCAAGTGCCCTGTCAATTTCCAG^ 

7 0 CTGGCTTTCTCAAGACTGAGTTGACACCTGTGCTCTCAGCAAACATG 

GGAAAGTGTGAAC-GGAGAGCTGGGGTCCGCAGCTGGAAAGACTCTCAC^ 

CTGG C CAAGGGGT AAATCTG CACAT GT AAGTCTCAT ACACTTTC^CT CTTCATGG AATTGG TCC CTGT AGAAG CCGGG CCAG AAAG 
AATCTAGCACCCTCTTXSCACTACAATT^ 

GGATTTCCACCACTCCTATTCCCATTTCGAGAAAGAGTACCCTTTC 
7 5 TTTGTGACATCATCAGAAGTGGAGAAGAGGTACCOU^TCCCATTAGCAA^ 
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CCTTGCGGCATTGCCTTCCTCAGTGCTGGCTGCCAT 

AGTGTTCTGGAGGCTGGAAGTCTGAGATCAGGATGCCAACATGGTCAGGTTTAGGGAGGGCTCC 

CTTTCTCACTGTGTCCTC^CATGGGCAGAGAGTCAGTGAGAGCAAGCT 

CCACGAAGGTCCCATCTCCAAATACCATCACGTTGGGGGTTAGGGCTCAA^ 

AGGCACTGTCCGGGTCAGAAGGCCTCCCAGGGATTTGG 

TGCTCCACTGTAAGAACTGGGCAGACAGATTGGCAGCTTAGGCCAAGCAAGA 

GAGATCAGGAAGATAGGAATTAAGGAAGGTGAAGAGGAAGAAAAAAGGACTCCATGAAGAGGAGAGGGTCTCTA 

ATAAGTGAGGAGGTGAGAACCAGTCTGAAGCACCCACCTGTGACCTGGTGGGG 

CAGACGGGAGGAACCCTTCCCATTCTTCATGGAGGCAGATTATCXaCTGGTCrGC^ 

TCCATTTTTCTAGACTTCTCTCATTGGTGATGATTTGACCT^ 

CTCTCACACATATACTATCTCACTCAATGCCCACCAATCCTAAGAGGCAGGAAC^ 

GTAGGTCAAAATTTAAATGTAGGTGATGTCACAGAGTTTAACCTAGTATTACCATCAATG^ 

CirCGGAAGGTTAAGTCATTTGCTCAAGCTCACACATTTGGTGAG 

TGCTGTGGTAGGCTGCCATCCATGGAAACAGGGGTTCCTGCGGGAATGTGTAAGAGA 

CAGGAGTTGACCTGTGTGGCTACCAATGGGAAGCCCCTGAAGACAJ3AAAGGTAG 

AGGAGGCCGGAGGGTAAGTAGGCCACTGCGAGATCAGGACGAAATTTACTGGGATCGTG 

CTGGGACCATGCTCTGCATCTTTTTGCTGCAATAAATGGTATTGCATGAGA 

TTITGTAATTTATATGACAAGAAAAAATGACAGGGTGATTTTTACATATGTGCATACCT 

GTGATCTGAGCTCACTAGATATGCTCAATTAGGTAATAATCAATTCAGGTCATCAAATTGATCACC 

ATG ACCTCAATT AAGG CAAAT AT AT ACAATGAAAT AT ATGCAG CTGTTAGAAAGGGCT AGATC C CAT AT ACT AAAATGGAAAGAAA 

GCCCrTTGAAACCTAAGAAAACTGTGTTrCCATTATGTGAGCA^ 

CCCTCCCCTCCCGTCCTCTGAGACGCxAGTCriTGCTCrcTTGC 

CGCCTCCTGGGTTCAAACTATTTTCCTGCCTCCTGAGTAG 

TTITTAGTAGAGACAGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGGACT 

AAAATG CTGGGATT ACAGG CATG AG CCACCGTG CCCAG CCAGAAAAATGGTCTTT CAATTGGAAATGT AATT C CAGAG AATAAAG A 

ATAGAACCACACTGTAATATGAC1AAGTAATAACACACACAAGCG 

CTGAGGGGCAGGeTCCGTGGAGCCTCCAGGCTGAGAGGCTGAGACTG 

AGGAGTGTCTCAAAGATTGAGACATTATGTTCATTCTTTGGGGCT 

TAAATACCCATTAAATAAGAGCAGAACTAAACACATCn'GGTTAGAAATTGTTCCT 

GGGGTTTCTGGAGCAGTGTTAATGCAAGCCAGTTCCTGTAATGTACATAGTGACACCT 

CCACTGTTGATTTCCTATATTCACTTACCCTTGTCTTTAAGTGGATGTTG^ 

GCATTTGTAAGTAAATCTGTGTAGGCTTAGGGCCACATTGATAAAAATATATGAGAGTA 

TTCTGATGTCCTAGATCCTCTGATTTCTGGAAGCTCCATCCCACATCGT 

CTCTTAAATAATGTGAGTGTTAAAACAATAGTTTCATTTGAAATTATTTGATC 

CCGGCAATCACGGCATTAGGGAATTCATACCAAGTGATGTCTAAGTTATAGrrTCAAAATATA^ 

ATTACAATTATGAAATTAACATGTTTTGGGGACCT 

CAGTCAATAAAATTTTCAGTCTCAAACATTTTCATAGGTCCCTGA 

CTGGGCAATAAAAGCCAATCGCGACCCTCATTTTCCCCCCATTCCAGGGCTTCCCCAGGATAGAACGCGGAAAC^ 
GTTTCCAGGG A l-lTCri'l ' CTTTTlCCAGTGTCXSrrCTCCCrCTCCCCCACTTCA 

GCGGGTGGTCCCACAGGGGACATCTTAACGCCCACCCAGGAGCGGCTGGT CCCACAG AGGACCTCTT AACG CTGACGCTTTCTGG A 
GAAAAGGATTTGTGATGAAAATTTCCCCAAGGGGACCTTCTAGCGCCT^ 

GGG CAG TGGC CT CCTTTGAAGCTG GGATT CTC CTG C C C CTCC CGGGCAG CGCAGG CCAC CGG TG ACCCGGG ACGGTCTGT AAACG A 

CCTTCGCGTCCCCGTCCCCGTGGAATAACGCGCGGCGGGCATCACTGCCCAATATCCAGCGCTCGTGAAG^ 

CAGCTAGTTTCTGGTTGTGGATTCTGATTAATGGGCCGCAGGCGCACGGTCTGCACCGTCCGGCTCGTTC 

GAGCCGCCGCACTCCC^GGGCTGTCCGCCCGAGCCGGCCTGGAAGGCAGAG 

TCCTGGGAAAGCCCCCCTGCAGCGCCGCCCCTGGACCCCAGGTGTGAGGAGCGGGGCTCGGC 

CCCCACTGCCGTGGCrcCCGGGGGCGGGAAAGTGATTTCT CGG AAAGCAGAG 

GACGCTATTGGTCGGTGTGCCCGTCGCTCTGCGCACCGCCCGTCCCCCCCACGACCCTTTTTAAGGCC 

CGCCTGG AGCGCGGCAGCAAGCGTGGGAACGCGGGCGGCG AGACGGCGGCAGGACGGC CAGGT AGGCACAGTGGG CGGGTAGGG 

CGCCCGTGTCCCGCGCGGTCCGGTCCCGCGG^TCCCraACGCCAGGCGGGGCGTGGGGGTGGAACTACCTACCGTAATGTCCCCG 

GGCGCAGCTCAGTGGCCACCCGTGTCCCTCTCTGCTTTTTGCCTATTTTC 

GATTGTCGTTTGCCTGTTTCCAAAGATTGGTAGATAGATGCCTTTTTAAAAATCTCATT^ 

ACGTTAGGAGAGCTCATATAATGAACGGCAATAGCAACCCCCTATCTTGAAA 

GGAATAATGCTTGGAGGGTCAGAGTTGTGGAACTCCCCAATAACG^ 

GCTTCTAAAATTCTGTTTAATGAC^GTCAATTAAGATTTCTGAGTCTGGCTTGAGGGC 

TTGGCAAGAGAGTCTGTATATGGGCCACAGCTCACAAAAGCATTGTTTGAAAAA^ 

TCCCAATACATAGGACAGACTTTCCTAAGGTGAGATGTGTTACrTACCCAGAGCTC 

TGAATTTTCCAGAATTTTAAGAAGTCTCCCCJUVCCAATGGCC 

AATG ACAGTTC AAAG CTTGT CCTTTTGGGTT ACGG CAG CTTCCTCTATG AAAAG ATTTGGG AGTG ATT CAGTCACTTG AGGAAGG T 

TGGTGCCAGAGAG&GAAATGGCTTTGGTGCTTTCTGGTCTAGTGATGCGGGTA 

AGGCGGGGGAGAGGGTGGGGTCTGCTGCTGCCAGCTCAGG^ 

ACGTCCCTC CTGGAAAGCCTGACTGCTGTATACATTTCT AGGG CTGCAGATAGGGCTTGGCCAAT AG ATTACCTCAAAATCAAGAA 

GAGGAAGTTATAAACATGTGATTAGACTTATTAATTTTTCCTGGCTGGGCATTTG^ 

ATTATGGATCATAACTGTAGTAGAATCTCAGGCTTTTGATGTTACITAGAATCT 

CTGCACATGTGCAGAGATCTTGTAATGTAACTATGGCTTTTCT 

GGCCTGGGTTCCAACACAGTCCGCTGGCTGCCTCGGTATTTGCCGTGCCT 

GGG ACTGGATTAG ATGATTTCTAAAGT ATTTT CCAGCTTTCAGATGCTATGACTTACT CTTCTCTTTTTCCTTTT AAAATTTCTTT 

TTCCTGTAAATCACCAAGATAGTTGTCCTGAACAGTTTAGATGTT^ 

TTTCTAGAAAAATGCGTGTACTAACAAAATTTTTGCTTT^ 

TAGCAAGGAAATCAGGGGAGTTGAAAGTCTCAGAAGCCAACCCCTATGTTGTGTTCTGGCC^ 
CTAGGAGAAATTTTTCTACCTTGCCCCAGGAAAGAGCTGCTA^ 



AGATACAGCCTCTCAACTTACAGGTGAGGAAGCCTCA^ 
CCAGAGTTCAGAGTTTTCAAACCAAGGTCTAATTTGGGTGGGCGCAGTGGCT 

GTGGGAGGATCACTTGAGG CCGAGAGTTCAAGACCAGACTGGG CAACATAGCGAGATCCTGTCTCT AAAATGTATATCTGGGTGTG 
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GTGGCAT ATGTCTGTAGTTCCAGGTACTCAAGGGG CTGAGGC AGGAAGATCGCTTAAGCCCAGGAGTTCCAGG CTGCCGCGAGCTA 
TGATGGCACCACTGCACTCCTGCCTGGGTGACAGAGCAAGACCCTGTCTCTAAAGA 
CCATAAACGCACAACTGAGGTTGTTGGATGGGGAATTTAAGAAATCCCACCTCCTCCT 
CATGGCCACCTGCAGGGTGGCTGAAGTTTCCAGAATGCC 
5 TGGCTCTACCATATTTTGGGATGTTATGTCTTGTTACTGCTGATTGTT^ 
AAGAACAAAAATGCTATTTTTTTTTTTTTTT 

TGGCTCTAAGTCGATAGGTCATGCTAGTTCTCTCTCTCn'TTCrCTGTG 

CCAAAGGATGTCTTTGGGTGCCCTGGGATGGTTAGGTCCCT 

GTTCGGTAGCTGC^GTTTATTTCCTCACrCTTGTCCCTATCT 
1 0 AATAATGACATCTGGCATGTAAGGTCCAGGCCCTGTCTCAGTTGCTTTCCGTCT 

AGTGTTTTGTTAGCTGCATCTGGGACTGGGTAGCTGATGCCCTGAGTT^ 

GGACTTGAATGAGATCGCGGTTCTGTCTCATTCTCAAACTGCTGCTC^ 

TGAGGGCCTCCTCTTTCCC^GACGGCACGGACGCAGTATTCTATCAGGGAATGTGGGGCC^^ 

TTCITCGGATAATGGGCTGCTCAGTGAAGGAACCTGGTGCTGTAAAAGTGATTC 
1 5 AG CTGTT CACTCTGGTTGT ATCTGC CTGAAAGG AAAG GG CAG ATGAAAGCTGGAAGGAGCACAG CCAG CACAGTGGGAGGGGCCTG 

CAGGGGGCTGCTGGTGGCTGCAGGTGGCTGCAGGTTTCCCACGGAGCnTTCAGTTTGCACT 

GCATTGCCTTCTCATGGCAGGTGTCCCGGAGTCCCTGAATCT^ 

TTCCTGTTAGCAGTTTTTGGGTTGCTGTGATGAATGAGACAATATCCGTAATATCACAGCGTGTAT^ 

GTG ACCGGAATGGTGGT CGG CGG CTT CG ACAGTGG CT G ATCG AG CAG ATTG ACAGT AG CATGT ATCCAGG ACTG ATTTGGGAG AAT 
2 0 GAGGAGAAGAGCATGTTCCGGATCCCTTGGAAACACGCTGGCAAGCAAGATTATAATCAGGAAGT^ 



TTTACTTAAAAATTAATCCAGTTTTTTAAAAGCAAACATTGGTTCAGGAGTC 

TCACATCTCTTGAATTACGGAGTCACTTTTGTGGTCGCTTGTATTCTGATGAGGAGA 

GCTTCCTTTTCCTGATGAGCCAAGTGGGAGAAAATACTTCX3AGACCT 

2 5 GATGCATATACCAGGGAAGGTTTCAGTCCTTGTTCTTTCAAAGCTGATACTTCTGCCT 

AGATGCCATCTGTCATTCCTTTGGGGGCTGGGCCXaTGT^ 
TTGAGAGTAATCGGTTCCCCATGCGTGGAGAGACGTGATGGAGGGTTTGTTC 

CACATGATCCTGTGTGGTAGATGCTGTTCCCACCCTCATTCTATAGAGGCACTGAGAGGAGAAGTGA 
TCTTGTATGACGGAGCCAGCATTCAGATGCAAGATGGCCCAACTGTGAACCCTTGGTCCTAACT 

3 0 ATTTCTTGCCATTACACCTAGATTCAGACAGAGGAACCAGGAATTA 

CTTGTGGGTCATCAGAGTCCTGTGTGGGCATCTCTAAGGCTCCAGATGTCCAGGTTTCACCT 

AAGGTCAAG CTAGACTTCCACGTTTGCCAGTGTGG ACACTACACGTAGTCTG AGTCCT AGCCCAGGTTATAAG CCATTGCCATGTG 
TGCTTATTCTCTTGCATTTGGGGGCCAGAGATGGGGAGGGGAGAGTGAGTGTGGGAGAGGGGTGAAAGG 
CCAGAATCAGACCTGAGACCACAAACAAGGTGTGGAGGGGGTCAAGGCTGTAGGAGCC 
35 ATG ACTGTC CCCTGACACCAGAGAAC AAAAATGACAAAAGATCACACCTT AACCCCAG AGT C 

TTAAGATGCATTTTGGTCCTGCACGTCTCTGCCTGACCX3ATGTTTGCAGTG^ 

ATTTG AGGTTG CTG ATCTTCTTGAGAATCTGATGCCC ATTACTG ACCACG CCAAGGAATATGGGGACAAGGGAACGCTTTG CACTC 
ACTTTCGAGGGGTCCCTTGGTGCC CCCACACGTTCC AGATT CTTTCTG AT CAAGGACC CCAGGT CAGGGAGCCTCAG CACCAAGAA 
GTTTTGTGATGCCACTGCCTGCCTCCTCACCCTTAGTGCCATTGTGGATTTCTCTCTTGTT 

4 0 T TTl M r rTTTTTTTTTGCCTTTGGCATCATGGTGAACTrCTCTTGGGAAGGACTGCAAATC 

CGGTTGGGAACTCTGCATTATAGATAACCCCAAAAG CCGGAATTTCTCTG CCTCGCCACTGTTG ATGTCTGGGTCTGGGCAATTCT 
TTGTTGTAAGGGTTCCCTATGCCTTGTGGGATGTTTAGTGGCATCTCTGACCTCTACCCA 
TACTGTCACAATCAAAAATGTCTTGGACATTGCCAAATATCTCCTGGTGGGGC^ 
AGAGCTGGCCACTGCCAGGATCAACAGACTTAACCAGAGGTGTTGGGTGCTGAAGGCACAGTTGC^ 

4 5 TCTTTAAGACATGGCAGAGGGTGCTGGACTAGGCTGTGTA 

CCCATTCCCTACTTCAACTTGTGCTCGGGGGGTAATGAAATAAAGCCCCAGATTTATACATCT^ 
TTCAAC^TGTTGAGAAAAATGTGCAAGATTCCCAATGTTTCCGTTGGTTACG 

ACCATCTCAGCAGGAATGGGGCCAGGAGGGAGACGCGGGTGCCCACTGGAGCAGGGAGGCCATCGGGA 
GAGACATGGCGCCAGCAAGAGCCCATCATGTGCTTCCCTGAGCTATGCTTCTTCCCATCCATCCTCTGAG^ 

5 0 CGTTGGCCGGATAGATCTTTGGCTGTTGGATGAAGTCAACCTCGGTCAGGTAGAGCCATCTT 

CTTGTTTCTTTCAACTGTTCAAAGAAAACCTTTCAACAATT^ 

AGTGGATGGTAAAAGGGGCCACTTAT ACCAG AAAATCCCTCATTGTCTGCAGGCAT CATCCTG AGGCCGGCCT CATGGT ATAG ATC 

ACAGAAGAGAAATTGAAAAAACATTGAGCTCATTTTGTACCGACCCTGAGTACACT 

TTTAAAAAGTACTTTACCTTCTTGACATGACGACAAGGGGGGAAAATCTAGT^ 

5 5 ATCGTAACAGTCCTGAATGAGTTAAAGCTCACTTAGT^ 

AATTAAAAAAATGCAACATGAAACAATCATATTAAATTAGGAAGACCTCCTGGTCITAGT^ 
TGAACAGGAGAAATAAGTTATGGTGGAGGCTGAATGAGCCATAGTGACTGTTGCTGGC 
TCTCATGTCTAGTTTTTGCTCTAGCTCAAGAAGGATCTCGAGAGATCATCAAGCTAAGTTGACC 
TGGGAGAGGGCCAGGTGCCTGCTGTGTCACAGAGTTCAGGGCAGGCGGCAAGCCAGGTACCCGAT 
60 GCCGTGGGCCCTGCTGTGATGGATGATGTGGCCTGGACTTGCTCGTGGGGT^ 

GTGCCTTGCAGGCAC CG CGGTTGCGTATGTGT ACC^GCGG ATCACAACGTTGCCATGAAATCT 
ATGTGGGTTAACTCAGTGGTTCTCAGGTGGGGGAGGTATCATGATTTACCCTCCAGGG 
TTGGTTGTCACAGCTAGGGGAGGGGTGCTATTGGCATCTGGTGGGTCAAGGCTAGGGATGCCG 
ACCACCCCGCCACAAAGAAGGGTCCCAGCTGATATGTCAGGCTTGCCACGGTGGG^ 

6 5 AACTACAGTTGATTTAGTACATGGTTTGCTGGTTTA 

GTGATGCAGTGAATGAAGAGAGCATTAAAGAC<XTGTGCTTTAAGA 

GAAACTACCACTT ATG CTCACT7ATGGG AATGTTCTTCCAGTCTC1TT ACCCCAGGTAGAGGTG CAACATTTTGCATCAACAAAGT 
AGGCAGCATTGGCCTTGAACAGTAAGCATTCTTCTCTTT1TT 

7 0 GGCCGGGCGCGGTGGCTCA.CGCCTGTAAGCCGAGCACrT^ 

CCTGGCCAACATGGTGAAACCTCGTCTGTACTAAAAATACAAAAATTAGCTGGGTATGGTGGTGCACACCT 
TGGGAGGCTGAGGTAGGAGGATTCGCTTGAACCTGGTAGGTGGAGGTTGCAGT 
GACAGAGTGAGACTCCGTCTCAAAAAAATAAAAAAACTAAAATAAATAAAAATAAAAAATAAAAAA 
CCTCCCTTCCCCAGTGAGL\ATCTTGAOGG^ 
75 GCATTAAGCCTGAGLAACACTTGCTGTGGTCGGGAAACT 
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CCTTGGGAAAGTTACTTACCTCrrTGGAATTGCATTTTCCTGAAGTGGGTACAGCT 
AAAGTTACTTACCTCTTGGAATTGCATTTTCCTGAAGTGGGTACAGCT 
CTTACCTCTTGGAATTGCATTTTCCTGAATGGGATCGGGATGTC 
ATTTTTCCTGAAACCGAAGATAATTCTCATCTGACTGTGGTTAC 
5 ATCTGTCCATTCAC CTGTCTT ATTTTTTT AAAATCTGT ACIG CAGCTTTG AATTT CT 

GGTCCTAGTTCAGATCCTAAGGTTTGGAAAAGTCAGAAAACAGATCTATTAA 
ATTACATTTTAAACAGCACCTGGGGGGAAAAGTCTAGAATAAATGCAAAAATG 

ATGGATGACCATTTTGACTTTTTGCAAATGTTCCCTTTTCAT^ CAACAACTTGTT CTT 

TGTTTTGAACXjCTCTGGTGAAACCATATGC^ 
1 0 GGGCTTGGCTAAAAGGTGCACTCCAGGGTTGGAGGAGCCTTCAGC 
CCCCTCCCTGGCTTGGGTCTITCCACAT 

AATTGAATGAGAGTTGATGGCCAACAAAGATTCATGGGAAGGGAAGGATGTGTG^ 
TGGAGTCGGCCATGAATTTAATGTGCTTCTGTTTCTTTCrrTCAG^ 
GCTGAACCAGCCACTTGGAAGACGAGGTTACGCTGTGCTTTGAATAAGAG 
1 5 GGACATTTCCGAGCCATACAAAGTTTACCGAATTGTTCCTGAGGAAGAGCAAAAATGTAACT 
CAGAAGCTGCCCTGGCCTGTAGCCTTCACCACAGAACTTGGGTCTGGGGTAGACACAGG^ 
AAAGCAGTTCTCTCCTTCGTCTAAGCAGAAATTGAGATCT 

CCAACCTGTGCCCCACTTGOTGCCTGACCACAGACATTTGTAGGGCCTCCCTGGA 
GAGGAATCCTTTCATTTTCCAGCAGCTTGCAGAGGAGGCCAGGGGAAAATATGCT 
2 0 GAGTAAAGCTCTCAGATGAAGTTGTCAAAGATGGCACCATGGCCTTCGTTGTTGGAGATGGGATATGCCTCA^ 
CCTGATTGTTATTTGGGAACTCCGGGATGTGATCATTCCTGAGGTTTCTTC 
TTAAAGCAGGACAGCAAAAACATGCCGCACCCTGGTC 

GAAGGCTTCTCAAAGCCCTGTGCTGTGCCGCTGCCATGAATTGTAGCTGATTTATGAGAT 
GAACATGCTTTTAACACATTGATTTTCrrTAGCCACTTGATG 

2 5 GAGTAGT CCACGAATG CCTGTAAG AAACAGAAACC CCCAAAACTTCTTGGGGCTGTG C AG AAGTTTGAAGTTTGCACTCTTCCCGC 

TGTGGGGGTGTTGACTTAGAATTTTTTAAGAAGTTAAAATGGTGA 

AATCG TCATGAAGGAGGCAG CATG ATGGT AAAG ATGGTGGACAGAGAATC CG AGGTG CTGGTGAATGTGG C CCATGAACG AG AAAG 

CTGAGCCAAAGCCGATGCCCGACCCGGACAGTACAATTGCGGTTTGGGGCGTTTGTAGCCTAATCGTCT 

CTACTGAGCGTGAACCCTAGTCAATACTAAATAGTAAGGCCAACAAACTGACCTTCCC 

3 0 AACCTCTATATAATAGTCTGGGGTGACCCTGAAAGGCTAAGCAGTGATCTTTTTGCCTC^ 

ATCACCAAAAAGTAAAGTGTGATTTGAAGGAATAGAGAAGTTTCTTT^ 

CCGGCAGGTGACCCCAACTTGTTTATAGCATGGGGTGCAGCCAGCATGACCTGCTAGTGATGTTA 
AGGGGTGAAGCAATGTAGCATTTGC^TAATTGCCTACAGGGGCCCACTCGCCTCTAACCT 

CG CCCCACCTGCTCCCTACTTC CCCATCAGT AGCTTCTCAGTGAATTCTTTAATCT ATCGAAATGGAATCAAAATTCAGCCGG^ 
35 GAAACAGTGGGTAAATTCTGTAGGTCACTGGGAATAATGAATTTTTGTAAGTGTTGGTTGTGA 

AT AAGGT AAG TTGT CGACTGAGG AAG CX3GCATG AG AAGG G AAAG AG CAGAGG CTGG C CCAG CGTG CC C CTGTGGACACTGTG CACC 
TTTAGGCGGTTTGTGATTACGGCTGGTGCCCACTGGGCATCAGTGAGGGTGCTGCGGTGTGTGTGTGTGCATGGCTC 
ATGCACATGTGTATATTTGGAAAGGCCAAAAAGAGATTTCCTGCCAACTGGCAATACA 
TGCCAGGAAATCGGTGAGTGTTTTCAAATGCCCTGGCTTAT 

4 0 CTGATACAGACTCATTTTGGATCTCAGAAAATAGAAGAAAGAGACCAAAGTGCTGTCTTTATCCT 

ACCACCACCACCACCACCACCAACTTCTCTCTTTTGACTTTGCT 

CCAAACCAATGAAGACACTCACTAACTTAACGTTTCCTTAAACTCAGCATTTACAGAGT^ 

TGCCTCCGTGCCATGTGTCATGGTGTTTGTCTGGTTTTCTGTAGGCAAACTAGGCGTGGCAACTGCTGGCTGCGTG 
CAGAGATGGAGTGCGGTCGCTCTGAAATCGACGAGCTGATCAAGGAGGTAAGCAGAGGCAGCATTTCAGGGGTCTGGCCCTGCCAG 

ATCTCCACCTGTACAGATCTGGAACGGAAGGACTGGCTGGGCGCCCCCTCCCCA 
GCCAGATCCCTCCTrCATCCCTAGGGCTGGGCCTGAGCTGCCGGTTATGTGTCAGTGAAGTT 
CACTTTGGACAGCAGTCTTTTAAATGGCATCGTGAATTCCCTTGCCTTCCTAGCAGGCTA 
GCTGCCTGCGCTTGTCATGATGGCGGGTGTTGTTGTACCAGCTCTCATGGCTATTTGAT^ 

5 0 TTCCTTAACGGATTGG AAAG CTGGATAACCAAGCATGTTCACATTG CACATAAATT CCTAACAGGCTACAAGGG AG AGCAT AATCG 

GGGCCTCCTTAGGCAAGAATAGATGGCTTTTGTATAC^ 
TATGCTTTTCATGGGTATTACAAGGTGGACATTTGCATGACTAAACCTCGT 

T AAATTCTTCTTCTTAATAACATAG C AATAATGGAATCCTTAAG AGTAGCTTGGCTGTC AGAGAATG A C 
TACCCGAACGGCCTTGCTCTTCTGCACAGGCAGCTGCCAA^ 

5 5 GTGGCCCTCCACCAATGTCCTGTTTTCTGTAGAAGGCTTTT 

GGGTCACAAGAAAAGCAATTTCAAAACATTTCATAATTGGATT^ 

ATGTGGCTCGTTGAGCGAGATTCTTGAAAGCATGGCAGCCTTTGGCCA^ 

GTCCAGGGCGTTGAGGG(ZAATTGGATAACCTGGAACACTGGGTCTGCAACAGGC^ 

GGCCCCCAAAGAGCCTAGTAATTTGGCC!ACATGGTTTGAATGC!T AAGCCTGGTAGATTTAAGGCCT CCTTT CAGGTTG CAAA 

6 0 GTGAAACTTGACTTTTGTCrCCTGAGATAAAGGTGACAATATGGT^ 

CCCCAGGTGGGAGATCAGTGACrGAGCTGCCCTCCTCTCCCrCCCCTGACTGTGCAGCCT^ 
CAAAAGGAGCCCTTCCCCGCCGGAGGCCTGTCGGAGTCAGCTCCTTCCAGACTG 
GGTGGCCTAGAATTGTCT^CCAGGCATGGCCTGAAGGGTTTACGGCATTCCA 
CATGATCCATGTCTCATTAAAGGTAGCrCAGCCCTGAGAGGTGAAGAACG^ 
65 CTGTGCCTTTGGCCCTGCTCTCCGCTGCTTCTCAGGAGGGCACA^ 

CGCTCCTGGGAAAGTCCAGGGAAGAGGGTCTTAGTGAACTTGTGTAAGGCCAGAGGGAAGATG 
CCrCCCTGGGTACTCCCITAGGTCGGCCAGTCTGTTACGACATTAAGGACC 
GAGAGGAGAACTCAGACCAGGAACAGGAAGTTTCTTCAGAGCCCT 
CAGTTTATTCATCCGTAAAACAGAGAAACCAGGACTTGCGCCATGG 

GCTTCCTCCCTCGGCGTCTTCCCCCT 

TGCCAGGCCTTGGGTGTCGAGTAAGAGGTCCCTGGAAAGGCACCrCT 

ACCCCGAGGCTCTCCCCCAGAGATAAGCCCTGGGTGGTGGGTATGCAGTCCA^ 

GGAAATGTGGCCCCACAGTGTGA.C!ATTGACACATGGTCAGAAATCCCACA^ 

7 5 TTCTGCATTTGTGAAACAATGGCrCTCTGCTCnX^ 
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CACG C CATGTG CA.G CCTTTT AAGGGACTTTTGG AGAAGGAG CG ATTG GGGTT ACT CC CTG T ACAC CACAC CTGGGTGGCT CTGAG C 

TTCCnTTTTCTGTTTCTCCTGCAGG CGTGCCGCTGGTGACGGGGTACACCACCTACGACGCG CACCATTCAGGTACGGGGTGGTCCG 

GGCTGAXSAGGGGCGTGGGCACTGTTTGAAGACZAAAGCCCAGA^ 

GTGTGGAAGTCTAGCCCTGGTTCCAAGGATGAGC^GGACCTGG 
5 GTGGAAGTCTAGGCCTGXSTTCTGAJGGATGAGCAGGACCTG^ 

GGAAG TCT AGG C CCGGTTCTGAGG ATGAGCAGGAC CTGGTGTGGAAG TCGAGG C CCCAATTT CTCCTC CTCAC CATTTGGT CAGGA 

GCTGGCXIAGTGGCCrCTTTTGAGGCCACTTTTCTTGGAC^ 

GAGAGACAGTGAGGGACAGGGCAGCGTCTGTCCCAGCGCTGCCTCTCTT^ 

CTCATCTCAGGCTCCCCTGCCCACCACCTCTAGGTGGATG 
1 0 TCCAGACTGGGGGCCTCCCCGCGGTGGGCAGAGGTGGCCTTTGTGTGTAGTCT 

GCTGCCCCTGGAAGTGACGCTCAGGCGTGGTGACGGCATCTTCTCAAC^ 

TGGGAGGCGAGCAGCTGGTCTCATCAGATGGGCCACAGAACGTTAGGGCTGGGCCTCT 

GAAGCGTGCTGGCTGAGGACTGCATGGGACAGTTTTlTrGGGTAGCAGAGGTCGGTGCT 

CAATGCCCTGTCCATGAAGCTCTTGTCTATTTCCTGGCAGCCCTTTGAGCT 
1 5 AGACTGAGGCTGGGAGGTTCAGGATCTCATCCAGGTCGCGCAGGCCTCTGGCTGGAG 

AGCCGCCATGCTCACAGGGTCTCAGTTCTGGGAGGCCTGGGTCATGAGGCT 

TGGGAGTCnKrTGACCAGTGCTTTCCGGGGGCrGTGGAGTAGCC^ 

TTTGGGG AGAGCATGT C CTTG AATT AGT C CAGCAGGGT CAT AGC CACTG C CT CATGCACAGAACAT CCTTCT AGAAG CTGTGAG CT 
TTGGTACCTCTCTCCAGCCCCGCCCCCACCTCTATGTAAATCCATGCTTCAGTT^ 
2 0 TACCITACAGGGTTAAATGTGCTGGTAGGTCCCAAGCC^ 
ATAACrTTCAGGAGTCTTGTCTGTGTAGATAAGCAT 
ATACAAGCTCCGATAATCCTCACAGCAGCCCAGTGAGGTGG^ 
AGTGGGGAAGCGGCTTGCCCCAGGTCATCCAG<TTGGGC^CA^ 

TGCTGCXKKjTGTCTTTGTTGCTG CCTCTGTGGTCAGGCCGTTCCCCAGCAAGG CTCTTCTT<5TGGGTGAGTGGCTCTATAAGGGCA 

2 5 GGGCTGGGTCGCAGCAGGCCCTTACTGTTGC^AAGGCTCACCT 

ACKSTACAAGCCIXlACAGCrGGAACCTTCT 
CTGAAGCACTTAGCTGTTCCGGGTGAAGCTCCCAGT^ 
GGCAAAGGGCCAAAGGGGTGACGGAGTTGCCTGAAGCGTTTCATTCATT^ 
GGGCAGTGTGTGGAGTCCGCAGGCAGTGCAGAAGTTGCAGACTTGGCGTGGGG^ 

3 0 GGGGAAGATGGTGAGCAGGGAAGAGCGAGACCAGTGCTGCCATTTCCT^ 

TCAAATCAATGAGGTGGACCAGAGCTTCTGCAGCTCTAGCTTCCTACCCrGC^ 

GCAAATGGAGCCCGTCAATGGTGGGTTGTCAAATAATAAGGCAGGGAGCAGAGCCCAA 

CAGGAGTGTCAGCAGGGAATGCAGATGGACTCAGGGCCT 

TAGGCACCTCACAACTTGTGTGGCTCCACAAGGCAGCCTCGAACCtSGTCTGGGGTACTT 

3 5 ATTGCTAGCGTATGTAATTCTTTTGGTTTAAGGCAA 

TAGATTAGTGTTTTCCAAACTTTGGTCACTTGAGGTCCAGTGTCACAATTGGT^^ 
TTTCCTCAAATCTTAATCTTAAAATAAAACITAAGTTTATTT^ 
CACCTGGTATAAGTAGGTAATACTGTCTGGCTAGAATTGATTAAGATTGTATTTC 
ATGCCCTGAATTTGGAGCCCTAAGTTCMTCCCATTATC 

4 0 AAACTGAGGCTTGCTTTTTGAGGTAATCAGAGGTTGGAAG 

CCCATTATTCTGTGTGTGCCACTTTTGGGGAACCCCTGCTTTAGTTTG 
GAAGGGTTTTGCAGAGATTCTTAATCTGAAATCCAGGAAATACCAAGGGCGCT 
TTTCTTTAACTTCAGGTTTAAATTGAGCATTTCCTTTTGTTATGAATGTA 
GACGGAGTATTGTCATAAGAAATX3AGTTGTCGAAACATCGTTTGCCCTCAGCTTGACGT 

4 5 TCTAGACCTTGTATTTAAAGAGTCAGTGAAGAAGTACAGAGA 

TTCTt3TGTATTTATTCATGCATATAAAAACATTACTTCTACA^ 
CACAAAGAGTTACCCGTGTAGGTGTGAGAGAGACTC 
TGTCCCTCTTGTCCACAGCATTCTCCCAGATGGTGAT^ 
CCCGAGGGCTGCCGCCTGTCCCTGAGCCAGCCTGGGCTG 
50 CCCGCCGGCCGACTSCCATCCCCAGCGAGCGACAGAGGCACXSTGACGCGGAAGCT^ 
•AGAGGAGCCGGCAGGGCGTGTTCGTCAAGCGGCTGTC 
CCCAACAAGCTGGAGCGTGATGAGGTGGTCCAGGTCTTCGACACGA^ 
CCCCACATCTTAGAGATCACGCCTCCTATCTGTGTGCC^ 
GGAGTGAGGGGCATGGTGTGGCAAGGAGGTGCTCCTTTGAATGGCACTCTG 

5 5 TGCCTCAACTTCCCCATGTGTACAGTGGGGGTGCA^ 

CATGTACCTTGGAGTATGTTCCAAGGATGGTCCCTK5AGCGCTGGCTAGAGACACAGAGTCTGAGGCCCT 
CGAACCTGCATCn-AGCAAGGTCCCTGGGGATCrGTGTACAGGTTTTGAGACA 
GCAGCrcCACACACTAAGTGCTGCCTGTTAGCTTTCGTTACGCTGTAGGGAGGATGTGT 
GGGTTAGGATGGCGTCTCCCrTCTCCACGTCCCCATGCTCTCTGCrCCTCrC 

6 0 TCAGCTGCCATTGGCCTGAGAACTGCCAGAGGAGGGGCCTGGA^ 

TG AGG AATC CCCCT AGTGAGGAGGTGTG AGGC CGCGGTT ATCACCCCTG AGC^ 
ACATTCCCGGGCCTACCTCACATCCTCTGATTCC^^ 
AGGCGACTGCTGCCGGTCCACCGGCCCACTTTGAGACCCTTTCAGT^ 
GGTGTCCTCAGGTCGGGGTTGCCAACTCAGTTGCCTCCAAGGG^^ 

6 5 GCACTGGGCAGCTGTGCCCATGGAGAGGCGTTCAAACGAGGCAT^ 

CTTGCTl'GGAATTTGGCCCAAGGGCCACCAGTTTGAGATCC 

CCCCAGCCCTGCTGCTGCGGGAGTTGGAGGTCIATCTC 

GTTCTATAACAGCCAGGGCCGGCTTCCTGACC^CAGGGTGGT^ 

CCAAACTCATTCTCGTGCAGGTAAGTATGGGCAG CXT r rr iTT rrrr TTT 1 1 ' 11 11 T T TGAGATGGAGTCTTGCTCTGTTGCCCAG 

7 0 GCrGXaTGTCC^UVTGGCGTGACCTTGGCTCACTGCAACCTCCG CCTCCCGGGTTCAAGTGATTCTCCTGCCTCAG CCTCCCG AGTTG 

CTGGGACT ACAGG CCCGTGCCAC CACG CTCAGCTAATTTTTTGT ATTTTT AG T AGAG ATGGGGTTTCAC CATGTTGG CCAGGATGG 

TCTTGATCTTGTCATCCTCCCAGCTTGGCCT 

TTGAGCCCTGCATCCAGGTCTCCTGGCCCrcTCATC 

CTGTGTAGGAGGCAGGCCTGGGCAAGACCITCAATATGCGGGAAGCGGATTCT 
7 5 CGCXSCACAGTCTGTGTGTGTGTG' l , rri , C , l ''X"r'l''l TC I i f fl i 'GGTTTITAAATCTAAGCCTCTrCTCCAGAAAGATTCTACCTGTTC 
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AGTTGCCCGAAGGTCTAAGTGACAGATTAGCX3CATTGCCA 

CCTGGGTGCTGGCTGTCCCAGCTGGGACAGTACTGGGGAGTATGGAACAGTTTTGGCAG^ 
CTGGCTCATTCACTTGGGACTCACGTGGCACrGGGGTCATCAGAGGACCTGGCT 
CCTCCGCCTCTGCCTCTGACTTTCTGCACCTCCCATCTAGATTGAGCAGCTGTATC 
5 GCTGTGGAGCCGGCTCTGTGATGCAGGCCCCCGAGGAGCCGCCGCCAGACCAGGTOT 
CACCAGAGATCATTTTTCAGAGAAAACCAACAGATCACCGTCTAAGTC 

TAATAAGGGCATTCTCGGAGGAGTAGACGTTTAATACGAAGTGGCGGCATAGCCCT 

TAACCACAACCTGTGGCTAAAAATTTTATTTTCTATCCTTTAC^ 
1 0 TTAACTATCATTTCCAAAGACTTGTCATTCAGTAATATTAGCAGATAGCTG 

TGTGAAAACAAGGTTGTTTTTGTCTITATCTTTTGTTAGA 

TTTTACTTTTATGCATTTTAATAAGATTTAAAAATATTTAGAT^ 

AGAAGAAAGTATATTCTTTCTAGAATTTCGTGCAGGAATA 

GAAACCTTCTCCTCTGGGGCTTTGTTGCCAGGAAGATTAGAACTAAATTT 
1 5 CCTCTTTTCTTCTTTCCAAATGGTTTTCACAT 

CTTTGTGCCCAGCCTGAGTCAGC^CCATCCCGCATTCAGA^ 

TTAGGGTTTGGGAGAGAGCTCATCCAATCrCCC^ 

GCCAGATATGCCTGTTTCCTTTTCCCAGCACCATGCCTGTGGAGGGGACAGTC^ 
CACAGGCGAATAGAGGAGAGGACCAGGGGACGTGGCTTGTCCCTTTTGTC 
2 0 TTTGCTGAAGTGTTCATAAGATAACAATAGGCTTGAATCTCCAATTCAAAT^ 

AATATTTAl^rTl'TGTATCCATTAAAACAGTATATTGATLrrtri Tl TATTCTTTATTAAAATAAAATGCTCTTTTTTAAAGCTGCTGT 

GTACTGTATTTTTTTAGTCACTGTCAAATATGATGTAATTCTCACCAGATTCT 

TGCAGTGTTAATTCAGTGCACT<X^CTGAAACAATA 

TCTGTCACCCAGGCTGAAGGGCAGTAGTACGATCTCGGCTCCCTGCAACTTCCACCT 

2 5 CCTCCCAAGTAGCTGGGATTACAGGCATACGCTACCACATCTGCCTAATTT^ 

GCCCAGGCTGGTATCGAACTTTTGGCCTCATGTGCTCTGCCCCAACCTCTCCCCCAGC 
AGCCACCTCACCCAACTCCTGTTTTAAGGATTTTTCITTT^ 

TGATCCTATGGAATGAAAACATTGGAGAAT GTl ' 1 'lT A TTCCGAATGGGGGGCTGTTTG 
CGGAAGGGTGTGGAGGAAGTGGGGTCTTGTGCTGATTTCATCCT^ 

3 0 GGGTCAGCCACACGTGGCTGGAGTGTGTGTCCTATTTACGCACATCTCGCTGTGCTAG 

GACCCAGGTTGGCAGTCACAGAGATGACTTCCAGCCTAGTAGGACATTCCCATCCATCAAC^ 

TACGAGCCGA.GTGAGACTGGGCCTGTTCTTCATTTCTAGCTTACA 
GAGCCATCTATGTCCCAGGAGGACAGCTTCGCCTGAGAGGCTTCA 

3 5 AACACCCCACGCAAG AGGTCAGCAG CTGGGTGCGTCGCTCACTGTCITAGGC ATCTTTGCCATGAGTCACACTAATGTC CGCACGC 

TGCAAGTAGGAATTTTGTTTACAAGC^GCTCTrCTCCATCCTGTGGATTCAAGGC 
CTCCCCAGAGTTCAACGCTCTCGTGCAGACGGTCGACAAATCCTTCCTACTCAACT 
AGGGGAGTGTTTTCATITGCAGAGGAGGAAATTTGGGAACAGGATG 

CCCTGGATGTG AACTG ACTTTTG AGATGCTGACC CCACACTGGC CA. GATGGG C 

4 0 TGACGGGACATCCTGAAAAACAAGCCATCTCCACTCTTGGTAGTTTAAAAATGTTT^ 

TCTGAGTGATTGCCACTTAGTT CATTAACTTTTGCAAAATCC CACTTTGGGAGGCCAAGACAGGTGGGTCACTTG AGGT CTGGAGT 
TCGAAACCAG CCTGGCCAACATG ATGAAG CTCTGTCTCTACT AAAAACAAAAACAAAAACAAAACACAAAAATTAGCCAGGTC 
TGGCAGGCACCTGTAATCCCAGCTACTTGGGAGGTTGAGACAAGAATCACITG 
ATCACACCACTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTCTCTCAGAAA 

TGGGAAAATCTTCCTGAAAGGGCTTCACCGGGCAGCCACCCATGCCTGCC^ 
GGAATGAGGAAGAAATGACTCCTGTGCATGATTCAGACTCTCCATAA 

AATTCT AAAGAAT AAT CT AT AATT AATGACTGT AATTTT AACAACCCAGTGGG CTGAG AG TGGGG TGTGG CAGTTTCTCAGTGGAA 
AATCTGGCTGCTTTTACTACAAGTAGGAGGAGAATGACAAATATCCTGGCCCCAGGGCACACCT 

5 0 CACACCGTTCATTAATCTGTTAAAAGGCCTAGCTCATG 

GTTTTGCCTTGCAGCGTGTCTGTGGATGAGAGGTGCCCACAAGCGTTCAGTCTTTTATC^ 
TGAGAAGAGTGTGTACATTATTTCTTTCCTATTTTCTTTGGGCTAC 
TCAGTCTCCTCTTTATATGTAACTAACTATTCCTAAACTTAGTGGTATAAAAAACGCTT^ 
GTACAGTGGTGTTGGCTTTTCTCTGCTCTGCAATGTCTGGAATCTTAGCTGGGAA 

ATATTGACTTTGGCCTCTCTATGGAGTTTGGGCrTCCTCACAATA 
ATGGCTGTTTGTGACCCAGCCATCACTCAACTGGACAAACAAGCCCTCCTAGGTTC^ 
AGGAAGAAGTGTCCAGGAATTTGCAGCCATGCTTTCAACCTGCCACG 
CTTTTATCCCCCTGACCTCTCCITGGGCATCCTGATTCCTTTCTGACCCTATT 

6 0 CTTTG T CTTGGG AGT ATGG CAGTGGACTCTG CCT ACTT CAT AA^ T C CATTTTG ATG AC CTCT CAG AG CAGG 

GCTCTGACACGTTACTGGGTTACAAAGACCCCTCCCTGCC^GGGCCAGCGTGCAGCACA^ 
GTGCTCTGGCTGGAGCCTAGTGCGAAGTTTCCCCTTCrrCCGCCCATTCCTCATCCCTCAG 
TCTCGTCCCCACCTCTAAGGCTTGGCCAAAGTTTCTTCATAAAAGCAAGAAAGATAACGAAAG 
TGTGGGGCCAGGAATGCCCAGTTTGAACCTCCAGTTCCGCCACCTC^ 
6 5 CCTCAGTTTCCCCACTTGTTGTGAGGCTTGAGTGTGCTAACACATCAAGGCAAC 
ATCACTGTGAACTGTOSTTATTGGAAAGTGGGTGGAGTCGGG 

GTAAGCGGCCTCrTATGGGAGGTGATCTCAGAAGTGCTGGGAGGGGAAGGGGACCTGT^ 



TATGTGCAGGGGTGGGCTCAGGCGACTGAAGTCTCAC^CACTGCTGCAGGTC 
7 0 TGGGG C^GGTGGGGTGGGGCGGGTAAGGGGTTGTGTTTG 

CCGGAACCACAGATGACCCAGGGTTTGTGTTCCAGCCCAGGGATCATTAGCT 

CTTCATCTGTAAAAAAGGGG CAATAATGTT ACCTATTGGG AGGATT AAATG ACTTAATAAG T ATG AGG GG CTT AGAACAGTG CCTG 
GCACGCAGTACATACTAAGAAAGTGGATTTCCACATATCATT 
ATAGTTATAAGTATAAATAATTACAATAATTTATGTCAACGAACTATAGTG 
7 5 TAATTATAAAATAATTGATACTAAATTAACTATAGAATTATATAGTGCCAGAGTTGTTT^ 



605 



WO 03/008583 



PCT/US01/51291 



TTTAATTTCACAAATAGTTATATGAATTATGGAAGTGAAGGTACGATGGTA 
TATACATATTTATCATGCAGTTATTAAATAATTACAA^ 

CGTCCCAGCTCTATCCAGCCCCTGGCCCAGTGCCTGGGGTTCGGTGGCG CTCAGTTCACGCTTGCAGAGCG AGCGG CGGTGAGTGG 
ATGCGGTGGGGTCGGGCTGTGGTGGTAAAGGTGGCCGTGAACAGGAGGGGCCACTAGAGCACACAGAC^ 
5 CAGGAGGGGCAGGAAGAGCTACAGAAACAGCATCCCCAAAGCT 

CATCCTCTTCTCCTGATAAACAACTGAAAACATAAGAGCGAGATCTTGGGGCTTCT 

CAGCGGAAATCACTGAJGAATCTCCCCTCCCCTGTCTTCCAAGGAGA^ 

CTTCCTGCTGGGGAGAGGCTAAGTTGAACTCTAGGATCA^ 

AAGGTGCTCGCATGGAG03TCCGGGCAGTACCAGGCTGGCCCGAAGGAGGTGCGGATC 

GCT ATGCTTTTTCAAGAGGCCATTCCTCTGAGACTGAGCCCCATGGCT CTCCTT 

TTTTCT AAAT AGT AAGGGGCAGAGAGAGTT AGAAAT AAAAAGG AAAGCTC CT CTCAG AAGAGACTCTGG CTGTTT 

TITO w rrri'rrriT AATTTTTAAGTTTTTATTAATGCTCAT 

AAAAATCACAGTGCATTTGAGTAGCTCCAGATGTGATCTCCCCAT^ 
1 5 TGGCTTCTCTGCTCTCCGTGCCTGAAAGGAGACATTGA 

TGCTTTTGCOTCCTTGCACGTTTGCCCCC^CTCT 

ACATTTCAGAACATTTCAACTGCAAGGCTGGCTGGGGCTATTGC^ 

CTGTGGGTGGAACTGCTGGGTGG CAGG ACGGGGTGGGGGAGCTGA 

TGGGGAACXSTGCCTGAATTCCTACCCAGGTCCAGTGTGGGTTGATGCAAATGGTGC^ 
2 0 TTC CG ATC CCGACTCCTT ACTGATG AGTTGTGT CCTTGGAAAATG ACTT CTTCT T C CTTG ACT AT AT ACAT 

GATGGATTCTTCCCACATCGCATGGATGTTGTGGGGATTCATTGTCCTGACCTCCTG 

GATTTCGTTTGACATGCTTGGACAAGAGCCCCACACACTAGAAACCCTGGGT^ 

GATGATGGGAGAACTCGGCATAGCCCTGC<XK3AGGAGTGTTGGT^ 

CCTCTTTGTAGAGGCTCCTCGCTCCTGGCrrCAGGGCTGCrr 

2 5 TCAGGCGTGGTGAAAGCCAGCTGGCCCTCTGGACCCTGGCCTTGGTCCTCAGCCCT 

GGCCCACXSGGGAGGGGCTCTCrrcCCTAAGGATGGTCTCCCCAGCTCAGTGACAGA 
CAACTGCCAGCATACCTGAGTGTTTGAGTCAGCCTCTCTCCCTCT 
GGTGGGGAGTCGATCAGACGGAGCGAAGAGATTTCTCXIAAGGC^ 
TGAGCAGACCACGCTCTAAGCTTCTTTTCCTGAAC 

3 0 TCCAGGGACCATAAGCAGCTTCCCCAAOUX5TGCCCCAGCGGGAGACCAGAGGGAGGGA 

TGGCCCCCCTGGTCC CCTG CCTTTAGTGACAGCTTATCCTCGTTCCTGCTCTGTCTC CTTTGGTTCC CTTAGACCAGGTGTCAG CA 
AATCTGGCTTGCAGACCG AATCTGGCCACAG AGCTG CAACG ACAGAATTCAGCG ACTGCAGCAGAGG CCGTGCGGCCCCCCAGGCC 
GAAAATATTCACTATCTGCCTCTTTCCAGCTTTCCAGAGAAAGTCT 

3 5 T CC CTTTG CAAAT C C CTTC CTTG AATTAAC CTG TGTTG AGTGTG C CAATGG CTGTCTGGG ACCT ACC CGGCACG CATTCT ATT ATC 

CTGACGTCCTCTTAG CCTGGCAACATTCT CGTTCTCTCCATTTTGCAGATGGGAAA 
ATGAGATTAGGCACACAATCAGTAGGGGGAGGTCAGTTCCAGAGTCAAAGGC^ 
TGAGCTGCTCTTACTCGCrCTCGGGACGGTCATGCCAAGGAAGTCCCCT 
GCAGGGCCAGTACTCGAATCAGTGTCCCTACTTTGCTTCTGTGCCCGGGTCCCCCTTGAT 

4 0 GTCTCTGAAGGGCCTTTGTGCGCCTCTTCTGTGCTGCACGAGGCGCTCvA 

ATGAATTTCTCAATTTTTGTGGCAAGGAGGGTCCCCAAAGCCTGGACATGTCAGCAACACT 
CATOTCCACCAGGAAGCGGGGCAGCAGGCTGAAGCTGGGACC^ 

AGATCCCAGGGCACT ACCGGGCCCTTCCCTCATCTGT AAAGGAG CTCCTTCCTCC CAGGGCCAGAGGAGGCAGCGTGTGGTAAGTG 

TTGAGCAGGTGTAATTATACATTTAATTACTGCCCTCACTAAGGAAA 
4 5 GGGTGGTTTGGCCTTTC^XIJACATCTGGCCATGTCT 

GGGTCAAGGCCAGGGACGCCGCTCGACACCCCATAGTGTAAAGGTTGGCCTTCACCTGCCCAGCTAGAAGGAT 

TCAGAGGCGCTGGGTTAGAGGAAGGCCGGAGTGGCCGTCTGTCTTTGTGGAGGGAGGGCCTG 

CTCCAACACGTAGTGCGGGTTTCACAAATGGTTCTTGACACATGGGCG 

CACTTCTAAGGOlGGCAGTGACCTGTGCrGGACCTGAC^ 
50 GGGGGGTCAGCTCGTCCCTGAT 



HUMAN SEQUENCE - mRNA 

ATGGATGGGGGAACCGGGCGGCGAGACGGCGGCAC^ACGGCGGCAGGATGTGTGACCGGAA 
CTGATCGAGCAGATTGACAGTAGCATGTATCCAGGACTGATTTGGGAGA^ 

5 5 CGCTGGCAAGCAAGATTATAATCAGGAAGTGGATGCCTCCATTTTTAAGGCCTGGGCAGT^ 

AC!AAAGCTG AACCAGCCACITGGAAGACG AGGTTACGCTGTGCTTTGAATAAGAG CC 

CAACTGGACATTTCCGAGCCATACAAAGTTTACCGAATTGTTC CTG AGG AAGAGCAAAAATG CAAACTAGG CGTGG CAACTGCTGG 

CTGCGTGAATGAAGTTAGAGAGATGGAGTGCGGTCGCTCTGAAAT^ 

GGATGATCAAAAGGAGCCCITCCCCGCCGGAGGCCrcTCGGAGTCAGCTCCriTC 

6 0 GTGCCG CTGGTG ACGGGGTACACCACCTACGACGCGCAC CATTCAG CATTCTCCCAGATGGTG ATCAGCTTCTACT ATGGGGGCAA 

GCTGGTGGGCCAGGCCACCACOlCCTGCCCCGAGGGCTGCCGCCTGTC^ 

GGCCCGAGGGCCTGGAGCTGGTGCGCTTCCCGCO^CCGACGCC^TCCCCAGCGAGCGACAGAGGCAGCT 
GGGCACCTGGAGCGCGGGGTGCTCCTGCACAGCAGCCGGCAGGGCGTGTTC^ 

CGGCAACGCCGTGGTGTCCAAAGGCAGG CCCAACAAG CTGGAGCGTGATG AGGTGGTCCA.GGTCTTCGACACCAGCCAGTTCTTCC 

6 5 GAGAGCTGCAGCAGTTCTATAACAGCCAGGGCCGGCTTCCTGACGGCAGGGTGGTGCTC 

GCCCCCTTGCX3CTCCAAACTCATTCrCX3TGCAGATTGAGCAG^ 

AG CCGG CTCTGTGATG CAGGCCCCCG AGG AGCCG CCGCCAG ACCAGGTCnTCCGGATGTTrCCAGATATTTGTGCCTCACACCAGA 
GATCATTTTTCAG AGAAAACCAACAG ATCACCGTCTAAGTG CGTCGCTTGGGCGCCCCACCCCGT CTGCG T CCTGCATCCATCTCC 
CTGTTACAGTGGCCCXSCATCATGATTAAAGAATGTGGATCCCTCTGTCTG^ 

7 0 GGCATTCTCGGAGGAGTAGACCTTTAATACGAAGTGGCGCATA 

HUMAN SEQUENCE - CODING 

ATGTGTGACCGGAATGGTGGTCGGCGGCTTCGACAGTGGCTGATCGAGCAGAT^ 
GAATGAGGAGAAGAGCATGTTCCGGATCCCTTGGAAACACGCTGGCAAGCAAGA 
7 5 AGGCCTGGGCAGTTTTTAAAGGGAAGTTTAAAGAAGGGGACAAAGCTC^ 
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AATAAGAGCCCAGATTTTGAGGAAGTGACGGACCGGTCCCAA 

GGAAGAGCAAAAATGCAAACTAGGCGTGGCAACTGCTGGCTGCGTGAATGAAGTTACA^ 
ACGAGCTGATCAAGGAGCCTTCTGTGGACGATTACATGGGGATGA 
CTCCTTCC^GACTGGTGGGCGCAGCAGCCCAGCACAGGCGTGCCGCTGGTGACGGGG^ 
5 ATTCTCCCAGATGGTGATCAGCTTCTACTATGGGGGCAAGCTGGTGGGC 

CCCTGAGCCAGCCTGGGCTGCCCGGCACCAAGCTGTATGGGCCCGAGGGCCTGGAGCTGGTC CCGGCCG ACG CCATC 

CCCAGCGAGCGACAGAGGCAGGTGACGCGGAAGCTGTTrcGGCACCTGGAGCGCGG 
GTTCGTCAAGCGGCTGTGCCAGGGCCGCGTGTTCTGCAGCGGCAACGCCGTGGTC 
ATGAGGTGGTCCAGGTCTITCGACACCAGCCAGTTCTTCCGAGAGCTGCAGCAGTTCT 

TGTCCGGCAACTGGCAGAAGAGGCTGGGAAGAGCTC 
TCTTCCGGATGTTTCCAGATATTTGTGCCTCACACCAGAGA^ 
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Table 29 

MOUSE NOMENCLATURE 
ICSGNM Bmil 
Celera 

5 

HUMAN NOMENCLATURE 
HGNC BMIX 
Celera hCG2309l 

10 MOUSE SEQUENCE - GENOMIC 

TAAACCAACCTCGTATTAAAAAAAAAAAAAAAW 

T CTTTGAAGTTATGAGT CG CT AAAG AAGTAATTCATTTATACT AAAG AT CTTCCCATTC CT CTCG 
T ATTTATTT ACGAAG ACT CT ATTGAGT AATT AAAGCC ACGGACG T AGTG AGTTT AGG AG CACAG AATTT AT ATGACAG AG TGCTTT 
CGAAAGGTATTTATTCGC^GTAGCATATAAAATTTTTTAATAATC^ 
1 5 GTT AACT ACCGGGCG TG CACACACCCGC CTGTG T CCTG CACAT CACG CAG AAATCT ACTG CCATGTTCCTG AAGTG ACAAAG ACAA 
GGTAAAACTGGTATATTTAAGATGTCCTCAGCCrCACAAAAGGG(^GCGC 
TCTCCCGTTGTCCCGAAGCCGAGCTTCTGGCCACACTC^ 
GAGCTGGTGCGGTCCCAGGTGAGCGCCAAG03GGACCAATGGGCGTCCGCGCT 

G CGAGAGGCCGGCGCGTCCCGCCCG AGG CCCXSCAGGGCGGCTGAGACGAGAGGCCCAGCCGCCG CCCGAGCTTCACCCGAACCCTC 
20 CCGAACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GAAAGTGGAAGCTGTTCTTAAAGCCCCCCCCCCCCCCCAATTAACCACTACCTGGTTTATCT 
AGAGAGAGAGAGAGCGAGGAGAGGTGGGGGGACTTGAAGATATTAATGGGATTTTGATATTTCT 
CTCCTATCAAG C 'l TCrrf GATGTATTTATGTTCATGGAGTCAATTGCTCTAAAATGTCTT 
3 0 CTGTGTAAACTACCTTCAGTTTCTCCTTTGCTTTCAGTCCATCCAGC^ 

AATACTAAGACGAAATAAAAGAAGGTAATTGAAAAGAGAGTGACAGCAGATTATTGATC 

CTTGTTCATCTCATCCATTTACAATTGAATTTTTGTTAT ATTTTCT CAAAAC CAAATATTCCTCTTAGG A 

ACAGGGGCCTAAAGGATTTGTAATCATTAGCAATTAATTTATTTTAGATCACCCATACT 
AGCATTGTCATGCAGCAGCTGCAACTTGCATCCTGGAGGCAGG AAAGCC 

3 5 CTAATGCCCGCTTACITTGAAGTGGTATCTTTGTCATTGTAATTTCAGTT^ 

CCCCTCCTGCCCCCTTCAGCACTTATTTAGAAGACTGTAAATTTGACCGAGAGCGAATAGAA 
ACTTTACTTTAAACTTTTAAATTAACAATTAGTACCCACCCCCT^ 
CAGAACAATAAGAATTCTCTAGAAACCCTACTGGGAAGGTAGGTACTGTGTTAGGTC 
AAGGAATGGAGTGTGCTGATGGGAAACAAAACTGAGCCAAAAAAATTGGGATCCTC 

4 0 AAT AATTG AAGTT AAG CT AATGTTTTTTTCAAG AT AG AG CCTG ACATAAT CTTAGT AGG TGTG CITTCACTCTTTTG AACTTG AAG 

ATGAGGTTCAACAGTCTTTAAGTTCAGAGTGGAATGAAACCAGTTAAGAAGAGAAA 
CTGTAGATACTACAAAATTATCATTGTCTTTCAGTATAGGCAGATCTCT^ 

CAGATAAAGGTAAAGATAACAAG C TTTT CTGTAGAGGG AGTTAG AGGAAC1" I'rinT'lT AAAAAGGTTAGAGTTAAAGACTGATGTT 
TTGTGTTGTTTTAGACCAACCAACTTCATAAGATGTACCGACCrGGATATTTGGTGACCTTAAA 

4 5 TGTCTAAAAATTATGTGTAATAAAAATTTTCATTTCCTTT 

CTACCCAGAGATTAATTTTAGTTGCAACATGGAACAGTTACAGGTACAGTAT^ 
GAGAACTGTTATGGAGACTTTATAATATTCCCAGCATCAGGTAATGA^ 

G CAG AAATTTAACATAGATTTCACTTTTAAAAGTAACATAGTAAC!AT CTAATTGAAAA 
ATAACTTTAAAGTTTGAGTTCAGCACACTTTCTTTCATTCCATCATGC^ 

5 0 GAATGCCTTTCAOVAATTAAACTTCCAAAGTTTTAAGCAGGAG 

AACTGTTAATATTTTCAGTGTAAAACTGAAGAAATTAAAGTCTTCGTTGATTGATGGATA 
TAAAATGAAGATCATTCCTGAAAATTTGAATGAAATTAAATGTTCCAAAGACATCTG 
CTTTAAACAGAAAATGTGTGTGAATCATAACCTTCCTTAGAGAAGTTGAGAGATAATT^ 
TTAGAAATTCCCTTTTAACTAGAACTTGAATACCTGATTGGAGGACCAGATCCAGCAGAA^ 
5 5 GCGAGCTGTTGAGCTCCCGTGGACTCCTTAGGTGTTCTTCCTT 

CTTGTATGCCCAAGTTGTGACATGATTGTACATCTTTTAGGACTTGGTGGGGAAACTT 

CGCGCGAGCTGGATGTCATTTTCAGTGGAAAAAGAGCrrAGCACCCTTTTGAAGTGATGCCAA 
CACATGTGAAATGGTTAAGAGGGAGACTCTTAAGCATCTGGATTTTCAGTATTACT 

TTG ACGACTGAGTGTGCCAAGCACATCTG CC CTG CGTGGGCTCGGACG CCTGCCTTG CCCCAGGTTAAGTGTTTNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
g5 NNNNNNNNNNNNNNNNNNNNNNNNNNNIWNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
75 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
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NNNNN1TONNNNNNNNNN1WN 
NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNHNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 nnnnnnnnnnNNNNNNNNNNNNNNNIWNNNNCAC^ 

GTAGACTTTGCTGAAACGGGGACGGGATCCCGAGATCGGGCGCTGCCTGGGTCCAGTGGCTTCC 

CTGGAGGT CTTTGAACCTCCAGGG AACG CCC CCATT CGATCC CG ACG AGGTGGGTGTT 

AGGCTGGGGGGAGCCCCX5ACAAAAATGTTATTAACTTAAGTTGGAGGGACACTTG^ 
TCCCAGTTGTTTCCAAGGTCTTGTGCATTTGTGTTGGAA^ 
1 0 TTCCAT ACG TACTTGGGG CX3AG CTCT CAAGG AGGTGGGCATTT AAAAT^ 
TAGCCTTGCTGAGOSCCCTTTCTCTCTCCACTCTCACCCCTCCTTTTT^ 
GCTGTGGGATGAGCTGAGACTTAGGTCCAT^ 

GCITCGGAGAAGGAGCACTAACGCGTTGATTGCTTATGTTTACGTA^ 

CAGAGCTTTACAGTTAATAATTGGAAATAAACAAGGAGAGAGACTTTGATCATTCCCAATTG 
1 5 GTGGTCACX5GTTCAATATCTGAAAGAGAATTGAGCATTAACCC 

TGCAGATGTGTGGTGGCAGTTCAAGCTATTTTTATAGGAACTCCCCCAACC^ 

ATACTAGAGATTGTAAATTGTTGACCACATGAGTAAAACCAAGTTCTGAGACTATC 

TAAAGGAGATTTAATCAAGAGGTTGATTTTTCTGTAAAATTATTATTATATGATC 

TCCATGTCTTCTATGAAAGGGAAATAATTTACAATAAT 
2 0 AAAGATTGTTTATGCATGTGTAGACACrCAAATATAATCA 

TCCAGTCTACCACTAAGCTCATGAGGGTATGTTAAGTGTTCAAGTAGGCAAAGTTCTCCCCATCAG 

CCCACGCTTTCCTTTGAGCAGTGAGCCCCAAGTGGGATTA 

TAAGCTGTAGCTTGACTATTGAGTATTTTCAGTGTTTT^^ 

2 5 GGTTGTTTCTTAAGTCTGACCAAGCATTCTAGAAAGTCTCCATGAGGTTGTCAG^ 

G AT ATTTG T ATCAAT AACAAAGT AAG AATTG ATT AG CATT ATG AATGGTTGT AGGAAT AGGGC CAGATT ATTTTT ATTTGAGGGAG 

GACAGTTGTGTGAATTCCAATTTAAAGTGAATTTGAGAGACTTTC 

GTGGTCACCTTCTATTGAGGTTTGACTGCTCTTTTGG 

AAGATAGAGAGAGCCCTTTGGTGGGAACTTGATGATGTGTGAGTATTGGATATG^ 

3 0 CTTTITAGGAAAGAGCCTGTGGTTTCACAAGAGAGTTGCCTTTAATCACTAA 

AG CTCTG ATTG AAGT CACT AG TTTCCC CTTCAGTCACTTTCAG AG AGG CTTGGTTTGGGTTTGG ACCATCTTT AGT AC CCATT CCT 

GTTAATTGCAGCCATTGGGGCATGCTGGTTGGTAGTTCCCTCTTGTGTATTAAATTT^ 

TATTGCGAATTATGCAAATTCCTAGATTTTGCTTACAGCAACTTTGG^ 

3 5 TAGATTCTGTGAGGAATAAAAGG<iACATCCCACAGCTACATCACCTGAGGTACTTTCTX3G 

GTGAGTTTCTAAAATGTATCTGCTGGGATTATCCTTTCATCTTGTGATACTGTATGGGAGT^ 

CCCCCCATCTTTACTCTGCATTTAGTTTTGGGCATGGGCTGGT^ 

AGTTTTTCTTGCCTCTAGCATAGTTTATTCAAATA 

ACTGGAGGACTTATTAGAGAAGGGAGCCTTAGAAGTTCAGTTCAATAAACGTCrcrc 

4 0 GTGTACAGTTGGCAATTCTCAAAAGATCTCTAAGTAAAATTTCTGACATCGCGATTGATAACAG 

GAGAAAGTAATTCTTCCCTCCCAATGGTCATGAAAGAGATCCGGAGAGTGCCTGGAATGG^ 
CTCAGGTGGCGGTCACGCX3AGGATCGGCACCCTGCT 

GCACGCCCGCAG AGGCTGGGCTCGGCCAGCTCAGCG CAG AGTTG CAG AGGTGG CCGGATGCCAAGTGTAAGTGT AAGTTGCTATGG 
AAACCCCAACGGAGGCGAGTT CCGAATCCGGAACAAGACCT AGCCCGGGGCACTT CGGGATCCGCCCCTCACAT CCCCGCTCGTGT 

4 5 CCGCGGTGCCCAGGCCTCAGCCGAGGCCCACGAGGTGCTTCTITTCCn'CCCT 

ACTGAGCGCTTTCCCTTTGTGGGGCCTGAGCCCTGCCAGGTTCT 

T AG AGTG CAGG AAATT CCACAGT AAATTCGG AG ACTT AG AT CAAT AG TGG CT ACTGAAT AT CTGTGGCGGCGG C CTCG TGG TAG C C 

GCAGCAGAGCTGCGTGGTGTCTGCAGAGCTGGAGGACAGCTGAAGAGAGAGAGAGCAGAATCTACTC^ 

CAGCCCTTTTTATCTCGCTCCAGGAATGACCCGAAAGGCCAGAAGTAAATTTAGTTGG 

5 0 TTATTCTGATCTCATTAGTAAATTCCTTTTGTGGAA 

T CAGCATT CAC^GT ATACAGTT ACITGG AGTTGTGG AG AT AACT GA^ 

TGCAGG ATCTTTTATCAAGCAGAAATG CATCG AACAACCAGAAT CAAGATCACTGAGCTAAATCC CCACTT AATGTGTGTCCTGTG 
TGGAGGGT ACTTCATTGATGC CACT AC CAT AAT AG AATGT CT ACATTC CTGT AAG T ACACAGTGTTCG CCTTCTTGGG T CTGTGCA 
TTT CACAGCT C AT C CATT AAAACC AACAC CATTT C CACAT TC CTTGAAAATGTT AGTT CAT AACTG TG AGGTTT ACTTTCTTTT AT 

5 5 ATGTAGTGATTTGTTACAGTTAGGCAGTATGTAGTTTTCAAGTTATTAATCG^ 

TT CC CAT CCTT G AATTTT CAGTTCTG AT ACAATT AACCTTTGAACCTTT ACTTTCT AAACT CACAAAACCTTTT T ATGTTC CT ATT 
TCTAGTCTGTAAAACATGTATTGTACGTTACTTGGAGACCAGCAAGTATTGTCCTATTTGTGATC 

CACTCCTGAACAT AAGGTAGG AAGTGCTGTTGTCCCTGGTGGGAAGTGCTGCT CTTTG CAGG ACGTTTAT 

TTCTGATATGTGTGCTTTATAGGGGGTAGAACTTTAA'jn'TCTTGTTATTTTAATT 

6 0 AGTATTTAAAAGGTGTCTGTCTTTTAAAACTGAGTTAGTTTCCCTTTTTTGTGCA 

AGATTTTTTTCCCGAACATTTCCTTATTCTTTATTCTTGCTAGTG^ 

CTTGGATGTGAAGAGTTAATTACTGCAC^GCATCTGCAGGGTGGAGAACGAGGCAGAGCTCT 

ACACTCCACCACAACACCTCATCACTGTGCCTCAACACCAGCATGCCT 

CK5TTTCTTACTTTTTCAGGTCAGATAAAA 

6 5 GATTCAGTGAAGTTTITAAAGTGTTGGTTTTACCTATTATAGCAA^ 

AGAAGAAGAGATTTTTATGCAGCTCACCCGTC^CTGATGGTAAA 

TTTAGT AATTT AAATT ATTTTG TTT AAATT CTCC CTT CT ATT CTCTT ATAGGT AAAAT ATTCAGTTTCT AAAATTG ACTTT CAAAC 

TGTCTTTATAGCTGCCAATGGCTCCAATGAAGACCGAGGAGAAGTTGCAGATGAGGA 

TAAGCTTGTCTATTGAGTTCTTTGATCAGAGC^GGTGAGTGCTCCTAGAAAGCrrACTGTGTGCT 

7 0 TATCACTTTGTATTTGTATTTTCTCTCTCTTAGATTGGATCGGAAAGTAAATAAAGAGA^ 

GGGAGTTTAAGTTGTCCTTTAGAATATTAAGGAGTTATATGATATTATTAAAT^ 
ATTTTT AATAGTAAGTTCAAAATGTAACT AG TTGAGAGTTGTTGCATATTGGTC 

CATTCAG ACTTTTTTGTTTTAC CTGAAAGTT AGGAATTGGAAGAAAATT ACTTTTG T AAATC TGAATG AT AAAAGG 
T ACTT ACGATG C CCAG CAG CAATG ACTGTG ATG CAC"1"TG AGAAAGTTT CT CAGGAC T AAAATGGAC AT ACC CAAT ACTTTCCAGG T 
7 5 ACCTACCTCATGCATTTTGTGCAGATTTTCAAATTTAAATGTT^ 
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G CAAAGTTT AAG AT ACCTCTT AAAATGGTTGGTGAAG ATGCTfT 1" l'AATCGTTTTGGGTTAGATCTTGACGACAGAAACATACATA 

TTTATCTTTCTTCTTTTGCCACTGAATACTTGAAATAT^ 

CTACACGCTAATGGACATTGCCTACATTTATACCra 

T AAAG AATTGAGTGG TTTTT CTTTTT CTTTTTAATGTTG T AGAATGGC CCACT ACCTTTG AAAT ACAGAGTTCGG C CAACTTG CAA 
5 AAGAATGAAGATGAGTCACCAGAGGGATGGACTGACGAATGCTGGAGAGCTGGAAAGTX^ 
CAGCCGGAGGTGTTCCCTCCACCTCTTCCTGTTTGCCCAGTCCCAGC^ 
TCCAGTACCATGAATGGAACCAGCAACAGCCCCAGTGCTAACCACCAATCT^ 
TGGGT CAT CAGCAACTTCATCTGGTT AGGACTGTT AAGG AAAAG AT^ 
TTTATAGATGCTTAATACATGTGACTGTOrrCCAGTTTGCT^ 
1 0 TGTCATACHTCATATGGATGTTAAGTGGAAAGATTGATT 

ATGTGAAATGATTTCTGTGACCACTGTTTGGATCTGGAAATGTTCT 
TAATGCCTACATTACATTCTCTTTGATTGTTCTTGTTATTATG 

ATTCAGCAAATGGAAAGAATAAGCATAGAATACTGCATTCTGTGCAGCCACGTCACTGTGA^ 

ATTTTAATTCCTGTTTGATTTTTACTTCTCTC 
1 5 GCGGGTACTACCGTTTATTTTACTGACTTGTTGAAATGATTC^ 

ATATTGGTATATGACATAACAGGAAACAGTATTGTATX^TATATTTATAAATACTATGAAAATAT^ 

CAGTTGTAAACTTCTCCAAATGGGTTCGACCTTTGCGGATATCACAGTC 

TTTTAATGTTGATAACCAGTTTCAAATTCTTTTTCACTTAGA^ 

CTTTCAAAATTAATATTGCAGTAATGAAAAATTrTAACCAAGGCA 
2 0 TGTTTTT AAATT CTXHTT ATCAGTGG AATTCT AAAATAG ATTTT AAAGT CAAC T AAAG CATTTT AT CT AC 

AATTTTATTTTAGTGTATGTGGCACAAGTGTAATTTCAAAT^ 

GTCTGGAAACAGAAACAATTTGTTATGATTTCTACAGGGATTT^ 

CC!AAGTCTCAGTTTTGCTTTGGTCX3AACTTA 

TGTTTCJATGTTTGTATGGGAAACTGTAGCTAAGCATTTCATTGTCn'GCAGTCT 

2 5 CTTGT AGCCATTAAATCATTACTTTTACACACATTCCTGTTAACTT 

CCACAAGATACATATATTTTTATTTTGACCTAAATTTG 

AATTTCATTTTTAATTGGAAAATTCAATAAAAAGATATT 

CACAGTGTGCAGAAGTTTTGGGAACCCTGTAGTGGATTGTAAGAGCTA^ 

TGTT AGT AAATG CTTTGTTGT AT ATTGCTCG AAAG C CCAAT GAAAG AAG T ATCTG CTT ATTTACAAT CTTTGAAGCAAAAGTTACC 

3 0 AATGTTTGCCAATTTTGGTGAATTTAACrCATTTTTCrA 

(^CCCTCTATGATGGACTTACTTCTGAGAGTGCGTTTC 
TCCATTTATTGTATCTGGGATGAGGCTTTTTACCCTTACTCAATTTGA 

CCTTTCCCAACTTTTTAAAATCITCACCTrAAGAAAGC 

3 5 TTGGCTTTAAAGTTTGACTTTCACTCCTAAAGGTATTCT 

CCTAAATTGATCTCACAGATACTAGTTTAAAAGTAGAAAGAGAAATGGGGCCCACTG 
TTACACATGCATAATACTTTAC1AAGTACTTGCACCTCCTAAATTGC 
GTTGACTTAACTAGITAAGCACATTGCCTCATTTCCATATGCXiCATGTTCAG 
GCAAAGCITTCCTTCATCTTTTCCTGTTT^ 

4 0 GTGGGCCCATCAAGAACAAATTCTCTTCrGAAACCTTTAA^ 

, CACCGTCTTGTCTGTGTGGTTTATGACACAGGTTrAG 

C C CAAGAAG TTGTTTTCG CTTTGTGTT CAAAAGATTT AAGAC ACTTT T C CT C CCCTTG CT CCAGGAATG ACAGAATGACAG AAGTT 

CTTATAGTGCTTACATGAGAGAGACTGAAACAACAAAAGGCCGTAGTCAAATTGTACTTAAACT 

CTTCTCAAAAAGTGCATATCCCTTAATACACCCAATGAAGTGGAACTTGGACCCCCTGCT 

GCACCCCCACCCCCAATCCCTTGTTTCCAAGTGTGACCAAGGATTT 

CAGT AAGCCTGGTGAGAAACAGTCCGGAGGAGTCCTTG AACCAGAGAAT CACACAGCAAATCG CAAGATCACC CATCTTTGATTAT 

GAAAGTCTCTGAAAAATGTTTACACAGTCGAGTGTTTCCTGCTTCCGCAGT^ 

AAACGGATTTCCTTTATTTGCATCAGTTTCTIT 

5 0 CTGCCTrCACCAAGACAGAAATCATCTATCACTCCAGGATCC 

CATCTCATTTAGCCTCTTGAATAAATAGTAAAACACCCAATAAAAACTTATGAATTAAATTCCCT^ 

ACAAAACAAAACAAAACAAAACAAACAAACAAAAAAACAA C CAACAAT CT ACTT CCAG AG ATT ATCAG AAG AACTTTTTT AAAAAG 

CTGCTCTTCCCTTGGTTACCTACATGTCAATAAAAAACATCCCTG^ 

GCTCCAACTCTGTTCTAAATCAGGTAGAAGCCAAGCT^ 

5 5 CACGTCGAAAATTGGGACTAGCAACAGAGATTGCTGCAGTGTTTCTTCCGAAGGA 

GAAAGTAGTAGCGGAAAATAAAGCCAGGGACCAGAATGATTGTCACACCTGTCTCCTC^ 
AAGTCTCCAGTTAAAGGATGGAGGGGAGCGGGATGACTGTGGAAGCAGCTTCGCTTG^ 

GGGAGGCCATT^TGCGCGTGCGCCGCTCATCTCGCACCGCCCCCAGCTCGGCGGACGC 

6 0 GTCAGGCCGGGCCGGGCTCCCGAGGCCGACGGACCCTGCGGGCTTGCGGAACCGGAGCTCCGGGGG^ 

GGCTGCAGCCrCGAAACGCGACCTTCTCTCTGAAGCCGTCCACGTCCAAGGG 

CGTGGACCCCTTTACCCCX5AGGCTGCAAGACAGTCACCCCTTTGTCCCCTCCTCTCT 

TTCACCAAGGTGGTGTCTTTGGAGCTrGGAGTCCCGTTGTTGTTGG^ 

CAGGTCCCCCGCAGGTGACCAGGACCAGGAGTATGGCGTGCT 
6 5 AGCGGGCGAGGACGACIACAGGCGGGGCACCTCGGGCTCTGGGA 

GAGCCGGCGAG AGGGG CGCACGGATGCGAGGGCG CACGGGCG CAAG CGCTTCTCTGATCCGGGAGAAG CG AAAGTGGCCTGCCGGC 

AGGCAAGGCAAACAAACCGATCCGCTAATCAGTGCAGCAGGCTC 

ACGCTGTGTGGGAAGTOITAACTGACTCGCCCAAGAACA^ 

CCCTGGCITTCCTCTTCTGGGGCCTGGGAGGGACCTTTCCT 
70 GCTGTCCTCT^CCCACTACTGAAGGTCrTATTTCACTGT^ 

CTCCCACCTCACACNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NnNnnnnNNNNNNKNNNNN^ 

NNNNNNNNNNNNNfiNNNK^ 
75 NJWNNNN NNNNNNN>WNNNW 
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NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNTOJNNNNNNNNNNNNN^ 

NNNNNHNNNKNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNhWNNNNN^^ 

NNNNNNNNNNNIWNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 



NHNNNNNNNNNNNNNmJNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNIWINNNNNNNNNNNNNNNN^ 
HNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNN>WNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
15 NNNNNIJNNIWNNNNNNNN^^ 

NNNNNNNNNNNNIWNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



20 NITONNNNNNNNNNNNNNNNNN^^ 

MNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNIWNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNIJNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNIWNNNNNNNNNNKNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNh^^ 

NNNNNWJNNNNNNNNmiNNNNNNNNNNNW 
30 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNHNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNMNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNIWNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNIWN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNN^ 

MOUSE SEQUENCE - mRNA 

CCCCCACGCGAGTCTGAAATCATAATCGTAGTATTAAAGGCAGAGATCGGGAGAGACAATGGGGAGGTTGG 
0 CAGGCTTAGCAACACCTCCCGGCCCCGCAGAATAATACGGCTCGGCCCCTCTGCGCGCCCTCCCCCGAGCG 

GGCTGCGCCGAGG AGGAGGAGAGG CCCCGAGG AGGCGGCGGCGGAGG CCGCGCGGAGGCGG AGGAGGCCGCGCGGGAGG AGG AGGC 
CGCAGCGCGCGAGCAGGAGGCCGGCGGGGGAGCGGCATGAGCGGAGCGCGGCGGCCGTGGCTGCTCTCGGCCG 
TTGACAGCGG CGGCGG C GG CTC GCTCCAAG ATGG C CG CTTGG CTCG CATT CATTTT ATG CTG AACGACTTTT AACTTTCATTGT CT 
TTTCCG CCCGCTCAGATCGCCT CGCGCCGGCCGCTCTI^CCGGGATCTTTT ATCAAGCAG AAA 
5 ATCACTGAGCTAAATCCCCACTTAATGTGTGTCCTGTGTGGAG 

CTTCTGTAAAACATGTATTGTACGTTACTTGGAGACCAGCAAGTATTGTCCTATTT 
TCCTGAACATAAGGTCAGATAAAACTCTTCAAGATATTGTATACAAATTAGTCCCAGGGC^ 

AGAGATTTTTATGCAGCTCACCCGTCAGCTGATGCTGCCAATGGCTCCAATGAAGACCGAGGAGAAGTTGCAGATG 
GATTATAACTGATGATGAGATAATAAGGTTGTCTATTGAGTTCTTTGATCAGAGCAGATTGGATCGGA 

0 CTAAGGAAGAGGTGAATGATAAAAGGTACTTACGATGCCCAGCAGCAATGACT 

ATGGACATACCCAATACTTTCCAGATTGACGTCATGTATGAAGAGGAACCTTTAAAGGATTACTACA 
CATTTATACCTGGAGAAGAAATGGCCCACTACCTTTGAAATACAGAGTTCGGCCAACTTGCAAA 
GGGATGGACTGACGAATGCTGGAGAGCTGGAAAGTGACTCTGGGAGTGACAAGGCCAACAGCCCAGCCGGAGGTC 
TCTTCCTGTTTGCCCAGTCCCAGCACTCCAGTGCAGTCTCCTCACCCACAGTTCC 

1 5 CAACAG C CC CAG TG CT AAC CAC CAATCTTC CTTTGC CAGT AG ACCT CGAAAAT CATCACT AAATGGGTCATCAG CAACTTCAT CTG 

GITAGGACTGTTAAGGAAAAGATTTTrCAACCCCCTGATTTAGTTACCIT 
ACTGTCGTCC^GTTTGCrrCCTITTGTAGTGACTT^ 

AGTGGAAAGATTGATTCTTTCTCTAAAGAATTGGATTCTGAGAAGGATTCTGTGTTAGG 

CTGTTTGGATCTGGAAATGTTCTACAGTGGGTAGACATTGGGCCATAGTTTGTTAATCT 
' 0 TGATTGTTCTTGTTATTATGCTGTTCTGTGAACCTGTAGAAAACAAGTGCTTTTTATC^ 

CATAGAATACTGC^TTCTGTGCAGCCACGTCACTGTGAATAACAATTTCTTGCATATTT 

ACTTCTCrcTTGCTACACAAAATGATCAAAGGAAACTTGTTTTACAATCTGTA 

TGACTTGTTGAAATGATTCACTTTTGTAAGAATCAGATGGCATT^^ 

AAACAGTATTGTATGATATATTTATAAATACTATGAAAATATTC 
1 5 GTTCGACCTTTG CGG ATATCACAGTGCACCCAATG CTGAGGCITACCACCAGCACAGAATTTGTTTTAATGTTGATAACCAGTTTC 
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AAATTCTTTTCCACTTAGAAGCAAATGGCAGA 

AAGAAAAATTTTAACCAAGGCAAATATAGTGTTTAATTTTCCATTGG 
GGAATTCTAAAATAGATTTTAAAGTCAACCAGCTAATTGCTGAAGTAAA 
CAAGTGTAATTTCAAATTTAAGAGTTGTTTTCATAGTTAAAGGAATCT 
5 ATGATTTCTACAGGGATTTTTAATAGAGCAAACATGTTGAATTTAAAATATGAAT 
GAACTTAGCGTGTGTTTATCAGCCATCAGTTATTTGTGAGGATGTTAATTCCGT^ 
GTAGCTAAGCATTTCATTGTCTGCAGTCTGCAGAAGAAGCACAATTGrAT^ 
TTACACACAAAAAAAAAAAAAAAAAAAAAAAAAAA 

10 MOUSE SEQUENCE - CODING 

ATGCATCGAACAACCAGAATCAAGATCACTGAGCT 

TACCATAATAGAATGTCTACATTCCTTCTGTAAAACATGTATTGTACGTTACTTGGA^ 

TCCAGGTTCACAAAAC CAGACCACTCCTGAACATAAGGTCAGATAAAACTCTTCAAGATATTGTATACA CTT 
TTCAAAAATGAGATGAAGAGAAG AAGAGATTTTTATG CAGCTCACC CGT CAGCTG ATG CTG CCAATGG CTC CAATG AAG AC CG AGG 
1 5 AGAAG TTG C^GATG AGGAGAAGAGG ATT ATAACTGATG ATG AG AT AAT AAG CTTG T CT ATTGAGTTCTTTGATCAG AG CAG ATTGG 

ATCGGAAAGTAAATAAAGAGAAGCCTAAGGAAGAGGTGAATGATAAAAGGTACTTACGATC 
TTGAGAAAGTTTCTCAGGAGTAAAATGGACATACCCAATCATTTCCAC^ 
CTACACGCTAATGGACATTGCCTACATTTATACCTGGAGAAGAAATGGCCCACT 
AAAGAATG AAGATGAAGTCACCAGAGGGATGGACTG ACG AATG CTGGAGAGCT 

20 

HUMAN SEQUENCE - GENOMIC 
CAAAAGCAAAATCCAGTTGCCTC^GATCGGAGCTT^^ 

ACCANNNNNNNNNNNNNNNNNNNNNNNNNNW 

2 5 CGCCGC03CCMCCGCCCCGAGCCGCGTGCGCGa3CC^CC^ 

GXjTCACGGCGCXSCTCACAATGGAGCTCTCGGAGTCTGTGCAGAAAGGCT^ 
CGCCTTCACGCTTCTCCTCCGGGCGGCATTCCAGAGTCTCCTGGA 
CGGCTCGGAGCTGGATCCGCCX3GGGTXXIAGGGGCGCTCGGAG 
AAGTCTCCC^GCTTTCCCCTTCGCTCCGGACGGAGTGTTGGGGACG 

3 0 TGGCCCCACTTGGCACCTGCATAGAGCCAACTCCACGTTCCTGGCCCGGTTTCCCTCC 

TGTTCAAGTCTTGTTTGTGGCCTAAACATCTAAGGTGTTrAGCC^ 
GGGAGCTCCCTGCAATTCAAGCTTCCAGGGCTAGGCTAGCAAAGGTT^ 
GTTGACATTTGCC!AACITAGAGCATACCTCAAGTTG 
CTAGAACCTTAAACTATGATGCACAATACTTTCTTCCGCCCCAGT 

3 5 TGCGGAGGTGCAGTGTCAGTATGGATTGAAGACATATT1TAAGACCGATCGCTTTTAGA 

CAATTCAAATTTTTCTTTTTTTTCT 

TAGTAACCACCACCTG^GGTTAATCTCAAAAAAAAAATGACTGTCAAA 

TATTCTGAACTCATG CCCTACCTTCT ACCCCCTTGATGTATTTGTTCATG^AGCCGGTT CATCATTTTT AGGATGG 

AAGTTTATTCTAAAATTGACAGAATGTAAAATATTTATCTTGT^ 

4 0 GTTATATTTTCTTTGGGGGCrATAATAT T TTlTAATAGTAAAAATATTAAGGCAAAAT 

TTCTTGTTGTGTTCITTTTATTGTTTTTCACTTGATCITG 
TTTTAATCTTGTTAAAGGGTTTTTAAACCCAA^ 

ACTGG AAGTT AATTT ATTTT AG AT CATCCAG ACTTG AAACAT AT CGAC C CAGTGGTTTT AAAACATTGT CATG CAG CAG CTG CAAC 
TTACATACTAGAGGCAG^AAAjGCACCGAGCTGACAAGTCAACTCTAAG^ 

4 5 CTACCTTAATGATTTCAATTTC?AGTTTTAA 

CTGTAAATTTGACAGAGAGCGAATAGAACTGTTTTGCACGGAATATCA 
TTTCTTTTACTTTGTTTGTAAAGATGT m T^^^ 
AATCCTACTGGGAAGGTAGGTACTGTATAAGGTGTCAAGCT 
AACAAGATCTTCC^ATCTTC^CCCAAAGAAAAC-GAGCCAAAGGGCATGTCAA 

5 0 GTTTATTGAGATGATTTTGAAATGCTCCTTTACCATTCATAATTTAAAAATAAAGTTTAACAATC 

AAATGGAGTTGGGGGTGGAAAGTAGAGCCATTCTTAGTAAATATAAATAACTGAAAAGTTCT 
ATCATTGCATCTTTCAGTATAGGCAGATCTCTCCCTCATATAACGGATGTTTC^ 
TAAGAAACTTCTCTAGCGGGGGATTTAGGGAACTTCTTAAAACCTAGA 
CAATCAACTTCATAGGATGTACAGACCTGCATATTTGGTGACCTTAAGTGTACAGGT 

5 5 AATTT AT AATTTCATTGATTTAAATAAACAGCGTTTCTTGTTGCGTGTTTTTTTTCC 

CCAGAGATTAGTTTTAGTTGCAGCATGCrAACAATTACAGGTACAGTATTAGC^TC 
ACTTG AGAGTT AT ACTCrTGGGAAGTTCAAAAGAAAATTAATCTAACCATTTGTC^ 
TGGGAAAC^GGCATCAAAAGTAGTATTTTAAGATAGGCTTCACTTGAAATT 
TGTTCTTGCTGATTGAAATAAACTTCTGTAGTGTTTAGTGTAA^ 

6 0 AATT AAGAG CATG G CTTTG AAAAT GTCTTTG CTTT CAG AAATTG AATT AAACTGTCAAAATTGG ATTTTGAAACTT^ 

TTTATAAAGGGGGAAGTATACATAACATTCTAAAACTGTTTATATTGCCC^ 

G TGGAT ATG ACAACG AATTT ATTT CATAGTTTTT AAAAAATG AAGCTGATT ACTC^ AAATTT 

aVTCTTCTGCAGAGTCGTTTTAATGTAACTATGAATCAAACm 

AACTTGAGAAAATTCAGATGTATCTGGGCGGTTAAG^ 
6 5 CATGAGGTATTTCCGAGGAAACGCTTTTTTrGCCC 

AAGG TGTGTC CTGATTT1TTTTCTTG CATCTTT CTTG AG CTTGGAT ACTTCTTT C CATG 1TTTATC CT AAAAACGTGGT ATTGT AT 

ACACGTCTTTTAGGANNNNNNNNNNNNNNNNNNN^ 

NNNNNNN1WNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNIWNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
75 NNNNNNNNIJNNNNNNNNNNIJNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNIWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
* 5 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNKWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNHNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

AGTTCCCCGCTCCCCCGGGCGCCCCTCCGCACCCTCGCGACGCACATC^C^ 
CTCrrCCCGGCCGCAGCGCCCTTTGTTGAGAGGTAGATTTGGATGAAACGGGGACGGGTTC 
GTGGCTTCCCTCCTCTCCCGAAGAAAGGGCTCTGGAGGGGCTCCAGCAGCCAGAGAACGCCCCCATT^ 
1 5 GGGGGGCGGCCATCCCGCCAAGGTGGGTGTTAGGCTGGAGAGAGCCCCGACGAAAATGTTATT 
TCTGAGATTGGATGGCTGGTAAGATGGCGGTTCCCAGTTGTTTCCAAGGTCTC 

GGTGG TG TGTGTTGGG AG TCTTGG AT AAGTGCTCTGAACAT CGCTTGGGAGG TG CT CC CTGGG AAGTGGG CATTTCAAATTTGG AG 
CTTTTTGTGGAGTCAAGATGGTGACTGGACGTTAGCTTTGC 

TCC CCTTTTGGAGACTGCAACTTTT CT CT CGACTTCT ACT AAAGGATG CACTCAGCAT CATGACTCCAGT CCACT AT CTTTT ACAG 
2 0 GTTTTGCTAAATGTCAGCATAGGTCATTTGGGAAAGGGATGCT^ 
TCACTTGGGCTTGACAGTGATATGGCATGATCITTTTCCCAG 
GATCATTACCACTTGCAATAAAAGGATTTAGTGGGCTTAGTGGTCACGGT^ 
CCTTAATGTGGGCAAAATTCCGAGCATATTTATCCAAATGCAAGTC 
ATATTTTATAGGAACCTTAACrCCCATGTTCTGGAAACACATTATAAGCT 

2 5 GACAGTATATAACAAGTTATGAGACTCTATATGTTAACATTTGAGTTTTAAGT 

AAACAAAATTATAAGTTTTTAGATTTGAGTTAA.TTCCATACATCT 
TGTGAGTCTTATGCCTAGTAAAATATCTTTAAAGCCATATACAAGTCACTTTGTA 

AATT AAGCATTTAATTGT ACATGAT AAATC CATT AAT ACTCT AT CACT AACTT CTTGGATGT ATG TT AAG TGTG CAGAT AGG CTTT 
CTCCCCAC^AGTCAAATTAACCCCAACTTTACCCATG CITATO 

3 0 TTCTTAGATTGCAGTAGGCAAGCTGATGGTTAACATAAGAACTTGCTTCTGTTTTCCT^ 

CCACCTAAAAACTCAC^GCCCCAACTGAGTGTGTTACAATGGCAAGTTT^ 

ATTTTTGTTTG(TTTAACTGGCAGTTTCrGGGTTGGAACCTGGAAGGTT 

ATCAAGCACGTCAAACACTTTGGAAGTCTTCIATTAGGTTGCCACT 

TAACAGTG GT ATTT ATT AG CATT AT AAAT GGT CACAGG ATTG TGGC CAAATT CTT CTGAACTGAGGT AG AGGAATTGT GTG 

3 5 AGACTTAAGATGAATTTGAAAGACITTTGGTCAAGTACATGTGACTTAAATGCTCAAA 

GTTTGATTGTTCTCCTTGITAATACTTGTCTTTGTGTGGTGTTGGTTTTCT 
TTGGAGCCCCTTCATGAACTTCGTGATCGGTGGTTTT^ 

TTCTAGGAAAGATCCTGTGGTTTCACAAGGAGAATTTTCTTTAAGCACCAAGAAGGCTGTTC^ 
TTTAAATATTGCTACTCTTTAGGGTCATTGCTCTACCTACAACT 

4 0 TTCTCCAAAACAAGTAATCCAGAGATAACTGAGGAGTCAGTCA 

CACTTTCATAAGAGAGGCTTTGTTTTGTTTTTTACCATTT^ 

TGTTTCCCTTTACTGCATTAATACACCTTAGGAATTAAAA 

TTCAAGGAAGATTTGAGAATGTACTTCAGTGAGGTCTGGAAGCT^ 

GAAGTTTAiTi I i CAAGAATACTACACGAAGTATTTGACT 
4 5 GACTTTCCTGTAATAGAGGTATCTGTTGGGAATATCATTTCATCGTATGGG 

TTGGGTTG ACAAG CTGTTG ATTT AG CGTTGG AAATGGGCTGGTCACATGGTT CTG T AC AT ATTGTCACT ATTGTAC CG ATT AAAG T 

ACTTTTTCTTGCCCCCAGCATAGTTTGTCCAAATAAAAC^CTGGGCTTTGCC 

AGCCTTTCCAAAACTTTGTATCGCTCATTTAATCTACAGGAGAGCGTCAC^ 

AAACATCTCTGCAATTTGAGCCCTGACGGTTTCGGGCTAAGTGCACAGTTGGCAATTCT 
50 CCGAGTGATTGCAGACAGG ACAGACOT AGAGAGTCG AGATCTGAAACOIGCTCTG CG CACCAAGTCGACGTGAAAAG AGGCCCCCG 

GAGTGTCCGAAATGGCCGGGAGTGCGGTTTGGATTTTCCT^ 

GGGTCCCCGGTCCCCGATCCCCXSCrrCCCCXSCTCCCCAGCGCTGTGGCCTCGTGGGGCTGCCCCGCCCCCCCGGC 
ACCCCGCCCCCGCGCCGGCCCCGCCCCCGOXSCGCGCCGCCCCrCCCCCGCCCGCCCGCCCGNNNNNNNNNNN^ 

55 GCCGGATGCCAAGTGTAAGTGTAAGTTGCTATGGAAACCCCGACCGAGGCGAGT^ 
CCGCCGGJttCCGCCCCTCGCATCCCGGCCCCCGGGOSTCCGCCGCGCT 

CGGCCCCAGCACCCGGCTCCGGAAATGCCAGGGATGCAGGGAGGGCAGAATTGTTTTCCCTGC 
TT CTG CTTGCTGGAACCGTCTAAAACAACCGGAACCCCGGATTTT CACT 

ACCTGGAACAAGCCCTAGTGGTTGGTAAATATCCGCGGGGATTGTGTGGCGTCTGCAGCAGCCTTGGGGCT 
6 0 AAATGGAAGAAAGAGACCCTAATACGCTCAGCTCCCA.GCCCCCACCCCAGACTTTCTTCTACCT 
GAAGTAATGTTAGTTGGATCCTTTTGCATCAGTTTGGTAGAAC^ 
C^CAGCCCAGCTGTACAGTGTTAAATGAGTTTTATAAATTCAGTGTC 

TGTGAATTATGGCCATTATTTCTGTGTCTTGCAGGATTTTTTATCAAGCAGAAATGCATCGAA 
CTAAATCCCCACCTGATGTGTGTGCTTTGTGGAGGGTACTTCATTGATGCCACAACCATAATAGAATGTCT 

6 5 CCGAGCTTTAGCTCTCTTTTGTATCATGCGTATTTCACA^ 

AAATGTTGGT AC AAAG TGGTG AAGG CATTTTCTTCT CTTG CATCT AATG ACTTTTTG TTAAT ATGT ATTG ATTGT ATTACAAG CAT 
GCAATATATGTGTTCTCAGGATATGAATTAGCTTATTTAG 

T AACACCAATGATTTATCCACTCATTTCTAAACAT AAAAAAACTT CACATGTT CT ACTT CT ACT CTG T AAAACGT GT ATTG TTCGT 
TACCTGGAGACCAGCAAGT ATTGT CCTATTTGTG ATGT CCAAGTTCACAAG A 

7 0 TGAAATTCCTT GTTTGT AATT ATT ATTGGAGTTGT AT AATTT ACTGAAG GCAACCCT CTTT A'iTT Cn"CACAGAAAATTTACTCTT 

GAATACATAACTAATATGTGTGTGCTTTGTGGAGGGTAGCCGTTTAATTTCT'I , G 
TTAGTAGTATTCAAGTGATTTTAAGATTTTTTTTTCCTAACT 

TGG ACTT CATGAC^GGCAGGAAT AAAT ATTT AAAAGGG CTT CCATTCCT CTTT ATT CC CT ATTCAT ATT AG CT AG TTTAGT AACTT 
GAAAGGCACACTTCTTTTGAGAGAGCAAATGTGAAGAATT^^ 

7 5 atacagagaacttttattctcatagtttttatcttatcaata^ 
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AAAGAAGACT AT AG AAACTT ATG AACATTTTT AG TTCTTG CC^ 

TAAAACTCTCCAAGATATTGTATACAAATTAGTTCCAGGGC^ 

TATTCATTA GTT CT TT GTGTTATGC 

AGCTCATCCTTCTGCTGATGGTAAACCTTTTAGGGGAGGGAAGACA 
5 TTAAAATTGACTTTACCACTTCCATCCTCTTATATATAAAATTTATTAGG 

TATAGCTGCCAATGGCTCTAATGAAGATAGAGGAGAGGTTGCAGATGAAGATA^ 

T ATC CATTG AATTCTTTG AC CAG AACAGGTAAAATCTTT AGG CAATTT ATTTT ATG CT AATGTTTT ATT AGATT GTTT CACT AATA 

AATTTTCTCTTTATTAGATTGGATCGGAAAGTAAACAAAGACAA^ 

TATAGAAGAATlCATT GT TTGGAATCCAGATTTTATCJtfSGG 
1 0 GTCATGTTTAATGTTTAAGAAAGGTCTACATGTGATATTTAATAATTAGAATTT 

TTTTAAGTAGTTCACAGTTAATACTITGGTATGCTrGAAGATTATATTAGG 

TCAGAGTGTAATTAGTTGAGAGGTTGGTCACCTCCAATTTTGTTTGATACTAGTAT 

TTCTTCATATCTTAAAGTAACTGAAAAAGTTACTTTrCTAAATGTAC^ 

GCAATGACTGTGATGCACTTAAGAAAGTTTCTCAGAAGTAAAATGGACATACCT 
1 5 CTTGCATTTTACATAC^TTTTACAATTTTAATTA 

AGTTAGTGCTAAAGTTGAAATTTATCITAAAACATTTGGTATACATTCT 

AGAAATAATTTTTTCTCATTTGAAGTTTCAGGAGTCT^ 

T AT ATTT AAATGACAAAAAATG T ACATTTACCTTTGTT CI" ITT ATT" ACTT AAT AAAAAT ATTTTT CACT AG ATTGATGTCATGT AT 
GAGGAGGAACCTTTAAAGGATTATTATACACrAATGGATATTGCCTACATTTAT 
2 0 TAGATTATGATCGTAAACTATATTTAAAGAATTAAG^ 

ACAG AGTT CG AC CT ACTTGT AAAAG AATG AAGAT CAGT CACC AG AGAGATGG ACTG ACAAATGCTGGAG AACTGGAAAGTGACT CT 
GGGAGTGACAAGGGCAACAGCCCAGCyVGGAGGTATTCCCTCCACCTCTT 
TCATCC^CAGTTTCCTCACATTTCCAGTACTATGAATGGAACCAGCAACAGC 
GACCTCGAAAATCATCAGTAAATGGGTCATCAGCAACITCTTCTGGTTGA 

2 5 TGATTT AT AT AGAT AT CTT CATG C CATTACAGCTTTCT AG ATGCT AAT ACATGTGACTAT CGTC CAATTTG CTTTC*rrrrG T AGTG 

ACATTAAATTTGGCTATAAAAGATGGACTACATGTGATACTCCTATC^ACGTT 
GGTTTCTTGGAAAGCAGGCAAGACTTTTTCTCTGTG 

ACTCTGCAGTGGACATAAGCATTGGGCCATAGTTTGTTAATCTCAACTAACGCCTACR.TT 
TACGCrGTTTTGTGAACCTGTAGAAAACAAGTGCTTTTO 

3 0 CTATGTAGCCATGTCACTGTGAATAACGATTTCTTGCATATC 

GCAAAACCGATCAAAGAAAAGTGAACTTCAGTTTTACAATCTGTATGCCTAAAAGCOT 

TAAATGATTCGCTTTTGTAAGAATCAGATGGCATTATGCTTGTTGTACAATGCC 

TGTATGATATATTTATAAATCCTATAAAGAAATATTC 

CCTTTGC AGATACC CAT AACCT ATGTTG AG CCTTGCTTACCAGCAAAG AATATTTTTAATGTGGAT ATCT AATTCT 

3 5 CCATTAGAAGCAATTGGCA.CATCTTTCTATACTTTATA 

GAAAAACCTTTAACrGAGAAATATGGAAACCGTCTTAATTTTCCATTGGCT 
TTGATCACTATAATTCTAAAACAATTTTTTAAATAAACCAGCAGG 
TGGTATAGCAGTAATTTTAAATTTAAGAGTTGCTTTTACAGTTAAC^TGGAAT 
TATTTATTATGAGCTTCTACAGGTATTTTTAAATAGAGCAAGCATC 

4 0 GTTTATTTTTTGCTTTC^TCGAACTTGGTGTGTGTTCATCACCCATCAGTTATT^ 

T CATGTTTGTATG<3GAAAATTGTAGCTAAACATTTCATTGTCCCCAGTCTG CAAAAGAAGCACAATTCT ATTG CTTTGTCTTGCTT 
ATAGTCATTAAATCATTACTTTTACATATATTGCTCTTACTTCTGCTT^ 

AAGAT AGATATATTTTTATTTTGACCTAAATTTGTACAGTCCCATTGTAAGTGTTGTTTCTAATTATA 
ATTTGTAATTGGAAAAAATCCAATAAAAAGGATATTCATTTAGAAAATAGCT 

4 5 AATGTGCAGAAGTTATGGAAAACCTATAGAGGATTACAACAGGTAAACGTTAAAGAGAATACATTGCT 

T AAGAAGT ACATG CTTTGTTGTAAAATTG CTTGAAAGCCCATTGAAAG ATGTATCTGTTT ATTTACA 
CCAATGTTTGCCAATTTTG^TGAATTTAACrCATTTTCTAG 

G TGTCAAG TGATATTTTT ACCTTC^ ATG ATCATTTGT AT AAAG AATG AAAAAAT ATTTTT AAAAGG CT AAAAACTTT AAATG CT AG 
TGGACTGTCATTTGACTTACATAAAACCAAGTTATCAACATC^AATTCAAATTT^ 

5 0 TTGAAATTGCTTATTAATCAGATACTGCAGTTATGGCTTACTTAGCACTAACCAT 

G ATTTTTTTTCCT CCT CAATT AG AAAATGG CAT AAG CTT AAAAGCAT ATTT CAG CCAAAATTTTTT CTGTT AATGAAAGATT AT AA 
TGTTTTGTATTCTTAAATAAGCAGCTATTACAGTTTGACAATTGAGATTAAAAACCTTTG 
ACCTTATGAAAACCCATATCAAAAGCTGCTAACTTrGTAACTTTTAGTAGTCAAAGCAGGGTA 
GACTTTTGCTTCTGAAAATATTCTAGAACACAAAGTTAGTGCT 

5 5 TTTTAGCIT AAATG CAAAGAGAAAAATGAGGCTGGTTGCTGTTGTGTAAATAGTGATCT 

TACAAGTACTTTTTG CTTTTAAAATTGCCGGCCGTTAATTGCACTGCATTGAA CAGTTGTCTCAACCTTCC 
AACCATATTGCTATTTCCATATCCATACGTGATAATATTTGGACCTCTGGAG<5GAAA 
CTTTCCTTCATCCTTTCCrCTTTTCCGAATATTTTGAAGGTGCAGTTAATT^ 
TAAGGTTTTGTTATGAAAAC^TAGGAACTTAAATACCCATGTGAAATGCGCCITGCATT 

6 0 TATTTTAAAATAGTTTATTTTATGGTATCTTTTGAGGGGATAGGATTTACT^ 

AAAATCCTGGGGTGAAGGAATTGTTAATTTTTCCATTTCTCTTTGTA 
TAATACAGGAAGAATGATAGTTGGAAATTCTGATTTAGTTTAGTACT 
TGGCTGTTTTCCCCAGACAGTCTTCATTGCATTTTATCCAA 

ATTTTAGATCTGTGCTGCATCTAACAGTATACTAATTAAAACCTTTGAAAAACCGTCTATGCCT 

6 5 AGATCACTGAACCCAGCTCCAACTTAAAGCTCrTGAGATACATGATT 

ACACAGGTGATAATTTCTAAGGTTGCTCTAAAAAAATTGACTCCT 
AAGCGAGGAGCCGAGTTCTTTACAAC^GAATAGAAGGAGATATCCATATATTCCAGCm 
CCGCGAAGACCTTGAGCCAGCCGATCTCCCTGCAAATCTCAAGATCGCCCAGTTTTGATTA 
AGTTGAGTGTTTCCTGTTTCCGCAGTTAATrrrriTU'AACGTTTTTTTCTTC 

7 0 CCAGTTTCTTTTTCTTAGTGTGCAATTTCTAGAGGTCTCACATG^ 

TCAGAAGTC^TCTTATATTTATCACTG<3AAAGACTCCGCTGTAGCA^ 

T CATT CGG AAGT CG CT TT AGCCCTC CAAAT AAAT ACT AAG CX2 CT GAATAAAAACTTCCAACAAAGG AG TT AG AACT ACT CTCATT C 
TTC^GCCCCTAGCCTITAAAAATGTCCCTTCCGGGACTAT 

T AAAATG CT ACTT CTTG CTCTGA r rrG T CAGGT CAT CATGG AATG CAAACTT C ACC^TGG ACGTG ATTTCAAC G TT GGTG CGGATTT 
7 5 GGAGATGTCCAACTCTGTCCTCAATTAGCGAGAACTCAAGTTGAGATAAGAGAC 
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CGCGTCGG AAAGTGGGATGCCAGGAATGGGATGGTGG CCGAGACCGACACCGAGGTT AGTG CAGGTGG AG AAGTAGGTCCAAAAGC 
CO\AGTCACAGGTCCGAAGTCCTCAGGAGCGGAAGATCTTCCAACTAAGCCGGAGAGCGGGGrc 
CGCC(XXX3CGAGGGTGATTGGGGTGGGAAACTCCACTGCTTtCO^AA 
ACAGTACACCGGCCTGGGGGAAATCACCTGACTTCTCCTTTGGTAAAGGCCCCOT 
5 TTT ACCAAAAG ACACCTT AAAG AAG AAACAG CCT CT CT C CTTTT C CTTATTTTCT AATT AG CATCTT ACAG AGGAGTGGAAACAG C 
TACAGCCCAGTCCCCTGCTCAAAACTGCGCCACCCCAGTTCGGCCCTGCTGGGCGTO 
CATTTTGCGCtSTGCGCTCCTCGCCTCGCGCCGCCCTCGGCTCT 

CCTGAGGGGCTTGAGGGGCGGGGGAACGACGCCGCTCTCCAAAGTTGGACCCCGTGGCG^ 

GCCTCCCGAGGTGCTCCCTTTTCCCGCCGAAGCCCTCCA 
1 0 GCGCGGGCGGGAAAACGACCTGCACCCGGGGAGGCAGCGGCTTCGCGGGCAGAGCCCA 

CGGATAATCGCCrTTCTTTGTCCTCCTCCCTCTTCGAGTCCAATCAA 

GAGGTTTTGTTGGATGATTTTAAATAAAATTATTAAGTTATAAAGTGGCCACCCT 

ACTGGAAAGGCCTAGAGGAGAGGGTCCTCCCGCTGGGCTCGTTTAATAGAACGCGCTCGAATCCCC 

GTGACAGGGCTGAGGAGGGGTGGCTGCGCGGCGGGAATCTGAAG 
1 5 AAAATGGGTCCCCC^ACAAGGCGACCCTGGGAGTGCCGGCGCCCCCGG 

CCTCCGAAAGCTGCCGGGACGCGGCGGCTTCCTGCAGAGCCTGCGCCTGCCGGATCCCC^ 

AGCTCCCCGTCCGCCCTAAAACCAGGAGAGGAAGGGACGACTTGGGAAAAGGGACT 

AAAATGGG CG ACGGCGGG CGAG T C CTCTTT AT CAGTG CAGCAGG CTG C CGGAG CCGCCATTTGGTGGCGG ATCTCGG T AGTT CAG T 
AGCACGTTGTGCTGAACGTCACAACTGGCTTGTCTACGTGGCATCGTCATT^ 



20 




TGGAATCCAGCGCAGACCTAAAGCACGCTTGACACCCCGAT 



CCTCACCCCTCTGCGAGGTGAGGAGGGGAAACGGCGAGCrTAGGCCT^ 

2 5 AGGGTCGGTTGGAATCCCGCCCTAGCGCCTCCTGCTCTGCCCGGTCCCCACCGGG^ 

CTAGCAGGGCCGGTGTCACCACCTCGGGTGGCCGAGGCTTCGAGGTTTTCATGAAAAGCCC 

GG AGAACAAAGGG CCGAGGG C CGAAGG CG AGGCG AGGCAGCGCG CGCG G CTCCCTTGG CTCGAC CT AG CTGGG AG TCGGGGG CG CG 

GGCAGGGCTCACTCCCGGCCTAGAAACTGGAGCCCGCCACCCCCGCCCCGCAG 

CTGCCATGTTGCCTTTCCGG AGGCCGCCAGTC ACTTGACGCTTCCG A 

3 0 TTGCCGCACAGCTGCAGTCAAGGCCCGGAGGGACTGCGGGACGCGGGCGGGAGCGAGAGCCCTGTGGG CTGCCAAGCCGGCGCGGC 

" CGCGCCGCGGCAGCCGCTTCCCTTGCCACCTTCGTTCCAGGGGCT^ 
ACCACACGACGGCGATTTGCAGCCAGGGCCGCCGCCGCAGCCGCTGGTACCTCTC 
AAACCCTGGGGGTTGCCTTCGGTGACATCTCCCGCCCCCACCTCCAGTCCTCAGTCTCCAAAATCCTC^ 
GCGCCCCCGGCTGGGCCCTGCCCCCACCGCAGACAATAG 

3 5 CCTGAGCCCCCAAATCCGGGCGCGTCGCTGAGTCTCAGCCCCAGGTGCCCTCITCGCGGAGTTG 

CCAATT AG AAAG CTTTT CTTGACATCAG AAAT AAGGG TGGTG C CAAAG AAAG AAG TCAAACG CCCACT CCCTTTCT AAT AACATCG 
ACACAAAGAAGAGCTCATGGTTGCX3TCTGCGCTTGGTAACAGTGGCTTTCCT 
TATCTTTCTTCTTCGCCTACCATGAATATTTTGCATGAAATCGA.GAATGTAACC^ 
CTAGTTATTrATCAGTGGATGGAACACTTTTTTTTCCATTTCATTG 

4 0 TTGAAACATAAAG AAAACGACTGGT ATTAACC CTTGAGG ATCT ACTTTTT CTGTGGCTCrAGAACACCAATTTAACrTTTTTTAAC 

AAGAGAACTCTTGTTAATGTTTTAAGCAGGTTTTATTGAAGAAAAGTCCTTAG 
GATAACTCGGGAAATATGTGCATTGCCAAATTTCTTCXBCTCTTAATAATTGTCCCGGCTT^ 
AAAATTTCTTGTAAATCACIAAATGTGCTGAAATTTCCAAGTTTAAAGCT^ 
GCGTCAGAGTCCTTTGAACATAACAAATATTTGTCGGACTTCCAGGGA^ 

4 5 TAACTTACTt3ATAACTAGC!AGAAAGCTTGGATATCTTTAATGAACTCAATGAA 

AGATATTCTCATTCATGAATATTTAACTTAATTATATTTCAA 

ATTCTAAGG AGAATGACAAATTGTACATAAGAACTTG AAACTG CTAATTTCT CT AGTGTCTAAAAATTTGGTT AAAT AGCCAC 

AAGTAAGGTGCAACTTTAAAGTGCTTTCATACTTTCTGATAGCACAGACATGTC 

TAAGGATCTAGCGTTGGATATTGCAAATCTACTAAGACAAGGCACCTTGAAATATC^ 

5 0 CTGAAAATAGTATGTCTCAATCACAAGTGATGGTAAACTCAAATTAATGCCT 

GTTCACCTTTAAGCTTCACAAAATATCAAGTTCATTAAATAAGACAGTTTACCTAA 
GAGCATAACGACCTGTAGCTTGCCAAACCACCCCCATCAGCACCTTAAAAATAGGA^ 
ACGGTCTTTCAGGAAAGAAAAGCTCCAACAACTTGCTTTGGACCAAGTTAAAGAGATTATCTCC 
CAATGAATTGAAGGATAAGATCAAAGTGCATTCATATTCATGTGAATTATCGCTATT 

5 5 TATTT AAACTGAAACCAGGAACG CTGCTGCTTTTG AGG CT ACAC CCCT AAGGCTG T AGAATTTCTTCTCTGAGAACTCAAAAGC CA 

AGCAATTAGGTCTGACAACCACGTCATTTCCATAATGTACTTCCGTACAGAGGACIT'1'1 
ATCTTATTACCACTCAATTTTTGACTTCTTTCCAAC 

GAGTAAATTGTACTGAGAAGATACAGTGATGTTTCAGAAC^TTTATTAATAG 
TAATTACCTCAGGAGAGTGTTTTTGTTTTGAACATGATACATTACAGTTTTTCTAGGGTAACTGAAG 

6 0 TGGGAAAATGTGTTTACCAGCTGTTTCTTT CTAAAT ATGCCTCATG ACAATTTT CTT AAG AC ATTTT CATTGGGG rrirC CT C AAA 

CTGTCCCATCATTTTCTAAGGAGATGGAAGTTACT AGATTCAGTGG AACT 
TAATTCTTATTACAATGCTTATCAGTTTGGAGAACACTTTTTAGGAAATTCATTACAA 

AG CAACCTTTAAAATAGACTTAGGAAACTG CATITTCCTGAACTTTAATATACACAATTTATAT AGCCTCTCTAAACCTAAAAAGA 
GACTCTAGTGCAACAAAAGTCTATTACATTATCATGGTCTGCAGTGGGT^ 

6 5 GTTACTTGGTTTCTCTCTTAAACTTCTGCCATAC 

ATTCAGTTGGGCTACAAGAGATCTCTATTTTCATTTACCCAT^ 
TTAGTAAATTGTTTCTCTTTGGGATATGACCATATATATATTGAAATTATTTGC^ 
CTGTTACCCTTGACTTATAAGTAAAATGATAAGCAATTGCATATTATTTCACTAG 
AGCATTTCTTTATAATTCAATTCCTTAAGACCAGATTTGTCTTACTGATGGG^ 

7 0 ATCTTTTATATOSTTCATATAGGAACTTTTATTTTTTCACCGTAAAAAA^ 

CTTTAGCAATATTrAGATGTTTTCTCTGAAATGTCAAAAAGCAAG 
ATGTAGTGCCACCGTGTCCTCGAATTTAGCCTACGTTTTAGCGGAGCTC^ 
AGATAGAGTGGTTAAACAGGAAACACTATTTTTTTTTTTCAAACG 
AATTTCCAGGAGTCAGAAC^TACAGTCCAriAT^C 
7 5 AATGACGTCAAAAAATTAAAAAAAAAAACGAAGCTCGCACATGCGCAATGAGGT^ 
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CTATTGGCTCCTATCCCCGCCTCTGGGATCCGGCCTG^ 

GCACCCTACTCCAGCAAGAAAGAGCCTGAAGCCCCTCCCCACCTTTCCCGGCCAGGGCGCTCTTCT 

CGGGGCGAAGTAGGCCGACCGGCGCGGCCGCAGCGAGTTTTCTGGCAG03CTAGCG 

TCCGCCGGCTCCGGGCTCGCCCCCGCGAGTTGGCCGCACCXSTTCCCCCGCCCGCGGGGCAGCCGCTCCTC 

GGACCTTCGCCCCGGCCCCCGAGCGGCAGTGCGGCTCCAGCTGGAGGCCTGGCCCGGGAAGCAAAGTGAAA 

TCCTCGCCAGCCGCCCGCCGCGCCTTTCCGAGCTCAGGCCGGC 

TCCACCGCCCGCATCTCGGCCGCCGCACCCGGGTCCGCCCCGGGAGGCCCCGCGGGAGGGAACCCCCGGCCCGCTGGGCGCTTCCG 



CGCCGCCTCCCCCGTTAGAAACrrCGCAGACACCGCGAAGCTGCrCTTTC 
AGAGCCGACCCGGTGGACA 



HUMAN SEQUENCE - mRNA 

G AG AGGCAGAGATCGGGGCGAGACAATGGGG ATGTGGGCGCGGGAG CCCCGTTCCGGCTTAGCAG CACCTCCCAGCCCCGCAGAAT 
1 5 AAAACCGATCGCGCCCCCTCCGCGCGCGCCCTCCCCCGAGTGCGGAG CGGGAGG AGGCGGCGG CGGCCG AGGAGG AGGAGGAGGAG 

GCCCCGG AGG AGGAGG CGTTGGAGGTCGAGGCGGAGGCGGAGGAGGAGG AGGCCGAGGCGCCGGAGGAGGCCGAGG CGCCGG AGCA 

GGAGGAGGCCGG CCGGAGGCGGCATGAGACGAG CGTGGCGGCCGCGGCTGCTCGGGGCCG CX3CTGGTTG CCCATTGACAGCGGCGT 

CTGCAGCTCGCTTCAAGATGGCCGCTTGGCTCGCATTCATTrrCTGCT 

CGATCGCCTCGCGCCGGCTGCTCTTTCCGGGATTTTTTATCAAGCAGAAATGCATC 
2 0 ATCCCCACCTGATGTGTGTGCTTTGTGGAGGGTACTTCATTGATGCCACAACCATAATAGAATGTCT 

TGTATTGTTCGTTACCTGGAGACCAGCAAGTATTGTCCTATTTGTGATGTCCAAGTTCACAAGACCAGAC 

GT CAG AT AAAACT CTC CAAG AT ATTG T AT ACAAATT AG TT CCAGGG CTTTTCAAAAATGAAATGAAG AG AAGAAGGGATTTTT ATG 

CAGCTCATCCTTCTGCTGATGCTGCCAATGGCTCTAATG^ 

GATGAGATAATAAGCTTATCCATTGAATTCTTTGACCAGAACAGATTGG 

2 5 GGAGGTGAATGATAAAAGATACTTACGATGCCCAGCAGCAATGACTGTGATGCACTTAAGAAAGTTTCT 

TACCTAATACTTTCCAGATTGATGTCATGTATGAGGAGGAACCTTTAAAGGATTATTATACACTAAT 
ACCTGGAGAAGGAATGGTCCACTTCCATTGAAATACAGAGTTCGACCTACTTGTAAAAGAATGA 
ACTGACAAATGCTGGAGAACTCGAAAGTGACTCTGGGAGTGACAAGGCCAACAGC 
GTTTGCCTAGCCCCAGTACTCCAGTGCAGTCTCCTCATCCACAGTTTCCTCACATT^ 

CCTCAGACTGTTAAGGAAAAAAATTTTAAACCCCTGATTTATATAGATA 
GACTATCGTCCAATTTGCTTTCTTTTGTAGTGACATTAAATTTGGCTATAAAAGATGGACT 
GTTCAAAAGAAAGATTGTTGTTATAAAGAATTGGTTTCTTGGA 
GGTTTCTGTAACCATTGTTTGGATTTGGAAGTACTCTGCAGTGGACATAAGCATTG 

3 5 TACATTACATTCTCCTTGATCGTTCTTGTTATTACGCTGTTT^ 

CCAACGGAAAG AATATG CATAGAATAATG CATTCTATGATGCCATGTCACTGTG AAT AACGATTTCTTGCAGCTATTTAGCCAT^ 
TGATTGCTGTTTGATTTATACITCTCTGTTGCTACGCAAAACCGATCAAAGAAAAGTC 
AAAGCGGGTACTACCGTTTATTTTACrcACTTGTTTAAATGATTCGC 
GCCATATTGGTATATGACATAACAGGAAACAGTATTGTATGATATATCTA 

4 0 AGAAATGATTGTTAAAATTCrCCCAACTGGTTCGACCTTTGCAGATACCCATAACCTATGTTGAGCCT 

ATTTTTAATGTGGATATCTAATTCTAAAGTCTGTTCCATTAG 

TAATACATGTTTACTTTAAAAATTGTT'GCAGTGAAG AAAAACCTTTAACTG AGAAATATGG AA^ CTT AATTTTC CATTGGCT 
ATG ATGG AATT AAT ATTGT ATTTT AAAAATG CAT ATTGATCACT ATAATT CT AAAACAATTTT T T AAAT AAAC CAG CAGG TTG CT A 
AAAGAAGGCATTTTATCTAAAGTTATTTTAATAGGTGGTATAGCAGTAATTTTAAA 

4 5 AATATGCCTTCTCTGCTATGTCTGAAAATAGAAGCTATTTATTATGAGCT^ 

TTTAAAATATGAATAACCCCACCCAACAATTTTCAGTTTA 
AGGGTGTTTATTCTATATGAATATTGTTTCATGTTTGTAGGGAAATTGTAGCTA 
CACAATTCTATTGCTTTGTCTTGCTTATAGTCATTAAATCATTACT^ 
TAGTAAAGGATGTTTTATGAAGTCACAAGATACATATATTTTTATTT^ 

5 0 TAATTATAGATGTAAAATGAAATTTCATTTGTAATTGGAAAAAATCCAATAAAAAGGATATT 

TAATAAAAATTTGATATGAAA 



HUMAN SEQUENCE - CODING 

ATGCATCGAACAACGAGAATCAAGATCACTGAGCTAAATCCCCACCTGATGTGTGTGCTTTGTG 
55 AACCATAATAGAATGTCTACATTCCTTCTGTAAAACTTGTATTGTTCGTTACCn'GGAGACCAGCAA^ 
TCCAAGTTCACAAGACCAGACCACTACTXjAATATAAGGTCAGATAAAACT 

TT CAAAAATGAAATGAAGAG AAGAAGGGATTTTT ATGCAG CT CATC CTT CTG CTG ATG CTG CCAATGG CTCTAATGAAGAT AG AGG 
AGAGGTTGCAGATGAAGATAAGAGAATTATAACTGATGATGAGATAATAAGCTO 
ATCGGAAAGTAAACAAAGACAAAGAGAAATCTAAGGAGGAGGTGAATGATAA 
6 0 ATGCACTTAAGAAAGTTTCTCAGAAGTAAAATGGACATACCTAATACTTTCCAGATTGATG 
GGATTATTATACACTAATGGATATTGCCTACATTTATACCTGGAGAAGGAATGGTCCACT 
CTTGTAAAAGAATGAAGATCAGTCACCAGAGAGATGGACTGACAAATGCTGGAGAACT 

AACAGCCCAGCAGGAGGAGTTCCCTCCACCTCTTCTTGTTTG CCTAG CCCCIAGT ACTCCAG^ CAGTCTCCTCATCCACAGTTTCC 
TCACATTTCCAGTACTATGAATGGAACCIAGCAACAGCCCCAGCGGTAACCACCAA^ 
6 5 CAGTAAATGGGTCATCAGCAACTTCTTCTGGTTGA 
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Table 30 > 

MOUSE NOMENCLATURE 
ICSGNM Runxl 
Celera mCGl3559 

5 

HUMAN NOMENCLATURE 
HGNC RUNX1 
Celera hCG40ll75 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNN1WNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNlfNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNONNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNONNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NCTCTCTCTCTCTTATACACCGACTTCTGTCTTCTTC^^ 

2 5 CCATTTTTAACCGTCTCCTTTGGCTACATTGTA^ 

AACTACTGTAAAAAGCCCTCCGTCTGTTAACTGTTCTACAACCAGCTCATATGGGTTCCT 

ACAAT CTTAAGAGT CT ATTTT AGGATTCTTC CT CAT ATT CTTG CTT CTG CTTCCTAAGTTGCTGG CAGAGTAATTGAATCTCT CCT 
GGTTGAGGATAAATGGGGGTGTGGAAAGGGGCCAAGGATTGCATACTGAGG^ 

AGGTGGCCTCTT ACCAAC CTCACCCTCCATCATTCCAATG CTGAG ACACCAAGTATAGTTCT^AAGGTCATCTGGGAG ACATGGAGA 

3 0 CCATGATGAAGTATTAGGATAATCCATGAGAGGGAATCTAACCTGAGCAAGAAAGC^ 

AGGGTG CTCCATAGTCCTGCAG CACAGAGCTGAAG AG CTGGCCCTGGGTTCTGGGT ATGTAGAGATG CAAGGTT AGCAGAAGCCTG 
CTAGAAGG ACTGGTGTAG AAAAGCTGTGGGAATCC CTTGTCAGG<;TAT ATTCGTAAGAAAT CTGACAAG AGGTCAGAGAATTAGCC 
CTGGAGTGGGAGCAATTGGGATGGGCATGGGAAGAGGAGAGAAATCATAACAACC^ 
GCAACAAAAGGAAGGGAGACACAAGTAAGTCCATGG^ 

CTCTCTCTCTCTCrCTCCCCTCCAGTCACCTATACACTTTTGGTGGACTA 

TGGATGTAGTGGAACCCATTGTCITCATATTGAGGTTCAGTCCTCTCAGCTTATGTGAGATACCA 

CTCTACTGGCAATGGCACAGTGACCTGAATTACAAGTGCCrAGCTTTGCTCTCT 

CTCACCTCAGCCrCTCTAGCCCCTTCTCTGGGATTCAGAAGGTAGAAGCT 

4 0 TTAGCACCAAAACCTCTTCTAAGCATCCCCAGGTGCTGCTGCCCCGTCCT 

CACATGCTTGCCAGTAGCCATGATACCCCATTTGCTGTCTGTTATTTGTCCTCTG 
GGACCATTCTAGGAAGGCTGGGAGAAAGAAAATCAAATATTTCATGCT 

GGTCAGAAAAGACTAAAAGCCTATTATAGATTTTAAAATCTCTCAACCAAATAGCGATTGCCTG 
AAATAACTACAGGATAGAAATTCTACATATCTGAGATTCCCACATGAAGAATTTTTGAGArrTTT 

GAATTACAAGAAATGAGGAGGAAAGAGGGGAAAGAGGACATACT 

GAAACATAGTTGGTCCATGCCTGTGGCTGTGCATGGGCTTGTGTATGTGTGTGAGTGTGTATACCAGTGCCT 

GCATGTGTACATGTGNNNNNNNNNNNNNNNNNNN^ 

NNltfNNNNNNNNNNNNNNNNNNNNN^ 

5 0 NNIONNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GGGGTGAATGTGCATGTCTGTGCATACTTGTGCCTGTGTGTGCACACTAAATCTGCATGT 
AGGCAAGTTTCCTAAGCTCCCTAAAGGGGAGGCrrCCTGGCCTATAAAATGGGAGTGACTATAGGAT^ 
ACAATTCAATCACTGCTGGTTCTATGATGGCATACAGTGGGCAA 
GTAGCCTCTCGTCTCTCTCCAAGCCCCAGGGTGGCATCTrCCCCAGCTCCTGGAGGTC 

5 5 cCTTTATCTATAAGTGTCrGTTCACTGAGTTCAGCC^^ 

ACAGT AACACACAGTC CT CTC CAAG CACAGT CG CITTTG CCACGTTGG CTTTTCTCT AAGAATTGTCGAAGT ATT AA 
TAGAGATCTGCGCTCCATTCTGCTTTTCCAACCACTCAGCCTGAGATG^ 
GGTTGGCTCTACCAGGTCrCTGGAATAAATTTACCTGTCCTAGTCAGTCCT 
TAGAACCAAGCTTGCCTTAACTTCrrGAGTCCATTTGTTATTTCTTACC 

6 0 CTACTGTGTTCAAAAAAAGGAAACTTTATGGAGCTTTTTTATGCC 

CCTTGGAGGCCAGACATTTAAGTCCCTCAGTCTGTCATTAGA^ 

GCTTTTGCAAAAAATATATGACTGTGGAGCTGCAAACCTTGACCTCCTGCCTATTTAAACAGCX3T 

CTCTCTTCTTCTTCCTClTCTl'CTTCTTCCTCTTCCTCTTCCrCTTCCT 1C1 rCTTC 

TTCTTCTTCTrC TT CT T C TTCTT CTTC TTCT rCTTCTTC 1 1CT 1 CTTCrrL'XTL'l'l'CrTCTTCACAACATCGTTGATAAGCCGTCT 

6 5 TGTCCTGCTATGGAATGTGACTACCCAAATCCTAAGGCATTGAGACCGACCATGTGCCC^ 

CAGTGTACAAAGCTAAGGAAGGGTAATTTCTAACTCACATTATCTGT^ 

CCCTGTACCTT AGAAG CAG CAGACTG CAGGG AGGAAAG TGTCTTC CTGGG AT CTT AG CAT ATGCTG ATGATTTCCAACAAT ATTT A 
AGGCTCTGACATTTCAAGAACGAGTGGTGTCAATGTAAACA 

AGGAGGGAACCAACCAAGGGGGAGGTGTTTCTGAAATGATCTACCACTAGAGGAAAGCAATTGGAAC^ 

7 0 TCTTTTAGAAAATGTATTCTGAATTGTTTATTTGGATAAATACTGGCT 

TTTTCTCCTTGTG CTTAGCCTCTGCCATGTTACTGATTTT AAA^ 
tn^NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNIINNNNNNNNNNNNNNNNNNNNNN^ 
75 NNNNjijiNNNNNNNNNN 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NiTONNNNNNNNNlWNNNNNNNNin^ 

NNNNNNNNNNNNNNNNNNNNNW 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNKNNNNNN^ 

NNNNNNNNNNNNKNNNNNNIW^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNIW^ 
10 nnnNNNNNNNHNNNNNNN^ 

NNNNNNNNNNNNNNmJNNNNNN^ 

NNNNNNNNNNNNKbWNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNIWNNN^ 
1 5 NNNOT ATT AT AT ATTTTGGAGATGATTT CAT AGAAT AAG C^CACAAAACTTTCAGTTTT CTG AG AC CACTT ACAGTT AG CCAGT CA 

CGTAAGGATGTAGAAGAATAAGTCAATAAAACCGCTCCGCCCTCCATTC 

TTTAAAATTCATGGTAGAAAGTCCAAATCCTGTTGTGCTCACT^ 

AGCAGCTTAAACAAATCTTTCCCTCGCAG'IT' ^ 

TGGAAGTGAAAGTTGTAAACCTTTCCTTTAGCTCTTGGTGAGGCA^ 
2 0 GGTGGGAAGGAATTCCTCATTTCCCTCTCCCGCTGCAGGA 

GCCCCTCGGTAGAGTCCCAGCTGCGCG CTTG CTGATTG CTATTTTG CAGCAAAG ACGGGG CG ATG ATI i K3TTTGT CTG TG TTTTT 

CIAAGTGCGTGTCAAGTGGCTCAGTCACAACCCACACAGGTCCAGGGATGC^ 

CCTGATCGAGCTTCGAACTAACTGAGTTACTAGCAGTAA 

TGCCTGTGTTCATGTAACTGTCTTTCCTAGGCTTCTGCCAGACAGACAA 

2 5 CGCGATTCCTITAATCTTTTGAAGGGCCAGAGAGACACCCTTTCC 

GATGATGCCCTGTCTOXSTTGGTCTGTGTGGATGTGCTTGGCTGTGGCT 
TGCATTTGAGGGGAGTCTAGCTAGCTATTTGAGGGAGTCTCGCT^ 
CCTGGCTGCACTAGGAAGCTCTGGCCCTGGCAGGGGAGCGC^GGCAGCGGGA 
TTTTAAAAGAAAAGCAAGGAGGAAAAAATGTCTTCCCIAAA 

3 0 CTGTCACGATCTTTGTGCTGAAGGAGGAAAGTCGGGCCCACTCGC^ 

TCGCTGGACACCTGAGGAACTC^TGGTCCTGAACTAGCATAAGGC^ 

ATTTGGGTGGATCACGACAGTAGAGACCCACCAGCCGGGGGATTCCCATGCAAGCTGGACCGGCT^ 

CGGGGGCGCTCAGGTGGGTTCTGGTGGCCACTTCCTAATGTCTCTCCCCAAACTCACAAGAT 

ACTGGTCCTCGCGCCX3TCGGGGCX3TGCCCAGTGCCCTGCTCCGCTTCCCAGGACGCCTAACGAGGGCCGCTC 

3 5 CCGGTCTGAGCCCCGACATGACCGTGAAAGGCCTGCCTGATCCTCCATGCCGCCAGCTCCTACCCG 

CCCIAAACTTCAGACAAGGCCTCCGTGCCGGGACCrCCACAGCTr 
AGGAAGACTACACGCGGGGACCAGCCAGGGACCCGAGCACAGATGAAAGTGGAGGGG 

CIATGG AGTGGGTAGTAAGAAGTTCCGGAGGAAGCAAGG CCC AAG AGAAGTAAGGGACCCCGGAG C ACAGG CCAGCTGCTCCGGCTG 
GGGACAGT AG CCGGCAGCGAGCCAAG CAGGAAGGAG ACCCTT AAAG CT CTTCGGCGAAAATGCC CTCGGCGCCG CGGGGACGAACA 

4 0 AGGTGGGGGAGGTGCCCCTACATTGCTTAAGCGGAGTGGGTTCGTTGCCTGAAGGAGAGCTCCGGACT 

TGGATTCGAGCTCATGAGTTCGGGGTGGCCTCAGTCCCCCATGCTTTCAACTTTTTT^ 
TAGAAAACTATCTCCTCTCC^GCCGCTGCGCACACTCAGAACGCGGCGCGCTGCGTAATCAAGCCGCAGCC 
GACACTAAATAAAGCAACTCTGCGAAGGAGAGGATGCCTAAAACCCAAAA 
GACTCGGTGCCACTTCTCCTCAGCTGCTAGTCTGGCCCAGGGAGGCGCTCA^ 

4 5 TCTTGCGGCCGAATCCCACCCTGTCCTCCCAGGACCCCAAGCCTTGGCTCATGGACT 

GCCCAGGGCTTCTCCGGACGCAGCCCTCGGTCCCCAAGCCCGACO^ 
ACCAGTGGCGCCGCTGCGCC03GCAAACCCACCCAGGG 

CCTGGCTTCGGGCAAACATGGCAGCGTGTCCCTGTGGGTGCGGGGTCGGCGTGGGT 

5 0 CGAGTC CCGGGCCGGGCAGCGT ACAGT AGCGCGCGGGGGCGCGCGGGCCAATTCC^ CGCC CAGGCTG 

ATACCC^AAAGGCCTGTGATTGGCCGTCGGTCTCCTCTATGCAAACGAGCTG^ 
AAAGCACGTTCATTTCCAGGCACTCTCATTCATAGAGCCAGCGGGCGCGGGC^ 
»CCACGCTGCCCXX3CCCCGCGGCTGGC^CITCCATCCTGAGCT 
TCTGCGTTGGACCTTCTTGTTTTCCTTTCCTTTCCT 

5 5 TG AAC CGG CTTCT AGGCTTTGG AT GTTTCAACTTCT AGG CTG TG AACTTTT CT CCG ATTGTTT CT I i rGACAACT GCGGG AG AAAG 

TGTTCCGGAGTTCCAGGCGAGCXjGTCAAGTTGCGTGTGCGTGG 

CTACGGTG CCTCCTCCCAGG CCCGCTCCACCTGCTGCGGTTACACGGTGCTCGCTCGTGGGTATTTGTG CGCCCC AGCAGAAAACC 
AGCCACCGCCGGX3CGACGTTGGAGGAAGATCATCTTGCCAGACTCTGGGCTGACT 

6 0 GTTCCCAAACCAGAGTCCTCAGCTGTGGGAAGCTCTACCCTGAGTC 

ATCTTTGTAAAGCACTGACCATTTATTTGGCCAGAATTTATTGAGTAATGACAAGT^ 
CTGATTTTC!AAGGCTACTTAAAAAAAAAAAAACTGCAGCAAACATT^ 

TGTAAATATTTTATGAAGTAGATCATATGTATATATTTATATATACGAGCACATACATTAGTAGC^ 
TCTTGCTTTTGAGAAACGAAGTGAGTTTTTCATGGTAAGAGGGGCGCTCT 

6 5 GACTCTAAACAAAACTGACCCGCAAGAAAGAAAAACAAACTGACAGGCAACTAACT 

TCATTTTTTGGTTGTTG TTGTTG TT CTG CG CATAAATATTTTAGGACGCGTATGGGAATTTTGCCTCCGGGAC CGTTTGT AAT AG C 
CAAAGACTGAACTTCAACTCTCAAAGCGAGGCTCTGTTGGGCATTTGAC^ 
GGCCAGTTGCTAACCCTGCTTGGGTGTGAGGCCGATCCGTGTAACTAGATCT 
CCCACAACCCTCCGGTAGTAATAAAGGCTTCTGAAC1TGTATGTTGGTCTCC 

7 0 GCCGTCTAGTAGGAGCTGTTTTCAGGGTCCTTACTCAATCGGCTTGTT 

CTTCACGCCGCCTTCCACCGCGCTGAGCCCCGGCAAGATGAGCGAGGCGCTCCCGCT 
CCAGCAAGCTGAGGAGeGGCGACCGCAGCATGGTGGAGGTACT^^ 
TTCCTCTGCTCCGTGCTACCCACTCACTGGCGCTGCAACAAGACCCTGCCCATC^ 
CGGGAGGAGCAGGTGGAGGTCAGCCGATGGCCAGAGTGA0CC"IAGGGGa3CAGAw 
7 5 gTAGTAGAAGTCTCTGGTCCTTCTCACTGAGTTTGGGATGGGGGGGGGGATGCTCCG 
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CTCftGAGCTATCCAAGAGGGAG TTTTTTTA 

CGiUUVTAGAAAAAGTTAGAACCTTGGATCGGGCCAGTCCATCTTCCTTAAGG 
TTGTATCTCTCAAATGAAAGCATCAGGAAATAAAATAAGAGATGCTTTAGT^ 
CCAAGGGAAACAGCCAGGGATCACACACACACAC^^ 
5 TTTTTTTTTTTTTAAATCGCGCTATAAGATTCAGCGTGTCCAAAG 

AGTTTTTCCGAGCCnn"AGCCTG CTCCTGCCAGAAG CCGCAAGTATTG AATCCTGTTGCTTTTTCTGGAACCTCCAGAATATTATC 

GCTGGCTCACCTAAATTAAGCCCCAGGCAAGAAAG^ 

GTTTGAGCAAGGGGCCTTGTAGACCCGGCAACTTCAGGTGGGCAGCT 

GCTTTGTGTTG CGTCT AAAACTTGG CTTTCAGAAG CT AT AT AT AAACAACCT CGGG CGGTTT AAAAAAAAAAAGAATAAGAAGAAG 
1 0 AAGAAGAAAGAAAGAACCACCCTACAACTCTCGCAGGGAGCTGGAGGAATTTA^ 

TTT AAAAGGT ACAG CTGAGAG GAAAAAGACTCTGTCAAGTTG TGTTTTT AAAAGG CT AAG AAGG CGGGACCCG AG CTGAAAAGGGA 

AGCTGCTCrCCrTTlTCriTTTTC X TTTTTTCAATTTAGCTGAATA^ 

TTCGTAGTTCCAGGCCTCAGCTGGATCTTGATCTAATTACAATGAATAGCGCT 

TCATGCT AAAAAG ACCAT C CAGATT AAAAAAAAAAAT CAT AGAAGTAGAATTTGCTGCTG ACTCTGT CT GT AG AATTGCCACCTTT 
1 5 ACCACTGTTGTGAAAGCAACTTTTTAAAGGGGTAGAGGTGGGGTTGAGAGTCTGAAGG^ 
TTCCAAGTATTGGGGAATCCGTCCCAATTCCTTTAGAAAAACAAATTTAGCT 

T CTGGG CTCAG CC CTAAGGTCACTTT'CAGAGAACAG AGAGAAAT CAAACTT ATTGTAGT CAAGG AAAGGGCCAGTTAAGGT AG AT C 
TCCCTGCCAGCCTAGCTAAAGGGACTXjTGGCTTCCTT^ 
GTTGTTATTGTTGTGGTTGTGGTTGTTCTTCTTCGTCITAGTCT 
2 0 TTAAAATGGTCTTCAGGAGTTGACXSTGGTCCCGTCTTC 
CTTACACATTGCCrrTTACAAGCCACGAAAAATAATTl'l^ 
TGATTTTGGAAAGACCTCAGTGTGTTTCCCTGAGTGACAGCC^ 
QAGACCTCTGGATTAATATCTGTCACCACCTATGCTGTCCCACA^ 
ATTTGCGGAGGGAAAGGAGATTTAAAGGCTGGGTTTAAGAGGAAGCTG^ 

2 5 TGGGCCCTGGCCCCTCTGTGTGCCAGCTCCTACCAAACACACAGAAGC^ 

AGATTCCGGAATCTCTTTAGGGGTTAACCCrGCAGGACTTATCCTATTCCCTC 

ATGTGAAGCCCTGTTCAGATCACTTGTAGAAGAGCATCTCCCGGAACCITCCX5GAAT 

AATTATOVGGAGACTTACAATCATGAAGTCCCTTGAA 

CCACCTATAGACTGTTTATCTCAGCATAGAAGAAACACTTAATTTCACTGCCGAGAAACTGGGA 

3 0 ATTTGCIAGATACCGTCTACIAAAGTTGTCTAGGTGACTTGCAG 

TACACCAGAGCACAAAAATATAAACATACCTTTATCAGGGAAGTTCAAAGACC^ 

AAATAAATAAATAAATAAATAAATAAATAAATAAATAAAATTCTGGCCGGAAGACCATGATCCCTAGCTATGTGTTAAAT 

TCAGATAAATTCTACGTTCCATTCCACAGCCTCrTTGAAACTCT 

TTCAACCTGTAAAATTGTTGGAAAATGCCAGTTTCAAACTAAGAATAGGCTAGCC^ 

3 5 GAAGACAGCTGACATTTGGCCCAACTTCCTGATACTTGAAGAACAG 

TCCCTTCTTGTCTCTTTGGTCTGTACTAAATGTAAATGTTTCTCTTG 
TTTGTATTTCTGTTTGAAGTTTGATTTCTTTTCATTATTTCT 
TCTTAGCTTTAAATAAAACAAGAACAGTGCTAGTAACCTAGCCCT^ 
CTXK1ATTTAATTTTGTCTCCTTCCCITCTCCATCAAAACTCTTTGGAGATAAAT 

4 0 GGATCCCCCCTTCCTGCTTCCITTAGAAATGGGGGTGATGCA 

AGCTTTTCTTGGCCACCATGGTCCTCTTGTGCTGTAGACGAGGGGGC 
ACAACCATGGTAAACTGAGGCACGGTGGGGCAC!AGTGGGAATTCATGACrTG^ 
CATTTTGGCCTCTTAGGAGATATGGGCCCCAACAGTCAGGCATTCAGACT 
AGTGGGTGTTGAGATGAACTGGAAAGGCCTAGGTCTCCACAGGGAGATGAAGGCAAGG^ 

4 5 CATC CTTGTGAGCAAACTGTG CAGAGGGAAATGAAAACATTGATGAGAGAGATGGGGGGAAAAATAGACAGAAAACAGAATTTACA 

GGAGTAACATTTTGTAGGAATAGAAAGAAAAGATAATAATGATCAATTAACCGAGCAGGGGA^ 
AACAAAAGAGTTGT(rrTATTACCACTCCTAATTAAAGGCATCAAA(2ATTTGTGGTGGGA 
GAGAATAAGCX3TTTTTAAAATCrrCCTTTGTAGAAAACACGTCCAA 
AGGGTCCTTGTTTCACAAAGCAAGTGAGTGAAAACAGATAAAAATTAA^ 

5 0 GTGTCCTAGTGATGTCCTAGGGGTCCCCTCTGTCCATTTCTCAGGGTAGC^ 

GGAGATAGAGCAGCCAGAATGAATAAAATCGCCCAGATAGCCATCTGC^ 
CCAGTAAGAATGGCGAGCAGAGAGACOVGACCnTGATACCAGATTAGCAGATCTGCGGCGGC^ 
ATCCCTCTGATAGATTGTGAAGCAGCCAGTGTACAAAAACTGAGAAAAACAAAGTGGCAGT 
TCCCGGGAAAACCTGCCCGAGGATGACCACACAACTGAGATAAATAACAGACATCTCACCGA^ 

5 5 GGTTGGAACAAACTGAACTCTCGGGGCATGGCGATTTGAAATCCTGTGGTOSCT 

TCAAGTCAAAACrGGAAACTTTAGCrcAAGTCAAAGAACGGCA 

GAGGACTGGGAGTTCAAGGCCATTCTAGGCTCCATCGTGAAACCATAGCTCTCTAAAAC^^ 

AGCAGTGGGTAATGTTTTGTGAAGCACTTTCAAGGCTTCAGAGTGTTTTAATCTG 

AAAGTAAGCTCATCTACAAGTTGTCTTTCCAAATATACT 

6 0 TTCATACAGATGCCAGACAGCAGTCnXKSTTCTACCTTTGA 

TCCCGCTTGTAATTAATAATAAAGAAGTGGCCCGTGTCTAGTTTTTCTCCTTGGGTGCAAGAGT 
AGCGGCAGGAAGAGGTGCCAGAGATTAGCTGCAGACAGACAGTTTAAGTATTTCCGC^ 
CACACIACACCTCTAACCCCAGCACTAGCAGAGAAGAGAGGAAGA 
CGGGGGCCAGCCTGGGCTATCTGAGACCCTGCCTCAAACAACAACAAC^CAA 

6 5 ATACITTCATCCAGGTTTCCTTAGGTGCIACCAAGAAGGGGC 

AACATGATTCATGATACAGAGAATGGGTCTGATTCCTAGAACCTATC 

TAATTTGCTCCrCTGGCCTTCATGGCCTGCTCTCAAGAAA^ 

TTTTGAAGTGGATTAGATCACAAATCTACAAACA 

ATGATGACGGACGGCTCTTCTATA 

ACAGCCAGGTGCCTGTGACAGCACTGATGTCTGCATTTGTCCC^^ 
GCACTCTGGTCACCGTCATGGCAGGCAACGATGAAAACTACTCGG 
GCGAGATTCAACGACCTCAGGTTTGTCGGGC^AGU^TAGAGGTGCG 

7 5 TTCCAAATAGTAGCCGTTTCTTATTATTGATGCAGCATAATGATATTTATTTAA^ 
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ATTATCACGTTTTGATGACTAAATTGGCTTACAACTGACTTGGAACGCACAGCGTGTGATGGACTTGGAA 
CGTGAATTGTGAAGTTCAATAAAACATTCCACCXSTTTTCCTAAACTC 

TGTATAATGGGTGGTCACTATGGGATAATCAAAGCCATGAGCCTTTATCATATGGTACTTACGCT 
TTCAGTTGGTCACATCAATAATTTTGGGGGAACCCTTATTGATATAGTTrTAAGTAA^ 
5 ACTCTT ACTGGT ACAT CTGAG AG CAGTGTGACCTTCACT ATG C ATTCTTTGTGT ATGG TTTGGTGGG AGAAGCCAGG CTCACTT AA 
CATTTAGATACTGCTCATGTGAGAG ACCAAATTATTTCCTT AGCCCAGCTG^ CCT CACCTTTAACCAGTT 

ATTAAACTATTGGAGTAGAAAGAGGACCTGCTCTCACTGTAGGCTTC 

TTTGTAATGTAGAATTGAACGATCTTAGCCGTAAATGATATCTCTAGC^ 
1 0 TCAGTTCTCAGAGAGGGGAAGACACAGGGAGGGGCTAGTCGTGTCCATCGCT 
TTTGATCCCCAGCACCAAGCAAGCAAAACAAAGCAGGTAGA 

ATTCTGAAGGCGTTCAGAACAGTGGAAATTCCGTGTTCATAGTGGGAAAGAGGAGA^ 
TTTGCTCCCCTACCCCTGCCCCCAGATGCCC^GGGCATTTTCAGAATGATGTCACA^ 
TGTTATTGAAAATATGTGTTCGCGCTACTGTGCTAATAATAGGGTGATTGAAAGAAGAGTCTACCC^ 
1 5 AGCCTGCGCCCTTAAGTGGATCCACTTGGAATATTGAAAGTCTGCCTTTCAGCAAATGGAA 
GTTTAGAAAATATCAGGGAGGTTTCAGATGAGTCGCTGGCCGAGCGG 
CCCACGCCCACCTGCCTAGCAATAAGCTTAGTTCGTAGGATCTAGCTGGGTTGCC^ 
AAGTGGGTCTTTTTGTCATGTTTGGCTATGGAAGTTTCGAAGACTATTTAA 

GCATGGTGGCTTTGTGACGACCGTGGCAG CCAGTTGATGATGATCAAGAGTGGGGTTCTC CAGCGTGCCTGCCT ACCTTGGT 
2 0 GGTCCTATGTGCTGGTCGCCTATCCCrCTTTGGCCATTTC 

CCCCCCCCAGGAAAACAGCAGCACCCAAGGCGAGGCTGGTTTCCCATACCCTGGAATG 

GATTATATTTGACTTGAGCAGGCTCTCCTCTTCATGAGGGGTAACGAGTAATTAAA^ 

AGTTATGGTTTCTATATATAATGTTGGTCTTGAGTTTTTATGGTTTAAATTTT^ 

2 5 GTAATAAACATGGTTTTATCTCATAATTTCACTTTGTAG 

TGGATATTCCCCAGCCCAGCCCA.CCCCACCCTCTCTCACrrCTCTTTCrCTCCTTCT 
AAGAAATGACTCTCACTAATTCTCACAATTAGCCCTAATCTCCCTGGCTGAACCTTATCCTTTA 
AACGATCTTAAGACCCGACTATTTCATTTTTCATTTTTTTAAATGTGCCT 
AGGTCCTGGCCTTCCAACACATCCACATTCCGTAAGGTCAGTTTGGG^ 

3 0 CCGTGTGGGAT AAAGGGT AG AG AT G ACATG AATGG ACAC CCT CCTTTCTTG TT ACCAG ATG CCAG ATTTTGAGGATTGTGACC CT C 

AGAGAAACTGCCAGTTAACCAAGGAAGGATTTTTCT 

CCTCAGGTTCAGC CTGTGACTGGG AAAACCrGCCTGTCTGGCTTCT AAG CCCTGGCTGCCTGCTGG CC CCTGTGAAAACCTGGTG A 

AGGCAGAGGAAGCTCCTGCCTGCTTCCTCCATGACCATGGCTTCCGCCTGCT 

GGAATCTTTCTCCCTTTGATTCCCAATCTGCATGTTGATTATCrGAGGGG 

3 5 GGAGTTTAGGGTGGGGTGAAAATCAGAGGGGATCTTGGGAGACCCTCTCTGAAGTCCAGAGAT^ 

CCAG AACAATGCATG CTTTAGTCACAGGGCCTGAGATTATCAGTCATCTCCG AGTTCAGGG CAAAAAG AGATCTCTTTCATGAGG 
GAGGTCAATTAGCTGAGCCTTTTTCTCGGTCTGTAACITGCTCCC^^ 
GCAGTGACATCTGCAAACACCCATTTCCTGCTGGGGTGTTTCACC^GCTGCCGAAC^ 
CJVTGCCTGTCACAGGGAAACAGATCACGCAACTACCAAACATGACAAC 

4 0 TGCTCACTTGTTCCAAGTTGGTGATTCTTTAAATCTCTGGAAG CTGTCATAG AT AATTCTGAAAAT CCCCCCAAAG ATTTCCT AAG 

GTCCAAAAAAAAAAAATGCACACGGGCCCAGATTTCTATAGTGCAGCIT 

TGAT ACT AAAACG TG CG AAT CTGG AACGTG TCTTGG ACTG CT G AGGGTCAG TTCAATG C CAAGT CTT AAACACAG CCC AT ACT GAG 

AAGATAAATATACATGAGGACCCACAAGGTTGACATTTTCCTG 

GACCTGGTTGGTAGAGC^TAGGAAGC^ATAGCAATTTAAAAGGCGAGGGCAGGCATC 

CACATCCCCAGAAGACCAGTAGTCTGCTGCITGCCAGGAACAGGTCrCCIT 

GCAGCTCAGAACTTGCCCAGTTTGTGCTTTTCTGAAGTGA 

TTTATCCCGTCACAGTCTGGGCTCCTTCATATTTGTGTAGTGTCG^ 

GTACAGTGTGGGATGTTGTCTCTCTCGGCTGAGCGGAAGGCTGTGGTGGAGACGCGTTGCG^ 

5 0 CAAAAAGGGAGGGCCGCCTITGGTGTGCGGCTTTTCCGGGAGAGTTGGTCTTAGTC 

TCTCTATTTGATCACAAATTCATAGAGGAAGAAGAAATCCAACTGCCACCCCTAGGGAT^ 

ACTAGAAACTGATTGTTGGGTCTCCTAGGGAGAACAAGAAGAAGGAAAAGGTGTGCGCT 

TCCATTTTCTTTCTGTTTTTTTTTTGGTTCACGGTTTGT^ 

AAACAAAAATAAAATC^TACAAAACAAAAATAAAAACTAAGCAAAAGCCAACAAAACTAA^ 

5 5 AGCCCCGGCCGGTCCTAGCAAACACTTCCTTTCCACAGCAC^ 

GCCTGTAAGGATCAGGTTTCCCCTTCTCTTTATTCCACAATAAGAGG^ 
GCTTAAAGCGTGGGGTGGCCGGAGCCTCAGCAGGGACCAGGAGCT 

CTCTACCCCAGCGGGCTTTACAAAGTGCCAGGCATGGCGGCAGGTAGCTTTTTCTAGCTGA 
CCCATGTGGGGCAGAAGCAGCCCTCCAGCCCTCCTATGGAGCCCCCTCCATCCCrrTCCCCAGC^ 

6 0 GCCCTTTTTGGTCAGGACTAAAGACTATCAATCAGAAAGTACCTGTGAGAGGAATAATGGTTGA^ 

TGAGAGCATCATAGACATAAACAGTGATCCAGCTAGCCACTCAGTGAGAATAGTTCTAACAAA 
TTCTTCTTCrTCITCTTCTTCTTCTTCTTCTTCTTCT^ 

CCTCCTCCTCCTCCTTCTCCrrCTTCCrCCCCCTTCTCTTCCTCCTCTTCTTCC^ 
TCCTCCTCTTCCTCrrCCrCTTCCTCTTCTTCACCTTCTTTTTGGTACTGGAA 

6 5 AGGATTTTTTTTTTAAAGAAAAATTACTTCTTCIAAG 

TAAAACAAGCTGAGAACAGTAGGC^GTGTTTAAGAATGACTGCGGAGTCACGC^ 
CTCCATTCAATTCAGACATCTGCAGCTGCrCCAGCCACTTATACA 
AACATAATTTAC^TCACACCTAGATTGCITATGAATCTAGCTAACAGAA 
AGGGAATG^TAATAAAAAAAAATAAAAATAAAAGAAGTCCACATATATGTTCA^ 

GCAGAGGAGCCTCTACTTTCATTGAGAGAGTTCTTGGACTAAAATC 
TTAACAAGAATGGGTAAGCAAGCTGTCTGTGAAAATGTCCCAGCGTGC^ 
AGCTGATCACTTCATGCCTGTGTTTGTAAACATGCACTAGTGCG 
CGAATCTCACATCCACACATGTTTGGGGCTCTGTGGM 

7 5 TCCAGCTTACAGCCCAGCCTCTATTAGCCTATATTGGACAGTCTATC 



620 



WO 03/008583 



PCT7US01/51291 



AGCCAGCGAGCCAGGCTACATGGGTCCCCACGAGTC 

GCCCCAGCACGTTGCCTCTGGCTTCCTAAGCCAAGGAGGAAACACTCAG<1R 
AGTGAATTGGAGGCAAATTATCTGACTCATAGTACTCCATGC^^ 
GTACCACCTCCCCAACACCTGATCCCAGGAGGCTCAGGTCCAGAGAATC^^ 
5 TCTAATCCAGTCACTTTATGCAAAGGAAGGAGTTGTATGGAATTCATCaSCAT 
GAGGGACTGTCK3GAGGTAGAAAGGACTCTTGTTTGATTACA^ 
ATGTCTGTATGGAATATATTCAGCCTCTATCAGTTTCTGCCTCTTATTAATGAG 
CTTTGTCTTTTTAAACGATGTCTGCAGAACCTC 

GTCCCCTTCCCTCTTCITTCCTAGGTCAGTGATTGATCAGTTTTGATGAGCCT 
1 0 GACCCCTAAGGGACATTTAATGAGGTCTGCAGATGTTTT 

ATAAGGCCCAAGATCTGCTAAACACCCTATTATCTGCA 

GTGCCAGGACTGGAGAAAGCCGGACTTGACCCAACACIT^ 

CTTTTCCTTCCTTGAAAAAAAAATCCGCGAAAATTTTAATTCATGGTATTCAC^ 

GAAGTTCAAGATCACCCTTGGCTTTGTGTrGAjGTTCCAGGCCAGCCT^ 
1 5 CAAACAAAAAGAAATAGTAAATGAATAAGAACGTGTCC^GAAGCCCAGACGAATGAGCTTTCGTGATATCTCCCCCCCC 

AG CTGC C CAGTGGT ACCCT CAGTGG ACAATTC CTG CCTGG C AGGT AGG CACT CACTCACT C CTT AGATCC CTGCTT AAATGTT CAA 

CTCTAGGCGCTTAGCACTGTTTAACGGCTCTAGGCTGTAGACCTTACCATCTTC^ 

ACACTTAAGAGCACCCATTTCCTTCAGGATGCCTITCCAT 

GACAGCAACCCCCACCTTCCTCATGGTGACAAT 
2 0 AGAACAACTTCCCTCTGTGCCTGTCGGCTCTCTGCCGCTTTTGCCC^ 

ATAAACGCTGGATACTGGTTAGGTTATGCCTTCCCTGGGTCAGAACAAGCTGGCTCAAGTCTC 

AAAGCCTCCTAGCCTACCCTTCCTGCCCAACCAGCCTCCTGGAGCTCTGAGGGCCCGAAAGGGTTATA^ 
TTTAGATTAGCAATGGGAGAGAGGAAATTAAATATGGACTGTAATATGAAG 

2 5 TAGCCAGTCCAACTCTTTTGTCCAACCGTTAATAGCATGCAAATATTTATGAATCATACATGCCCCTTC^ 

TT CACAAT AAACAGTG CGTTTCG ACAATGGGG CAGAAT AATCC CTTTG T C CCGT AGCAG AG CTGTG CAGG CTGG CAC CCGG CAG CA 
TCGCATTCTrCCCGAGTGGTAACTGCTGTCTCCACCCAAGCTTAGGGGAGGTTT 
ACGGTCCTCCAGGCAGTGCTCCAGGCCACCCCGTGTTACTAGAAGCrATTCTTATCT 
CTCACACTTGGGTCTTCTCTGGTGGACACAGAGAGTTGATGTGCACTGAGGTTTGC^ 

3 0 TTCTGGCCAAATTCCTGGACTCCGGGTCACACTAAAGCCACCTGATTCCTCTC 

TCAGAATGATTCGGCX^TCTTTGGAAGGCGGTGTGAGGGAGGCTGAGAAGTGCTCAGGACGC 

TTTCCTCCTGTGTAGTGGGCATGGTGGTGCCITTCACTCAGGCCTCACAGGGTTGTTGT 

TAATAATAATAATAATAATAGCAAGATGGTGTACTTATGAGGAGACTTAGGAGACGAGGCATC^ 

GG ATTGTT GG AAG AG AAGGG AGG AG ATGACAGGAAG TT AGGG AAGG AGGT G CCC CT C CT ACTGC CT CTCAT CCACG ACAGGGG ACT 

3 5 TTTATCCCTTTGTTCTGCTAAGGACCTGATAAAACGGAGGCGTCACAATTACTCATAAATCAATGCATGCATAACAGTAAT 

ACTAATCATTTTCAGGGCCCAGTCTGGGCCCTGTGTGTTGTGCATATGCAA 
GTCTCCTAAAGGCATTTAGGGAGAGCATTTCATTCATTCATTCATTC^ 
AAACAGAGTGAACACTGTTCAGCCCAGGCTGGCCTGGAATTCACAGTGATCCTCC^ 
TTTCTTGATTGATCATCGCACATCTGCTGTGCACCTATTGTGTCGGCTAG 

4 0 CAAGGTCAAGGGTT CATGAGGGGCGACAGTGAAGCAGGC CTTCACTGTTCrGTGTAGACGTCT CTGTGACAAAGGACAGGTGCAAC 

GCAACAGAGTGTGCATTTGGCGGGGGTGTGGGGGCGCGACTTGTCTGTCGAGGAACCAAGAGATACAA 
GCCTTGGCATACAGGGGGCTGAGGATGCTTTAAGGGCCTTGACATCCATCCT 

AGCCAACAGATCCTTGGCAGAGACGTACATCGACACACAGGTGTGGTGCACAAACTTAGACTGCAGTGTGCAG^ 

4 5 TTTGGGGGTTCTGGGGTGCTCAGAGACCAGTCTTACTAGTGTCTCCTCTCAGGCAGCCT 

TAGCCCCATTTAGCATTAGCGCCCACCTCAGGCTGGTGTCCAGCTTTCCTCTC^ 

CITTCAGAGCAGGAAAGGTTCAAGAAGATGCTGAGCAGGAACTGCAGACCCCAA 

ACCCAGACTCCCTGCTGCTCCTGCCCAGGTTGCTGTGGGCATGATGCTCT 

CTTGGG AAGCCACCTTAGG CTTTTTCCCTGCATGG CCCCTACACAAGGCAG ATAGACCAGCACAGCCTGGGGAATCTGGGCATCTA 

5 0 TCAACG AAAGAAGT CG CTGAGTGGAAGG ACAG AAG AG CACG CGG CCTT AATCT AAAG AG ACT CTTGTG CAAGT CTGGGCT AAAAAC 

AGGCTAGTCATCTGAGATAATGTGGGGCTAGGATTCCAAGTGAGTAAGACAGGAGAGAACATGCA.GCGTGAAGAAAGA 
TCCCTGGGACrCTCTT^TGCTTTGGTGGGGTGGGCAAGCGATATAAGTG 

AATCT ATAGACATCTCCTTACCTTATTAAAATGGG CACAGGT ATAGCATTTCTGTGCAATC CCCTCTCTCTCCTCTT 
GTTTGTCTCATCTCAGAATX3ACTTT ACCCTACCTGTCGCACAGCAG ATGC CCTGTATCTTTG AGCGGTGGTGGG C 

5 5 CGGACCCTCAGAGACAGAGACAGGACCCACTGCGTTCATTTTTTCCCT 

CTT CC CAGG CAGGGGCAGC CAT CX5CTGAAAGTCGG CAG AACTCGTTGATTCACTT^ 
TGTGCAGGTGTGTATGTGGAAGCAGCAGCTGCTGTCCACACTCTTTCATCACT 
GACTGAACCTGGAGTTCACCATTTCAGTTGGCAGCCCCCCTCCCCCAAATCCAT^ 
CCrACTCCCCACTGGCATTTCACACAAGTGCTGGGCXS^ 

6 0 CATCTCCCAGCCCATCAGCCCTCCTTTTTTTGTTTTGTTTTTGTTGTTGTTGT^ 

TTTATTTTGTTTGGGTTTGGTTITTGGAGCriTCTCTGTCT 

AAACTCACAGAGATCTGCTTGCCTCTGCCTCCCTAGTGCTGGGATTAAAGTGGGGTATAAGCGC 

ACATTAGAACAAAAGACACAAATAGGGTCCAAGCCCAGCCCCTGAGCTCTACTGAGTCACT 

ACATGGCrTTCCCATTCTCCAAGTGGATGGCTCACrCTCAAACTG 

6 5 GAACTGAGTTCrrt3TGCTTCAAGTTTTCAGTTAGGCT(TT^ 

CTACATGGACACAGACAGGACAGGGTC!AGGCCAGCTGGACAC^ 

AAGTCCAATGGGTTGTGTCCTGCCTATAAGTCCCTAAGTAAATTATGCAACTCTACCGTCAACAGATCATC 
GGAAGGGAAATGGAACGGACACAC<3TGTAAGTGCCGGCTGTTAGGAAGCACCGGTGGGAA 
GCTGGGAATATCCTGCAAACACAGAAGTTATCCCTGGAATCTGGGGGAGAGCA 
70 GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGGTTAGGCTTGCACATGTGTACACATGTC 
TGTGGGAACCAGTCTCCCCTTTTTCAGTATAGGTGCCAGGGACTt^ 
AGCCCTGTCGCTC^CCTGC^TGCGTGCATTTCCATGCrCAGAGTTA^ 
TnTAACCTTGGAGAATCTTCGGGGATTTCCTGAGTTTGTGAAGAGCAGAC 
AAAGGGAACATTTTAAAAAGAAAGTTTCCTTCCAC'l^ 

7 5 ACACAAAGAGAGGGATGTTACCGGTACTGCTTAAGGAAAATGGCTACCCCACATCCCCCGGGCA^ 
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CTTCCTTTTCGTGACAAAGGGCAAGTTT 

CAAGGAGAATGAGAGTTACGGAGCCACTGAGAACTGGGGGTGGGGGGTGGGGA 
CGGGCTTCCAGCAAGAGACTGGAGGCCAACGGGCTTGGAGGGCGG 
GCAAGCAGTGATCTTCAGCCACCrATCTTCTTGCTTACTTTTT 
5 TATAGTATCTTOATCCTGCAGTTTAGTCATTTGCTACA^ 

GCTGATTCTGTATTAGACCCCAGGGCTGTCAGTGTCCAC CACTGAGTCCT CCCCTG AGGCCCTGGGACTGCCTGGCCTTGCAAAGC 
ATGCCXICACAGCCCCAGCAAGTGTCCAGGAAGGCTCACCCCGTGACClT L ' l ' ri 'CCAACTAA^ 
CCCTTTGGATGGGCAGACTGGCGTCGAGGAGGACGCCACATGTTG 
CATGCCTTAAAGGGTGGCCAGCTGGTGATGCAGCCATGAACCGCCAAGTTCAAG 
1 0 TGTTGAGCGTTCCTCCTCTTCCCCAATTCCTCCCCCATCAGGTGTCCTTGGGCAGCT 
CAGAAGAGAAAGCTAGGGCTGTGAGACAGTAACAAAGTTGCCCTTGAGTATGACTC 
TATACTGGGTTGCTGAGCTCTCACACTTAGCCCTCTCCCTATGTACTTCCAGCT 

TGCTTGGCTGAGTCCAGACTTCCCCCTTCTTCCCCTGGTCT'lTCTTl' 1 1'IGATGCTCCTTCCTCTTGCCTTCAGCCCTCTTGTCAC 

TCCTGTCAAATATCAGTCCTCTGGGATCACAGCCATCCACrGAA^ 
1 5 ATTTCGAGGCTTCCCCCTCTCTGTCGGCCCTCTCACT 

CAAAAGCCTCTGCCTCCAGCCAATGCCCCTGATCAGCACCAAAGTGATCT 

TCTCCCATCAGG CAACAGACTAAAACATGAAGTCTTGG ACAGCCCAGTCTCT 

GCGAGTGTTATTTTGCTGACCGGTACTCCTCTAGGTCGGTCCCTTGTGAGTCTCT 

AAAAACACTTGAGAGTGTTTCACTCTCCACGCCCATGTACTTAAAATACCAAGCCCA 
2 0 ACAACACATGTGCGCTGGAGGCCCTAGCTCCCACGTCTC 

TGGGTGGAGAGCAGAGCTTGCAGCTTCTGTAATTCTGAGCACCGAGATGTGCCTCT 

CCTAAACCCAGCTGGAAAAATACAAACAAGAATGGAGGTCAGGTTAGAACCAGGGTGAC^ 

TAAGCAGATGTCCTTGTACTCAAAGCTCTTACCAGGACTGGGGAGGTGGCTCAGT^ 

CCCGAGTTCTATCCCTAGAGCCCAGGTTTTAAATTTTTAAATTTGTAATG 

2 5 GACACATCCCTGGGGCTCACTGCTCAGTGGCCTAGCTGAATCCATGAGTTCCAG^ 

TGGGCAGAGCTTGAGGAATGATGTCTAAGGCTGACCTGTGGCCTCCACATAAAACTGCACATC 
ACACACATGCACACACGTCAGCTGCAACCCGAGCCCATCGTGGGGATTGCCACCAGTTTA 
GACATGCAGTCTCCCTAATCTCCCTCCCTGGTGGGAATCACTAAGTGGAATTCCTCGTATTGAC^ 
TCATAGGAACCATGGTGCTGAGCAGTGGCCATACCAGTTGGGGGTCACAGCTGTATT^ 

3 0 AGCCGACACTCTCACAGTCTTACACCTTCTGGAAGCGCCAGTCAGCT 

GCAGCTGCCTAGAACAGCTTGCACATTAGAGTCACAGGGGCAATTTTCACTT^ 

GGGTGAGTGGAGGCTCCTGAGTTTAGATTTTGCACAGGTGGCCATTTATACGCCAATGGTGGAAGCAC 

GTTTAGGGTCATTCCt: i L"iGTGGTTAAGTGAGGTGGTTAATAri r rATGTCTGTCTGTCTGTCTGTCTG'rATTATGTATTTGTGTb 

* 35 AAACTTGGGTTG TC C CG TTTGGC AGT AAG TCC CTCTG C C CACTG AG C CAT CTTG C CAACCTGAG CTG AGTGATTTTTGAGG AT GCA 

AAGT(^CCTGTTTAAGTCTTTGCCTTA(^CTCTTTTAAGAA 

CCAAATGAAGTAACTTCCAT CGGAATGTGTCGGAAGCATTGG AAGGTCTG C^T ACACCTAAAC ATGTCTACATTCTG AAAACG^ 
TTCCTCATTGTAGGCAGAGTTATGAAAGGAGCCCTGGTGAGAATTGAGCT 

AGGATGCTCTGTATTTGTGCCAGGATGAAGAATGTGCCTGGATGGTCTCCACCCACCCAGGATCTCTGAACA 
40 AAATTCGCCGATGCTGTGAGGTTAAGGCCATGCATGAAGACCTTCATGTGGAGACCATATGTGTTAGCCTTCATGTGGAGCTCACC 

TATGATGGCCTTCATGTGGAGCTCATGCATGACAGCTTCTAAGGATGGGGAGATT 

CAAAGGTAGAAGCAGTAATTGGGGCACTGAGACCACAGCGCCTCCAGAAGGTCAATACTGATGAG 

GGAGAAGGAAGGAGCCAACCACCCCCATTGCCACCTAAGGTGTCTTACAGGCACCTTCATCTT^ 

AGACTGAGTCTGTGTGCATTAAGCCATAGCACTGTATTACATCAGCCCC^UVGATTGCT^G 
45 CTTCCGAGGCTGAAAGCAACAGATGACATGATTCTGTGTGTGGTGACAGTGTGCTAGATAGTCACAG^ 

AGACCTTCATGACGCATTTCAGGCAAGCCATGGCTGAAGCCAGCATTGGA^ 

TGTTACCGAGCTACAAGCTCCCATGTAGGCATCTTACTGC 

CAAGTAGCTACTCTTGACCTGGGACCCAGATTGGCAAAAATGTGGGAGAATCCAACT 
TCTTATTGTCTGTTCACTCTCCTCACACTCGCGCATGTACAC^ 
5 0 ACACACACACACACACCAAGGGCCAGCTCAGGACCAAGCTATGTACCTATCAGTCACTGGTC 
CCCrTTTGGCTTTACAATCTTCCATTGCCTGAGAGCCrrCTTCCTGTTCTCC^ 
TCGGCCCTGCCCCCACCCTCACCCCCACCCCCAGGCAGATTAAGCTTGGACCATCAC 
ATCAAGGTGAAAAGTGATTTATCGGGCCAATCGGAACTAGTCACAAGCAGACTC^ 
CAGTGGGAGACTCACTGGGCCTTGAAGTTGGGAAGACTTGAGAGAGTGAATCTGGGTGGCTGGA 

5 5 TC CCAG AG TCTGGAGG CACT ACTT AAAGAG CTGAG CTCATGAGCTG CAG AAAG AGGGACGGGTT CTAGGGAGAAGATTCT GG AAAC 

TGGCTCTAGGATTCCATGTTGATGATACCAAGGTCTCGCCCGTCCATCATGGGTGCTTGGTGAGA 

GTGCAGGGTGCTTACCGTACAGCAGCAGGCTCGGGACT 

GTTACACATCAGCACACCATTCTTAACATGCCCACCCTCAGGCTCT 

AACCATATGGAATATCCCCTGATGGTGTATCCGCAGGTGTCTGATGCAAGTGCCAGAGCCTGGCAGCCTTCTT 

6 0 GGGAAGGATTCAGGTACAAAGTCTCTTGGGCCGTAGAACCCCAGTCTCCAGAAACCACTCCT 

ATGCACAAGAGG CCTCCTCCTGAACCTTG CCTGACACT ATTGTTCT CTTTGGCAG AG ACTAGCAGACAGACTTCAGAGGCTCTGGG 
GTCTTTGGGAAACTGTACTCTTGATAGGGAGTG CTGCTTCCT CTGGTTATAGGTTGTATTCTTGGGAAATGTCTCAGTGG CCTTCC 
ACTTTCAGCTGGAAGTGGTATGGTTTTTCTAAGGAGGTTAGGC 
TTTGTTCCCATTTCCCTTGAGTTTCATTGTGTACTACGTAAGAGGAGAGATTGTACT 

6 5 CTTAATATTGCACCCGGCCAAATACGGGCATTCCATACAGAGCTCTGAGACGGGGAGAGGTGC 

CCCCCATGGGCAGACACCATTACAATGCTTTACCTCATTTATCTTr 

T ACCAATTTTCAGG CAAG AACACCG AGCCTGAACAAGAACACAGAGAAGTCT AGATCT AGTGGCTTC CATCTGGAAT 
TTAAACCAGAATCCACCCCCTGAAGCCnxSCTCTGTCTGTCCTGTCT^ 

CATCT AGT CATTGGGTATTT AT AAAT ACT ATTT AG CATCAAG CAT CAATTT CT AATAGGTACAG AG AAT AACAT CT AATTT AG CAT 
70 GTGCTCTGTTAATAAAAAGATAGAGAATAAGAATATGTTCTATGGAGGTGTGG 

GAAGTACCAGCCACATGACGGTATAACATAGAATAGAGTTTATTCAGGCCATGG^ 
ACGCAGAGAGAGGGAGAGAGTAGAGAAGTAGAGGCTGACCATGACCACGTGGAAAGAGGG^ 
AG<^AAGAGCCCAAGAGGGCAAAAGAGAAGCAAGGGAGCATGAGAACAAGA 
GGTCAGGCCTACCTGGCTATTGTCAGGTAACTGTGGGGTGGA^ 

7 5 AGTGTTTGTTGATTTGGTTGGCCTGCTCATTTGTTTTGAGACAGACT 
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TGAGGGTGACTTTGAACTTTTGATCCTCCrGCCTC^ 

CAATGCTGGCATAGATCCTTGGGCTTCATGCTTGCTACACCACCACCCT 

TCTCTAATCCTOSGAAGTACAGAGTAAAGAGAGAAATC 

AGGGAAGACAGAGXKjAGGTTAGCAGTAGATAGGTATGAGTCCCAGCTGTCAATTC 
5 CTCCTGGATCTTATTGGTAATTAATGATTGACTTATAGCTATGTGACTATGC^ 
TACAGATCAGAAATAACTGGTGAACCCACTGTGTGCATTACATGGTGAACCC^ 
GTGTACCAGGAGAACAAGTCCTCCTGCCCCAGCTCTGTTGCCTTTCAAG 
GAGGCTCAGACCACCCCTC^TATGTTCTGTTCTGTTTCCTGTCTCCACAGG 
AATCCGCCACAAGTTGCCACCTACCATAGAGCCATCAAAATCACAGT^ 

AGCTTTGGCTTAC^CGTCAAAGGGAAATTAGGCCAACT 

GAGT AG ACCCCCACAT AT ACAG CAATG TTCAG TGTG CT CAGACACAAG CAGTGGAT AG ACAATTGG C CAAG T ACAGGGTGTGAC AA 

TGAAGACCTGAAGGTCACGGTGGACAAGAAGAGATGGCTTACCATC 
1 5 TGAAAATCATCCTCTCTTTTTCCCTTGAAGGGATCTTATTCrAGGATTC 

GGAAACCAGTAAATATAGAAAGATAGCCTCCAAAATACCCCGAAACTCCTGTGAGGAGAGTGTTTAAA 

GTCTGAAGGAAAGATGTCTTCAGAGTGACTTAAGGACTATCTTGGAGTAAACACAGGAGTGACACT 

CCCAGGAGCTGCAGCTGGGCCACAGTCCAATTCTGTAGGCAGATGAGCCCGGAACGAAGAGCAGCT 

GGCCCCTGGGAAGTGATACCCTCAGAGGCTGGGATGCGAAAACATGGGGTCCCTCCTGCA 
2 0 ATTTGATAAGAATGAAGGTACTGAGAACAGGGGCCCTAGAACAGTCTCAGAGCCT 

TGT AAAGGTGACAGCCGCATTGTGAAACTTAACT AGATGAAG CCAGGTGCCCGGTG CAGGGCATGCGCTTACAG AATX3TCAGGAAT 

CTCCXsTGGAGGTTCCTTGCCTCGACCCCTCGTTTCAGTGTGGTGGTTCCATTC 

CTGTCCACACAGGCATCTCATCTGGGCTCTGAGGTAGTAGCCCTTGGGAATTTGGCTCT 

TATGGCAGCGTCTCCTGCAATCATGGTCATGACTTCTGGTATCTGGACCACCCTTGGAA 

2 5 CAATTTTCCCTGAAGCAGTAATACACAGAGAACAGCTGCAGTCACCAGC^ 

CCCTCCCCTCCCTGGGCTGGGCAGCTCCCTCCACTCTGCCCACCCTGTTCCCATTTTTATTCC 
CCTTGTTAACGTCCTTGAGATTTCCTAGCTGTGGTCAGGTTCAGGGAACTCTGGGGC 
AGAGTGTGAACAGCAGACCAAGGTAGCACACAGTCCTCACACAGCCCAGGAAACCTCTTGGCACTCCCT 
AGTCATGACAGAACACCCGAGGGCCATGGCCCAGTGCAGGGGACTGGAGAGGAAGGAAGCCGAGG 

3 0 GGAGAGGGGGCGAGGTGTAGGAAGACTGATGCTGAGAGTGACCGTGCTCATAGG 

GCAGCTGCTGCCTCCAGGACAGATTATACAATGGCTGTGCACGAGGAAACAGCTGAAAATGGGGACCCATGW 
TCCTTATTCTTCCACACCTAAGAGAGACAGGCTTTCACCTTGTGTGTATTTTTTTT 
GCTGAGCCTTAGAAAAAGCAAACGGCAGGGGAAAGGGAAACGGCCCCCATATTGGGGCCCCTGCTTC 
GTACTCAGATAAAGCACrrcCCGCCTCTGCTATTTTATAACTTGTAAGAACAGTGGA^ 

3 5 CCTATTCTTTTTT ATTT ATTTTTGTGCTTTGAGTTG AGAGACAGGCTCTCACT CT ATGTA 

AGCCAAGCTGACTTTGAACTCCTGCCTCCCCAGACCCTGAGTGCTGGGGTTACAGCGTGTGCCACCACATCTCATACCGGTTCT 
TAGCACACGGATCATTTCCACTGGCACTCTGCCCATCACCACTTTTGGATGCAGT 

TGGCCAGACTGGCACAGCCGTCGAGATTTAGGAGCGCTGTCCCACCCCTCATTAAGGAAGGATCATAGATT 
CCAAGAACCCTAGTGTTTGCCACACTGAGCAACATCGCTCAGT^ 

4 0 TCTGATCCTTCCAAGAAAAGGAAAATCAGCCCTAGAGAAGGGAAGCAACCTTAGGTTTAAAG 

GGGGGAGTTTTTCACTGTCTGACTTTGGTGTCTG ACTTTGGAACACACATCTGTC CATCTGT CCATCCATAC CATGCTGTCTGCAC 
TT CACATCCT ATGAGTTCTCTCTCCTG CCTTTGG AATTTTTGTGTTTAATTGTGG ACTGGCTTTCATGATTTCTCATGTGGG ACCC 
AG CTCTGAGGCTGCCCCATAGGGCACAAGGTAACCTCGACG CTGACATGGTTGTCAGG ACTG AAGCTGTATCGTAGTAGGTTGTGT 
G CTCCAGGGACTCCCCAGCCTT1TGCAGTTCAGATGGTTACTGGCCGGTC CCAGAAACCCTCTGCCC CCG AGTAAGTCAAAACTTG 

4 5 CTTTTCACACAGTCCATGGGATCCTTCTACATCTCCAGCCAATACAGAAATCTGTGAGCT 

AAGAGTGCCACOSGAAGGTTTTTGTTTTCAGGCAGTAAATAGACCATTTGATGACTTGTTAACATACTGG 

AGATATACTAAAAGGGTGGTGGTATATGTGTGTGAGGGGTACATATGGGTGTGTGTGTAGTATAAGTGTGTGTGTTTAT 

TGTGGAGGTCAGGGGTCAGAGGTCAGCATCAACCATCATCGTCTTGAACTCTACTTGTTCT^ 

TGT AGCTCATCTG CTGAG CGATGCTAACT AGCCAGTAAG CTGCAAGG ACCTGTATGTGTCTATTCTCCAACACTGGGGTT AGTGCA 

5 0 GACATACCAAATCACCCAGCCAATTACACTCGAGGTAGGGATGGGGACATGAGTCC^ 

TGAGCAATCTCCTTAGCACCAGCCCCTGCTCCATAAAAGGGTTTTCTCTTCACAAACAGAATT^ 

TGAT ATTTCTAGAAACATGTAAAACTCCATCTCTCAACC CACTCGTG ATGAGTCTTGGGAAGTCTGG TACCTGTG ACTTGGGGAAT 
AG ACTT CATTAAAATT CAAAAAGGTCAAG AAACCAATGCTGG CTGTGTCTGCGACAGCTTCTGC CCT CTGGT AAAG CTGT C CTTGG 
CAGTCACTCTTCAAGCTCTGAGATTCACAGTGACGAGAACAAGATAGAGCT 

5 5 TCAGTGT CTT AGTTGGGGTTT ACTG CTGTGAACAGACACCATGACCIAAAAGG ACATTTAAT AGGGGCTGGCTTACAG CTTAGGTCC 

AAGCCTCAAATCCTTTGCATGACCCITTCAGTCCTGGGCTGTCAACTGCAACTGAGGCT 
CTCTCACAGTGCCAAACCTCTGCTGCTCTTCATGCCCCCTTCATG^ 
GAAGTACCACTGCAGCATGAGGCACAACCTTGGCTA^ 
TTCACCTCAGTGATGCTGGTCrCTTCTTAATCACrGCTAATTTTGT^ 

6 0 CTATTCTTGACTCTAAAGCITGAGCCCAAACTGCTGAGTTCTGCTGCTTGCT 

T ATCAGTGCT G GGATT AAAGGTGTGT ACC AC CACAC CTGG ACTT AAG CTTTTCTT CACCAAG AG CTTGCT CTGGT CC CAGGCTGG C 
CTTGAACTCT GAG ATCTTCTTGGCTTT AT CT C CAAGGAT AG CCC CATGCCTGGAT CT AAATTTAG CTGAG TAGGATCTTG CC C CAA 
AGTCCTACTCCCTTAATCCGATATCTCCTAGAACATGGGAGTTGGCTCCATC 

AT ATTTT AC ATTTTTC CTTTCT AAG CTTGCT ATG CTTG CTT AAAATGCTCTTCATGAG ACT CAAC CAG AG AACAAAGTCT CTGCTG 

6 5 GGTGTTT CTGX3ACTTCCrTTGCCATTG CAATTAATCTGAGTCTCTT CTCCTTAGCCTCAGGCAGACTCTTCTGACAAGGG CT AAAG 

CCACCCATATTCTTCATCAAAATACCATAAAAACAGCCTCCAGGCCATGTACT 
GTCTGCACAGTTCAAATCACTCTCAGCAAAGAAGTCTTCCATATT^ 
CTGCTTTCCAAATCCAAAGTCCCAAAATCTACATTTTTTCCCTAACAAAA 
TC^TACCAACTTTTATCTTAGGCTTTTACTGCTC 

7 0 GCACTTTCGAGGCAGAGGCAGGCAGATTTCTGAGTTCGAGGCC 

AGAGAAACCCTGTCTGGAAAAAAACAACAAAAGAGGAAAACA 

ATTTAATTGX3AACTGGCTT ATAGGTTCAGAGGCAGTC CATTATCATCAAGCCAGGAACATGGCAG CATCCAGGCAGACATGGTGCA 
GGAGGAGCrrGAGTTCTACATCTTCTTCTGAAGGCTATTAA 
CACCCACAGTGACACACCTACTCTGACIW3CCACACCTTCTAATAG 
7 5 CACCAACTGATCGTTTCTAGATGAAAGTATCCTTAGAATGGATCAACTCAA 
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TTGAAGTCGTCCTGGGAGTAAATGCACX3TGCTTCTGTATGTC 
TTTATGCAGGTGTGCCTACAGTGTGTTCCCACGTGGAAGCTAGAGGTTGA 

TTTCGTTTGTCTG AGTCAGGGTCTCT CACTGAACCTGGAACCTACTGAG CTTGAAATTCTGG CTAGATTGGCTGATAGTGAGTCTC 
CAGGATTCTTGTATGTTGACCTCACCAGTGCTAGGGTTATAGGTATGCGCTGCCAC^ 
5 CTAAATTCAGTTCCTCATACTrGTGCAACAAACACCTO 

TTTGAAGGGACATAGCAGACAATAAAGAAACAAACAAACA^ 

TG T AT ATT AAACAGAACCTTT AT ATTTGATGAAT AATTT CT AATGAAAAAGCAAAATG CATGAAAGACAT CTGT AAACATT CGGTG 

TTTTTCAAGATATTCTTTTCTCTGGATAAGAAGAGT^ 
1 0 CTTTCAAAGTGCTGCAGGCATATTAATTTATAAAATCTTGGGAGAA^ 

ACCCAAGGTCACCCAGCTAATCAGCCAGAATGGAAGCTGTGGAATCCAGATGTTGC^ 

ATTTGGGCCTCGCTGCATTGTGATGCTGTATCTTTGATAAG^ 

CTTTGAAGCTGCCCCAGACGTTTGACTGGAGTCAGTCT^ 

CCAATGCTGCCTGCTCCCACGTGTGCTAACGATTGGCTCTGAGGGTCACTC 
1 5 AACCAGGGATAATGCAAAAACCTATATGTTTCCCAAAGAGAA 

TAAACGGGGATCCCCTTTAGCAAAGGGTCCCAGGT AAAG CTTGG ACACT CAGTGTGTGCTCTTGGGGTTGCCATGG CTAGTCATGG 

TGCTCACAGGGAACCGTGCCACGTGGCTCCCAGGTCCTTGGAGTGTTCAGATATCAGGCGTGTC 

GGTGCATAAAGGAAAATGGAGAGrTTTTCCTTCCrTCCrrC 

GTATGCCTGCTCTTCCCCTCTGAAGGCATTTCAGGTTAAGCAGAAGATTC^ 
2 0 GCTAATCTCTCTAACTACCT AGAGACTTAGGGAAG CA U 1 ' 1 ' CTl ' A AAAGCATTAATAATAACTGTGCCTAAGTGTTTCTAAGGAATG 

TTAGAATCTCTTCCTGTTTATTATTTTAGTTATTGGACACT 

GCTAAGGCCTAGAAGTTTCTCTGGGGCAACTGAGGATGTGGGAG 

ATGACACTACCCATGAACAGCAGT CCAT CATGACACTGCCTGTGAACAAGAGTCTATCATGACACTACCCATG AACAGCAGTCCA 
CATG ACACTGCCT ATGAACAAG AG TCT ATCATGACACTAC C CATG AACAG CAGTCCAT CATGACACTGC CTGTG AACAAG AGTCT A 

2 5 TCATGACACTACCCATGAACAGCAGTCCATCATGACACTGCCTGTGAACAAGAGTCT 

AT CATGACACTGC CTGTGAACAAGAGT CT ATCATGACACTAC CCATGAACAG CAGT C CAT CATG ACACTGC CTG TGAACAAG AGTC 
TATCATGACACTACCCATGAACAGCAGTCCATCATGACACTGCCT^ 
CCATCATGACACTACCCATGAACAGCAGTCCATC^TGACACTACCCCATC^GCAACAGTT 
GCCTACCTTGATACTACCCCAGTGAGTTTTTAGTCAGTGCCCCCCT 

3 0 CTGAAGCAGGTTCTTCCCAGGTACCAGGAAGACTATACAGGCATAAGCAAGAC^ 

GAAACAT AT AATGAATGGAAG CT AAAC C ATrTCOTG AATG T ATTT CTITG AGTT AGAGCACTG CAAAATCAT ATTGCT ACTTAG A 
ATACCATCCCAGTGAAACTGGGGTCTACGCTGATACACTGTCCTTGTTAGGAATGATTGTACG 
CCCAACAATGTAATACTGTCTCTGGTGAGGCTTCCGTTCTTAATCT 
CCTAGGCTTTCACCCnGGCGCCATCGGGCCrGTGGGTGTGAACTTGGGCAATTAT 

3 5 TGGGTCAAACTAGGCAATGAGACGAGATAATCTCTAAATTAGTTCCCACTCGAGCTGGGGTAACAA 

GGTATAGCTTCTTCACAAAC^GGGCCCCTTTACAGGCCACAACCCrGAGCTGAGGTGA 
CCACAGTGGTTTTCAGATTCTCCCTCTCCCTGTCCCTCTCTCTTCCCCTCCCTCCTCCTTCCCCTCT 
TTCCCTTCCCCTACCCTCTCTCTGTTCTTGAATGGGCTTTGTACTTCATCTTCAGTTTCTCrrAGCT 
CCGAACAGCCCTGATCTGATGCCTCTGCCACTCAGCTTGGGATGCT 

4 0 CCTGGCTCTCTGTCCAACAGAGACAATGGCCATCCATATCAGTGTCATC 

TTAGAAGACGTCGX3CTATTGTTTGTATTTACCTTGGAGTCTTC 

TGTGAATTCCTAAATTTAAAAATTCATGAAGTGGCCTTAATGTTGCGTTTTTTTGA 

TTTTTTATTTCAGATTTTrTTTTATTTAGGCCCAGTAA^ 

TCCTAAG TC C CTC ACGGTTCCAAAAG ACTTGT CC CCAAGTGT C G T ACTTTG TGAAC AACTTG AAG AAAACTT ATGTCCTGG T AACA 

4 5 AAGAAGCTCACACCTGACACTCGGGCCTCAGTGTATGCACGAGACAGCTGATGGAGAGTCTTC^ 

ATCCCCCCGCCCCCCATCTGCCCTTGCTGACCTTAGGGCATTTGTTCCAGGAACCAACACCTCAG^ 
CTGATATCAAATGGCAGATTATTTGTAGATAACTTGTGAGCATCTTCCTGTAGACTGTCAGT 
GAGAGAAGGCTGGATAAATGAATACCTGTGAGACTGTGTTGTTAGGGAAGTCC 
AGTATTTTCAGCCTGGATTTGGTTCAATCCGGTTGTATCA^ 

5 0 ATCATCACTCCAGGGTGCCTGTAGACTCTAGCAAGACCTGTACGCGGTGCAC^ 

AGGG CTCCCT C CCATTGTGTG T ATGAGAGAG AGAGGAG GGGG CTCCTG CT ACCTGTGG ATGTTG CTCTGGG AG ACTCAG CTG TT 
GTGCATCTTCCGTGGGGACAAATGACCTCCCAGCTAGCTtlAAGG 
CTACCTCTCCCCTTTCCCTTTCCGAGGCAATTTCAATTCTTTCAAGGCCT 
AGTTTTCCCGAGGGTTTAGGTTTTCCTGTAAAGCrGTGTGCTGGG 

5 5 ACCCC^CCACTC^CACACACAC^CACACACACACACACACACACACACACACACACAC^ 

ATCATATAAGTAAACTTCATGCTCCTCATTTTAGATCTGCGTCTAAGTTCACTGTAGCTAATTTGAATCTGC 
TCTTGCGTCGGAAAACCTTTTGTAAGGTGTCCCGCCTTGGGGCTCT 
CTCCAATCTACCTGTGAGTGTTTCTGCTGTCCTCAGACCCGGGAAAGCACCGGCT 
CTCAGTTTTCGTCAAGGTCAATCTTAAGGTTTTCCCTGGGAGAACTTGCAAGTGTATCAAACTT^ 

6 0 AAGCCAAGGTTAGAAAAAATATTTAAGCACCCAAAGGGAAGTCCTTTTCCTGTGGGT 

GCCCCCACGCCACCCTTAAAGCAACACAGTGGAATTAATCTGCAACTCCCCAGGCCCTT 

ATTTTGTGCTCCTAGGAATTCCATATGCCCAGGTTCCATTAAAGCAAGTATTGACTAAAACCCTCTCTG 

CAGCGTACTTGCCTGCCCTGAAATGCCGAACAGTGGGCATTTTATTTGCTTGCTGG 

AAGTCATATTCTAGTCATTTGAGG CAAAAATGT AAACATTTG AAAT AAGT CACITCGTAAGTTCAGGATGACTT AATGGTCCCTTT 

6 5 TGGAATTCTGACCCGAGTATCTCAGCCTTACTTGCCTGTGGGGTGGCTTC^ 

AGCAAACGTACAGGAGGGACATTTTTACTCCTGATCTGACCTTC 
GTTCGACAGGGAGTAAGAGAGGGAATTGGAGTCAGACCTCTGATACATACT 

TTCCAGATGCCCACCTTCCAAAGGGGAAGAT AGACCAGGTGGGCGCCATTGCTCT CAGCTCCCGAAC 

7 0 TGCACATAGCACCCTGTAACATTTAAAGCACAAACCAGGAACACAGGC!AAG^ 

AGGAATCACATGGAATTCTGCCGAAGTGTTCACCAAATATTTACCAATACCAAGTTATTTAAA 
TGGTTCCACAGGACTCAGCACTTGGAGCCTTTT 

ACCCAAGCCTT ACCAAGGCACCAC CTTGTGAAACCTGCATCGCGCCTG CCTCITCAGAGAACAGGCAGGTGGGTCCAGATAG AGGC 
TTTATCTGTCATATGTAGCTCTGTTTCCCAGCAGAACATTGGGCATCCATGTTGeTG 
7 5 AATTCTACTGGGTCAGCTATGAGTACCATGGGAGGGGAGTCTCTCTACTGTATAGGGTT^ 
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AGTGTGGCTTATCAACTAAGTAAGAGAAATAGCCTATCCACTAAGTAAT 

AAG CAACCTG AAAACCG ACTGCT AATGTTTT ACAGGT ATTGGG CAAG AT C CTT AGTGAT CTT AAGG AAAAT AATGATTGTT CT CTT 
AAAGATAATTTTGGTTTTGTTAGACTTGCTTGl-rriATA'I'r^GTG CATG AGTGTTTTG C CTG CTTGT ATGTTTG TG TACCACGTG 
CATGCCTGGTGCCCTGGAGGCCAGAAGAGGTCCTTCCATCCCITCAAACTGGAAT^ 
5 TGGGAACCAAACTCTGGTCTTCTAGAAGAGCAACACGTG<rrCTTAACTGCAGAACT 
CGTCTCGTCAGATTGGTGAGAATGCAGGAGAAGTCCGCATTTAAATTAGCCACCCCT 
TCCTCGTTTGCCTCCCTCCAGAAACTTTTTAAGGCACTTGGAAACA 

ATTCAl"ITTCrr IT CCAGGAC CAAATAG 'ITIT'I' CTT CTG CAT ACGGCT CT CATGGCGG CAAAACCAT AGG AGAGAACAG CGT AG C C 

TGCTGCTTCTGAGTCTATATTTCCTGTGTGTGTGTGTGTGTATGTGTGTGTGTGTGTGTGTGTGTGT 
1 0 GTATGTATGTAAGTATCTGTGTGTCTATGTATATCTGTGAGGGTGTATGGGTGAGAGAGAGTGTG^ 

GTCTCTGTATGTGTATTCATGTCTATGTGTGTCTGTGTATGTTTATGTGTGTCTGTC 

ATGTGTGTCTGTGTGTGTCTGTGTGTGCCTGTTTGTATGTGTATGAATGTGTG 

GTCTGTGTGTGAGTGTGTGTCTGTGTGTGTCTGTGAGTGTGTGTCTATGTGAGAGTGTC 

GTATGTGTATTTATGTCTGTGTGTGTACGTCTGTGAGTGTGTGT^^ 
15 CTGTTTGTGTATGTCTGTGTGTATCTGTGAGTGTATATGTCTGTTTGTGTATGTCTGTGTGTGTCT . 

ATGTGTATTTATGTCTGTGTGTGTGTTTGTGTATGTCTGTGAGTGTGTATGTCT 

GTTTGTCTGTGTCTGTGTGCCTGTTK3TATATGTGTGAGTATGTGTATAAATG 

AAACCCTTCAGCTACTTGATGCTGAGGCTCTGGAGGCC^ 
2 0 GAG AAT CTTTT CTG CTCTG AG AGTGATT CAG CCATT CTCAAAAT AAATAAGT AGCAG AGTG CAG AG AGGAAGAAG 
GAAATCTGAGTCTGGAGGGrcGAGTTCTTCTCAGCAGGGCTCCrC^GGG^ 
GGCTCTGTGCAATTCrCAGCTTCTGCTGGAACTTCAGTTT^ 
TGATGAACTCAACGCTTGATTCAATGTTTCCAGGAGCAGTTGAACTT 
CTGAGGAGTGTGTCCAAAGGGGGATGGAGAAGAGATGGCCACTGGGGGTGTGGGGGG^ 

2 5 GGCCAGAGGGAGGATTOACCCATCCAGCCCAAGCTGCTGCAAAGGGACAGGAATAAATGCAGAGTG 

ATCATCAAAACTATATTTCCCATGACTTTGGCAGCCTTAGAAAAAAAAAAAGAAGAAAGGGGAAAAAA^ 
TGCTCAATTGTCOGAGCCTAATGGCCAACTTCCCAAAGCT^ 

AAAAGATGGTGTGAGTTCTGTTGACTTAATTGCCAGAGCGTTGCTGAGTGAGGCTGCACCCCCCC 

3 0 TGTTTG ACT CATTCGGTGCTC CTTAACCCGAAGCCAAAGAAAACAAGG CAGAG AAT AGCTGGCTTGTATGGAGTCTGG AAAGCCCT 

GCTCGGCCCTGCTTTCAAAGTCCTATACAGTAGCCTTACrATACCTCAGCTGCAG 




TTCTGGGCTGAACAGATACACTAAAAAGAAAGACTCAAAAACCCATTGGAAAAAG 

3 5 GGAGTTGGATCTCACCTGACCAGCTTCCCATAGGACAGTCCCTAG^ 

ACTCAG AGG CAG AAGGTTGTCTTGGGTTGAT AGGGAGGTAGCCACACTGCTCCCTACTC CTTG CCTGAG ATG AGGTCACCAGGGGG 

GCAAATGGGGGTGAGGGGTGGGGTGTGCTTCCAAATTTGG^ 

AGGTACATGGCCGCTTGCTGATGGGTACTTGTGACTGGTAGAQVAGATTCTC 

CCTCCTCCTCCCCAGTCACACACCCCAGTGG CTGAGAGACAGAGT CACCCCCAAAO^CTTG AAAGTAACCTTTTCTTGAGTGTGG 

4 0 CACTCAGCCAGTGCTTTGCTAAAACACAACTGTGAACTCACAGGTG 

TTGAGGCCAGCCTAGAGCTCCAATTGGCTCITTTTTTCCTCTrCAGGTTTAGG 

TAAGGA.CGGAGGGAGAAAGATCGTTGTTGATGCTGCGAGCCGAGAACCCCAGAGAAGGACATCCAATC 

TTTCTCACTCCTCCGTCCAGCCAGGTTTTACAGGGACATC^CrrCAGCCrCCT 

ACATACTCAGCACCACTGCCAATCAC^TGTAGCATGGCAGGAA 

4 5 TGCTCAGACCCAGCGCTGAGCATGGAGATGGAGGTGGGGGCCTCTGAGTGT^ 

TTG C CCTGCTG AAGG CCCGAGTCCAGGTGCATTT CAGAG ACT AG AGAGCTCT 
GCTGTCTTCCCTGAAGTCTATCCGTGTCAGAGCTGCTCCGGTAGATGGATAAGAACAGTTAC^ 
TTACAGAGGGGGGGTCCTGAGACCGTGCTTTCAGGAGAGTTAGATAACAGTTGAGAT^ 
GTGTGAGTTCCCTGCAGTTCTCCCACAGGCAGCC!ACCCCCACGGAGGACCACTGCGGGTCAGGAGCAC^ 

5 0 CTGCTCTAAGCTACAATTACCTCCTTAGTCACCZAGACAAAGCAGGAGGTGAG 

TCTCAGGACACC!AGGGTCCCATGGGAGCCACCACACCrrGAAGTCTGTCACrC^^ 
CTGCTTTTTTTTTCTCCCTGAATCCrTGGGGCTCAAAGAGGGAGAGAGGAGTC 
TCAGCCTGTTGTCTGTGACTGTGAGGCTTGGTCTGGATTTGGCCTCCGCT 
TGTGTCTGATTGAGAAATGTGAATGTGTATGTGTGCATGGGTGGATCACACACAGGTACACACA 
55 ATACATATATGTGTGACTTATATGTGTGTGTGATAGATATAATACATATATAG 

TGTATGTACATATAACIATATATATGTGCTATCGTGTATATGTGTGTAGTAGACATGTGCTACATACATGTGTGTAGTATATATAAA 

CACATTACATATATATTACATATATGTATGATATGATAGAAGTATGTACTGTATACCAGCATGGTGAAGATATG 

TATATAGAGATGCACATGTGTGAGTATGTGGTACGTGTGTATGT^ 

TGTGTGGCATCAGCATGTGTGTGAACACACACACX3TGCATCAGAGAAAAGCAGAATCCACTCT 

6 0 CTGTGCTTGGCTCTGGACAAGGCTCATGCCAGTGTGCTGTGCTTGGCTCT^ 

AAGAATGGTTTGGTGACAGATTTGACrCTGACAGGACAGTTCTTCAGGGAG 
TATCTTCACAGCAGTCATACTATGTCCACACAGCAGTCA 

AITCTACTTTCCTCTAGTCTTGGGAAGGAAGGGTGGATGGTGCTGATCAAGAATATCTG 
TCAGCATTTAGCCTTAGCCGGATGTGTGCCCGGAGCCTTAGGTACTAAGGAGA 

6 5 CAGCAGGCACAGAC1ACACCACAGTGACACAATGGTGCTCTGGAAAA 

CTTAGAATGCATAAAGCCTGGAGTTTAATCCCTAGCAATGTGTAA^ 
AGATCAGGAGTTCAAGGTCATCITACATAGTGGTTTCAAGACGAGCCT 
ACACACACACACACACACACAC^CACACACACACAGAGAGAGAGAGAGAGAGAGAGAGAGAG 
AGAGCTCTAAAGACTGTCTTAATGGTTAGGTTGGGGACAAGCTCTAA 

7 0 AGCCAAGGCATCCCATGAAGTTGTTAAGAAGCAGCAAGCTTAGAGCCATCCAGTCAGCCCCAGCCGAG 

CGGCAGACGTAATTGCCATGGGTGCCCTTGGGAGGAATCCACCITAGTCCTTTAAT^ 
GAGTCTGCAGGAAGCACAGACTTAGAACAAGGGGTGCCAGATGGGTAGTGGTGT 

7CAGAGGCTG CATTG ACACCCTGGCCCTCAGGGGCACTCTG AG CTCTG ACCTGCT ACATGTCCTAAATAGAACAATAAGCTCTGT A 
TGAGCACCATGGTCCCTGCTGTGCTCACACTCCCTCTGACAGTA 
7 5 AACTCCATGCTACTCAACCTATGGGTTGCAATCCCGCTGGCAACCGTCTAGCTCC^ 
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GTAAAAAGGTAGGAATGAAAAACAATGTTGCAGTTGGGGTTCACC^ 
AGCTGAGAACCACTGTTGTCACCAAGGCCATCTAGCCACAGCAGGAGATAT 
AACAACGGTTTAAAAAGAGACCAAAGGATCGCCAGTCTTCA^ 
CCAGTCTTCAGTAAATATC^CCCTGCACGTGCCAATTCATTCA^ 
5 AACACATGTCTATTAAGTGGGAAGACTTTTCTTCTTGTTGTTCCCCAAAAGATACAATGT 
GGAAAGAATTCAGGAAGACATGCXSTGGGTTGTAGGTACAAATATGGGGCCGTT^ 
GTATACAGAAATCAAGCAGTCACCCACATATACTGAGGTAGGGATTGTAACT 
GTCTGTAGGAGAAGGGAAGTCATGGTGCTCATGGGAAATTCACGGGGCAGGAGGGGG^ 
CCTGGGACCAGCAATAGCGGCAGCACACAGATCrrTCCTAGAAACTGAAGT^ 
1 0 TTCCAAGTGCTCCGGGTATAGAJGATCCCTCAGAGGGCTC 

GGACAGAAGTGGGACAAAGAGGAGTGCATGACCTGTGATCTAAACTCAATTTGGTTGTGTCTCTGTA 

AGGGGGTCACCAAAATCAAAACAACTCATTCTGGGTGCTCTGGGTGTCGGATGG 

CGTATTCTTCCTCGATCTGGCTGCGTGAGATGGGCAAAACGTTAGAACCTCT 

TATGGCTGATTTCT CTGGCCCTTTT CT AGTTCTT CGGTGT AACAG CTGTGTCCTG TTTT CTTTGCATG ATGGCACGAG AGG AAATG 
1 5 AAGAGACGAGATGAAACCATCT AAG TTCACAT AA CATCAGACAATCTGT AAAAACATACAT AT AT AAAATG T ATT AGGGT AT AT AT 

TAGCATACATCATATAATATCATTGCATGTTTTAGGTATAGTACGTGGATTAATTT^ 

TAAACTGAATTTCTCTCTCTCTCTCTCTCTTCCCCCTTTTACGATGCTAGCACTGAGACTTG^ 

GTTTCTCACTGAGCTACATGCCCAGTGCTTTGTAATATGTGAAT^ 

GGGAGATAGCCGGTTTGCTGGTTGGAAGGCTTGTCTTACTGCAAGACAGTTCAAGGAA^ 
2 0 CCACTGACTX3GGACACAAAGAAGGGGGAGGGGTTCTCTACACTGCCCTTCCATAGACAT 

NNNNNNNNNNNNNNNNNNTOINNNl^^ 

NNNNNNNNbTONNNNNNNNN^ 

NNNNKNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

2 5 TCTATCCAAGAAAACCAGAAAAACTGCGGTCAGAAATAGGAGCTGACAGGTTGGCTAATCAAAGGTT^ 

TCCACCTCATTGCCTTGATAGAGAAGCAGATGCTCTATTTGAGTCATTTACAGC^ 
CTGAGGTGGTAGTGAGTTGTGCATGCTATTCTGCTCTGTT^ 

TACAATAACGCTTAGGGGTGCTCTGATGGGCAGCCACCTCAGGGLAAATAGAGAGACCTGTCT 
GCATGAGTTTACTTTATTATATCTTTCTCTTCCCATTTTTTGTTGTTTTCTAGGCTGTG 

3 0 TTTCTCGAAATATGTAAAAATAATAATAGAATATCTGGGACCTCCTTTCAGCT 

TTG TGGTCACAG CTTG AAG CCCCACTCTGG AAAC CTT CACG ATGGACT CGCAG TGAACTTGAAAG CCCAGG TT ACCAG AATGTGTG 
TGTTTAGTCTATGTTACAGTCCTCCGGTGGTTTGATCCAGTGTTGAATTCCCTGTTACCCTCCAG 

GAAGGAGG GATTTCAGAGTGGG CACTTTC CAT ACT CCTCGATTT AAAATG CAAATTGAT ATTTT AATAG CAT ACAGATC 
CAGGAAACTAAACACATAAAGCATTCATTTTCCCCCAGCCTGAG 

3 5 GTTTGGCACACAAAGCCCAATGGTTTCAGAAGAGAATTAGGTGCT 

GATGCCCTGCGTTCCCTGGGTGTCCrcGTGTTGCAACTCTCTAACACCCTGTGGATC 
CTCTCCT AAAGACTCCCAGAATGCACACGGAGAGAGCATGCJ^TGA^ 
TTCTAGACTCATGGGGCTTAGGAAGCAGCTCTGAGAAGCCAGGCCCAGGC^ 
GTAAATGACTACC^TTCTGGCAC^GTCAGCTGCTACGAAAACAGATCACATATTAC^AGTCCT 

4 0 ArcAGGGAAGAGAAATAAAAATAATATGTCACGTGTGTTGGCCACATAGGGGCTTGTCCACTTG 

TGAGTTCTCAGTGATCAAGTCCCCAAATTCACAGAGCCAGATACTCAGATAGCC^ 
TAACTGATGCTCTGACTTTAGAATGATCATGCACACAAGTGAGGGG^ 
ACACACACACACACACACACACACACACACGCATTCCCTTCCT 
TGGGTGCCTCAGGAATGCGAGGCTGCXSACACAGACAAAGCCTGCTGC^ 

4 5 GCTATCCAGCGTCATGTTTCCCACCITCCTCTTCCITGACT 

TTTAAATGGCTTTAGAGCTCAGATTGTTTTTCTGTGAGAAAGCCTGAGGTTTCTCAGGTGGGAT 
CGCAAAGAAAAGAAGAGGGAGGATCCAGAGAGGGCTGCAC!ACCGAACTACATATACT 
AAAGAAGTTTTTGTACTTCTGCTGGTAAAAGGCTCGAGGGGGTGGCT 
TCCGGCTACAAGAGGCCTCTCCCATGCGCCCCGGCCGAGCCTGCGTGACTCA 

5 0 TGAATGACAGTCCAGATTTGCCGTGGCGATACGTGGAAAGAGCGAGCGCTGCATGTGAACTACAC 

TTTTTTTCCTCCATTTGGAGCCAAACAGGTCTTGTCAGTGATAAAAATTGCTGCAATATAGTGGG 

TG AGACCGTGT CGGCAGTGC CAAGG AGG C C CAT AGCAGG AAG AAAT AG C AAG C CGCGGGG ATTTGAAAC CAT AACACAACTTCTAC 

GACATAATTTTATAATTGTAGCAATCGGGAACTAGGTGGTGGGCGGGGTAGCCTTGTAACC^ 

GGGTCCTCCGTGTAGCTCTGGGCTGGGCTGGGGTAG(2AGCAACAGC^ 

5 5 GTGAGGCTGTGTACAGGAGGATGATTGTGGCTCAGGCTC 

TCAAACAAACCTTGAGAACC^GGAGGAAACAACAAAGTCGGCCTCTTAAAAGGAGGAATGGATCTCTAC^ 

CTGGGCTCCTCTACAATCTCT AGTCTGCAT AAT ACTTTG AAGGGGATGG A CGGGAGG CC CAG CCCTC 

CTGACTGGCCC^CGTGG ATGC CCCTACTTGCTGTTTCCTGGT AACAG AGACACCTAC CACCCTTACAAG ACAG CAAGCATCCCACC 
TTGCTTTTCATTCCGAGGGAAAATCAGCAGACCAGCTGGCCACGCCCT 

6 0 ACAGACCCACX3CTGCACCTTTCTGCCTACTTCCTCCTAGGGATCC 

GAGCCTGTACGAAGGAGGAAGGGGCCTGGGTGTGTGAGGGCAGAGAGGGATGCCCTGTGCCT 
CrCCTCCGCTGTGGGTTTGCAGCTGCACCCAATTTCCTC 
AGACCTGCTGCTCGGGTTTTCAGGAAGGCAGGGCATTCTTGACCC CGGGAGG 
CCTGAGGCCCACAGCCAACAAGCAACGGGCTTCTTACCCGGGGCCACCATCAG 
6 5 GGATGTGCGGAG AGCTGTGGG CACCGAGTCCCAGACTGAGGCAACAG COiGG<^CCAGTGGGCTGAAGGAACCTCAGCGGGCT ATC 



GAAAGATAAGAAAAGTCCCCTCCACAAC^a3TTTTCTGATCTCTTCCCTCCCTCCTTACCCT 

CTTCCCTACCCGGTCCCTACACTAGGACACC^CAGAAACTAGATGATCAG 

NNNNNNNNNNNNNNNNNimNNNNNNNNNNN^ 
7 0 CTAAGGGAACTGTTGAAACCCCGCAGCATCAGCTGGGAGCTTAATGAGGAGAA 

GAAGTGTGGCTGCTGAGCATGGGAGGGAATCGTGACTGTGGAAGCCATATGTGTACATG 

CCTTGCAGTTATCGGTTCCAGGTATCAAACTCCACAGGAGTTCAATATG 

ACACTCTAATCCGTTCTGCGGCTAGCATCTAACACGCTAT^ 

GACGAAAATACATTTCAATATGTGTACATTTAAAAAGCT^^ 
7 5 GAT ACAAAT AGCCTC CTGT AT AG AAG CTG ACGAG AGGTTTTTG T AAGACTTG TTTTCTACT ATGTTGT^ CT ACCCATT A 
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CTCTTGAAGCCACGATACACACAGTTGCTTGCATGTAT 
CATACAAATCCAGCAAGGGGTTTTTAAGTTCTTGTTGCCTT^ 
ATGAAAGAGGCATGCGTGTGTATGTATGTGGTGTGGGCTGTATCAC^ 
CTGTGGGCCAGTCACCAGTGGCAAACACGCrrGTAAGATGTAATAACTG 
5 ATTTCCCCAGTCGGACGGCTGACGTCGGAATCTGTTTGTTCAGTGATATACCATTATA^ 
AGGTGGGACACAGCTATTAATATGGCATGCITACAGCAATACTCCTGCAGAGATGCAG^ 
GAGCCCATACAAAGAAAGGGGCAGAGGATGGGCAGCAGGAGTGG 
CTTCACAGGAGAAAAGGGGGCATCTACTTGATAAGGTTTTTATCTGAGCCGA 

ACAAAT AGGCCGACATTTTC CAGAATGT AAT AT AG AAAG CCT AT AC CCCCAGG CATGGT ATT ACAG TCAAGTT ATCAT AG T ACTTG 
1 0 CAGGATGGAGACAGCAGAATCAAGAGTTCAAGGCCAGCCATGGCTACGTAGCCAGCT^ 
GTTAGCACAGCACAGCGCCTCTCITTTATATGAACAGACACAAAAATTAGGCT 
CAAAATCCTTAGGTGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT^ 
ACCCACACTAATTTTCCAAGGTCCCACAGTGCAGCTCGGTGGTGAATTGAGGCAAATGAC^ 

TGGG AGAAGG AGATAT C CAAGGACATTTTT CTGCCTT CTGTTT ACT CT C ACTT AAGTT CACACACGTTTTCTCCGGGATCCCATGT 
1 5 CTAAAACCCAAAGAAAAGGCTGTTCTCTTGCCCTCTCCrrGGCTGCCCCGAG 

TACAGTCAAACCAAAACTG^TAAGCGTCTTTAAAGCCAGTGCCAAAC 

CATGAACCCGC^CTGCTTCAGTTTGCCGTCTT^ 

GCCTTTGGAACCACCCACCTGTGCACAGAGGCATC^TCATGGACAG 

TCTTAGAGTGTCCTAGGGGCTTGGTTGGTACrCTGAGAAGATGTGTG^ 
2 0 TTTGATAAGCAGGGATTTACTGGCCrCCAGGCAGACACG 

TCTTATAAGAAAGTGCTATTTTATACAAATCTAGTTTTGAGTGAACT 
GGATAGTGTAACAGGCAAGTTCTTCACATTGTTCCTGGCGTTGCGGGCAGT^ 

ACTTAAGGTGGG AATTT AAG CGATG CAC CATGGG ACTG TGG CTCT CTG AAATGG CAGTCACAGTTTTAAG AAAACT AT C CCAG AAG 

2 5 TAGCCAGGGCGGTGTCCCCTGGACACCACAGAGACACAGGAATGATTGTCAGCT 

GTGGJU3CAGCAGAAGACTGCAGGCTGGAGCCACCCCTTGGCrCTGCCAGCAG 
CTTACCAAATAGACACCTCGAGTGATCTGCACTGGGGTTCAGATG^ 
TTTGTTGAGGTTTTAACTTTAAAACAGCCGAGGTTAATTTC^ 
TGTCATGGCTGCCTGTTTACGCAAGTCAGGACACCATTGTTAAG^^ 

3 0 GGAGTAGAGCAGATTACAGAGTTCTGGGGTAGCCGGGCCTGGTCCCTCACACCTCCCTTT^ 

GGGTCATTTTAGGTCTTGGGCTCTGATTTCCCTGGGAACC^ 

GATTCGTTGTGATGAAGCTGAGCTTAGACITCGAGGTCTACACTGTTCTTTTTGCT 
AACACAGTCTCCCCAACTTCTTCCAGCTTTGTTACGGGGTGGATATGAAGTCAAATCC^ 

3 5 GTGGTGTGGAAGGACACACTTGTGCTCCTGGCTCAGGGACTGGTGTC 

ACTTGTGTAGAAGATGTGGACTGAATGTGTTTGGTTGTAATTGGAGGTTGATGCTGCT 

GTGCTGG ATGTCTTG CT AAGTCATGTTTCGGGGCATTTTATTTTTAAAGATGGGTAGAAAAG AAAG C CAATCAGAATACTGAAATT 

CTGAGCATGCCCTAAGGGGGATC^TGGGGTAAGGGGACTTCCAGCAACTCTTTCTTGG^ 

TCTCCTGTGGCCTTTTATAACTAAAGCCTTCTTCCCAAGTA 

4 0 CCACGTGACCG AGGAAGGCCACGTG CTCTCAGGTGGCCCCCAGATTT CCAGTTTTCTAATGGATTGTCTT AGGCTCCTTCCAAGTC 

CCTTTG71AAAAGTTATATTCGCTACTGTCTGTGAGGTTCTTAGAGGTAGGGTTACCATGTCAG 
TGTACIGAAGGGAAGGAAGCATTTCCACCTTCCAGAGTAAGGAGCAAACT 
AAAGGCTTGTCCTGCCCACCACCAACTCACATAGCIATAACTAGCA 
CTGGGGTAAAGCATGGGACTGCTCAGCTTCGCrcCCrrGCTTTTCTCT 

CTTTGTC^TATGCCTGTCTCCACCCCTAGTCACATACATATTTAAAGCCAGCGTGGGATG 
CTTTGAAATGGCCATTTCATGCACACAGCGTAAAGTACTAAA^ 

TCAAAAT AACTC CTGTCTGTGGCAGGCACTAAACCCG AAGAG CTGCAGG AC CGG AGTTGTACACGCACAACACGCCAG AATCAGAT 
GTTCACACAATAAACAGGGCAGTGGGGAGGGGAAGGGAGAAAGAAGAAGTGGATAAGAGAA 

5 0 CTACAGAGCCCCTGCTACATGGTGTGTGCTTCCCTGCCCCTGTCCTGTGGTATGTG 

GCCCCGCCCCTGCCACGTGGTGTGTGCTCCCCTCITCTCACCCCACAGTGAGA^ 
GAAGCGCTTCTAGGCAGGAACTGAGGATGCCCTGTCATGTGCATATATAT^ 

AAGT ATG AATGTG CGGG CT AAAGAACTG T ACCAT AG AC ATTCAGG AG AAG AG AAAGATGGGTGT CAGGG AGG AAG CG C CAGCAGG C 
CTTGGTGTTCrCCCTGTTTGGGCCCCTCATCTATTGAGCGTGAGGTACTAACA 

5 5 AGCACCTTGGGCCCTAAAACTGTTCAGTGTCACAACCCTGGTGTATTATGTATTAACCT^ 

AAAGAATCACITACCACTTAAACTGACACTACCTTTGTAACCCAAAACCACAGT^ 

ACTGTTGCTCAAAGAGGTCTCCCCATCTTCTCAGGATCCTGTTAGTCCCTCGT 

TTACTATGGGACTTGGTCTCTATGATCIAAATCGTCCITCTGACGCITCGAACT 

TCAGGGC CTTT CAGTGGT CTTTT G TG TATTTG CTG AATGG TG ATGTT AAAT ATT CG AGG CTT C AAAAG G AAC CAG T C CTT AG CTG C 

6 0 TAAGGCGC<3AAGAGTACAATCACAAACTGGCCCAAACTGGCTACGGCATATACT 

AAGCCTCTGTCTTCCCATCTACAAAATGGGAATGGTAATTGTTCTTGTCGG^ 

CG ACAACAC CAACTTGGCTAG TTTTC CTGTGG CTG ACACAACAT A CAC AG ATTT AGTG ACTT AAAACAATGAGAGTTT ATT 
CTCTCAAAATTGGAGGCTACAAGTCCCAACATTAA^ 

CCTCCTCTAACCCTTGCCAGCTCCAGGCACrrCATTGGTCCATGGCTGCATCCTCCCCACCCCCT 

6 5 TTCCATCTCTCCTTCTTCTTCTTTAAAC^CTCATGTTACTGGATCTAGACCT 

AGCTGTGTGTGTGAGy^ ACCAGACCCTTTGTCAGTGAGATCCT CGTCACAATCTAATGGTCAG AATATGGCCCTGGCCTTT CTGGT 
GCCACCGTTCAGTGCCAGAGATTAAAACCTCTTACATTGATGTCATCGGAACATTAGCC^ 
CCTTCCCATCTGTCCCTCAGAGTAGACCTACTGTCCAGATAAATGTGATGTC^TGCCTGTGTGACCCT 
CAGCATCTCAGCCCTTCATTTGCCAG AG CCATGTCCTGAT^ 

7 0 GGGCAAGATCAGCTATTTCCGAGTGTTTCTAGAGCACCCATACAGATGAAGCCAAAG^ 

CATCCCCCTTACCCCAATACCTAGGGTGTAATGTCAGAACACAGGAGGCTTGGAGCCT 
CGAGCAGAGAAACAAAACTTGCCCTGATGCCCAAAGCCAGCGA^ 
CAGAGGCAGGCATGTTGTTGTTTTGTAGGACCAAGCAGCAGGCGT^ 
CCATGGTTTGCTTrGGGCCCCGGTCATGCCAAGCCACAGAT 
7 5 TTTAACGGCAGCTGTGAAGCTTTATCAAAGTTTACCAAAAGTTTGGCCCGAG^ 
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TTCTCCCAGTGCTCK3GCTTTGTAAGTGCAG 

ATTCTAACTCTAGGGGACCATGAACTAAGGCACTTGGTGTCCCTTATCTTCCA^ 
GTTACCACACTTCTTGGCCCCACACAGACTC^ 

GATT AAAGGT AGGGATCGCGGAAAG CCTGCAGACG CACTGTTT ACTG ACAAG C CTC C CATCTCT C CACAGAGTTT CCT CCCAT AAT 

CCCTTTCIXXSCAGGACATTTTATGTGTCAATGCCGGGCGAT^ 

AATAGAGTAGAACTCACCCTGAAAACAAACACAGATCAGTCATGTGCCTCTCTG 

GGACAGGGACTGTGCTGTTTCCTTGTTCTGTTGATCTCTGTAACCAG^ 

GCTCCTTTGAGCAGTGCTTACCACTTTAAAAGTAGCATT^ 

GCCACIt3GTAGTAGAG<3CTGGCCAGAAGGGACCCTTACTTCTGACTC^ 

CAGTGT CT CC CTCGGG CT CATG CAAGG CTT AG AAC CTT C CAG ACAT ATTTG CAAG ATTTG AAC CATCACTGATGGTTCAAATTCTG 
ATCrCCATGGATCATAACCTTATCCTTTTTAGA 

GCAAGTCTTATAAAGAACAGCATTTAATTGGGGCTGGCTTACAGCTTCAGA 
AGAGTGCAGACAGGCATGGTGCAG<jAGGAGCTGAGAGTTCTACATGAAG 

AAT AGTGACTTCCAGGTAG CTAGGGTGAGTGAGGGTCTTAAACCCATG CCCACAGTGACATGCCTACTCTGACAGGGCCACACTTT 

CTAATAGTGCCACTCTCTGAGCCMGCATATACAAACTATCACAGGTC 

TAAGAGGGGATTATGTGGGTGGAATCTCTGCTGTATCAGATXK^A^ 

CCAGGATGTAGCACAGTTAAACTCATCAAGGCrrTGGGCACCGTGGTGACGAGTCAG^ 

ACCATGCTGCTCTGGAATGAAGAGACACCACTGATAGATTAAAGTAGAACTTCCCCAG 

CGTGGGTGGGATGGTTCACCCATTAGGACTGTCTTCCAGCTTCATGGCAGGTCCT 

GGCTCAGAGCTAAATCTTCACTATTTCCTCTCCACACACITTGAAGGG 

AAGGGGCAGAGGACTAATGAGACTGAGGACTGCrCGCCTTAGAGTTGCT 

GTCTCCTGGGACGACGGGGCTGCCTCGGCCACTCTCCTTCCATTAGC^CATCTC 

ATAGGGTTATTAAAATATCTGTAAAAGTCAGCCCAGGCTCAAGGGCTTGACTTGGCCCTCCTC 

CTCCCAAGGTCAGCTCGCCAGCATCTGTATTTTGCGGTA 

CGCTTCCGTTGGTTTTCCAGTAAGGCACTGG CACTCTTAAGAGTGAGCCCAGAGCTCTTCAGACCTAATCCTACTGTCGCC CCCCA 

CCCAATTTTAAAACTTCAGGGTAGGAGCGTGACCGTGTCTGCAGCCAGATTATCACCA 

AATGATATTGTTGGGTTCACACATTTACCAAAGGCAATTATATA 

CTGGTGAGGTCACAGGCAGAGGAAATACCCTCTAGAAGGTTCAGGTGGGCCACTC C I T C T TTTTCA^ 
CCCCCCATACATCTTTCTCCAGGGGGCGGGGGGGGGGAGGTGGGGGGGATGCATATGGGTGAGCACA 
AAGTCGCAGGACAGCCTCAGGTGTGGGAGGACGGTCTCAAATGTCAGCCTCCAATAACT 

TATTAGCCTTTTGCTTTGCTCAGTGGTTAGG CTAG AAGG CCAG CCAACTCCAGAAGTCTGCCTGTCTGTATCTCC CATCTTGCTGT 

AATCCTTGAGGTAACCGCCATATGCTGCCACACCCTGTCCTTGOGATGAAACTGAGGG^ 

TGACTGAATCTCCCCCTGCCTGGCACTGCTCTCTACAGTTTCCTAGGCCrTGCT 

CATTCTGTAATTGTGGGCACAGTATCTTGAAGGGATTAGAGTCACCCCTAACACCCTCAGGGAAACCA 

CCCCACCCCACCCTGCTGCTTCTGATTGGCAGACTTTTGTGGTGGTGGTGGTG^ 

CAAGCTTAGAGAGGCTAAGAGCCACCATATGTTGCCAAATGAGAGAAGCCACTGCCCAGACATGGATCT 

TTTCCATAACTGACAAAGCAGGGGTTTGAAATTGAATTTGTGTAAAATTA^ 

TGCrATTCATGTTCCOSTGGCTTTTCCCGGTTTCCCCATTTGAGTCCTTC^ 

CGG AGGGTAGCTCTTG CAGTCAGACTTTCTCCCTGGCCAGTCATGGCTTC CrCTGAGGAGG ACTCCAG AACCTGGAGATGTTGGGG 
CGAAAGGACAAATCCTCTGT CCATCCATCCATCCATCCATCC ATC CCT CCCATCATCCAGCAC AG AACTCATCTTCACACCTGGTC 
ATCAGAGGGGCTCCAGGATAAACTTGGGGAGAGGGGGTACCTAGAAAAGCCTAATTCATTA 
GCTTTGGGGCTGAGACTCCGACTTTGTCCACTTTCTAGTTGTAG 
TTGCCATCTGTAAAATGGGCA.CGCTCATAATGCCTGACTCATGAGATGGTCIT 

TGTAGAGCACAGCGCCCAGCGTGTCTAAAGTTCTCGAGACTGCTCCCGC^ 
CTTGCTGCCGGCTCTCCXX3CTGACATGAGGCCTGTCCTGCGAGGCTCACAGCCACT 
TTTGAGTTTGG<K5TCCTTGGGTAAACrrcaVGCGGGACCATCCATAGT 
ACCACCCCACCCCACCCC^CC^CACCCTGAAGGTCCAGGCGCCCTCCATAA 

GTCGGTCATATG<^X3AACGGCTCATTTCCCCAGCGCTGG CTTAGCCIT' ACTCAGCCTTTTCCCTGT AATGGGATTTTGG CTCTGGG 

AGAGGGCAGGATTTAGTTTCTTATGGTCCCTTTTCTTTTCTTCGCTTAATGTTGAATTT^ 

AATAATTAAAATAATATATTTTAAAATGGGGrTGTTTTTTTGTTTrA 

TCACAGCAAGCACTCCCAG ACTGCTAACAGCTGT AGTGAGCT AACAGGTO^CHTC CC CTGACTGGGCAGCGTT 

CAGGGGGTGGTCCCrCCCCCrCTCCGTrGCTrTCTTGAATTAGACTCACCATA 

TTACCGTCTACTAAAGGGTTTCTTTGAATGAGGTCTCATCT 

CTCTAAGCCTGCTCTCAGCTGGAGCCCAGGCTGTCAGACC^ 

TGGGAAGGACAGCCACTCAGGGCAGGCCCTAGGCTGGGACAGGGGGGAGGTGCCGAGTGACAGTCCCCT 

TCCTGCCAGACHTATGTCATTACTCCCAGAAGACTGCATGCATG^ 

AAGTCTTTGCCATGAATTCAGAGTCAGGGCCTTGAATGGGTAGATCAGAGAC^ 

TTCCCCCCCCCCCCCCGCCCCCCrCCCAAGCTACCAGGGAACAGGATCCTGCATAGCTGG 

AGAGACAGAGGACTCCAGCATCCTGGCTCITAAGTGATATTTACrTAAT^ 

CCCTTCACTAATTAGCCCTGCCGGCTGGCTTGTCATCTCCACCTTCACCACC 

CTCCCAGGGGACTTGCACTCrcGGCTTGCCTCCTCC^CTGTCT 

GGGCATTCTGGGAAAACGTAAJ^GGATCCCGCCCCCCCTCCCCAGATCCCAGAGGAGCAATGGCAGCCrrGGA 



ACAGGGAG CCTTCCTG AGGAGTCCCTTCCAAACACCATCCAGG CGAACGAGGGGCTGTTGGGACAAGATAGGGCAGACCCAGGTGT 

AAATAACCCAAACAGGCAGGGTACTGATACTGCAGACAGTCT 
NNNNNNNNNNNNNNNNNNNNNNN^ 

CCCACT CCCATCCC C CCCC CAATTT AAT AAATTT AT ATTTAG AAA CT AAAT AT AGTTTCT AAC CAACAGCAGCC CTGTT ATG AAG A 
ACAT CTTGCAG CAT AAGG AG AG AGATCTCTT AG AAAATCAATTTC CATCTGT AG AGACGTGGGAT AAG GAAGGG AATGT AAC CTTT 

TaACaTGAGAAAGGGAATCACTCACTGGTGAGGTACCAGACATCA 
TACrTGAGCTGGCTATGGGCAGCCCCACCTGTAAAeT^ 
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CAAAACAAGGGTTCTGCAAACTATCCCAGGCATTCTGT^ 
CTCTTCCCCTCAACCCCCCACCATGTTGCC^CCATGCCTCA 

GTGCCTGTGGCCATCTTT CTCAG AGCTACAAAGAG CTCCCTT ACACACCAAGCTCATCTCTGAGCACTACAGGGAAGGCAAACACT 
GTTAGCTCATGGGCAACTGAGGACTGAGGAGACTTGAAGTTCCITGAA 
5 GCCATCTTGCGTCAGCCATCTTGCATCAGCCATCTTGCGTCGGCCATCTTGTC 

TTTCAGACACGGCAG AATGGCCACAAAGTACAGGAATAATGACG CCTCTG ACCTCAGTGTGCCTTCTGCTG CTATG ACTCTTGAGT 
GGCAATGTAAGTGAGX3ATCCTGCAGCTTCCCCTTGGCTCCTTTGCCrTCAGG 

CACCTGGTCTGATCATGTAGCCTTCAAAGG CCATGCCCACGG CCGAAGAAGCTATTAATGCCTTTCAAGCCAACTG AAGGCCAAAG 

ACTTGGCGCACAGAGCTCTTTTCGCTGCCCATCAAATTCCCTTCTCGAGC^ 
1 0 GGCrCCCTAGTCCATCCAAGCTCTGAAATCACTCTCAGCCAATCACAGGA^ 

GTGGATGTGAGGTTAGCTTACCTGTCTTCCTCTCTGGCTAGTCCTGTTCCCT 

CAATCCTCACATGTCXCTTCCTACAAGGACCAAACTGAGTCCTCrCCCMTTC 

ATCTCCCCATGTCTGCTCTCTGGTTATACTTGAGAGGTGGTGGGGCCAAGTC 

CTGCCCAGTTAACAACCGCAGCCTGCTAACCX3GAGGACTGAATCTATGTGCT 
1 5 GACCCACAGAAGGGTCAGCAGAGTCCCAAGTACCCTTGAACACTCCTTCACTr 

CAGAAACCAGTCAGTCCCTGTAGAAGCATTTTTTCTTCTGATCTTAGG 

CATTTCAAAGTTCCAGGCAGGAAACAATGCAGTGACTGGGAGGTCCCT^ 

GGATACCCGCCCCTCCCCACCATGCTGTTTCCAGACCCACACCAAATACTCCTCCCATGACACCCTC 
AGCTCAGCCTCACCTGAGGCAGACTTCCACATATTTGAACCTGCTTTC 
2 0 CTTAGTTACCCACAACTTCCTAGGGATCCCAGCCCCrACCACAGCACAATACT 

T CAGGGGTGTGTGTGT GTCTGCTGCTT C CTTCTCAGAC C CACATTTCTAGATTCTTTTT C AAATC CAT CCTTCX3 AAACT CTTT ATT 

TG CCCTGGGG C CAGGG CTGG AG CTCAG CTGGGAGAGTGCTTT C CT AGGGCTGTG AAC CCT AAATT CT ATC C CCAGCACTTGGCAC C 
CACCGACATCTGTACCACCTGGATTCCAGCATTCAGGAGGTGGAGGCAG^ 

2 5 TATATAAGTTCAAG-GCTAGTCTGACCACAAGAAATCTTTTCTTTAA 

CCGTCCCCAGCTTCTGGGGACTTTACCTAGTATGACTCCTACCACGCAC^AA^ 
GGTCACAGGATGGTGCTCGGACAGCCAGCCAGACX5CTCTCTCTCATCCACCTGTCT 
AGATTGTGGCCCCAAACAGGAAGCCCTCTGGCCCrCTGGCCCGCrAATCCCCTTTGGGA^ 
GTCTTCTGTTC^AATCAAGGGGAAAGTGGTTTGAATGAGCrGTGTTTT^ 

3 0 AAGTTCCTGTTTTITAGAAAGGCTGTTCATTTTCAAAGAAC 

TCCCCCAATGTGGTACACAATACATGCGAAAAAGAAAGCGGGGAGCCCCACCAATGAGAGATAGTCAGTAACA 
TATAATCGGCTTTTGTTGGTGCTTGCGTCTTCTTGTTGAAAACCAGAAAG 
TATCC^CCCCCAGAAATCCTGCCTGGAGTCCAGTGGCAGGCAGGATCCGAAGCT 
CAAACGGTGCTTTCAGTCTCTGGTAAACAGCCTGTGCCCCAGGGCTC^^ 

3 5 GCTGTCTGGCCTGCGCTCGTCAGTCTAATCTGGAGGTTCAGACAGCC^GGCC^ 

AATCrAATCCACTTGAAAGACAGACTAATACAAGTGCAGCTAACTCATTATTTTAAG^ 

CT AT AAAAGGGTTTCAAGTCAGTGCAGTGCAGTGTTTGG CAG CIT AAAGCCATT AGAGGAACTGCAAAGGGTGACAGTGTG ACTCC 
CCGTCGCCGTCCTCATTTTATGAAGCCTGGTTCCTCCCTGTGACCACCCTAAAATGG 

GTT ATTTGTGAATT ATT AT AG T AAAAG ACTCCAACTTTTGTCTTTG AT CCAG G CG TGGGT ACGAAAGG AGACACAAATGT AAAG AC 

4 0 AGAATTCTGAGGGACCCCTCGGAGCTCTGAATACCCCCAGATCTGAAAATAGGAAG^ 

AG CAGTTAG AG TTTT TG AAGG TT AAG CCTT C CCATTTTG CCGGTATGATCATT AGGT AGGAGG AGGGGTGCTTT ATGG AAGT AAAC 
CTTCATCCTACCACCAACCTGTCTG AGGAACAGAGACCGTCCCTTCATCTTG CACTCCAGCTTT CTCTGAGTTGGAGCTGGCACAG 
GTAGCCTCTGGGGCTGCITCCTGCCCCATCATTCCATGAGTCTTTGGAACAAGACCT 

ATCTAGTTACATGTGACTCATGCCCAGAAAGCCCAGGAGAAGCAAGTGCTGGCCTGTCCCTTCCAAGGTAGCAAA 
45 ACTCAGGGGCAAGCTTAT^GCC(^TTT ACACCTGTGGCTCCCCCGGGG CTGG ATT CAGCCAGTGCCAAGTCTGCGTGTCAG ACTGA 

GACTGTGAGAAAGCAGTTTCTGCAAACACCGATGTGGAGACGAGGTTCTCCT^AG^ 
GCCCCTC^GTTCATGTGTCTTCCACCCAGGCCACCGTGCTAGATTACATCTCTCGAAATCAGAGCA 
CTGCTTCCATGTTGAAGTTGGCTGAACTACCCTATTGGTGTGACGTTCC^ 

ACTG TGGG AGCTG AAC CATTCTTTT ACGGGAACTTCTT ATGCACTTAACAGCT AC CAG ACAG CAC C C CTGG TCT CT ACC CACT CAA 

5 0 TGTCTCTGGCTCTCCCGACTCTAATCAAGAGTAGTCCTGTTTCCTGACATTACTAAAA^ 

TTAGACCAGCCTGGGAAAGGCAGAAGTGTTTAGTCC^TGCX5TATAAGATGTCAATG 
GCAGATCACTTCGGCCTCTGGGTCTCTGGCACCTCCTTCCTCCACTGGCCAGTCCC^ 
TCCCACAATCTCAACCTGTAOVTAGCTTTTTTTCTTCCATCTGCGTCCAAGTCACTGAA 
CCTTGACCTTGTAAATCAAAGGCGTTCTCTGGAGGGAGCTCTCCCACTGCTGGGCT 

5 5 TCATCT ATTCAAGTGTGAAG<3TTGTATTTGAAATGTCATTTACT C C ATGG ATCTTTAG ATCATCA 

GAGGCCTGCTTATTACCGAGAGTCTCTACGCTGTATGGGTATGTGTCTTTTGAGATCT 

ACTTCCGTTTGCCTCTCTGAATCAGAGCACGGGGTGGAATTCCTCTGTGAGTGGAGGGTGCCCCCAAATGAGTCTC 

TTTGTGTCCTTTGTTCAGAGCACCAGGGATTACTTTATGAAAGCAC^^ 

CTCTACCTCACCAACCTGTGTTACTAATCCATTGCC^TCCCAATTAGTTCAGATC^ 

6 0 CGCAAATGTAAGTGAGTGAGTCCCCCTCCTCTGGTTCTCCCACGAAAATGAG 

GTCTGGTGCCTCAGTGTTTGAGAGTGAAGTGAAGACCTCAGGGAACCCTAAA 

GCTGCGGGGAAGTTTCTTGGCCCCGTCTCCC CACCCT ATTGCTGATGGCAAG CATCTGAGAAATTTAGG AGCCTCGG ACACAGAGT 

TGGGTCTGAGAACATTTGCTGGGTCCACGTGGAGGAGGAACCTTGTGTTTATGGC^ 

ATCT ATGG AAAAGAGG CTGG AGTTTGGTAGGTCCATAC^ATGGCTCATC^TAATT 

6 5 T CAT CAG CAGAGG GATGGT AAGG T ATGTG AGGGTT AGATCTTT CT AGG TGG ACAACAAGAC CTATG AG AATACACTTCTCTGATCT 

ACCATTGTTTCTAGTACTGTGGGGAGGCCATGGAAGATCAAATGGAAGTGCAGTGATTAATGTCCAT^ 
GACAGTTCTTGTGAAAGATGTGTTAGAACAACCCAGATAGGAACAGCCAGGAATC^ 
AGGAAAACCCTAGACTTATTTATTAGATTCTTCTGATGCAGAGAGAGACCTGTC 
CCC CGCCCCCACCCCACAGCTCTCTGAGGTGCTTCTCTTGGGCTTTGCrrrTGGGG 

7 0 AGGAAAGGAAAACACAGCACTCTCCCAGGTTCAGGCCAGAGAGAACAGACATT^ 

CACACACACACACACACACACACACACACACACACAACGC^ 
AAACATTTACCAAAGGGGGCTAGGCATTTCACCTCTGCGGAATTTTTCTATT^ 
AGTGTGGAATGAAGGCTCTTAAAGAAAGAACAACCCTTTC^CATGACT 
GCAATTCAGCTTTCAGACCCTGTGTCCTGGCAGGCA 
7 5 CTCTGGAGCTGTGGTGTTCAGGGAAGAGACGCTCCCTCATGCTGCGTCTCCG 
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GGTTGTGGGAACCACGGTTTGCACCCTGGAACTCTGGCTGG CTGGGACAAGTCCT ATTCCTGGTAGCCTTG CGGTGTCTCTGCTAT 
GTGGAGCACCGGCAGTGAGGATAAAGAGTCTGTGCTGGACAAGCCCAAGCAGTCCTC^ 
GGGTAGCAAGGGGAGACTCTCTTGCCTTTTCATACTACAACTGAACTGCT 
ACCTCACAGGCTGATGCAATGACCCCTCCTCACAGGAGGC^AGTATC^ 
5 TTTGCAACCTCTCAGACTGGAGAGTCATGCCCTTGCTCrCTTGTTGC 
GTACCCAGATACTAGAGATGCCTGGAGCCAGTACAAGCCA 
GATATTGCATTGGATCTTCTTTTCTGTCCCCATGAATATGAATTTCTCTTA^ 
TTAATACCCTGGTCAGAGGAACACAGGGGACAGGGCCTGAGTTCTGTCCATTAAGCCA 
TTTAACAAAGTGAGTAGTACAATGCTAGACTACAAGGTTGAGCACGCACAGGCCAT 
1 0 CCACCAGGGTCCCGAGTGTAGGCTCAGTGACAGCCCTTGGCACAGAGATCTGGGA 
ACAAAGCCACCTAGTGGCTTGATAGAGCCTGGATTAGGTCGGACTTCAATCT 
ATAGATCCATCATCACCACCCTCTGCAATTCCAAAGAAGAATAGCCACTCCTrr 

AGGACCCTCATAAAGTCTCAG CTGTTCACTTAATGTG ACCGCCCAAGGG CCAGCTCTTACTGCATCCAGTTTTCAAAAGACAGTTG 
GTGCTCCCCTGAAGACCCAGCAGCAGCCCAGAGCAAGCTGGACAGAACAAGCCA 
1 5 ACATCCCCTGACCTGAGCCACACGAATAGAGCCTGCAGTGTGT 

GAGGTCCTCCCGCTCGCTCGCAGCTTCCCACAGGAGGTGTCAACAGCT^ 

CTG AGC C CAAGCTC CCAAACT AAACT ACCTGGGG CCTCXKaACC CCTGACACTGAATTT CT C CT CA CCTTATTCAAGGGAGG AG TAA 
GGACGAATATGAGGCTGGAGAAGAACCACTGGGGGCTGGGGAG^GGAAGTCA 
ACJAAAGCAATGGAGGAGAGGATGATTCAGCAGGCGCTCCTAGGATGGCACCT 
2 0 AAAACATAAGTGGACGTCAGAGGTCCAGGCAGCTCGTGCAATGC^ 
AATAAGTTATCTGCAGGAAAGGACACTGCAGAGACCAGACGG 

GGCCACTCTGTCTGGAGGAGAGACGGTGCCAACAGAGAATGAGAGGTGTTATGAAAACA^ 

TCCCTGAACCCTTGCACAGGGAGGACT CTGC CTAAAT ACAGTGCCCTTTCCAG ACCTGTCTCAGACTG ATCTGGCTG AGAGGAGCG 
GCCTCCCTCGTGGCGAGGGCTGAGCTATTTCCCCTGGGAACTGTCTGGCAGCTCTAATGA 

2 5 GCCTTTCATCTGTGGATCTCAAAGCTGTACCACAAACATTAATTAACT 

ACCCCATTTTTAAAAAATGAGGAAACAGAGATTTGGGAGGAAAAGGATTCTGGGGCGAG 
TTGGGAATAAGCGTTTCAGATTTGTAGCATCCTGACCACAAAGTCAGGAGGCCTCCCCGGG 

ACAGGAGATGAAGCCACACAATTCCTT CTTCCCTTTCTCAGGTGGCCTGTCACCTC CTGGGTCTATAGATGTCATTT AGAATGGGT 
CCCTCAGACCCCGAAGGCCTTTGAAACAGCCTTACCAGGCAGGTCTCCATCCTGGCT 
30 GTTGGAACAGTGACCGCAGCCGAGACGCACATGACTAACCAAAATGTCTGCAGCCAGTGGGAGACAA^ 
GCTCTTTCACAACCCTCrTTTCTTCCAGGATCCAGAACTGGTGGGAACTGACTCCT 

GCGATGCTAATCCAGTGCTGG CTACCTGGGG AGTGGGGGTTGGGGGAAGGAGCT CTGGGGTGACTCCCGTCTGTTCTGACTTCTGG 

AGCTGTTGTGGGTTTACTGCCCACCGCAGAAGCCAAAGGAAACACTGCTCAAAACC!ACCAGAG 

ACCTGCCCAACACC^GGCTGTGAAAGCTGACATTGCAGAGT 

3 5 CACATCCTGTTGTGTTCTCTGAGGCATCCGAGAGGGCATTACACCAGCTACCCATGACTGAGCTGCACTGGGAG 

ATGACTCTGCCTTCTGCCTTGCAGAAAGAGGGGGAGTCTTCTAAGGAAAAACCCAGAAC^ 
AGCAGATTTTGCGCGTTAAGAAGCAAGTGGCCCTGCATTGCATGGCCCACTGTTTATTTTCA^ 
TGTTTTAGAGATGCCCGCCAAATACTCAAGGAACAGAGCCCCGACTACAGCAGAGCCCCAACGTCCTGACCC^ 
GTGAGGTCAGGTGCCAGGGCCACCAAAGGCTCAAGTGAACTAGGCTGATGTCCCCGC^ 

4 0 CACAAGGCTGAATCGGGATGCTTTGAGTTTTCGAAACyuSGAACATTT 

TAGATACGGCTACTTCTCTTTGCCTCGGGTGTGGGGCATTCT 
TTTCCTTTCAAATATGTTTTCCGAACCAGAAATTTCCCTTCCAAAGCTGCCTCT 

TCATTAAGTGACTGGCAGTCG ACAT ATCTTTTCAGATCTCTT ATTTGGTGGCTATGGCATCTC CAGATAATCT CCAATTTTTGAAG 
ATTTCTAAGTAATAGTTCTGTTACCGGTTTTTGTTGTTGTTGTTTTGTTTTGTTTTTCGG 

4 5 TAGTTTAAGTCCTTGTGGCCTAAGGAAATACTTTGTATGATTATGCCT^ 

TGCCTT AGCGAGTG AT CT GTGTG C CT GAG AAG AAAGG AG CGGCCTAC C G GTGTTCT GT AGG CCAAACTG ATACTGT CCAGGTCAG C 

CGAGTCTGAATTTCTGCTTGCATGGTCTATCGGTGACTGGAAGAGGGTGCTAAAGTCTCCTGCCGTAAGGATG 

TTCCTTGT ATTG CAGTT AGTGAGAC CCTTTGTTCACGGGACACCTGTGAAGGAAAATGT ATATTT 

GT AACTTATCTCTTATCCCTGAAAAGCTC CATCTTCAATCGGCTTTGT AGAAAACTTGATACAGCAACTCAGG CTTTCCTTGGATT 

5 0 GGTGTTAGCACAGCTTCTCCACCCTTTTATGCTTGCCCTATCTGAATCTACATTTAAA 

AGTTTTATTTGTAGCTTGTTCTGACAACCTCTGTGGTAGATTT^ 

TGGTCTAACTTTTATGCCTGTTGCACCTGGTTCTTAGTCCCACCCCCGGCTTTCCTGCCTCCTCT 
G1GACTTCATTTTAGCITGTTTATTGCCTAT 

GTCAGACATTAAAGCACTGAATGCTCATCCAGTGGACCTGGGTTTGATTCCTAGTAACCACTAGTGGCT 

5 5 TCCAATCCCAGGGAAATCCAACACACGATTAAGGTCTCCCTAGGCACTACAAGCA 

ACACTCATATGCATTAAACAATAAATAAATAAATAAATAAATAAATAAATATTTTAAAATTGCTTC^ 

AAAACTAAG CCCAC AAT CAAAT AATGTT ATT C CTT AATTGTT AT ATG TT ACTT AT ATTT AAACTTT TGTTTACATTTTT AAAT^ 
GTGTCTCTCTCTGTGTATATGTGTGTGCCTCTCCCTCrCCTCTTCCTCCTCCCTCCCTCCCTCCCTC 
TGTGTGTATGTGTGTTTGTGCGTGTATGTGTCTGTTTCTCTCTCT^ 
60 ATGTGTCTGTCTCTCTGTGTGTACATGTCTGTCTGTCTGTCTCTCTTTTATGTATGTGTCTGCT 
ATATGATTCTGTCTCTCTCTATGTATATGTCTGTlTGTCrr 

GTATGTGTATGTGCCTGTCTCTGTGTGTGTGCTGTCTGTCTGTCTATCTCTCTCTCTCTGTCT 

GTGGTTTCCTTCCACTATGTGGGTCCTGAGGATCAAGCTCGTGTATTGGACAAGCG 

6 5 TAT CTGTCATT ATTG C CAATT ATTTT CTTGAT CTTT ATG CTG T AACTG TCCAACAAATTGTGG CTGTT ATCACTTCT AACAG TT AA 

CTTTCAG AT CCGTT ATG AAT AG AAAATGGG TTTT ATTTT ATTTG ATTCACT1' CTTCTGGG ATGAC CTCCT1TT CACC CAAGAT CT A 

TGTACTGCTTTTCCCTTTGCATTGGAAGAATAGTTTTGCTGGGTATAA 

TGGTGTGAGTTTTCTGTGATCGTCCAGTTGGGTTCAATGATTACAAAGCTATAGTTGACAAAGGC^ 

7 0 TGTTACITCCTCCCTTGTTGTCTTTGAGTTTCC 

AAGATGGGAAACCATGAGAACTCAGTTGAGTTCCrGGGCCACAAACATGTTTCT 
TTCTCATTATCAGTCTATATTCAGCCTCCAGCAACTTGTG 
ACCTTGGGTTGGTTTTACATGCrTGCCTGCCTCCCAG^ 
GAAAGTCACAGATTTTCTGTTCTACTrTTTTCCTTC 
7 5 AAAAGGAAGAAGCAGGCCACTCTGGTATGAATTATTTGGCAGCGTCATGGT^ 
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GACATTCTTTTCCTGAGTGAAATATGTCATGGTGCTTTG 
GAAGGGAATGGGGTTCAGGGATGTAAAAGGAGCCCCTCTTATCT^ 

TTGTCCTGAAAACAAAAACAAC CAG CAT ACAGAAGGCTCAACTTTT CTGGTGCTTTT AAAAAC AG AAG CATGG ATTGATT AAAACA 
CCGAATTCCTGCTTTCTCTGTGTCACGAAATTCCAGAGTCCTTACT 
5 TACTCAGCTCCAAGCACTCCAGTAGTACCTCAGTCCCA 

CTGAATTTCCTTTAGCAATGCAATCATGGAAATCACTCAGATTAA 
CCTGTTGCTTGCTTGTTTCTTGATGTTACTCTTGCTAGTCAGAA 
AGTCAACCCCAAATCTGGTTGACTTACATCCTAATGCTTTCCTGCCCTGCT 
TCACTGCTGCCCTTGCCTCTACCTTGGAAGCAGACATTTG 
1 0 CTTTACATCATCCCGTGTCTGTAAGTCTTACAAAGTAAAGACCA^ 

GAAGCAAAACTATTTGTGGGAACATGAATGAGGGAACAAGTAAATAATTTCCCCAGCCTA 

AATGGGTGAATG AATG CATG C ATGAATGAATG CATGAATGAATGAATG AATGAATG AATG AAG CAACT CTT AGTGTGCTG ACATG T 
GCATG AGGAAGG AAC CTTT C CAAG CATCACAAAGTT AGAAG CCTTTC C CTGATG AG CCAAGG CAGGGG CCATGTGG CT CCT AG T CA 
CCACT AT CACTGGC CACATTTCCTGTGCCTGG CACACGGT AT ATGTTT AAC CAACTCTTG TT CACTTGG ATGAATTGG CTT AC CAG 
1 5 AGGCAACTCCTTCAAGACTAGTACACCTGCACTTTGCCGAATC 

CCCAGTTTAATCCCTCCrrTGTGGCATTTCCCAGAAGGCACACAGT^ 
CGACTCTTCCCTGACTCTAGCAACATATAGCAAAGATGAGTTGGACT 

TTT AG AG AT CAT CT CTCTGC CGAGAG AGAAG TATG GAGGTTTGGTCT AGGG CAACTGG CC CT CT CAG CAAACCG ATGGGCCTAGAG 
CTCCTTTTTCTGATAGGCAATCAACAATATGGTTGT^ 
2 0 ACAACX5ATAGAACCTCTCCCACAAACX3CAGAATGCACTGTATTTCCCT 
AAAAGCTCAAAACAACCCAAATTACAAGAAGAGGCTCACCAC 

CTGCTGCTGGCGGAAATGATGGTT'GATGGGCCGTGGCGTCAGTGACCCACTCAATACT 
TATAACCrGTGTTTAAAAAAAAAAAAAAGAAGAAGGOSCAGATTCrGAGCCT 

TG AAAT ATT ACG CAACGTTCGCACGGTTT AATT ACG AGTTTAG AAAGG CCT AG C CTGAGTTTTTCACT ACAGT AAAACATGACAGA 

2 5 GCATGAACCCrGGCACCACACTTTATGCTTAGTGTGGTTTT 

ATTACAATACCGCAATTCACACCCCGGGGGCCTTCATGACAATTACAG 
GAATAGTATATGTCCATTTGCTAAGTGTGTATATTGAAAGCACTGGCAT^ 
TCCCaSCACACACACAATGGAGCAAGATCCTTACAGTGATGTGGGATGCAGGACGGTC 
AACAGTCGGCTGTAAAACOGCACAAGGCTCCACCTGCATCGCAGCACnTGTCT 

3 0 TGCCGTCCCTTCAGACAAACTCTCCACCATTGGGAGGATGTCA.GGAGATCT 

GTGAGAGAAAGGAACAAGGCTTATTGAGCCCCGAAGTCAAGTGCACAAGAGACATTTC 

TTCTTGGTGACAATCTATAAAATACATTCTCAAAGTTTTGCCCTTCAA 

GGTCnXSTTGTGAAGCAGCCTCATTAGTGTCCCTGTGTTTT 

3 5 CTCTCTCTAGTCTCTCTCTCTCTAGTCTCTCTCTCTCTAGTCTCTCTCTCTCCCTCTCTCTCTCT 

AGAGTTATCITTTTACrATCGCTCTCGCTATTGTTTCTATGG CATTTGAGG AATGG CATTTG CTGAAGTAGAAAACAT ACACATGC 
CCGTGGACATTTCCTCAGGGTCCCCCACGCTCCAGACCCCCTTACATCCTTTGTGCTTCCTGAACAAGG 
ACAATGTTTCTGGTGATGATAACTGTAATTCCTGGCCAGGATGAATGTGTTATCCT 
CCAGAGTCCATTCCCAACAGTATTAGTGTGAAAAAAAAAATGTATA 

4 0 GATAGATAGATAGATAGATAGATAGATAGATGGAAGGAAGAGTTAGCAAATAAAACTCCCCCT 

ATCTTTAATGGGCCTTCATACAACAGAGTCATCAAATACTTX3GGCT 
CACATGTATAATCATTTAAATAATGGTTAATAGCTATCATGATGAGAACTTTTCGTCT 
CCAACATCCTTGTCTCCCTGCAGATGCCAGGCAGATCCAGCCA 
CCATCACCTCTTCCTCTGTCCACCCAGCGACACCCTOTTCACCCGC^ 

4 5 AGTCGACTCTCAAGTA^GCCCTGTGAAAACACACCTGCAGCGG 

ACACAGAAAGACACAGG AAAGG CGGG AC ATGG CT AATGG ACCAG AAAAGGGTT CAAAG AC CAAATTTC CTG C CT CTGT GCGTGTC C 
AGTGCCTGAG CACTGATCTTTCTTAACATGCT AAG AGATG CCAGATGAGGAGGCAGCAATGCCTTAGTCCATCCCTTTGGG ATTTG 
GGGTTTGCrrCATATCTGGTGAGAAACGTT^TGGCCAATGGTTACTATTCTG 
TGGCCAAGAAATTAACATCCCAACTGAATTCTAGACCGAGGCTATTCTTTCCAGAAGTGGC 

5 0 CACAGTGGGGTACATACTGGAATAGGACIACAAATACGCCCCAAGATAAACAT^ 

ACCAGGTAGAAAGCTCAGGTGTCATGGTTAGCATAATTGTTGTTAATA 
TTTGGGCTTGGCACACTCIACCTTCGCTCCreGGCAGGGAAATCAGAGGTACTC 
AGAAACTTACTGTTTGGGGCTTTGAAATTC^AGCCGAAGCAA^ 
ATCCACrcCCATGTGTGCCGGGGGAGGGGCAGGCrTTCCTT^ 

5 5 GGAGTTGGGGCTGCTGCTTTAAATGTTGTGAAAGACTACATTTCCAG 

CAGGGGACTGGCCTATGTCCCAAACAATAGATAG ATGACCCTTGACCTC C AGATGT CAGTAGTAAATGGCCCTGTCTGTCCAT CAG 

AAATATCTCCAAGGCTGGGGACCAGCTCAGTGGGCAAAGCCCTTATCTTGCAAACATC 

GGCACAGGCTGCTAGTACCATCACTAGGACATGGAGACATGTGAGTATCCAGACTAGT^ 

6 0 GACCCTGCCTCAAAAAGCCAAGTTAGATGGCTGCTGAGGAACAC 

ATGAACACTCAGAATGCTCCCACTCACCCCTTACAAACACAAACCCACAGAA 
GGGCATTGAGAACTTACTAGCTTCAACTAC^CTTGGTGCTGGACTGAAATGCCA 
GAGAGAGAGAGAGAGAGAGAGAGAGAGTTTGCCAGGACCCAAAAGTGACCCTCTt5CTGCCCAG 
ACAACTGGCATAGAGTGAGCCTCAACTTGGCAAAACrrCCCTCTAGAAACCAAGC 

6 5 TTCTCATAAACCTGGGCCAGAGAGGGTAGGTAGOTTCCCTGAAAG^ 

CCCTCTGTGGCTGATGTGCCTTCTAATCCCACCAGTGAC^CTTCAAGTTCTGG^ 

GAAAGGAGCTGCCACGTGAAGGTTTATCTTTCCTCCTCATTTCTCTTCT 
TAATGTCTAGCTGCCAACCTC^GCC^CAGACAGCGCATACTAATCTGAGTCACGCAG 

7 0 TCAGGTTGTTTAATCCTTTGGGGGCAGTGGGAGGGGCCTGG^ 

GCGCTCTGTCAAC^TAGAGACTTAGCACCATAGAGGGAGAGACACTGCCAGTGAAAGTGGCTGAA 
TACTTTrAAAGTCTCCCCATGAATGGGAGAGGGGGCAGTTGTCACCTGAATTGAGTG 
AAAGAATAGAGGAGCCGTAGTTGGGTGATGTATAAATTCTCTGTGACTGCTGATGTCT 
ATGAAAACACGGGGAGCTGGTGAGGTAGC^CAGTGGTT^ 
7 5 GCTG CCCATAACT CCAACTCG AGGG CAT CTCTCTCCTTCCTCCAGTGTCT 
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ACACACACACACACACATG CG CGCGAGCAC CT AACT AAAAAT AAAAT AG CTGT AAATG AGAACACGT ATTCGAAT AGAG CACTTG A 

ATTTGAACATATCTAAAAGGCCCATGTCTTTTTTATTACAGAGCGTATACAAAGCGAGACAGG^ 

GAGGGAGAGAGAACCTGACCTCTCGTAGAAAGACGCCACACTATCTCAGATCTTC 

AACATT ACT CCTG CGCTT ATG CCCATGACAAATCACGGACT CACTCGGG ATTCCGGT AACGTGGT AC CAT^ 
5 AAGAACTGGGACAGACTGGCAGAGCATGTTCAGACACGGCTCTGAA^ 
CTTTTTTGTCTCAGGACAAATGAAGCAAGAGGGATAATGGGC^ 

GG CCG ATTG TG TGTGTG CCT AATG CT CT C CCT CCTG CCT CGGCTT CTCC CTAGACTG T AT AAATCT AACTGGAAAAAAAAAATGGA 

GTAGAG<TTTCCTGTTAACTCTCAAGCATGTC^TGGCTCTCT 

TGAGGGCGCAGAGATGAAC 

10 

MOUSE SEQUENCE - mRNA 

CTCATTCATAGAGCCAGCGGGCGCGGGCAGGACXGGCGCCCCGrcGCCC^ 

TGGCACTTCCATCCTGAGCTCCCCGAGACGCAAGAAAGTCACCCTTTGCGGCT 

CTTTCCTTTCCTTITTTITTlTrTTT 
1 5 GTTTCAACITCTAGGCrrGTGAACTTTTCTCCGATTGTTTC^ 

TGAAGTTGCGTCTGCGTGGCAGGGTGCGTGGGAGTGAGCGTGTGTAACTCGAGC 

CTCCACCrGCTGCGGTTACACGGTGCTCGCTCGTGGGTATTTGTGCGCCCCAGC^ 

GAAGIATCATCTTGCCAGACTCTGGGCTGACTCACTCAACCAATGTAGAC^ 

GTGTGACCCAACTGCGGCTCAACTCCCACCftAACCACAGGATCAGCCACAAA 
2 0 GTGGGAAGCTCTACCCTCAGTGTCATCGAGTTTGACTTTTTTT^^ 

ATTTGGCCAGAATTTATTG AG TAATGACAAGTGATC CTATCGA.GCCTT CGAACTACTGAAGCTGATTTTCAAGGCTACTT AAAAA^ 

AAAAAAACTGCAG CAAACATTAATGG ATTT CTGTTG TG TTTAAATT CT CT ACAGATTCT ATTGT AAAT ATTTT ATG AAGT AG AT CA 

T ATGT AT AT ATTTAT AT ATACG AG CACAT ACATT AG T AGCACAAT CTTTT AAAGTT ACGG CTCTTGC 1" ITTG AG AAACG AAGTG AG 

TTTTTCATGGTAAGAGGGGCGCTCTGTATGGAAGACACTCCTAAGTTTTCT 

2 5 GAAAGAAAAACAAACTGACAGGCAACTAACTGCTGGAACTTCCAA^ 

CTGCGCAT AAAT ATTTT AGGACGCCTATGGGAATTTTGCTCCGGGACCGTC 
CGAGGCTCTGTTGGGCATTTGACTTTGAATTCTGGTTGCC 
TGAGGCCGATCCGTGTAACTAGATCTCAGTCCGGGACTGATGGTAATTTAGTCT 
GCTTCTGAACTTGTATGTTGGTCTCCCGGGAGCAGCTTGCAGAAGATCCGAGCCCCTGTCGCCG 

3 0 GTCCTTACTCAATCGGCTTGTTGTGATGCX3TATCCCCGTAGATGCCAGCACGAGCCGCCGCT 

GCCCCGGCAAGATGAGCGAGGCGCTGCCGCTGGGCG CCCCGGATGGCGGCCCGG CCCTGGC CAGCAAGCTGAGGAG CGGCGACCGC 

AGCATGCTGGAGGTACTAGCTGACCACCCTGGCGAGCTAGTGCGCACCGACAGCCCCAACTTCCT 

CTGGCGCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCACTGGGGG^ 

CAGGCAACG ATGAAAACT ACTCGGCAGAACTG AGAAATGCTACCGCGG CCATGAAG AACCAGGTAGCGAG ATTCAACGACCT CAGG 
35 TTTGTCGGGCGGAGCGGT AG AGG CAAGAGCTTCACTCTGACCATCACCGT CTTTACAAATCCGCCACAAGTrGCCACCTACCATAG 

AGCCATCAAAATCACAGTGGACXK3CCCCCGAGAACCCCGAAATGCCAGGCAGATCCAGCCIATCCCCACCGTGGTCCT 
CCTACCAGTACCTGGGATCCATCACCTCTTCCTCTGTCCACCCAGCGACACCCATT^ 
CTCTCTGCAGAACTTTCCAGTCGACTCTCAACGGCTCCGGACCTGACCGCCT 

CATCTCCGACCCGCGCATGCACTACCCAGGCGCCTTCACCTACTCGCCGCCCGTCACCTCGGGCATCGGCATCGGCAT^ 

4 0 TGAGCTCGGCCTCTCGCTACCACACCTACCTGCCGCCGCCCTACCCCGGCT 

TCGCCCTCCTACCATCTATACTACGGCGCCTCGGCCGGTTCCTACCAGTTCTCCATGGTGGGCGGAGAGAGATCGCCCCCGCGCAT 
CCTGCCGCCCTGCACCAACGCATCCACCGGCGCCGCGCTGCTGAACCCCAGCCT 
GCAGCCIATAGCAACTCGCCCACCAACATGCCCCCCGCGCGCCTGX^GGAGGCCGTGT^ 
CATCGAGGGACTGGGCCTGCCGTCCATGCACAGACCCCGCCAC^ 

4 5 GTGAATGCTTCTGATTTAGCAATGCTGTGAATAAAAGAAAGATTTTATACCCTTGACTTC^ 

AAGAGTGTGGAATGTTTTCGGTTCGGGGTGGGGAAGACGCAGCCCATCCTGTTTGGCATCT 

CCGCACCTTATCGATTG CAAAAATGCCTGTTTGCIATCTGGGTGGTCATTTATTTTT AAG TGTGT AT AGATTrGAGCTTGCTTTTTT 
TTCTTCCTTTGACCAACTCAAAGAAATAAAATTCCCTTCT 

TT ATTTGTGTTTGG TTTTTGTTTTTGTTTTT ATTTTT A CTTTGGGTCATTTTTT AACT ACTG T ATTT CCAC 

5 0 AAAGAAATCCCTAGATATTTATGTATCTTGATGTTTGAACATT^ 

ATTAAGAAAGATATCAAGTACTACATTTTTCTTTATAATAGCCAAAGTTAAATATTAT^ 
GATCGCrTGGTTAACTAGAAATATTGTTTACATTAAACTCCCTTTATGTTATTCAAACA^ 
TAATTGGATTGTAGACACTGAGGGTCACTCCAAGGTCAGAAGTACAAAATTTTCT 
TCCTTCCCTTCAAAAAGAGAGGCAAACTCTGTCCTGAAAGGGTTCAGAGAGGTGCCAAG^A 

5 5 GGCCTGGAGATATACTTGCCCCAAGGCCTCCTCATTCTGGCATGCTTTATCACAG^ 

TTACITACATAGTATTTACATAGACCCAAACCACTGAATGTGATTT^ 
AACCAAAACTTGAGGCTGTACCCCAAAGATCCAAATAGAAGAGTT 
GAGTGTACCCC!AAAAGTCTGAAGGAGCCXXjTTTCCTTCTCCC^ 
TGTGCAGGATGCTCCTCAGAAGCTGCCCCTTCACCAGCATCTTCTCCCACCAG^ 

6 0 GATAGATCCTGGTCTCTGCAGTC03CCATGGCTACGGTTCAGATGTGCATCGTGTCACTGTAAATG 

TGGAGGACTTGGTCAAAATCCAGTTGTTCTACAACGTATGAAGCCTAACCGCTGGTTCT 

MOUSE SEQUENCE - CODING 

ATGCGTATCCCCXSTAGATGCCAGCACGAGCCGCCGCTTCACGCCGCCTTCCACCGC^ 

6 5 GCCGCTGGGCGCCCCGGATGGCGGCCCGGCCCTGGCCAGCAAGCTGAGGAGCGGCGACCGCAGCAT^ 

ACCCTGGCGAGCTAGTGCGCACCGACAGCCCCAACTTCCTCTGCTCCGTGCT 
ATCGCTTTCAAGGTGGTGGCACTGGGGGACGTCCCGGATGGCACTCTGGTCACCGTCAT 

AG AG CTT CACTCTGACCAT CAC CGTCTTT ACAAAT C CGC C AC AAGTT GCCAC CT AC CAT AG AG CCATCAAAATCACAGTGGACGG C 
70 CCCCGAGAACCCCTGAAATGC(^GGCAGATCCAGCCATCCCCA 

CTCTTCCTCTGTCCACCCAGCGACACCCATTTCACCCGGCCGTGCCAGC^ 

TCTCAACGGCTCCGGACCTGACC^CCTTCGGCGACCCACGCC^GTTCCCTACTCTGCCGTCCATCTCCG^ 

CCAGGCGCCTTCACCTACTCGCCGCCCTGTCACGTCGGGC^T 

CTACCTGCCGCCGCCCTACCCCGGCTCATCACAGGCGCAGGCCGGCCC^ 

7 5 GCGCCTCC^CCGGTTCCTACCAGTTCTCCATGGTGGGCC^GAGAGATCGCCCCCGCGCATCCTGCCGC 
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ACCGGCGCCGCGCTGCTCAACCCCAGCCTCCCCAGCCAGAGCGACGTGGTGGAGACC^ 
CATGCCCCCCGCGCGCCTGGAGGAGGCCGTGTGGCGGCCCTACTGA 

HUMAN SEQUENCE - GENOMIC 
5 AATCCACTGGTGCGTTTGTGTGGGAATGAGAGTGAATCTAGA 

AAAACACCCCTGCTCTCCG<5TGATCCACAATCAACAGTGAGATGACAAAGGAGTC^ 
CACTGATAGAACTTTTGGAGAGCATGGCrrrAGGTATGTGGCTTCCT 
CCCAAGCAGTGTCTCCCCAAAAGAATAAGGAGAGGGAAAGAGCATC^ 
TTGCCAAGCTTAGCACCTTAAA(IAAATTTAGCCCACAGTTCT 
1 0 AGCTTCATAAGGCCAAAAATCCAGGGGTTGGATAACCTGAGCCCTTACT 
CAGGTAGTTGGCAGAATGCAATTCCATGGGGCTGTAGCATAGT^ 

GCATCT AGAGACTG CTCTCT AGTTCTTGCAAGTGCCTGCTCTGTCTTCTAGCT CCCCAGCCTG AG 

AAAACACTGTGTTTGTAATGGTCrCCTGTGATTAGATTAGGGCTACCAGG 

TAGTAACCTTAATTACATATGCAAAAAAAAATCTCTTTTTTTCCATGTAACATAAC 
1 5 CCCAGGGATTAGGGTGGAAAGTTTGAGGGGTTCATTTTAT 

GGTAATTC^TCITTTATGGTGGAGGAGGAATGAGGGTGTAAAAAGAGGTCAAG^ 

AAACCTGAAGCCAGGGAAGGGGTGGTTCCACCACTTCCTCTCACCCGCCCCACCACCCAA 

TATATTATAATTCAAGTCACATAGGAGATGTGGAAATGATGATAAAGC^ 

GAAGGATCTGCCCCCCTCCAAGATGAGGACAGGAAGGGTAGCrCTTAATCCTGC^ 
2 0 AGATGCCCAGGAGAACACAJGCAGAACTATGGTGCAGGAGACrAATCT 

GAGATCTTTGTAAGAAATCTTCAGAGAGGTAATGAGAACTTGAACCAGAATGATGTCAATAGGAA 

GTTATCAACCAAAGAAATGAAAAAGAGATCTTGGCAACAGACT^ 

GGGAGAATGATGGTGCAGTGGGGAAAACZAAGAGGAAGACCTGGGTTGA 

ATGGOUUVCACTAATTTTCTCAGGTTGCTTCC^^ 

2 5 TCTCrCCTCCrGCTACCTCACCAGTTTCCCTCCCTCCTCCAACAAATTTCA 

CTCTGCTCCATTTCTGCTGCTGCCACTCAACTGAAGCTCCTT^ 
TATTTTCATATTTTCAGTCCITGTTTTCATGGCCCCTTTGTGGCCT 
TTCTTTTCTTTTCTTTTTATGAGACAGAGTCTCATCC^ 
CCGCCTCCTGGGTTCTAGTGATTCTCCTGCCTC!AGCCTCCTGAATAGC^ 

3 0 TTGTATTTTTAGTT^GAGACAGGGTTTTACCATGTTGGTCAGGCTGGTCTCGAACTCCT 

TCCCAAAGTGCTGGGATTACAGGCGTTAGCCACCATGCCCC<3CTGCATTTTCCT^ 

CTGGAAACAGTCTCTGAAACTAGCATAGAGTTTGATTGTACCCTAGATACAATCT 
GCTCATTATTTCTACATGAAACCTCTCCCCAGAATTCCT 

3 5 AGAGAGACATCACGATGTCTCTCTGTAAACCCGCTGTTTCTGGTCTTCTCTACTA 

CCCCCrCCCTCACCTCAATCCCCTCTTCCATCrCCATTGCTTAATCCAGATCCTCATG 
CCCACTCCTCATCTCTACTCCCTCCAACTCACCGAGCTGTTGTTGACA 

CCCTGTTTGGCTTTG ACAGATCAATTTTCCCGCATGTTGAAGTTCAC^ CTTGTG CTTCCAATGACCT ATCT AC 

GTCTATGTATTTTTGAACTGCTGCACTCCCTCCCCCACTCCCCAACCTGGTCTCAAACCTATG 
40 CACTGATGTGTTTCATAAACTCACGCCCTTATTC^TGATGCrTATTCAGCCCAGAGCACCATT^ 
TAAACCCCTCCTCTAGCTCTTTGAACCACCTAAAAATCTACCCAACCTCCA 
AGTCTTTCCTGAGCCCACCAGGGGCTGTTGGCCCCTCTTCCTCTTAT^ 

CACCAACAGCCATGAC ACTTCTTTACT CTCATCTGTTTGGTGTTTGTCATCAGTGAGCCTAG CACCTTG CCTGGGACAGAGGAAGC 
ATTCAAGGAATGCTGAAGGCAGGTGAATCAAATACCTTGTATCTAGG 

4 5 AAAAGACTAAAACGCCATTTTATATTATTATCAATATAAAAGCT 

ATGAC AGAAATG ACATTAAAATCTGACTCCT AATGATATGAGGTTTGCATGGTGAAGAATTT CCAAG CTG ATCTAG ACC 

CTGAGACGGGGGGGATAAAAGTGAAAGATGAGATCAGAGTTAATTTTITAGAACTCTAC 
TCCAAAAGGAACATAGCTTACCATGCATGTGGACCTATATGTAGATATGAATATGTTGTCT 
GTGTGTGTGTGTGTGTTTCTATGTAGGTCAAGAGTGATGTGAATTTGGGCAAGTTACCTGAACT 

5 0 CTGTAAAATGGGGGTAGCAATGATGTTCTTGTGAGGATTAAATCAGGCAAAGTATATAAAAGC^ 

AATTACATTCAATGGGGGATGTAGGGAAGAGAGAGACAGAGAGAGAGCATGCAAGCTAGC^ 
ATTGTTCACTCTCTTAAACACACGCAACCTCCCCTCTCTTGC 
TCAAACCTCCACAGGTGCTACTTCACCTGGGCAGC^GTTGCTC^CCCCCTAC 
TGTTCTCAAGGCTGCTGATACACAACAAACTAAGAAAGTTTTTAATAGTTACCAATC^ 

5 5 CTTGTTTTTGGGTTTAAAAAATCCTGGCATTTTCCTCAGAATTGCTGAACTATTA 

T CCT ACT C CAATTTCTT AG ACTCAG ACATGG AAAAG CTT CT CTGGG TCTCAAGAC^ CC AGGGTG CGTGG TTC CG CTGGGT 
GTGCTGTCTGGTCAGTCTTACCGTTGTGGATTCCACACCCCCACCCCACTGCCCCAACACCTAT^ 

CAGG ATACATTT ATTATTTCTTACCTT ATCATTCACAATT AGTG CAAATGATTAGTTCTCTGCCTTG CTCTTTAAAAAGAAAAGTC 
TTCATGGAGTCATTTT AT CTACTTGTGAACCACAATCTCGTTTTTTTTTGGATGCT CCTTTG AGACT 

6 0 AATATTTCATCATAGAGCAGTCCTTCAGAACCTCTTAT^ 

CAAATCTTGTTTTCCTGCCTATTTAAACAGCATCTAGAA^ 

AAAAAAAACAGCATACTAAATATTGTGGCATGGAGTTGAAAGCTTTAGGTCTGCAGCAT^ 
GAACAGTATTTCCTCTITGTGACTCAGTGCrGTTATCAATATACAAGG 

ATATT CAAAG GAAAG AAAAAG AGCTTCATG TG A CTTTG CTT AATGGGTTT ACCT AT AG TTGT AAATGTATACTGT AAGTG ATC CAT 

6 5 ATTTGGGAATTATCATGTAACAATTATTTCTAATAATAATTGATATAATCT 

TACCTGCCTAGCAGAGAGAAGGGAAGGAGAGTAAGGAGGGGAAAATGTT^ 

ATGTTTTTGCG TT AAGTT AGGGTTTGG TTTT AAT TTTCC CTTGAATTCTAATGTTC CT ACAC CTT C CAAAT AT ATTTT AAATCTTC 

ATTAAATAAATCACATITTAATCACACACGGAGCTTAAAAGAGGTGAT^ 

CATTGGTTGAAGGTACCTAAAACCACGTGCATAAC^jlAACAGTTr 

AACTTGCATTACTTTT ATTTT CCT ACTTT CTt3 CT AGAG ACG T CCACGGT ATG T 

TAAATTCCAAGTTATATTTTTAAATAAAGTACACACCTTCATAGATGTGTATATGATTTTAT^ 
TGTGATATAATTTAACCIAAATAATTAAGGCTGTTTCTGCTGATTC^ 
CCATATAAAATATTTCAATGAAAGCTTAAATCATACCCTC^GTTCTAC 

7 5 CTCTATACAACTCTCTAAAAACAAAACAGAGCTTGGCTTCCCTC^ 
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GATTTTTATGAGAGTAAAATGTTTACTAAGCATTCCTAA^ 

GGTACACAATGGGTTTATGAAGTGAAAGTGAAATCATCACACATTAACCATAGCATCCCACAGAAAC^ 
GTAATGGAAGAACTAAGGGCTCTCCTCCCCCCTCTCATAGTAATTAGATTTGAAAAGTGAAGTAATCT 
AAAAAATGTTTAATTAAATTCCAATGTTCAAAACrGGATAGTGTGTATGGCAGGTGA 
5 ATTGTTTTCCATCCCTGTGACATCAATACAAACTG CAGCTATTTGGTTC CAAACCATAGCAAGATACATTCT ATTTTTTAAAATGT 

AAATGGTCATTTAAAATAGAAATAATTCTTTTAGACGTACTGCATTTTT 

T AATTTT AAGAGAATTG TTG AG ATTCT AATC CCATCTTT ATG CAT ATTTTGGGG AGG CTTTTG AAAGAG CAAAAAGGT AAAAG ATT 

TAGTTATCTTCTGAGACCACCrATAGTTCTTTACCTAAAGTTGTAGAAGA 

TTATTTTTCTTCCAACTAACCAAGCAGTACTTTTTAA^ 
1 0 TCTCCTCCCTCCCTCTCCCCGCAACAAAATTTGGAAA 

TTTTTTTAATTTTGCATTATGACAAAAGTAAATTCTAGC^ 

CTTTCTCCCrTAGGCAAAGGAAGGGGAATTCTGTATTGTCGCTAAAATGAGC 

CGCTGTGGCTGTCCTTGAAAGTGCAGGCTCCGTAAAATGCCCGGCCGACTAAGGGGTTC 

CATTGATTTTAGATCCTCAGTAAGGAAGGGCAGAGGTTGGTCTC 
1 5 CTG CGCCGCTCCGGGGAAACCCAAATGTCCCGCGATCCCATCGAGT CAAGG AGCAAAAGCTGTGATnTGCTATCAGCTTTTTGTCGG 

GCITCAAACTCGGGGTTTATCGGTTGAAACTCCT^ 

AGGCTGGGACCAACTTTCCTTACTCGATAAAGGAACTTGAC 

CGCGGGGTTTAGGGTTTTAGGCAAGCGGCCCGAGACCCAGA 

TATTTCCCCTGGAGTCAAAAGCCTACnTTTGGAGCCGCCACACCCTCGTCCT 
2 0 GTGGGGGTGGCCCCAGGAG CCCTGGGTGGGATCTCGGGGAAATG CATTGG AGACCAGGCGAAAAGGTTTCCCTTG CCCAGAACTTG 

CCrcrAGAGCGGGCCGCX3TCCGAACCCGCAGGCAAAGG03CTGCGGCCCCTGCGGGCGGGCGGG^ 

GGGACAGAGTTAGAAAGTGGACGTTTGGGCGGGGGGAGGGGGGGATTTACAACTGGAAT^ 

AAAATGTAGTAAGTTTTCTTTCCGAGACCCCAGGTCCATTAAAACAAAACTGAACT 

GGCGGAGAGGGTTGAGGCCGCGGGCCGCCCGCCTTAGCCCCCCGGAAGCCCGGGCTCCTGAA 

2 5 AGTCAAGCCCCAGGAACAGGGGCTTCGGGAAGAGACCCC^ 

CCGGCTGCGACTCTATATTGGAACCTCXXaAAACGCGCCTTGGCCGTTCC^ 
TCCTCGAACGCTAGTTCTTCCCGGGCGACCCGAAGGCCACGAGGCGGGCCGCACAGCCCCGC^ 
CCGGGGACTGCGCGCGGCGTCCAGGGAGGGCGGCTGCTGCCCCCTGGCGTCCGCCAGAGCCCGCGGCG 
CAGTCCGGGCCCGCCCGGGAAAGGCCTCCAOSCCMTCAACCCCGGCCCCCTC 
30 GTCCCTG CGCGGCAGCGGCCACAGCACCCTAGCCCTGGGGG AGGGAGGAGCGGG CGTCAGGGGCCCCGTGAGGTCGGCGAGGGACC 

CCCGCGCCGGGGAGCTTAATTGAGGGGCTCCGC^GGACCACAAAGCAGCCCGCGGTG 
TGCAGCCCCGGGGTGGATGAGAGGCCCCGCfGAGGTGCAGGGCCCGGCAGGGAGGGCCT 
TCCGCTGGGAGGGAGCCCCGCCGGGCGGGGTCGGAGTTGGCAGCGCACGCCGGACCtSGGGGCGGGGCCCT^ 
TGCCCGCAGCACCGCGGGGACCGGCGCAGGGGGAAGCGGCCGCCCGCACGCGAAGCCGGGGTGGGTTCCGC^ 

3 5 AGCGCACGGGGGCGGGGAGAGGAGCGGGGGCCGCTTCCGGGGAGCCGGAGGGGAGACGC 

GCGGGTGCCGGCGCTACAAGGCCCGGGAGCAAGCCGCGTGGCTGCTTTCAACTTTCCTTCCAGCC^ 

TGACTAGGAAACTCTTCGCTGGCCGCTGACGCACGCAGCAAGTGAGACGCGCGGAGGGATCGAGCCGCGGCCCGGGCGCAGC 

AAAAACACCCATTACAGAAACCGACCCAAGGGAAGACGCTGAGAGCAAACGGCACGCGCGGACGCCGTCCGGCCTCATGC 

TCAGCX5CCGCTTCGCCGCAGCTGCCAGCCGGACGGTGCAGGAGCGCGGAGTGGCTCCCCCGGGCCCCGCGGCCA 

4 0 GCCCTGGAAGCGGAGGCCTGGCCCTGGGCGGCCGCGCAGGATCTGGGGCCGTCCGC 

CGGCCACTGCGCCTGCCTGCAGCGGCTCGCGGGGCCGGGCCTCTCCGGGTGCGGCGC 
GCCTGCTTTCTTTTTCCAGGTCTCCGGACCGTGGCCACTCGCX3GAGCCGCACAAGCCT 

G CGGGCGGGGCCCCGGGCCAGCATGCAGGCTTGGATG CGGGGCGCGGGGGTGGCCTCCGACTCCCCG CCCGGGCTGCGTACAGT AG 
CGCGCGGCGCGGCGGCGGGGGCGCGCXKXSCCAATTCCA^ 

4 5 TGATTGGCCGCCGGCCGCCCCTATGCAAACGAGCTGAGGGAATGGAATT^ 

CA.GGCACTCTCATTCATAGAGCCAGCGGGCGCGGGCGGGACGGGCGCCCCGCGGCCGGACCCAGCCAGGGCA 

CCTGCGCCGCCAGGCACTTCTTTCCGGGGCTCCTAGGGACGCCAGAAGGAAGTCAACCTCT 

ACCTTCLTTTTTT'IGTTGTITTTTTTTGTTTTTCCCCT 

TTTCCTGGCTGCGAACTTTTCCCCAATTCTTTTCCTTTTACAACAGGGGGAGAAAGTGCT 
50 TGCGTGTGCGTGG CAGTGTG CGTGGCAGGATGTGCGTGCGTGTGTAACCCG AGCCGCCCGATCTGTTTCGATCTG CGCCGCGGAGC 

CCTCCCTCAAGGCCCGCTCCACCTGCTGCGGTTACGCGGCGCTCGTGGGTGTTCGTGCCTCGGAGCAGCTAAC 
GCGACGGTGGAGGAGTATCGTCTCXJCTGCTCCCCGAGTC^ 
GTGCGTGTTTGCATGTGTGTGACTCTGCGGCTGCTCA^ 

ACCCGAGTTCACAGATGTGGGAGAGCTGTAGAACCCTGAGTGTCATCGACTGGGCCTTCTTATGATTGTT 

5 5 AAGATCTCTGAAACGCTGAATTTTCTGCACTGAGCGTTTTGACAGAATTCATT 

TC AGC ACTGCTCCAACTACTGAAGCTG ATTTTCAAGG CTACTTAAAAAAATCTGCAG CGT ACATT AATGGATTTCTGTTGTGTTTA 
AATTCTCCACAGATTGTATTGTAAATATTTTATGAAGTAGAGCATATGTATATATTTATATATACGTGCACAT 
TACCTTTGGAAGTCTCAGCTCTTGCTTTTCGGGACTGJ^GCCAGTTTTG 
TTCTGTTGGGACTTTAGACAAAACTCACCTGCAAAAAACTGACAGGCATTAACT 

6 0 GTTTTACTCTTCGCATAAATATTTTAGGAAGTGTATGAGAATTTTC 

CTCTGCAAGCAAGATTCGTGGAAGATAGTCTCCACTTTTTAATGCACTAAGCAAT 
CCGATCCGCAGAACCAGAACGTTTTCCCCTCCTGGACTGTTAGTAACTTAGTCTCCCTCCTC 

ACC C C CCG CAGT AAT AAAGGCC C CTG AACGTGT ATGTTGG TCT C C CGGG AGCTG CTTG CTGAAG ATC CGCGC CCCTGTCG C CGTCT 
GGTAGGAGCTGTTTGCAGGGTCCTAACTCAATCGGCTTGTTGTGATGC^ 

6 5 CCG CCTTCCACCGCGCTGAGCCCAGGCAAGATGAGCG AGGCGTTG CCG CTGGGCG CCCCGGACGCCGGCGCTGCCCTGGCCGGCAA 

GCTGAGGAGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACC&CCCGGGCG^ 
GCTCCGTGCTGCCTACGCACTGGCGCrGCAACAAGACCCTGCCCATCGCTT^ 

AGGCGGAGGCCGGCGAGGCCGGGGGAGATCCGCGAGGGCAGCGGCCAGGCGGGCGGGGCCGGATTCTG 

70 GCGGCCTCCCAGGAGGGCTGAAAACATCAGGGTGTCCCCAAAGATATCC^ 

AG CTTTG AT ACAAAGT AGGAAAAGTTATAACCCAGCCCGG CAG AG ACAGCCCGTCTGCTAACTTACTGCTGCCCCTACCGCTGGAT 
TAACACCCAAACCATTTTGTCrrCCTCAGATGAAAGGATCAGGAAATAA 

AAJ^CTAACTAGGGAAACTGTCAGGG A 'r i ' n 1 i i ' i ICCCCTTCTCTAACCGCCCTATAGGAATTAGCATGTCCAAAGTCCACAGCGGG 
CGTAGTTAAGTTTGCAAAAGTTTTCTGAAGAGTTCTrCTGAGCTTTTT 

7 5 TTCTTTTCCCGGAACCGTTCTGAATGCTATCATCAGCTCACCTAGATTAA^ 
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AjGAGCCAGGGGTCCGAGCGGCCCGCTTT 

CTCCCTTCACTTTTCACCCCTGATCATCCCCAAAGCTTTGAGCTGCC^^ 
CCCAGGCGTTTCTAAACAATCACCCTACTACACTAGCAGCAAGCAGTAGGAATT^ 
TGGTGGGTTTGAAAAGATGCGACTAAGTAAAATCAGTC 
5 AAGTGTAAAGATAAGCTACTAAAAAAAAAGTTTAAATAATTGTCC^ 

CCAAGCCTCAGCTGGGTCTTGATCTTATTACAACGACGAATATTCACTTT^ 
TACAGTTTCCACACTTAGATTGTGATTTCTAAAACAAAACGAAATTTC 
CACTTTTCTAAAAGCAGTTTTTTTTGTTTGTTTO 
TGCTTGACTCTTTTTTCCCTCCCTCCGTTTACTTAGATTTGGGT^ 
1 0 TAAATGGACTCAGCGATGACTCTTTAGGGCACCCGCATTG 

GAGCAGAGGCAAGACGGGCTGATAACCACCAAAGGCCATTAAAACCTG<Krrc 

TTCCTTTTGGGGGGGAAAT CAGGGTCTCTTCTGAACTCC CT CCTCTG CCA.GGGAACC CACACTCAAAT ATATTAAAAGACTAAATG 
ATTAATAAATAATCTTCAGGAATTGGAGCAATTCAGAGTCTCAAGCCAT^ 
ATTATACATATTGCTTTTATGAGCTGTCAGAATTCTTTCrrAGCA^ 
1 5 TGGCAATTCACTTAAAAACAATTAATTTGTTTTATTGAGTGATAATCTATAG^ 

AAGTTCAAGCAGATCCCCTGAGAACATTTAGGGGAAAGGAAAGGATAGCAGAGAAATC^ 
CCCAATCCTCATAAAAAGAGGCTTCTTCCTGGAGGAGACCTGCT 

GT AAACGTGAAGGGAT AAAATGTT AGGGAT AAAAG CAAG CAAAACAT CAAAACCTGC CT ACATCCCGGG CTCATTT AGTGTGTGTC 
CCTC^XTITCACTCTATCTGCCCACCCAACCACCT 
2 0 CCXSCAGGCAATCTCCCTAGGAAGTTAACCAGTAAATAATTATTATATTCCCT 
GGCCTGATGATATTGATGAAAAGATCACTG 

CrrGAACTAAAGAGAGATTTAAAATTATGAATTlTCTTTAAAATAGATTACAATAACCCT 

TCTCCCCCTACTCCrTCCCTCCCATGGAACACCTTTATTTTT^ 

AGTTTGCTGTGTGTTTTGCTGTACTTACAGAATTAGCCTAAAAAGCTG 

2 5 TTATAGCTTGTTCAGGCCCTGGTACACCAGAGTCaUUVGACGTAAACGTC 

TGGTAATATTCTTTTC^TAAAAAGGAAATAAGTAATATTTTTGTCGG 
AGCTGAATTCTGTGTTCCACTCCACAGCCTTCTTTGCAACACT 
TCAACCTAATTAGAATTGGTTTAAAATGCCAGTTTGGACCGTGAACAGG 
GAAGAGAGTTGATATTTGGCCCAACTTCCTGATACTTTAAAAACAGAGAAGTAA^ 

3 0 CCCTTCTTGCCTGTTTGGTCTGTGTTAAAATGTAAATGTTTGTGTTTCCTTACCTATTG 

TTTTTTCTGTTTGAAGTTAGATATTTTTACATTCTCTC 

AGTTTTG AATTAAAAAGAAACTTTTT AAACCTGG CTTAAGTATG CTTGTT AACAATTGGACGATATACAGTATT AATGAATTG CCC 

TGGATTTACTAGTTCCCCTTTACCCACCCGCCTCCACCAAAAAAAAACCCTCAGAGATC^ 

GGTGCCCAAATATGGTTGATTCTGCTGCATTTAAAAAAAAAGAGGAGTGGGAGTTAAACTTCT 

3 5 ACATCAACAGCCTTTTCITATTTAGCCTCCATGATAATTTTATTCT 

AATTACCAATGAGGATAGTATTGGAAAGATGAGAGAGGCACAGGCTTGGAGCACIX3 
TCAGAATACATTCCTGTAGTTATAAAAGAAGATTTATTTATTTTCACCTTTTTACAGATCO 
TGAAACCCACGTAGTGGAGGCAGTCTCTGTATACCTGCT 
GWATGAGGAAGTAAAAATATGTGTTCAAAGAGGGCATTTCAACACTGTGAG 

4 0 AAGGGGAAAATTTGTTGTGAGAGACTAAAAAGAGCAAAATAAAGCAAGCCCAT^ 

CAATAACCAAATAACTGATCAGGGAGATTTATTTCCGTGATATCTTTTTA 
TAAATGGCAACAAACACACTTATAATGGGAACTAAATAGCTAATTAGAGGGTGCT^ 
CTATATtMAAAACATGTCCAACAATTGGAATGAAAGGATTAAAAAG^ 
TAAAATAGAGGAAAATGCCTCCCTGCTTCTATGAGACTAGATTA^ 

4 5 AGTGAGTTAGTTCGTGTTCTGTGCCACAAGGACTGAAGCAGTTCTAATGAATAAAAACAATAGATAGTCATCT 

AAAGGGTGATGGCTAAGCAGATCATGAAAATGGGAATCTTGAGAAACATGACTTCGATAT 
TTTTACATATAGAAAAAGAGGCCTGAAAATATATACTACAAAAAATTATGTAG 
CAGAGGAAGGAGTGATAACACTAGATTAAGGAATGCTTTACTGTTACCAATTCAAAAA 
AATGAGTAATATAAATGTCACTACTGACTTTACCGTAAAGAATGTC^ 

5 0 TGAAGTTGTATGACTGCTTTGAAAAATAGAAACAAAGGAAATACAGCC^TAGTAAATGT 

G<3CCTAAATGATAGCCAAGAAATTGATGCTATTGTACAAAGTAT 
TrTCTTAATTACCTGAGTATAAAAACATTTATCTACCAGTTT 

AAATAAATTCCTGGTCAAGATCAGCTGGGCTAGCAGAAAACCTCAGGCATCTGTGA^ 

5 5 TTGTAATTAGTAAACGAAGGGTGCATTTCTGATTTTTTTTTCT^ 

ATGAGTAGGAGGTATCAAAGATGAACTGTAGAGGGACAGTTTAAGTTACTTAAGAATC^ 

GAAATTGGGTTTTTTTGTGTTTTTTTGTTTTGTTT^ 

GAGCAAGTAAAAGATGGGTTGGATTATAAACTTACAAACAC^ 

GTATTCAGG AG ACGG CAT AG CTACTCAAAATCCTTGATATCTTG CTATGGTTAGTCTTGTT CCAACTGTGCT ATGTG ACCT ACTAT 

6 0 GGCTTTATGAGGTAAATTTAGTATATGTGTCACTATTTGAAAATTTACATATAGTTA 

AACTGCAGTTTTTGGAGTTTVAAAATTGCAAACACTGCAATTACTTT^ 

ATAAlTTr r CTG T T UrnT GCTTAAGTTATCATTGCrAT^ 

AATAATCACTACACAAATGCCCTAAAAGTGTATGTATAACATCCCTGATGTCT 

CCCT AGGGG ATGTT CCAGATGGCACTCTGG T CACTG T G ATGG CTGGCAATG ATG AAAACT ACTCGG CTG AG CTGAGAAATG CT AC C 

6 5 GCACCCATGAAGAACCAGGTTGCAAGATTTAATGACCTCAGGTC 

TAACATTCTTATTGCTATGTC^GCTCTTAAAAATATTTATTTGAAGTTTAATAT^ 

GTTACTGAGTGGATTTAAGACTGATTTGGAACATGCAACGTAGGAATTGTC^ 

GCAAAAAATATAAAACTGACTTTAATGAAACTAGGATTATACAATTACAATGAGGATTCT 

7 0 TTAATGCHUIACTGTTGACGTATGATCTG 

TTT CT AGTG T ATG ATGTTG AAACATGGTTTCAAG CAAG CTTACCTTTTTAG AAAAAGACATG AAGAAAT AAIX3TT AGT ACCATT GA 
TGGCAGTATCrATCTTCCTGTTGTATTTCTTTTCAATGCCTTATTTAGGA^ 
AATATTAOAGATTGGATTATCCAAGAAATTGTTGC 
TCAAAGCATTAGAGCAGAAATAGGAATCTTGGCTTATTGTAAi"I''i'i'^ 
7 5 CATTGTAAGAACCAAATCATATTTTATAAATAGCAACAAGTAGTAAAAGT^ 
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TTTTTGTAATATAGTACCGATTAAGCITACAAATAAGTTATT^ 
AATAGTTCTCGTATTGAAAAAAATGGAAGCACCAATCAAACAACATTTAAAATA^ 
AAAACAAGAAAATCTCAGTTCTCAGAGAATTTGATTAGTCAGAAAGGAGAAAGGGGTAAA 
TTTTGCTATCCTAGCTATCTGGAGCATTTTCATAATAGAATCATAGCTGGAGAAAATAA 
5 TATATTCATAGTACTATATTAATAATAGGGTAATGCAAAGAAAAT^ATATCCACTTTTTG 
AAAAGCATAAGTCATTAAATGGGTTCACTTGTTGTATTAAAAGTCCCCACCTTTCAGTA 

TGTGTGTCT AGAAAACATCTTAAAGGCTTCGGATGAGT AG CTGGCAGAGTGGGTGAGAAGCCAATAAAAAACAAAGGCCCCACAGC 
CTGGTCCATGCCCATCGGCCTAGCTAACTCTCTTAATTCAGTTCTTCTGG 
GAATTGCTTTTCTAGAGAAAGTGTTATCATCATACrATGCATCTG 
1 0 AACTATTTAGTGATCCCTGCCAGTG ACATTTTCCCAC CAATG CACAGTAAAC CAGTGAT AA 

TAAAAAAATGAGATTTTGAAAGCAGTGTG Cr ' rrC ' r ' riT CTG<SCATTG ATGGCTGATCTTG CATCCCT CATTGAC CATTTCTGTTTT 
GAGTGGGTGACTAATAGGTGCTTGTGTTTACTTGGGTAACAGTGAACTGTTTC 
GATTCGTGCCAGTATGCCCTTGAGTTTGCATAAACCC 

GGAATGAGAAAATTAATTGAAATGCCATAAACAGACAATATTTTGAACAGACrrCTCCTT 
1 5 ATACGGAAACCCTAATGGAACATTTGACCCAATTGTAATTACTGTGGATATGAATAT^ 

GTGTGTGTATATATATATATATATTTTTTTTTTTTTTT^ 

CTCTGTCATAATTTCTTTCCATAGTCATAAAATATAACTTGATTTrrC^ 

TTT CTT TC TT TCT T T T TCC TT CT T CT T C TTTTTTTTTTTTTTTC 

ATTTTAACAATTAGTCTTAATCITCCTCACTGAATCTTACCT^ 
2 0 TCTCATTTTCTGTTTTTGAAAATGTGCTTGAAAA 

GGCTGATCCAAACTTTCAGTGCGAATTTTGAATTCT 

CAAAATGAAAGGTAGAGAAAGAAGTTCTCTrrCTGTGACCAGCTGCCAGTCCA 
. GTTGAACATGGAAGGGTTTTTCTCTCTTCTACAAAACGAATCTATCTGG 

^ GGGTTCATTCCGT AACTGGGAAAACCTCCCTGTCCGG CnTTGATGGCCTCG CTGGCTACACTGGCCCTtSTAG AAGCCTGAACCTTC 

2 5 TCTCTTTTCTTCCATTCCCTCCTTTTTCTC 

TTGATTCAGTATCTCCATGTAAATCGACCGCTCCTTTACAAAGGAGGAAGAGATAGCAGCTCCC 
CTGGGGACCGGGCTGAACTCCAGAGGTCATGTGAGCACAGAGGTTATTGTTCAGAATAA 
CTCCATGACACCCCCAAAAAATTGAAGTTCAGGGTCAAAAGGAATTCTCT^ 
TTACATCTGCAGTTGAGGCAAGTGCTACATCACCTGCCA 

3 0 TTAAGAAATTTTAGCAATTGGTATCCGGGCACAGTGGCTCACACCTGTAATCCCAGCAC^ 

CTGAGGTCAGGAGTTCAAGACCAGCCTGGCCIAACATGGTGACACC^ 

GGCGGGCGCCTGTAATCCCAGCTACTCAGGAGGATGAGGCAGGAGAATCGCTTGAACCCAGGAGGT 

GTTG CGCCACTGCACTCCAGTCTGGGCAACAAG AGCAAAACT CCATCTAAAAAAAG AAAAAAG AAAAAG AAATTTTAGCAATG AAA 
TCTGTGACTGCTTCATCCCAGTTGGAGTATTTAATCAGTCAGAGAATAACTGATCAA 

3 5 TCAGAACCAAAGAAGAAATCTCTCCAGCTGCTTTATCAGCTGCATTGTTGCTGCAGGAGGA 

CTAG T 'riTT l VlTAGAATATGATTTTTAATCT^ 

CAAAAAAAAAATGCACGTGTTCACTTACACACAGAATTACGTATATCATTTCAGTTTCTAGGCT 
TCTGGGGATCCACAGACCCTAGGCCAAAAATTTGTGAATTTGGAA 

ATATAGTAGACACCTGAAAATGAATAAATACATGTACAAATGAGCTAGACATTTTCTTGAAC 

4 0 G'rTTTTGTTATCGCCGGGGTCrGGTTTGTCTAATTAAGAAAGTAGATCrCCTTGAAACCACTG 

GG T ATG C CCACAT ATCAT CACGGGT AAAGGGG AGGGGG C CT ATTCTG T CATTGAAAGG ACAG TCG CTT CC CTCGT ACTGTGT C CAC 
ACCCAGCCCTGTAAGGCCAAGTGTTCTGTGGCCTGTCAGGATGGCGTT 
TGGCAGCTTTAATCTrGCCCACTTTCAAAATAGTATCCTTTCCCAACACATTTATA 
TGAATATCTTCCCAGACATAATATTTATCCTGTCACAATCTGGGTTATTGCATATTTGTGTAATGTTA 

4 5 ATTCTTAAATTATTCTACATTATTTGGGCTAAAGTGTGGGAGGTTGCCCTTCAGGTGAGTGGAAGA 

AGGTACATTTAGGGAAGGATAAACGGTACTGATATTCTAAACAAAGAGACATGGCTTGGCTTT^ 
TGATCTGAATGACTCGCAGTATTATCGTAGGATCCCAGTCCTTCTCTATTCTATTAACAGATTCATATAAT 
TCATCTGACCACCCCAGAGTTGTTT CACGGGT AATGTTCATGGTATAAACTGGC^GT 
GACATGAGTTATATTAAAATCAACrrCTCCCATTTTCCCACTTTCTTCTGT^ 

5 0 TGAAATCTGTATTCAGATCTCATTGCAAGCAAATATAGATGAATAAAGACCCAGTCT 

CAAATGCTGCTTTTCGACGGGCAGGTCAAAAGCnTGCCTCCATGATCTGATGTAGAATTC 

CGGTTTCACATTTGCCAATAAAAGGATAGCAACACTTTCCrAAAG 

TGATGGCAAGGACCAGCAGGAAAAGCTTTCTTGA 

TCTTGCTGATTTTGTGAAGTGTCAGGCATTGCTGATGGCGTCACACAGC 

5 5 ATAGTTTGATGGGCAGAAGCTGTACTTCTCTCCACCCACCCTCAACCCCTTCCT^ 

CTTTTTGGTCGGGACAGAAGACTATCAATCAGAAAGTGCCTGCAAGAGAAATAATGGTTC 
GCAACAAGCATTATAGACATAAACAGTGATCCAGCCAACGGCTGCAACCCTGCTCTGGTTCT 

T T ' rT T T T TTA AGGAAAAATTACTTGTTGACAAGGA^ 

6 0 TTCCCTCrTGTTGCCTAGGCTGGAGTGCAGTGGTGTGATCTCAGCTCACTG 

CCTCGGCCTCCCAAGTAGCTGGGGATTAGAGGCATCCACCACC^ 
CATGTTGACTGGGCTGGTCTTGAACTCCTGATCTCGGGTGGTCCACCrGCCTTGGCCT 
CCACCGTACCTGGCCGGTGTACTGTCITAAGGCTGTTTTCTTTCrTC 
TTATGAATAATCATATACTGTCAATATGGGGAGAGTGTATGCTTTTGAAGGCTT 

6 5 TGTGTGTGTGCGTGTGTGCAAGGGGGAGAGAGAGAGAGAGAGAGAGAATGAGAATATCCATA 

CAAATT ATCTTTGAGG ACT AAATT<^TGCTAATAACAT CTT CT CATCTAAAGC CCAAAT AAAG CAATT AATGG AAATT AAT AAGTG 

AGC^GTGTGGATTTAATGAGCTGCCTATGAATACTATGACATGC^ 

CAGCACTTCAACGTCTCTATTTATAAACATGCATTCCK^ 

TACATTCAACGGAGATGTATTTTGATCrCTGTTGAAGAC 

7 0 AGTACATTGCCCAGTCCCTGTTAGTCTATATTGGGCAGCATGTCCCATTCAGGA^ 

GAAGGTGTCAGGCAGTTGAGGTCCATAGAGGAATCCCAGCAACTTCCT^ 

TGCGTTT CCT CTGATTTCCTT AGTGAGG AAG CATTCATTTAGGAGCCATTTAAACTGTGTTTTTCTC^ 
TTCCTGrTTATTTCTCAACTTAGATTAATTTGGGCTTTTACCT^ 
TGCAGAACTACCCCACCTCCCCCATCACCTATCCCAAGGACGl^CCTGTC^ 
7 5 TGTATGGCCCTAATTCLAGTCACCGTATGCAAACCAAGGTAAATC^ 
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CTCATCAGATCTCTGGAGGTGGAAACACTCCAGTTTGCT 
TTGAAACTATTGTATAGAGGATAGTCAGTATTAATC!ATCTATCTT^ 
TCATAATAAGGCTTTGTCTGCCTCTTTAAATGGTGTGTCTCTAGTACCTAGGACAA^ 
CACTGGGC^TTTATTATCAGTATCTTGTCTCCCTGCTTATCA^ 
5 TTGACTCCCAGAAGACATTTGGCAATATCTGCAGACATTTTTGGTGGTCGTC 
GTAGAGGCCAGGGATCrTACTAAACGTCCTACCATGAACCrcACAGCCT 
AGAAACTCATCTAGCTCCAGTCATGCGTGATCTCGCC 
CATTCTGCCTCCCTACAAACTTATCAAAATGTATGAAAAACTTATTTTAA^ 

AATAAAAAATAGAGAACATCACTGAATATTAAAAG'rTi > 'l''rTT rAAAGGGAAAACAGCAGTGTTCTGTATGCATAGAATTGCACCAT 
1 0 CCCCCTGGCGCCACTGACCCTGCCTGCTATCCTGGGTCAGGGATGTGCAGT^ 

ACAGCAGGGCAGCTTCTCCGAGAAGGCTCCCCACTGTAGCCACCCCCTCATGAATGTCACACA 

AAACT AGGAAG C C CT CTGCCAG AGGCTCACTG CTTTT CC CTGAATACTT CAGCAGAG AG T CCATGG AG CCACTGATCT AGG AATGG 
GAATTGATGTCAGTGGACAGTGTCCACTCATATTAAAAAAGTGATAAAGAGCCGGGCATC 
TTTAGGAGGCCAAGGCAGGAGGATCATCTGAGGTCAGGAGTTTGAGACCAGCTTG^ 
1 5 AATACAAAAATTAGCCAGGAGTGGTGGCACATACCTGTAATCCCAGCT 

GAGGCGGAGGTTGCAGTGAGCCGAGATTATGCCACTGCGCTCCAGCCTGGGTGACAGAGCAAGACT 

AATGAAGGCACTACGTGGCAGGAGGGGAGACAGGACCAAGGTTTGTG^ 
CAGACAGATGGCTGGGTTCCTCTGGAGCTTTATCCCACCTGCCCCAGCATGTCAGTTT^ 
2 0 CAGGGCACGATGATTTGAAAACAAAAATCCGTAGGGAGACTCCCCTAGGTGTTCCCAGTGCATCAAGG^ 
GAGTTTCTGGAATAGGTCAGGCTGTGTCTGCTTTGTTTGTTTGTCCGT^ 
ATTTTCCTTCTCCGTGTATACATCTGCTTCCTTTAAGCTCCCT^ 
GTCATGTAATCTGTGACTAGTAAGCATTAACTATCCAAATGGGAGTTCGTGGACGGTC 
GAGTTGCTGTAGGTGTGCTCTAGAGAGAAGGTCACTGGGGAAAAAGCTTGAAAACAGCTGTGGC^ 

2 5 GGAGACCCAAGCACTGTGGAAAGG AAGATGTGCTCAGGAGC CCCGAGGAAGTGGG CTCACAGG AATTTTCCAAAATCAAGAGATCC 

AGCCTTTAACTAGAAAGCTCCAGGTGGTATCATGAGTGACATTAGATGATGCCTGATACATGGTCT 
GACTGTTTGTTTTTTTGTTTTTGTTTTTTTTTGAGATGGAGTCT 
CACTGCAGCCTCCGCCTCCTGTGTTCAAGCAATCCTCCTGCCTCAGCCTCCC^ 
CTGGATAATTTTTGTATTTTTCATAGAGACGGGGTTTCATCACGCTGGCCA 

3 0 GCCTCGK3CCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTC 

GTGGTCAGCACGTGGACTTTACTGGCTTCCCCACTGTGTTTCAACTTGGGTGTTGCTG 

AC AACCAG AAATCTCTGAAGC CATTG CGTCATGTCCCCTGGGGG CAAATCGCCCCTGGTTGAGAACCACTGGCTTGGAG CTATCTG 
TCCTTGAACAAGCTACITGACCTGTCTAAGCCTGATTTTGCCCTTGTCT 

3 5 AGACAAGACATAGGTTAGGAGTTGTTACTTTTTGCAGTGACACCAACACCATGG 

AAACAACAGCAAACAGCAACAACAACCAGAGGTCTG CTTTTTTTGTTTGTCT AAATGGGTGAG CTTCCAAGGAGGAAACACAATCC 

TTATCCTTATTTCAGGGCCCCATACGCTTCTGTGGTCGTGAAAGGGTTATATGCX3GCAGGCCCTGAGATTC 

GAAACGAAATTAAATATGTACTGTAATTTGAAGCACTGCTTTTCATC 

TGTCCAAACATT AATGG CATGCAAATATTTATGATTCATATGTGCCCCTTTTAATTCCCAATCGTATTTTAACAGTA^ 
40 TTTTAATAATGAAGCAGAATAATCCCTTTGTC 

ATCATTGCTGCCTCCAC CCAAGTTGAAGGGGTTTTGTAG CAGTGC CACGGCAGCTCX3CCTTGGACT ACTTGAGGACTTCCGG ATGT 

TCCGCTGGAAGCCATTCTTAAGGTTGTGTCTCAGCATTTTCAT 

CGTAT AAACT CAGCACTGGTTCATCTGGCTGTCACTTGGAAGGGTTCTTACTACACT 

ACTCTAGATTACACCGAACCCATTGTGTCCTATTTAATATTCCITTGATTCGTAATGATGAC^ 

4 5 CGATGATTCTGTGAGTCCTAGAAAAGTAGCATAATGTAATATGAGAAGAGGGCACACCTGTAGGACTCAGATGTGCCA 

CTCCCCATCTGTGTTCAATAGGTTTAAGAGTGCCTTTCATCTATGACTTATATCATCTATGT^ 

TGT ATGTAACTAT ACTTAGGAAGAAG AG CTAGGGCCCCT CTTGATGATGTTCTTCCri^CT ACAGCGAGAATCCCTGAGCGAGGAGG 

GGAGAGAGCAAGCCAGAGGCAGCATGGGTCTTGCTGAATTTGAAAAGTTGGGAAT 

GCAAGGAATTTATATTCCTTGGTCCTGCTAAGGGCCTACTAAAATGAGAGTGC^ 

5 0 CAAGTAAAAGCCTATGTAATAATAATATAATTACAAACTTTTATTGGGCATTGACCCTGTGATGGAA^ 

TTTATTAGAACTCTCCCTCTGCCCCACCCCCCAAGGTCATTTACTAAGGGCAT 

GTGCAC CT ACTGTGTG CCAGG AACTGGGCTAG CATGAGAAAT ACG ACGG TCAG CC^UUVGAGAACCCAC AGCCTTGGGGG CTAG T AA 

GGAACATGGCAATTAAGTAGGCCATCCCTGTTGTGAGTAGATAACAC^ 

CAGAACCTGGTGAAGGTTCAGAGGGTCTCCCTGGACCACTTGTCATTC^ 

5 5 GAGG CATG AAG AAT CATGT C CCAG GAG AGAAAG CAGAGTGAGG AAGG AG C AC CTGGC CTGTTGGGAGCTGAGAGATG CTC CAGTGG 

CGGTGGTTGGGGAGTGATGGCGGGGAGCCTTACAGAGAGATAGGGCGGGGATTGCCTTAAC^ 
CCTAATGAAATGGAGAACCACTTATAACTAACTCCAGATACAAATTGTGGAA^ 

AC ATTTGGAGGGTGCGTGGGG AG CTT AG AGACCCATCCAGAGACTGAAAGCTCTTCCTTTCTCCTTT CAAG CAGCG CCAGGCATG TG 
TATATCCAGCTTGCAAGG ACCCCATTTGGTAGATGG C ACTGT ACCTAGAAGTCAGCTCCCTCCTCAAGCTTGCCTCAAG CTTCCCT 

6 0 CTCTTCCCTCTCTACCTCCACACrCACCTGCCITTCTTCCTACCr 

CCCCCCAAGCCTTCCTTCCCTGAAGAATGTCCATGCCTCAGGCCTCC 
GCACCCGGCTAGGGGCACTATCGGAATCAGGAGGAGGCCAG^ 
AAACACACAGCCTGATGGATGAGTGAAGACTGGTAAATCTGGTGGCT 
TCTCCAGGGGGCTGCGAATTCAGGGTTTTGGGGTGTTTTGGT^ 

6 5 AAAAGGAAAAGGAAG ATTAGAAAGGGAAAATG AAAT CT CAGG AAAGG AAATG CAAG ATTG CATG AG AACAT CATGGTGTTTTG TGG 

GATAGGTGATATTCTGCCAAATAAAGAGGCCCGCCCCTCCACTCCTTTTGCAAACCT 
AGCATACCTGTGCCTTGGAAAAAGCAAGCTATGCCTTCCAAGTGAAATTCG 
GCAGAGTGATTCATTGATTrTTAGAGACAGGGTCTTAOTCrGTCACCCAGGCrrGG 
CCTCCGCCTCCTGGGTTCAAGTGATTCITGTGCCTCAGCCTCCCAAGTAGCTGGGACT^ 

7 0 TTTATTTTATTTTATTTTTTATTTTTATTTTA 

GACTG^TCTTGAACTCCTGAGCTCTAGTGATCCTCCCGTCTTGGCCT 

GGCCTCATTTATCAAACTTTTCTTTATTGCCTCAGAATAGAAGATACAAAGATGAATAAAAT^ 

ATTTTCATAG AACAT CTCCAGAGAAAATG CTTTG CATGT CTTCGGTTTT CTC CAAGT AAAACAG CT ATTTTTCAAACAAT C CAT AA 
TTCCAAACAAATGCGATTTCCATGGAAAACTGAGTTCT^ 
7 5 TGCCAGTAGCTTAGGCCTCAAGTCTGCATTTTGCATGTGTGATC 
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GCAC^CAAGTATATACAGTACGCCGACATCTTAATTCAATT 

TCCCGTrCCAGAAATCCCCAAATAAATTAATTATACAACTCCACTGTCAAAAATAGTGGCAAATC 

G AG C ATAAGCATG AATTTTGG CTGTT AGGTTGTT CCAGTGG AAAAG TG AG AAATCAAAATT AAAAATACTTCAT AAAAAAT ACT C C 
AAATGCAGACT ACT CCTGGAATCCG AGGGTACX3TT AATTTTT AT ACTT AG TT ATC CGCAATT AAAATT AGACCCAAAG CCATCT AT 
5 TGTAAAAGGAGAAATCTCCCTGCAACCTTTCAGGATCTTGTGGGA 
CTCTTCTTGGTITTTAAAAGCTGOGA 

GCTTTCCTGCAGACAAACACAAGAGTX5AAAAATGTTGCGATTC 
GTGTCTGCAGCCTTCCTTTCCGTGACAAAGGGCAAGTTTC^ 
CAGACCTACACTAAGGAGAGTGACAGGAACAGAGTCTCCTC 
1 0 AGGAGGCCGAGTCACAACAGGAGTCGGGGCTCGCCC^ 

CACTTTTAAATATGTTCTTCACACCTAGGCAAGAAATGACCCCGAG 

GCT CTTTGGGGAG CAGTGTG CACATTTGAAG CAATCCTGAGGTTT AG C CACTTCC CAGATG TGG TGAGGAAGTGG AGAGTGC G CTG 

GGCCATGCGGAACTGCAGGTTCCAGCCGTGGCrTTAT 

CAAGACTAATOlAGCCTAGACCrGGGGCCTCCCCGCTGACTGGCCCTG<^ 
1 5 CCCCTTGACCTTCITCCCAACTAAATAAAGCAGT^ 

ACATGTTGCAGTGAGGAGCACrGGCCTGGCTTGTTTACATAGCA 

GCGGGGCAGCAGTGAACTTGAACTTGAAGTTCAAGTTGCGTT^ 

CACACACACACACACACACACACACACACACACCCACTlAAACCCCT 

l ^ LXTlTACXXjGATCnXjTCACAGCTCTGGTCT 
2 0 C CTTTCCCTGTGCTCTGCTTGT ATATTCAG CATTCCATTTAACCTGCAGAACAATGCTACG AGATGGAATG ACTGTTAT^ 

CTTTTACAGATGACAAAGCTGAGGCCCCAGAGAGGTCAAGTA^ 

TGAACCTAAGTAGGTTGCCTTGAGAGTCCACACTGAATGAATCACC^ 

CrCTCrCACTGCTTCCAAGCCTACATCAAGACAAACCTCATCrGCCCTGTTC 

TCGTTrGGTGCTTCTAATGGTCTTCTAATGGTCCTCCT^ 

2 5 CAGTGAGCAGGAGGAAGGACTTGGTGTTTACCCCTTGTTTACCCCTTT 

TGGTG^CCCCCTCTGGAACCAAGAGCTGCCGACTTCCAGCAGCACACACAGGATCAGT^ 
CCTCTGCTGGCCTCACCAGACCAAGGCCAGTGAGTGCrrTCAGGGAGCCTTGGATC 
TCGGCAGCAGCCCCATCCTTCCACCCCTGCACTGGGTCCTGAAAAGCCCATI^^ 
CTTGCTCTCCTGTGATTTCTCACTATTCCGGCTGCAGCT 

3 0 TTAAAATACCAGGCCTATAGGTGATTTCAATGAGGGAAT^ 

ATG CTGGCCAAGAAGGGCTGTGTCCCAG CCGCCATGGGGTGGGGCCACAG AGAGGGO C 
TCCTGTCAGGTGCGCCCGTTGCTGACTGCAGCCC 

CAGTTTTCGTGCTGTGGAAAAATGTCACCTACACCTGTGAAGAAGTCATCACTGTGTCT 
CCCAAAGCTGTAAACAAGAGCCCITTGTGAGGGAGAACCACATO 

3 5 ATCTCTTGTGACTGTGCTTAAAATGAAGGGCATGATTGGGAGGCCAAGGTGGGCGG^^ 

GGCTG ACAT AG TG AAACC C CATCTCT ACTAAAAAT ACAAAAAATT AG CTGGG TGTGGTGGT ATG CAC ATG T AATCC CAGCT ACT CG 
GGAGGCTGAGGCAGGAGAAT CGCTTG AACCCGGGAGGCAGAGGTTGCAGTGAGCCG AGATCACG CCATTG CACTCCAGCCCAGGAG 
ACAGTGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAA^ 

CACAAATTATCACTTTGATTAAGTTTTACTCCTGA1TATATAGGTTAGTCTGTGGTTTAC 

4 0 TGCCAGAGGCCCAAACGCAG CTCAGT AAG AAAATG CTTTTGAGCTATAACCCAGGTTG AGTACCATTGGT ACATTAGAATCACAG A 

GGTGGAGATTTATGACCTCAGGGATAATAACCGG 

TGTGTGACTTTGGGAAAATTACTTAACCTTTCTGGGCCTTAGCITTGCTACCTATTC^ 
TTTCATGAGATTGGTGTGAGGACCAAATGAAATAGTATATGGGAAGGTGTTTAAAAAGTT 

4 5 CAGTATTATGAATGCAACCATTCTTTGTTGTCATTTGGGTAGT^ 

TGTTCCAGAGGGTAAAACACACGCGTGCCTGTAGAGCAGTTGTCACTGGTAGAGCCATGATGGGAG 

GT ACTGAGTT AAATAGTG TT CT CCAAAAAG TG TATGTCCACCT AG AACCTT AAAAT ATGG CTT AAGGTCTTT ACAGGTG TAAT CG A 
GAT AAAGTCATACTGG ACITGGGTGGGCCATAAGTC CAATG ACTGATGT C CTTCTAAG AAGAGGG AGGGGACAC CCAGT AGGAAG A 
CAGTCATGTGAAGATGGAGGCAGAGGCTAGAGTGCTGCAGCTGCAAGTCAGGAAACX3 

5 0 AGAAAGAG^CAAGGATCCTCCCTAGAGCCCTCAGGGGGAGTGTGGCCCTGCTGCCACCITG 

TGTGAAAGTCTTAATTTCTATTGTTGTAAGCCACCCAGTTTGTGATAATTTATTACAGCA 
TATGAAGTCACTGAGGTGCTAGAGTTGTAGGCAACAGAAGAATCACACGTGGA 

TOC CAGCTGC^TTCTTGG AAGATGAAATCAGAGAGTTGAAAGGGACATTG CATGCATC CCATGAACCCCCCTCATTTTGCTGATG A 
GGAAACTGAGATCTGCAGAGGTAAAATCX3TTTGTATTGCATCCCCAAACTTGTAACTCT 

5 5 CTCTCTCTCTCTCTCATACACCCACATGCACCCCTAGACCTACTCAGGACTGGGACAATGGCATTATCTATT 

GGACTTTT'TTTTTTTTAATTTTAAATACCCCCTCTCTCCT 

ACATGGGTATATAGAAAAGGGTCCCCACCTTCrGGGAGACTAGGTCTACACAGCCGCAAACTG 

ATCAAAGGTGAAGGATGATTTATGAGGCAGCCCAAACAGCCATAAAGCAAAATGTAGTC^ 

TGACTGAGTGACTGCCCGTTGCCAGATGCTGAAGGGCAGGAAGACTCGGGTGAAGGGTGCAGCAAGGAG 

6 0 CX3CTGGAGX3AAGGAAGACCTCAGCATCTCAATGC 

AGGAGAGGGGGTGAGTGAGAAATTACTTGGGGCAGGGTCTTATTAGTGAGAAGATCCTAA^ 

TGAT ACAGGTGATAACCTTCATGGGTTCTTGACAG AGAGGG AACAGG CTG ACCATGAGGCTTGTAGG ACAATCTGTT AGTC AGCT C 
TGGAACATCTGGATGTGGCTGCTCCCCGTCTGTGGCTGTCATTGCTCCTGGATCTGTTCT 

ATGGGCAGTTGCCCCCAGGACCTGCCTCCCTCTGCGGCAGCATCT^ C 
TGTCCTAAAACTGGCAAATCCCCAAAGGCCAACAAGACTCGTCT 

GGAGGCAGAGTCTCCTGGCATCCCTGATGCTCTGCTCCATGGAAGTCACTCCCCAGAAGCATCCTCAC 

CAGCTGAGACITTAGTTATAAAGGCACAATGGGCAGAGCAGCCTTCA 

ACTTTAATACAGATAGAATCCACATCTTTGGGGAGTGTTCGG^ 

7 0 GATTATATTTTCATTGAGCATTTTTCTGTGGCCT 

ATTAAGACATGTGTGACAATTGAAAATATTTTCCTTTCATCTGT^ 
GTTGATTAAAGAACTACTGTATTGATGTAGGAAGATTGAACATCAGACTTAAT 
AACAGAGAAGAGAAATAAAGCAGACAACATACACACACACAC^ 
AGATAACTGTTTTAACAGGGTTACTCACGGTAAATCATTTAATCTCCCC^ 
7 5 TCTTCCTGGTAAGGATACTGAGGAACATAGAGGTGTCrrAGTGACTTGC 
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TACCAACAAGAAATCTTTTTAGATTAGTAAATCTATGC 
AAATTTTGTGCATTTTTTATTACTAGTACITATTGA^ 
TTTATTCTCACTGATCCCATGAGTACCTCTAATAACATCTC 
AAATATAGACAGAATGATTCTAATTTAGCCTGTTTTCAAACTTAGAA^ 
5 GAGAAAGATGGATATGGTTTCTTCTACTCTTATTTGCCCTGAATTATA 

AGAAGTGGAGAGCTGCCAATCTGACTCTATGGTTTCCGGAAATCCTGACTTAACAGA 
ATTGTTTATTGATGATTACATTAAAGCCATATAACTATC 
GAAGCAAAGCOVAAATTCCGGGAGTGTTGTCAAAACTGGTAACTTGTGCT 
CAACRGATATGTTCAGGCCACCAACCTCATTCTGTTTTGTTCTCT 
1 0 GTCTTCACAAACCCACCGCAAGTCGCCACCTACCACAGAGCCATCAAAATCA 

CATCCACTTGGGGCTGGTACACCCTCCAGGCTGGTACACCCTCCAGGCTGGTATACT 

CAACCTTTC CCCCAGGGGGATGTOUX3CCCCCGT AAAGCCAAAAAGTTGCATTGTCAAT ACTCT CTACTCAAAGCTTGTTTCTG 
TGCACTGAGCTCTITTTGAAGCATCCTTTTAGAGGGCAAAAGGA 
CATGAAACTAAGGACTGCTTCCTCAATATACAGTAAAGTTCAACAGGATCAGAAACA 
1 5 GAGGCAGCCCAAAG AGGTCTG AGGX3TCAGAGTGGACCACAAGGGAGTAATTACCGT CTATGCTXKX! CTTTTATTAATTCATGAGGG 

CCAG AGATGAAAACT ATGAAAAT G ACC CT TT T C TTT CT ATGG CACAG ATCAAAATCTT ATT ACAGG C ATTTTT AAG AT AG AGAGTG 
GGAGGGTAGAAGAATGTCTGGGGAAAACCAGTAAATATAGAAAGATAGTACCAAAAAATAACTCCTGTGAGGG 
AATGAGTGAGGTTGTTTAGTCTGCAGAAAAGATGTCTTC^ 

TTTCCAGAACCACACCCTTCCCCAGGATCTGAGGCTGGGTTACAGTTCCAGGTCT^ 
2 0 CAGATCTCTCCAAACCGTATCAGGTCCCTGGTGGAAAATTAATA 

CT ACTGTC CC CAAAAG C CAAG ATG CAGGG AGGT CAGG TTGT AAAATGAAG TGGTT AAGAATTTGGG CTTT ATGGGCAAG C CAGTGG 
AATTCTTAAGCCTTCAGTGAGCTACAGAGCCTTGCTGAA 

TATAAGATCAAGCCAAATGATGCTGTGTGCCTGATGCAGAGAATGAAGGCAGGAAATGTTA^ 
TTTCATCCCTGGCTACTGCTCCAGTCCCCTCTAGATGACCTTCCCTGCATGGTC 

2 5 CAGGCTATCCTGTGATCACX5GCAGCTTTTATGGCAGCCTTCCCTG 

GAACCATTCCCCAAGCTGCATCAACTTTCCCAAAGTCCTTGAGTGGATGGAGCTCTGC 
GCCACCAGGATATGGTGTTCCTCGTCTGCTTTTTCCCTTCCTTCCATTTGCTGTGCT 
TTTCCCCAGAGGGCCCTCTCAGCTTTITCTGATCGAGACTTTCTGGCCAT 
CCCTGATTCTTCTTTATGAGTGAAAAGCAGGCCATAG^ 

3 0 GGTTrCCAGAGTAGGAATGTCAACCTGGACATTCCTACGTTAATGTAACCCTGCATTA 

GTTCTCCACAGAGGACCAAAGAGGAAGCAGGCTTGAATTATGGCAAGGGGCCTGGCTGGGAGGGAGGCA 
AATCGACTTTGAGGGTGACTAAACTTAGAGAAATGCTCCTTCAGGAGGCT 
GTCTCTGGGACAGTTGAAATGCCGACCAGCCTGAAGAAGCCGAAGC 
CATGCATAAAGAAGGCTGGGTTTTGCCTCACrGCCAGCATTTCCTGTAACA 

3 5 ACAAATGGTGGCGACTCTGACTTTTAAGAATTGGCAGTCAGAGGGGAAAAA 

TACAGTTTACTCTGCCCCTGCTGTTTTACAACTTAAGAAAACAGT^ 
TGCGCCAAAGAAGCATCCATTTTTACACAGTCTTCCCTGCCAAATC 
TATAAGTATGATATATAGAGGTGCAATAGCCCCAGGTAGAAAGTCTCACATTTTACAAG 
GCAGACTCCCAGGTTGACTTGGGTAATGCTGACTTTTCTATT^ 

4 0 CTGGTTTCTTATGGTTTGAACGTCTTCATTCAGGGTGGGAACTTTCTATAGT 

AAAGAGAGGACACTGCATGTAAATCAGGATGCCCAGATGTTGCCAGATCTCTGGATGAATAAGACCTCCCGAGTCCCCT 
TATGGTGCAATGAATTCTGGGAAATCCTGGCAAAATGGCCCTGGCATTGCCAAGGTCAA^ 

TTTCAT AAAATCAAAGAC AATG AG AG CT AGG AGGGG CCTG AGAAG AG C CTTT AGG AT CATCAC C CCAT AGGT AAGGG AAAAT AGC C 
CCAGAAAAATGAAACGGTTTGCCTAGCTTTAAAAGAGATAGTTGATGATGAAGCGTTAACAGAATC 

4 5 TCTGTGCTGTCCTTCATTTATCTACCTGTCCATTCATCTGTTCATCTCTCTATCCATACCATACTGTCT 

TTCTT AT AG ACTCTTTG ACCTG G CCT CG GT ATC CT CACATTT AAAATG AGGAGTTGG CTT AT ATGATTGG CTGGCAT ATTTGG CTC 

ATCTCGAAGACTGTCTATTATAGAAGGG CAAATGGTAGTGCCTCATTG CAGXH'GTGG AGCCTTrTGACTTC AGGTATCCGTGCAAGC 

TTAACTAAATAAAGCAGTTATCTGGCATTTGGAGGGACAAACTGCCATTCAACCTGCCATTCCGCTGACTAAGT^ 

ATG AGAAG CTGTG CCACAGCTCGTCTGTTTTTCATGCCG ACGGGT AAACTTGATGCATCC CTCC CCTCITCCAGGGTTTTCAGTGG 

5 0 TGAAGCTGTAGCCAAAAGTATACGCCAAAGGGAATTTAAAGTGTTTTTCGTTTCAGA 

CTGCCCCTTAGAAAATCAAAACTTGTTTTTCTCATATTCCATAGGATT^ 

GTTGAAAAATTAACTTT ATCATGGAG CAAGAGCACCATCAAAAGGTATCCTCTG CCrGGAATGGGTTCTTACCAGCACCAAGGTGG 
GCTCTGTTTTCGGTTAGC ACGT AGAG CACTT AGTAGCTTCTTG ATAT ACTG AG AGTGTCT CCTCCTT AAGCAC^ CG 
AAAGGATTTTTCAAGAAAAGGATATTTGAGAGTTTTTTC 

5 5 GGCTGGGGTGCAGTGGCACAATCTCGGCTCACTGCAACCTCTGCCCCACTCCCAGGT^ 

AGGAGCTGGGACTACAGATGCCCACAACCACTCCCAGCTAATTTTTG^ 

CrGGTCTCGAACTCCTGACCTCAAGTGATCTG CCCACCTTGG CTTCCCAAAGTG CTGGGATTGCAGGCATGAGCTACTACGAATGG 

CTGAGAGTTGTTTTTTTAATGTATGTTTTATATTTCTAGGATCTAATGAAAAA 

GTGTCTCTATGCTAATATTGAGATCCCAAAGCATCTTCTCTACAGAGATTT 

6 0 CCATGGAGGAAGATATTTTGCCAAGGAGAGGTTGAGGAGAGTGAGCA 

GGTGGTATGGATTGGGCIAGCCTTTACCTGTCTGGTCAGGTGTTTGT^ 

GGTTAATTAACGTATTGGCCATTTCTGAGATAACTTCTGTCTGTCCCCAGGTTAAAGGAGT 

AGCTCTTGATTTTTTTGTTTGTTTGTTTAATTAAATATAGT^ 

GTGTAACCCAGCAGATGCCGGGAAGTACAAACCTGGAGGGAAGGACAGGCCTCTGCAGCrrGCCCCCCT 

6 5 CCATGTCTAGAGCTCACGGGATTCCTTTTGGGTGGAGGCAGCGTTGCCTCTGCTCT 

TTAACGTAGAGTCTTGTGATGTGGTCTGTTTAAGGGTGGGGCTTCTAGCATCCATACATCAG 
AGGGCAGCTAGGGAGTAAGGTAAAATGTGCAGCTTTCTTTTTTCT 
TGCCCATGCTC^AGTGCAGTGGTGCGATCTTGGCTCACTGCAACCTCCGCOT 
TGAGTAGTTGGGATTATAGGCATCCACCACCATGCCCGGCTAATTTTTTAATTTTTTTC 

7 0 TTACAAACTGTAGAGTGCTTCCTX1AGGCACTAGCTTTCTTTCTGT 

GATCAAAGGGCCTAATCrrCACTCAGCTACTTTTTAAAGTACTTGGTTGAGATCATGTCATATC 
GAGTGTGTATAGCAAACCTATTATATATGTGAACATGAGAAATAGGATCTTT^ 
AATGCATGAAAAGTACCTATGGGCCAGGCGCAGTGGCTCCCGCCTGTAATCCCAGCAC 
TGAGGTCAGGAGTTGGAGACCIAGCCTGGCCAACACAGGAAAACCCC^ 
7 5 TACACGCCTGTAGTCCC^GCTCCTCAGGAGGCrGAGGCAGGAGAATCGGTTGAACC 
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GGTGCCACTGTACTCCAGCCTGGGTGACACyurTGAG^ 
AGGAACCAACCTGTAAACTGGTGGTGATAATGGACCTAAAAATAATC^ 
TTCTCTTAATCATAAAAATACTCGATAATAGCAATAGCTAACATC 
CATATTAATTTATATAATCTTCAAAAGAACAGGCAGGGAAATTGAAGAAAAAATA 
5 AGTAAGCCAGCATGTAAACTAAGGCGTCCAGATGCTGGCACGCACCGATGCCAGG^ 
TTGAGCTTCATGGTAATGTAATTCCTGTATTTTTTATTGGGA 
AAAGCAGAGCCCTGTTGCTTTGGCATTCATCATTTTTTTTGTACT 

TCTGTCACCAGGGTCCACTTGGGGACCTGGCATGAAGGGGGACCTG^ 
1 0 TTGGCCTTCACGAAAGATGGGTAAAGTGGAGTGACCTGCAGAGTAGTGGTCCAAGTCTGG^ 

G AGGG CCCGTGTGCACAG CACGT AG CCGC CGGGGGGATGAAAAGATGTGTCACTCTTTG CT CTTG AATGATT ATGAAAATG TCAGA 
TGGTTTTATGGATCTGTGATCTCAGGATTGTAATT ACCAATC CACAATCGCATTG CTCACTGGG AATTG CTTCAAG CGT ATTTT CC 
AGGTAGTGAATGATGAGCTACTTAATTCCTCTTCCCTTGAAT^ 

CTGATCCACACAGGAATTAGAAAGTAATTCTTCCTCCCCTCCCACCTAAAGCAGGAGCAAGGC^ 
1 5 GCTGTCGTGACTCACATCCCCTCCTCCCCACTTTGAAAGGCATTTCAGGCCAAGCT 
TTTTAATTCCTTCTGTGTTAAGCTCCTCAACTACC^ 

ATGTTTCCAGCGAATTTTAGAATGTCTTTCTGTTGAATATTTTTAGTAGTAT 
AC^GACTTCCTTGACCCTAGAGTGGAGGTTAAGGTCTGAAAGGG 

GGGCTTCTGTCCCAGAAGCGGCAGTAACCCACCTGGTGCAGCTCTGAGTGAGCTGGCGT^ 
2 0 TGTCTTATACAGAAACCACTGAGGTTTTTTGTGTGTGGTTGTTG 

TTGCTCTTTCGCCCAGGCTGGAGTGCAGTGGCGCCATCTCAGCTCACTC 
TCAGCCTCCCGAGTAGCTGGGACTACAGGTGTGTGCCACCACGCCCGG 
GTGTTAGCCAGGATGGTCrCCATCTCCTGACCTCAGCTGATCCACCCGCCTTGGCCT 
CACTGTGCTTGGCCACCACTGAGATTTTCTAAGATGGCATTTGTATTG 

2 5 ATGTGAAAACCCAAAGAGATGATCTAGGCACCTTCTCAATCCAATGTGGGAGCAG 

GAATTTCTCTTTCCGTCTGGACCCTCGCAAAACCA 

TTGCAGTCCTTGTTAGAATTCATTGTATGACATGCCATAAGAAAAAAAAAACCCTGTCTGCATTTAGGC^ 

CAAACATGTAAAAAGTTTTCTGGATCCTGAATTT^ 

GCAATTATCTTAAATACITTCGTCTTCATTTTCCTCTCCT 

3 0 GCAAAAAGGGTAACAAAATATGGTCAGCAAAATAAAGCCGTAAGTT^ 

G CCTT CCCTGGG CAGTGC CATCCT AAATCACATCAGGCAAATCCACATGCACT CAGGTGGT AC^ 
CTGTAAAGTTTTTAAAGGGCTTTCTAATTTGTTCTGAAATTCTCTGGCTGTAAGTGGCTGCC 
AATCAAGACTAGGGGTTGAGGTCTCCATTTTGGCACTTGCTAACACTATGAGAGCCGAITCCTGGGGGAG^ 
GCCTCTCCAAAACGGGGACAGTTGGTAGTACCACCGTCTVC^TCACT 

3 5 CCTGGCACACAATACATACTCAAAAGAGGACAGGTATAATTCAGATTTAGTCTTGAAATCCTTCT 

AAAAAAGATCTGTGGCTCCTAAATTTAAAAATTCATGAAGTTGCCATAATGGTACACTTT 

TAAATTTTGATTTATTTTTCTGGATTTCGTTTTGTGTTGAGACC 

CTCTGGGCACTGTTCCTAAGTCACTAACGGGTGAAATGGCCTTGCCCCCAAATGT^ 

CACGTGTGGGTGACATCAAACATCAGTGCTTAAGATCTAACACATGGGTCCTGTTCAGTAGAGGCATGACACAGC 

4 0 CCACACTGTC CT AG AAACG TG ACG TGT AATG T AG CAGGTGT CTTGGGGTCGG AATT CTTGT ACT AGG CAAATG C AGTGT GT CTC CC 

AAGCGTGCCTGGGATCTCTCCCCGCAGCCAAGCTTTGCAAGCACATAAACATGGCCG^ 
CCAGCACTGGAGACCCTTCCATGTGTGCAAGGCAGCTC 

GAGATGGTCAGGCCTTGTCATTGGGTGCACGTCTCCCATGGGGATGGATGGACCCACAGCTT^ 

4 5 TTTCATTTTGAAAAATTTCTCCTTGAGTTTTCrrCAAAGG 

TAGGAAAGGCATTAGTAAGAATTTGATACACTGGGTCTCATGGAACTCCACT 

TAATAATTCCAGTTTAGCTTAAATAAGTAAAAGCTACCCCCTTACACQATATCTGCATTTAA^ 

TCTGTGTTCTCCTGTTCATCGCATATTCTTTTCCTTTAAAGGCAAC^ 

ATTATG AAAAGTTGGCAAT AC AG AAAGGG T AAATT AGGT AT AAT CT ATT AC CGT AC ATTATGGCT AT ATT AT CAGACAT ATCACAA 

5 0 GCCAGTGACATTTAGTTGGATGTGAAATTCCTTTTTTATAGTATGATTGACA 

TATGCCGTGTCACTGGTAAAATTTGCTGTCATGAATAC^CAAATATTCTGGATC 

atatatataatatatattatatatatactatatattataatatatagtatatatataatatatatactatataatatattatatat 
agtataatatattatatatactatgtatatajgtatatataatatattatatatactatatataatatatatagtatatataatata 
tatataatatatattatatattttatatataatatatattatat^ 

5 5 TCTG TTTGCCTTTTCAAAAAGT AACACAATGG AAT CAATCTG CC AACTCT CAAGACC C CTAG AAAGTT ATTT AAAGGC AGT ACATT 

GGTTTGGTACTTTCATGATATACATATACTTAGTTTCCATTAAAGC^ 
GTGTACTTCCATATACTGAAATGTCGAACATTTGATACTTGTTTGATTTTGTATTT 
TTCTGATTATTTAAAATAAAGAGTAAACATTTGAAATAAGTC 
TACCACCTTATTTTCACTTACCTTTGAAGTGGTTGTATGATATT^ 

6 0 CCACGTCCCACGCGCCTCTCGGACACTACTCGCCAGGCATAGCCCAGTGTTTCAAAAGT 

ACGCCTGTAATCCCAGCACTTTGGAAGCCCGAGGTGGGCCGATTACCTGAGGTCAGGAGTTCGA 
AAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGCATGG^ 
GAGAATCACTTGAACTCGAGAGGCAGAGGTTGCTGTGAGCCGAGATTGCT 
TGTCTCAAAAAATAAAGTGAACCAGGGTGCTCAGGTAGCCTCGGTGAGCGAGAAAG 

6 5 TTCACAGTTTTCCCATCTGTAGAATGGACTGCATGATACCTCT 

GAGCTGGAGCAGTGGCTGATGGCGAAGCTCCAGAGCTGGCTTCC^ 
CACCCTCAGCCTCCITGGGTATAAAACCAGGAGTATGATATCCCTCTGTTG 
TTAGGGCATAGCACCGTGCTGGGCACAGAAGTGTTCCTTAAATATTAG 
CCTTGACCATGTCCCAAAGACCTTATGAGAGGAAACAAATGGAATTCTGTTCAAGT 

7 0 TTAAAAAAAAAG AAG AAAAAAAAAAGCTTGG CAGATT TGG C CCTCACAAT ACAGT ACAATAAGGGci"! 1^ TCT Cl 1 Vl~i' C CTTCCAT 

ACTACCTATTATTAATTCCAATTACTGCATCACTGAACTGGCTTCAA 
GTTGCCCTACTTCCTTGAGGGAGAGGTGATCTTTTTCCAATTAAAGGTTTTATCA^ 
CTATGCTGTTGTGTGACCTG CAAAGCTCTGAGCATCT AGAAG CTTCCCCTGGGTCATCTATAAG 
TGCCTACTGCACT.GGGTTCTTGAC^ACAC^-TTGAGATAATATATATCAATGTGCl'1 
7 5 ATATAATT AGATAATTAAAAAATAGAAAAT CTATTATTGTTTTG CATGTACTAT ATT AAAAT CTT AGGG ATTTT AAATAAAATTAA 
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TTGATTTATTGAAAGTAGATATTATCTCGTCG 

TCAG AC CC CACTGTG AATCCAT ACTTTT AGJUJVTTTT AC C'l' UT IT CCATT CACAT AT CTT AATG TTTTCTCT AC CT ACAACT AGAT 
ACCCTTTTTAAATC-TTGGAAACAAAAGAAGTAACCTnGG 

AGACCAAATACAG TTTTTC 1 1 CT AAAACTCTGTT ATTCCATTTT CTTT CAATG AAATG CT CACAGAAC CAGAAGG CTTGGG AAGAG 
GCTCAGAGCACCTAAGAGTTGCTGAGTCCXrrATTCCriTTTT^ 

TTTGATTTCAGGGAGGTGAAATGTTACACCCXTGAGTCT CAGG ACG AGTCAAAAACTGCAGCTG CTTGG AGG 

TGAGAATCTTCCAAGGCTCCAGAGGCCAGCTCGCI^ 



TCTrGAGGGCTGGTCTGTTCTCGGGAGCTGCTCTGTTGAACT 
1 0 GTGGAAACSACTCCGGGCCTGCCCGTGGAGTCTATGTCTGGCCC^ 

CCT CAGATTCCTCATCT ACAGAAGGAGAAAGAAGAGGAG ATTGAATG^ CAG CTTCT AAAAGTTTGATTTG 

ATGGTTGTAGGAGCAATTGTTCTTTCAAAGCACCA^ 

ACTGGGAAGACAAAGATTGTGACrrGAGAAGGTGG 

GGAAAGAGATGAATGAAGGTCCACGCCAGAGTGACATGCTCTAA 
1 5 ACATTTGAAAAAAAAAAAAAAAAAAAAAGACTATCCTTA 

AAAGCTGGCITGGTCAGAGACCTTGOTGTTAATTCCTCAG 



tctagcascaaacagctttggttgaccacaccacitccct 
gccaaag aaaacaagg cag agaataactggcttgtgtggagtttggaaag ct 
2 0 gctttactatatctctccgctgcagcrrgctgagaaa^ 

ctgggaagggagtggggcggggtccccctccggcttgggaaaagacagggtaatct 
tgaggtctttgctitcacacacgagagtgttcacaca 
gttgaaaaggcagctctcctattgcctg cx3taacagagagg ag ctgggagg 
ctcaaggtagacaggccagggggaagttgtgtacctttcgaatcg 

2 5 gtcctgggcgtccatccitgccctccatccctgt 

caaatgtgctcaccaaggtgaataggaggcactagtgtctitg^ 
ggaccagcatcttcatcaccatgctgtgctctgtgggagtctag^ 

tccctcctcacagatgcccrrtctcagcaggtggctgcrttct catagaagtgagcctgatg cctgaatg 

agg ctgtcatgacacaaacaaccggaag cctt atttgtgagggag^ 

3 0 tgt act agtgagtactcaggagctgtgagtggccagggtaccctgt ccaggacctgctcag ccatctgtccaa 

gggctttgcaaggccagcctagaacctcagttggcgtctgtgac^ 
gaaagtccccgaggcccag<xx5gcagggcttttagaaataggag 

TCATCCrrCCACACTt3CTGGATGAGCATTCC!AGA^ 
CATTCATTTATTTGTCCAGCATTTATGGAGGCATCACCCAACCTT 

3 5 CAGTGTCTCTGCCCATTATGTATGAGATCTGCTGCTGCGGGAGGGGAACT 

GAGCCrrCAGTGCTGGTAACATGCTCACGC^GCAGTGTGTG 
CTACCCCZAGGACTAGCCCrcCTGATGCCTGAACTCCAAAGCC^TG 

CTCCATAGTG CGCCACCCGCCACCAATACTTT CTTTCTCTTCTGC AAGTCG CCTTT CTA 
CTCTGATCAACAAATCCATGTGA.CTGGTG 

4 0 AG ACCTTAACCTAGGGAGGGTTG CTAATATTTGGGATT ATAGGCTGGTAG AAGAAGAAAAACCTTGAATTTGGGA^ CAC 

CCTTCTTAAGAAACTGCCTCCACITCCCCCACAGAGG^ 
GCTCAAGTTACCTCCITAGTCACCAGACAGAAATGGACAGTGAGCCCTGGCG 
GCHTTTAGGAGAGGCCCGAGTCTCATGTGAGTGGGCGAAAAACTCCACTGCrr 
TTTGTTTGGGTTTTTGTCCCGTGAGTCTTGGGCGCATGGTGC^ 

GTCCTGAGGGCGATCACAGAACGGTGTGTGTGTGTGAGTGGATGTATCK3ATGTGTGATGG 

GTGTATGTCAGTATGTGTGTATGTTTGTATATATtTTGTCCCCATGTCTGTATATCTGCCT 

GTGATATATGTATATGTGTGTGTGCGTCTGTATATATGTGTCTGTCCTGGTTTGTGTCTGTATGTGTATGTC 

TTAATATATGTGTCTGTGTGACGTATGTACATGTGTCTGTGTGTATTTATATGTCTGTATCCCCATC 
50 TGTGTCCAGGTGTCTGTGTATGTGTGTGTCTGTATGTGTTTACATGCATGCCTGTGTTTG 

TGTGTGTTTATACATGTGTGTCCATGTGTATCTATATTTTGTTTTGTTTTG 

TGGAGTGCACTGGCATGATCTTGGCTCATTGCAACTTCTGCCT 

CTGGGACTACA.GGCACGTGCCACCATGCCCGGCTACTTTTTGCATTTTTTAGTAGA 

TCTCGAACTCCrTGACCTCATGATCTGTCCGCCTCAGCCTCCCAAAGTGCTGGGATTA 
55 CTGTATGTTTTTATGTGTATGGCTCTGTCCGTGTGTATGTTTGTCCATGCATGTCT 

TCTGTGTTTTTGTATGTCTCTCrGTGTGTGCGTGTGAGAGGGTACTCACT 

GTCAGAGTGCCACTCTCAACrrCAGCAAAAGAGCCTTTGACCTATTCTT^ 

TTGATTTAAT ITTTTTITTITT TTTT^^ 

ACTACAGCCTCTACCrrCCrTGGGTTCAAGAGATTCrrC 
6 0 TGGCT AATTTTTTTATATTTTTAGTGGAGATGGGGTT^ CAGGCTGGTCTTGAACT CCTGACCTCAAGTGACCCG 

CCTGTGTCGGCCTCCCAGAGTGCTGGGATGAGCCATCAGGCCCGGACTGATTCAACT^ 

CAGTTATGATATTGTATTTTTTCTCAACAAGAAGTTTTTCTGTAG 

TGAAATGATTTCTGCITTCCCACAACCTAGTGGAGGGTGATTGGAC^ 

CAGCCAACATTAGTGAGCCTCCTAGGCTCCAGTTGGTTTTATAGGTGCT^ 

6 5 TCTTTCTAGACTTrACAGTCCCATGGGT 

TAGGGACTGGCTGGATGTGGGATAGTCTGCAACTTAAAAGGGGATCTCCAGGAACT 

ACTTGAAGGGTGGGAGGGATCAGGTCCACAGGGATCTCAGGGAAGAGC^^ 

GGCAGTGACACAGGGGCAAGAGCAAGGTGGCCACTG 

AGGAGGCATTTGCCTTAGAAAACAGGCTTAAGAAAATGCCACCTGACCTCAGGTG 

7 0 CCACTGTAGGACCATGGTTCGGGAAGGCCCAGAGTAGGAAAGCCCCATACACAG^ 

CACCCACAAGAGTACAAAAGCTCTTTCCGACCAAAGGCTCTGCTATTCCT 
ACJAAATCXCTGAGTGGGTAAAAGGGATTCCTAGGAAGTGGAGGTATCCTAGGGGACTTGC 
TAGGGGCJVTGAGCTAGAGGAAGftGGGGCCTGCACAGG 
ACTGCCACTGATCXSAAAASCTGGC^ 
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TCCCTCTGGACTCAG CCTTGGGATGGTAG AGGGGGAGGGGTGAAAGCAGAATGGAATGTCCCAGTCAGTAGGGTAG CACGAAAGGC 

TCTGTGAAGGCTTAAGGGAGAGAGGAGGGCACCTTTGAATACTGAAGTCCAAGGTC^ 

ATGAAACAGCTGGAGGCAGAGGGAGAGTGAATTCTTAGGACACCTCGTGCA 

AGGCCAGGTGGGATCGCAGGCTGGGCGACCTGGTGGGGATCAAGACAGAGCTC^ 
5 CT CCCGCAG C CAGGT GTCTGCCTGAAAAAAGACAAAAGGGAAAACTTTGAAA TCG T AACCT CATT AGAAGTTGTT CAAG AAAGAC 

CGTGCTTAGAACCAGTTGAGTAGATAATGGCAGCAGATGTAATTG 

TCTCTTAATTCCAGGCITCCAGATACGGAGCCTCAGTGAT^ 

TTTAGAATGGAGAATTCTTAACGCCTACTCTAGCATTCACX3TTTCCCTC 

GCATCTGCATTAACACCCGGTCCTGTGGGCACTCAGAGAACTTAACTGCTGCAT^ 
1 0 TTCCCTGCCATTCCCCAAAGTCrTTCTATAGATGGCACAGCAGAAATTACT 

AGACTGATTATCATGACAGTGACrrCTGACAGAATGTGCCCAGTTATTGTAGGA^ 

GGCAGAGTTTGGGCATTTTCAGCTGCTCTGTGTGTATTGAAGAQTTAAAAA^ 

AGGCCGTCAGTCATTGCTCATGAACAGAGTCTAAACACAGTTCAGGC^ 

CAGAAGCAGGCAGGAGAAGGGAAGTAGTAATGCACATTGGAAGCTTGAGTATC^ 
1 5 AGCGTGCTCCCTGTATCGGCAGCCTCAGCATCACCTGGGACCTTATTAGAACTCCA 

TACCAAACTAGAATGCCCAGGTAATTCCATTGCCCACTCAAGCAGGTGAAACACA^ 

GGGCTGTCTTCTGATTCTACACACACACTGGGAGCAGAAGTC 

CTGAGAT CTAAAACT ATG AAG CATTG TG CACTTCTC CACCCTC TTCG CITTG AGTT CACAGAAATGTGT ATTTCTAGGTG CACTGA 

2 0 TGAATTTCTAGGTGCACTGAGTGTACATATCTGAGTGTGTGTGTCCGTGG^ 

AGTGTGTGTGTCGGTGGGTGTTGAGAGTGTGTGTGTTTTCAGACCTGCCCTCTTCCTC 
ACATAACCTCTCTGAACCTCACTTCCCACAGATTAGTTAAAAACCAAGGCTC 
CAACCTTGTGGTGTTGTCTTAGCTTAACGATATGAGAAGAAATTGGGGTGGGGGAAAGT 
GACAATATATAAGCATGCATATATAATATGAATTAGAATGTAT^ 

2 5 TATATAATATACATGTGGTATGGTAAATAATATTGAAGAATATTTCCCAACATTAAATTAGCCAGCATATG 

AGGGAGATTTATTTTACCATCTGCAAGGGGCTTCAGGAAGAGTCACATTCAGTACTGTCAC 
CTAAAGGTTAGACTGTGATACCAGGAAAAGTAAGCTGTTCTCTACACACATTGATTTer 
TCAAAGGAAGAAAACCCCATATTTTTATATCCAGATTCATGTAGTGAGTATCCAATTGACTGGC 
ATCCTTTTAAAATTGACTATATAAAATTCnTATATGCTTTT^ 

3 0 CTTTTTGGTATTATGCTTTTGCATAGATTTCGCTCA^ 

TAAGAATCTATCAAGAAAACTAGTAAACCCATATCGGAAACATAGACTAATATGTTGTTGACTGCC^ 
TAACCAAAATTTTCCAAGAAACGGTGGCAATTTAACCTTATCTGGTTTCT^ 

CAGCCACAGATCACAATGGTGTCTTGAAAAAGGCAGCTCGGTTATCAACGAGATATGCATACCTGCTCAG 

G AAAAGTGAAAG CTGG TCTCACT ATCCGT CACAAAAT ATCAT AACATTT AAGG ATGG C CT AATG AG CAGC CAAC C CCG AAAACAAT 

3 5 AGGAATGCCITTGGAATGAAGTAAAATTTTGCAAA 

TT ATG ACTCTG CAT ACAT AAT AAAATTCCAAGCAAG T ACTTT CATTTTTTT C AAAAATG AG AAAATGG T ACT AGAAGAATTT AGG A 
CCITTTTC^CTCTGCTTAAGATTAAGATGTCCCAGGGTGAAAT 

CA CAGTTGATT C CAAACATG CTG AAG AAGCC AAGCTG CCAG CATG C7VGGTG TTCAG AGCTG CAT CCTGGTTCT AC C CAAGGG CTTG 
ACCCAGGGTCGAATTCCCTGTTGCCCTCTAAAGCCCGGTACTCAGTGAGAAAGAAGAAAAGTGAATTGGATAGTAGAC^ 

4 0 TACTCCTCTATTTAACATTCGGGGGGCAAATGAGCCTTTTAACT^ 

CCCCAGCTTGAAAGTTTATATCGGAAGTTCCTATATTTCATGCAAAATATTGCTATTTGG 

CCTTGGTGTCACAGCTCTCCAGCACCATTGTGAGCCTGAGGGTCIAAGACCC^ 
AACACACGTGGGGAGAGTGGAATGC<5CTCTTCCCCAATCCTCCAGCATTAAACTCAGCCCT 

CTACAATTTTC^CAC^TATCAGCTGGTATAGAAATTGCTCACAAGTT 
AATTAAAATAAAAAACAAAGAAACAAACAAACAAAAAATACCTCACGTTGTGCCTG 
TTCCTCTTGGGTGTCTCCGTGGCACAGGTCCCCACGCTTGCrGGCTAAGAG^ 
GGCTGTAGACTCTACCACGTCAAGTGCTGTGTATTCAGAATGACCTCGCTCACATCACT 

5 0 CAGTGTTCTTCAAAAATGCCACTAAGTGCCTGAGGAATGTGAAGTTTTC 

GGCATTAIT^CTGAGGCAGATAGAGCTCTCC^CAATCATTGTTC CTGCCTTCTTCTTT ACCCCAACTCGTAATGCCAGG CATT AGA 

CTGTGGTAGGAGGTTAAAGGGCTGAGTTATTATGTTTTGAATGGTTGTTCCATTG<3AATTC 

GATTTTTCAGTTTGATCATCAGGAAGCCAAGGATCTATGGC 

ACTGAGTCACACCATGGCTGACCAACACAGAGCGCTCTAGAAAAAAGACATTTTCATACTTCTGAGGGAAAGGGATGATG^ 

5 5 GCTGCAAGACCTGGCCCCATGGCCTTTCATCCACAAAGCCGGTGCTTAAAATACGCAGCC^ 

GAGCCrGCGTGACTCAGGCTGTTrcCGTTCAAGTCTCCTG 

ACATGACAAGAGGGGCCATTGCATTTGAACTACATAAATATTATGCTTAAGTC 

TAATGATAAAATCACTGCAATATAGTGGGCCCTTTTTTTCTGGTCTGGAATTTGAGA 

GCAGGAAGAAATGGCCAGCCATGG AG ATTTGC AG C CATCATG CAAG TT AT ACCATTTAATTTT AT AATTGT AG CCATTGGG AG CTG 

6 0 GACTACAGTGCIACTTCTCGTGGTCTGCTCTTACCCCAG 

CAC^CATGTGAAATAGCACTTTATGGC^CAGCGGGCAGGCTGTTTCTCAGTCTGGCTTTGGGGA 

AGGTTGTGCAATCGATCAAGGACTCTTTATTGGAACCATAAGGCTGAG^AAGCAG 

ATGGCCCAGGCCTCCTGTGGCTACCTCCCTACCTTGGAGCCTCAGCTT^ 

GAAAACCGTAGGCCCCAGTC CCCCATGAGTGGCCTCTGCTTG CrGCTTCCTGGTGACGGTTGTTTTGCTGAAGAGCATGATG AGG A 
65 CACTTGCTATCTATATAGACACITGAGGGGCCCIACC1ATCCCACCCTATATAGACA 
TGAGCCCCCTGGGGAAGCTCACCAGATAGGCTGTAGCTACCTCTGGGCT 
ACAGGCGAGGCCTTCCTGTCAACTCCTCCITAGGGGTAG^ 

TGAGGAGAGGGAGAGGGAGAGGGGCTGAGGGGCTGAGGATGCCCTGCGTGTACCT^ 

7 0 TGGGTTTGCAGTAAGGAACAGCCGTTCTGATGCCCCTGGGA^ 

ATGTGCCAACAATTAATGCGCCTCTTTCCCTCCCGGCATCACCACCACC^ 
AGTGGGTGTAGAGGGTTGGX3GAGGGTGCCAAGGCCCAGACT 
CTGAGAGGACAGTGGCCCCAAATTGAGCTGGC^TATCT 
AAAGCCCCAGTTTTAGGAAATCCACAATACTTTTTCTGATCTCTT 
7 5 CCCCACAAT AGGACATCGG CAG AAACT AGATG AT CAGACCAAG C CCG GG AGCTTGT C CTTttCCGhGCGGc 1 CAG a oAACTGGAGC 
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AGCTGCGGrcCACAGCCATGAGGGTCAGCCCACACC^^ 

TTTAACCCTCAGCCrCAGAGTCAGATGCAGGGTAAGTACCAGATGGAGCCCACTGCCCGCCTCT 
CAACTGGCCCCCATCTGCACACACCTTCCCAGACCAACTGG 

ACCTGGGAGCTGTTGGAAATGCAGAGTCTTGGGCCCCACCCACCCCAGACCCACCGAGTCAG 
5 CAAGGAAAAGCACTGCTACAGAGGATACGAGGAGCTCTGGOTATGGATATC 
AGTTAGCTTTGGCTATGCTATTITTTTACCTACCCATTAGG 

AAGTGAAGATGGGATGTTTCAAGTTCTCGCTATTGCCAGATTATTTGTGGGTTTTG^ 
GTTGAAATGTATGGATGTCATCACATCAAGGGTGTTATTTGCAAATCAATAGAGAATGO^ 
GGCAATTACTAAAGCGCTGTAAGATGCAATAATTGCCTAAGGCCCACTGTC 
1 0 TAGACCAGTGACGTCAATGACTGTTTGCTCTGTGATAC 
CAGCTX3TGTGTGATATGGTTTACAGTAA 

AAAGGGCAGGTGGCCAGCAATCGATGTGGTAGCTCTTTGCCCCTCTCGG^ 
TGAGTTGTATTGTGGGATGTACAAATATCATCTGTTCCTTTGGGTTTCCA 
TCTCAGGCCGGGCGCGGTGGCTCACACGTGTAATCCCAGCACTTTCGGAGGCCAAGGC 
1 5 ACCATCCTGGCTAACATGGTGAAACCCCAACTCTACTAAAAATACAAAAAATTAGGCA 

GCTACTCAGGAGGCTGAGGCAGGAGAATGGTGTGAACCCGGGAGGTTGGAGGTTGCAGTA^ 
GCCTGGGCAACAGAGCGAGACTCCATCTCAAAAAACAAACAAACAAACAAAC 
CTCCCATXK5AGCTTGACAGCCCATGCCATTAGCTCTCACTGTTAGGTTTCT 
AATAAATCTTTCCAGAAGATACTATGCACACAGCTAAGTGGCCTGTCTGTGGAGTAACCC^ 

TACATAATAATCATTAATAATGATTTAAAATGTGTACATTGATTTTTT^ 
CAAATGGTATGTTCAGTGAGCGAGATGGGCTTGGGGCAAAACAATACTTTGCT^ 
TCATCTTGCCTTTAAGGTCATTTAAACTAATTCACATAATCTTCAGAAAA 
GGAAGACTGTTTTCrTAGTCTCTCGTTGCCTAACTGGCCATTTATACTACT 

2 5 TGGGAACAAAAAGTCTTAAGCATTTTTAAAGGCAATGGAAAATTCAGCCACA 

TCTG Tri ' C TTGG TG AAG AGTGAATCT AAT CTTG ATTTC CTTCTT CATCAG AT ATG C CT CTTT AACAACAAAAACAAAAAAATGGGGG 
GAAGTTGCACATTGAGTTCTCATCCTCTGAAGCTCCTGGGCAAAAGCATTAT^ 
GGAGACCTATATGCITAGTAGAGTATCAGGTTTAACACTAAGCATCGCCCCTTTTTTGTTCCT 
CATTTAAGAAATCATAACTCGCTTTTAGAGTTCCTACGGGCTTGGTTGCTACT 

3 0 TTTCTTTCCAAAATATGAATTTGATATTCAGTACTTCACTTGTAA 

TTTCATGTGAAACAACTTTTCCTTTTCCTTTTGTTCT 
ACCCTTGGGCTTTTTCTCTTGAAATTGGAAAGTTAACA^ 

ACAGTAGGAAAATGCTTATTTTCCAGGCTCAGCCCTGAAATTTCTATGTTATATTTACGAA^ 
TTTCTGGCTTTCTTTGAAATGGCAGTGGTAGTTTTCAGAAAAA 

3 5 GCATGAATGTTTTTCAGCTTTACAGAAATGTCAGTAACCGGGGGTTTGGCGAGC 

TACCTTGCTAGGACTTTGTCAGAAGGGTTATGCCAGCTCCTCTGACTCTGTCA^ 

CTT ACCT AGAAT CACATG TGCTGG AGGGTCACTCAAAAT AG ACAT CTC AAATTAT CTG AAG CACGGT ACAG ATTG AG AGGGGG CAA 

GTTGTCATGCC^CTAAACCACTGTTTATTTGTTGGGGTCTTAACTATTAAAGGGCTG 

GTCTTAAACTGGTAAAGCCATTTTCTCATGAAATAGAATAGAAAGCCATTTTGTCATGAGTA 

4 0 CAGTCAGGACACCATCATTAACTTCTACGATCGAGTCTTAGAATTCT'l l'T r T AATTT ATTT CAAAAG AT AG AACAGGG CAG ATT CT 

AGAGCTAGTGGTACCGATGCCTGTTCAGTCACAACTCCCACCTAGAGAT^ 

CTGGTACAGAAAACTGAAACCTAGATAAACTAGGAAGCCC^TTTTGTTGTTCTTCTGCATC 

AAG ATTT AACTCTTTAACTGGACCAGTTG TG AG CTATTTTGACG TGGG TC AGACATTTGT AC AT ATT ATTC CATTC CCTCAACAAT 

4 5 CCTGTGGTATCATTTATAGACAGAGAAATCGAAATTCAGAGGTTAAAAATTGCCCA^ 

GACGCAAACCCAGACCTTCCTTGCTACAAAGCCAGTGCAAGTG 

TTTACTACAGGGCTGACCAGTGT03AGAAGAATCCCAATCACCAAAG<3AGGGAT 

ATTCTTGTATTGAACATGTTCTATTATAATTAGATGTTGATAATAAGAAGAATTT^ 

AAACTGT CTCCTTTGG T AC ATGTG CATTTGTT^ ATAAAGTTTGGGA CTCAG AATAATG AAT CT CTAAAT AAG AG TG 

5 0 AAAGAGGGGGTATATGAGCAGGGAGAAGGCTTACCAGCCACTCTGT 

ATT GGGAC CAAGTGCTT AT CATCCAAACC CAGTCAACATCT AATC C ACTTTT ATG AGT ATG CGTG C CTCACTTTG AGTGTT C CAT A 
TGGCCAAGAGGGGTCCCTGACCTCTCAAATCATCTCTTGATT^ 

AAAATTAAAGTTACITAGGAAGGACTATATTCTGGTGCTGCTGTCTGCAAAACTCTGAGATA 
GTATAGAGTTATTTAAGTCCGTCTGATTCTCTCTTTATCATCTGGATAGACAAAAGG 

CTCTTCCITACTCTTCTAGCAATTTGAAGTT^ 

AAACACCAGAAATCCTCATATGTGAATCTATAAATGGTCATTTAATGGTTACACAAAGTAACATATTC^ 

TT AT ATAAT AGG CT AT AAAC ATG ACAGGTGAATG ATTTCATTT AAAAAT AG CTTTT AACAAGG TGT AGTAGGGT ATG AG AG C CTTT 
CCAGGTCTATAGCATTATAATCTTACCCCGACAACAAGCCTGAATCAGGTGTTC^ 

6 0 AGGGAGAGAGAGAGAAGAAATGAAAACCAGTGGGATGACTGCAGAGATTCACTGCAGGGCTTGATCT^ 

CTGTAGACTGCATATGTTCTGAACTTTCCCCTTATGCTTGTAAATATGTAGGCCAGCACA 
AAGATTAATAGTTTCAGCAAAGTTGTAGTTGGAGCATGTATTTGAT 

ATAGCCAAGAAAGATAGCAGATCATAGTGGCTAAGAAAGAATGAGGGCTCAACAGGCAGACT^ 
ATAGGACTTTTGGCAAGTTACTTAACCTTTTAAAGCCTTAGTTTTCTTA 

6 5 TAGGACCACTGTGAAACATAAAGCAGTGGACATTAGAGCCAGCCCATC^ 

CAATAATATCAAGTTGTCGTAGTCTTATATATTATAGACTGGGTAATATATAAAGAAAAGTT^ 
TGGGAAGTCCAAGAGCAAGGTGCCATCATCTGCTGAGAGTCATCCCATGGCAG^ 
GCTCACTTTTATGACAAAGCCACCTCCATGATAACTAACTTACTCCCTCAA 
TATGACTCAATCACCTCCCAAAGGCCCCACCTCTCAACATTGCC^ 

7 0 CACATTCAAACCATAGCATTCCACCTCTGACCTCCCCAAATTC^ 

TCCCCAAAGTCTTAACTCTITTCCAGCATTAACTCAAAAGTCGA^ 
GAAACATTTAAAGCAAATTCCATTAGCriTTGTGAAATCA 
ACAAGGTAGAAATTCCTACTCCAAAAGAGAAAAACAGGCAAGAA 
GTAAaCAATATTAAGTCTTAAAACTCCAGAATAATCTCCTT^ 
7 5 CCCCCAAGGCCTTG0K3CAGCCCTGCCCCTACAGTTTTC 
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GCAGCATTACCACX5CTAGAGTTGCACACTGGTGG 
CACAGCCTTAGTGGGGACTCTTTGTGGCAGCTTCAACCT 

TTCCCCTGTGGTAGGTCTTGGCCTGAGCTTCTAGGCAGTCTGTGGCATCCTTCGTAATCTA 
CATGTATTCTGCATGCCTACGGAATTAGCACCACATAGAGACCATCAAGGCATACTGCTTGTGCC 
5 ATACCTAGGCCCACTGAAGCCACAGCTGGGGTGGCCAAGGTGTGCTGCAC 

GCAGTGAGCCCATGAAGGGTGCCCCAGGCCTGTCCCCTAAAATCATTCTACCCT 

ACTTGAAGATCTCTGAAATGCTTTTCGGGGT CTTTCTC CC ATTGTTTTGATG AACAG AATCTGGCTTCCT CG CATTCAT ACTAAT C 
TCTTTGGCAAATGGTTGCTTGGCCACACCCTTAGT^ 

CCAAATCTTT CCACTCTGCTTCTCTTTT AATTGCAAATT^ CTTT AAGT CATTCTTTTG CT AT CACAT CTCACTATATGT AGTT 
1 0 AAAAGTAGCCAOICAGCAGCCTCAATGCTTTGCTGCTTAAGT 

TTCCATTAAGTCCTAGGACACAGACACAATTTCACCCAGCTCTTTGCAACCT^ 

CAATACCTTGTTCCTCATTTCTATCTGCAGCCTCATCAGAATGGCCT 

AGTAATCIACTTAAGTAATCTCTACGAAGATTTAGACTTCCCCTAGTTCTAAGGTCT 

AATGGCAGAACCGTTCTTGGCAATGCAGGCrrTTTTTCTAGCCTGCTCCTCC^ 
1 5 AGCCACTTACACACTTTCAGGTATTTGTTGTAGT 

CCTGAGACTGGGTAATTTATAAAGAAAATAAATTTAATTCAGAGCTGGGC^ 

GGCCAAGGCAGGCAGATCACTTGAGGTCGGGAGTTCGAGAGCAGGCTGGCCAACATGGTGA 

AAAAAAAAAAATTAGCTGGGCATGGTGGCACACACCrcAGTCCTAGTTACCCGG^ 

GAGGTGGAGGTTGCAGTGAGCTGAGATCATGTTACTGCACTCCAGCCTGGGT 
2 0 AAAAAAAAGAAAGAAAGAAAACAAAAGAAATTTATTTCATATGGTTCTGGAGGCAGGO 

GTGGGGTCCTTGCTGCATTAACCCAAGGCAGAAAGCAAGGGGCACAAGAAAGCATGTGCAAG^ 

ACTTCACCAATAACCCACTCACATGATTATGGCATTAATCC^ 

GCCTCATCAACACCACCACAGTGGGATCTAAGTTTCCAACATGTGA^ 

AGTTTCCTGTGGCTGTTTTAACAAATAACCACAAACTTGGTGGCTTAAAACAA 

2 5 GAAGTCCAAAATCAAAGTGTTGGCAAGACCATACATTCTCrcAAGCCT 

TGGCATTCCTTGGCITGTGACTGTATCAGACTG 

TTTGTCTAAGGATACATGTTACTATATTTAGACCCACCCAGGTAATTCATGATGAACTCAA 

AAGGCCCTTTTCCAAATAAGGTCACATTCACAGTTCTGGGGGTGAGGAAATGGTTCTC 

GTACAAAAGACAGGGGACATCAGACCATTGACCAGGCACAGCAGGTGAGTCCA 

3 0 AACATGGTAGAGAGGTAAATGTGAACrGAACATCTCATGCACAAATG 

TGAACGCAACATCATAGTATTTTTATTTCCCAGCCAGAAAAATGTATGATTTTTT^ 

TCTAGAGCACCTATACCGATGGAATCAGAGGTCCTGACAGTGATAGAGA 

ATGGGGAAAC7VGGAGGCCCAGGTGCCTCTTGTCCTAGCCTCCAAAAGCT 

CAAAAGATGACATGATGACCAAAGCCAGACAGTTGGGATGGACTCAAGAATTTGACCTCCCAAAGCCA 

3 5 TGAAGCAAGCTGCCTTTTAAAAAAGTGACTTGATTGCCAGGCATTGTTGAC^ 

TCGCCTGAAGATAGTCTTTCAAATTTGGACATTTAAAAAAGAAACTTTTACTC 
TTTGTATTGAGAGAAGAACAAACAATATATGCTAATATGAAAAACAGCTCTACT 
CATAGTTCTCCAGACTGAGTTGGTTTTACTCATCTACATGATTTTT 
AGTTATTTTAGGGCTCTTTAAAATCCAGTATTTGTGATTTAAATGATGCGGAGGGACT 

4 0 CTCTGGCCCTCAGAACACCCCACCCTGACCTTTAGGGGA 

TGTTACATTCTTCCTTCCAAGCCTTAGCCTCACAGGGACCTTCTCATTATTGAACAATTGCCTTC^ 
TGTTATGGAGATTAAAGATACCGATTGO^AAACTCCTGTAAATAAAATCTTCACTGACAA 
TTCTGTAATCTCTTTCTGGCAGAACATCTCATGTTTTGATGTTAGAGATT^ 
TAGAAAAATAGTATAGAACTCACCCTAAAAACAAACATTGGCCAACCATGTTTATTTC 

4 5 ATAGGGGAAGAATGTACCACCTCTAATTCAGGTGATTTCTGATTAGCAAGCTATGGAAAGTCTTG 

ACGCTCCCCTAAATGGCATGGAATAGACTATTTTCTGTTTTAAGAAAAAATAGAAO^ATGG 

ACTAAATGTTGACTGAATCATGGATAGGAAAGATTGGGCAGAAAAGACAGCCACTGCCTCCAGA 

CACCATCATTATTCCATACAACCTTAGGGTCATTTT^ 

CCAATTCTGATCTCCGTAGATCCTGTTATGGGAATTAACCTTTTTGGAAGGGGATTCTTG 

5 0 CTTTCCTGGAGGGAATCAGT ATGGGCAGAGGGAAGAGGAGATGG CGATT CTG ACCTGTGTGTCTCATGTCACCTAACACCT ATGGG 

GTGGCATGAAACTTGAGCTTTAAAACACACCAGGGGCCAGGCACAGTGGCTCATGC^ 
TGGGTGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGCCAACATGGCAAAACCCCGTCTCTACT 

TGAGCCGAGATCAGGCCATTGTACTCTAGCCTGGGTGACAAGAGTGAAACTCCATCTGAAAAAAAAAAAAAAAAAA 

5 5 AACCCACCAGGAATGGAAATGGTGCTTAATTCCTCTACCGTCTTGGrrCTTCCACACAAT 

TCCGTAAGTCAACATCATCAGTTCCGTTTAATAAGACTTGGGCCACTATAATGTCCAGCTCAATCCTCCTGGAAGT^ 
CTGCAGGATCCTTCAGTTGATGTCATAAAAGCAAAGCATCCCTAGGGAAGATCTGTG 
GCGCTAACTTGAGTATCAGGAGCTCACCCGTCTGGGACGGTTAAGTCTCCCAGCCCTGTAAATTGCCA 
GTCCTCTTTACCTTACGGCTTTATGTGTTCAACCTGAGTCATTCTAAAGCCAGATCCAAGACCCA 

6 0 TCCTCTTTATATCACTGGAAGGACTACATATGTCCAGTTGCTTTGGGATTTTT^ 

AAAT AATGTG ACT AGGGATTG CACTTCCTGG AGTTG CAGG TC AC ATG AGGG ACACG AACATGG TGGTTTGATT AAGT CTCGCG GG A 
TT AT AACACTG TTGTGG CCACTCTCACTC CATT AGTG CAT CT CT C CG CACAT AT CAG AT AG CAAATT AT AT AAT G AT AAAGTT ATT 
AAAATATCTGTAAAAGTCAGCACAAGCTCAAGGGCTCGACTTGGCCTTTCT 

AGGCCGGTT AAT CTT C ATCT AT ACTTTGTTGT AAAT CACC CG CT ATG TGG AACAG CCATAACTGC CTGAGTTTTAAG ATGCGCTT C 

6 5 TGTTGGTCGTCCAACAAGGCTG^TATCTTAAAGATGAATGCAGAAGCTTTTGAAGCATGA 

TGATAGCCTTTGTTTGTGATTGTGTCTATACCTGGGTTATCACAGAGC^^ 
TTCACATGTTTACTGAAGGGAGTAATAAAATAATGATAGCTCGTATT^ 

TGACCCAGCACTGTTTCCTTTACCACCAAGTCCCAGTCCCTCATCTATCATGTAAGTGATTAACCA^ 

7 0 TTCTGTCATTCTGGATACTGCACTTCCC6TGAATTAATGGTCACCCCTAACTAACA 

AGCCCCTCACTCTGCTGCTTATTTGGAAGGAGCCTTTATAGTTCT 
GAGAAGCCGTTGCCCAGACATGGATCrcTTTAATTTAATAT 
CAAAATTACTGATGGCCAATTCAGTGGAAGGGAAGGGAAATC^ 
TTCTrTCACGCCTTGGGGTAGGGGAGGGGAATG 
7 5 C^CATGGCTCCCTCAGAGGAAGACACAGCACCCTGGAGATG 
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GAT AG AGCT ATCTTTTCCAT CT CAT CCTCAGAAGAGACTCAGAAGAAAG ATGAGAG CCCTCAGAATGCA^ CAG 

AATGTGGGTCTGTAATTCCTCCGTGTCCCTTCTCCCCCTCTGCAAACCGTCGTAACAATAATAGTTCCTAA 

TGAGGATTAAATGAGTTAGCCTGCAGAAATCACTTGATGCACAGCACATGGGAAGCATTGTGTGTAT^ 

T CTTTGAGAT AT ATTTTT ATCAAAT ATTT AGGATGGAT C CCGGT ACACTTT CAAT ACTTAATAAATGGT CAATGTT ATTCTTTTTC 
5 ACTATTATTGCTATTTATGTGCTCTTATTGGOGGCATTGTTGTATCACCAGTT 

GCTACTCTGCGATAATTCCCCCGGTGTCTCCTTATTCCTCGGGAGCCCCAGAGTGGATAA 

TCATTTGAGTTTTGTTTCCTTGGATAAACCCAAACAGGAGACC 

AAGTTGAAAGGACCTTAAACTTCTTTGCTCAAAG^ 

CCGAACTAGAGGCAGACCTAACATCCAGGTTTTCTGGCTCCTGGCCCAGTGCTCT^ 
1 0 GTAGCCTCAGAACAGTCTCTAGGGTTGGTGTGGATCAT^ 

AATACACCATCTCAGGGAGATGCTAGGTCCCATGGTTGGGTTCAGTATCATGTGCTGCT^ 

ATTCGTGCT AAGGAGG AGTGATGCTTTTTTTITTT^^ CTCACTGCAACCTCT ACCT CC 

CGGGTTCAGGGGATT CTCCTG CCT CAG CCTCCCG AGTAGCCGGG ATT ACA^ CT 

AATATATATATrrtJTATTTTTAGTACAGATGGGGTTTCAGCATGT^ 
1 5 CCGCCTCGGCCTCCCAAAGTGCTGAGATTACAGGCATGAGCCACCATGCCT 

GGAAGTCCTCCTCAAAGCCACACCTACGCTCCTCAGCCCCACTCAGCAA^ 

TCAGCCCCACCCAGCAAGGAAGGGATATCCTCCTCAAAGCCATGCCCACCCT^ 

TGTTGGGGACAACC!AGCTACAGGAGACCGACAGGTCACTTCCTGAGCA 

CCTGTCCCATTACTTCCATGAATTGGATGCATTCAACAA^ 
2 0 AGGATTTCTTTTAACCAAGTTTCACCGTCCTGTGTGATTTT^ 

TTAAGCCTGCTTGGGGGCTCTGGCTTC^CTGGCGGGCCAGAAGCAGGA 

GGAGCCACAGCTGTGGGAATACGAGGCTTTGGCAGGCCGAGGCCAGGAGGGGCGGGAGGCT^ 

CCGGTT CAGG ATGCCACTG CCAAACTT CATG C CAATTG CTC C CAACAGAGTACACGG CT ATG AGAGGG AG AG AG CTGGG ACCAG AT 
GAAGGCATCCCTGTGACGGCCACGTCCTTGTCCATGAATTCIAGGGTCAGGGTGACGA^ 

2 5 TCACAGGAATTTGCAGAGACCAAATGCCCTTCTTTCAGAGGCA 

CTCCCTT(^CTAAGACTTGGTGACTTGGTAATCCTCCCTCCTCAATGA 
ATGGCAGTAACTAGCAGTGCAGACCTGTCATCTCCATCTTCCCCTCCCATAGCGTGT 
GGGGGCTCGAGCTCTC^GCTTGCCTCAGCAGTCTGAGAACCACAAGTT^ 
TCAAAGACATC^ATCCACTCAGAAACAGCATTCAATGGTG 

3 0 TTCTTACAGTGACTITGAACTTCTGAGCATTAGAGATGAGTGAATCGCA 

AGGTT AAGAG ACTTGCCCAGCGTCACACAGATACCACCCAT ATCAG AG CG CTCAGCTTTCCTTTCCCCTCCTCCCCAGCTTGGACA 
CATTCCTAGG ATTTGG ATGT AGCAATGTGAACCGTCCTG AG CAGTCCCITCAAAACAGCACTTAGG CGAAAG AGGGGCTGTTGGAA 
CAAGACAGGACAAACACCCGTGTG C CT CTG CAC GCTG CT CC AC C CAGTTT CT CAT CTTT AT ATT ACAG AAAAAGAGTCTC AAAT AA 
CCCAAACAC^CAGGGTATGATCTGCAGACAGTCTGCAGGCATCTGCCCAAGAA 

3 5 TATGTTGTGTGTGTT AAGCCTT AAATTGTTGAGTCTCTGGCTGGCTG CAGTGGT ATT ACTGGAGGG CTGGTGCATGTTTTCGGCTC 

ACAGAGCTGG CTCCGGTACTGG AGGTGTTGGCCCC!GAATCAAGTCCAATCrCACACCTGTCT CCTTGTTAG AC 

ACATCTCCAGT CGAGG AGGTGCAGGCTG CTT ATTTTTCACATTCAGAACCAG AAGGAGGGGGGTCTG 
AGCCAAGGGGAGACGGGAAATGCTGGGGTCAGCTGCCTCTCGGGCCCTT^ 
TTTAATAAACTTCTCCTTACTrTTGGAAATTAAAAAACAAACAAACAATACTAAC 

4 0 TGCAGAACAAGGAGCTCGTGGCCTGTTGTAGGATAAATTCCCAGTCATCATGGGAGTG 

TAGGCAGGGGCGCAGCCCCCACCCCACCCCAGCACTGAAACTTCrAGCTCT 

TCTTCTGGTTTTCCCTCCACTGCTGACATTTCA.GTGACAGCCGCAG 

TGATGAAACCGCAAACrGTTCTCTCCTGGTTCIAGCCTTCCTGTCCCCTG 

TCTGCAGACACTTTGCAAATCCAGTTCTCAAGAGGCCCTGACATGA^ CTGG CA 

4 5 CCCCTCAAGGGTGATGAT CGCTCTGAAGGATCATTACTTAGTCIAAAG CC!ACCCTG AAATGACTTTCT 

TGGAGGCCCATAGTGTAGGAGTCAGGAAAAGAGCTTCCCAGAGTTATCAAAAACCACTCTTTACT^ 
CCTGTGTAGAAGAGCTAAGAAAATCTCAC!ACATGTAGTGGCCATGTCTTGGGGGCTTGAG^ 
TTGTGATCAGGTAGGCTCTTACCTGAGCCGGGAGGCATTTA^ 
TCATCCATACACAACITATTTAAGCACTAAGCCCCATTC 
50 CTGTTTATTTCATTTGAGACTGAGGACTTACCAGAGATGAGG 

CCACCCTTGTATCTCTX3TATGTACATCAGTTCACAATCCCAGAGAAACA 

GTGAATAG CAATGAGCAT AGG CAACCTGGACTTCTGGCT ACACTCCAG CCTG AGTTTGAATCCCCTCTATGTGACCTTGAGTG ACG 

TCCTGGTAGAACCTGTCTGAATATGTCACCTTTGGAGGAAAGTGGCCGAAGAAGCAGGTC 

5 5 GGGAGGCCTGATACCTATGGAGACCATGGAACCACCAGTTTCCTCTCTCCTTCT CTAGCCATGCCTCCCTGCCTTGTTCTGGGCCA 

CAAC^TGCTCCATTCTTCCGAAGCTCATTCTTGCTCTGAAATG 

AGGTGGGTGAATGCTGCAGATCrTCCTGCTTGATCTCTGAACACCCATGACCAGTCCTGCCCACAAGCT 

CCTACCCAGAAAGTTCCTTCCTCrCCrCCCCACCACATCCCCGAATCCTGCCATTCTTCAGGGCACCT 

CAGAACATCTTTGTCCTCACTGAGGCCCAGACTGACTCCCTTCCCGACTCCT^ 

6 0 CCCC ACACATTGCCTTCTGCTGTTAC CAGATAGGCTG CATGTTTGAGTGACGCTTCACCAACTGTCATGGCT 

TCTGTGCTGTAAGACAGTAAAAACTCAACCACCACCACCGAGATGCC^TCCAGGCAGGCAACC 
ACACTCTTCTCCCCAGAGCACATCIAGGGTTCrGAGATCAACAAGT^ 
CCAGGACCCCCACGAACCTAGCCCAGAGCTGCAGAGAAAGGGGTGCATGAGTC^ 
GCAAGTCAAAACTGTTTGTTTTAGGTTTTAAGTTAATTTCAGAATATCCTGCCCT 

6 5 CTCTGCTGGGGCCCTTTGGCTGTTTCCTTTCTCCATTCTTTCCT^ 

TGGCTGGCACCACCTGCCCTTTCTTGGGCCATTTCCCTTCTTGTTTGTGATGAGGAG 

TTTCTGGCCCTCATCACrTTCACAGCCrrCTGCTAACTAAGCT 

CTCTCTCATCCCAGCTTCTTTCTTTTCATCTTCCTTCCT 

CTGCTTTC CACATTT T G T AAGACATCT CGG CCC CAGAGT AAG AGTGCATGATTTAC CT ACCACTTC CTTGGATCCCAACCC CT ACT 

7 0 CACTGCTAGCTCCrC^CAC^GGCGTTCTTCCTCCTTCCTTGGTGTGCAGATGTGCCCCT 

TAGATTGTTCTTGGTTCCAGCACTTCTGAAACCCATCCTTCCAAGTTT^ 
CATGGAAGCTTCTTTCCCACACCCACCCrGTGCCCTATGGCGGG 
TGCCATCCTGCTC^CTTCACTTGGTGCTTCCTCCACAGCAACTACAA 
TTCTCCTAGCAGAATTACATTTTAATTTTCAGCTtiA^ 
7 5 ACACACAAACAAAAAAAAGTCTGGGTCAGCAGAGGGGAGACCATGGTC^ 
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TTCTCTTCCCTCCACCTCTGTTCCTCTGAAACCT C I Til Ul TCAAATCflJUKSGGAAGGTAATTCAGAGGTGCTGTTGAGACTGGTT 
TGCTTTTTAATTTTTTTTTTCAAAGAAGACGGGAAGTTCCT 
TCiVGCTATAGGATGATTCCTGGCTGCATTTTTTCGGGCrTAGT'AC^ 
AGTGAA(^OUU*GGGAGATGTTATAATAGGCmi\ n ^^ 
5 TGAGGTGTGGACAGAAGCCTGATGTTATCAGGCCCCCAGAAATCCTGCCTGC^ 
GACGGAGGCAACTGGTCCTGCTC^GCACAAACGGTGCTTTrAGTCTTTCGTA 
AGTTTGCCTGGGTAATTGCATTTCATTGCTGTCTGGCATGTGCT 
GCGAGACTGGAGGAGAGCGAGCCTX3TGAATTTAATCCACTTGAAAGA 
ATGCTAGTCGAAAAGGGGATGGTGCTTTCCATAAAAGCATTTAAAATC 
1 0 ATTAGAAGAAAAGGCGCACACAGGTGACAGTGTGATCCCTGTCGCCG 
CCTCAAAACACAGAAGGCAAAAGGAATGGTAACTTCCATCCCT 
CAACTTTTATTTGAGATTTGGGGTCATTTTGGCATCGTTTC^ 
CTCACCAAGCTCAGGATCTCACAGAGCTATAAGCAGGCAA 

TT CT T TT T 'l' Tl 'T GAGGlTT AGCT AGGT ClT ri T GTTTCTGT CAGT AC ATGGT AACT AG ATTG ATG GGAGGG TT ATAGTTTCT AACT A 
1 5 TCATGTAGTTAAAGC^GT AGTCT ACAAAG AGCCAGAAGGTCTGTCC CTG CTCTGTTGCT ATCTGCTGTGGAGAATAGGGGGAGTCC 

CTTCACCATTCCGCACTGCAGCTTTCTCTGAGATGGGGCTGGCATG^ 
TTCCATAAGACATTAGAATGAAACCTTAAGTAATGTTGACGGTGATTTC 

GCCCAGGAGAGGCAAGTGCTAGCCTGTACCCTCCCAG ATGG CAAAG CCG ACGGTCAG CTCAGGAGCAGTCTTGTTGGT AGTTCACA 



2 0 CATGGGGATGGAGTTTCC CCCTGGGTGCATCATGACTGCAG CC^CCCAGGATTCTCACAAGGGCTCCAGAAAGTGCTCACCACACA 

CCTCATCTCCTCATAC CCCTCAGTTCATTTGT ACT^ CACCT 
CTAGAAGTCACATCTTCCGTCTTGTCATCATGGTAGCTGAAACCATTGTGGCAGTCT 
CACTTTAGGGCACGCTTGATCTTGTGGGCAGGATCATTCrCTC 
CCTGGCCTCCACGTACTAGATGACAGTAGCACTCCCCCTTCCTCAGTTC 

2 5 CCCTGGAGGAGAAATC^CCCGTTTGATGATCGCT 

ATCTCAATGCCTGAAAAAAATGTGAGTTACTTAGCTATCTAAACTTG^ 
AGCATTCCTGTCTTCCATAGGTCATCCCTCTTGTTGAGCAAGAGTTTCATAGTTC 
TTAAAAAAAATTACTCCATTTTAGCCCAAGTATATGGCTTCTATGAGAAAGCCT^ 
GAGGGAGGTTTCCCACTGCTGGGCTGAGTATCATCTTTCTCAGGGTGACCrrCATCT 

3 0 CATTTACTTTCTCTAAGTCAATAAACGAAGCTt3TGGATCT 

CATTTTTTAAATCGAGATCTGTGTTGGAGTTTAAAAGAATTCTGGCAGATTTCC^ 
CGAGTGTAGATTGCCCCTAGATGAATCTGACTTTATTAGTTTGTGTCCTTTGTTCAG^ 

AATGGCrCTTATCTTCAAGTGTTTCCTTT ATGCTG CAAATTC C ACCTCACCAACCTGTGTT ATTAACCCACTGCAATCCCAATTAG 
TTCACATCAAGGTCAACTCTTTGGGGTGTGCAAAATGGCTGCA^ 

3 5 GTGGGGGAGAAGACTGCAGCAGGGGAACTTAGAAGAAAGAGGGAGTTTTCTGATACCATAGAAG 

GCTCACAAGCCAAAGAAATCCAATTATTCCAAATAATTCAAAAC^ 
CTTATTCACTCTGTTGCTTGGTGGCAAATATTTAAGAAATTTCAAAAAATAGA 
AGGTTTCTTGGGGGCATTTCTCGGGGX5TATAGAGGGCTGTTTCTTGGAGGCTA 
GTATAATAAACACTACTCCTCCAGAAATATGTCTGTGAAAAATAAGCTGGAGCGGGTGGGTCT 

4 0 AAGGCTGCCTGTCCTGCCATCTCTCAGGAGCTCCCATCAGAACAGACAGTAAAGACA 

GG CCC CTCCTAGTGTGAAGAAAATAT ACTTCCTGG ATCTATCATTATCTCTAG CATATCAG AGGGGGG AATAAAAACAACTGGGGT 
GWCAAGGCCCTGCTGAAAGAGAAACTCATTGTTCTGCACATTCAGAGAC^ 

AAGGGGCAGCCAGGGAATCACATAGAGTG AAAGACCTGTGTG CAGCGG CCAGGTCAGAGGGCGTGTAG AAGACGACATGAAGCATT 
CCCACAGGG CAGTTCCCTGAGGTTCTGTCCTCCAGCTCTTGCTG CAAGGAATT CTAG AAGCCTTGGCCTGATGTGTGGG AAAGGAA 

4 5 AACAAGGTGAAGAGGATGCAGGTAGGAAAAGAAGAGCTGTGACAAGGTAGTCAG 

AGAGCACAGAAGCAGCCCTTTGATTTCACTCGCGTTTATTGAAGGGGACGAGGC^ 
CAGGATATCCTTCAAATTGGGAGCATGGAATGGAAGCTCTTAAAGAAAGAACGGTTGCCATTCATTGT 

CAAAGAGCAGTGAATCCTCG ATTTGTAAAATGCAGCTTTCAGAATCTGTGTTC CAACACTGCTGTGAGG CCC 

TGACG CAGCTGTGTTCTGCAGGGTGGCTATGGTGCTCCTTGTGAAGGAGGAGG CTGCTG CTCTGCAGG AGGAAGGGCTT ATGTTGC 

5 0 ATCCTAGCGGTGCATGCT ACCCCGCTGGAATAGTG CAC CTTGGAACTCTGGCTGCCT AAGG AAGGTCTCATTTCTGGTGG CTTTGT 

G AGACCAAG CCTCTCCCTCTTCTGTTACCTGGAGCAAC AA 

AAGGGCAGGGGAGAGAAG ACrGTTTCCTGCCTTCAAGGTGCCAG CT ACTG CATTG ACAAT ATCGGGCAGAGGGGCACATTTCACCC 

TCCCAGGAATCTAAGGGACAGATGCAATAACATCTCCTTATGGGGGGAAACTGAGTCTCCAAGGCATTAAGTTAC 

CACTGTGAGTGAGGGGCCAAGCTAGGGTCTCTGTCCTTACCT 

5 5 CTTTTGGCTTTTAGAGAGACAGCTCrGGGATGAGGTTCCCAAATCCTGAAA^ 

GTTAAGCCATATTTCATGTTTATTGTTTTGAATT 

TGCCT CAGAGTTTGCTG CTT AAAACCAG AATC CCTGGCATAAAAT C CAAT AAG CCTG CCTT AG GG C CCATGT CAT ATGGAAG CAG A 

AGACAACAGAGACrrGCTTGCTGGGCTTCAAGCCACTCT 

AAGTAAAAACTAAGGCTGCAGCTAGX3AGAATGCCCACTGA 

6 0 CTGCTCCATGCCCCCTCGGCCTCTCCCACCAGAGGTCTGGCT 

TGGATGGGGTAGTCACCAGAGGAAGCCGAGGCTGCAAACTGGCATGACACACACCTGTGTGGAA 

TCAAAAGCZATCGTATTCTCCAATATCGAGAACTCATAGGT^ 

CACTTCTTTTTTTrTCTTCCGTGCTGACACCCrAA 

CCCCAGCGACAGCTCTTAACTGTATCTCTTTAGCCAGAGATATCACAGGTGCCT 
65 CAGTGGAGATTGGCTGTAAAGCACGGCCTTCCIATGCCCAGCCTC^IACT 
AAGCATAGACTATQIAGCGTGCACAAT^ 

AGTGAGTATCCTCATTCTCC^CCAAGGGTCCAGTCTCAGG 
CTCATGTTTTTCAAGGAACTAAGCGGGGCAAAGGTGCCAGAGGAGTAGA^ 

7 0 AGGACATGGAGGAGTGGAAGGAAATAAATAGGGTGGCCATAGAAGAATCTAAACATCCATGA 

CCAACCTGTCTGTAGAGCTCCTTCAGGCCTGGAAAGTTGTATACGGGGGGC^G 
ATGGAAAAGATGTCTGCAACAAATGTTTATTTAACAAA 



AACTGAGAATGAGACATGTTCTAAAAACACCCATCTT"ITCT^ 
7 5 G CCT AAAT ACGTAGCCCTTTCCCAG ACGTGTCT CAGACTCAT CTGG CTG AGGG TAG CAG CCTCGTGGTG CAGGGTGAACTGTTTCC 
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CCTGGGAACCGTCTGACAGTGCTAATGAGGAATTGCATTTC 
aUACATTAATTAACTCTC^CACCTGTGAGGCAAGTGAGCCTGGT^ 

AAAAGGACTC CAGGGTGAAC CAG CAGGGTTCGGGG CGGCCTGATGTG AGG AAC CAGCATT CTTGATTCACAGGG C CCTGACCACAA 
AGCCACAAAGCTTCCCCGGCGAGGTGATGAATACGGGTCCCGGAAGCAGGGAGTGGT 
5 CTTCCCGCAGGCAGCCTGCCACCTCCCAGGCTGGCAG^^ 

ATGACTAACCAAGATGTCTGCAGCCGGCCAGAGACGAACGGTTGGCAAATCCTGGA^ 
TCCAGAGCTGGTGCTGACTGACTCATGTTATCATGGGATTTCAATGATATAGCAACAGTG 

GGT ACCTGT AGT AGG AAT AGGTG ACTG CTCTTCTT ACTTGTGCTGTG CCGG AGCTTTG CTC CCTGCTGCAGAGG C CAG AGGAAACA 
1 0 CTGCTTCTCAAAGACAGTGGAACTTTCAGGGAAATGTCCCTTTCAGAGCAGAGTCA 

ACATCACACACCTCTGACAGATGTGATGCTGGGAAGCATCTGGGGTCACCCCCTTTGTCTG 

CAGGGGTCCCAGACAGTGAGATGCAAACCACCACTGGGGCTCGCCCTAACACATCACCCTGGAAACTG 

TGAGTCACATCCTGTTGTGTGTTCTGAGGTCTCCAAGAGGGCATTACACC^ 

TGGGAACGTCCCTACCTTCCTCCAGGATGAAGGAGGGAAAAAATTCCTCCAGGGAAATAA 
1 5 TTCACCAAGCTGATTTGCCTTTTTCAAAAATAGATGG 

TGACTG AAAG ACCTGG CGTTTTAAAGATGTCAGCCACACAATCAAGGAAGGGGCTT^ 

GAAGGGTGCCACCGCTCAAGGGGTGCTGTTGGAAACCCAGCTTCTGAGGGTGGGTGTT^ 

GAGGGCCGGGAGGGGTCTCTATCCATCTTCTCCCAGCGCCTGTGGGGAGCACGAGGACAAAGAGGGATO 

AACAAGATATTTTGTTCTATTGATTTTGTCTATTATTTTCCTGTACTITAATTTCACT 
2 0 CT LTT CTG CTTGCTTGAGGT CTGT ATTGCT ATTTT CT CCAGT CTCCTGAGGTGGAAG CTTCAGTGACTG AITC^r GG AT LTTT CTTT 

TCTAATATATGTATTCAATGCTATAAATTTCCCTCTAAGTACTGTTCTATCTGCATC 

TAGTTCAAAACATTTTTTAATTTCTCTTG 

TTGGAG ATTTTCCAGTT ATCTATTATTGG CTTTTT AATTTAAAT C CATGTGGTCTGAGAAAAT ACTTTGTATGCTTTCTATACTTT 
TAAGGATCTTAAAGTGTGrrri'GTGGCCCAGAATATGGCTTATCTTGGTGAATCTTCCA 

2 5 GTTGTTAGAGTATTCCATAAATGTTAATTAGATCAAGTTGACTGGTAGTGCCGTT^ 

TTGATGTATCAATTACTGAAAGAGGGTTGTTGAAGTCTCCAATTGTAATAGCGTATT^ 

TTTTGCCTCATGTTTTTTGACACTCAGTTTTTAGGTGGACATAT^ 

TCATGTAATTCCCTTCTTTATCCCTGAAAATTGTCCTT^ 

TTTCAATCAGTGTTAGCATGGTGTACCTTTTTCCATCCCTTTACTTGTAATCTC 

CCT AAGT ATTAAT AT AACTGCAG T AG T ATCAAT CAT ATTTAT AACTTTT AT ATTTGTTG CACCTGATT CTTCCC CCCACAC CT CTT 
TTTCTACCTTTTATAGGTTTAACTGAACATTTTATATG^ 

TGTTTTT AGTT ACTG CCCT AAAGTTTG AAAAGT ACATGT AT AACT AATCT AC^ CAT CAAATAACT ACCACTTCTT CCTTCC 

TGTTTATATCATTACTGCCTTTTATTTCATTTATCCACATG 

3 5 TTTCAGATCAGTTAAGAAAAATAAAACTTAATTTTACCT 

TGACATTTCTCATAGC^CAGGTCAGCTC^CAATGAATTA 
GGATAATTTTGCTGAATGCAGAATAATAGTTTGGTAGCTTTTTTGTTC 
GTTTCTGAAGAGAAAGATAATGTAATTCTTATCCTTTTTCCTCTATGGATAAG 
AGATTTTCTCTTCTCTTTC^TTTTTTGCAGTTTAAATATGATATGCCT 

4 0 GAG CTTT CTGGATCTG TGGTTTGGT ATCTGTCATT AATTTTGGAAAATTCAT AG C CATTAAT ACTTCAAGT ATTTCTT CCACTGTT 

TTCTTTTTCTTCT C CATCTC-GT ATTC CAATT ACACATT ACAT CTTTTG AAAC TTT CT CACAGTTCTTGGATGTT CTTT CTTTTTC C 

TCCTCATTTCTGTTAGGGTGCTTTTGGTTTTTAGAATT^ 
GTAATCCCAGCACTTTGGAAGGCCGAAGTGGGTGGAT(^CCT^ 

4 5 TGTCTCTACCAAAAATTACAAAAAAAAAGAAAAAATAGCT 

GGAGGAGAATCACTTGAACCCAGGAGGCG^C^TTGCAGTAAGCCAAGATC 
ACACTGTCrCAAAAAAAAAAAAAAAAAAGTTTCCATCTCCCTGCCTACATCATCCATCrrCT^ 
CTAGAGCCATTAACATGTTAATCATTGTTAAATTCCCTGTCTTGTAATTCC 
TTCCTTTATCTCTTCAGACTGTGTTTTTTTCT 

5 0 AACTTAGTAGGCCTTTAGCGTGAGTTATTATTCTCATCTACCCAAGAGTTGGG 

TTCAAATTCCTCTAGCGTCCTTGTTTTTGTCTCCCATGTCATCTGTGAGCT 
CTCTTTCAGCCATAACCCACTGTTATAATAATATATTGGATCCTGTGAGAATCTGGTGGAGT^ 
TGCACTTCCrCCTGAAACTGTAGCCCCTAGGAGTGTCTTATTCGC^CACT 
CCATTAAGTGTTCTCATCAGTTTATGAATTC^AAATTCTGCTCCCAGCAA^ 

5 5 TCTCAAAATTTGGGGTTGCAGTTTGCCCTGCAACT^ 

ATTTTTT CTT AT AAAAACAAGAG TGATG ACT CT CAAG CTCTTT AAATT TTGGAACTG AAACT AGAAG CAACAG GACATT CT G ACAT 
GAATTArrTAAAAGGGTCATTGTTGACTCTAGAGACATTTTAATATTGCT 
GTGATATATGCCGTGTACACCTCTGAGCTCACCACTAACCTCAACT 
GGGGATC^AGGAGTAGAAGGCCTGTTTTACCCATTATTGATGATTAAAA^ 

6 0 GTGGCTGCATCAGACATCATTTAGCCTGG 

ATTATTATTTTTTTTTTTCTGAGACGGAGTCTGGCTCTG 
CCACCTCCTGGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGA^ 
TTTGTATTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCGAGGATGGTCT 
CCCAAAGTGCnWGATTACAC^TATGAGCCACTGCGCCCGGCCCTGAATCCAGTATTTT 
S 5 ACTGAGTTCCTGCCTTCTCCATCCTAATCCCAAATTTCCAATCTTTGGAAATTTGCCC 
TCCTCTCCAGCCntlAAGAGTGTGTCTGTCAAGACTGTAGCTGTGA^ 
TGTGGTCTCACCAATGAACATCAGATAAGTCCCTTGGCATGGATTTC^ 
CTTATTGAGCATTTACTATGGCCAGAAATTGTGATACTTGC^ 
AGAATTCTAACTATGAATCAGAGAAAATAAGTGACCTGCCTAAGGTCAT^ 

7 0 ATTTGTCTGTGATTTGAGTTCGGTG CTTTCCCTAGTCTCCT CCTCTCAG AGGCAG AACATAAGCATGTAATAGATACCACT AGGGT 

TCTTGTTTCTACCTTC^GAACAGCTGTTTGCAAGTCCAAAT^ 
CCTAGTAGCTGTAACTTCCAGAAGGCAGGAACCTTCCATTCACTAGTC 
TGTTGGAAATATGAGTGAATGAAAAAGTAATTTCATGAACACATGAATGAATAAATGAATG 
TGCCACAGGAAGATGCCTTTCCAAGCIATCACA^ 
7 5 TCTTATTCATCACTCTCCTGCCAGGTAAATTAGCACATTCCTGGCAC^ 
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TTGACTTATCAGAGGCAACTCTTTCAAGAACAAGTAGCCCTGTC 

GCCTTTTCTGGATAACTCCCATCGTACTCrCCTCTACTTCCTCTTCCT 

CCCCACATCTGATTC 

5 HUMAN SEQUENCE - mRNA 

GCGGGCGCGGGCGGGACGGGCGCCCCGCGGCCGGACCCAGCCAGGGCACCACGCTGCCCGGCC 
CGGGGCTCCTAGGGACGCCAGAAGGAAGTCAACCTCTGCTGOT 

AATTGTTTTCCTTTTACAACAGGGGGAGAAAGTGCT 

1 0 GCAGGATGTGCGTGCGTGTGTAACCCG AGCCGCCCGAT CTGTTTCGATCTG CGCCGCGGAGCCCTCCCTCAAGGCCCGCTCCACCT 

GCTGCGGTTACGCGGCGCTCGTGGGTGTTCGTGCCTCGGAGCAGCTAACCGGCGGGTGCTGGGCGACGGTGGAGGAGT 
GCTGCTGCCCGAGTCAGGGCTGAGTCACCCAGCTGATGTAGACAGTGGCTGCCTTCCGAAGAGTGra 
CTGCGGCTGCTCAACTCCCAACAAACCAGAGGACCAGCCACAAACTTAACCAACATCCCCAA^ 
GCTGTAGAACCCTGAGTGTCATCGACTGGGCCrrrCTTATGATTGTTGTTTTAAGATTAGCT 

1 5 CTGCACTGAGCGTTTTGACAGAATTCATTGAGAGAACAG 

ATATTTTATGAAGTAGAGCATATGTATATATTTATATATACGTGCACATACATTAGTAGCACT^ 

CTTTT CGGGACTG AAGC CAGTTTTGCATG AT AAAAG TGGCCTTGTT ACGGG AGAT AATTG TG TT CTGTTGGG ACTTT AGACAAAAC 
* TCACCTGCAAAAAACTGACAGGCATTAACTACTGGAACTTCCAAATAATGTGTTTGCT^ 
2 0 TAGGAAGTGTATGAGAATTTTCCCTTCAGGAACTTTTCT 

ATAGTCTCCACTTTTTAATGCACTAAGCAATCGGTTGCTAGGAGCCCATCCTGGGTCAGAGGCCGATCCGCA^ 

TCCCCTCCTGGACTGTTAGTAACTTAGTCTCCCTCCTCCCCTAACCACCCCCGCCCCCCCCCACCCCCCGCAGTAATAAAGGCCCC 

TGAACGTGTATGTTGGTCTCCCGGGAGCTGCTTGCTGAAGA 

AACTC^TCGGCTTGTTGTGATGCGTATCCCCGTAGATGCCAGCACGAGCCGCCGCTTCACGCCGCCTT 

2 5 GGCAAGATGAGCGAGGCGTTGCCX3CTGGGCGCCCCGGACGCCGGCGCTGCC 

GGTGGAGGTG CTGGCCGACCACCCGGGCGAGCTGGTGCG CACCGACAGCCCCAACTTCCTCTGCTCCGTGCTGCCT ACGCACTGG C 
GCTGCAACAAGACCCTGCCCATCGCTTTCAAGGTGGTGGCCCTAGGGGATGTTCC^ 
AATGATGAAAACTACTCGGCTCAGCTGAGAAATGCTACCGCAGCCATGAAGAACCAGGTTGCAA 
CGGTCGAAGTGGAAGAGGGAAAAGCTTCACTCTGACCATCACTGTCTTCACAAACCCACCGCAAGTCGC 

3 0 TCAAAATCACAGTGG ATGGGCCCCGAGAACCTCGAAGACATCGG CAGAAACT AG ATGATCAGACCAAGCCCGGGAG CTTGTCCTTT 

TCCGAGCGGCTCAGTGAACTGGAGCAGCTGCGGCGCACAGCCATGAGGGTCAGCCCACACCACC 
TGCCTCCCTGAACCACTCCACTGCCTTTAACCCTCAGCCTCAGAGTCAGATGCAGG 

GGTCCTACGATCAGTCCTACCAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCATTTCAC 
AGCGGCATGACAACCCTCTCTGCAGAACHTTCCAGTCGACTCT 
35 CCCCGCGCTGCCCTCC^TCTCCGACCCCCGCATGCACT^^ 

GCATCGGCATGTCGGCCATGGGCTCX^CCACGCGCTACCACACCTACCTC 

GGCCCGTTCCAAGCCAGCTCGCCCTCCTACCACCTGTACTACGGCGCCTCGGCCGGCTCCTACCAGTTCTCCATOT 
GCGCTCGCCGCCGCGCATCCTGCCGCCCTGCACCAACGCCTCCACCGGCTCCGCGCTGCTCAACCCCAGCCTCCCGAACCAGAGCG 
ACGTGGTGG AGG CCGAGGGCAGACACAGAAACTCCCCCACCAACATGGGGGGGGCCT CCTGTTCGCGACAAGCCCG CCGGGATCCC 
40 GGGCCCTGGGCCCGGACACCGTCCTGGGGCCGAGGGCGCCCGACGGACAGGATCTCGCTGTAGGTCAGGCCCGCGCAGCCTCCTGC 
GCCCAGAAGCCCACGCCGCCGCCXBTCTGCTGGCGCCCCGGCCCTCGCGGAGGTGTCCGAGGCGACGCACCTCGAGGGTGTCCGCCG 
GCCCCAGCACCCAGGGGACGCGCTGGAAAGCAAACAGGAAGATTCCCGGAGGGAAACTGTGAATGCTIT 
GAATAAAAAGAAAGATTTTATACCCTTGAGAAAAAAAAAAAAA 

45 HUMAN SEQUENCE - CODING 

ATGCGTATCCCCGTAGATGCCAGCACteAGCCXSCCGCTTCACGCCGC 

GCCGCTGGGCGCCCCGGACGCCGGCGCTGCCCTGGCCGGCAAGCTGAGGAGCGGCGACCGCAGCATGGTGGAGGTGCTGGCCGACC 
ACCCGGGCG AGCTGGTGCGCACCGACAGCCCCAACTT CCTCTGCTCCGTG CTGCCT ACG CACTGGCGCTGCAACAAGACCCTGCCC 
ATCGCTTTCAAGGTGGTGGCCCTAGGGGATGTTCCAGATGGCACT 
5 0 TGAGCTGAGAAATGCTACCGCAGCGATGAAGAACCAGGTTGCAAGAT 

AAAGCTTCACTCTGACCATCACTGTCTTCACAAACCCACCGC^AAGTCGCCAC 

CC CCG AG AACCTCGAAGACATCGGCAGAAACTAG ATCATCAGACCAAG CCCGGGAGCTTGTCCITrTCCG AGCGGCTCAGTGAACT 

GGAGCAGCTGCGGCGCACAGCCATGAGGGTCAGCCCACACCACCCAGCCCC 

CTGCClTrAACCCTCAGCCTCAGAGTCAGATGCAGGATACIAAGGCAGATC 

5 5 CAATACCTGGGATCCATTGCCTCTCCTTCTGTGCACCCAGCAACGCCCATTTC^ 

TGCAGAACTTTC CAGTCG ACTCTCAACGGCACCCG ACCTGAC AG CGTTCAGCGACCCGCG CCAGTTCCCCGCGCTGCCCTCCATCT 
CCGACCCCCGCATGCACTATCGAGGCGCCTTC^CCTACTCC^ 

GGCTCGGCCACGCGCTACCACACCTACCTGCCGCCGCCCTACCCCGGTTCGTCGCAAGCGCAGGGAGGCCCGTTCCAAG 
GCCCTCCTACCACCTGTACTACGGCGCCrCGGCCG^CTCCTACCAGTTCTCCATGGTGGGCGGGGAGCGCrrCG 

6 0 TGCCGCCCTGCACCAACGCCTCC^CCGGCTCCGCGCTGCrC^CCCC^^ 

AGACACAGAAACTCCCCCACCAACATGGGGGGGGCCTCCTGTTCGCGACAAGCCCGCCGGGAT^ 
GTC CTGGGGC CG AGGG CGCCC G ACGGACAGG ATCTCG CTGT AG 
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Table 31 

MOUSE NOMENCLATURE 
ICSGNM I12ra 
Celera mCG9569 

5 

HUMAN NOMENCLATURE 
HGNC IL2RA 
Celera hCG23482 

10 MOUSE SEQUENCE - GENOMIC 

TAATGATTAATAATAATAATAATAATAATAATAATAATAAG 

CCCCTTGTTGTGCATGGAACCCAGTAGAAGAAGTGGGAAGAATGTGAG^ 

TCTAAATCAACATGAGCAAAGCACCTATGAAGAGTGAAGCAGGACTC 

CAAAGGTCCTGAGTTCAAGTCTCAGCAACCACATGGTGGCTCACAACC^ 
1 5 A(1ACAACTACTTACACATAATCAATCAATC^ 

TGAGTT CGGTT CC CGG CAACCACATG AAGG CT CACAACTGTCTGT ACAG CT ACAGTGT ACT CACAT ACACAA 

TCTTTAGAAAAGAGAGTGAAG CAACATGCACAGGGCCTG CATGGGTCTACACCAGGTCCTTTGCATGTAT ATT ATGGCTTCCAGT^ 

TAGGGTTTTTAAAGGATTCCAAAATGTGTGAACAAGGTGGACTTGATTCT^ 

TTGTCTAAATCCAATGTGTTAGTTTTTTGTATAGTTAAGGTTATAGTTAG 
2 0 TGTTTTCTAATGAGAGAC^GAAAAGGAGTAGATCTGGGAGGGAAGGGATAGGAAGGGGACTC 

T AATCAGGG TAT ATT AT AT GATGGAAAAAT CT ATTTTCAAT AAAAGG AAAT AA(TTTT AAAG AAAGAT AATT AGGG 

TCAGGTGAATAT ATGAATGTCTGAAAATG T CATAATGAAAT ACATG ACTG T ACACT AACTTTTT AAAAAAAGGAG TGAT CTTGGG A 

CTACAGACTCTTCAAAGATAACCAAAAAAAAAAAAAAAAAAAGAGAGAGGAGGCTGTTC 

T AGGGGAAAAAGTGGGG AAG C CATGTTGAAG AGGGT AGT CT AAGG CAGATTT CTTTGGGAAG ATTTG C CTGGAATT AAT AG ATGAG 

2 5 ACTGGAGGAGGTGTGAGGGCTGGCTCAGTAGCTAGGCACTTGCTTAGCATGT 

AAAAAACATAAAGGATTGCAGAATCTTTGAGTCAATGATGTCAGTCGGA 

GTTACATAGACGTCTAGACAAAAGGTGACACAGACATGACTTCACCTTCCTCCCCAA^ 

CAATCTTKSCTTCCAGCAGTCCCTAGAGGTCACCAGTGCTGG^ 

ACCACATCAGAGGAGGTCAGGGAGCCTTTCCTGATACCTACGTGCCAAAGACACAATAG^ 

3 0 ATCAGCCAATG CAGT ACCTG CAG T AAAAGCATG AAGACCT AGCTTTG G TG C C CAAC CCAACATT AGTGGTG CCCT AT AAT CTT ATT 

ACTAAGAAG ACAGACAGGATCTG AT AG CCAAGTAGTTGAGT AAAACTTGG CAAGTCCCAGGCTCAGTGAGGGACCGTGTCTCAATC 

TCTGATGAAGAGTGATTGAGGAAGAACCTCCAACAACAACTTCCTCTGGTCCCCAGACACCT 

GCGCGCGTGCGCACACACACACACACATGCACAGGTATGCAC^ 

ACACACACATACACAAATG CAGACACACACAAAAACAGT C ATGTATACATACAAACACACATACACACATACAGGCATGTGTGTGC 

3 5 ACACACATGTACATT CATACACACAT AAACAC ACATGCACATGTG CACAC^CAGCCACACATGCA 

ACACA(^CACACATAC^CACACACACAC^C^CACACACAC^^ 
GCAAAGCTGCTGTTTGGTTTTCAGTGTAGGGACTGAGTTTGTGAATCrTGACAA 

AACTCACCAGTCAGATGGG CCCCTCTGGAGCTCAGCATTTG CACAG AGGACAGCTCCTTTGCGCATAGTGGATTCTGGTTTT CCAC 
AGGACCCTGTCTAGTAAAACCTAGTGGAAATTACAAGCTGAGAAATTCAGCCTTGTGCC^ 

4 0 CACMAATTAGCTGCTGTCTTCTCTTAACGCCAATTTGAGAGAAAGAAGC 

TATTCCCATATATCTGCAAATGATGGAAGTTATCAACCACAGTGACACTGTCTTTCTAGCGGTC^ 

CTTGCCTCTCTTTTC CACCATGCT ACC CCCTTGTTGGCTCCTTTGGG ACTT ACC CATTTGGAATGTGAT ATAGTTAAT AGGGACCT 

GTCTGGCCCTGCCTCCCOWCTCCATTTGAGAGCAGGAATCCTTTTCAAT^ 

TTCTAGATCCTTAACTAGTATTTTATGGAATTGAATTCTAGAAAAGTACATGAACTGTGTA 

4 5 AGACATCCAAACCTATGTCCACGCAATAAAGACTCCGTGCCCCATAGGCTGACA 

TATATAAATGATGAGTTCCCTGAGCTACTAGCCATCCTCCAAGAAAACAAAGGCCCCTTTCCT 
TTCCCTTCCTCACCTCCAT1TTCTTCCCCTCTCTTTGCACCTACCCTTTCCAGTCCTCT 
CTTCClT T ClTCTCTCCCCTTTCTCTCCCCTTCTCTTrCT^ 
TGCTTTTCCTCGCTCTCCCTCCACACCATGCCCCTCTCCTCCTCTCT 

5 0 GGCTCTTCAGAGAGGGAAGTGAAAAGACAGAGTGAGGGAGCTTTTATGTGGTGTCACT 

CAAGGATCCAGTGAGCAACAGCTTCCCTTTTTAAGG^ 

ACTTCTCTACTTCAGAGCCCAGTGTAAGAGCTTGT^ 

GAAACACTTTGACCTGAGTTTCAGTTCTTCCTCTCTGTAGACTT 

AATGAATAGACAGGGGTAAAGGGGAGAAGAGAAGAGAGGGGCGAGAGAGAGACAGAGACAGAGAGACAGAG 

5 5 GAGACAGAGAGACACACACAGAGAGAGATCGTGATACTCTGAACTGCACAC 

CTCAGAATTCTAACACACCCTCAAACCCGATTCAACTCCAGAATGTATAAGAAAA 
GTGTTGAAAAAAAAGAGTTACTGCCACAAAAGGAACTATAAACAATTACT 
GAGACAGGATTTCTTGGTGTAGCCTTGGCTGCCCTGAAACTCATTCT 
TCTGCCCCTGGTATTACAGGCATGCGTCACCACCAACACCCCCCCCCCCCATTC 

6 0 GATTTATTTATTGTTATATGTAAGTACACTGTAGCTGTCTCCAGACATGCCAGAAGAGGGAGTCA^ 

GAGC CACCATGTGGTTT ATGGGATTTGAACTCAGG AC(HTTGGAAG AGCAGT CAGTG CT CAG C C 

CCCTTTTTAGGATTTCTTTTACTCTCTTGATTGTGCC^^ 

TT CTTTTTCT CTTATT AACT CACTTTCT AT GAGT ATTT ATGTG TG TGG AG TG CACACATG TGT ACATGGTT G CACATG TGTGTGT A 
TCTGAGCTCCTTGGTGTGTG CACACATACACATGCAT ATTTG ACGTCGGGGTGT CTTCCTTGGTCACATTCTACTTCATCTTTTTG 

6 5 AGGCAAGTCTTTTGCTGAACCCAGAGTTCCACTGATACAGAGTCTAGCTAGCCT 

AGTGTTGCCGTCAAAAGCAGCCACTGTXXTCCGCITTC 

GTCCTTTATCCACTGAGCCATCTCCCAGTCCrTCAGCAT 

AAGCCTTACCTGCCAGGTCAAACATGTCAGAGAGGCAAAGAGACAGAGAA 

T ACACTGTCTTTG AACT ATG C C CCATC CTG AACT ATGTCTT AACAACTG AAAGT AATGGT AAAAT AATC^ 

7 0 AAAAGATAAAT ATTT ATTTTTACTT ACTT ATTT ATT ATTCT ATG TTTTG CCTGT AAGTATG T CTG TG TGAGTGTG C 

CAGAACTCCTGGAACTGAAAGTATAGATATTTGTGAGCTGCTATGTAGGTGCTGGAGATTGA^ 
TAGTACTTTTCATCACTAAGTCATTTCTCCAGACCAAT^ 

AAATGCATCGTGTGTGTGTGTGTGTGTATGTGTGTGTATGTGTGTGTGTGTATGTGTCT 
NNNNNNNNNNNNNNNmJNKNNnl^ 
75 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNIWNNIJN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNIINNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNITONNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNIINNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNN 

N^NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
2 0 AACACACTGCTCGTATGTACTGTGATACTGCACTTC^^ 

CGGGCGGTGTGGCGCATGCCTTTAATCCCAGC^ 

AAGTAAGTGCCAGGACAGCCAGGACTATACAGAGAAACCCTGTCTTGAAAAACCAAAAAAAACC 
AAAAAAAAACCAAACTTTGATTCTGGAGGTGTATC^Cn'TAGCAACGGAGCATC 

AAG AC CCT GGGCTCCAAC CCCAG C CACAAACT AAACAACAACAAAAAG CCCAACTTCTTGAAAATGTT AAAAAAAAATC CTT AG TC 

2 5 TTTGATTTTCTTTTTTAAATGAGACATTGGGCCT 

TTTAAAAACAAACAGATGACTAGTTCTACAAAATACCTITGTTAAAACAGT^ 

TAATGGAAATCTAATCTTGGGATGGTAGCCTATGCCTTTAGTCCCAGCACTTG^ 

GGTCAGCTTGGTTTGTCTACATAAGAAGTTTCAGGGCAACCAAGGCTTCATAGAG^ 

ATCAATTACTAGTTAAGAAAATGCCCTACAGC CAGATCTTAGGGAGGCAT'l' i'l' CTCAGTTGAGGTTCCCCTCTTTCAGATAACTCC 

3 0 AGCTTGTGTAAGATTGACATAAAACTAGCCAGCACATTGTTTCATCCTGTTTGAACATGTCCI^ 

AGTCTTCCTCGCCATCAATCCAACTGACATCT 

AGCACAGATTATACCTGAAGGACCTACCTAAAAAGGCTACATTCACCCTTCAAGG 

ATCCATAACGTTCTCTTTGCTAAGCGTCTCCCGCTCTTCTTCATCATGATCACTGGGCAACT 

CAAAGGCTCACCTCTAACCTAAGAGGAGGCAATCTGGGTCAGATAGTTGATGCATGATATGATGTGCAGT^ 

3 5 CAGACATTTCTGATAAGAGAGTTGAGCJ^CTTCCTGATATGTGAGACTCAGCTTTATACAGTTG 

AGGAACATTAGTGACCCTTAGACAAATGTTCCITAAATTAATCTATTGGGGTCATCT 
TTGCCAGGAAAGAAGTATTTGAAATAGACTTAGTGGAAACTA 

AAG CATTTGTCTGAAATGGCTTGAAATGTATCCTTGTTGTATATTCACATTCACAT ATCTCAATG CATCATGGTTGAACACCCAT A 
TGTGTTATGAAATCAGTGCTTATTTCAAAAATATATTGGATGTAGCCTGTCTCCAGAGCCCACAGTC 

4 0 TCCTCAGACTCTTATGAGTTCAAAGTGCCTGTGACACTCCTGTAACACTCCTGTCTATAAATATACTGATTAT 

TTTTTCCCTCAAGGGAGGAAATTAAGATTTC^CCAGATCTrATTTGCAA 

TATGTCAATAGTGGGTGAAAAGACAGCTTGGTGACACTATGAGAGAAGGCAAAGGGTTCTTTCT 
AGACTTAAAATCTTCCATTGCAGCntSTAAACACATCTGGACAATAGTCTGT^ 

TGAATCGTGCATCAGAGTTATTTAAAACCACCGTGCATGGAGTGAACCAATACCGAGGTGTTTGCTTA 

4 5 ACAGCACAGCCTTGAACTCAGTGACACTCCTAAGAGGGCTCTAGGGTCAGGCCAACTTAG 

TGCCCTTCCACCCCCGTTGCACGACCTTGCTTCTCAGTCTTTGTTG AGT CTTCTGG^ 

AAAACCCT AAGTG AGACCACTGCCAAGAAGTG CTTGCTCACCCCTCCTG CCGCGGCAGGG AATCCCCCTTTCCTTGTACAGGCAAA 

ACACAAAAAAGGACTCATAAGTGAAGCCTGATCC^CTCACCAAACACTGCCC^ 

CTGGTTTGAAAAATTACCGCAAACCATATTGTCATAAAAAAAAAAAAAAACACTTCCTATATGA 

5 0 AAGTTCCTGCTGAGCAGATCAGCCTAATGCTTAAATAGAAC 

CTGAGAGGCCTGAGCCCTTGTTCTGGCATTCTCCCAGGAAGATGCAGTAAAGGGGTTGACCCAATAT 
AGTTCCCTCCTCCATCCTGATCCCATGTGCCAGGAAGATGGAGCCACGCTTGCTGATGTTGGGGTTTCT 
CCAGTTGTCGGGCAGGTAAGAGACCAGGAACTGCCCTGGGAACCTTGGGAAGACTAACAGAATCTTGGTAAGCCT 
GCTATACAACCAGAGGGAACAATGAGATGGAAAACTTTGAAGAGGGAGTTGAATCATGGGTTCT 

5 5 GGACCTCTTTTTGTGCCTCAGTGGTCCCT AAT CTATCAACCTTTCTT AAAAT AG CAGGAAGGAATTTCTCATTCATAGTCCTTTTT 

AGGTTTCATTCCCAGGGATGGTTTGGCTTTGCTGTCTTGGAAGfTTTCTTGTAAATAT 

TAGATTTTGTCAGGTATAGTTTTAAGAAATAAAA^ 
CTAATACCATTTTTTCTTGCTAATCCAGAAAGAGAAGTGAGGT^ 
60 ATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATATGGGACTGTCT 

CATGTAAACTAGTTTGGCCTCAAACTCAT AAAG ATCTACCTG CTTCTGTGCAT AACCC CAC ACTAGGCTGAAAATAT ATTTTTTGA 
AAG CCT ACAGTTTCATATTCTT AATTTCT AGGAATTT ATTCTTAAATGACC CCAGAAAAATGTA^ 
CATATACACGAAGCTTGATTGCTGACTGGGGGCAAGTGATCACATTGTGTTGG^ 
ATGTTTATGATTTTAGGAAGCTTTTATTGTGTGATTTCT 

6 5 AAGTCAGAAOUVCCAAGTCITrACAGTTATGTGTATTCTGTCTCTGAGGAA 

TGATTTATATAAAGCTCATTTACTTGGTTTTGIZAAAACATCTGTCC^ 

TACAGGTTGGAACATGCCTGGAGCTACAGGGGTGTGGGGCTGTCAGAGAAGACTTCAT^ 

GACTGGTAAATTTCCZATAAGGAAAGATTTrAGATGAGAAGGCACATGGA 

TTCTCTGAATATGGGGCTATTGGATAAAGAACAGAAAGATAGGAATGGGGCCAAAGGCATTAAAGCTCT 

7 0 CTTTTCTTTTCCTGTGTTGTTGGGGATAGAAATCCAAGGCTTTGAGCAC^ 

CCCACAACTTAAAATACTTTAGTTGTATTTGGTAGGCAACAGAAAACTACTTG 
CTTGATAAATCAGATGTATACATTTTATTACAACTTCTAATCACTT^ 
ATCAGAATATCAGTGTTGCAAGGCTCACATCACTTTTGTATTTTCT 
AAGAGATCAGGTGCAAGGAGTTATGTCAAGAGGCA^ 
7 5 CTCAGAAACTAACCC^lGGGCAAGAGATGATCTTATTTCAACTT^ 
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CTGGAGAGATGGCTCATTATTGGTTTAGAGCACT 

CCATCTGTAACTCCAGTTCCAGGTGATCTGACAACTTCTTCTGGCCTCCAT 
GGTCAAATGCATATAC^CGTAAAAATAAATAAATATTAAACTTTAAGAAA^ 
GGCAAGAGGTCTTTAATGAAATAAGACAGACTGTCAACAGATTCAATT^ 
5 TTCCAAAGCCACCATTTAAAATTGTGATGTGGTACAAGGGATGTATTAGTCAACCCGGGTG^ 
TTGGGATGTGGTTAAAAGAGAATTCTCCCTCATGTTGCCAAC^CAG 
CTCTGAGCTGTCTCTTGATCATTTTTCATTCTAGGAATTCTGTAAT 
GAAACATGTTAAGGTGGTACX3CATGTCTAGTTATCCCTTCTGGTAAAGGCTACTC 
TTATGCTTCCCAGTGCCCCGTCAGTTTCrTCACAGTGTTAGAGCTATTGTC 
1 0 TGTGGTITTGCTAAGCACTGGGAATACTTAATAAAGCAA 

ATGCCTGTGCTTGATTTAGCTTGGAACATTGAAAAAACAGTAATTT 

AACAGCTTCCGAATtS CTCAACACCTCCATCACTGGGTC C CT CAGGGATTTTTGTG TT CATGT ATG CAT ATGTGT ATTT 

TTGTGGCTATGTGTCTAGATCAATGTGTGTGGAGGCCAAAGGTCAACTTAGGTATTAA 

IT'TC'T T ATT ATTTGGG1"I^'TTT ATT ATTGTTGTCT AGAGATGG AG'ri'TC'l' TG CTGGT CTGGGACT ACATGGTT AGG CT AAGCTGGT 
1 5 TGGTCAAGAATCTGCCTACTTCTGCTTTCTTAGGGC 

ACAT AT AT ATACATTAT AT AT ACAT AT ACAAT ATAT AT AGATT AAATGTAT CAGTGTTTTG C CTG CATGTTT ATATGCACACCACA 

TGCCTGCCTGGTGCCCTOVGAGGTCAAAAGAGGGCATTGGCTCACCTT^ 

TGGTGGGAAGCAAACCTGGGTCCTCTGCCAAAGCAGGTGCTCTT 

G^CCAGGGATCAGAACAGAGGTCCTCATATTTATGAGGTAM 
2 0 TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 

TTTCTTATGGAAACCATTCCTGTGGTAGCACTGAGTGCCATGAG^ 

TTACTGTCCAATGATGGCCGGGCATTTTACA.GATAGAGGTTGAATTCT 

AGAGCTAGGAGGCAAATGAAATTTTAAAGATGTATAGTATAGAACAAGGCACCTAATC 

GCAAGTCACAGCITTCAGCAAACTTCTCTCTGCAACAGG 

2 5 ATACATCAAGAATTCTGGGGGGAAAAAGACCTACTCTCTCCTCT 

TGTCTCACAAACACACACACACACACACACACACTCACTCACAAGTCT 

CCAACCAAAGATATGCCCACTACCATGTGGATTGAAAGGTCACCGGGTATCAGCAGACCCAAGCCTC 
TTTCCCCACTCTCCAGAGTGCAACGCTCTGCACTGCAGCATGACTACT 

ATTACTTCAAAAAGTTATAGATCCCAGACCTGAGATGCCTCTGTGGTTAAGAGTC 

CCATGCTGGGAG ACTT ACCCTG CCT ATTTG AATCCAATG AC CTCTTCTGTCCTGC ATGTATGGATAG ACACATACAT AZUVT AAG AC 

CTTGCTGTCCTGGAAATCAACCTGTAGATTAGAACAGCCACCACCTCAGAGCT 

CACCACTTCTGCCCAGTGATCCCAGGCCTCTTGATGACTTGATTACAT 

3 5 CTCTTTATCCATGCAGCCAGCCATTCATGCATTAGGATTATAGAGGTGGATAAAGGTACA 

AACATCTGCTGATTGATTCTGATTAGGAAGACTGCTAAGGGAAAACTTGCATCTGC 
CCCTCTGCCACATTTTCAAGGACATCACCCATTTGCAG 

CT AATG TTG CT CACAT L M 1'1"1" 11' TGGCT AGTTG ATTTT ATGGT CATTCT AG ATTTTGCCTTT ATCAACAAG ACAACG CAGG T CT CAC 
CGGGAGCTGTCTGTGGAATCATGTTTCCCAACCCCAGCITCAGTTACA^ 

4 0 GAAGTCAGACTCTCAGGGGAGAAGAACCAGACGTACCAATCACATTATrriTATGTCC 

TGAGATCCGTCTAATATTGTAGAGAAGTAATATAATGGAATAATCCTACAGCCGGGCATGGTGTACATTO 
CACTTGGGTGATTGGTGCAGGAAGATTTTGAGTTTGAGGCCAGCACAG 

AAGAGAAGGAAGAGGGGAAGGGAGAAAGGGAGGGAAGGGAAGAAGAGAAACGCAAATGGTTTTT 
TCATTTTTATTCTTTGAAAATAGAGGAGAACTTCrCAAACTT^ 

4 5 AGGGATTCGGTAGCTTGAAATTCTGTTCAAACAGGCTCCAAC^ 

GGCTATCATGTAACAACAACAACAACAAAAAAATG(^CACTATGTAACCACACAGAG^ 
AGAAGATTAAATATTGATCAAAGGTTGCACAGTCTATATAACACAGCCTATAAATATTA 
TACTGCCTGGGTAATTAAAGATGATGCTITGAGAAGGGCTTTTG 
AGCCGGGCAATGGTGGTGTAGACCTTTAATCCCAGTACTTAGGAGGGAC^ 

5 0 T ACAG AG T AAG TTC CAGG ACAGT CAGG G CT ACACAG AGAAGCCCTGT CTCAAGACAACAAAACAAC^ 

CAAAACZAACAGAAAATGTTAGAAGACAGTATTGTGTGATGATTCTGCCAAAGCT 

GTGTGTCTGAAGAAAATGATTTTCAAGTGCACAGTCTTTCT 

ATC^TGGCAGATGTTTGTAATCGCATTGCTGGGAAAACAGAGATG^ 

GAGCTCTAGGTTGTTGGGAAACCATATCCCAAAAGAAAAGATAGATGGTATACACACTACACACACACACA 

5 5 AGTAGTAGTAGTAGTAATAATAATAATAATAATAATAATAATAATAATAATAATGATAATGATAATGATAATAATAATAATGATTT 

TGCTCAGGCATAAGACTTTGCATGTTCCCC!ATCCTACT 

TGACATGATGAATTACTGAGTAGCCTTGCCAGTACCACATGACACCCTGGATATAGTCAAAGAATTT^ 

TACATCTGTGTTTTTAACTATGATGCAGTCCTGTTAACCTCATACCACAAGGTCAGAAT 

TTTTACCCTGCCAGTTCACTAGGCTTCAACCAGCAACTCT 

6 0 GACTGGACTACCAACAGTAAACC^TTTCACCCAAAAGAAGCACTGTTTT^ 

ATTGGTTTAAATTTTTTCTTTTCTTATATGTATGGGTGTATGGA 

GTGCCAAGGAGGCCAGAATAAGGTATTGGATCCCCTGAAACTGGAATTATGGAAAGTTGTGAGC 

GAACTCAGGTCCTCTGCAAGAGCAGCAAGTGTTCTTAAGTTGTGAGGCTTCT 

GATGATAGCGAGAGCTTATGCTCACTGTCTACATGCCATCTCTTAGTATTGTC 

6 5 TAAGGTAGTTGTGTTGGGCATTGTGGTCTATGCCTTTATGTTCAAC^CTrGGGAAACAAA 

TCTCCTCTACACAGAGCAGTGTTCTCAACCTAACACTGCAGCCCTC^ 
AATTATTTTTGTTGCTACCTCATAGCTGTAATTTTGCTACTTTTA 
CAACCCACAGGTTGAGAATTGCTGACACAGAGAGTTCCAAGCCAGTTAGGGCC^ 
AAGCAGTTCrTATTATCTTCCCTTTGCAGTTGACCAATGGAGTCT 
70 GTCTCTAGAATCTGAGTTTACAGAATCTCTACCCACCATCCTGAGCCAT 

GGCCACCrGTTGAATCrCTTGTCTTTACTTTTGTG CATGTGTGTGTGTGTGTATACATATG CATGTATATACACATGTGTGTGGAG 

GTCAGAGCITCATTCACACGAC^TCTTCTTTTATCC 

ATTCAGCTAGACCACCTGCTCAAAAGCTGCAGTGGTCTICT 

GGGATTTCTACCTSGAiKsGTl^^ 

7 5 TTTTTTCCCCCTGAATGTATGTAGCTCAGGGTGGCrTAGAAATTCATGA^ 
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CCTTACTTACACGTCTGGAGTTCACTCrrGGTAGGACCrCCTCAGGACCCTGCTAGTCTTAGA 
TTACCTTCAAAGAGCAGCCTGGGGTGTTTTCCTTGCATCTGATAGCATTTCA 
CGGCAAAACAGAGGAAATTCTGAGGACCTTTGGCGATGAGTGAC 
TTCAGGAATTGCCATCTCTGCCCACAGAATCGCCCACTGTTTTCTGTGTGGGACCA 
5 GTGTGTGTGTGTGTGTGTGTGTTCTCTCTCTCTCTCTCTCTCTCrCT 

AAAGATGGGTATGTGCTTCCAAGGGCAGACTAAGAAGCCACTTACAGGTGAGTC 
CCTACCCTGGTGGCAGAGCTTCTCGATATTTGTATGTCTGCTG 

CAAGGTTAGAGAGGCAGAGATGAAT AAGACCCAACTCTTGG CCTCCAG^ C1TCTTG CTT ATG CGG AGAGAAT ACCCAAG CT AAG 

GG CTG AGATTT AAGTAAGAGC ACAGAGCTGGTGGACACACA CT AG 

1 0 GAACTGCTTCCCATTATATGTACATGCAGAAGAGAGAAGGGCTCACGCTGTTTTGTCAGTGTGAGTGACT 

ACTTT CCATTTTGAGTTT AGAACAATCTG CAGAGG' IT riGTGCTTGTTTGG CTCAG CAAT ACCTTTCAGGGACAT CTCAGGT AGGA 

CGTACTGGGAGTCCTTCAGTAACAGACTGTTTCCACAGGCTCAGGAGTCT 

GAGTACATTTGAAGATCTCCCAGGTCCTCAACATCTATGACCT 

GGNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNN^^ 

GCCCAGGGTTCAATTCACACTAAACAAAACACTTTACCTCTGAGCTGGGTCCCCAGTTC^ 

AAAGCAGTGCITTAAAGATGTATTAATAAACTATAAGAATAA^ 

AATGAGAGCAGCCCTAAAACCAAACAGGCCTGTGTCCCCAGGGAAAAGGTCACTTTATCA^ 
TTTCTAAAAGTTTCTTTTGAAGTCCCCGACATCTATGCTGACGTCAGCCT 
2 0 TTGGGGCCAGTGACTTGACACAGGGTTGTATGTTTGACACTTGCTGAC^ 

AAAGAACCTTTGTGAAGAGAAAACTCCTTATTTAAAGGAAAGTAAACCGAACTTTTGAG 

CTCAGGAGTCCCGCTGGTACCCCTGAGGGGAGGTGAGCCTTGAGGGCTCGCTGCTAGG 

TCCCTTTCTAACTGTCTCCTCAGAGGCCGCTTCTGGAGGTAATTTTTAGCT 

2 5 CCCTGCTTCTTAGAGAATACAACTGCCACCATTGAAAACCTAGGCTGTCTAACCAGTGAGCTCAGGGCGTCCGCC 

CCCTTTTCCTCCTGTGCTGGGATTTG CATGTG CACAGCCATAT AGAGTTTATAGAGTGCTGGGGATGTGAACTCAGGTTCTCATGC 
TTCTGTGGCAAAC CCACGAAAC CATCTC CCAT C CTT AGTTTT ATAT ATTTCACT AT ACATT CAT ATTAT AATAT ACATT AT AT ATT 
ATATAATATATATTATATAAAACATATGTAATATATATTATAATATATAATACCATGGTATTGGCAGTGCTTAG 
AGATTTTCX3TAA Crrm ' C l' ri , GTTTTCTGTTTCr<^ 
30 TGTTATTCCCCTTCCTGGTTTCCTCTCCGCAGCCCCCACAATCCCATCCCCCTCCTCCCTGCTTCT 
CACCCACC»CTCCCACCTCACrGCCCTGGTATTCCCC^ 

GATGCCAGATAAGGCCATCCTCTGCTACATATGCAGCTGGAGTCATGGGTCCCTCCATGTATGCT 

CTAAGAACTTTGGGAGGTCTGGTTGGTTGATCITGTTGTTTTT 

TAACTTTTCCCCTGGGGCCCCTGGGCTGAGTCCAATGGTTGGCTGCAAC^ 

3 5 CTTTAATTCC AI ' ITG T IT AACAATTTATTTTAACA 

ATGAGAGAAGAGGGCATCAGAAAACAGAGGGTATCTGATCCCTTTACAGGTTGTGAGCC^ 
AGGACCTCTCTGGAAGAGCAGCCAGGGCTCTTAACAGTTGGGCCGTCTCT 

TACTTATAGACAGGGTCCAGCTATGTATCCTGAACTGACTTGCAGTGTAGCTAGGCTGGCOTCAAACT 
CAGCCTTCTGAGTTCTGGGACTACACATGCAGGGGAAAT 

4 0 CTCAGAAGGCTCTGTCTTTTTTCAGAAGACTGTGGGCGGGATTTACCAATT^ 

AGAAATCCGTTTCTAAGATTTGAGGGGAAAACTGTCTTTAGTTTTC^ 

TCTCTCTTTCTTTTTTAATCTGATAACACTTTAATT 

CTGAAGCTTGTTTTCHTBAGGTGACTGGAGAGTTAGAAAAAGACAG^ 

GATGTGACAATCTAGAGGCCGCTCTGTGAGACAG AGGGCAGTTCTCAGCT CAAATT CCT 

4 5 AGTTTCTGTAGACCTACTGTTTGTTGTTTGTGTAACTACTGCCAA 

CACACATTCCTGCCITATGGAAACCCAGGCTTTGACGAAGCCTGAGAGA^ 
CTCACAGACACAAAAGGCTAACCACCXaTTCTCCAAATATTGTTA 
AATGACACACACACACACACACAC^CACACACAC^CACACAT^^ 
TCCTGGAACTCATTATAGAGACCAGGCTGGGCTAGAACTCAGAGATCCACCTGCCTCT 

5 0 GTGCCACCATGTCTGGCTTTCATGAGTATTCTTTCAATCATCTTTGGATTCAT 

CCTTTAAACAGAGTCATATCAAGAGATTTTCCACTAGTGAACCATCCATCACTGCTGAGATAAATTCT 
TTTTTGTCATACCTGGCTAGAACCAAGTA(ZAAATATTGTAATCGATATTCTGTTGTCTGTTT 
CATTTTCTCTGCAGTTCATCCCAAGTTTAATAGGAAGACTAACATAACTCTGTAA 
GGTCTGAATGAGAATAGCTGCCATAGGCTCATAGATTTGAATATTTAGTCCCCAGTGAG 

5 5 G AGGGGTGACCTTT ATTCAGGGTTTTGAAAGACTCATGTGATA 

CTCTCTCTCTCTCTCTCTCTCTCrCTCTCTCTCTTTCTCTCTCT 
TTTACTCTACCATCATGGATTCAAACTCCCTGAAACCATATGCCAAATTA 
CAATGGAAAAGTTAACTAAGATGCCCTCTGAAACAGTTTGTATTAA 
TTTTTCAAATGATTGGTTTATTAGGTATAATTCTCCTCTTTCTTCAGCTTCTTTCTCTCT 

6 0 CTCAGGCTGGGCTrGAACTCACAATCCTGCTCATTTTTGAGTT^ 

GCTCTGTTGTTCTTGCTCCTTCTAGTCCTGACTATTTTCAGAATCTAC 
TGGTTTTTTTITTTTCATCTTGGT 

CTCCAGTAAATATTAAATGGGATAATGCCTGTTCTTTGAACAaSTATTTTCCT 
TGGCTCAGTACTGCAGGGTTTGTGGTTATCATTTTTGTTGCTATTCGGTGOIG 

6 5 TGATCCATGGATTTGTTTGGAAGTTTGAGGTTTCTITTTTAAAATC 

TTAACTTTTAATTTC^TTGCGCCATGATTAGAAAATGAATCT 

CCTAGTAGGAGCTCAGTTCTCTTTAC CACTGTC CC^TGCGIX^T AAAGAGAGTGT ACATTGCCTATGT ATTGGTCACACAACTACA 

TAAATATGCAGAAATGAAGCITACTGACAACTAACAGACCCT 

TTTGTTTTGTAGTGAGTTCTGGGCTTAATTCTAGTTGGGTTACTTTTTGTGGTTT 

7 0 TCTGATTGGCTGACCTGCTTTTACTATAGGATTCCCATGAGTGT 

TATGTGAAATGTATTATCTTTGGTTATTGCCGTAGAGTCTCTGTTA 
CCCTIGTGTGGAGACAGCACAGTGTGTTACCTCTTTCrc 
ACTCCCCTGCATCTTGGGTCITTTGTCTCCTTCATTATTC^^ 
AATATGGTATAAGTTCrCTCAT^ 
7 5 ATCAACAGGAATGTCAATGGTTCAGAAGCAAATATGTTCTACAACATT^ 
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TCTACAACACCAATGACCGTAAGAAAGTGAGGAGCTAAAATCAAACACACAGAAGAGATATATAATAATGAAAGT^ 

ATAATATTTCTAATTATACTTATAATAATTATAAGTATACTTATACTTAATTATACTTATGTTAA 

ATTTAATTATTTAATTATTTAACTCTACTCTCT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTTA 
TGGCATGTGTTTATGTTTGTACATGTAGTGTGTATTTATGTGGTGTC 

ATGTGTGGTGTATGTGGTGTATGTGATGTGTGTGGTGTGTGTGTGTGTGTCTGTGTGTTGGGGG 



CAAGGCTATATTCACACCCCTGAGCTACCCTAGCAAATCCCAGGACTCTGTGAGGT^ 
1 0 TTTTGTGATGCAGCAGGAGGGGTTGCTTTAGCAGGC 

TATAAAATGGAGTTTATTTGGGACATGGGAGTAGAGGCAGAGGAAGGAAGGGGACAGAG 
AGAGAGAAAAAGGGAAGAGGCCTITCCAGGAACATGCTGAGGGAGTGGGAGAGAGAGAGAGAGAGG^ 

GGGAG AGAGAGGAGG AG AAAGG AGGAAG AGGGAGGG AGAAGGGGGAGGAGG AGGAGG AACAGGAA GAGAG AGAGG CCAAACAAT C 

CrrTTTATAGCAAGTCAGGCTAGTACCTGGCTGTTGCTAGGTAACre 
1 5 GCCATTCTTGTCCCTCTGCCTGCAGGAGTGAAAAATGCCCATGTCGTTGGCTAATCCT 

GACCATGACCAAGCCAAAGCATTTAGTGCTGAGGTGCAAGTGAGGGGTCATCACT 

CAGCTGCCCTTCCCCTGCTTATTAATAAGCGGCATGTAGCTAGGCATAATGCT 

GCAGGCAGCAGAAAGAGATGTGTGTCACACTGGTCATAGTTTGGGCATA 

CTCCAGCAAGGCCACACCTCCTAATAGTtCCACTCC^ 
2 0 TCAGGGTTTGTATTCCCACACAAAATATCATGACCAAGAGGCAAAT^ 

TTCTTCATCTCCAAGGAAAGTCAGGACTGGAACTCAAGCAGGTCAGAAAGC^ 

TACTGGCTTGCTTCCCCTGGCTTGCTCAGCCTGCTCTCTTATAGAACCAA^ 

CCCACCCACCCTTGATCACTAATTGAGAAAATGCTTTACAGCTGGATCTCATG^ 

TGATAACTCCAGTTTGTGTCAAGTTGGCACAAAAAACCAGCCACTAC^ 

2 5 GTGACTCCTGTG TG CTTG C CT C CTGTGTGACAATT AAAAAGGACCATCACG CTAACCTACT AT ATG CAAGGGAGTGCAGTAAAGAA 

ACAGCTAACATGATTTCTCCCATGGTGTGGTTAAAGTTTTTACTGTAG 

CATGGCCAGGACAAGGGAAGCAGGAGAGAATAGTAGTGCTCGGGAGGGACCAAGAGATAGACGACAGAGCCTA 

AGTGAATAGAGAGAAAGAGCAGGAATAGTGGAGAATAGTGGGTTATAAGGAGGCTGCATAGGCAGGGAGAGGGAACAGCCT 

CTGTCACGGTGGCTTTGAAACATATAACATTGTTGAAATAATTAAGTAATAAGTTAATCCC^ 

3 0 GGTTTATGTTCCCGAACAAAAGATATACACACAGTCTTTATATTTTAATATGC 

CCTCCATGAGGTTAATCCTGAATTATTACTTACTAATTCTGTGTTCTATCTTGGCTGCT 

CCATGCTCTCAGTTTCTTCACAAGATGGCTATGCCTCTCTGl iUTCCA'lTC'n'CCCCCATCTCTCCTCTCCTCCACACTCCCCCTA 
CGTGGTGGATCATCTCTTTCTCCCTCCTCAGGGTCCCAAGCCTGGTAATC^ 

GG TCATT GG CAT CTT AATTT ACCAATCAGAAT CAACT ATATGT AACTTTAT AT AT C AGT AAACAAC AAAACAAAACAAAGCAAG AT 

3 5 TGGGAAGTAATCTAAGCAGCCATTAAGACTTGTGGAGGATTTGTAGATAATAAAGCAACAAG 

AT CTT CC CG C CCTTTCCAG CAAG AG ACCCTCCTTGTAGTCTACTG AACACAG CAATGCATTTGTG ATG ATTAATCTCCTTGTTGAC 
TTGAGGAGGGTTAGAGTCACCATGAAAGCAAATCTCAGGCATATCTGTGAGGGACTT^ 
GACCCTAAAGTGTGTGGTGTCATTTCCAGTAGCTGTAGTCCTAGTTGGCATTAAAAGGAG 
TCTCTAGCATCCATCTCTCTCTGATTCCTGATTATC 

TCTTTCITCTTTTCTTTCTTTCTTTCTTTCTTTCT^ CTTTCTTTCTTTCTTTCTTT CTTT CTTTCTTTCTTCCTTCCTTC CTT C C 

TTCCTTCCTTCCTTCCTTCGCTTTTGTTGGGTATTCTGTTACAGCAAAGAAGAAGTTGGTAGG 

AGGATGGCTTGTTTTGTTTGATCTGAACACCCGTCAAGCCAG 

CAATGGGCACCAATGTCAAAAGCCCAGAAGCACAAATCCATATGGCGAAAGGCAATGTCTTAGAGA 

4 5 CATATCGCTGATTGACGGTTCAAGGAAGCTGCCACATGCTGTTTGATTGGCAT^ 

AC CTT ACTTGCCAAG AACTT AG CAT ATGTT AGACAAAAACTTCCTGGG TCAAAAG AGAG CAAAG AGAGGGTTCCTATCCGGT GG TT 
AATCACAAAGCAGAAGGGCACATTGGGTTTCACTC 

AAGTGGTACAGAGAGAAATGGTGAGATGCTCnTGCCACAGAGGTGGACAGTTGCT 

GAGGGAGTATG AATGCCAG AGTATTGTGGCTCTTGGGAACTG AG CGGACAT CAAGAATCAGAAACATGTGGAATTGGCTCAGATGG 

5 0 TGACAGTCACTAAGAGAAGCAGGTCAGAAAGGGAACAGGGTGGTTTGTATGCTTATTT 

ATTCACACTCTGTCCTCATTCTCTTTGTTCTC 
TTTAAAAAAAGGATAAACATAAAGGCAGCCATTAGACTCCTC^ 
ACCTCTTGGGACCTCAGTCTCTAACTGTATGAAGCCAGTTGAGAATATCTT 
ACACAAAGTGGAGTTCAGAAATTAGGTCTGTTATTACATATCITAAAAATCCAAG 

5 5 TAATGATCTGAACTGAACCACTVACATCAATAAACACTGCCTGGTACCaVG 

CAGCTGTCAGATGGGGAGTGGGGGCAACCTGTCTCCTGGAGGTTACATCTCTCT 
CTCAGGGTTTTCCACAATTCACAGAAAGGGCGCAGAAAGATCA^ 
GTTAAACAATATTCATCATGTTCTACTGAATCGTCTTTCACATACCCAAACAC^ 
CACAOVCTTTCTGTAAGGCCCAACCATTGATTTC 

6 0 GATGGCTCTCAC^CAAAAGCCAGACTGTTCATCAGGTGACTCACACT 

GGGTTTCTCTGTGTAGCCCTC^CTGTCCTGGAACTCACTTTGTAGACCAGGCT 
TCCCGAGTGCTGGGATTAAAGGCGTGGGCCACCACX3CC^ 
TCAATACCTGTCCCGCAGACACCJVrGCTTGGTGAGTGTT^ 
CAGGGAGAAGAAGCCCTTACTCAGCGTTGGCTGAGCTAC^ACAAAACAT^ 

6 5 TCTTAGGGAGCCCTCCTTTGGAGTCATTTACCTAG^ 

CTACrGTCTCCCAGCCTACCTTTCCrCGGGCTACCGACTAGA 
ACXSAAGGGTCTGCTCAGTGTGCGCTGGAAGACAAGAGTAGCTGGGC^^ 
TCCCCCTCAACCTTAACTATCTGCTGTGTGACrAGTCGACAGAAAGC^ 
TCTCTTCAGGAGGTTTGAATGCCTGCCACTGGGAGCTCTCTG 

7 0 ACTTGCCTCCCGAGAACGCTTCAACn^TCAATATTGCT 

TTCATTTTGTTTCTAAAAATAAAACTGTTAAAAAAAATCAAGGGG 

ATGTAAGGAGGATGGCAGTTAGGGTAAGAAGTACGGAGCCATGTCTGTGCTCCTTGATCT 
TGGGAGCAGAAGGGTTTGGCGAGAAAAGCACAAATGACCTCATCCCACA^ 
AACAGCGCCCCATGCTGGGGACCAAG'l^l^CAAACATAC^T^ 
7 5 CCTACCTGCCCTGTGCTCCAGCCCACAGGGAGGAGGCrrTGTAATTC 
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TCTTGTATTCCATTTTTCCTGGAGACACTAGATCCCCAGATAATTXS 

GATTTGGCAAGGATGGGACTACCCCTCATACAAGATGGGCAACATACTCCAT^ 

GGTCTCTGGTCTGAACCCTTGTATTGGGTTGATTAACAACTGCTGGCCGCl'r 

CATCCAAGTACCTCGGAGTCTGAGATAGGAGTCTCCCAGGAGCCCCATATCCCAAGAATTAAGGTGGAAAAAGGCT 
5 TTTGGG AGAAGTGGTTTGGGAGT ATTCAGGAGG T AAAAGT AG AG AGGGAAG ACG CCAGTCACCTG AAGT CATTTT AG ACAAT AAAT 
TAGGATTCCTTTTTTAAGTTGTGTACATGGAATTCTCCTTCATC 
TTGGCTGTTGCGCTGATGTTTCTTTTATGACACCAGAGCAA 
AATAAAATCTCrAGGGCTGCCTTTCCCATGGGCTCTTCTGCCTTGGAGTCTTT 

T CAGGG AGAGAG AAT CAT AAG CAATTCTGAGAAAGGTTCITTTAGAAATGG AAAAT CTGTGAG AGTACAAAATTCT ACAACTG TCT 
1 0 CCGTGACCCCACATGACCTCCTTGGACACTTCK3ACTAGTTCC 

ATCACCGAACTCTGAGGAATGCTTAGGGGTGCAATGTCATCCCAAAGAAAACTTATCTG^ 

GGTTTCTTAGCTTGTGGTTTGGTACCAGATTCCAAATATGATCAAGATGAATCCCT 

GAATTGCTAAATTAATTGGTTTTGTTTCAGTGAAATTAGCT 

TTTAATGTGCGTATCTGTGCCTGTGTGAGTGCATGCACACTTGTGTGCATAGAAGCACATGCCT 
1 5 ATATTCCGGTGTGCCATTCTGCCTATTCCTTTGAGGCAGAGTCTCTCCCTGAACCTGAGGCT 
GCTAGACTATTCGAGCTGTCTTCCTGTTCCTGTCTGCACTTCCCCTCCGGGCT 
TCTGCAGGGGCCCTGGAATCCAAAATCTGGTCCTTGTGATTGTGGGGCCAAGAGCAACAGC^ 
ATGCAGGCTTTATTTTACTAATCACTATTGTGGGGGTGGTCATTGGGGTGT 
GCTATATGAATAGGTCTGATGCAACTTGATGAGTTTTGACAACTGTGCGC^ 
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TTAGTrTGTTTTTTTTTGGAGATTTCTCTATGTTG 

CTCTATTCTCACAATGATAGGTACATATGTTACTGGGTTGGTTTTGTTGCTTTAGAGATAAGGTCTGGW 

CAAACTCATAACTCTCCTGCTCCAGCTTCCGGAGATCTGGGATTATAA^ 

TTAGAAGTCTGTGTCAAATTCrCTCTTGGTGCACATTATTTCACTACT 

G T AT ATTT AACAGT AT CTGAAAT ATATT CITT AAATT ATTTT AG ATTT ATG ATGTGCTCATG AGT ATTTTGCC 

ATGCACCATACTCATGCCGGGTATTTGCAGAGGACAGAAGACAGCATCAGGTCCCCCTGGAAATGGAGTTAC^ 

C^CAGTGTGAGTTCTAGAAACCAAATCTTCATCCTCTACAAGAATGCCAGCTCTTAAT^ 

AAATACCITAAACGATTGGTTCAAAACCTTCCTGATGCTGCGACCCTTTAATACAGTTTCTCATACT 

AAAATTAGTTCGTTGCTACTTCATAACCATAATTTTGCTACTGTTATGAATCGGTATGTAACT 

AAATGGGTCACAACCCACAGGTTGAGAACTGCTACCATAAACAG 

AATAATTCCAGCAGCTCAGATTCCCCTTCACATTTGGTGCAGATTTCTGTCTA 

GGGCAAATTCATGGTGGTTTTCAATTTGTGGGTTTTTTTTTCCCrrTGAAAA 

CACGTGTGGAGGTCAGGGAGCAACTrrCAGGAGTGGGTTTTCTCCTTCATCC^ 

GTGGGGGTCAAGCACACCTAGCTTGCCTCTCTCTCTCTCCCCATCTCCCTCTCTCCC 

KNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNinn^ 

HNNNNNNNNNNNNNHIWNNNNNNNNN^ 

^^NNNNNNNNNNNNNNNNNNNIWNNNNN^ 



nnnnnnnnnnnnn ^^ 

GCCCTTCGGTATTCCTCAGTTGAGAATTCTTTGTTTAGCTCTGCACCC 

TTTTGTCTTAATGACAGTATCTTTTGCCTTACTGAAGCTTTC 

AAGCCATTGCTGTTCTGTTTAGGAATTTTTCCCCTGTGCCCATATCTTTGAG^ 

GTCTCrGGTTTTATGTGGAGTTTTTTGATCCACTTAGACTTGAGCTTTGTA 

ACATX3ATAACTGCCAGTTGTGAAAATGCTGTCTTTTTTTCCACTG 

TGTGTGGGTTCATTTCTGAGTCTTCAATTCTATTCCGTTGATCTACCTGTCTGTTGCTG 

ATTGCTCTGTAGTACAGTTTGAAGTCGGGCATGATGATTCCACCAGAGGTTCTTTTATTGTTGA 

TTTTTGTTGTTGTTGTTGTTATTCCAGATGAATTTGCAAATTGCCC 

GATTGCATTGAATCTGTAGATTGCTTTTGGCAGGATAGCCATTTTGACT 

TCCATCTTCTGAGATCTTCTTCAATTTCTTTCTGCAGAGACTTGAAGTTCTTATCATAGAGAT 

ACACCAAGATATTTTATATTATTTGTGAAGGGTGTTGTCTCCCTAA'i TT CTTTCTCATCCTGTTTATCCTTTGTGTAGAG AAAGGC 

CATTTGTTTTAGTTAATTTTATATGCAGCTACTGCACTGAAGCTGTTTATCAGGTTTAGG 

CACTTATATATACTATCATATCATCTGCAAATAGTGATATTTTGACTTCTTCC^ 

ACTTCAGTGAGTGTGTGTCTTCAATAATCACAATATTTTCTTTGGAAACATGACT 

TGCCCTTTAGTGTTGCTATGTAGGTTTTTGTGTATTCTGGAAACAAATAT^ 

TGCCACATCTCCTCCATTTTCTCATGGTCTTTGGATTAGCTGATACTTTTGTAA1T 

AGCAGGGCATGGGAGGGTGGACTCAGACCCTCTG ATG CTGGGGTTAC AGGTGCTTGTGAGCTGCTTGGTTCAGKSIX^ CTGGGAAGC 
GAACATTGGTCCTCTGCAAAGGTAGCAAGCATTCTCAGCAACT 

GCTCTGTCCTAAGAAATCTTCACCTATCCACAACTTCTAAAGCTGTCTGCCTTTATTTCCT 

TATGATTGCATGTGATACACACACACACACACACACACACACACACACACACACACACACGTTCTTATTAT 

TTTCTCTGTGGAAGGAAGCTCTTTTCCCTTTTCACATTTATCCA 

TGAATTGCTTTGGTAAATGGGGGGGGGGGCAGTAATTAAATGTTCCTGAGTTTCTAGAATG 

TCTCCCAGTTGCTTGCTGTCTTGATGGCTAGCTTTATAGTAACAGATGTTATGAGTCTCCTAT 

CTTAGATACTTCCAAGGAAGTTTGAAAACAATTTTCCAATTTTA 

TGTCTITTAAAAATCAGCCCTCCAATTTATAAACAGTGTATGTTTCTCTG 

CTTTTGCAATTTTCAAGTCTGTTCITGTAGGTGTGGGTGAATTTATTCTTGA^ 

CTTTAGATCTTAATTAATTCATTCAGTTTAAATATaTAACAAGT 

GTAGAATTCTCGGTACTTTCATGGAAAGAGGCATTTTTAGCTGTGTTAAGTGCCC^ 
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TAGCCCATATCAGCTTTCCTATGCTTATCTCTGCATATCT 
TTTTCTTTGTGAGATAGTATGCAGTTGTTGTGGTCTGAAAGTTTCTATTT^ 
TAGTAGTGGGTTGACGTTGATGTCTGCCTTCrTGCTCTTTCITCGTC 
TGGAATGTTCTGTTTAATTTATTCTGTTATCITTTAATGTGTGCT 
5 TACTATGCATTCTTAATTTTCTATGTTCTATTTAGTTAGCATTGTACCA 
CCATCAA(XrrCAGCTTTCTTGCCCAAGTTGTCACGTTTTACATT^ 
GCTCAGTGGCTAAGAATGCATGCACACTGCTCnTCCAGAAGATCTGAGTT^ 

CCTGTTTCIGACGCCCTCTT CTGGT CT CT AGG AG AACTGCACT CACCCCCT C CCCT AACACACACAGTT AAATCAAAAAT AAAT AA 

AAGACCTACCATACAGTGCTATTTTTTITTTATTTT 
1 0 CTAATGCCCTTCATTATTCCCAGTAGATTTCTATTTTTGACrA^ 

TTC ACTTTT AATATGCACT AT AAGTG TG CTAG TGGGGTTTTGTGAACTTTCTTTTACCTGAAAATG TG CT AACC C CATCCT CATTT 

TTGAAGGGTGTCA.CGTCTAATCAGGATTGAGCTGAAAGCCTGTCT 

CTGGCTCCCTTTGCTCTGTGGTGAAGTCAGTTGTCATCCATCACTGTTCCCACT 

AGACATTCTCTTTAGATTTTTGGCAGCTTGGCATTAGATGC 
1 5 TGGATCTGTAAGTTAACATCTGTCTTCAAGAGTCAGAAATGAGCT 

CCCTCCCACATTTCATCCTTCCCAAACAGTTCCACCAAGCAGGACCAAGCAT^ 

CGAACCACTAAAO^AGGGATAGAAAAGGGGGCGGGGACTGGATGTGTGGGT^ 

CCTGTAGTCAGTCTTCTCAGGCAATGTGCTTACAGATACACCCAGAAGTATGTr^ 
2 0 GTCGACACCTCAAACTAGCCGTCATG AT ACTCACCAGTCT AATGTATAAGCAGGTGCAGTGTTGG CAT 

GCCGTTCCAGATGGAGCAGGTGGGCAGGATGGCAGCTTGATTACAACC^ 
TGCITTCCTGCCACrCCCACrCCTTACCATCACrC^CrCCC 
TCTTCAAGCACAAGAGCTTGTTGCAATGGAATGGTTTTCACTTAC^ 
TGGCTGTGGCAGTGTGCACCTGTAATCGAAAGACITGGGACrrGAAA^ 

2 5 TGAGCCCAATTGGGCCTACAGGAGAACACTCCCTCTCAAAATGATTAC^TGGGAACCTCT 

TGACCTGTCTCAGAAGCTTTGGCCCACCCAACCTCTCTCCrGTGTATGCTAGAC^ 

TTGTACTTA1TCCTTCTACAGTTTGCATTTTATACTGTAGAGGAACTTGAG 

ATCTCrrCAGTTTTATACAATGGGTTTCATTGTGACITTTCCA 

ACATACACACACATACACACATATCATCCTCTGTGCTCCCACTGATCCCGAGGAGGTAGAGTACGT 

3 0 GAAGAAGCATGTGAGCZAAGCTACTGCACAATCAAACTCATGTCTCACCAACAT^ 

CCCACCCGAGGTCCCCAATGCCACATTCAAAGCCCTCTCCTACAAGAACGGCACC^ 
GAAGACTAAAGGAATTGGTCTATATGCGTTGCTTAGGAAACTCCTGGAGCAGCAACTGCCA 
CCTTITGTAACCTGGATGG<3AAAGGATCTAGAGATAAAAGGTGACAGACrCCAGC 
GTGGATAAATCTATGAATAAC^GATTTTAAAGGCTTCTGTAATATCCACTCACCT 

3 5 GAGCTAGGCTTACTAAGTTCTTGACTAAGCCTGAGTCAATCTGATAGATGCACGACTTTGCTGTTGGC^ 

TCAGACTCCTCCTTCACAC&GCCTGTATCCAGACT 

TATTATATATAGCACTGTGTTAC^ATAAGAATATTATCATGCAGGTATAAACT 
GCCACTCTCTTCTATATTCCCCCTCCCCrrCTGTTACTAACTTTTAGATTC 

ACAT ACTT ATGTTT AGTT ACAT AGAGAAAGTCTAGGAACTG CAAATG AAAGAAAACATATTTCTCTTTCTGAAG CTGGCIT AATTT 

4 0 GCTTAATATTGTTATCTCTAGGTGCATTCATTTTCTTCTTTATGGCTGAAG 

GCACATGTCTTTAAACTCAGCACTCAGCAGCCAGAG 

AGAAAAATCCATATATTCCACATTTGCTTTAATCATTCCTCTACTGTTAGACACCT 

AG TCTCCITTATTTTTGAC CATG AGG ATGG AATGGG T CAGTG AG C CAACCT ATGGG TGG AGGG CAG AG TTT AAAG ATG ACTGCTCT 

4 5 ATTGAAGGGCITGGCTTCCrCATTTTCCAACCIA 

TGTAACCrCTCTGCCAGTTTTTCACAAGGGCCAGAGTTTTCTGATCATTGCCTTAAGAGC^ 
AACCAGGAGAGGGGAGTGGAGGAGTTAAATAAACGTTGTGGAGTTAGATTACTCCITACCre 
TGACAAGGAACAAAGCCACTTATGTTTGATCCACATGCA 

GTTGTATCCAATGCCAAT ATGT ATATG AGTATGG AGAGGTCAGTG CT AGTCTAGGTGTGCTGGCCCT ATCCAAGATG CCCTACTGA 

5 0 GCCCTGCCTGGCTGGGGTGCCTGGGCACAGATGACGTGCTGCTCAGGGGAGGCAGAAGGAAAGGCAGAATGAT 

TCTTTTCTCrGAAGGGCAAACGGCAGAGTCrGGGATAGATG 

TCACTGTGAGTC!AGGAATGTGGGTCC CCCAGTCCACACTTCCCCCTTAGGGTGCCCAT ATGC C AGGCAGGAAGGGGAAGGCAGAGC 

CGAAGTGAAACCCGGTTTGATTCAGAGTGAGTGCTGGGACAATGGAGTGGCT 

TGGCTCTTTAAGACCAGGGAGATGATCTTTGTCTCTTCATACT 

5 5 GTGAACCAGGGATTAAATCCCTTTTGCCGGAAGGAAGGAATACCCAGGAAGAGCAGCTCATTG^ 

TTAGCAAGG AG AAGAATTC CAGGTGCGTATGCTATGTGACTCAGTGG CCATGGTTCTCAGTGGAGTGTC^ 
AGCAGGTAGAGGCAAGCAGGGAATGGCTTCACTCCTGCCATTCTCAATTCTGAGA 
ATTTCTCATGCCTGTCTGGTAACAAGGTCCCTCATGCCCTACACAGGAGCGGTCTTTTGC^ 
TGTCTCTGGGCACCAGAGAGGTGTCCCAGCGTAAATGACCGCATAGACAGCTCriTTCrr 

6 0 TCATTTCTCCCATATTCTCCAGCCCATGACAAATCGAGAAAGCAAGTTACAGCTCAACTl'GAAC^ 

AACAGACATGCAGAAGCCAACACAGTCTATGCACCAAGAGAACCTTACAGGTAAGACGTC 
CTCCTGGGAACrGAGAGAGATTAGTCCTGTTCTCAACAGGCAGGTGCAACCCATTCTCCT 
ACAGCATTGCTGATTGTTGGCAGGCTCGCTG(nTGGGAGCATAAGAATTC!AACT 
CTGGGAATATCTGCCATCTAAGCACCACCCACAGAGGAAACACATCGAATAGA 

6 5 TTCTCTTAAGC^CATGCCAATGGACnTCAAG(ZAAAACCATTGATA 

CCACACACrTATTCACTACTTAATGTGGTGTG^ 
AATTAAGCCATAATGTCAAGAAGAAAGAGGCTTACTAGTGTTA 

TAGTCTTATGATAAAGTTATAACCTTACAAAGAACATTATTACATCATGTATAGTCCATCTGAACC 
AATAGGAAGATGTTCCTTGATTACCAAAGAAACTAAGACACAGAGCCTTGAAACGG 

7 0 CAAAACCTCTAGTCCCCCTTTCTTCTGAGACCITATTTTTAGCTACTTAA 

T AAAGTCCAATCT AG AAG CATT ATTG CAATGTG ATG CTTT CATTT CCCTTTCTTT CCCAG CTCTGTC CTGGAAT AGTGTTTTTCAA 
ATACATTTAATGTTTTCTCTGCTCrCCTGGTATCrcGTGCCAACC 

TGAGAAGTTCAAAGCTGTCCTGAATTGTATGATAACCTCTTCAGC 
75 ;^GNNNNNNNN*mNNNNNNNNN^ 
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ACTGGGGCATATAAAGTTTGCAAGTCCAATGGGCCTCTCTC 

GAGTCAAGAGCTCCGGGGTACTGGTTAGCTCATAATGTTGTTCCACCTATAGGGTTGCAGATCCCTTTAGCT 
TCTAGCTCCTCCATTCGGAGCCCTATGATCCATCCATT^ 

CTCACAAGAGACAGCTACATCTGCGTCCTTTCAATAAAATCTTGCTAGTGTATGC^ 
5 GGTGGATCCCTGGCTATGGCAGTCTCTACATGGTCCATCCTTTCATCTCAGCrCCA 
TTTGTAATAGTTTGGGGTTTACAGAAAAGAAAAATTAAGTTCAAACAGGT^ 

TTGTTCTGAGACTAAAACT CAGTC CCAGTGTTGTTACACAAAG TGCAATG T AACAG AAAATGG CT CAGGTCAGGGCCAC CCAGCTT 
CCK3TGGGGGCAGAGAGCTTGAATGTGGTTCTTTTGGGGATT 
CTACGAGTAATTTATGATTCAGAGTCCGACAGCATATATAGTTCTGTCTGATTAA^ 
1 0 TAATTCAGAATAGGCCACGATCTCTGCGGCC 

GGATCTTTAGAAGGGTTGACATGGTCGATGTGGGTCXSTGCTTCAGCrG^ 

GTGACTCCCCTCCTCCAGACATTAGCTTTGTCCTTGGTCAGCTCCCCCAAGTCCTAG 

CTCACGTGTTCAGCAGCTGTACTGAGAGGTCCGTCCATCTGCTCCACCCAGCT 

GCATGGGTCAAATGAAAAGCACATTCCACCTGGGCTCCCGGCAAGCATCCCCTCCGCACCTGCCGCTCCCACT 
1 5 CCTCTGTCTTTCTCACCTCATCACCTATCCGACCCTTGT^ 

CTGGTCACTAGATAGAGCAATGACTGCAAACTAAAGAAATGATTCCACCCAAGTATCAT^ 

TACTCTTGGGAGCAAGGATAGTTCAAATTCAGCTGCTTCATCAGAAA 

TGCTGGAGGTCCCAGGGGGACCGGCGGACAGCCCCAACAGTTTGAGTCTCC^ 

AAG AG AAGAGGGGCGGTGAG CTG CAGG AG CAT CCTGT CATCTG TGGGGTTTGTTT ATTTT ATTTTT CTGAGTCTT AAAGAG CCG CT 
2 0 CT CCAATTCAGGAAAT AGCTGCX3(^TC CCTGGGTTCCATG C CTTCCTACAATG CCT ACATTTCTGACAG CTGG AATCT AT CAACAC 

TTGTAGGTTCAGTGAAAGCAAAACAAGTAAATAAGAACCGATGTCCA^ 

ACATGAAGATTCCAAGAGAATCTATCATTTCGTGGAAGGACAGAGTGTTCACTAC 

G AGGTCCTG CTATT AGCAT CTGCAAG ATGAAGTG TGGG AAAACGGGGTGGACT CAGCCCCAGCT CACATGTGTAGATG AAAGAG AA 

2 5 CA.CCACCGATTTCTGGGTAAGTTGTTCCTTTCTTGTGTCCTTCC 

AAGCCACGTGGCCTTCTGGGTCTCACCTTTTAAAACACAGAAA^ 
GGTTAAAGGCAGGGAAATGAGGATCTCTCCGGTGCTCCGAACCATACAGGATGCT 
TTGAAGCCAGAGCITCCACAACTTCTGCCCATCACCAAGATATCCT 
TTCTTTGTACnTATATGTCATTTGTACTGTGTTCCATGGTCATTACAG 

3 0 AAGTGTGTGGGAAGTGACTTOCCTTAGGATGGGCCTTCACATGCCCAGTCCrGCT 

GCCATCATTGTTTGAAACCTTACTACTTCCATCATCAAAAC^TGCTCATGAGTTTTTCT 

(^TTTCATG ACTTCCTCTGTGGTTCCCITGGGATGCTGTTTG CTTCTCAAAACTTCCCTCCAGAGAGTCACCC CCATGG CCAGGG C 

ATACCAATACTTCCTAGCAAGACCCIGGGTTTCTGAGTTAGGGTT^ 

GGAGGAGGAGGCTTATTTGACTTTCATCTCCACATCATCATTrATGATAGAAG 

3 5 AATTGATATCATGG CCATGGAGTCATGCTGCTTATTGGCTTG CTCCTCATGGTTTGCTCAG CCTTG CTTTCTT AT AGAACCCAGG A 

CCACCAGTCCAGAGGTGTCCCCACCAACAATGGGCTGACCCTTCCCAC^ 
TACAGCTCAATGTTATGGTAGCATTCTCAATTGAAGTTCT 
GTCTCCATTTGGTAGTGGTGGTGTTCTCTCTCrCTCT 
TGTGTTATTCTGACCCATTTCCAAGTCAGGGGTGGAATCCAG 

4 0 TAGATAGCCACTGTCAGGGTTAGAATT*CCCGTTTACATACTTTTCTATGGTCCCT 

TATCCCAGAGACATGCCCCAGAAAAGCTGCCAATGTTTTAGCGGGTATTTCTAGGAACAGGAGCCCCAGATGGTGAGTG 
GCATTAGATCCCATATCACCATCTTTGACTCCAGATACATTGCTTTCTTTCTC 
ATTCTTCTCCCGAGAGTGAGACTTCCTGCCCCATAACCACC^CAGGTATGGAG 
CAAGCAAGGCTATGCAAGCCCCTCAAGCGATGAATACAAAGAGGAAAGGAAGCT 

4 5 G AAT ATGGCCAT AGCAGCIATAATCAACCCAAGGCAGTCTGGGAAG ATG 

ACCTGAGTTTGATCCCCAGAAATCACACTCTATTTTTAAAAGCGCTAGTA^ 

CAGAGACATGGTG ATCTCTG CAGCTTGCTGGC CATCAGCTAGTGTAGCAGAACTAATG AGCTCAGGTTCTGTAAGAGATCTCCAAG 

AAATAAGATAGTCAATTAAGAAAGACATCTGATGTTGACCTCCGGCCrrCCATAGGC^ 

ATGTGTACACCCTCTCTCCACACACACACACAATCTAAGGAAGGAGAA^ 

5 0 GGCCCTCACATCGCTTCCCTCAACCTGAGTTGTAGCCGGCGGCGGCCACGAA^ 

TGGGAAGGACG<5TGAGAGAAATAATGAGCCAAGACAAAGTTCTCT 
GGAAGGAGGGAGACTC^TCCCCAACTTTGCCAGGATCTCCATTGCTTTGTCAGGT^ 
CAGGTAGCTT(^TGCAGGAAGCTGCAGATGTGGCACACAATAGACTTTACCT 
GCTGTGGCCGAAATAGGTCTGATTCAGCCGCTCAAGGCTGGGGGAAGGGCX3CACTGAGTC 

5 5 CCGAG CCTTG TGCAGTCCT ATG CTAGC CAT AAAAT AAG TATCC CCT C AT CTCTCTTCT 

AACTG CAATGACGG AGACA1TTGTGCT CACAATGGAGT ATAAGGTAGCAGGTGGGCCAGCTC^ 
GCATGCAAAGACGTGGGTAGTTTAGCTGGTTGAAACGGCCACTAATTCTTTCACT 

G CT ATCATTGCT CAAAATCCCATGGGTCAGCTGTAGTACGTTGGGAGTTGTATCAG CT ACCTTTTGTGTTGCTGTGACCAG AAGCC 
TGACAAAACAAACCGAAGGAAAGAAAGGTTTGTTCTGACTCCCAGTTCTGGGATCCAGTCCA 

6 0 AGGAGACTGAGCAGCTGGTCACAGTGCATCTGTGGTCAAGAAACAGA 

ACTTGGAAGGCAGAGGCCAG CCTGGTCTGCAT AGTG AATTCCAGTATAG CCAG AArcACATAGTG AGACCTTGTTTCAAAG AAAT A 
AAAGAATTTTTAAAAAGAGAAAGATAAATGTTTGTTCCCAGCITACTCrrC 

AG AGCAGT CTCC CT ACCCATTT AG CCTAATCTTG AT AACCT CT CAAGG ACG TG T CCAG AACGTT ATCT CTAATGTGACTCGAAAAC 
CTGTCAATCAACAT AAACCAAGAG CATCATCAGAGTT AAATCAC CCAAAACCAGTTTCTCCCACATACATTTTCAAGAACAATGCC 

6 5 TCCCAAACTTTGGTGAGAGACAAAGCAAGGATCCAGGGAGAGAAAATATAGTGTCC^ 

CAAAAGCAGGCCTTCCTGAACCACGGAATCAATGGAAGCTTCTGGAAGATTC^ 

GGCGATATCGGTGTXnTTAGGATTTCTTATCTTGTTTCAATAGTTCCTTTATGA^ 

C^CCTTCCTCAGGTTGAGCTGTTCTGTCAACTACCAGCCT 

CCTTGATCCTTCTAAGAAATACCCAGGGAGAAGAGGAAAGAAAGAAGAAACTGCAGGCCC 

7 0 CT ACAG AATTTT CTCACCTGATCT ACAGATCCAAC C CCCGAT CCATGCATTCC CAAAG CATGGTCCTGACCTG AACTCCATCTCCC 

TCTCTTGACAGTGGCCAG CTGCCTCTTCCTG CTCATCAGCAT CCTCCTC CTGAGCGGGCTCACCTGGCAACACAGATGGT AAGTCC 
AACACTGGAAGGACCCAGCACCCCACTGTCTGGTCCCTGGCCTC 

TGGGAAGATGGCTCCGACGAAAGCATGAGATGGGTGTGCAATTCCTAAAAGCTTTGTTTAAA^ 
TTTGCCTATAa.TTGTATAATTCAAGAGCTGAGGAGGCAGGCAGA 
7 5 TGG TGAACTCCAGGTTGAT CAGG AGACTCTATTT CAAAAGT AAT G TGAATAAT AT ACT AAGG AACAAG AUT CCAGG TTGAC CTCT A 
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GACTCCACAGTCATTT ACTCCCTGACACA^ C AAATGT CA 

CCCCTACTCTCAGAGGTCTGTGCTGTTTAGAGATTAGGAAAACCT 

TTAGAGATCTTCAACCTTGCCATCCATCCACCAGGGCTCT 

TCCGCTGATC^TCGACTAGAAACCAGCATGGGTAGCAAGGG^ 
5 ATGGATGAAGGTTGCTAGATACTCACAAGGCTTTAAAAAAAACAT^ 

GCAAGAGCAGGGGGAGGCTCTGTATTCTGTGTCTGTGTGTCTGGCTGTGTCrGTGTC 

TAGGCCCTAAGTATCTTGGTCAGTGATATCTGCCATGGCTCATATAATGTCTTTTTATGC^ 

AGTAAAGAGTTTTGTGTGAGACAGAAACCAGAAATCAGAAATCTAGGTGAACATACAAATTGC^ 

GACACAC^CAGACACACACACAGACACACACACAC^ 
1 0 CAGACACACAGACACACACACACACACTGGGGGGTTG<5AGGA& 

GTTGGAGACAGGAAGATCATGAGTTTGAGTTTGAAAGCAGTCGGGC^ 

CTCAAAGTAAATGAGTAAATAAGATGCCAAGGGATGTAGTTCATTTGGTAGAGTGA^ 

GATCTCAGCACCATAAGAAACCAGGTATAGCGGCATAAGCCTCTAATACCAGCGCTCAO 

CAAGGT CACC CTGGG CT ACATG AGATCTT AT ATT AAAAAACTAAACACATT ATTTTTTAAAG ATT^ 
1 5 TGAGTGCACTATAGCTGTCTTCAGACACACCAGAAGCGGGAGTTGGATC^ 

GG AATTGAACTCAGG AATTCTGGAAG AGCAGT CAGTGCT CTTAACC^ CT CTCCAG CC CCAATACATT ATTT AAAGAA 

GAAGAAGAATTAGAAGAAGAAGGAGAAGGAGGAGGAGGAGGAGAAGAAGAAGAAGAAGAAGAAGAAGAAG 

AGAAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNGAGGAGGAGAAGGA 
2 0 AAGAAG AAG AAGAAGGAAGAAGAAG AAGAAG AAGAAG AAGAAG AAGAAGAAGAAGAAGAAG AAGAAG AAGAAGAAAG AGTTAAGAG 

TATAAAACCATCTTGAGAAAAGCCAGATACATCCTCATTAGAGTCAGCTGTGTGAGATGTTCT 

TGTGGGTGATATAAAATACCCCAACCAAAGCAGCTTAAGAGAGAAGGGGC^ 

TGTGTGGCAGCTAAGGTGGCCGAAACCCGAAGCCTTTACTCATATCACACC 

ACACTTATCCCAGCTCACTTCCTCTTTCTATACAGTCTAG^ 

2 5 TAAAAACAATCCCCTGCAGGCATGCTCACAGATGACCTAATCTAGGTAATCCACCATTGAGAT^ 

TGTTAAATTGGCAATTAACACrrAAATCATCACACGCCTGACGTCAAGGGTGTGCCCCTCCT 

GAT ACTTTAAAT CTTTAATAACAGAGTCATAGGT ATGTGGCTGAGAATCCAGTTTCTT CCATTCTTTTGTTGTTCATCTGG CCTCC 

TTCTCATTTTCCTCAAGTGGAGCAAGAAGCAGAAGGCATTGAGGTCCACATGAGGGTACCCCGAT 

TCATGGCCCTGACACTCTACCCATTTCCTTAAGATGTTCTTTTAGAG 

3 0 CTAGGCCCCTCAGTCCTTGTGAGACCTTCTTCCTGCTCAGGACGAGCATCTACC^ 

GACTCAGAACCAGAGGACAGGGCTCTTCATGAGCAACAATAGCT^ 

GATTCCCCTAGACCTCCCAGTAGCGGGGCCATTAGGGAGACTGAAATAGAATCTCAATATGACCCCTTTGGTC 

AAGAAAACAAAACAAAACACTTCTGAAATCCCAGCCACAAGGAGTCGAGCATAACAAG^ 

GCTCACGTTTGTACATATATATATATTTACATGTGTGTATAT^ 

3 5 TT CAT CT AATGCTGC C CT CTTGTGGC CAATTGGAAAT CTT ACCTCTGAATT CTG AAAATTTCAGT CTTTGC CACTTTGG AACACTT 

ACCAGGAGCTAGGAGGGCACTGGCTG^XrrGACATTAGTTAC 
TTGTTTGTTTGTTTGTTTGTTTGTTTGTTTGTTTGTTTG 

CTCACTTTGTAG ACCAGGCTG^CCTCGAACTCAG AAATCCACCTCCCTCTGCCTCC CG AGTGCTGGAATTAAAGG CGTGCACCACC 
ATGCCCGGCCTCATTTTTTGTTTTAAATGTTCGCCCTGK3GAATTTAA 

4 0 CT ATGATCCAGC CCTTTGAACTG TTT TGTT AAT AC1TTT CTCT CTTTTTCTG TTT CGT AATGTTG ATTTITGG GTGGG ATTGG AGC 

ACGAGCATCCGGAAGTTAGAGGACAAGTTTTGAGAGTTGGTTTTCTCCTTC^TTCACT 
TGAGACATCTTTCTCCCCCAGTCGGATTGTTAGGTTTTGTGTG 

GTGACCCTTTAATATAGTTCTGTATGTTTAATGGCTAATCTTACTCCAACTCAGCAGTTA 
AATCCACAACTGCCAAGTGTCAGACATTGCACCGCAGK3C 

4 5 AAGAAAAAAAAAACHTGAAATATTTTTGTGGTGCTGAGGATACAGGTCAATAGTAGAATGTTTTCT 

AGTATAGAAGAAAAAGAAGAAAGGGAGATATGGGTGGAAGGAAATACGGAGGGAGGAAGGGAATGAGGGAGGG 
GATTTGTTGAAAGTCTAAGATGGAGATATGTAAGGAGAAGGAGATTTAATCT 
ATACTCTGAAGTGTCAGATAAAACGAGCCAGGAAACATCTCCTGGAAGAGGAAGC^ 
CTGATGTCTGAGAGAAGGTTGAGTTGTAGGACATAAAATGAATGGGCACCATGGCCACAATC 

5 0 AAAT ACTGT CATG GAAG ATG AGGCCG CATTGAGG AG CTGGG C C AAG ATG CTGATGGTGGTGGCACCT CAG ATCACCTGAC CT AGG A 

GAGAAACCGTTATCTTGGCCTTCTGGGTGATCrGCTGCGTAGAGGCTTACCGTGGGCAATGTGTT^ 

TGGAAT AAATCCACATAG AAG TTGGT G ACTGTG CTTTTGTGGG TCT GTGG AT CTG AG TGAATGGAGG AAACAAACCAAGG GG TATA 
GAAAAGAATTGAGCAAATTAGTATGACTGGTAAGCTGGATAAAAAGATGGAGTATGGAA^ 
GTTTCTCAGACAGGGAGCCTCTGTAGCCACTACATCCTAC^^ 
55 TAGCCACAAGCATAG CCCAGTAGGACATCTTG CTCAG ACCCACTCAGCTCTAT AACCGGC CTAGAGTT AGTTGTGTTGTG AG ATGA 

GTGTTGGGCCCCTGGGTGGTTGCTGGAGCTTGAGGCTGTTTTTCGTCAGGTGGGAGG 

TTAGACTCTGAGCTAGGGGGCrCAAGGGGACTGTTCAGATGGAATGGGTTT 

ATGOVTGGCTGTGACAGGGTTATTCTTCTTCCAGGAGGAAGAGCAGAAGAACCATCTAGC^ 

6 0 AGCGGATGGGAATCACAAAGCTCAAGCCAAATCTGAGACGCCAAG 

AGAACATTCTGAAGTCACAGGACATAACAGCAACTCTATCACTAAACTGGA 
CACAGCAGTTCTAAAGCTTTAATGGGAGAGAGGGCCCAACAGTGCTCTGTGTG 
GAGATGGTGTGGTCACTTTTCATGTAACATATAGTATAGAAAAAGTAGCTTTAGGTTGA 
GCCCAAGTAAAACTCAGCACTAATGTAAATAATTCCrrCCrCCTCCTCCT 

6 5 CTCCTCCTTTTCCACCTCCTCTGTCCCT^^ 

CTACACGTGCATGGCTGGCTCCTTTTTCAACCTCTGTTTGCCT 
ACTCTGAGCTGGCAAAGATGACTCTATCTATGTTCTTGGCTCAGTCCCAGA^ 
ATCCTGAAGAACAGAAACTGTGGCAGAACAATCTTCAGATAACATC^ 
TATTCAGAAGCCTGTGTCTAGGGTCTGCGCCAGGAGCAGAAGGCTGAAGGAAGTCCCA 

7 0 GGCCACTCTAAGTOCTGGTTTCTTTGGGACAG 

ACTCfUVGGCACAGGTATGGACITGGTACATACCGTGAAAA 

AAAAAGGAAGCAGAGGTGGTGATTATACAAAAGATTAGCTGTAGACTGGATATCCCAGGCATC 

C n XCCTGATCCAf>CTCACCAAAGCCTTGTGAGATCAGACTG CAG AG CCAGTCTGTCTCTGAGTCAGT AAATGT AGAATTTGGATTT 
CTCACAAGTT CCTGG CGG TGTCTTTTTTTTTTTT AAT ATTTTTT ATT AG G T A1~I"X"TCCTChTTTACATTT C CAATG CT ATCCCAAA 
75 AGTCCCCCATACCCTCCCCCACACrCCCCTACCCACCCACrCCCACTTTTTGGCCCTGGTG 
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TTTGCAAGTCCTGGTGGTGTCTTTATGCTG 
ACAGATACCAAAAGCACCTGGGTTTGATGCTCAGACTTCTC 
AGTTGAGTCAGATTCTCATGTTTAACAGTGTTTTAGGGGATTC^ 
AAATTTTATGAAAAACTTTCAAAGGTCAGACAGAGCCATTTTGAGTC^ 
5 • TGGTCTCCCCTTTCCAAAATATATCTTTTGTGGGGACACGGCAAGGAGGAAAGT^ 

AG AAAAAAAT ATT AAACTCTT ACTAAAG AG CTGTGGGT AGTGGT ACACAC CTGT AAG CC CAG CT CTCAGGAG ACTG AGGCAGGAGG 
ATTGCAGTGAGTCAGAGATCAGCTTC^CCTACAAGCAAAACCCTGCCTTACCCCTCAAC 

T AAAATT AATTTTT AAT AC AT ATTTGTG CACG ATGTGTGTG CCTGGTG CC CAGGG AAGCCAAAAAAAT ATGTT CAATGTCCCTGG A 
ATAGGTGGTTATGAGCTGCCATGGAGGTGATGGGATTTAAACCCCTGTTCTCTGA 
1 0 ATCTCTACAGCCCCATTAAATTGAATTTTATTGTCATTACTCA^ 
CTAAGATGTTTAGCTATTTTACrCTCCTTCAC^ 
TTAACACATAATGGATTTATCTTGTTAATAAAAGAATTAGT 

AAT AT CT AT AGAT ACTAGTGT ACATTAAG AAAG C CTTTGGG ATCCTC AATCATTTTG CATGGTTTTAGTAATTTTTT AAT AACAT A 
AAGAAGGTCTGACAGATTATGCTAAAGAGCTATTGTGGTATGGATTAGAAATGGCCCCC^ 
1 5 AGCAGTGCTGCTTTGGGTTTTGGAACCriTAAGAG^ 
CCAGTCTCCCTCTCTGCTGTCCCAGGAAGCCA^ 

GCACGTACTGGCCACTGTGTAGTGTACCCrCTAAAACrGAGCTAGAGCTCCCCT 
TCACGGTGATGAAGAACATAAGTAAGGCAAAGACAAGACATAGTTTGGAGACTCAC^ 
TGCGATGTGGGAATTATCAAGCATGAGATGCAAAGCAATGGAAAATGAATCGTTATGACAG 
2 0 TCCCATTAGTAATAATAGCCCGTGTTTTAGAAGAACACATCTTTT^ 

AAGAGAAAT CTGGGATG AGGGAACCTCAGTGAAAGATTCTCCTTGATCAGATTGTCC CCCAAC 
CCACTGTGAAAGG CACCACC CATTGG CACX3 ATGACTGGGGTTAT AGAAG CAAACAGG CTGAG CATG AG CCAGTGAG CAAG CCAG T A 
AGCAGTGGCTTCTCTTTGGAAGAAGC^TGGCTTTCTTCCG 
ATAAGCTAAAATAAACCATTTCrrACCCA™ 

2 5 TTTCTCCCAAGCTAATC^TGTTTGCCAGCCTCCTTCAGCTACCT 

AGCAGTGTGTGACGTCTGGGCTGTGCCTTGAAGAGAACATAAGTGGCATTTTC^ 
CTTGTCTTGGGAGCTAGAAATCCCAATGGCAAGAACCCCATTT^ 
TCTGATGCCAGGGAGGCATTGCAACATGGCTGATTGCATTGG^ 
TACCTTGTGAGGACAAAATTAACGATCACTAACIACCGGTACrrGTTACCAGOT 

3 0 TTACGGTAATACTGGCATCAAAATATTTTGAGAGAGAGAGAGAGAGAGAGA 

GTTGAATGACTCTGCCTGAGAGTTCCCCTCCAGCAGGCAAGTTCCTGAGATGAA^ 
CTCGGGGCTTCTGGGTAAGTCATGAATCGAAAATAGAAAGCTTCTGGGGTCCCTAAAGCCCTTCCT 
CAAGCGTGAAGGATAAAATACATTCCCTGTTCTGCTGTAATATTAAGGATTTGGATTA 
AAAACCTTATAGTAAGGGGCCAGTGAGATGGCCAGTGGACAAAGGTACCTGCGGCCAA 

3 5 CCOVGATGTTTAGAGACCTGGTGCCATTAACTCCACATGTC 

ACACACACACACACACACACACACACACGAAGCCCTTATT^ 

CACACACACACACACACACATCAGTTAATTTTTGACTAGCTCAAAG^ 

CTTCTGGTATTCCTGAGACTCCTGAACCTTCCCCTGTC^ 

TACATGGCTGGTTGGCTTTCTCTCCTGCAGTGCTTCCGTTCTATCCTTCTCCCCTGAGTC 

4 0 CCAGTTCCTAGCCTGCGCAAATCTAAAGGTCCCACCITAGTCTACCTGTCCAGCCATT 

AAACCAATTGGGGACAAGGACCTTCAGCGTTAGGACATGCAGATTCCC^ 

ATCCCC^CACTTATATATTAGAGGCAGAACTTGGGGATACAGTCTCCTTGAAGCTAA^ 

GTGACATAGAAGAGATAAAAACCCTACAAACrTGGTGTTTTTGTTATTGAGACAGCGTCT 

GTTT AC AG TC CT ATTT CAGCT CTCTGGTATTT CAGGT ATG CAT CAACTT AC CAAGCTAT AAACTTGG L"l"l"l" CTTGAGC CTTGAACT 

CTC CA.TTT ACACAATG CCGG AG AT AG ATCACAGGG CCT AGTG CAG ATT AAGCAAG T ACTCT GGCAAAC CAG AC CAAAAAAAAAAAA 
AC CAAAAACC!AAAAACCAAAAACCAAAACCAAACCAAACCAAAC CAAAACAAAACAAAAC CTGAG ACAG ACAC CAG T AT ATT AGG A 
TACTCTTGAATCCTCCC1TTTTCGTCCCCTCCAAAATTGCAACTAGAGTTGTAAGTTCTGCCC 

T AAACTGAACT CT AGTGG T AAATG T CAATGAGGGGG ACCTCTG AT CCACAT ATGT CAAG ATGG AGGTTTGG AAACACT AC^ CAG 

5 0 AAACTCCATAG CTG AAGTCACAGG ATGGCGGGGTGACACTCTGTATTCCAGCT AACAAGCT AGGG AACTCG ACTGTGTGCACGTCC 

CTTGTAACCACCTACACATGAACTGGX1AAACCTAAGATCCCTGGATGTTAATCTG 
ACCAGAGAAGTGGGGGAGCAGGAAACAACAGAACTGGTTTCCTTCCTACCCACTCACAGC 
ACAGAGTACTTAAGGCCAAAGCTTGAGACTTCTTGAAGCTTTGTG 
AAGCTGAGGTACCTGAGAATTTAGAATGGTCTAGCTGTGTTTTGAGACCAACC^ 

5 5 AAAAACAAGGGAAAATTAGAGACGTCCTCTGAATTTTCCTTTAATAAATATGGCTCAAAC^ 

ACTTGTAAGGCATGGTTAGCTGAGGACACAAACAGGCATACCTCTCAGGGG 

AAGTGAGTCTCCAG ATGGG CACTGTCCTGGG AATTGAATGGACACCCACAGACATGCAGGCTGCTAC CTGCTACACTCCCTCTCCA 
TT AT ACAAC AT CTTTG ACC CCAAGGT CAAAATTT AAACCAAATT CG AATT G AGTT AAT AATT AATCT AGG AAATT C CTG CAGTGGA 
TAAAGGAAATAAACTTGGGATGAAAAATATTCAAGCGAGGACACTGAGGGCTTAGAATCAGATATTAACCACT 

6 0 ATTGTTCTT AC CCTAGGTTTGG AG G CCAAT CTGTT CCCTTGTCTT AGG AAAAAT AC AT AAT ATTTTG CT C CACAAAGG ACACG TTT 

GGGAGTCATGGCCATTTCCTTTCCTCTCCAACAATT^ 

GTGTGTGTGTGTGTGTGTGTG TGTGTGTGTGC^TCAAACACTCAACAGG AAG CAACTT A^ 
GTTCT AGGG AATGTGGCT CAT CTTGGCAGAG AAAG T ATG ACAAG AG ATGTAG AAG AAG C^ 

GGG ATAGTGATGGTCAGTTTG CCCATGAAACGGT ACCAGACATTTCAGATGX3TTCTTCCCT ATTCGCAAGC^ CTT AATAGACCCA 

6 5 CACAG AGG TGCGTTT C CATGGTG ATTCT AAATTCAAT CAAGTTG AAAAT CGTCATT AACAATT ACAAATTCAAAG AAAT C CT AAAA 

AAATACTCTTCACCACACTCGTATTTTTAACTGAAAAATACGATCG^ 
AACTTGACACAGCTC^AGTTATCACAGAGAAAGGAGCTTTAGTTGAGGA^ 

7 0 AG CAAG CCAGGAGAAGCAAGCCAGTAAGT AACAT CCCTCCATG^CCT CTG CATCA^ 

TCCTGACTTCCTTGGTGATGAACAGCAGTATGGAAATGTAAGCTGAATAAACCCTTTCCTCC 
TTTTGTCCAGGAATAGAAACCCTGACTAAGACAATTCCCCAAGCTGAAACTTAAAA 

TG CTTTCCTTG CCTGG T AAGTTT AAAGAT ACGTTGTTTTTT CTAT CAGTGTTTCAGGGTG T AAAG ACCAATT AGAAT AT ACT CAGT 
CAATTTTTCATACAATGTAGCTCAAATACAAAACACAT^ 
7 5 CCTGGCTGGTTGTGATGGATATCCAGACTTCCGGACCCACATGGAAA 
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CTACGTAGGCTGAGGGAGAATTCTTTATAACAGTCTTACTC^ 
ATGCTCACTGGTTCAAGAGCTTCATGATTGATGAGCAACTACTTTCT 
TACAGTAACCTTGATCAAAAGTCAGAGTCCAGGCAGGCCATAGGTACCACTAG 
NNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNITONN^ 
NNNNNNNjlNNNNNNNNNNNNN^ 

NNNNNNNNNNTJNNNNNNNNNNNNNN^ 
1 0 frJNNNKNNNNNNNNNNNNNNNNNN 

TTGGTAGACCCTATCCAGGTTGTGAGCATGTAATCACAGCTTCTGAGTGC^ 

CACACACACACACACACTCTGTGGAAATAGAAAGAGAACTATCTGTAATGATAGAAGAGATC^^ 

GAGCATGAATATGATGCAAGTGC^TCCTATGTACATGTTTATCTGTGAGAAAAGAGA 

TAATGACCCAAATGCATTATATATACACGTATTAAAATGTCATAATGAAGTCTATCATTA 
1 5 ATATGTATCAACTTTAATTTTTTCAAAACAAGAAAT^ 

GCAGGAAAGCAAAAAGTCTTGTCTAGTTTAAACAATTCTAATA^ 

CAGCAAAATGGCTCAGCAGGGAAAGCACTTGCCACGAAACTTGACAACT 

GAAACAATTCCC^CACGTTGCCCTTTGACCCGCTTCTTCCAAGCAATGTGGAGAC 

CACAGTTCTGTAAAGAGTCAGAGTCTACAGAGGTGACCGTCCACAAAGGCAGT^ 
20 GCTCACAGTTCATTGTGAAGGGGGAGTCGGGGCCAAGAAGAGCT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNW 

TAGCTATCTTCAGACACACCAAAAGAGGGCATCAGATCCCATTA 
CCAGAACTCTGGAAGAGCAGTCAGTGTTCTTAAGGGCTGAGCCACCTCTGCAGCCCT 

2 5 GATATTGCCCACAGAGAGCTCAATTGTCCCTCATCGATCATC^TCAAGAAATGCTTTAGAGACTTGCA 

MOUSE SEQUENCE - mRNA 

AAAAGCCAAGATGACAGACTGAGAGCCTGAGCCCTTGTCTTGCATTCT 
GCAGAGAATTTCATCCAGTTCCCTCCTCCATCCn'GATCCCATGTGCCAGGAAGATGGA 

3 0 TCATTAACCATAGTACCCAGTTGTCGGGCAGAACTC3TGTCT 

CTACAAGAACGGCACCATCCTAAACTGTGAATGCAAGAGAGGTTTCCGAAGACT 

ACTCCTGG AGCAGCAACTGCCAG TGCACCAGCAACT CCC ATG ACAAAT CG AG AAAG C AAGTT ACAG CT CAACTTGAACAC CAG AAA 

GAGCAACAAACCACAACAGACATGCAGAAGCCAACACAGTCTATGCACCAAGAGAACCTTGCAGGTCACTGCAGGGAGCCACCT 

TTGGAAACATGAAGATTCCAAGAGAATCTATCATTTCGTGGTTGGACAGAGTGTTCACTAC^^ 

3 5 TACAGAGAGGTCCTGCTATT AG CATCTG CAAG ATG AAGTGTGGGAAAACGGGGTGGACTCAGCCCCAG CTCACATGTGT AG ATG AA 

AGAGAACACCACCGATTTCTGGCTAGTGAGGAATCTCAAGGAAGCAGAAATTCTTCTCCCGA 

CACCACAGATCTCCCACAACCCACAGAAACAACTGCAATGACGGTGAC1ATTTGTGCTCACTATGGAGTATAA 

GCTGCCTCTTCCTGCTCATCAGCATCCTCCTCCTGAGCGGGCTCACCTGGCA^ 

CAAGCTTAGAAAAGTCAGAGCCCAGGCAAGCGGATGAGAGCCCAGGCAAGCGGATGGGAATCACAAAGCTCAAGC 

4 0 GTCCAAGCATTCACCTAATCGGCTGTTTCCTTCTGATCCCT 

ATCACTAAACTGGACTTTGCCATTGAAGAATAGGATCTAACCACrrTCAGCACAGCAGTT 

AACAGTGCTCTGTGTGTTTTGrrTTTTGTGTATATCTGTTGA 

TAGAAAAAGTAGCTTTAGGTTGACTTCATTGTTACAACCGAGT^ 

45 MOUSE SEQUENCE - CODING 

ATGTGCCAGGAAGATGCAGCCACGTTGCTGATGTTGGGGTTTCTCTCATTAACCATAG 

GTATG ACC CACCCG AGGTCC C C AATG C CACATT CAAAG C CCTCT C CT AC AAG AACGG CACCAT CCT AAACTGTG AATG CAAG AG AG 
GTTTCCGAAGACTAAAGGAATTGGTCTATATGCGtTGCTTAGGAAACTCCTGG 

GACAAATCGAGAAAGCAAGTTACAGCTCAACTTGAACAC CAG AAAGAGCAACAAACCACAACAGACATG CAGAAGCCAACACAGTC 

5 0 TATGCACCAAGAGAACCTTACAGGTCACTGCAGGGAGCCA.CCTCCTTGGAAACATGAAGATT 

AAGGACAGAGTGTTCACTACGAGTGTATTCCGGGATACAAGGCTCTACAGAGAGGTCCTGCTATTAGCATCT 

GGGAAAACX3GGGTGGACTCAGCCCCAGCTCACATGTGTAGATGAAAGAGAACACCA 

AAGCAGAAATTCTTCTCCCGAGAGTGAGACTTCCrGCCCCATAACCACCACAGACT 

5 5 CT CAC CTGG CAACACAG ATGGAGG AAGAG CAGAAG AACCAT CT AG 

HUMAN SEQUENCE - GENOMIC 

CTGTGCCTAGGCTGGAGTGCAGTGGCACAATCGGTAG CT CACTGCAGCCTC AACCTT CCAGGCTCAAGG ATCCTGCC ACCTCAGCC 
TGCTGAGTAGCCGGGACTACTGGCACGCACCAACACACCCAGCTAATTTTTAATT^ 

6 0 CCTAGGCTGGTCTCAAATTCCTGGTCCCAAGCAATCCTCCTGCCTTAGCTTC 

ACCCAGCCCACTTCCGCTCTTAGGGAGAGGCGAGTGCATTTCAGAGAGCACT 
CGTCCCCTCAGCCCTTCACACAGCCTCCCCTTGTCTTCCTCTCCTACTCAGCACTGTGGCT 
AGATCCTCATAAAGAGGACAAGAAGTTCATCCAGTACCAGTCAGGCCAATATCCCAAAATT 
CATCAACAACTCCTCTTGACAACAGGGTTGGTAGAAGAATCAAGGGCCTAATTTTGATGCCT 

6 5 GTACATGCTGCCCAGCC^CTTCCAGAAGCTCCTGGTCCACCCCGTCAAGAAGCCG^ 

GTGCTGAGACTGCCCTCAATGTTGCCCCCAAGAACTGCAAAGCCATCCTGGAAACAGTGG 
CCCAGTGCCAAGCTTGTGCAGCAAAGAAAAGGATTAGACTGCTCATGTGCACGTl" 
GAAAGAAGGAGGAAGTGGAGGAAGAGGAAGGAGGAGGGAGGAGGAGGAGGAATTTACATCAGC^ 
GAATTCATGGTTGAATGTACTGACCATTCTCCCTACGGACACTTGAAGGTT^ 

7 0 GGACTCTAGACTTGGGCCACAACTGAGTGA'ri' ri' 1 1 1 TTTCTCAAACCCAATTCCACAAAGTAAAAGCCTGATGAAATACAGTGAA 

GGCTGATGCCTTATTAACTGTGTTTCCCAAATCCAGAGCTCT 

AGCATTAATTCCATGCTAAGTACTGAACTCAGCACTAGGA 

AGCCTCATTGGAATGATCAGCCGTGTCTCCAGAGAGCTACIAAGGCAGTT^ 

G CATGTC T AAGC^TT AG ACAG ACCTGGG AC CT AG ACATG ACAC CACT CCTGACG AATT ATGTGAGT GTGGGTGTTT CACAACCACA 
7 5 ATGAGATGCAATGCCTGCACTTGTAACATGGAAATAGTGATGGCATGCCGGCCCCGCCAG 
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ACATGCAACATTCTCAGCACAGTGCTTGCCATGTAGTAAGGGCCT 
ATAAGGGAACAGCCCAGAGGAGGGGGAGCACTTCCAGAGGGAGGGATGCGGTGAGA 
GTGTTAAGGAGTGAGTGAGCTTTGGGCTTATGGATTTAA(mGAGGAAACCAAGAA^ 
AGTTGATGATTTTTATTTTGTTCTCTGGTCTGCTTGGGAACAT^ 
5 TGAAGAGAT AGGCCAGGG CAGGTTATGAAGGATTTTGAATACCGGG CTAAGAAATGTGGGCTTAATT^ 

TCCTAAAATGTTACGTTGTAGATAAGGGAAAAGTATTCTTC 
CCCAGCACTTTGGGAGGCGAAGTGGGCAGATCACCTGAGGTCAGGAGTTTGAGAT 

CCTGGGAGG CAGAGGTTG CAGTG AGCCAAGATTGCAC CACTG CATTCCAGTGGGCG ACAGAGCAAGACTCAGTCAAAAAAAAAAAA 
1 0 AGATTAAACTGGAAGCAATGGGTAAGAATGGTGGAGTGGAGAGACACATTTGAGTGAGG 
TCTAGACAAAATGTGACACAAGCATGCCTTCACCTCCCTGCAAA 

TCCTGGGCCGCCATTGGTAACTCATCAGTGCTGGCTTGAGAGATAAACTCTATTATCCATCCCTGAACT 

GGCCAGGGAGCTTTGCTGCTATCCCCCCAGGAAACACGTCCTCCACTCAA^ 

CAACAGCACTGGTCACCACAAGCCACAAAATGTTAAC^ 
1 5 ACAAACCAACTGTTCTCTCCCAGTGTTTTCTGGGTGGCACAACCCACAA^ 

TTGGGTT ATTTGATT CAT AGTGG ATTTTGGTTTT CCACX3GGAC C CCTGTG C C CTTGTCT AGT AG AAT CTGGTGGAAATT ACAAACT 

GCAGAAATTCAACTCAGTGCCGCAATAACAGGATGCACCTGTAGATTTCGTAGAATTA^ 

AGAGAAATAACCCTGTTTGGATAGTGCCAACTGGGGCA 

CAGCAACATCTATCCAACCGCCATCCACTTAACAAGCAAAGATtKl^ 
2 0 CTTTTGATGGGATTATCACTTTGACAGATGTGCTTGCAATTAACTGGGG^ 

ACTC^CTCTTGGGGAGCAAAAACCCTTCTTTGTTTGTT^ 

GTTGATATGCACTGAATTGAATTATACTAAAGGGTGCACCCAAGAGTCCATGAGCT 

GGACATTTCAGGTCCATGTGTACAAGATAAAGGAGAGAAAATCCTGGGGGACAC 

CTAACAAGAACATCCAGTACATAAACAGCAGGCTCTCTGCAACCGCT^ 

2 5 TATCTCCTGTTCGGATTTTX^CTCTGAGACCACAGCTCAGCACATCA 

GCTGCTTTCATGTGGTGCCAGCGTATAGTGCAGGGCAGGCCTGATTGCTC 
TTTGAGGGCCCATTTCTTTTCCTCTTTGAGGAGACTGTGAGC^ 

AGG TTCT AGCAT AATTCTTTGATT AATTG CAGTCAACGTTT ATTGATGGATT ATG TG ACAG AGG TG AAATT CTTTGAT CTGGG TTT 
CAGCTTTTCCTTTCCCCCACTCCCTCCTT X TT Cl^ ' X ' T ' ri ' lTl" 

3 0 GAGTGCAGTGGTGCAATCATAGCTCATTGCAGCCTTGACCTCCTGGGGTCAAGCGAACCTCAACOT 

GGACTATAGGCATGCGCCACCACGCTCAGCTTCACCTTTTCCTTTCCTTGTGCTCCC 

AGTCAAGAGGACAGCAACAGGAGGAAAATGGGAGAAACCCCATTATTCGTGGACTAAACAGGCrrTCCCACTAGT 

AGGAAGTCATGCTCCATGTTTCAAATGTCCTCAAAGCTCGTTCAACTCCAAAATGCATGAAGACAAAATGG^ 

TTTTTGTGGGCTGTTGGGGAGAAGAGTTAACATTGCAAAGCATATTGCAAACTCTT 

3 5 AGCTCTTTATTATTTCTCCAGAGCTGCCCTGCTCTTCCCACTCTGGGTCCTTTCT^ 

ATCGGCCAGCATTT CTTTT ATTCTATAGAGCAAGTTTTCTTGCCCTGGG CACCT CTCATTTGCATGTTATACCACCACACCAAGTG 

AGCCTCAGAGCACCAGTGTTTTAATCCCTGAAAACAGTCCTTACGGGATTCAGCACAGAGGTCCTCTTGGCAGCCTC 

GAGTGACACGTCCTCAACAGTCAAAGTCAGCAAGCTATGGTGCCCATCACCAAGTG 

TCTCCTACCTTCTTCCTTATCCCATCCTTCCTCCTTCLTrr'l''i''i'l i' i CTGCATTCAATTATTCATGGTTATTCATTTCATACGGGC 

4 0 ATCTGTGCCACTTGGTTCTACAGGCATTTTTTTTTTTTGAGAC^ 

TCTTGGCTCACTGCAATCTCCACCrCCTAGGTTCAAGCAAGTCTCCTGCCTTAGCCTCCC 

CACCAAG CCCGGCAAATTTTTGT ATTTTTAGTAG AGACAGGGTT^ CTGGTCT CAAACTCCTGACCTC AGG 

GACCGTTAACAAGCCCCAAGGGTAAGTAAAGGAAAATGAAAGGTGATTGA 

4 5 TGCCCCTGCTCCCCAATCTAGACTCTAATAAACAGAAATGATTTTTGTTGCAAGCTGAAAACT 

GGGTGGCAAGTCAAGCTTTAGAGTTGTCTGGATAATAAATGCTGTCTTC 

CAAGTGAAC^CTAACTCCTGTTCACAAATGCTACTGCCTCTCTGCCATCCCCTAACCTCAGCCTCA 

TCCATTTGATGAGCTCAAATCAGTGAATTTTCTCCTCGCCAAAGCTCACAAATT 

TTACCAACTGCATTTTCCCTTTGCCTGATATTCTGGCATCAAGCCTCCTTGCGAT^ 

5 0 TG ATTTC CTTCCTG AAGAGT ATTTCAGCAT CTCTATGCCT CAATCTTCCCTGCACACCACACCAGCTCGGCCTC ATCATCACATCA 

CTTGGCTATGTCCCGCCTGGGTTAGACAGCTTCAGTGGCTGAAGTCCATAGATCTTAT^ 
AATCCAGCTGATATTTCTGCCCTTAGCrrCTACCCCTCTCTACrTCT 

CT ACCAAGGCCGTATCCATCCTTCAAGGACAATACGTGGGCCTTT CCTG AT CACATCAGCTCAACAACTTTTCCCTCCTACATTTC 
AATTGCTCTTCTTACCATAATCATTAGTATTCACCCCACTGTACGTCTAGAAAGAAA 

5 5 AGGTCAGAAATTTGTTGTCCGCTGTTCTGAGCAGTTTCTTCTAGGAAGTACC 

GATGAAGTCAGACTCAGCTTGCACTGTTGACCGGCTGTCCCrcGATGAACCTAGTT 
TGTTCCTTTCTTGAACTTAATCTACCAATGTTATCTAGATAACTTTCCTCAAAAAAAAAA 
AAGAAATCTTTGAACCGGACTTATTGGAAATTACCTCCTTGCAGCAGGTTTGAAACA^ 
GTCTGAAACTGCTTTAGTATATGCTAGTTATATTTGTATCCACATGTGGCTTCATACATAGTC 
60 CACTTTGTTAGGTGACGTAAGTTCACAGATAAACTGGACATAGGGCTATCCTCTAGAGGCCCACAGTTTGACACCTTAACCTG 
TCCTTGGACCTTTGAAGTTCAGAGTGCCCACAACCGCTGGGTGTGGTGGCrCACACTTGTAATCCCAGT^ 
TGGGAGGATCGCTTGAGTCCAGGAGTCTGAGACCAGCCTGGGCAACATGACAAAACCCCATCTCTACAAAAATTA 
AGCCAGGCCTGGTGGTGCATGCCTGTGGTCCCAGTTACTTGAGAGGCTGAAGTGGGAGGATCACT^ 
GCAGTGAGTTGAGATGGCAACACTGCACTCCAGAAACCCCATCTCAAACAAAAC^ 

6 5 CCTGAAACTTCCTACGGATTTATCCAGAGTATGTGCTGATGTGCATTTTTTCCAAGGAGGGAAACCAAAG^ 

AAAAGAGTTTTTACCATTGTGGAAGACAGTATGGCGATTCCT 

TACTG'GGTATATACCCAAAGGAATAGAAATCATTCTACTATAAAGT 

GCAAAATCGAACCAACTGAAATGCCCATCAATGAGACTAGATAAAGAAAATCTG 

TG AAAAGGAATG AG ATCATGTC CTTGG CAGGGACATGGATGAAGCTGGAAG CCATTATCCTCAGCAAACT AACACAGGAAC AGAAA 

7 0 ACCAAATGCCGCATGTTCTCACTCATAACTGGGAGTTGAACAATC 

CCTGCTG CAGGGTGGGGGGTGGGGTGTGAGGGGAGGG AACTT AGAAGACAGGTGAGT AGGTGCAG CAG AC CACCATGGCGCACTTA 
TAC CT ATG T AAC CAAC CTG CACG TT CT ACACATGTAT C CTTGGAACTT AAATT AAAAACAAAAAAG AGTTTTTAATC CAAAA^ 
T AACAG CAA CT AG AAGAAAGAAAG AGAT AT AAACAGTG AATTGCAATGCAG CACGG CAGGTGTT ATGATGGAG T AGAAAGG ACTGG 
AAAAGGCCTCCTGGAGGAAGGGA(^CTCAAGTGTCITTCCATTT CACCT 
7 5 AGTATGAGATCCTCC^CAATCCGTGACAAAATGGGTCT 
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GCATCAACTGGAAACCTCAGCATCAGCAAACC^CGACAGAGCGTTCM 

TTAACCATGGTC CATCTCAGAACCAAGAGTTGGG CCTCTT ATTTACCAGAAAAATTGTGGGGG CTTTGTGATATGGCTTTAAAAAA 
ATCTTGTAATTGCCAC^CGTGGTGGCTCACACCTGTAATCCCAG 
GTTCGAGACCAGCCTGACCAACATGGTGAAACTCCGTCTCTACTA 
5 ATCCTAGCTACTCAGGAGGCTGACGCAGGAGAATCACTTGAACCTGG^ 

CTCCAAAAAAAAAAAAAAAATACATTAACATAAATTTAAATATTTTATAATGACAAT 

TTTCCAGGAGAGGCAGCAAGTGCATTCTACTCCCATGCCCAAGAAGAAAGG 

TGGAGCACTTGCCCGCAGAGTGAGAAACGTTCCTAGAGAGGAAGTTATACCTC 

ACAAGTTTTTTG AGATGGAAGTCTCACTCTGTCG CCCAG<KrTGGAGTGCAGTGGCG CAATCTCAG CTCACTGCAGCCTG CACCTCC 
1 0 TCGGTTCCAGCTATTCTCTTGTCTCAGCCTC 

TTAGTAGAAATGGGGTTTTACCATGTTGGCCAGACTGGTCTCAAACTCCC 

TGCTGGAATTACAGGCGTGTGCCACTGCGCCTGGCTAATTTTTTT^ 

TCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCACCTTGG 

GCCCAGTCAACGTGATGTGTTTTGGAACCCTGAATTCCTTGGCTTGCCCGGAGGGTTTTCTTTT^ 
1 5 CT AGT ATTT AAAAAATTGTGTTTTG CTCT AACT ATGCAATGGCTTT AAGT CTT AG ACAAATTT CCAGGGAGCAAAACACACT CAAC 

CATTT CAT AAT AATCAG AAG AGAGCTCTGAT CAAT AAAT AAGCAAG ACTG AATTTTACAAAAT AAT CC^AAG TTTAAAACCAAAG C 

CCACTTTTTGCATGATCCnTTAAGAGAAAGAAAT 

ACTGTGGTGAAATGATGATGGCTAGTACAGGTTATAAGCCT 

CCTTCTCCATTATTTGAATATTGGAGGCTGCCTGACCAGAATCTTGTCAG^ 
2 0 TCCTGAGGACGTTACAGCCCTGAGGGGAGGACTCAGCTTATGAAGTGCTGGGT^ 

C CTTCAACGGCAGGGGAATCTC CCT CT C CTTTT ATGGGCGT AG CTGAAGAAAGGATTCATAAATG AAGTTCAATCCT 

CCCCAGCCCACACCTCCAGCAATTGAACTTGAAAAAAAAAACCT 

AAAAAAAAAAAAACACTTCCTATATTTGAGATGAGAGAAGAG^ 

AAGAGAGCAACTCCTGACTCCGATAGAGACTGGATGGACCCACAAGGGTC 

2 5 TCCGGCGCTGATGCC^AAAAGAGGCTGACGGCAACTG^ 

GGTCAGGAAGATGGATTCATACCTGCTGATGTGGGGACTGCTGACGTTCA 

GGGTGCCCCCXX5AATTCCGGGAAGGCTGATGGGCATCCCTCTTCCCAGCCACAGAACCAGAGGGAGT 

AGAAGGGAGTTGAATCTTGGGTTCCACCTCnTGCCTGTGACCCACGGGGA 

GAGAGCCTCAAATAGCATTAGGTTCTCCTGTCCTTCrCAGTCCTTGACAAT^ 

3 0 GATATTTGGATGTTAACAGGGTCCCGATGAGCAGGTTTCTCAAGAAATTATCnTTTAT^ 

CACTCC CAC C CCCCAACT AAG AAG CT ATCAGG AAGCT CTTT AAGTTCAGG TT AATTTG CTGG AGACATG AGTACT CAT AT AT CAAA 
CACTGGGAGAATTTAAGATCTCAGAAAAGTTTAGCTTCAAGG^ 
rrCCTTACAAGTGCAACTTTACAGAAATTTTTAAAAATATGCATACT 
ATACATGATATGTATATGTATACGTGTTTATATATGCAAATTTTAAATTTCATTCC^ 

3 5 TTTTTTTTGACACTGTCTCTGTCAGCCAGACTGGAGTGCAGTGGCGCGATATTGG 

GATCCTCCCACCTCAGCCTCCTGAGTAGCTGGGACTACAGGCATACXJCC^ 

GTrTTTGCCATGTTGCCCAAGCTGGTCTTG AACT CCTGGGCTCAAGTG ATCCACTCACC CTGG CCTCCCATACTGCTGGT ACT ACA 
GGCATGAACCACCGTGCCTGGCCAATATTTTTTTTTAA 

AGTGTGAATTTTCTAATAATGTCTTCCCTTTGCTTACTCAGGAAGACAGATAAGAAAGTTTATAGTGT^ 

4 0 GGCAACAAGTAGAATGCTAACTGGCrGTAGAAAGAATTTATTAAGAAATGAAACATAATGATGCATTTC^ 

TTTAGGGGAAGTGCCCAGTCATGCTGGGCTGTGTACTGGCCCAGATGTAGGAGT^ 
CATTTTTATTTATTTTTTTTTTACTTTTTGAGACAGAGTCTCA 

CTG CAACCTCCACCTCATGGGCTCAAGT AATTCTCGTGCCTC AGACTC CTGAGTAGCT CACCACCACGA 
CCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCA^ 

4 5 GCCATGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCATCGCGCCTGACCTATGTGTTTTAT^ 

GGTGGTGTTTGGTTACATGAATAAGTTCTTTAGTGGTGATTTGTAAGAT^ 
CAATGCGTTGTAAGATTTTGGTGC&CCCATCACCTGAG 

CACCCTTTCCCCTGAGTCCTCAGAGTCCATCGTGCCGTTCTTATAAAGAGGGCATACTTTTAGGAAG^ 

ACAT CCACAGGGT CTTTG CT AACT CAGT CT AAG C CAAG AACAGG AAAATCTTGACAATT AAT CATG CT CTGC CTCTGGAAGACACA 

5 0 TTTCAAGTAAACTCGTGTTATTTCTAGCTATTGGTGACTT 

CCATAAGGACTCCTTCACGCCATTCTCAACATCTATCCATACCCCATCTGCCCCT 

ATATAGTCCTGACACTTAAGCAACTTATAACCTGGTGAGGAGGAGAAAAACATAGATATAAATATAGTAATACCAGTCAGAGTGTG 
GTAAOlATCTACGGAGCGTTGGGCTATCAGAGAAGGCrTCATGA 

TQAATGAGATTCCAGC^AGAGAGACCCCAATGAGCCAAAACACAGGAGTGCCTTAGGG 

5 5 CCTGGACAGGGAGG<3TGK3GGATATGGGGAATTCAAACrGGAAAGTTAGGATG 

TTAGTCTGTATATTGCTGGGCCCTGGGATGTGGTCAGAAAGAGAATTCTCCTAGTGATAACTTCT^ 
AAAGAACTACAAGAAAATTTAAAACTACCATTTATAAAGTAACTTCTAT^ 

CTTTGCAGT ATGG AT AT CATG AAC CC CT AG ATGAAG CAAG AACC CAAAACTCAGACTCG CACAGG GAG TGAGAGTG CCAGGGTTTG 
AACCTGGTCTCCCTGGATTGTTCTGTTTCTGTTCATTGCAAACAGGGTC^ 

6 0 GAGATCAGGTGCACAGAGACAGGTCATGAGGCTCTTAAAATAGTCCAATCACAA 

ACAGGGGTCAAGTTT CTGG AAT CAACAGGAATTTCTAGGCATCACTC 

GAGGAACCTAGTCTTGAAACAAATCAGACTTTTTATTCCTCAGGGTAAGAGTTCTCTAATGA 

TTCTAGCTGATTCATAAGGGCAGATTTTCATTCTGAAAATTTCCAAAGCC^ 

AGTCAACTTGAGTGTTTTTAGTCTCTATATTACTGGGCTCTGfGGATG^ 

6 5 ACGTTATTGAAGGCACCTTTCTCAGAACTGTGGAAATTCTGAGAGCIT^ 

CTGTGATCAAAT CTTT ATC CTCT AGACTCAAAAATAATT CTTT AAAAAA^ 

ATTATCCATTCTTGTAAAGACAACTGCCCACAATGGAATTTCCCA 

ACATTTAATCTTTCAACTTCTTGAGATCCCAAAG 

AGGG CTTTCCTAAACATTGGGAGCTCTTCATAAGACAGTTG CTACTTTCCTTAAAACAGCATGCCTTAAATAGGCAGTTTGATAAA 

7 0 TCACCTGTGCATATACTTCATTTTAGATGTCAGAATTGAAAAAGAAAA 

CACGAATGCGTCAGTTAGAGGTGCTAATATAGAACGGTTGCTGAGCT 
AGTATTCCCAGACTGGTTTTTGCCTAAAGCCTTGGGTATTCTC 
GGTCTCACTCTGTCACTCAGGCTGGAGTGCAGTGGCGAGATATTGGCTCA 
TCCCTCAGCC7CCCCAAGTAGCTCC 
7 5 CACCAAGTTGGCCAGGCTCGTCrCGAACTCCTGACCTCAGGTGA 
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GAGCCACCGCACCCAGTCCTCTTTGTGA C TT 1 TTTG 

TCAACGTGGCAGCTGTTCCCACCCTTCCCTGCTCCTCTTCTTGCCCAAGGATGAGCA 
TCAGGAAATGAGGCATGTTGCTTGAGGTCACAG^^ 

GCACTGAGGAGAGGACACGGGGAGGGCGTGAGTTTTTGACAGATTTCTCTCT^ 
5 TCCCTTCCAGrrCCTTOVCTCTATCTCAAGGATTCCAGAAAGGAAA 

ACTITGGGATGAAAAAGTTTC CCAAATG AAG ACAT ACTGG CACX5GTCAGGGTTGAG CGACTTTTGTGT ATCAATGCAC C CAAGC CT 

GATGTCTCATTGTGGGTTATTTCCAGCTCTGAAGAAGTACTGCCT 

GCAGGTGTCAACGCAAAAACTCCACTCACTCAATTCTCAAAATCrrATCCAATATCT 

GTAAAAATGAGCGACATAGTCCTGATCCTTAAGAGAGTCATAAGTTGGT 
1 0 AAGTCATATGTGGTAAGATCTACTGAGCATGGGGCTATCAGAGAAAGCTTC^ 

GGT AAATTTCCATCAGTG ACATTCCAGGGJVGAGAGGCCCCAAGACACAGGGG CAGCAGGTGGTCCACTGTGCTCCTGCC CTGGACA 

GGGAGGGCGGGGTCCTCTGCTGTTTGTTTGCTTGTGTAGGAATTTTCCCCT 

ACATGAGTGGGAACCTGTTTCTCTTCTCTATCCCAGGCCATTCCCTGAGAGAAGA^ 

TTCGCTCTGTCGCCCAGGCTAGAGCACAGTGGTGCAATCTCAACTCACT 
1 5 CTCTGTCTCCTGAGTAGCTGGGATTACAGGTGTGCACCACCATGCCTGGCTCCT 

CTATGTTGCGTAGGCTGGTCTTGAACTCOSAGCCTCAAGCCATCCATCTGCATGG^ 

GCCACTGTGCCCAGCTGAAAAGAGAATCTTAATTCAGTTGATTACTTCTAA 

TAGAAGCTGAACTTGAATCCAGCCCTCTrAAATTTCAGTTCAATATTCCCT 

TATTCCTTCATCCATCCATTCATGCATGAGGATTACAGAGATGAAGAAAGTTCCAGCTCTTATCCT 
2 0 CGGGAGATGAAACTGCTGCTGAATGGCTCCCATTCTGAAGTCAGCCAACAGTGTTTGCCAATAGCCCCCACTC 

ATT CT CAAGT AACT CACTCATTTAGGGG AAAACAAT AAT AT AATTGT CTCTAG AAATTTT AGGTO C 

AGTGTTTTGACTGATCTCAAGGCGACTCCGTGGTTTGCTTCTGTC^ 

AATGGTTCAAAACGCCAGCTGTACATTAGTGTATATCCAGGATGCTT^ 

G ACTCTCTGGGG AACAG AACCGAACAT CACAAT AC CTTCCCTT CCAGTTC CTTG AAT ACTTC CAAATCG C^ 

2 5 CACCAAATTAGAGAGATGGAATAATGATTTCCATGGTItjGTCT 

TTAATGCATTTGGAAATCATCCAGGGATCTTGTCACGGAGCAAATTCTGACT 
GCAAACAGGTTCCACAGGAGGCCTATGTTGTGGGTGCATGAACCCCATTTTC 

CATTG CTTTT AC CCACACAG AACTTGCGACTGAAATG AG AAAAT AGG CTG AG TG TG ATAG CT CATG CCTG T AAT ACCTG CACTTTG 
GGAGGCCAAGGCAGGAGGATCCCATGAGCCCAGGAGTTCAAAACCAGCCTGGGCAACATAGTG 

3 0 TT AGAAAAAGGG AAAACCAATCTGGGAAAGATCAAAT ATTG AT CAAACATTG CAAAT AATG C CAT AAATG AT CAG AAT AGGGAG AG 

TAACATGCTACCTGGAGAACTAGAAAAAAAATTCTCAGAGAAGGTGGCCCTGGGACCT 
C^GGAGAGGGGCTGGAGAATGGTGTTTTCTAGTGATGCTC 

AATT AGAGAG ATGAGGGGTTTGCC^GGACAACTACATAATGAGCCATATTTCAACAT ACATG CAGTTTTGTGTGTGTGTGC CAAGC 
AAATGAGTCTCAGATTCTCGGACAAAAACGAATACTTTTGCTC^ 

3 5 CAGTGTCTACAGGTGTCATTGGTCCCCXKSTAGACGTCCTCGGTTCAATGG^ 

GGATGGCAGCCAAAGAGTTCCTGGGTCTTGTTTCCCTACCTCCAC^TTTTTTTCCTGACCCAGCC 
GGCTGGAGTCCCAGTGCTTTTGTCCTGACAGCCCATTCT 

CTCCGCCTG CAGCCCTCG AC CCGG ATCCAGGCATCCTGCTT AAGG AGGGGTT CAGGTGGCAGGAAACTCCAGGGCAACAAAGTGTG 
GGTGGCATGGGTGGGGTTGACCGGAAGGTCACTGTGCTGGGAGGAAGAGGA^ 

4 0 ATTCACATCCTGTGCCAGTAGAGGGGCCTtSCCCCATGG^ 

CAGG CTTTTTC CTCCCACGCTGTATGCAGTGCAAGAACAAG CTGGTG CAATTGG ACT AGC^GCJ^ATTGAGTCCTTTTTACCCAAAAG 
AAGAAGAAAGAGCAATGTTTCAAAAGAATAAAAAGGAAGCTGC^ 

GGGCCAAGAGGGGAGGATCCCTGGAGCCCAGGAGTTTGAGACCAGCCTGGGCAACATAGTGAG 
GATAATTAATTTTAAATAAGGAAGTTTGTGAAAGCCA1TGGTTTCTTTATTCTAAAGAGCGTGGA 

4 5 GCCCTGATTTC CTTTGGTAAGAAAATGAGCAAAG CT AT AG A CT AGGAAAC ATGACTGGTTAAAG CCGG C CCATGCAC CAACT CCGA 

TACTGATAGTTATGTTTCCTTTCGTTCTTATTTGCTTCAAGCTT^ 

G AAT ATCCATTG CCT AAATG CT AT CTTTCAGGT ATT ATG TT CAG ACAT AG T ATT ACTTTG GTT AATTCT CAAAAT AAAG T AGGTCA 
TATG ATT ATCCCCATTTT'CTG CCTGGTGAACTGTGGTT AAATGGG CTTACAGAG ATT AAATGGGTTGCCTG ATGTAGCCCAACGAG 
GAAGGGGCTAGGATAGGAAGCAGGCGCTCTCATTCCAGAACCTGCCT'ITTTTCCTCCT 

5 0 AAAGGTGCCATGTGAGCAGGGAAGTTATTTTTGTTTT^ 

CCCCAGGTACATGCAGTGAGGCGGTACCCACAGAGCCGCGTGGCACTCCTCAACCT 
TTAAGGACCACCTCGTCCTTGGGGTCTGGGTTCTCCGGGTCCTGCTGTTCCCT 
GGCATCITTAGAGAAAGCGGCAGTGTCGCACCTGCCCCAG 
CCATACCACATTCAGGAATTGCCACCTCTGCCCZACAGAATCACCCACT 

5 5 AGCCGCTGCTTTCTGTACTCTCAGCAATAAAAAAACAAAGGCTAGGATAGGTGTGTC 

CTCCCCTGGGAAGGTTT'TCAGAGAGAAGTTGTTGCTACAGCAGTACAG^ 
GTCAGGGAGATTGGAGACAACCTCTTGTGTGGTAGGACTTCCC^CTATTGGTTTGT^ 
AATGTTCATTGAGGAACTGCTATGTGTCAGGCACAGGGTTTGAGACACAGAGATGAATA^ 
CAATTATGACACAGGCTAAGTGCTGAGATACAAGTTTAAAGAAGGAG 

6 0 AACCCAGCA.CATCTCACATCAGGGACTGCCTCATTCAC^ 

TCAACTACTAAAATTATTAAAACCAAACACCACTAATTTTGAACTTCAGGTAGACAGAC^ 
ATTCCTTTACAGAGAAAATGGGAACATTTCAAGTGGGAGTATTTCAATT^ 

CCAAGTATTCTAGAAGCACCTTTCATAATGGACAGCTCTG AC CAACTGCCCCACACACATGCATGTG AACCGTTCTC CAATCCTAA 
AAGTCTGTGACTTCTCCAAGGAAAGGAGGAAGGAATCGTATTAGTCCAGTTTACATTATTATAT 
65 TTTCCTTAAAAAATTCTATATATATATATATATATATATATATATATATATATATATATATAGCTGGGCATC 
TAATCCCAGCACTTTGGGAGGCTGAGGCGGGCTVGATCACCTGAGGTCAGGAGTTCG^ 
TCTCTACTAAAAATACAAAAGTTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTCAGG^ 
CCCTGAACCTGGAGGCTGAGGTTGCAGTGAGCCGAGATTGAGCCACTGCACTCCAGCCTGGGTGA 
AAAAAAAAAGAAAAAAAAAAATATATATATATATATTTGTGTGTATTTTCT^ 

7 0 ATCTTTTCAGAGAAAGATCAGTTTATTACGTCACCAAGTAAA 

GTTAAAGTTGACGTCAGCCTCTTCCTGCCCTGGTGCCCCGAGAGTTTCCCGGGAGTTTTGGGGATGACACCCGG 
CATCTCGCTCGCTGGGGGCAGGCCTGGTGGCTTCCCCGACAATGAAACCAOT 
CTCTGCTTCCGGTCCTTGTGGTAGCACCTCTGTTTACAGCCTGATTTAGAACAAAAA 
ATTAAATGGAAAGCAAAACAAACAAT-TACCAAAAAAA 
7 5 ATCTGTCTGACGCCCAGGTGGCGCTGAGACCACAG^ 
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TTTCACTACCTTTCACCCACCGGACCATCAGTTTCACT^ 

AATCTAjGTGCAAGAGGACCTGTAGCCAGGGCACCCATCCTCCTCCTCCTCGTCCTG 

G C CAGGG AGGCCTTG CTGCGTG CT CTG CTTCTG TGAT CTT AGAG ATTC CT CATGG CCACCAT AGGAG ATGGGG ACTGTT AG CACT A 
TTCCCCCACCCCAACCACCTTTTTTTTGAGACAGAGTCTCACT 
5 GCAACCTCTGCCTCCCGGGTTCAAGAGATTCTCATGCCTCAGCCTCCCC^ 
TGACTAATTTTTGTATTTTTAGTAGAGACAG^ 

CACCAGTCTCAGCCTCCCAAAGTGCTGGGATTAC7iGGCTTGAGCCACCACX3CCCTG 
TGAGACCATGAGGCAGTAATTC^CTCTCCCAAGAAAGCAAGGAGCCAGTAGATCT 
GAAGCAAGCTGTCAACAGGCACTGTTCATCTCCreGGCTTCCTAGGCAGGC 
1 0 TCCCCAAGTTAATCTCCCCTGAAAGCACCTGTCGTGATGCCCTTTCGGCTGCAAG 

CITTGCTACCTCCTCTGCCTGGCACACAACATGCTCCCCAGCGAAGACTG1TCCCTCAGCA 
TCCTCCTGTCCTTCAAAGGGAACCTCCTCATTTGTGATTTCAA^ 

TTCCCCATCTTGTCTGCCTACCGGAATTTTTCCCAACCAGAAAAGCGTCTTGAGAGAATGGAAAA 

G ACAG CTTG AATT CAAAT C CT AGCTTGAACTTG TGAATTTCT ACTTT C CTTT ACG AAAACCTTTGCATGC CATTCTG TG CAGGCAC 
1 5 TAGCAAG^TGGTGGGAGGGAGATGGATACAGTAGGGTGAGTGCCGTGTAAAGAAAAGGGAGCAAAGTCT 
CCACAAAGCACAGACAGAAGAGTCCCGGAGGCTTCCCAGAGAAAGATGAC^ 

G AAGGTAG AAATG CATTAGGGAAGTGTGGCATGAG CATGATGGT CAATTT CAATGTGT C CACTT GC CT GGGCT AAGGG ATG CC CAG 
ATAGCTGGCTAAACAGTCTCTCTGGGGGTGTCTGTGAGGGTATTTC 

2 0 AGGCAGAGGAAGGTTGCATTCTCTCTGTTCAAGCTGC^ACATCCG 

TTGGACTTCTGGGACTrACAGCAGGG CCCCTCTG CTCCTCAGGCCTTC!AGTCTT AGACTGAG CCACACCACCAGCTTTCCTAGTT AT 
CCAGCTCCAGCTGGCATATTGTGGGACTTCTCAGCCTCCATAATTGAGTGAGCCA^ 
ACATACATCCTATTC^TTTTGTTTCTCTAGAGAAATTCTGTTTC 
TGAGCTCCAAGAGCCACGCAGAGTCCCGCAAGGCAGGAGAGCTCAG^ 

2 5 TGTGTCTTGGAGTCTCCCCATGTCACTGGGCTTAAACTGTGT^ 

GGCTTGTACTTTGGAGAGACAGCATTGGGTTTATGTGAAGAATGA^ 

GAAGCAGAAAGGCAAACCCATAGGGCATTGAGTAACCTCCATGGAAGTCCATGAGGGTA 

TCTGTTCGTATAAAATGGGAATAACACAGACCCTGCTGGGTGCT 

ACAGT AGGTCCTCACT AAATGTGACTTGCnK^CTTCT AGTCTG CATGGCCAAAGGCCCTTCCCTCGGAC CTTTCCGTGTGGATGC 

3 0 CTCACAGGCTGCTCAGTCTACACATCCAGAACTGTCCTCCCTCCCTGGAACCTACTCT 

GGCACCCCATCCTCCCCATCGCCCAGGAAATCTC^GCAAAATCCTCATTTCT^ 
CTCAATTTTTGGCAATTCCAACrrCTAAAATCTCTTAACGCCAGCAACCCCT 
TCCCATGACAGCCTTCCTGTGATGCATGACGGGCACATGACAAGGGCAGTTGAGGGTCAGGGGC 
CACATGAGGGAGGATAGGCC<3GAGGACAGGCAGACACTTGTGCGTAAGAGCACGCACT 

AAGAACACATCACCTCATAATTAATTACTGCATAAGTAATAATTTTAATTAT^ 

TTTATTATATATATTTATCATATAATATACTAATTATATTAATGAGTAT^ 

TATATATCTGTATAATTATTTTATCATATAAGATAATTTCATAAGCTAAGATGGTATTTTATA 

ATTTATAT ACTTT ATTGTATTTGAATTGTG AT G T CT AAT AAAAT AAT AG AT ATGTT AATTATC CTTCAT ATT CAT AATAAG CTCCT 

TGATTGCTGTTTTTGGTGGCCCGGAGGCACAGGGTGCATGCTACAGGCAC^AAGAGCTT^ 
TCTGGTGCCX3TGGCTTCTGAGGCCCCCAGGTGACAACCAGCCTTTGCCA 
TGTGGCACAGGCCACACTCCTGACCCCACCAGCATCAGGTGCrGTGC^CATGGTATGAGGT 
GGTTGTTCCTTGGGCTACCrrGTGTCCACATGGTCrGATTAACCCCTTCr^ 

4 5 GGCATGCTG CAGGGCATGCAGGGATTTG AGGGTCCTTGGACAATG CATGTTCTCCATGC AAGCATGCAGAGACATGGAGAGGGTCTT 

TGTCCCTTTCAGTCCTAACATTCTGTGGCTTCATAACTACT^^ 

GAGGAAGTCTTTAAACCTTCCAGTAGGACATGGGATCCCTGCCTGCATCr^ 

GAATTCAGCTCTCCAAGAAGCCCAGTGTGAATGCCCATTGCAG^ 

AAAGAATAGAACCCACCCACTGAAACTATCAGAGATCAAATGTTGTGTGTTGACTAAGAA 

5 0 CACCAATGCTTTCAAGGAACTCTraiAGAAATGCAAGCCTC 

GGCATTTTGACCCACTCACAGATGAATACAAAGACTAAACTCAGCAAAAGCCTTGA 
TGGAGCACAG^CATGGGCITTTGGTACCACGTAACCTACAACCGTAAGCTCTCTGCTC 
TGCCTGAAATGTCATTGTCTTACCCGATGTCTGGGGTAGCATCTGTGAACACACGGATTTGACACT 
TACTGAACCCTTCTTAAGCAAAATCACAGTAGACTAGCAGATCATGGTAGTA 

5 5 TGCTCCCCACTTCCC^TGGTTTTCTCCCTGTGTTTCCACAT 

ACACAGTGT ATGC AATGTGTGTCTTGTG T AAAACGGAGTGTG AATT CTTGAG^ 

GCGAGATGGGTGAGGAGCAGGGCAGAGTGAGCCTGGCTGCTGGGAACCCCTGAGGAGAGCTCT 

CTTGGTCAGCCTGACAATTAGGATTTGTGTTAACAC^ACCGGCCCTCACCTG 

TT AG CATCATG ACTTTGG AGG TTG CCAT CATTTTTG ATGACTTTTT CTTTTT CCTTCTTTT CTGTTT CCCT CTTGT CAAGTTTCT A 

6 0 CCTGTAAGTTATACTTACTTATTTTGTGTTTCTTTACAGTCTCAGGGGAAATTCCGTTGAGTTGAAAAT 

CCAGTTTCTCAGAAGAGTCTCTCTCCITGCAGAAAAGACTGTGGGTGGGA 

TCCATCCCAAGAAATTGGTTTTAAAGATTTGAGGAAAAACTGCTTTTAGATTTCAGAGAA^ 

CCTTTTAAATCTGATAACTTTTATTTTAATCAGGTTGACATTCT 

TGTCG ACCGAGAAGACTCCCAGGATTAGAACATAACCAG CGCCCAAC ATTTCATTTG CAGAAGCATCTG AGGTCCC AAAATGTG AT 

6 5 GGTTTAGCGGCAGCTGCTTGTAGGGGTATTTTTG^TTTCCTGCT 

CaVCCCCACCACCCCATCATTTCGTTGTTGTTTAACAAGTGTTGA 
TTACrCATTTTAGGGTTACGGAAACTCrGACTATTTGACCCAGTCAACCAGGCCT 
ATGGTGCTTCTGTGCTAATTACTGCAGAGATGTGTCCAAGTGCTCGGAGATCTCCTCCT^ 
CACCGCCTTCTTCTCCTGTCCCGAGGGAATCAGGCAACTTTTCTTTGATGAC^ 

7 0 GGCACTTTACTTCTCCCACATTTCCTTCAACATITGTGCTTGCTGAATGAAACAAGGA 

CAATGGGTAACAGCACCAGTGCTAATGCTAATACACTCAACTACTCAT^ 

TGGGTGCTGATCCAGCAAAGAGTGGGTGGAGGGGGAGTGGGAGGTGAAGTCCCTATACC^ 

TTTTCCrCTGGTCACATGCATACATAAATTITATATCT 

CT CAACCC CCGAGG CAGGGCT C AGACATL\iGACAAAATTG AGGACT AC CAAAAAC B .GGATTGGGG C A AG CAG CTTTCCAT AAGA 
7 5 CATGCTCATGAGTGTG CCATGTCAGTTTACCATTGCCATGGCAAC!ACCCAGG CGTTCCTGCTCCTTTCCATAGCAGTGACCGGATG 
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ACCCAAAAGTT ACT ACCCCTTC C CT AG AAATTT CTG CAAACTG C CCCTT AAACTGC ATGT AATT AAAAG CAG AT AT AAAT ATGAAT 
GCAAAACTGCCCTGACCTACTACTCTCTGCCTATGTGGTCGCCCTGCTCT 

T AT AACATTGCCACTTCAAT AAAG CTGTTTTCTTTT ATTCT ACCACTGG CTTGCCC CTGAATT CCTT^ CAAAGCCAAG AAC 
C CTTGCGGG CT AAG C CC CACCTCGGGG CTCG CTTGT CCTGCATCACT ACCACCTTCTCT CTAGG CTT AATCATTG 

5 TGTTGAATTCAGCTGCTGAGGACTGAGAGGTGATTTGGATGATTGACT^ 
GACTTGCCACTGAGTTGGGGGAGAGGGGGGTTTCTTTCTCT 
ATCACnTCCTTTGGAAGAAGCTTTGGTAAAGAATTAACATT^ 
AATTATTAAAATTTCAACCAATACAGAAAAACACAAAGATTA^ 
CAGTTTACCTTGGTTGTTTCATGTCCTTTTTGTTTATTTTA^ 
1 0 GCACCTTGGGAGGCCGAGGCAGGTAGATCACCTGAG^TCAGGAGTTCAAGACT 

TAAAAATACAAAAATTAGCCTGATGTGATGCAGGCGCTTGTAATCCCAGCTTCTTG 
TTGGAGGCAGAGGTTGCAGTGAGCCCAGATCTCGCCATTGCACTGC^ 
AAAACAAACACAAACAAAAACAAATAAATACATGCTGTAGATACA^ 
TCCAGTAATTGTTCTCTTGAATCTGGGCTGTATCTCATGATGCAATATACACT 

1 $ TAATATACTGTTTCTATTTGCAACTTTGCTTTTTTCACT 

CTGTACATTATTACATTGTATAATAATACAGAGTTCTTGATAATGCCAAGTTT 
TTGTATCTGATTTTTCCCTATTAGAAGCAATGCTCTAATAAACAGCC^ 
TACCTAGAAGTGAGATCACTCATCCTAGAACATGGACAGCTTCA^ 
GCTAArTTATAATCCTCATCAGCAATTTATGAAATTTCCTGAAT^ 

2 0 AACATTTTTTGCCAATCTGATAATATAAAATAAAATCTCATTACT 

CCACATGTTTTTTTGGTCATCTCTGTGX3CCTCCTTGTT 
GATTTGTAAGTCTTCCTTATGTATTACGGATACTGAGTCTTT 
TATAAATGACAATTTTATTTTTTGTTATCCACTTCITATTCCTCT 
ATACAGTATAGAATAAAAGCAGGAATTGTATATGTTTCCACC^ 
2 5 GTAAGCTTTCAGTGTCTACTGTCTATTGGTTAAATAAGTTTCTTTT^ 
AATGGATATTCTTTTCATCATCATTAGATGCTCrrCTTC^ 
TATTTAAGTACATTGAGAGATTTTCTAACGTTAAACCATTCACTATTGC 
TTTTTTTTT T TTTTTC CTTAATACATTGCTAGCATC 

TTTGTAATTCTTCACCTTATCAACCATATGTATTGTTGAGTGTATAC^TGTTCAAGATTAT 

3 0 TGTGAAAATGCCTTTTTGATCTAGTTTCTCTTATGTTATTGTAGCTA^ 

TATTGTTGAGTGTTTACAGC^TCAAGATTATTATAACTTCCACAGTGATAT^ 

CTGTTACTAAAGCTATCGAGCTTTCTTTTGGTTGCrAAATATCT 

GTCACTGTATTTTAAAGACGTCTCTTGACATAACAAAAATAGAC 

ATACCTCTGTCTTTT ACAAGCAAGCTTAATTGATTTATATTT ATTG TACT ACATTTGGACTT ATTT ATG CTATTTTACTTTG AACT 

3 5 TTTATTAAC CATTATTTTT CTTTTACTG TTGT ATTT ATCTTTTCTTGT CTT CTCT 

TTCCTTCTACTGGTTTGTGTATTATATGTTTTAATTCTACTTTTAGAATTAAGTCACCATTGGC 

T AATCCCAG CATTTTGGG AGG CCG AGG CGGGTGGATC^CG AGGTCAGGAGATTGAGACCATCCTGGCT AACACIAGTG AAACCCCGT 
CTCTACTAAAAATACAAAAAAAlTAGCrGGGCGTGATCXSTGGGCGCCTGO 

TG AACCCGAGAGG CGG AG CTTGCAGTG AGCC AAG ATTGC ACCACTGCACTCCAGCCTGGGAG ACAAAGC 

4 0 AAAAAAAAAAAAAAAGAATTAACTCACCCTTTAAGTTCTCAAAGTACTCGTC 

CACTTCCAGGATCTTCTGTTCAGAACTTAGCACATTTCTGGCAACTAAGG 
TTTTTGGATTGGTTGTTGCTGATATATAAAAAATGTTGAATTTTATAAATGAATTT 
AGTTCTAATACTTTAACCGTCTTGGGTTTTCTAAGTACTTAATTAAATTCCTC^ 
CAACAATTGTATGCCTTATCTATGTTACCAGTTTTATCAGATCACTAATAACGAG 

4 5 ACCCAAAGATGCCATTAAGTCCTTGAGTGTCTTTGGGGAAGAAAAGGTAGGTT^ 

CTAAGACAATACCATGTATCGCAAATTTAGGTGTGATTTAATGATGGCrrA 

TATATAAGGTAGTTTAAATAAATAACAGGTAATGTGGGCAGGCAACAACCATCA 

GTGATTTGGTTGGCAGCAATCAGAGAATTGACTATGCTTCAGTAGATTGAAATCCGTTATAATCT 

AGG AAACAACCG AAT ACAGCATT CTCATAAAT AG TTCTT AG AAT ATGAATTCAAAA CT C G TTGTG ACACAATC 

5 0 AAG CTTTAAGCAATG AACGCTTT AATT ATTGTTTTT CAG CTT AG AAAG T AACT 

CG CT CAAGAG GT CATTGT AGCAACAATCAAAATTTGTGG AG CATTTT ACAG ATTTCAATGT CACTT ATG TT ATTT CAACTGAT C CT 
CATATTTCCATCCTTCATACCCTCACGATTCATGCTTCAGCAAAACATTCTACCTGTAGTCTCTTG 

TGTTTT AC^CTTTT AC AC ATTCTCTGCCAGAATGG C CTG ACGTGC CT AACCTG C CAG TTGAACTCCCACCCACCTl'TCAGG CAC CT 
TATAGTTAAGAGCAGCCTTCATTCTGTTGCCTGGCTCATGGGCAGGGGCAGGA 

5 5 AAAGGACATATCAGCATGGTAAAAGGAAATGGCTTGGGGTGTATGAGAAGCTCCTATG 

TTTCGTGTTACAGACGGAGCATCTGAAATGAGAAGGGACC^CAGTGAGCCAGTGAGT 

GCTTTACCTTCAACATTTCCTTCTGTTTATTCCTTATCT 

TTGCCTTCAACTCCTCTACTGCCTTGTAAATGTCCCATGTC 

TT AATTCTGG TT ATCACATTTGATTG TTCTT ACTG ATTTTCT^TCTGTC CT AT CCAT CAT AAAGGAAGACACTGTT AAAATTT CTC 

6 0 ATTATGATAGTAGATTTCTAAATTTATCCITGTTTTGTCAATATTTGCTTTGT^ 

GTACGTAAGTGTTATGTTTTCCTGATTCTTGATTTGTATATTTTGAA 

TTTTCCTGGTGAATTGAATCTTTAGCAATACGTACTGACTTCTGTATTTCTATTAATGCTTCTGC 

AACTCCTTGGATTCAATTGTCCATTCTTTTAGTTTCAAACTTTTGGAGCATTT^ 

AACTGCATATTTTTAAAAATATATGCAGTCCAACATTCTGTAACTGCAGGGTTAATCTATCC^ 

6 5 GTTTGATGAATTTCCACTGTCTTATTTTGTTCTTTATTTTTTAC 

GTTTTTATCCTATTTTATTTTCTTCCAATAGCTTGTAAGTTATTC^ 
CATGAATCCAGACTTTACAAAGCCTAGATTTAACCAATATCATAGCCT^ 

ccacaaaccttttctacacagttccctcctcttcctccttgcttctttgtgcgcattta^ 
aagtaaaagtgagattttttagaaggctgacatgagggcagcaatagtagactttcacgtgactcctgt 

7 o tgtggtttagccaacttacttaatctctctgaacct 

atggtgagcattgaaggtaatatatgcaaagaagcagcacataggcac^ct 
aaatggagc^cctgggccctctc^cttcttcaaagataataa 
gatagtttagacatagctccacctcittgtaagagataaccttcaacia 
tctgggtgttggaaaagcactgaac-taggagtcagg 
7 5 accacaggcccagtgttcttatctataaaatgtagaggttggcctaaatgatc^ 
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TACCATTAAAAGGGATGTTCTACATCCCTTTTGCACCCAA 

AGGAAGCGAGAGACAACCTTCAATATTAGGCATATCTTTGGAGTTCTTATTAAACAG 

GGG AGG AATTCTTTTCATTT CATATT ACAGACAG AGCATTTG AAATGAGAAG GGG C CACAGGGGGG CAGTG AG CGG AACAAG AGG A 
5 AGATTCGGGTCTCTGACACCCAGAATCTATCTGATTCTGACACAGAAGATGAAC 

CCAGTGG CC CAAGCTCTTG ATGCGT ACCTGTGTC CCTTG T AAAG CTCAGTG AAAT CATCACCT AG ACATTTCCTGTGCTCCGTG CG 
TTATTCTAATCTrCACTGGGTACAAGTAGAGCrTAAAGTTCTGATATATG 
GGCTGGCTGGGCTAAGAGTAAAACAACTTCACCTGGATTCACCCAAGGGGCAGAGAATCTTAT 
TCAGCCCCCTTTTCCCTTTATCTTTCA^ 
1 0 AAAAAAAT AACG ACAAAAAG TT ACTCAACCGACAAACGTTTCT CAC C CIACACACCTTTC CTC CAG TCAATG ACGTTTCCACTGCAC 
CCTGTTAGACG CTGAGTGAGGAAGCATCTGGGCACAAACCGCTGTGGATG CAGATTGAGCAGCACTC ACTGTGCACTGGAGGGATG 
GGAGTCCTGTCCTCACCAAGGGGGCAGATGGAGAGGGGGCAGAGAGAGCA 

AGTGCGGCCTGGGGCTCTGCAGCCCCAGCATCATGTAGGAGCTTGTTAGAAATAGCCTCAATGTCCACACCTGCCCT 
CAGTCrGCAAGATCCCGGGGGCTCTGCACACACGCTGCTGCGAACAACCTCAG 
1 5 GGATATCCTTCTTCTTCAGCAGTGGTGTCCGCTCTCTAAAA 

CTCAAGGGACACAAAAATAGCTCAGGGAAAAATTAATTTTCATTCTACTCCA^ 
AGCTTTCAGATTATGGGAGTGCAGCTGCGGGTATGTGGCAAAATCGAAGAAAGAGGGCTCATAATT 

TGGC CTG CCCAGGCTGCTGTTT ATCTTT ACTG TG AATGGGG CCT CTTT i T TG TTTGTTTG TTTTG TTTTGTTTTCACTTAACCTCA 
GTGGCAAGAAAAATGGGGCCTCTTGCCAGGAGGAGCAGAGAGTGACGCGGAAGGCA^ 

GAAAGGAGAGAGCAGGTGCCTGGGAGTCAGAAAAGCTGGCTTG 
GGTCATCTGTGGCAGGAGGACTACAAATGCCTGGCTTTTCCCTTTCTAGCGGGCACA 
AATCCATTGAAGGGGAAACCCCACAATGCAAAGAGAAGCATTGTCGAGGGCCCTT 
CCATGATGCATTCCAGAGGAGGTGGCTCTCCGTCACTGTCACTTGCGArrGCCAAGGCGTCAGCCAG 

2 5 GGGGTGTTACCCTTCAGCTGCATACAGAACTCCTAGAGGCAACAAATCCACAAGGAAAGA 

ACAATGATGGAGGTGAGGTGGAAAAGGACGAGGAACAAGTGGGATTCTGCCCATGTCTGTCTAATGAGCATCCA 

ACGGAAGATGTGAAACACGCTCAACACTAGAAGAACTCX3G 

TTCAGGATTAATGATCACTGCTGGCTGTCTCrTTTTCCCTCT 

CG CAG AGC CCGTGG TTCAGGATTCT CCTCCTGGGAGCC C CG ATTTCTTTT CTTTT CTCACCTT AACAGTGT CCAGAGT ACTTTT AT 

3 0 TGCCAGAATAGAGATATAGCCTGCTCAGAGCCTCATGTGCGGCCC^ 

TGCAACAGAGAAAATCCAGCCCACTGAAACCGTGTGCAAGAGCAGCrrCCGTCCrc 

AGGGCAAACAGAAAGTCCCCTAGCX3GGGGCCCAAGGAACCCCTGGCT 
GACTAAATACCACTCAGTTGCCTGGTGGGAGTAGGTGCTGGGCA.GGGTGGAC^ 

3 5 CCTTTGTGTGAGACCCTAGTGCAGGGTGGGGAGCCCCAATTTCTAAGCCITGAGTGTTTACCACCCTAC 

C CGAATT GAT CATCT CAAATTTT C C CACAG T C CAT AGG CACCAG ACACTTCAAAT AAT CTCAGAAAAGAAGGTGATGTCT ACATC C 
AGAATCGTGCCAATACTTTAAAATTACAGGGAGAAAAATATTCCCCT 

CCCAGGTCCCTTGGTGTGAGCTGGGGGAGTGCGTAGAGCTAGGAATTGGAAAATATTCO^GGAGAGA^ 
CCCATTGGGAGAAGCAGGAAGTGGGTCTGGAAAGAC!AGGAGGAAGTGTAAAG^ 

4 0 AGTAATTTTTGGCATTGTTAGTGAGTAAAACTAGAGAATGAGGACAGTTGTA 

GGAAAGATGATTTCAACATTTTGGAGTCATATAAGTGGAATCCTCCTGTTATT^ 

TTCTTTTTTTGGATGTTGTTCTGTTGCTGCTTTTATAACACTACACT 

AGATTGAAAAAGAGCAACCCTCCAGCGTACCTTTCCACACACTCTTCTACCT 

ACAAGGCCTTTCAGGGAAAAGGATT ATAG CCAAGCTCTTT AGTGGTTACTG AGCT ACAGACCTGG AGGTGT CAAGATCCT 

4 5 CAGGAAAGATTCTTTATGAAAGGAAAGTCTOTGAGAATAAGAAOTTCrGG 

TCGGGCTAATTTATGATGACTCTGAAAATGCCCATGACCTCAT^ 

ACAAAATAGCTTCCAAGACAATTTGTTTCCTTCTTITCAAATTGATGTGTT^ 

TTCCAAAAACATCTGAGGTCAATCCTGGTTATTTGTGCTAATGAATA^ 

GTTG AG CTAAAATT ACTT AT AAAGAAATG CAT AG AT CT G CGG CAT AC AATTTGATG AG TTTTG ACAAATGTGTACAG CTG CAAAAA 

5 0 CGTTTTCATCACTTCTGCCTTCTGGTCAATACCCACCTACCACAAGAAACCACTGTCTGC 

CTTTTTGTGAACTTCATATAAATGAAAGCAAAGTGTATGCACTATTTCGTC 
TTTATCAATATTGCCTGTATCAATAGTTCAT^ATTTTTTATTTTTGAATGGTTTCT 
TGTTCTCCCAATGATGGATGTTTGAATTATTGAGTTGGTTATTATTAATAAAGCTACTATGTACT 
AAGAAAGAAGTTACTATAAATATTTTATAAAAGTCTTTCTGTAGACACATCCCTTCAT^ 

5 5 ATTGCTGAATTCTAGGGCAGGTATATATTTAACGTCAATAAAAATACTTTCTTT 

TCTTTGTTTGTTTTTGAGACAGAGTCTCACTCrGTTGCCCAGGCTGGA 
TCCrTAGTAGAGAAAGGGTTTCGCCATGTTGACCGGGCrGGTCTCAAACTCCTGACTTAA 
AAGTGCTGAG ATT ACAGGCGTGAG CCACCATGCCTGGC CAAATACTTTCAGATAGTGTTCCAAAGT 
CACCAGTAACATAAGAGAGTTCTG^TAGTTCTACTTCCTCCCCAGTATTTGACGTATCAGTCTTTTATC 

6 0 CAATGGGCTATCTCATCATTGTTTTCAACTTGCATTTTATTG 

GTATCTTCTTTGGGGAAGTGATTGTTCAAGTCITTGCCCAT^ 

TATTCTG G ACACAAG T ATTTGTG ACAAAT AT AT ATATTG CAAAT ATTTT CT CTC AG T CTAT CATTT ACTTTTTCATTTTTTTCTTT 
TTTTTTTTTTGAGATGGAGTCTCG CTCTGTC ACCCM CTCCCA 
AGTTCACGCCATTCTCCTGCCTCAGCCn'CCCCAGCAGGTGGGACTACAGGCGCACACCACCACACCCGGC 

6 5 TTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGTCT 

CTGGGATTACAGGCGTGAGCCACCACACCCTGCCTACTTTrr 
TCTTTTATAGTCAGTGCTTTTGCATCCTCTATGAGCAATCTTTGCCT 

GAAATCTTTTT AGTTTTGCCTTTT ATG G TT ACATCT AT AAGATAT AGAC CT AG T ACAT AG AT CTTTAT CTTT ATATATAT AAT AAA 
ATTAATTCTTGCATAAGATGTGAGTTTTTTTCT 

7 0 T CATTGAATTT C (HTGG CACATTT ATTTGTTT AAAAAAAAAG CAGTTGAACAAACATG AGTTGAGTC CATTTCAGGACTTT CTG CT 

CTGTTTCATTACTGTTTGTCCTITATGCTAATACTACGCTGCCTTGATTTCCATA^ 

GAATCCTCCAAATGTATACATCTTTACGATTGTCTTAGATATTCTAGGTCTTITGT 

AAATTTGTACTAAAAAATTGTGTTGGATTTTTAAACTGTAGTTGCCTT 

ATCAATG TTACAATCT AT AAAG AT ACT AT ATTTTT C CATTTG CTT AATTCTTTT AAAATATGTCTCAG CAAT ATTTCAGAGTTTT C 
7 5 AGTTTAGTGGTATTAOUVGTTTTTGGTAGNNN^ 
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NNNNNNNNNNNNNNNMWNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 NNNNNNNKNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNinraNNNNN^ 

NNNNNNNNNNNNNNNHNNHNNNNNNl^^ 

HIWNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNKNNNNNNN^ 
10 NNNNNNIWNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNinJNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNJWNNM 

NNNNNNNNNNNNITONNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNITONNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNIWNNNNNNNNNNNNNNNNNNN 

NNNNNNIWNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNKNNNNN^ 

GCTGGGGTGCrCGGCTGACTGOUVCCTCCACCTCCCAGGCT 

ACATGTGCTGCCACACTCGGCTGATCTTTTGTATTTTTAGTATA 

GACCTCAGATGATCCGCCTGCCTTGTCGCCCCAAAGTGCTCGGATTACAGGCATGAGCCACC^ 
CTATTGAGCACTTTTTTGAGTAGGTCAAGTATGTCACATTTTCCTGTTTCT 

3 5 TTATGAGCTATACATTATAAAGGGTCTAGATTATCTTATGGTC 

CTAGCGCATCCrCTTGTCTATTAGGCTTGGTTTTAATCTTTGTAAGGGCAGGTCTAT^ 

CTCTTTCTCTAAAGCAGGATTGGAAAACTACAACCCACAGCTCAGACACCT 

TCTCAGCCCTCACAATGGACAATCAGGGCTAAGTTTCACTGAGTTCTGCCCTGTTC^ 

G AGGTGG AC CCCT AT ACAT AATTTTGGGGCGACCTTCATGCAGTGTCCTCT CCAGAGCCCCACCCCACAAATCCCAGTCATGTCAC 

4 0 GTCAATGAAACCACTCTGCTTGGGTTCCACCTGGGTGCrGCOSGGAAGGAAAGC^ 

GCCTCTCACGTGTTTC CCTCCTCCAAAATCATGTAGCCCTGAGCTGC CTGTTGTTCAG CACCTGAAAGCACTTGCTGTAT ACATTT 
GGACTGGTTTTACAGCTGTTTCAGCAGAAACGTAAGTTTTGGTCCCAGT^ 
TTTTCTGATTATTATCCATGTCTTGCTTTAGTAAC^GATTTATCTTCTG^ 
TGCCAACAAATGGTACCAGTGATATGAGAAGTACAGAGAGACAAACTTTTGCA 

4 5 CTCTATCTGAATTACCATTCTCTTCTTCTGGACAGTGAGAAAAAGGAAAGGAAC^ 

CCAGGCTTTCTTCAAGTTACCTTAGTATATTATTTGCTATAATCATTAAATCA^ 
CCTTAGAGAAGTAAAATAATAAAACTATGTAGCTTGTAAGCTGCTGGAGCTAG 
GCTGCTGCTCTTGCCTTATCATGAGATTACCAGCATTTATTGAGCACT^ 
CCCATTTAAATTGCTTTTGCTTAGTCCCTTGTTGGTTAGAGATGGGCCAAGCTGTCT 

5 0 GAGACAGAGTCTTGCTGGGTTTCCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCCCACTGCAA 

ATTCTCCTGCCTCAGCCTCCCAAGTAACTGGGACCACAGGCACATGCCACCACGCCCG^ 
CTGTGGTTTTCACCATGCTGGCCAGGCTGGTTTTGAACTGCTGACCTCAT 

TTACAGGCATGAGCCATTGCACCCAGCCTCCCTACCTTCAGTTCTTAGGGGAGACAGGAGAGTCCT 
TCTTCCGGAGGACAAGGTGCTGGGCAGAGTCTAAGGGCACAGAGAAGCTTCACTCAACCT 

5 5 CCACAGCCATAAGTACAGCCAGCAGCCCTGATO 

CCTCTTCCTGT CCCTGTCATCTCC CCGTCCACTTCCTTT ATCCAGCATGTG ATATTTCCCCCCAAAATCT AATCTCT^ 
G AACTTGTTG CAAAGTAGG CAATG GAATT ATTTTTCTGATT AG ACCT G TG C CACTTTT CCTTTT AAT AG TTTG ATG TTTG ACC ACT 
GCTT CTGGTG C CTTT AG TGAG AT AC CAG AGT CCATTTTT CAGG CTG AGGG AGGGT AAGGT CTT CAT AAT ATAAG AAGG T ATTTGGT 
GAACT CAG AGGTGGG CAGGTGG GTGAG AAAT ACAC ATTT CCT ATTTG C C AG ACATTGAGTG AAGCACTTTCTG T AT ATTT CTC AAA 

6 0 AAATGGCGCACAGTCATTAATTATAGAAAGTCTAACAACrCGTATATCCTTAAAAGTATCTTGC^ 

TTAAATGTACGTAACTTTGCTTCTrCCTTTCTTCACATTCCI^ 

AATAAAACCTTGTTAATTTCTACTCATCTAGTTTGAAATACATTGTGCT 

GCTCTGATACAAAGCTTTCrAAAGAACAATGGTAGCAGCCTTAGTTG 

AGTGAAGAAATATGTGATTAAGTCATTATAGGATTAAATATGTGAGTACTTGCCTGGGGCCTCTT^ 

6 5 TG CTTTG CAGAGCTCTGTt^CG ATG ACCCGCCAG AG ATCCC^CACGCCACATTCAAAG CCATGGCCT ACAAGGAAGGAACCATGTT 

GAACTGTGAATGCAAGAGAGGTTTCCGCAGAATAAAAAGCGGGTCACT 
GGGACAACCAATGTCAATGCACAAGCTCTGGTAAGTGTCCCIT 

AGAGCCCAGCTTTTGTTCACTCAGCCCTATAGACACAAATGGGCCAGTCCAGTCTAC^ 
AGTCATTCACTAAACACAATTTTTTTTTTTTTTGGAGACAGACT 

7 0 CGGCTCACTGCAACTTCCGCCTCCTGGCTTCAAGTGATTCTC 

CATG CCTGG CTAATTTTTGCATTTTT AGT AGAG ATGGGGTTTCACCATC CTGGT CT CAAACT C CTGACCTTG AG T AA 

T CTG CCCAT CTCAG CCACC CAAAATG CTGGG ATT ACAGGCATG AG TCACAG TGCCCGG CCACACAATTTT ATTTTT ATTTTTGAGA 
CAGTCTCGCTCTGTTCCCCAGGCTG<K^TGC(^TGGTACAATCT 
GTGCCACAGCTGCCCAAGTAGCTGGGACTACAGGTC^ 
7 5 CGCCATGTTGG CCAGGCTGGTCTCGAACTCCTGGCCrCAAGTG ATTCTCCTGCCrGGG CCTTATAGGCGCACACCACCATGCCTGG 
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CTAATATTTTGTATTTTTTGTTAGTGATGGGGTTTCCCCATGTTGTCCAGGCTGGTCTTAAACT 

GC CT CAG CCTC C CAATGTG CTGG AATT AT AGGTGTGAG CCAC CACACCTGACCT AGACACCCTCTTTT ATCCTGAG CTGGAAT ACT 
GATCACCCCCCTCACTGG^^CACTCAGACTCAACTCCTTTGCCAAACT 
CATTGTGTTAAGATGGOSTCTCTAACTT'CAAGCATTCTGAACA^ 
5 TCCTGCTGCTGTCAAACAACACAGAGACAATTGACAGCAAGC^ 

TACATGACATACACAGACACGTAACATACATGGCATACACAGACACGTAACATAC^ 

CTCAGCAGCCAGACACCACCAGACACATCGTGGGCCCTGGACAGGTGTGCTTCT 

TGAAGCATTTGTGAAATTATCTGCATCATTCCATCTATATTTTCTAGCCACT 
1 0 AAGAACAGAAAGAAAGGAAAACCACAGAAATGCAAAGTCCAATGCAGCCAGTG 

TCTCCTCCAGCTAACTCCTGCTAGCGCACCCrTCTCCTGCAGGCAGATGA 

ATAAACACATGCATGTGGCTCCGTCCTCCCAAGACATCATTGCAGACAAGAAT^ 

CAACTAAAGCCAATCrGACGGCAGCCAATCCAGGTCTGGGCTTACTCCCT 

TCAACTCAACTTAAAAGCGTCTGTTGGGTCTGTGTGCCAGGCCCTGCTGATAGAAGA 
1 5 CCCTCAAGGAAACGCTCTGCGTCTTCCCCTTAAGCACAAGACAACGC^ 

GGCAAGGTTGTAAAATGTGCATGCCACACTTGTGAGTCTCAAAAGCTCCA 

GGGCACTGTGGCTGTCCCTTGTCTGCCCTCT13CTTTCCCATC 

GCAGTGGCTCACGCCTGTAATCCCAGCACrGTCGGAGTCTGAGG 

CAACATGGTGAAACCTTGTCTCTACAATCATACAAAAATTAAGCTGGGCATGG 
2 0 GCCAAGGCATGAGAATTGCTTGAAATCAC^AGGTGGAGGTTG 

AGGAAGACTCTGTCCCCCCTCAAAAAAACAAAACAAAAAAGAAAAAAAACCCACAGAACCTG 

GTCAGGCTAGCCTTAGGAGACITCAGAGCCCTTTGCTGATTTTA 

CTGGCTATACAATGAGTCAGGACCAGGAAGGGTTACTTCAACTTACTGAAGAAACT^ 

AAGCT CACAT AGT AGGGCTGGAAT CAGACCTCAGGTCT C CCAG ATT AGATG AGGAAACTTTCH^ CTGATT ACTTATTTTAAAAG AG 

2 5 TTTGAATTTACTTACAGCTTCCATTATTTTAl"rrrri'A 

TCCCCTTCCGACTTTGTCCTGGAATGGTGTTTCa^ 
GATTATTGGAAGGCTTCAAAGAGGCCAATGCTTGTT'GTC 
CAGTTTGGGTGTTTTCATTGGAAAGAAGCAAACCCAATTCAAACATGT^ 
TTTATCCTGAGACCAAAACACAAGTTCAGTCGTGATGTTCAAGGTGCCACATGGCAG^ 

3 0 TCCTGCTGGGG AAGAGGCTCTGG AGGATATTT AC AG AGG CTCTGTTGGGT ATGCAGGGTTGGGGAG AGTCTGG AAGTTCAGAG CAC 

GTGCGCAGTTTC^CAGGCTGGGGTTAGAGGGCACATACAATTCC^ 
TAATCAGAACAAGACATGGTCTCAGGCCACAGGTCCGTCAGGCCTGGGATGTAGTT^ 
TTTTGAGACGTCTTGAAAGGGCGCATGT<3TCTCAACCTTCAGGT 
TCAGCCCGTCCCTCCTCCTGGCrTTGGTGTTGTCCCCA^ 

3 5 CACTTCCTCACTTGTCCAG CAGLfiAATCCAAGGTGGTCTTCATGCTCCATTGG C CCAAAT AGG CTTG AGCCTACCCTTGTGTCTCAG 

CTGAGACTGCCCCGCCCACTCCTGTCC^CrGTCMCACCT 
CACCAACCAGAATGAATCGCTCATTCCTCTCTGTTCCCATTCCCA^ 
GTCCACCCCTTGCAGGAGTGACAGGGTCTTCGTGCCTTCCTACAGGG^ 
ATTGGTTGAATGAAAGGATGAGGUUSCAGTCAAGAGAGTAAGAACCAAGGTCCAGTC^ 

4 0 CGGTGCTGTCCCCAACTTGCCTTCAGAGCGTTCCTTCCATCTTCCAGGTCACT 

CAGAGAGAATTTATCATTTCGTGGTGGGGCAGATGGTTTATTATCAGT^ 

GAG AG CGTCTG CAAAATG AC CCACGGG AAG ACAAGGTGG ACC CAG CC CCAG CTCAT ATG CACAGGTG AAATGG AGACCAG T CAGTT 

TCCAGGTAGGGTCGCTCCCTTCTGGGGTCCTICAATGTTGCTT^ 

TGGACAAGAGTGGTCCTTGGACAAGACCCTTGATCTCCCTGGGCTTCCATT^ 

4 5 GACCAGATCAGATGGACTCTAAGGGCTCCTTCAATGAAGGTCCCCCT 

GGAAAGCCCTCTCCTAAGCACCCTAGATGCTGCCTCGTAGTTG^ 
GCTCCAAGGGCAAGTCCAGTTTCCCACTCAC!ACATTTTTCCTCTTACTGG 

GGTCAGCATGTTCCTTCCCCTTGTGCCATTCCCCAGCTGAGCTTGTAAAAATAAACATAAAATGTGCAGAGTC 
TGCACATGCGCGCACACACACACACACACACACATACTC^ 

5 0 TCTG AAATCTTATT AGT ATCACG ACAATCAAAAGTGACTTACATG CAGGCTGGGCACAGTGG CTC ACACCTGTAATCCCAGCACTC 

TGGG AGG CCAAGG CTGGGCAG ATCACTTTAGGTCAGGAGTTCGAGACCAG CCTGGCCAACATGGTG AAACCCCGTCTCT ACT AAAAA 
T ACAAAAATTATCTGGG CGT<WTGGTGGGTGCCTGTAATCCCAGCTACTTGGG AGG CTGAGG CAGGAG AATCACTTGAACCCAGGA 
GGTGGAGTTGCAGTGAGCTGAGATC71CACCACTGCACTCCAGC 
ACATTCAACCACAAGTATTCCTGGTTTTCTTGCTCAlCTGC^ 

GAGCCAGGGTTGCTGACCCCTOtfXs AG AAGTGGAGAGCCTGG CACAGAAGCAG CTGTC CCTT AGTG CATCTCTGCTCCATCC AGTG 

GTTCAATGGCATTCGTATTATTATGACCTGTTTCCAAGTCACAGATGGAAACCAGGACT 

GCTCAGAGTTTTGGGGACAGCTAGGGTCAGAGTTAGAATTCAGCCTCGTCTGCTC 

CCTGCTGTGGAACCATGTGGGGTGG CATCCCTGGCGAAGTGCCCACACCAGGATTCCTGCCT CCTCGGGCGTGT^ 

6 0 GCCTGGGGCAGGCGGGGTGGGAGTGAGGCCCTGACTCCTGTGTTTA 

AAGCCTCAGGCAAGCCCCGAAGGCCGTCCTGAGAGTGAGACTTCCTGCCTCGTCACAACAACA^ 
GACCACAGAGGCCTAGTCCAAAAG^GCAGGGGTGACCAGGAGCCAGGCT 
CCACAGCAGCCTCCTCTCCCTCTGAGCAGGGACAGGGCCrrCAGCAACCTGC^ 
GGACCAGGACCAAGCACGGTCCCCAGGCAGAGATGGAACACCTTCCCTCACC^ 

ACAGATTTTCAAATACAGACAGAAATC^KTGCAA 

CACTGGCITTGTGGACAAAATGTACACCAGGCTGAGATATGGACAGGTTGACT 
GTTGGTTAGTTGGTTGATTGGTAGGTTGATAGGTAGGTTCGCTAGTTGATTGG 
TAGTTGGTTGGTTGCTTGGTTGGCAGGTTGGTTTGTATGTTGGTAG^ 

7 0 GTGGGTTAGTTGGTTAGTTGGTTGGTAGGTTGGTTGGCAAGTTGATAGGT^ 

TGGTAAGTTGATAGGTTGGTTGGTGGGTTGGTTGATAAGTTGATAGGTT^ 
GTTGGTTGGTAGTTTTGTTGGTACTTTGGTTGGTT^ 

TGGTTGGTTAGTTTGTTGGTAGGTTAGTAGGTTTGTTAGTTGGTTAGTTGATTGGTAG^ 
CCTTACACTGCAGAATGTGTGAGTGAaAGACTAATCCAGATACTGACG 
7 5 GAGCAAAGT CTaUVGT<^GCCCTATCAGCrCCTCrCAAGCTGATCTGGAAGGG CTGT ACC CCGGGCCTCTGCCAGGGGTCAAG AGT 
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GGG AGGAAGG AG CCACGT AG CCAGTCAG AG AGG ACAGG CTTCCAG AAACACTG CCATTG TTTC CTGGT AGGAGGG AAAAACAG ACT 
CTG AG T CTG CATCTG C CT AGAAAGT AAACAAATGTC AG C CT CCAAAACCTCAGCAG AG CAGCATCTGG AAAACCCGACCT CTTTCA 
GGAGATGTTAATGTGCTTCTCATCATCTTCATATTTCAACATTT 
TCniAGCCTTTCTGAGGGGCTTTTGTG^ 
5 GTTCCTCAAGACTCCTAGGAAATCCCATGGGGAGGGACCTACTTCCTGAG ACAGG 

TGGGGATGTCCTCAGCCTGCCCTGTGGCCCTGGCTCCTGCTGCCCCCATGCCA^ 
ATTGACAGTGGCCGGCTGTGTTTTCCTGCTC^TCAGCGTCCT 
ATC^CCAAGGCAGCCCTTGGTCAGATCAAAGTCCTGTACCC^ 
CACCTGCAGTTCAGGGAXjTGGCAAAGGGAGAGGCCA^ 
1 0 GCAGTGGCTCACGCTGGTAATCACAACACnTTG^ 
CCAACATGGC^AAACCCCATCTCTACTAAAAAT^ 
CGGCTGAGGCAGGAAGATTGCTTGAACCTC^GGAGGCGGAG^ 

AGAGCAAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAATTAAACGACCAGGGAAAT 
AGCACCCACTATA(^CCAGGCTTGATGCTTGTTTTATAAACATTATTG 
1 5 CTAGAGAGATGCAACAGCTTGGAGAAGATGGCACAGC 

GCCCCTACCTATAGAGGACCCAGGGCAGCATCTATGCTGAGCTTTGCCTATCCTGT 
CACCGGCATCCCCTTGACCAGGAATGTAAACCATGTGTGGTCTTTTT^ 

CATC CAG GAG ACAGTCTTCT AAAAT AC CAG TGAGTGG CAAG ATATGAGTAAATT CCACAGG AAAAC^AAGGCAGT CC AAAGGG CTG 
TGCTTCTCCTGTGGCTCAGTGTTATTCATAGCTGTGAC^ 
2 0 TGTGACCCACTTTGAATTTGGGGGGAATCITCATTTG 

ATGT AGGTTG TTCAG C CAAAACAGCAT AAT AATTT AAAAAGTT AAAAGAT AG C CTGT AAT CC CAGC 

ACTTTGGGAGGCTGAGGCAGGTGGTTACCTGAGGTCAG^^GT^ 
AAAATACAAAAATAAGCCAGGCGTGGTGGTGGGTGCCTGTAATCCCAGCTACT^ 
AGGAGGCGGAGGTTGCAGTGAGCCAAGATCGCGCCACTACA 

2 5 AAGAAAAAGGTAAAAGATATCAATGATTGGGCTCCAGTCAGAATGTCT 

AGATAAAATTCATCATTTAACCATCATAAAATGTACAGTTCAGTGGTGTCTTAGCC 
GACTC^TAACTTATAAAGAACAGAGATTTATAAAGAACAGAGATTTATAACTTATAAA^ 
GCTGGGAAGTCrGAGGTCGAGGGGCCCCCATGTATCAAGGGCGTTTGTGCT 
AGTGAGAGACAGCAGGAGGGGCTGAACTCAGCTTTTATCACAAAC 

3 0 GCCCTCGCGACCTGATCACCTCTTCAAGATCTCACCTGTCAACATTGCTG 

GGATGTATTCAAACCACAGTAAGTGGTATTTAGTATGTTTGCAATGT^ 
TTCCCAAGAGAAGCCCCACACTGCCAAGTTCCCCATTCTCCCCAGCCCCGGGAAACC^ 
GCCTATTCTGGACIATTTCGTGTAAATGGAATCACGTGGCCTT 
CTAATACATATGCATCCAGGTTTGACTUVCCCTACACCCGGCCATTG^ 

3 5 CATAAAGAAACGATAAATGTTTGAGATGATGGATATGCTAAATACCATG^ 

TACATACCTCATAAATATATATAACrATTATGTGTCAACTTAAAAACATTT 
AGATTGCTGGGTCCCACCTCTGGAGTTTCTGACGCAGTCGTCTTGCTTAGGTCC^^ 

AT ACTGACGCTGCCGTTCCAGGGAG CAGACTCTGAGGATCACCGCACTAGGCTAAC^GCATT AAGAATCAGGG AAG CTGGCCAG AC 
TCACGCTTGTAATCCCAGCACTTTGGGAGGCCGAGGCAG<JCGGATTACT^ 

AGGAGAATCGCTTGAACCCAG^IAGGCAGAGGCTGCAGTGAGCTGAGAT 
TCTGTTTAAAAAAAAAAAGAAAGAAAGAAAAAAGAAAAGAAAGAAAAAGAAAAAGTAAACT 

TG ATTATCCAGGGCCTCCCTGACCCCTGCCAGC ATCCACTCTGG CCAACC CTGATACAAACTCCACAGTAAG ACCCTG AG AAAATA 
CAGATCCACTCCCCCAAAACCAAAGCCCTTAGCCCAGCCAGAGAGAAAAATAAAATATCTCTTCCC^ 

4 5 TCCATAACIATACTCCTAATTTTCCTCATGC^AATTAAAAAGTATAAATTACTG 

AGCAGGAATGATTCATATACCCGGCAGTCrACCAGTTTCTCTAAGGGGTTAGCT 

GGGGAAGGATGTGACGTGGCTGACATGTTGTACGTGTACTCAGTACTTGC^^ 

TTAACACCTTGCAAGGATTTTTTTCATTTGTT^ 

AATGG TTCAG TGTTT ATG TTG TGAAACAAAT AAC TGTGTG AAC CT ATG TGAGTTCC CACAAGGT CT C C CAG AACT ACAAT CTTT GG 

5 0 GGAGACTCAAAC^TACATCTAGAAATGACCCCTCTAGTCAGTTTGTCCAAAAGCCTTCCT 

CTCTTGTGGCAACAGAAATGCTGCATTTGAATTACCGAAAG 

TGTTTGTTTCGGGCAATGACCAGGAGCCCGTGATGTGCTACAGAAGGCTCAC^ 

GACTTCTGCAAACTGGTTTTTAAGCATAGCCATTATGGAAAATTAA^ 

TGC^CTTCGGGAGACTAAGGTGAGAGGCTCGCTTGAGACCAGCCTGGGCAACAAAGTGAGACCTTGTCT 

5 5 ACTTAGCTAC^GCGAGGTGGCACATACCTGTAGTCCCAGCTACrcGTAAG^^ 

GATTGCAGTGAGCTGTGACCATGCCACTGCACTGCAGCCTGGGTGACAGATTGAGACGCT 
AAATATTTAAATTTACAATTTAATAAATTATATTAAAAACAAAGGCAAG 
CCTTTTACTGTTATCTTTGTTCGCTGGGTGGTTTACATCTATTGTG 
ATCTCTTCCCAACCGCGTGATCAGTGATGTCACATTGCCAGCTGGAAGCTGTGTGT 

6 0 CAACCGTCATCAGAGAGCTGCTTGTTAAACATCACCAGCACACCTCTGCTTGGAGT 

GGAGTAATTAGAAATTCCCATGAGCCAAATAAGAACTGGGAAAAGGCTAAGACCCT 
GTGGATACTTAAGTGTATTTAAACTCAAATAAATTAAATACAATTAAAAATTCAGTTCCT 

TTCCCTAG CCACAGGTGTCTGTGTTATCGGACAGCACAACTG C AAAAGATTTCCATCATCTCAGAAAGTTCTAT AGCACAGTG CTG 
GCTAGIAAACCAAGTGCTTTACrGCATGCACATCATTTAGCACAGTTAGTTC 

6 5 CATATAAAAACTTAGGCCAGGCACIAGTGGCTCACACCTGTAATC 

GAGTTCGAGACCAGCCTGGCCAACATAGCAAAACCCCATCTCT^ 
GGTTTAGAGTGAGGACCACATTTTTTTGGTGCCGTGTTACACATATGACOT 
AGAACAATCTAGAAAACCAAAAGAACAAGAATTTCTTGGTAAGAAGCCC^XSAA 
CAAAGGTGCTAAATGGTCGCCCAC^AGACATCCGTTGTGCITGCCTGCGTTTT^ 

7 0 CAGTGGCAACCTTGTCTCTATGCCAGCrCAGTCCC^TCAGAGAGCGAGCGCrAC 

C^AAGGGCAAAACCACTAGAACTCTCCATCTTATTTTCATCT 

CTATATGTAGTATAAAGAAAAGTAGGTTTACATTCATCTCATTCCAACTTCCCAGTTC^ 
ACGTAAATACACAACACACACACTCTACCCTATACAACTGGACATTGTCTGCGTG^ 
TCTCCCGTTCACGTTGCCTAATAAA(^TCCrTCAAGAACTC^ 
7 5 AGCTAACCCCCTTCTACTGAGCCTTCAGTCTTGAATTTCTAAAAAACA 
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AGGGCAGCCAAACCCATGAGGCAATGTCAGGAAC^GAAGGATGAATGAGGTCCCA 
TGTGCGTTACTAATTGGCCTCTTTAAGAGTTAGTTTCTTTGGGATTGCT^ 
TGTTTACAGGTGGACACACAAGGTGCAAATCAATGCGTACGTrTCCTGAGAAGTCT 
CAATGTITATGCAAGGAAGGAAAGAAAGAAGGAAGTGAAGAGGGAGAAGG 
5 AGAGCGCATCAGGCCTGGCACGGTGGCTCAGGCCTATAACCCCAGCTCCCT^ 

AGTTTGAGACCAGCCTGGGCAGCATAGCAAGACACATCCCTACAAAAAATTAGA 

TCCTAGCCACTCAGGAGGCTGAGG CAGG AGGATTGCTTGAGCCCAGGAGTTCGAGG CTGCAGTCAGTCATGATGGCACCACTGCAC 

TCCAGCCTGGGCAACAGAGCAAGATCCTGTCTTTAAGGAAAAAAAGACA 

AGTTCAGCATATTAGAATCACCGGGAGGCCTTGTTAAAAGAGTTCGCTGGGCCCATCTT 
1 0 TAGAGCCAAACGTTTTG TGTGTCTAACAATTCCCAGGTGCTGTTGCTGCTG CTACTATTCCAGGAACACACTTTGAG AACCATTGT 

TATTGCTCTGCACGCCCACCCACTCTCAACTCCCACGAAA 

TATCTGGTGCAAGATCTCCCCTCACGAATCAGTTGAGTCAACATTCTAGCTCAA 

TTCATTTGGGAAACTATCAGCCAGTTTTCACTTCTGAAGGGGCAGGAG^ 

AGATTCAGATTAAATAACTATTTTCTGTCATTTCTGTGACCAACCACATACAAACA^ 
1 5 TCAGGTATATGTTTTCTGAGTAAAGTTGAAAAGAATCTCAGACC^GAAAATA 

GATTACGACTTTTGGGTGTTGAGGGGTCTATAAGATC 

TTCTCTCAAAAGTGTATGGTGGAATTTTCCAGAAGCTATGTX3AT 

TATTG CT ATTG TTT AT AAAAGAAT AAATG AT A'l"rX"i"l~l AAAAAGTCTTGGTTTTT ATT AT AAAT ATG^ CT AG CTATA 
ATCTACATAACCAAAAACCTTTATAGl^'lTl T GGTGCX rr r i 'TTTTGri T l' ri^ 
2 0 CGTTGCCCAGGCTGGAGTGCAGTGCACAATCTCGGCT'CACTGCAGCTTCTGCCTGCT 
CCCAAGTAGCTGGGACTACAGGCACCCACCACCAAGTCCAGCTA^ 

CAGGCTGGTCTCGAACTCCTCACCTCAAGTGATCCACCCAACTCAGCCTCCCAAAGTGGTGGGATTAC 
CCAGCCTGGTATCCTCTATAGTTTTTAATAGTTTTAAATAACA^ 

TGGTCATTTCAGAAATGCGAATCAAAACCACAATGAGATACTACTTCATACCCATTAAGATGA 

2 5 CTCACAGATGGCTGGTAGGAATGTCAAAATGTGCAGCCACTGTGCAAAACAGAGTCTGT^ 

TAAATTTGTCCTATAACCCAGCAATTGTATTCCTAGGTACACACCCAAGAAAGACAAAAACGTGTCTAAATAC^ 
ACTCTTTATAACAGCATTATTCATACTCACCAAGAGGTAGAAA 

TGGTGG CTCATGCCTGTAATCCCAGCACTTTGGGATGCCAAGGTGGG CAG ATCATTTGAGG CCAGGAGTTGGAGACCAGCCTGGCC 
AACATGGTGAAACCTCATCTCCACTGTCTCTACTAAAAATACAAAAAATTAGCTGGT 

3 0 GCGTGGGAGGCTGAGGCACGAGAATCCCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCAAGATGGTGCCACTG 

GGGCGACAGAGGGACTCTTCTCTCCAAAAAAAAAAAAAAAAAAGAATTGAT 

TAGCCACAGGAAGGGATGAAGTATTGATCCATGCTGCCATTTAGATGAACCTCAAAAACATTATGCTGAGTGA 

CAAAAGATCACTACTGTATGACTCATTTAACATGAAATATCTAGAATAGGCAATTCCATGGAGAC^ 

CAGGGGATTTTTCTGGGGTGGTAAAAAAGTTTTGAAACTAGAGGGAGGAGG 

3 5 TGGATTGTATACTTTATCATGGTTAATTGTGTGTTATGTGAATTTCGCCTCAACT 

T^AAATATTGCTTATCTTATTCTTGATGCTCAAAGAAGGGTCATCAGCCCAGAAGGATCCTCCTGAAGTC 
TATTCTACTAAGTGAGTTATCAGGGGACCCTGAATTTTTGTTTTCAAAAACCAAAATTAGCATATTACTA 
CCACTTTGGTATTCTATCGAGAACGTAAAATTTGCATAGCTCCC^GGAAGGCTGCTGTTCCTGA 
GTTGGTTTCAGTTTTTACTCGCTGGTATTCCTTCAGCT 

4 0 GGCTCCCAACGTGCATAATCAATTAATTTGCTGTXK5GTTGAAAATCTGAACAAACT 

AAGTAAGTAAGATTACAGGGTAGCTCACTGTCCCCAATATTGGAGAGGCAGACAGGCTGATACAGG 
CAGAAACCCAGGAACAAACCCTGCTACTGGAACCAGCAGAGGGGCAGGAAAACCTAAACT 
GTGTGGACACTTCTGAGAGATAAAAATTTCAGGGGAGCAACCCAGTCACAGGGGGTCCCCACTTTC^ 
CTCAATGAGGTCTTCACAGTGAAGATGGGAGAAAAACTTCCTTGTGCTTTTGACGGGGGAAAGAGAAAAG^ 

4 5 CCGTGTGAAGAGAGTCCACCAAGAGGCTTTGTGTGAGCAACAAGGCTG 

AGGATTCAGCAAAGGGTGGTGGGATTATCATTAGTTCTTATAGATTTGGGATAGGCATACAA 

ATATTACAAAGTACCTTCTTAAGGTCAGGGGAGAACATATCGTATCAGTTAGGGTGGGGCAGGAACAA 

CATCAGTTAAGGCTATTITCACTTCTTTTGTGGAtCTTCAGTTGCTTC^ 

TGATGGCTTAACTTGGGCTCAGAGGCCTGACAGGTACCATTCTGAAATATGCCAGAGCATTCAGTACTTC 

5 0 CTCAAGAAACAGTATTTTCCTCACTATCTTAAATTTTACCATA 

GAAGGGAAATAACCAGCTCTAGCCTCTCTGCCCTTCJITGTCTCATCTAAGGGAGGAGGCGAATGAA 
GCCCAGGGGTACAGGCTTGCTGAAGGCTGAACCTAATCATAAGACTATGGAATGGTTCCCCTCCCCCTGC^ 
AGTGGGGCTCTTGTGCGCrCACAGGGGACTACAGTGGAAACAACT^ 
CCCAAAGATGAGAGGGGAGACAAAAACAAAGACACTAGAGGGAATTTTAGCCTCTGA 
55 AGCCTAACTCCTAACCAGATAAACATAAAGGCTCACACTAAAGGCCTAT^ 

CTTTCAAC AAAAAATT ACAAGGCATG CTAAATGGCAAAAAAACACAGATT AAAG AGGCAAAGCAAG CATTGT AACCAGACTCGGAT 

ACTACCATGTTGGAATTATCAGACTGGGAATTTAAAACAACTACACAAATAAGCCATTATGAC^ 

TCCATTCTTCCCTTCCACCATCAGTAATAAAATCCCCAAATTTTAACCCAACACATTTCTGCCCCTAGCTAAAGACAA 

AGC CTCCCTT GCAG CT AAGTGTGG TCATGTG ACT AAATTTGGATG AG ATGTAAAC CAAAAAGT AT CTG AG ACAGGT CT AAAT CC AT 

6 0 TCAGAGGTTTATATTGCCAAGATTAAAGACCATGACCTGTGATACAGCATCAGGAGTTC 

GTTACAGCTGGGTTTGATATTAATGCATTTTAAGGAGACAGAAATTATAGGCGAAGA 
GGTTCAGCCTAGAGAGGTGGGACATGTCAAAGTGGGAGGCTTCCAGGTCATAGTT^^ 
GAAAGAGTTAAGCTATGCCTGAGGAGTTGAAGTCAGTAAAAAGAAATGCTTGAGTTAAC^ 
CTGTCCTTGTCATGTACACGAAGCTTCCAGATAGCAGGCTTCA 

6 5 GTCAGACTCTCTGGAAAAGACCCATTAAGAGAAGGGAGATTCCTCCAGTATGAAAAT^ 

AATATGTCAAAAAATATTITGGGGTAAAATAATTTTATTTCCTTCTTGGCCT 
CTGGGArrGGGTATCTTGTTGCTAAAGCATCTGTTTCrrTAGG 
GAATTCCAAAGGAAGGGATTA<^TGAGGCGTGTCAACCCCCCCACTTCrrc 
ATGCCCTTGGCCAAAAAGGGGCTCATTTAGTCAGTTGGGGGCTTAGAATTT^ 

7 0 GTGTGTGACGCCCAGGGTGTGCCCTTGAAGAGAATGTGCACACCCTCCTTCATCTCCT 

CGTGAGCTAGAACCACCGGGAGGATAGAAGCCACACATGAAGCCTG^ 
CATGAGGTGCCCCAGCAGCAGTGCACACACTCGGACTGCATATGAGACAGAAATAACCCT 
TTAGGGCAGCATCCTAACTGTATCAATCACCCACACTCCTACTAAAATGACACC 
ACAAAGGGGTCCTCCTCCAGACCACAAGAGAGGGTTCTT^ 
7 5 AAACAAGTTTATTAGGAAAGTAAAGGGATAAAAGAATGGCTACTCCATAGACAGAGCA^ 
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T ATGGTT ATT CCTTGATG AT ACG CT AAACAAGGGGTGG ATT ATTCATG C CT CCC CTTTTG AGAC CAT AT AGGG T AACTT CCCG ATG 

TTGCCATGGCATTTGTAAACTGTCATGGCAGCGGTGGGAGTGTAGCAGTGAG 

TTTGGTGGGTTTTAGCCGGCTTCTTTACTGCAACCTGTTTTA 

T CATCTTGTGACTT AG AATG CCTT AACCATGTGGGAATG CAGC CCAG T AGG TCTCAG CCTT AGT CT ATC CAGCCCCT ATTCAAGAT 
5 GGAGTTGCTCTGGTTCAAACACCTCTAAOVGTTGTre 

ACCTAATAAGAGGTGTTTGTGTCATGAGGGTGGATCCTTCATC 
TCATTCTATTCATTCCCGTGAGAGCTGGTTGTTAAAAAGAGCCTGGCA 
TCTGCACACAACAGCCCCCCTTCACCTCCCACCAGGAGCAGAAGC^ 
. TTCCAGACACCAGAACTGTGAGGCAAATAAAACTTTT^ 
1 0 AATCCCAGCTACTCXKSAGGCTGAGGCAGGAGAATCACnTGAACCCAGGA 
CTCCATCCTGGCCACAGAGCGAGACT(nt3TCAAAAAAAAAA 

GGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAAAAAGCAAAGAAAGAAAGAAAGAAAGGAAAG 
ACAAAGAAAGAAAAAGAAAGAAAGAAAGAAAGAAAAGGAAGGAAGGAAGAAAGAAAGAAAGA 
AAGAAAGAAAGAAAGACCAGTCTCAGGTATTCTCA.GGTTACA 
1 5 TTGTATAGCATATTACTATCAAAATCTTTTCAATC^AAAATGGACTCCAC^ 
ATACATGTTTTCTGTTCCATAAATATTATCTGAAAAAG 
TCTCTTTATTAATTTGAATACTCCCCTACTTTTCTTCAGAAAATGA.CT 
GTTGGCTCACATGGAGATGTTCCTTTTGCCTC^TGTCTTCT 

2 0 CTTCAGAGCTAGCCTATTGATATGAAGGTAAGGATGAAAAGAAATCTG 
CAAATGCATCACTCGAAACTAAATTGTAAATATAGCCATGC^ 
CAGTGGTTCCATAAGATTATAATACCATATTTTTACCATACCTTTTC 
AC^TCTGCAATCCCAGCACTTTGGGAG^CCAAGGCGGGTGGATCAT 
AAACCCCGACTCrACTAAAAATAOVAAGTTAGCCAGGTGTG 

2 5 AGAATCGCTCGAACCTGGGAAGCGGAGGTTGCAGTGAGCCAAGATTAC^CCGTTGCACT 

CTCT(ZAAACAAACAAACAAACAAACAAAAAGATACACAAATA 

CTG CTTT ACAGGTTTGTAGCT AGGG ACGACAGGCT AT AG CAGCCCCT 

ACTCTGCGACGTTTGCACAAAAAGAAAATCACCT 

ATAGAAAATGAAAACCCACTGTCTCTCATAACTGTGTACCCTCCAGTGGTGCTTCCTGCATGGCT 

3 0 GGTTATGGGTATTTTTTCTTCrcrCTTTTCTGTCT 

AGGTTGTAGCTATTCCAGAATTAATATCAGATTATCACATCTAGAAA 

AAATGTCTTCTCCAGTCTTGTTGACTCAAACTAGGTCTCAT AT CT ACTCCTG CACTGTG ACCAG AAGATGTGAG ATGATTCTGATT 
TCGTAAGGCCCAAGCCACAGACCTCACTCCTAGAACCAGTAATAAA 

CAGAGAAATATAAATAAGGACGACAGCCAGACCCGATGGCTCA.TGCCTATAATCCCAGCACT 
35 ACrCGAGGTCAGGAGTTGAGACTAGCCTGGCCAACATGGTGAAATCT^ 
GCACATCCCTGTAATCCCAACTACTCAGCAGACTGAG^ 

T(X3CACCACTGCACTCCAGCCTGGGTGGCAGAGTGAGATTCTGTCTCAAAATAA^ 

TATAAGTGGAGCAATATCTTTAAACTACTGAAAGAAAAAACTGTCAAC 

GAAGATGAAATAAGTATTTTTAGACATATAAAAGTTGAAAGACITC^ 

4 0 CGGCTAAAATTTCACrcGGTTGAATAATGTCTCCCCAGAATTCATGCCTA 

AATCTTTGGAAATGTAACAAGTTAAGATGAAGTCATACCAGATTAGGGTGGACCTTAAACCCAA^ 
AGAGGGAAATCTGGACGCAGACCCACACAGAGGGAAGACAGCCAGGTGAAGAC^ 
GAGACAGAGTCTCACTCTGTCACTTAGGCTGGAGTACGGTGGCACCATCT 
ATTCTCATGCCTCAGCCGCCCMGTTGCTGAGATTAGACGTGTATGCCACGACACCTGGCT 

4 5 ACCGGGTTTTGCCATATTTCTGAGGCTGGTTTCAGACTCCCGGTCTCAAGTAAT 

AGAGAAGTGAGCCACTGCGCCCCGCTGGAGAGGTTGGAGCTTGATC 
GGTTATACACTGTTTGACTCCATTTATATAAATTTCTAGAAAATCCCAACT 
AGGGGAAGATGACAAAGGAACATTATCTTGATTGTGGTTT™ 
ATATGTGCAGTTTTTCAATGTTCACTAAACCCCCAATAAAACTCTTTCT 

50 

HUMAN SEQUENCE - raRNA 

GAGAGACTGGATGGACCCACAAGGGTGACAGCCCAGGCGGACCGATCITCCCATCC 

CTGACGGCAACTGGGCCTTCTGCAGAGAAAGACCrCCGCTTCACT 

TGCTGATGTGGGGACTGCTCACGTTCATCATGGTGCCTGGCTGCCAGGCAGAGCTCrc 

5 5 GCCACATTCAAAGCGATGGCCTACAAGGAAGGAACCATGTTGAACrGTG^ 

ACTCTATATGCTCTGTACAGGAAACTCTAGCCACTCGTCCTGGGACAACCAATGTCAATC 

CG AAACAAGTG ACAC CT CAACCTG AAG AACAGAAAG AAAGG AAAAC CACAG AAATG CAAAGT CCAATG CAG CCAGTG G ACCAAGCG 

AGCCTTCCAGGTCACTGCAGGGAACCTCCACCATGGGAAAATGAAGCCACAGAGAGAATTTATCATTT 

TTATTATCAGTGCGTCCAGGGATACAGGGCTCTACACAGAGGTCCTGCTGAGAGCGTCrc 

6 0 GGACCCAGCCCCAGCTCATATGCACAGGTGAAATGGAGACCAGTCAGT 

CGTC CTG AG AGTGAGACTTC CTGCCT CG TCACAACAACAGATTTT CAAATACAG ACAG AAATGG CTG CAAC CATGG AG ACGTCCAT 

ATTTACAACAGAGTACCAGGTAGCAGTGGCCGGCTGTGTTTTCCTGCTGATC^GC 

GGAGACAGAGGAAGAGTAGAAGAACAATCTAGAAAACCAAAAGAACAA^ 

TCATGAAGCCCAAGTGAAATCAAAGGTGCTAAATGGTCGCCCAGGAGACATCCGTTGTGCTTGC 

6 5 TCACATCACAGGACACGGGGCAGTGGCAACCTTGTCTCTATGCCAGCTCACT 

AGCAATTTCGCCGTTGAAGAGGAAGGGCAAAACCACTAGAACTCTCCATCTTATTTC 
GGTATGGAACTCTCTCGACCCTATATGTAGTATAAAGAAAAGTAGGTTTACATTCATCT 
CAAGG AAAG CC C CAG CACT AACGT AAAT ACACAACACACACACTCT ACC CT AT ACAACTGG^ 

7 0 ACCCTTGGCTCAATCCTCTAAGCTAACCCCCTTCTACTGA^ 

T CTTTGGGT AACTTCAAAACGG GG CAG C CAAAC CCATGAGG CAATGTCAGGAACAG AAGG ATGAAT G AGGTCCCAGG CAG AG AATC 
ATACTTAGCAAAGTTTTACCTGTGCGTTACrAATTGGCCrCTTTA 
TTTGGCCTGCACTAATTTGATGTTTACAC^TGGACACACAAGGTC 
CAAAAAGGGATCCGTACATTCAATCTTTATGCAAWlAAGGAA 
7 5 GTAGAACGTAACCACGGAAAAGAGCX3CATCAGGCCTGGCACGGTGGCTCAGGCCT 



670 



WO 03/008583 



PCT/US01/51291 



GAGCATCTCTTGAGG CCAGGAGTTTGAGACCAG CCTGGGCAGCATAGCAAG ACACATCCCT 

GTGGTGGCAT ACG CCTGT AGTCCT AG CCACTCAGG AGG CTGAGGC AGG AGG ATTG CTTG AG C CCAGGAGTT CG AGG CTGCAGT CAG 
TCATG ATGGCAC CACTGCACTCCAG CCTGGG CAACAGAG CAAG ATC CTGTCTTT AAGGAAAAAAAG ACAAGG 

5 HUMAN SEQUENCE - CODING 

ATGGATTCATACCTGCTGATGTGGGGACTGCTCACGTTCATCATGGTGCCTGGCT 

AGAGATCXCACACGCCACATTCAAAGCCATGGCCTACAAGGAAGGAACCATC 

TAAAAAG£GGGTCACTCTATATGCTCTGTACAGGAAACTCT 

ACTCGGAACACAACGAAACAAGTGACACCTCAACCTGJU^GAACAGA 
1 0 AGTGGACCAAGCGAGCCITCCAGGTCACTGCAGGGAACCrC^ 

TGGGGCAGATGGTTTATTATCAGTGCGTCCAGGGATACAGGGCTCTACACAGAGGTCCTGCTGAGAG 

GGGAAGACAAGGTGGACCCAGCCCCAGCTCATATGCACAGGT^ 

AAGCCCCGAAGGCCGTCCTGAGAGTGAGACTTCCTGCCTCGTCAC^^ 

TGGAGACGTCCATATTTACAACAGAGTACCAGGTAGCAGTGG CCGGCTGTG 
15 CTCACCTGGCAG CGGAGACAG AGGAAGAG TAGAAGAACAAT CT AG 
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Table 32 

MOUSE NOMENCLATURE 
ICSGNM Nfkbl 
Celera mCG22374 

5 

HUMAN NOMENCLATURE 
HGNC NFKB1 
Celera hCG39336 

10 MOUSE SEQUENCE - GENOMIC 

TCTTCTGAATATCATGTTTAGATTATGTAAGTATTCACCGGCTTTCTGTCACT 

GAAAAAAAGGTTTAGTTGATGAGTTTTGAAGGTTCCAGTCTACCAGTCTGTT 

CAGAGCACCTGGCAAAGCATAAACTACCTGCTCGTCAATCAGGAAGC^ 

TCCAGGTCATGTCTTAGTAGAGCACTTCTCCCAGGAGA^ 
1 5 TACTGAGGTAACAGTACCCATTCTGCATACATTCCCTGAGATTAACTGAGACr^^ 

GAGTGTTCTGCTTGCAGCCCTACTAAACACAGATGGCCAATGTGGTATTCA^ 

CACATACCCTGCCCCTGCCATGAATAGGATTTTAAAGGGGGCTCATGAAGGAG 

GTGACAATAGAGTGATTGTTTCCTCGTGACATTCCTCATTTGrc 

AAACTTATGTTTTTTATAAACTACCCAGACTCAAGTACTGTTTAACAGTAC^ 
2 0 GCAC CG AAAGAGG AAAGTTGAAAACTT CCACTCT ACTTGTGGATTT CTGT CT CTT CCACAAATG TTT AGGTCTACTACTTTGT ACT 

TCATGTATTTTGGAGACCTATTAATAAGTGC!ATGCAGGTGGAGGTCATTA 

TCCAACACTCAGCATCTAGAGGCAGGAGGGTTGCTGGGAGTTTGAGCCC^ 

CAAAATTATTACTGCCCATTGATGACCTGGCCCCTTTATTCTTATATATATATCCT^ 

ATCTTCTCTATTGGTGGTTATGATATAAACTACTTTGCTGTGTTTAGTAT^ 

2 5 TAAACACTTTGCGTCTTTCATGTTAAAAATAATTCAT^ 

ATG C l' T l T TATAGCTGAGTTTATTTTATTTGCATTTAATATTGGACCTCrTC^ 
AAATGAAATAAACGTAGCACTTGAAGAGTTCCATTTTATCTCCCGTCTGG 

TTCTGCATCCTGCGTCTATAGCGAGTGAAGCGCTATGTACCTAATGTGGAAGCTTTGATTTGTTCTCAC 
TAGACTTTTTTCCCTCCATGGGTTCAGAATACTCCATAATATATTGTGTTGT^ 

3 0 CrTGCAAAGCGAAAGCCCCATGTGCCCACACCTTTATCAC 

CCTTCACCACAAAGATCTGCrrTTAGGATGTCTTCTGTCACT^ 
TCAGTTTACCTTTATTGTCAAACCATTCn3GGACAAAGCTACC 

TCATGATGTGAAGAGTTTGCATGTTTTTCTTTTATTCAGAGTTGAGTTTTGTCCTAGTAGGCCTTACT 
CCCTTATTTTATGATGTGTTCnTGACTCrTATAACACGGTTTTCCCCT 
35 TGTCTCTT C AGCCC ACGTG ATTGCACCC!TCAGTGTCTCTCAGCTCTGT ATGG CGTCTGT AATCTTAT CAG AGCATTTAT AGAACAA 

CAGTGGCTTTCACTGAGCTTTTTGGGATCTCTGTTAG 

ACATATACCATTTCTCTGACCAATGAGATCACTGCTCTTTATTTGAACTCT^ 

CAGAAGGTGAGCTTGGAGTTCACTTAAACrGTGCAGCCTGTATCTCTTGTTTAGTGTCTCAGGCTATTGTC 
TGAATTTGGCAAGATCAAATAGTGGTTGATTTATAATGACTGAAATTAGATTTGTC 

4 0 AGGGTTTCTCTGTGTAG CCCTGGCTGTCCTGG AACC CACTCTGT AG ACCAGGCTG ATCTTGAACTCAG ATCTACCTGCCT 

CTGGAGTGCTAGGATTAAAGGCATGCTCCACCACTGCCCAGCTGTA 
TTTATATATTCAATATGTTGGCTAGCTAGCATTTCTCTACAGCTTTGTTTTGAAACTT^ 
TGTGGTAAATGATGGCTCCCTTCCCAGTCTTCACTCACTCCCCTGGCCTCTGGCA^ 
GCTTGTTCTTGGTATTCAATGTGAAGGGAATTGAGTATAG^ 

4 5 C TGG TTTTG AAATACAAAAT CAAAACTC C CATTG CTGG CTGTAAGGCACACCIATTT AAGG AG ACTGT AAATATAT AATGT ATGATT 

AAG TTG CAT AGAAACAG TG AACCAT AGTG C AAAACCACCAGGAAATGTT ATAG AC CTT ATGG ACTGAGG CG ATAACAGTGGAAGAG 
AAAATACAGGTTTTATGGAAGAAGTTGATTTTGAAGTACCATGTTGT 

GAAGAGATGTACACACTGGAAAGAAATGACATAGAGCTGTTACATGTGTGTGTGTACACATGCGTGTGTGCGTGTATGCAGTGAG 
ATTCACATGTGTAGAGGTCAAAGGACAATTTTGGAAGCTGGTTCT^ 

5 0 TGGG CTT AG CAG CAAGTbT T l ' TT ACCTG CAG AG C CCT CTTGTTGG CCAAG C AATG CATCTTT AAATG ATTGATTTCCCT AT AGTAA 

AGTGCTTGTGTTGGATCATGGTGATAGTTGTGGTGCTTATATAATATATAATATAATATAATCATATGCT 
GACTTCACTATGTAAAGCACCTGGCTGTGATCTGAAAGATATTTAATCAGGAAGA 

CTGTTT CTGT CAG AGTG AGGTAATAAGGTCTG CC CTAGGG TGTGG CT ATG GACAT AGAAAG GAAGTG CACACAC AAG AAACAGTGT 
GAGGAAAGAAGTG AGCGACAACAAGACTGTTCACTG C^TTGGTTGGAGAAGGCTACATTTG CAAGAATGAGACAGTCAGGAGTCGG 

5 5 G CTTG TTGTCGAGT AGT AAACTGT AG ACATT AGGGTTG ATGGGTT T AATGT AT C ATT AACTTG TTTT CT AATGG CATC G TT AC CT A 

AAGCTAACAGGGGCTCATTACCAAATACACCTTCAGAGC^ 
TCrTGGATAGCTTGGGTTTTTTGTTTTGTTTTGTTT 

TGAGAACTCAGTAGCCGATGGAGAAAAGCCAGTGCAAAGTCCAGACCGTGACTAAGAGTGTAATTAAAG 
CAGGGTAATTAACCCATGCTCCACACGCCCTCTGTTCCCATGACACTGACCAA^ 

6 0 CT AACCGTT CTGATTGGCTAG CCTTTGCTGCGGCGAGATAAATCATG AAG AAGCACATAGAAAC^ C 

TTGGCTG AG CTCTGCTGCGTTTCCACTCCACACG TGGG TTCT CATC 
AATCCAGTCATTGAACGGCTTCTTGAAAGCCATGTTCATGACTCATGTCTC 

ACAACATTCAAGGGGCAGAGATGTCAATTTTATTGTCCCTGTGTTGTGAGGGATAGAAAAGGGCAGT 

6 5 CTACAGCTAT AAGC CCTGAGTTCAG ACTATG CCACTCTTGTGT AGTTTTGTTTCTTTGG AAACTGACAGGGT AGGC^ 

CTGATGGGTTGATGAAGGGCCAAAAACATCCAGGTAACATTTCAAAC^ 
TCACAAATCTGTTCCTACAGCACTGTGGATGACACACACACACACTCT 
CTCACACACACACACACACACATGTTTTAATTTTTAGTTTAGCT^ 
TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCTCTCT 

7 0 TAAGTAGCTCATCTGAAATTCATGCACTGTAAGGTCCTATTCTGTTCTCTGGGTTACT 

GTACTAGATGTAGGAACAGAAAGCACACTCGGGAGAAGGCATGTTGC^ 
TCCCCAGTC^GAGACAACACAATTTGACTGGACAGTGTGT^ 

CG CTCCAGT CAC CGT G ACAG CATG CT ACTTCGTGTG TTGG CATG CACAAAATTGG CAGAACG CCT CAAGTGTGGAG AATT CACCAT 
TATAACA AGATTATAGTAAGCAAAAAATCACTCCACATCACGTAGAAGAGGGAGTTGGCAAAGG CGGCTGGTTT AAAG CAGCTCTT 
7 5 TGCAATCTTCTCTAGAGGGTCTTATCTGAAAGCAAACAAAAA^ 
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CACTCCAACCCAGCCCAGCCCACCCAGCCCCCAGCACAGGTCT 
GGCTTAACTAGTAAGAGTGCTTACTGCTCTTGCAGAGGACTCAGGCTGGTTCCTAA 

ACG CTG GTTCCAGGGG ATCTGG CACTC CCCTCCTTG CCTCTGC CAGT ACC CTT ACATTTGTGGTG CACAG T CATG CTTG CAGG CAA 
AACACTCAAGCCACAT AAAATAAAAATT AAT AAAAC CTTGAAAAAAAGAT CCT AGCT AG CAGTT AAT AT CATGAGT AGACTAG CT C 
5 CAG CCAGAACTTTTG AGAGGCTTGAAG CAAAGATGTTITTAAATGAAT AATACTTCTGTCAAGAGG AAAAT CACATCAC C CATT AC 

GCTAGTACAAGGGCCAT AAGCTGTGCAGTTAGAGAGGCCAG CTCT AGTCTGTCAGCCTG AGCAAATGACTCATT CTCTCTGAAGCT 
TGGCrCTTCAGGTGGACTCTACrTTCTGGGCT 

TGGAAGCAGTCTGTCAAGCTTAGCTTTCCCCATACAAAGGAGAGCAC^ 

AGCGTAACTGGCACCAGGGATTAGGTAAATTGTCCCTGGGTTAATCCTGAGCCT^ 
1 0 TAGGACCAGCCATCCAGTCCTGTGAGCCCAGAATACTTCAGAGAATAACTTCTAAACT 

GGTAATGGCATGCAACCACTTCAGACTCAAATATGTTAGACTGTCAG 

CTATTGCTGTGAAGAGACATCATGACCAAGGCAACTCTTCTAAAAGA 

AGTCCATTATCATGATGGCAGAGAGCAGAGGACACAGGCAGGCACAGAATCAGGAGCT 

GAGAAAGACAGACTCn^ACTTGGGATGGGCTTTTGAAACCTCAGTGCTC 
1 5 TTCGTAATCCTTTTAATCCTTTCAAATAGTGTCACTCCCTGGTGACTAAGC^ 

ACGAAAAGTTTATCTTGCATCTTTCTTCCTATAATTTATCATGAGTGAGCT 

TCTCCCACTCACTAATAAGCTITCGCATGGATTAGCTAGCTGAGTGTTTC 

ATG AATGGGTTTCTTCG TC AGCTCT CAG ATTTGCATTTG CAGTGCGTG CATGT ATTGTT CT AGGTCATTGTCAC GATG CT CCTGGT 
GAAATAATATTGAAAATAAAAGAAGTGGGAAACCAGGTTCTGTAAATCTCAGrc 
2 0 TAAAAGAAAGCTTTTTAGAAATCAG CATT ACTG AAA 

AGGAGGACACACAGAAAGGGGGACGCTTCTGAAGAGCAGTCAGGAATAATTAGGACT 

TAACACTGTOlGGGCATTTTAG<3TTCAATG ATTTAAATTTAGA^ CAG CCCCCTTTTT ATTCTAATACATGTCTAATGTGTATTAG 

ATAAATTTCCATGTCACTGTACAAATCTCCCCATTGTTATAGCAATTCCATAAT 

ACAGTTAGATCAGAGCACATCACAAACTCCTGAAAATCGACTTGCAGAATTTACGCCCT 

2 5 ATGTGGCCATAGCCCTTACATTATGTTTTCTTCAAAAAATTCrCrATGGAAAAGA 

AATTTATTGTGCGTGCAAGATACAAAACAAAAAATGTTGTGAAAAGTCT 

AGT ACATT AAATGACACCT AAGTG ACAC CTGTT ATGGTT ACT AGAATTTTAAACCACC CT C CACCC CACCCCCTCCCTG ATTTC C 
AAGTGCACCTTCTAGCTGGTGTCTGACCGCCTACTTTTCATAG 

3 0 TTACAGTGAGTCAGGCCTCGGCTGCCTATATCTCTGAACCTAGATGCTATATAGTTACTG^ 

GATGGCATACAGCC ATGCCTTTAGAATTT CTCCACTCAGT CCAGCTG CTTCACCAAGATGCCCCTGTGTCAG ATTGAGTTACTAGG 
AAGATTGCCACAGGAATGCftfiGAAACAGTTCGTTCAGCATrCAT 

CCTAAAG ATT CCTGGTTGGGGT AG ACAAATG CATAAATGAG ACCCAGG (nTGTGTCTCCCTTTGTGTCTGAAGGTCCJTTCCTC AAA 

3 5 AGCGCAAAAGGAAAGTGGGGGGAAGGCTGAGAAAGCAGGGGTACATAAGGGACTTTCCCACTGCT 

CTT AGTCGATATTTC^ACATTTT AGX3T AT AAACre CAAAT ATT ACT C AGTG ATTCATCCTTTCCAAGTG A 

CTCTCGGTTTCTTTCTTTTGACATTTAATTTAAAATTAGAAATTAGCATT 
TCAGTAATGGCCAAGGCTTTATGCGTCAAGATTCTAAGGC^CCCGAGTGTTCCTTG^^ 
TTTAGCC^CATATAATCTAGAAACCGTAGGAC^GGGTGCCATGATGACAAGGTGCCCAGTATGGTCGTTTG 

4 0 GGAGCCGTCTTCATTAGGGCCTCATGTAGGGAATCX3ACCA.CX3AGGATACCCTTACGTCTGT 

CAAGTTGGAGTGAGACTTCTTTATGTTAAATTCCCC(ZACTCAAACTTCT 
ATAATATATTATATCATTTCCCCTTCCCCTGCCTCCAAGTTCTTTCAT 

AATTG ATGT ATCTTTTT CT AAGT AC AAAAAAAAAAAAAT CAATCTG CTC CATCCATCT AATG TT ACTTGTATGAATG TTT C CATTG 
GCATTGAGTAACCIAATCAGTGTACATTTTTGGGAAACACAATTTCTC^ 
45 TAGGGTTGAGACCTCCTGAG C AATCrCCCTTTCACATTG CCTGTCTGTTGGTGCCATC CTTGTTCAGGTACACAC AG AATCAATGA 

CATAAAGGTACTAAGTGGGTTTGGACAGTTTAATAAGGCAATAATGTCACCTCCACCITAGAAG^ 
AGAATATCATCACCCAGGTTAATGGAAAACCTAGAGTCATAAGGATAGTGTAAAACATTGTTTG 
CTCAGATACCAGACAGTGGCAGAACCCTGGTATTTTCAATC 
ACTTTTTTGTCTGAATAAATATGTTTAGTTGGAGCTGCTTAGAGAGTT 

5 0 TTACTGTGTAGCCCTGGAACTCACCATGTAGACCAGGCTAACCTCTACCACAAAGACATCTACTTGCCT 

AGATTAAAGGCGTGTGCCACTACACCCAGCTCGTGTTTTCTTTTTAAGTATGTATG 
GTTTATGTATACITTATGTGTGCAGGTACCCTC^GAGGCCAGAAGAGGGTGTCAGAGCCCTTG 
GCTGCGCIATGTAAGTACTGGGAACTGAATCAGGGCCrrrCTGCACGCTTAACTG 
TTCAAGTGTTTCTTGTTTTAAACAACCCCACTTCCTTGCACACACCT 
55 CCTTTAGTTGGTGCTG<3CCCTGCTCCATGTGGCCCTGGGAGTTCTCCAGAGCCTCCCGTGCATCCCATTAGT 

G C AC AGAGCCTTTCACTGGAAATG AAGCCTO^GTT ATGGGAATGTCCCGAG ATGTT^ CTTTTG ATGG 

AGGCTAGACAGTACATTGGATTGAAAACTAACTAGCCTCCATTCTTCCATCAGCCT 
CCCATCCTTTGATACAGAGTCTGTTCCCTCATGTGCTAGAAGCCAGGGCTTTTCTACATTTGG^ 
TTTTTTAAAAAAAGATTTATTTATTTATTTAATGTATGTGAGTACACTGTAGCTGTACAGATGGTT 

6 0 TGGGAATTGAATTTTTAGGACCTCTGCTCACTCTGGTCGGCCCT^ 

TGTAGCTGACTTCAGACGCATCAGAAGAGGGCATCAGATCTCATTACAGGTGGGTGTGAGCCACCATGTGGTTGCT 
CTCAGGGCCTTCAGAAGAGCAGTCAGTGCTCTTACGCGATGAGCCATCTCACCAGCCCAGAATGTGATTTTGAG 
GGTTCTATGTXn , TTATOVGGGTCTCTGGGTGCCTGAAGGCCGCTGAGGCTGCTTATATTAATA 
AGACTTTCAGGTTTGATGTTTTCACTGGTTATTTTTTTAGTGCACTC^ 

6 5 AGAATGACTCTCACCAGTCCTGACAAGAGGGCAGGGGGTGTCGTTCTATGCCCACT 

CCTCAGCCTCTCAACCGAGTCTAGACAGTCCTAGGACCATCTCAGAGTTCATGAGAAAA^ 

GGGTTCCATCTGGACCTGATTTTTACCCAGAGGTAGGACCAGCTGTATCAAACCGAGGACCCTGAGCTTAGCCAAAGGGTT^ 
CCCGTGGTCTCCCTACAATGAGAGGT AG ATGC CAAGG AGGTACAGCACGCATGATG CCCTCCTACAGAGGTCTTTTTTGCAGACAT 
TTGTGGTCCCTGTAG AAATACAG CCTAGCCCTGCATATCTGCCAGAGTTGT AGG ATCTTTAAGATTCACTCCAGTG CTACAGCTTT 

7 0 CATTTCTTGTTCTGTTTTGTACAGCTTCACCATGGCAGACGATGATCCCT 

ACTTTGTTTCAACGCGTT CTT AT CCCAAACACACAGAGG CAACG CT AG T AACT CAG T ATCAAGCAAAC CCT AAATT AG T AAACACT 
ACTCTTAATTTATTTTCAGTTTTTCTGTTCTTCCAGAGTTGGAGAGG^ 

ACATGCCAGAAAGTAGCCACTGAGTCAGTTGTGGTGTTGAAGGTCTGACCCCAGTATTTCATA^ 
TTCCTATTCAGT^jAGATTTCTAGGGCTC 
7 5 GTCTGGAGGOlAGAACTCrGAATTTGGGGCTGGCA^ 
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TCACACCAGCCACAGTCTGTTGTTGTGTGGCTTTCCTCCACGTCTTCACACATACCATCT 

TTGGATGTAGGATCCATCTCTGTCTAGTGTGAATTCATCTC^ 

AAGGAGGAGCACTTATGAATTCTGAATATATTTTTTTTCAAAGGTACTGTAGTTAA^ 

TTTG AG AG CGG AGCAGTGTTT AT CAT CACGT CATTGTCG CTGT CAT CAT AGGTGTGACAGCAG CCATG ACACAGACTGACACGAGG 
AT ATCTGT AT CTTCACTTT CCTT CT AGTTTTTTCTCTT AAAAAAT CT AAAC C CAT AGAAGTGTT AGACAGT CTGATG AT ACTTC CA 



GCAATATTAAAAGTGGACTCTAATGGTCTATTAAATTATTTAAAATACrrGGTATCT 
ATAGAAGACTTTATGCCCAGAACATTAGCATCCTAGAGACGAAGG 
TCGAGTAACGGAACCACACTTTACACAGTCTGTGGATGTTGCCA 
1 0 GTT CCTTT AGGG AAAGAG CAGGTT AAGGATGACTTTGCAAAG^ AGG ACT AAGCT CTTTGAAAGCT ACAACCAGTGCAGGGG CA 

CTATTTACCAAGGTAGCCCTGCCACAACATTCATTACTACTTCACATTTA 

AAATTTATGCTCG CAAAGAGT CAG C CAAAAATAAAACnjAAAATGAAATATTTGGGGGAGAC CTGGTG TTT CTATAG TCACACAG C 
CCCAGAGCCGCAGTCJU3AGCTGCCATGGCCTCCACC<5TAGTGACAGCTGCA^ 

CTCTT CTT CTCAC CCAGCGGG CGG AGTG AGCATG CAC CATCCG CT C CTG ACCATGTT AT AAAG ATGG TGGT ACTGGGT AAACCT AA 
1 5 GGATGTTTTCTTTTATGGCAGCATAAAAGACACCTGGCAACCATACGCTCAAACCC 

CCTCTAACACAGGCCAGCCCTCACCACCATGAGAGCTGTCTGCT^ 

AGCAACCAGAGATGGTCTTTTATGCCACAAAAGAGTAAGACCTGCCAAGGACAGTCATT^ 

TCTGAGCTCATTAGCAGGGTGGGCATGTTCTGATAGTTCAGTTGTGGC-GGAAGGGGAGGC 

TTGGGTTGGTTTTTTCTTCCTGTCTTAAGCTTTAA 
2 0 TAGCCACACTGGAGTTTGTTCTCCACCTGTCACCAAGCAAACACCACATGCCGCC 

CCCCCAGCCAGATCACTGCAGACAGAGGCGGAACCACAGCAAAGGACT 

CTGGAG CTTATTATAAAACCT AGTT^CCC^TTCAGTGTGTCTTT CATGGTTAAT AGCG AATGGCACAGATGCGCTTTGCTTTTAAG 
GATTGAAATATAACACACTCTTTGCTCCTCAGTTGAAAA 

ACATTTTAGGGGCCAGTAAGATTTTTCAGAGAGCAAAGTGCTTACTAAGTTAGCTT 

2 5 AGCACACTCATGTTACACACACACACATATCCATCrCATGTrAAACACACATA 

ACATACACACCITACACAAACATATATGTGCATACACTCAAACACA 
ACTATACACACATACATGCACACACACTCACACTACATATACATACATACTT^ 

ACATGCAAGCATTACrCTTCrTATTCTTGATTTAACTAGAC^T 

3 0 AAGGACAAAAAATCCCAAAAACCAAAAACCAAAAAAAAAAAGAAAGAAAGAAAAAAACCGTCTAGCT 

GTAAAAGAAATAATTTGGTTTCTATGTTGTCCATAAACATTTCAACAAGTCACTTCTGCT 
ATACAAATGTGATTTTCTAAATGAGCTAACTGGGAAAATTCCTTTTGGA 
TTCTATCAAAACGTTATATGTTCACITTTTCAGTTTTCT 
CACAAATATCTTCTAAAGATTAGAATAAAAATGTCAGAGTCGAGGCATC^ 
' 3 5 TTCCAAGCTCTAGGGGCTGCCTGTAAGTCCTGCTATCATCCCC 

TG ACTCCTG CTTCTGTAGGCACACTG CACTTTGATTTCTCACT CTCCT AAGTAGCTCACTTGGAT ATTCCA 

TT AATTGTC CT ACATTG ATGGCACATGCAAT AT ATCTTTTGCCACAGAAAG C ATAGGTTCTGGGG AATAGGAAGGTTCTGTGGAGT 

AGCATGTAGACATCTTTAGAGCCATTGTTCAGTGTGCCTTCATCTACTAGAT^ 

GGCAGTGGTC^CACATGCCTTTAATCTCAGCACTTGGGAGGTAGAGGCAGGTGG 

4 0 AGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAGACCCrGTCTCGATAAAAAAAAAAGAAA 

GAAAGAAAGAAAGAAAGAAAGAAAGAAGACGACX^CGACGACGACGACGACGACGAAAAGAAAAAGAAAATAG 
TGAGCCCTCTGCTCCAT CATGTTCTCAGTTGT ATTG CCAGTCATTTTGTAAGTTGTGTAACT 

TTGAAGTT CTG T AG CT CAG TAAGT AAT CT ATTT ATCATCAG AACAG AAG CAT ACAT ATCT AAG ATCATG TG CACG TATGTG CTG TC 
TGTCrGTACGTGTACATTCACAAGTGTGTGGAGGCTGAAGGTGGCTCC^ 

4 5 ACAGCTCCTGG ATTCAGCTAAGACCTCCAGCCAG CAAATCTTCATGTCTCCACCTTCCTGTAGCTGGGACTG CAG ACAACTGCCGC 

CAGCCCAGCTTTTCACTGGGGTACTGGGGCTCTGGACTCAGGTTCT^ 

CCACCCTCTGGGATGTAATGAACTCAAGTTCCTAAGAGGGTGCGAGATTCA.GATCACTTACT 

GCATGCCTG AAG CTCTCTTGGCTCCX5AAGATGCACAGCTTG<X3 CTTTAGGG ACCCCATAG CCATTAT 

TCTCCTCAAATTTCTTTGTTACATAAGTTACACACACAGGGGCCT 

5 0 AATCCTATTTTTGCGTTTTTATTTCTAGTTTTC 

AATATTTGT ATTTTCTTCT TT ACAG ATGTTT CATTTGAACACTG CTTTG ACTCACT CAAT ATTT AATGCAGAATT AT ATTC ACCAG 
AAATACCACTGTCAACAGGTAAGAAAAACACCCCCITTTCCrGT^ 
GAGATGTGGATCAACGAAACTAAGTATATATGAAATTACTATAAGGAAATCT^ 
CCTCTGGAAGAGCCACAAGCACTTTTAACCACAGtAACC^TCTCTTTAGCCA 

5 5 gaaggaagtggtgatccgtgttttaaaaccaccctttcagta 

ctt ctccgtatctcattacccatt agtg ctcctttttgctttaccgcagctca 

CTG aggg ccaotcttgtcagccaggacacgaggctctggggaaacagtaggaaagtaac ctttatattctgaccagaccaagcgta 

CAGAACTAATGCTCCTGACATGACTGCTAATTAACGAACCCACAAACTGTT^ 

6 0 TT AG ACTCATTATTG ACTGGCATGTGTGTAGCCTTGG CCTG ATGG CTCTTTTGCTGCTTTGATCACCTGTTGGAGCT AAG CCGT CT 

TCAGCAGCCACTTGTTAGTTGGTGGAGATGCGTATGGAAGTGATTGTCCATTACTGAGGTGGACAGCAGGTG 
GATTTCTGTGTCTCGTGATGTAGTTTTTATCATTCTCATTTAAAATGACT 
TACAGACGTGGGAGGAAGGAGAGTTAGTATGTCACCCTCCTC^CTGGTC 
CCCAAATAACCACCACAGTTTATCTGTAATCTTAAGCTGTCCAA^ 

CTCGGTTCrrGCCCTCAGGGAAAGCACAGGTTTCTAGAAAA 
ATATTCTAATGTGTTCTCCTTTTCCTTTGGTTTATGGACTTAGAAATAAAG 
TCATATCATTGCGCTTCGGGAATCTTCTTTTCCATAGTGTTAGAGAATCCrTC^ 
GTAATGGCTCCTTGCTTGCACT AG CTCTCCGGCTCTGTTTCAGTG CGTTGTCCTGTTTTTATAGTC^ 

7 0 GTTCAGATTTTAAAGGACTAGAAAGGCTATCTGTGACAGTGTTTGCTTGTAATTT^ 

T CTT AAAATCTC AACG T ATT AGT ATTTCT AACCCTGTGTTGGG AACT CT CT AACAAGGGACAGCTGAT AGGTCTGTG T CATTTTG C 
TGTGATTTAATTTTCTTTTTACAATGGAAGATCATTTTTT^ 
TATCTGTGTATCAGCTCCAGTGTTCrACrrCTCTTACCCrTGGCTGTGCTCT 
A CTTCACATATGTAACGAAAJ i -CCT&AAACCAA 
7 5 CGAGGATGTCTCCTCTGCCrrCCATATTTCAGCAGCATCTTATCCAAATGCT 
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TTGGATX3CCTAGAGGAAATGTATATTTTTAATATTGTTATA 

TT ATTT C CT CAT AAAAG TT AAATT CG AG CAATTTT CTTTTTGAAATTT AAAGTT AACAGTT ACCTTTCT AAA^ 
AGAGATGAATCCCATGAGACAGAAGAAGGTAATTTGCTACATTACTGACCCCAGTAACCACCCATCTGTC^ 
AAATT AGGTTTCCAATTGCTTTCCTGGTACAGTGTTTAGGCrr^ 
5 TTCTCTTTGAAGTT ACAGTTT ATT AAAAATT ACAAC CTGAACTTTT AT CT CAC CTCAAATTTACAT ATGCAAAAT AAGGCTT AAG C 
TTCAGCCGAGAAAGTGAATTTTCAAATGAGCACAGAAGTC 
AACTATAAAATGTCGAATATGTTTGTGTATTACACAGCTATTGGTTATCATACT 

ATTTTTTATCATAACATTT AT ATTT ATTT ATTTTGA CAAG T CAGAT AT CATTGGT AT AT AAATTGAGCCTTT AGGTCCTTT 

TCAT AGT AATT AATTGG CATGCATTT AT CTT CAATT ACATT ATTTTTGGT ATTTT ATCAT AACT CTGGG CTGTTTT AGAACAG AGT 
1 0 TCITATGAGCACACACTGTTAGATACTlTATTATATCrGCATCAAAAGCTC 

ATCCCAGCACrTGGCAAGAAAATCTCrGTGAGTTTGAGGCC^ 

TGAGTCTCTGTCTAACAAAACAAAACAAGCCATGGAGAAGCAAGAA 

TAGGCAACAGTG^CTGTCCCTTGGCTTGAAGTAAGGGAAAGACTACATTT 

CTCCCTTCrGTGGGTTTCTAGCTTCAGTTCTTCATTTACT 
1 5 CCAC CTG ACAC CACAG CACT CAGAGGTGCTCC CTGCACTCAGCTG T ATTGTGCCCT AAATGTTTGTTG AC^ T CTGAGG AC 

GGCTCGGATCAGAATCTCTGTCAGCATTCAGACCITTTGT^ 

TACCGTTAGTCACTTAAACTATAAAACrANNNNNNNNNNNNNNNNh^ 

AAAGCACTTATGAGCAAAAAATAAATATTTTTTAAAACATTTAT^ 

ACCTATAAAAATTGATTTGAAATCTCTGTTrAAAGCA 
2 0 CTCACAGTTTTCTTTTGGTTTTCTTGTGTT 

TAAGCGTAAAGGGGTCGGGTGTTTGGTAATATTACATCTCATGGTAAGGCT 

GTAAGGGTTTTATCTGAATAATCC^CAGCCTGCCGTGTTATGCAATAACTTCCTGTGTCT 

ACATCAGTT AATT AT AAATC CTTGGT AGT CTGT C CTTTTT CCAGCTT AG CTCAT CAGCTG CTTCCAGGGAAAG CACT AAAGAGTCT 
CATTCATGAAGAAAT ACTCACACACACACAGGTT CACAG CCACTCTG A CTCATGG CCATGC C CACAGGGT ACC CCT 

2 5 CCTGCCrTCATAAGCrCTGGCCrTTTCrGTGCATAGAATGTC 

GGAGATCrCCCCTCCACCCCTCItSGCTl^lAATCCCAGCU"! r l-i'iGAAGCAAGGAGAGGATl'lTrntSTTTGGTTGTTGUTGTl'GTT 
TTTGTTTGTGTTTTTGTCTGGCTCTTTCGCTTTCTACTAAATA1TCT 

ATATGCTCATTAGAAGTTGCATGGCTCTAGTACTGTAAGAGAGAAGAGAATCTTCAATATCTG 
TGGGATGGAAGTTATGGTATAGCCTCCCACAGAGTGAGAACCAAGTCACTAGATCTCT^ 

3 0 GCTGAAGGACTGTATACCTGGCCTACCGAGTTGTGTCACCCAATGTCTTT^ 

TTAACACATTGTGTTGTATGACACTCAATGCTAAGAAGTGAATGCT 

AGCTGTTAAATAGG AG AAACATCTTAGGGATGTCCATG CTACTTTGTAAAG CTG AAATCAG ATGGAAATGTTAATTATAAAGACAG 
GAATGTCACATGGAAC1TTGGGGCATTGTTTGGTTGCITATAGGCGGCCT 
ACACACACACACGCAC^CACACATACTTATACACATGTACACCCTCCCTGGGCTG CAGAT AA 
3 5 GTCCATCTTTGTAAAGATGATTTCTCftCATAAAGTCGCTGTTCTCGGAGCT 

GTAATCTGGAATCCrGGAAAATGTCCTAGCCrGGAAGATTTGGTGAATTCATAAAA 

AATAGATTTTTCAG ATC CTTTAG AAAATTTT AAATAATTT ATTT AATTTTATTTTGTG CGCATTGGTGTGATGGCGTCAGAGCCCT 

TG(^AACTGGAATTACAGACAGTTGTGAGCTATGACGTGGGTGCTGGGAATTGAGCGCAGGTTCC^ 

GCTGAAACATTTCTCTAGCTCCCu l Ti i l AACAATATTA 
40 CCCGTGTGTGTGTGTGTGTCTCTGTGTGTGTTATTATGAACCCCAAAGTC^ 

TTCCAGAATTTAAAATTTAATTCATTTTGTGTGAATATATAAAATCATATATGTATAAAACAT 

TTATTTTGGTATCTGGCTTTAGTrGAAAGTAAGAAGGTTGTATGTAAGCCAGGCTTTT^ 

TAATGACTCATTTGAAGTAAATGGCGTTCCTGAGATGAACAGCGATAGATC<BTGAACGTTTAATT^ 

TGTGTCCAGAGCGTGCATTTTAAGACTTCCTAATTTTAGGTTGTTATTTAA 
45 TCTGTAGTCATTGCTGGCTTCAAACTGGTGCCAGTTTTCCTACCTCCCCTCTCCAAGTGATAAGATGA 

CCCAGCT CAAAGGCATCG ATG CT C CAG CCGAACAG CG CAATACACATT ACAAC CTT CAGAAT AAATGCTGCCTTGG AT AGG ACGTT 

CAATATTCTTATTCGACCGAGAATAATACACAGTGTCACACGCTGTGCAATAAA 

CATAAGGTATGAGAGGTACAGCTACAGATTTTTCCAATC 

TTTTGAGTTAG AAATGTTCITTTTCITGXSAGGCCnTTTGGG CACAG ATG ATG AAAATGAAC CAAACCACAGACTAC ATGTCTTT AG 
5 0 ACrcTCCAGTTCCGTGGCATGTTTCTCACTGTTAACTGTTTATTCATGTTGTGTTGTGTTGTGTTGTGCT 
GCTTTGTTTGGAGGCAGGGTCTCAGCAGTGTAGCCTTGGCTC^CCCCAAACT 
GCACTC^GGTAAGATGTGTGGCAGTTATAATCTGAAGGTGTGTT^ 

C^TTTGGGGCTCTTGGTGTTATACATG CT ATTATTGT ATAGTTTGGGGGTGATTG AGTT C^AGCGTTATCCAGACTGG ATCCCAGT 
AATATTAATTTTTACAAACTTTATGATCTTATTCCTTGAAGTGCTCAG 

5 5 TCTGTAACACAGATCCACTTAATCCTCCAGAGACTTGA 

AGTCGAGGAGGAGGAGGAATGGGGTGAGGAACTTGGGGGGGTGATGTAAATAAATAAAA^ 

CTGTGATCAGAGCAGATAAAGGAACTCCTTTAAGTAATTAGCATAACTTTTACTGATGGCAGAGCGGAGAGC^ 

TGGTAACGAGAGGCTCCAGTTGAATAAAGTGCTAACGTATATCCAAAGGCTTAAACTCTTCA^ 

CCTGG CTT CT C CG C CACTG CTCAGTGCAT C ATTTT CAT AGTGG CTGTT AG AAAG C ATTT CTC CGTCTG C AG AGTTGT CTTCCACAG 

6 0 TTTTGCATGTTTAGAGACGTGTGTGTACATCTCTCATTGTTG 

GATTCCGCTATGTGTGTGAAGGCCCATCACACGGAGGGCTTCCGGGAGC^^ 

AAAGTAAGTTTGAAACCTCTGCTCTCACTGCCCAGGAGCAAAGCCAAAACGTTCAAGTGAATGTGACT 
TCAGCC^AGTGTTTACATGGTTCTTAGAATTTTTAAATAGAAATTCACCTCATGAAAATGACGAATC 

TCTCCATTT AAGTG AG ATGGTTCCAACAT C AACT AAGAAAAACTT AACT ACCTCTC CAGTC ATCTTAAAGCn^TCTTTTCATCTTA 

6 5 AAGTATTTTTAAAT ATT CTCTGATGTTTT AGT AAAACTTGGAGCCAGGTG TCAT AGT 

CTAGCAACAGGACTGTTAGATATCCCAAGCTAGCCTGAGCTACATAGCATGCTTGAGCT 

TCCT AAG AAAACG AAG AAAAT AAAAAATG T CTTT GAGG CIAAATTTCAAATACTGGGTTT AT AGAGATTTCATGACAAG ACC CTTGA 

CAGAATATATCGGCTATGGAGTATATTAATTATTTAAGATCATAAGTTCAGACC 

TTATCTCACTGACCAAACATATAAATACACAAATACCACTAGCATGT^ 

7 0 CTTAAGTTCTCTGTAGAGAGCCTGAGAGAAACCGTGTGCAGCTA 

AAGCTTGACTGTCTTTACCTTGGGGTGTCGGGTGTAGCTTCGCTGC^ 
CCTCTAGTTCAGAGTTTCTACTTTCCrrAATCCAGTGTTGTGTTTCAGAGA 
TCGA(rrGGTTTA,AAAGGTTTTATTTATTTGGTAATCT 
CTGTCTTTCTGCTC^CTTGAGATCTCAGACGCTCTTGCCCTCAGCTGAG 
7 5 GGTTGCTTGTCCCCCT1TTAATGTTGTACTTCCAGGCTCAGGCCTTCCAGGTTCTTGCCC 
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AT CCTTTATCTTTCCAGTGAAAAAAATGTGAAAAAGCCATACCATCCTTT ATCTTTC CAGTGAAACTCAGCTCCTGGCTTGGGTCC 
TGAGAGTCACATTTTCATTTTAAGGCTCCCAGCTCTAGTTGTGGAATGAGCCG 
GAAGTCCAGTGGCTTTAGTGCACAGAAGGCCAC^GTTATATTAGAC^ 
GGAGATAGTTAGATGGTTGACTGAGCGATAGCGGAGGAGATGAGAGTAATCGGCCTCAGCT 
5 CCCACTCTGCAGCGTCTGTAAAATGTGCGCAGGGACACTGCTAGGATTCC^ 
TClTCACTAAATCTGGGCGTGGCrATGCTTTTCCTCTAAAGGC^ 

AGAGTGTCCTCCCTCCCATTCAGGTGGGGCAGTAGCCGGAGGGAGGGAAGGAGAGGATTCT 
CTAGCTGAAGGGAAGGTCTTCCATCAGTCCJVGCAAACTTTCCAGTGT 
ATCAGCTTTCAGCATCTGCCCAGTGTAGTCGACTGCACACCGGAAACTTCCCT 
1 0 CCAGGCAAGGAATGCAGAGCAGAGCACCGGCGAAGAAAACGGTTATTCCACACCC 
TCCTGTAGCCTTATTAACACTGAGGAGCTCAGGCTGAGGTTA^ 

ACTGGCCCTCCCATGTGAGCGTGTGCAGCTGAGGGGCCAGGGGCCAGGGGCTGAAGAACAAGTCAA 
CAGGGAGGATGAGCATTCTGAACAGCAAAGAAGAAATGGGGGCAGAAGTCATCAGGG^ 
GTCCTGTACAAATATTGTGTGCAATTATGTGTCCTTATCTGCACCAC^ 
1 5 AACTACTTAAGAACCGATGTTAGGAGAAAAAAGTAGGGTTTTTTTCTCT 

AAGAGGCTTCCAGGGG TATC ATGG CTT ACTGCAT AC C CAC CCATAATTCAT CGT AG AGG T GGGT C CATGT ACTTC CTGAGGG AG CG 
ACTCTTCCGTAGCCTTGCACATGCAAGGCTGTGACTTGCATC 
ATCTCTGTCTCCCTGGACTCAGGCCCTCTTTCCTGCCCTGCCTCTTCAAGCACC 
2 0 AGGTTAGTGGTTGATGTGATGTGAAGATTGCAGTGAAGATCTGAAGGCTCTTGAAA^ 
CAGAAACCAAGCTTTGCTTTATGGAGCTTGTAAACrTA 

CTTGCAAGCT AACTCT AGGAAT CT CC C AAAACCAG CACTT CCT ATT AGAAG T ATTTCCACGCTG TGGT AGATG CTGGG ATGCAAG C 

TCAGTCGTCACCCCCTCCCCCCACTGAGGTACCACCTTTTCTAGAACTCAGGCGGCTGTACCATGTGT 

TGAGAAGAAAGAACCTCTTTGGCTCTCTAGGATCTTGTTGTACAAGTCTTC^ 

2 5 CTCTGGGATTACTGGTTCCTATGTCCTCACAGGTATCATGAATGGTTTCTTCAGATTGA 

TCGGGGTGGGTGAATAGTCTCTTTTAAACCAGCTACTTCATATTTCAAAAT 

AAAT ACCATGT ACCAAAATT AAT AAATTT AATAGATGGT AAT ATTTTG C CT ATTTG CTAT CTG T CT ATCTGTTCTCT CTCTCT CTC 

TCTCTCTCTCTCTGATTCTCTCTGAAGTACAACATAGTTA^ 

TCTTTGCCCAAAGGAAAACCACTGTTCTGAACATTCTTTATCT 

3 0 T ACTTTTT ACT AT CTTT AAT CCAAAG IT'rr'ri ATT AT AT ATGGTGG CTCTTTTT CAAACT AT ATCCAGGTTAAAACTGGG CC C CTG 

AAGGCTTAAACATTGATTGCTTTAATCCAGTCTCTTCCT 
GCTGACCCTTAAAGCAAAAAATCATATTCATGAGGAAGCTAGGGA 
AAATGGAGTAATCCACTGTCTAGCCTGTAAGCTCTCCCTCCAGTGTTTGCTTGGTT^ 
GGAGAGACAGCAGAGCTCCTCTGGGCCAAGCCTGTCGCTACCTGTTCTTGACTCAGCAAG 

3 5 CCTTTTCCCGAAAGTGGTGCAGAAAAATTTTAAATCAAAGTGTCTTCCTT^ 

GTGGCTTTAGGAATGCGCCTGCAGCCCGTCCTCACTTGCnTGAGTCCTTCTGTGCrc 
CAGTGACTTTCATTTCTAAAAGACAGTTTGTTTGTAATGATCCA 
TCATCAGCTTTGCTGGCAGCTGCCCTGCCTGCCACrCTCCTGGG^ 
ACAACAGCTTTGGCTCCGTTCTCCCACCTCCTGTAGTGTGCACTCrrACGTG 

4 0 GTCAAGGTTAATAAAGCCCGGCTGATCTGCTGTAGGCTTGAGGATATCACATATCT^ 

rrGTCTGCATCCGGAGACTTGCCATCCTAAAGATCrAGrrGGAATACTAATTTTG 
GCCAGGGGGGAAAGATCACCAGAACAGATTGGGCATTCAGAACCTACCTAGT^ 

CAAGTGTTTACTGTGCTCAAATAAAAGAAAAGAGCTATGTTC CCT ACCA^ CTTTGT AACAC ACACACACACA 

CACACACACACAAACACAGAGTTTATTAAAAC^ 

4 5 CAACCTCTGACTCTAC^CATAGAGTTTTATGCATATTTGTATGCATTTTTAAAG 

CATTGGCTTTTTTACCACAGCTGATTAAATCGATCTCTACAATTTTGGAG^ 
TCACrGTGTAGCTTAGGCTGGCCrTGAACTCATCTTCCTACCrCTGCCTCCAGACT 
GAATGTTTTTCTTCTTTCTT1 W 1'U4'CATTTCAAAAATTTACATTTAC^ 
TGTGTTTCTCAGCATGTAATTCAAGCTAGTTGCTCTTTAGTATTCCATTGACT 
50 ACACACACACACCACACCCCACCCCCGCCTTTCrGTTCTCTCCTT^ 

TGGCACTGACTAAATACGATGGTTCAAAACCAAAAAGAAAAGAGAATCATT^ 

CACTACAAGGAGGTAAtZATCCTCTGCGTGGCCATAATAATCTAGACTTCT 

GATTGCCTCCATGAGTAAAATACTGACTTGTTAAAAAACAAACAAACA^ 

GAAGTTAGAG AAAG CT G CTT ACTG AAGG AAACTG TTG AG CAGTGAG AATTG ATCCTGAGAGTT AGG AGAG C C AAG CTCCTG AGGGT 

5 5 TGCAGGTTCTCTGACTTTCCAGGCITGGTATGGTGGCCTGCCTCGCCTGGCA 

CATTGCrCTTGGCATGCTTGTTCCTAGTGGCAGTGTTCAGTCTGAAATCATCT 

TTACATTCAGGCCGGGAATCCCGGTTTCGTAAAGCCACATGTGAAGATTGGTTTTGATGAGGCAGAGTGAG 

ATCCTCATGTCCCCAGCCACCITCTCCATAGTCAGGGTCTAGGGTCTTTGGTGGGTCTAGGGATG 

CCCCATCTTCCCCCTCCCACTCTAAGTGGATTTTAGGTCAGTCTTCCAAACAACT 

6 0 GGGAGGATATCTGGGCAGACATGTGAAACAATCCAGTCTGAAGACCAGG 

GAGTTATCTATCCATTGCITGATAGATAAGTACAAAACAGGTTTGCT^ 
AAGGTAAGGCCAAACACACATTTTTC^VAATCACATATCATTGAAATG 

GATAATGAGGTTTCTCGGGGTGGAACAGAGAACCGTGGAGGCTGGGAGAAAGGGAGAGCTGAAAG 

6 5 TTGTGGGCAACATATTCACGGTGTTAAGCACCAGCTGTGAACTGATGGCCAGCG^^ 

GGCTAAGCTTTGTGAACTGATGGCCAGCGTGGGGTGTGACGTCACITCCTA 

agtgtggagagtctcacttggctaagctgcagcttcttcatctgt 

attcagttgctcaatgccx;gcacacagtcagcctggaaacacaattacac 

ctccaaaagcttgtgtttagaggtaacacatgatgtcataggaaact^ 

7 0 ttaggtcccttcactgtgtcctggtaggccctatgtgtctaatttgcacaacgctgtattc^ 

ctgcaacctgggtcatattaaacaacagagtttattgttccaaagttctggaggcccag 

CTTGCTTCCCACAGAGGCTTCAGGAAAGAGTCAATCCTGGCCTCT 
GCTGCAGCCACTTCrCTGATCTCTGCTTCrGTCTTCACAGGGCT 
CGTAaAGACACCAGTCAGCGTGGTTAGGGACCACCACGCTGTGTA^ 
7 5 tGTCTAGATGAAATCATGTTCTTAGCTCCTGCATAGGCCCGAGTCCTGGGAGATGCTCTTCT 
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GTGGTTACAGAGTGTTCCCG AGTGTGTGCATTTAAGG CCATGTCGTTCCGTG CTGTG CTCAG AAAGAAGCCTGTGTGCACCACGTG 
CTCCCCTGTGTACCTTTGTTAGATTCAGGTAGTGGAAGAAAAT 
TATTCTAGTAC^TGGAAGGTGTGCTAGAAAGTGTCTGGCATCA^ 
TTCCCATAGTAAATTTCAGGCTGCTAACATGAAGTCAACTGGCTTGCAA 
5 AAACAGTGAAAATTTAACTATTCTGAAATTAACTCC^T^ 

TAGAGCGAGAGAAGGTGATACATTGTTTCCTGTAGGAGCTACT^ 
GGGAACCCGATAGGTGTTACAGCTGGGACTTCATTGTC 

GCAAGGCTAGACCTCGGTCCCATTCTGCCACTCGCTCGCAGGTGGGGTGTTCTTC^ 

TTG TTT ATGAGATG AAAGT AG AT^CAACTCCTTCACAGGAACC ATG CG TG CTT ACGT AAAAAACTGAC CT CTT AG AAAGG CAGCTG 
1 0 CCATAGTCCTGCCACATCrGCGTCATTAAGTGCTCATTC 

GAGCAGAAGCTTGCAGGTACTITGTCAACAGCACCTAGAAGGATGCCCATG 

CAGGTGGTCAGTGCTCAGAAGCTGGAAGGATCATCTAAGGAGGCACTTGTCCT 

CCACCAGGTCTCCACCreTCACTGCTGCTC^CrCCCCACCAGCC^ 

GACAGCCACATCCCAGACCTGGCTGTAC^CTGTAGTGTAGCTCTTAGACATAGCACCAATCT 
1 5 TGCCTTAGTTTCTAAACCTATTTTAACTGTCCAATTGCAAAATA^ 

ATTTATTTTAAAAACTGAAAATTCTAAGGGAAGAAGTAGAAAAAAGAAACCCAAACAAA 

AACTCCCATTGGGCTTTTTAGCATCTCGACTTCTACCTT^ 

CCAAAGCTGACATCAAAGCITATGAGACACCCAACT 

CAGCAGTGAGACAGGGGATAGGGTTGAGCCCTGTGTCCGTCAGTCGCATACTCTGCCGCTGGT^ 
2 0 ACTAGGAACATGGGGAATGAACTGAGGTTAGGATTCACTGAGCAAAGAGCTATG 

AGGATTAGCTCATTCGTCCATCCCGTGACCAGCCTACAGTTTCTGAACTCTCATTCCTAACAGTTGG 
TGAGAATGACTGTTCCATGCTACTCCTAATATATGAAACTTTTGTTTTTA 
TCACTTGC^TTTAATAAAATTTCAAGAAACACAAAGTTATAGGTAGTAAGTCCCCT^ 
GGAGAGCTGTTAGGGGCTTCTCAAGTGCACTCCCAACCGTGCCTGCCGTGTGTTTAC 

2 5 CCACTCACGCTTCCTCGTGTAAGTAAGCATTAGTGTAGCGCTTGTGAGCACGGATGTAGTCTACT 

TTACTTT TTT CACTTACTG AGTCTG AAATTTG CTCAG ATTAACAGCAAACrCCAGAGCTGGCAAATCCAGATTCAAGTCCA 
GTCCCCTGAGTAACTGTGTAACCTTGAGCTAACTGACCACCTTCrCTGTGCCCCTATCCAACTAG 
CATAGTGTTGTTGGGAGTACTAATTGAATTAGTGTTGAAAGAGCAATGTCTGCTCCAGGGAAGAGTTCrrGATC 
CCATCTACTCTGTAAAGGCTGTGTGCTTTCCATGGCTGCTTGGGGCATGCTGTCTTCTCTAAGC 

3 0 CCAACCTTAGTAACCCTGCATGATAGTAACTTCTCCCTT 

TTTTAAAAATACTTATTCATTTTTATTTTATGTGTCTGAGTGT^ 

CACGGGGGCCAGAAGAGGGCGTTGGAATCCCTGAAACTGGCGTTGCAGTTGGTTTTGAG 

G AAAGTG TTT TT AACCACAG AG CCATTTCrCCAGC CCAG AGAT ACCATTTTAAAATTAT AAT CTGTG ACTG TTTTG TTTTGTTTTG 
TTTTGTTGAAACTAGGTTTTACCATGAAGCCCCCTAGCTGGCCAGGA^ 

3 5 GTTGGCCTGCCTCrGCCTCCCAAGTGCTGAGATTAAATGCATGTGCCACTGTGTCTAACCCT 

TTGAAGAGTCAT AAACTAGATTTTCAC CATATTCACTTCCCACTGTT CT TTT TAGCTCTTCTG AGGTCTAC CTCCT ACCTCCTCTC 

AGTTTCTTCCITCCTTCCTCCCTCCCTTCCTTCCTTCCT 

TATGTACTAATGGATATGAGACTATCCACTGACCATGGTCATTCCT^ 

GAGGTCAGACGCAGCTTTCTAAGGTCAGTTCTCACT 

4 0 GGCCCTGGAGCTCCCAGCTGGTTCTCCrGTCTCTGCCTCCCATCTCTCTATGGGG 

GGGCTTTTTACCTGGGTCCCAGGGTCTCAAACTCAAGTTGCT^ 
GATGCTGAATTTTTTTTTATCATAGCrrrTATTTTC 

TGTGATGACCCTTTCAGAGAGTCTGCCCTGAGTTGTTCCCAGGCCTCGCCCA.CTTGAGTGTGTTCT 
CCCCATGCGGACTCTGATGGCTGCCGTACTATGTCTTACTCAGATGTGCCCCTCCT 

4 5 AACTGTGTCTTTTTAGATTCCCTAACACTGTGCTCAGTTAAATTCCTTT^ 

TAGAAATATG CAATTAGTATT AGACATAAG AGG AAACATGTCTCTCC CGTTCAT CTCTTACTG AGCTCITGCTGTGTCTCACAGCC 
AATCATATTCAGCTCCACCAGGGAGGCGTTGGTATCCTAGTTTTCAATTGGG^ 

T CT AC CTTTCTT ATGT ACATG CAC ACACACAT ATCT C CT AATT ACTTT CCCCAG C CT AG ACTT CTCCAGAAAAC CAGG CTT CAG AT 
CTGGCTACTTGGTCGACTTTACCATTTGGATACCTGGTAGGCT 

5 0 TCCACTCTTAGAAAATGGTAACCCATCCCCCAGTTTCT 

CC ACTTAT AATCTACC CT ACTGGCTATTTCTTCAAACTAATTAG AAT CT ACTTGCTTGTTGCCAGCT CCACCCC ATG AGCCCTGGT 

ATAAACCATCCGCCTCCTAATGGCTGCCATTTG 

TrrGCTCrATCATTTTTTTTGTTAGATAGCATTTATCTTTT^ 

TATTTC^GACTAGTTTAAATGTTTCTAATCTAAAAACACAAAGTCAAAAAATGCTCT 

5 5 GGGTGGGGGGGTAAAGTACTTGTCATAC!AAGCATGAAGAGCTTGAGTCCCCAGAACCCA 

GCAAGGCTCTATAATCTCATTGCTCCTGTACIAGAAATGGGAGGTAGGCACT 
GTGTCAGCTGAGTGCAGTTCTGGATCCTCTTCTCATCCGGGAGCCACGCATCA 

TGCTTGTCACCAACCAGCAGAAGTGTTGCATGCCAGCTCTCGTGTC CGCTAGACCTCGCT CCCCCACTGCTAACTGACTGGGCTCA 
TCCTCCATCTCCTGCTCTGTCCTGAAATGGCTCATCTGTCCAAGGGACCTGATTCCTTAGCAGA 

6 0 GAGGCTCCATCATTGATCTTTTTTGTTGTTGTTGTTTTTAAGTCTT 

TCACAGACTCTCCTTCCCTCCCCATCCTAGGTTTGTATCTCCTTTTCCCAATAAAATG 

GTGCCTACTCAGTTGATTGGTCCTACAATACTTCCAAACACATGGGGAGATAGTATGGTTTCAGGATG 

CTAGCAACTTAACAAAGAGTTGCGTTTTCTTTTTATTTCT 

TGGGTTACATTC'UTTTCCTTCTAGGAAACTGCGTTACCATTGAACTATAGGGTTAGAA 
65 TTTCCTATTCCTGCTGATTTTGTTTTATTTTTGAATGGTG 
ACGATTTGGGAACTATCATAAAGTCGGTTTGCTTT ATTACH 

GAATGTGAACCCAGTAAGCrCACGTTGTCACATGGGGTGTTCCATTTCTATATTTGGAGCCCT 

AGTACACACATATATGGTTCTCGCACATAGGCrTGGTGCTGGCTCTGCGGTCACAGA 

GTTACAGAGAGTAGCCAAGCAGCAGACACGTATGAGATGATACTC^GGTGTTTCCTT 

7 0 GTGCATATGTGTATGTGTGTGTATGTTCCTGAAGTGGGTGGCCTTCCCT 

ATGGTGGAATGCCAC^ATCACAGAGCACTTTTCATACTC^TG 

AGCX3CTGGGCCTGTCACACTGCGTGTGCCTCAGAGATAACATAATCTTCCAAGCTGGTG 
GATGAGCCCCAGGGGATTGTTCCTAACCATTTTAACAAGTCT 
AGTGGTAGATAACAAATACTTTAAATTCAAATTAATCrTGAAAACTGGAGCGA^ 
7 5 AGACACACACATCCAGAAAAAAAACAATGAATCCACAAAAGTA 
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AGAGTTGACGTGCCTAGCAAGAGCTAGCAAGGCCTGCA^ 

TGTATAAGAGCCAAGAGGTGATAGTCTCATGGCCACCCTGC^GGAAGATCACACTAGATGCGTGCAGC 

GGTGTITGAATAGCTATTATAAGAAAATGGAAATGGACCCCACCATCACAAGACA.GGGGA 
5 AGAAGACACCAGCTTCGTGGGAATCACAGCCACCACAGGACGGGGTGACTTTGTCGTCACT 
AGAGGAAGAAAGGGAGGCTGCGTGGGCTTCAGCTTCACAGCCTCGGAGTCTCTGGCTCT 
AGAGTTTTGCrTAGTAAATCATACCTTTGTAACAAGGGCTG^ 

TAG AT AAGAAAAT ATAACAAT AG ATGCT AGTGACTTGCTGG C CATTTTCTC CATGGGT C AAAGTG Ci^TGACIT ATl"l w ri w ri ACACT 

AGTATGTCAGTTTATTAACACAGGCCGAACAGTTTAGTCACAGCriTGAAAGGGT^ 
1 0 AGAGTGATACCAGATTTTTCCAACAACTTGCTGGCATTATCCTAGAAAT 

TCTACATCATGCTATGCAGATGGCACACTCCTTTGGTTAACCT 

GTCTGGGGGCGTGATGTGACCGATGAAGTGTCACCAAGAAATAAGACAAA^ 

TAGTAGGGGTTTTCCACAGGAAATTGCAGGAGAAGAGTTGAAGTGCTCTTTGTCTT 

AGAGTGGCCCACATTGCCTTGTAAAGGAGCTGTGTTGACCATGATACTGATCAAAGTCAGAGTGC 
1 5 CrGATTGCTCCTCTGCAGTGGACTTCTGTAGTTAAACT 

CAGCAGCTAAGATCACATACTTCTTCTGCAGAAGACCCAGGTTCCGTTCTAAGCACCCA 

CTCCAGCTCCAGGGGACCCAACACACTCCTCTGGCCTCTATGTGCACCTG^ 

TTAAAATAAAAATCTTTTAAAAATTAAGGAACATATACCCAGAGAGTAAGTATTTGTCTCTA 

ATCTGAAACAAA.TTTCGACATCACAGAATCAAAAATATTTCTGATTATCCT 
2 0 AGAGGTGGCTCATTGGCTAAGCTCTTCTCTGCTCTTGCAAAAGACCCAGGTT^ 

CACAGCCATCTGTTGTCCCAGTGCOVGGAGACCTGACACCCT 

TCAGGGAAAACAATTATAGACATAAAATTCAAATTAATTTTAAAATGTCATTGTAAGATCATGCTAGATA 

TGTCTGTTTTCAGACAGGGATTCTGTACAAATCTAACATGCATTT^ 

ACTTTGCTTAAAATAGCTAATTATGACTTCATTTGCCTGTTTTC 

2 5 CAATAGTCCCITATTCTCAGTACTATAATAAGTACTGAGCTCCCAGG 

AGTACCCAGACCCAGCCTGGATGGGAAAAGTTAATATCATTGCCTAACATGAAGGTT^ 

TGCTATCAGAGGCCCCCTTCCAACAGGTAATCTGAGTGTTGTCTGT^ 

TCACAATTGTAAAGCTAAATTGACTTGACAAGGTTCTTTCCGACCCAAATTCTGTG 

TGG ATTTGTGATTG TTG AC CATTG TCCTGT ATG TCTT ACATAT AT ACTTT AGGG AACCAT AAG C ATTTT AGAG CTGGAAGC CATG C 

3 0 TAGCTGTGGGCTAGGACTAGACATTTTGCAAAGAAAGCATTAGGAGAC 

CTAAAGACTAAAATTCTGACCATCTTCAGAGGACTAAAAATAATCAAAGTGTTATCT 
ATATATATATATATATATATATATATATATATATATGTATGTATATATATTTAC 

ACTCATATATGTATATGGGGTGTGTGTGTGGAGAGAGAGAGAGACAGGGAAAGAGAAGGGGAATTAAATATT^^ 

GCTTTG AT AAG AT CTG CCTG AC ACAAAAT ACAAAATG CATTTGGTAT AAACAG CATCATT CTC CCTTT ATACCAAAAACGGAAATT 
35 TCTTCGCATTCTATACATTTGAGAATGACACCTCXJGGNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

4 5 GGCTATAACCTGCACAAGTAATGGCCTCAATACAACTGTTTTCTA 

TTTATGTAGGTCTGCC^TTCAGAGTGGTTATGGGAGGATTCCTGTCCAGCATTGGGACrrGTATCT 

AAAAAACAG CAC CT GTACAAAAAAACAAAAACAAAAAC AAAAAAAG CAG CAG ACTCAAAATAGG AATCTCACAGGTCTAATGGGAA 

TCCTGAGTACTTCTGCTGCAGAAAAGTATCCTTAAGAGAGCCCAGGTGAAATAGTGTTCGAAAGA^ 

AAAAAGATAGACTATTGCTTTCTTTGTAAGGAGATGGGTCAGCCGTACGCGTTTTAATAGAGTCG 

5 0 GAGCTTCAGAGCACAGACTCGGCGTCTGTCTGCAAAGGGGGCTCAGTTGGAGCACTGTT 

AGAGTTCAAGCTAAGATGGATAGGCAAGGCAGTGAGATCAGAGGAAAGGGCT 

AGTG CC CTGGCT CTG AAATTTG G CAGTT AAGTGG AG AT AAG AGG CAGCG CACT CC CACACACT AGGACGG TTTTGGT ACAGTGG CT 

CTGTTCAACAC^AGGCAACGTTAGGTTTAAGAAAAAAGGTAAGCTGGGCGTGGTGG 

CAGAGGCAGGCGGATTTCTGAGTTCAAGACCAGCCTGGTCTTCAGAGTC 

5 5 TCTCGAAAAACCAAAAAAAAACAAAACAAAAAAAAAGAAGGAAGGTAATTAGTT 

GAACAGAAGGGCTTCCTGCTAAGGGCTAGAAATGTCACACATGCTTATCTGAGAGATGAAAACACTGGGTTAT^ 

GAGAGCGAGCGAGCACAGCCCGGGCATGTGTTTGAGGACAGGAGCCTAAGGAGAGCCAGAGGTATCAG^ 

CTGCAGGCGGTTGTGAGCCTCTCCAAACAACGCGGCCATGAACA 

ACAAGCATCTGTCCAGCCTGTAGACTCTAAGTTAGTATCATATGGAGTAGATGACACAACTCC^ 

6 0 ATGTGATGACTTCCAGAAAG AGACTAAGAGTCGTGGACCGGCGCTTGTG AGCT CTTTGAG AAGGGGAATGTAGCCAAGCAGCTGG T 

GATGGGGGAGAAGGGAGGTCAGAGGGACCACGTTTCAGGGATTGTCIATA 
AGGCAGTOlGAACCGTTAAGCAACCrTTATTTGAAGACTAGTAGAAGTAC^ 
CTCCCTGGTGACTAGAGTTGTTCTTGAACGCTTGGGTGTAGAGAAAGGAA 
TCTCACAGACTTAGAAGAAGAAAAAAAAAAAAAAAACGTCAAATGAAACAAACAACCCCCCA 

6 5 GAGTGGCACCTGCCGCCATGTGTTTGAATAAATGATAGCCTTTACGATC^ 

AAC-GAACTATGTGGATAGCTAAGGGGAAGGGTCATTTACvATGCITTGTAGGGGTGGAGAG 
GTGGGTAAAACAAGATGGCTTTTCCTGGAAAAGGGAAGAGACACAGTTCTCTG 
AACAGAAAGGAGAGACATTTGTGGAGGACCATTGTGGAGGCTTCTCGTTAGATCT 
GCAAGAAGAGATAAAGC^AATTATTTTGCATCAAGAACTGTTCTAAGCAATTTAAATTATCT 

7 0 AGAGACTATGACATTGAAAGTTAAACAACTTTCCCAGAGCCTGATATCAGCTTGTTATGG 

TCTGACTCCAACGCCTGTGTTCTTATGGTGAGAGGACTCACTAGTTTCAGGTCTTO 
GAACTTTGAGGGAGGTCCATCTGTCCATCAGAGAAGGATGCTCTAATATATT 
TACAGTTCCTCATGTTGTGGTGACCCCCAACTATAAACTTATCTTTGTC 
CATAATGTAAATATCTGATAGCAGGATGATCTTGGGTAACCCCT 
7 5 TTGAAAACTGCTCCTCTCGTGAGAAATTGTCAGGCTGAG 
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TAAGATAAATGGACAGAAGACAATGTGAGGC^ 

GCAAAGGAGGGTTCTGACTTCATGGGGACAGCTGTTGGCTTGACTTG^ 
CTGCTGTGATTTCTTTGGAGGTTTCGGTACCAATTCTGGTCTAX1A 
TTCTCTGGTATTTAATAAACAAGCTGTAACTCAAA 
5 AAAAGCATATATOVCrrrAATTTCTTCCTACCTAATATAACTGTTTA 

TATCTCAATTGTTAGTCACGCCCACGTTAACAAGGTAAAGTTACAGGAAGTACrc 
GGAACGCTTCTGTTCTTATAGTeCTTCCATTCTCACATAG 
TTATTAACATACMAGAAAAGATGAAGCAGGGCAGAGAGCAGAAGATGGGGTG^ 
ACTTGTCTGAGAAGAGAGCTGTGCACAGACTCGAGCTGGA^ 
1 0 AGTCCAAAAGCTCTCTGGTGGGAGCCCTCTTGGCAAGTTCTGGAACAGC^G 

AGGTTGAATCAGGACCCCACTCCCCCAAAACCCCCGAGGCTCCCTCAGCCGCTGTCGAGGCT^ 

GTTTTATTTACTTCTGTTCTTTAACATGGGAAATTGTAACCATGTATTCCC^ 

AATTAACATGGGAAATTGTC^TGTATTCACACAACTGTGTGTTGAGCCCCTGTGCTAATC^ 

GG AATTTTGAACTG AGG AATG ATG ACTT AG CATGTCCTGACTTGAACAAAACT CACT CAG AACAGG TT CTG AGG AGG CAGG CAGGG 
1 5 AG C CAGGG AG T AGCCTG AG AAGTGGTT ATTTGGGG CTT CGTTTGAAG AT CTGG CTTG A IT i 1 ATTT A llii CT AATTTTTT^ 

AGAAGAACTGAAAAGGCAAATAGAACAGTGAAAAGAGAAAGGTAATAAAGAT^ 

CTTCTACCTAGACGTGTACCTGCCTGGCAAAGGGCTGGCAGAGGGAATATTTCT 

TATGCTCTTATGGAATCATGGAGTATTAAGCCCCCACACAAAGGTGAGGATT^ 

TACCTGAGGTACCTAGGGGCGCTTGACCTCTAGATATTTATGATCCATGTC 
2 0 ACAAGGAAGCCCTCCATCAAGGCTGATCAGGTGAAACTTTCAGAAC^ 

CTTCTCTTCTCTTCTCTTCTCTTCTCTTCTCTTCTC^ 

TGTGTGTGTGTCTCTGTGTGTGTGTGTGTCTGTGTGTGTCTGrGTATAGACTAGAGTTT 

AGGGTAGTGGAGTCAGAGAAAGG CAGAG AGAGGGAG AG AGT AGAG AAGT AG TTGCTGGC C ATG ACCATG TGGAGAG AGAGGAGAAG 
GGAATGGGGAGAGAAGGGGAGCAAGAGGGAGAGGCAAGAGAGCAAGGCAAGAGTG^ 

2 5 GCAGGTAGAGGTATTACCITCTTTCAGAGCAGAGGTGTGGGCAGTTAGAGGT^ 

AGAGGT/rTTACTTGCCTGCTCTATATGAAACCCTCCTATCT 
TGATCACACTAGTAGGGAGTCTCAGGAACAGTGCAAATCCATCAACCCTGGTG^ 
AAAGAGGTTTTCATGTATCCTGTCCCTGCTTTGCATTCCCACACCAATATTCAG^^ 
TCTTC^CTGTCTCTCCAAAGCTTACATCACTTGCTCAGGCTCTCT 

3 0 C^CCCTGCCTATCTCAGGGTCATCAGAAAGCCCAGATGTTCTCCATAGATAAGAGGGAAGG 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNmJNNNNNNNNNNinWNNNCAAGACGA 
GCTTCATCTGTCCITCCTGGTTGCACAAATAAATAATAAAGAACAGT^ 
TAAACCAATATACTATTTCGTCTCTCTCCCTCCCTCCCTCCTTTCC^ 

3 5 AGCCTTTTTTTTTTTTTTTCTTTAAATGGAAAA 

GGCCTTTCTCACAATATGCCTCACCAGTGCCAACAAACAAGCAGGAGTCTCTGTAGC^ 

CCATGGCGCATTAAAGCCAGTTAGAGAGAGAAAAATAACCGAGCCGAGTGTTGACATGGTGCCGTG^ 

GGAGGCAATGGCAGACCCCGAGTTCAGGGCACCACCACCTTACCCCAAACCATTGGAACCAGGTG^ 

4 0 GAGACAGAATATGCACATAGACTTAGATGAGCACX3TGCT 

TAGCCAGTCACATATCACTTTCCTACCCTATGTCCTCCAGTCATTCrCGA^ 
TGTGAGGAAGGTGGGAAACACAAGAGACACGGGCTATTGGTGCAGCTGCTTGAAGGA^ 
ATGAGGGCTCGAGAGCACTGAGGACCCTGCGAGCATGACTGC^ 
CCGCAGAGAACGCCTGCCAAGCTGCCTCATCCTCCACTTGCTCCTTGCTTACT 

4 5 TGAGTTAAACACACGGAAGCTTTCTCTCACITCCCAACCGTGCCAGACACT 

AGGTTTTACCTGTATTGTTGAAGTTATCCAGTTAGCGTCAGTCCCACTGTTTAATTCCGTGAGAGAAA 
AACGAGGTGTAGCCCCCACTCTCTGACCTTCTACAGTGTGGGGCTCTTAGA^ 
ACTCAGTGAAATAATtXJGCTAGATAAGAAACAATTTTCTGA 
ATCTTAAAACAGTCCGGTGATACTTCGTCATTAATTTGATTTTTGAAGGTG^ 

5 0 GGATTTt^GCCTGGCTTTGAGAGATCTTAACCTAACCATGACCTGCT 

CCCTGTTAGAACCCTGAGGATGTTCGAAAGAAAAAATCAG CAGAAAAGAATGATTTT AGGTATGATCTCTTACATTTATGCTG CTT 
CTGCTTTCTGTGAGAGTACGTTAGCCGAATT 

CTCATCT CTTGTT AC CCCTTTGAAGAAACAGCAAATG CTTTT CCACAGT AGGTC AGGG CT AT CTG AGGTT AAAAAAAAAAAAAAG T 
ACGTGG CTGACT AAGAGGGCT AT ACCTG AGCCCATGACTTGT AGTTTTT ATTCT AAAAGG CTCTGACCC ATGGACCCAGAATTC 
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AACCTTCCCCTTt^AATTGATGTGGGCCAAGTATCTTCCAGCCACTTGCTTCAAATGGACCT 

GCATCTGACAGTCTGAGATAGCCCCTGAGGGTTGTCCTCTCCTCTC-CTGAGTACTGTTGTGAAGTGTTTGTGT^ 

TATTCCTCATTAACAGCCCCITGAACATTACTGAACAGGTGAAG 



60 C^GCCAGCCT AGCCTGTCTCTGAAGTCC ACACTCTTCCCATTCCGAG CAGCTX3ACTA CAGG ATGG CTCCTTTGGCTTCTTACCT 

GTG CTTG AGTGCTTAC AATT AAG AGTCT CTGTTCAG ATTCT ACT CT C AG AAAGAC CTT C CT AGT AG ACAGTTG C C CTCT AT AAACT 
TTATCTTCAAGAGACGTAATACTTCCATTCCCTTCCCA.GAATAAGTTTCTACATT CTTGTT 
GATAAATGATGGTTCCAAGAAATCGTTGTC7VAGGTTTGCCTGTGCCGCACAGCTGTCCC 

TGGCAT CTTGG AAAGTACAG AAAGC CCAGGGCCCTCCCTG TTTT CATTCTGTGGTCTGTGTAAAAAC CAAATGCTACCATCCCGAC 

6 5 TACACTCTTTCAGATGGATCAAACAGCACATTGGTCACTC<3C^ 

TCTTGCCGCCCTTCAACCATGtAAGTTAGTCGTAAGAAAAACAGTAAGTGCTTCTTG 
CAGTGACTAACTAGCCTX3TCAGCGCAGCTTCAGAAGGCT 

GTGGAAAGCCTGCCTGATTTCACCCAAGTGAAGCGCTTCCTTGAGCCrTCCTGTC 

TGGGATCACGTG ACTCGCT<jCCTCT<3CCGATCATTAGCAGACCG ATGTC CAGC CATCCAG AGCTGG AATCAGCCAT CCAGGGAC 

7 0 TTTGT AGG AAGAG CATCATC AGCCTTGG AGCAAAACCAGAACTTGGT ATTCAT^ 

GTGAAGAAAGACGACTTCCTAAATTCACTTCATTTTAATATCAGAGAGAGT 
TGGC^CAGTCAAGGTTAAAGTTAGTACAAAACCTTTCTCT 
CTCGCCG CTCT GGAAG AAGT ATG CCCTtK3TCTGTTTTATX3TTGTGTT 
GCTCGGUTTTGAAGCCT^ 
7 5 ACATAACAAGAGAGGCCATCTTGGCTGAAGTATCTCTTCCTCTTTGTATAGCTCA 



679 



WO 03/008583 



PCTAJS01/51291 



TGTATTCGATTTTACCTCCTAAAGATCAACTTCCAGACACCATCACTGGATTACATTTCT 

GGGTGGGTTCAGAAGCAACTCATTATAATGTTGTATCAGGGCTCAGG 

TGGCCAGAAG<5GAGACCX3TTCGTTGTTC^ 

TCCCGGCTCTGACCATCAAAGGTTAC^CTGGAGAAAATCCATATTG AG AG CAAACTCTGT AGAGCCCTGCCTCCGTGGAGT CTGGA 
5 CCCCGCTGATAGTAGTGATATTATAAGTGAGCAGGCTGAG^ 
CTTCACGTTCACGTTGACATTTATCCAGAGAGGCGACGTA^ 

TT AATCGAATTCAGAGGGGAGTAATGCCT CCG AG CT ACTCATGGTGTG AG T CTCATGGTGTGATTTCCACGG TGTG ATT CTCATGG 

TG TGGTTCTCATG TTGGGC CATCTTG ACGAAAAGTTCT ACACTGT AGAGG TCGGGATGGT CAACT AGACAAGAAAGACTTGGTC AT 

CT CCTCTCT CCTGTCCTGTC CA.TTGCTGTTCTT ACCACGTT ACAAATACACAAG TG AAATG AT CG CCT CACATGGAAAACTT AACT 
1 0 TTGTAAAACAAGAGCTAAACAACCAAATGTGAAACCTACAAATTTTT 

ACTCCTGAACATATACAGTATAATGTCAGAACCCATGCCTGGGGAAACTACTGAAGAGT 

CCAAGCCATGGGCTTATCTCATTAAAAACAGCCTCCAGAGCCAAGCCCCTGA^ 

AGTTAGGACTCAGTCTTTGTTTGTCTCTCTTACTTCATCAGTC^ 

GAAAATCGTGACAAGGTTTCAGAGAAGAAACAGTGTTGGTTAGTAAGGACCCTTG 
1 5 TTTCAGTGACTTGTTTCCTGGAC^TATAAACAACCCCCACTACCATTTTC 

TTTT AAGTTGAGCT AT AGTCAAAT ATGT AT AT AT CTG CAAAATG TTGT ATTT ATGCTG ATTGG TT C CATGTT ACAGT CAGGT CAGA 

TGTACTAACTGTTGAAAATGTGCTTCTCATATAGATTTGCAACTATGTGG 

GAAAAAACATCCACCTGCACGCCCACAGCCTGGTGGGCAAGCACTGTGAGGACGGGGTATGCACC^ 
ATGGTGGTTGGGTAAGTGGGCTGTGTGGTATGGAGGGAGCGGGAAC^ 
2 0 TGTGCAAAGCCTGAGAGGGTCTCACACACGCGCAAGCAG 

AATTATAAGCCCTItlACCTAAGAGGCATTTGTGCCACTTTCAATTATATT^ 

AAACAACTACCTTAACTATATAGAAATGATTACCATGGAG^ 

TACCTTGAACTCACATAGATCCTGTTGTCTCTACCTCCCAAGTGTTTC 

2 5 ACAGAG CT CTGA TTT'TT G CCT ATTT GGG ATTTTCTTAAT AG AG TTCT AGTGGG AACATTTG AG ACATG AAG CAG AT AAGT AAAT AT 

TGTTTCAACTCTGCGCTACTTATAAAATGTGTGCTTTTTAAGTC^ 

ACATAAGT AG AAATT AG TG T ATTCTTTCAAAGTCAGTCAAAT ACAACAATTTTTAG TGAATGAAATTTT ATAGAAGT AATCTTACT 

TCAAAAGAATCACAAAATATTTAATAAGACATAATTATGGGCTGAGAGAGGATGAG 

AAACCCCrGGCCTCTCATAGCAGCGCTTTTCAGAGCACAGGC^ 

3 0 GAGAATCTGGAATCTCATGCAGTAGCTGGCCTGGTGTGGTGGAC 

GACCTCCATATGCACAGCACATCACATAGTCAGCTGCGTTCGCACACATA^ 
CATTATCCTGAAACACTAAAATAGCAGACAATGTAGTAACTTAAACAGCAAA 
ATAACTCCTATTTTCCAGACTCTTAAAAAGCAAAGGGGATGATTCA 
CGAACTCCTGTAAGCCACATGGCAGGAGAGAACCAACTTC 

3 5 CACACACACCATGTAATATAATTTTTAAATAAAAAAAAGTTAACTAAAATAT^ 

GTCAGTGGGTAAAGTATTTGCAATGCAAACTTCAGGGCCT^ 

AC CAAACTGAT AAAT AAAATAAAACCGGG AAAGGGCAGAG CACAGCGGTG CATG CCTTTAATCCT AGCACTCAGG AGG CAG AGTCT 

AAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACCGCOVGGGCT 

TAGTCOVTGCACCTGGAACCCCAATGTCTC^CCACAGGGATGGAG 

4 0 GTCCATGACCCTGTCTCAAAAACCCGAGAAGTGACTGAGGAAAATTCCCTGCCTGTGTTCT 

GTTAATTTTATTTAATTGTGTAC^TCATCATTCAGAGTTCTCCAG 

TG CTTTTTTTAGGAG ATTGCTTAGCCAG CCTTG ATG AAGTCTGACTCTGTTGCC CTGCAATCTTGGGAGGTTCAGCTTTCAAAGTC 

CCAGGCATCATTTCTTCATTTGACGACGAAGATGCTCAAGTGTGAGAAGCTCCCAGAGAGAGGG 

CTCTGGTGCTTTTTGACGTTAGGGAGAAAAGTATCCGAATTGGAGGAAGCTGAAGTTA 

4 5 ATCTCCKKOTGCCCTGCTTCTCAATCTGCCGAAACTGACATGCAGGG 

CCCTGATTGATTGGCCACATCATAGCCAATCGTTTTGAAGCATTTCCGTTC^ 
TTCTTAAGGAAATCACAGGTGAACTTTTCTATCAGTATTTCT 

CTGTTGGG TTTAATAATCTTT CATG ATTT AG AAG AAACTAATGG TGG CTGTGTCTG CTGCT CTGAGGT AGACACAAGG CCITGGGT 
TGCTCTCC CCTGCAG CACTACCGCCACT AAAAACACT AATGGCTGTTTATGCTGG AGCCT 

5 0 GTACCTGTAAGCAGAGCGGAGAGGTCCGTCGGTGTCTTTTAGTGTA 

ATG ATTT AGCAAACATTTT CGACATC CG ATTCCTG ACT AGG CT C CCTGTGGTTTG ATCT AG T CAT AG CTGGG TGG GATTTT CT AAG 

GTGAGGAAACTTTCTCGGAGAGATGCAGTCGAGGAGATGCCGCCTGAGTGATAGGGGGTAACCACAGTAGCTTT^ 

ATCTCTC ACTGTAGAATGTCTGCAGTCG CTTTCCCGGTTGG CATTT AGCTG AGAG AATTCTGCTCATGGT ATTTTCCCCAGAC^ 

ATTTTCTTCGTGATTCCCTTGTCTCTGTGTCTCTGAGTTTCTAACACGGTCCTA 

5 5 TGGAAAAGCAGCGTGCTAGCAGCCGGGAAGTCAGAGCTCCTGTATGCTTCTGTCACTCTG 

TGCATGACTTTCTAAAGACTTGAATTCCTTTCCTTACAAAATCAGAAAGAC^ 

ACCCTCT AGACCAAG C CCACATT AG CT AATGTT AT AAAAC AGCAGGAGGAAACTTG CCACT AT CT C CT CATGGT ATG ACATCGGTT 
ATTTTATTTAGTTCTGACTTGTAGGCCAGAAGTTTAGGAAGGCAGCCAAAG 

CAAG AT AACATTTT C CAAT ACT ACAG ATTAGACTGAAT AT AGAT TCACAAACAAGGCACCAGT CCAAAC T AT C AACAT ATG T AAAA 

6 0 ATGACCCCAAGAAGACCAAGCTTTGAACACATCAAAACCAAACGGACCCCCrTTGTTTACAGTG 

TTGAATTCTTAATAAGTTTAGTTTCTATTTTTCCAAATTATTT^ 
TTGATTGTTTTAATATAGGGTCATTAACTTCAGTTAGCTTTCA 

TTTCTT AAT AAAAAAAAGAAGAAG AAT C CTGTTT AGT ATT CT ATTTTGAC CAGAATGTT AAT ATTTTT AG CAACACTCT AG TTCTC 
TGAAATTTTTTGTTCAGTGGAAAATGTCTGTGCATTTCTACTTTGCT 

6 5 TTACTGGCAACCTGGGTTGAAAGAGATG T ACAG TT CTTTTT ATTTC CCCCTCCCAACT AATTT ATGT AG CTT ATTG CAG TT GGT AA 

GCAGGAGAGCAGAGATTCTCAACCTGTCGGTTGCAACCCCTTTGGGC^ 
ATATTAAGCTTTATAACAGTGGCAAAATTACAGTTATGAAGCAGCAATGGAAT 
AACTGTTAAAGTGTTGAAG CATG AGG AAGG CTG AGAGCCACTXSC CCT 
GGCTCCCTCAGGAGAACTGCTACCTTAAATTACTGCTC^ACAGATCCAGT 
' 70 CACTGCCTAGAGAAGGTTGGAOU^TATGGACTTAGGTrCAAGGTC 
GCTTTTGAACACTGAAATCAAGGTCTGAAACAACAAAGGCCAAAG^ 
AATTTGAGCAACAGCTTCCAAATGATGATGAAGATGAAACATCTGACCTA 
AGCCTGGGTCACTTCATAGAAAACTGATGCCAGCCTCCATCCCATAGAGTGGGATT^ 
GCCT^GCATTCCTCTTCvAGTTAATGGAAAGGTTTGTTTGAAAAGTC 

7 5 AAATTCTAAAACAACACTGATGTTGCTACTTAAAGACTCCCATGTTGTGAATATGAGAGGA 
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TTTT ATT AGTGGTTGT AGC CTTT CTG TTTTT AAAATGTCATTGCTGCAAC ACTCCT CCTCT C CAGTG TT GATTTCTGTG AATTTTG 

ATAATCATGTGGCTACTCCTGATCTGAGGCTGATGAGTTCTCTTTGGTTT^ 

AAGGTGTTTTTTGAAGAGCAGAGGATTCTAAGTTGACAATATGTACT^ 

rTGACCTGTTTCCAGCAAGAGCTCTCTGTAGGTCTGCCTCGGCTCT 
5 ACTTCTCTTTAGCACTCGTAAGCAGTGTGAGTTTTGATGGGTTTGAGGGTAGu 

AGAT C CTTGGATTT AGA'riTCT T ATT AAAT CTT AAAGCTTTTCAACT GTT AAG 1 1 i i CTGTGACTGTC^CTGCCCCTGCTCCCCAT 

TCCTGTTCTCTGTCACTTTCTCTCTTCCAGACTAG 

TGCCTCATGTATTCCGAGGCTTCCTGTTTATGAGAATAGTAGCTCTT^ 

TGGATTTTTGAAAATCTGTGATGCTCTGGATATAAATCCTAATCTTCTTAACC^ 
1 0 TG C CTGGGTTCTCCTIT CCCATC CTGCAACCT AAG CATT AAGCCAGGT AATTG T AAG AGT AC CTTTCTATTTTTTTTTCTCGG ATGA 

CTGTTGTGTGTCACCTATCTCCAGAGC^TGACAATCGTTTTCT 

ATATTACTCCACCATGGCCTGTGGTGGAACACTGGTGTGTTGTTTACTAGGG 

GCATGATGCCTTTCTCCAGATGTGAAGCTGGGTTACGTGTAAGAAGTGCAGC 

GCCTGGGCTCCTAGCGGGGGAGCAGTCTGCACAAGAGTGAGACCCA^ 
1 5 AGCACATTAGCAGGTTGGGTGCATAATGTTTGTGATGCTCCCT 

CACATGTCAGCCTGCATTCTAAGACCTGGGGATCTCTGTCCTGAAGGAGCTGTG^ 

GTAACTCACTCCACATCTGGCCTTCTGTGTGTGTCTGTTTCCTTCTGCCTT 

CTTTGCACAGTAGCTTGTTCTCATGTGTGCAGGCAAGTTGTC^^ 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTATTT CTG CTTGTCGGGGG AGGCTATCTGTT CTCTCTCTCTCTCTCTCTCTCT 

CAACTTGTGGAAGCCGCTTCTCTTCCCCTCCTATGTGGGTTCTAGGGATTGAACTCATG 
ACTGACTTAGCCATCTCACTGACCTGTATCATTTTCTTCCTTCT 
ATCTATATGGCTCTGAGAGACCCAGGCCCTGCACITTTCTCTTTGAGTTCATCT 
CATCTGCCACCTTTCTGCCACTGTCAAAAACAGCAGAGTGGCCCT 

2 5 GGTCCTCACACTCACCTGTCTGCAGCTCTGATACTTAGAATCCACTTC^ 

GGGTCCTGTCTGGGTTCTCCCTGTCTGGCrCTCCrrACCCTCTCACCCCTCCCAGGCCT 
CTCTTTTAAGTTACACTGTGAAATCTATCATCGTGTACGCTAGCATGTG 

CCTAGTAAAGCATACGTACAAGGGCACCCGTACTCTGTGACTCCTGAGTTGACATCTAATATGTTCTGAG 
GTTTTCCTCCAACTCTTTTAGTAACCCAGTTTAACCTGGTCTGACAAATCCACTTCTGT 
30 AGAAATGTCAATGAGCTGAGTCCATGGTGCACCCCAGCCCTGCCAAC 

CTGACCTGAAGCCCGCCCCCCGGCCATGACCTGCCTGAC^ 

CATGTG ACT AAGAAAAAGGT ATTTG AAACACTGG AAG CACGGATG ACAG AG G CGTG T ATT AGGGG CT AT AAT CCTGG ACTT CTGGT 
GCATTCTGACCTTGCCTATCTACAAGCAGAAGGCGGAGGAGACCGGCAACTCACAGGTGAGCCAGCTGACCCAGCCTTAATT 

3 5 CTTCTATGTTCCAGGATGACTGCAGCCATCAGCGTGCCAGGCTATCAGAGGGAAGGGGA 

TGCTGGGCACCGTGGAGAAGCGGAGCCCACACACAAAGACTCTGCATGCTTGTTCTCACCT 

G ACGTG CATTTCCCTGTACTCCrCG CAT^CTATGTGTG CCTTAACTCTT AG CAAGCGT AGG AG AGTAGATGGGTGAGCATGGGAG A 
AGACAATATTTACAATACTCCAGAGAAAACGCAGCTTACCTTTGATGGGTTTTTAGGCAATTTAATAAGACC^ 
TACTAACAGTTTAAAGAATTTCAGGCCAAAGTAACCAAAAGAAAAAAGAAGACTCA 
0 AACATTACCAGAGGTAATGGTCAAATCCAAAGAATCATTAGGGCTTTGAAAATATGTATTACAGCAAATTAGAATA 
ATGGGTAAGTTCACAGGTACATCTGACCTGCCAGTGTTAAACTAAGAAGATAGAAACAACTTACACAGAC 

AAT ATCTCCCAACTGAAAAAT ACCCGK3TCCTAAATGG AATCATGGCTGAATT CTGT CAG AT ATTTG AAGAGATAACGTAAATCCTC 

CTTAAACTGTTCCATAAAGTAGAAAGGGGAAGAATATCAACAAACTTATTTTCCAAAGCCA 

TAAAAGAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAA 
5 TCATAC3VTTCCCAATAAAATAATAGAATGAAATGAAAAATATACAA 

TATTGTCCCACACACJUVGAGCCCAGGGCTGCACTGGAGAGGCTTAGCTCTGATTGAGGATCAGCCCTGCGTC^ 

CCCGTGGTTTTCCTATATCGATAGTGAGTGTGTCGAGAAAGAAGTC^CAAA 

TAGCTTCAACATGAGCAAATAAAACTCCTGGCAATAAACTAACCATGTGAAAAGTCCTTTACAGT 

AAGAAAGAGATGTCAATCAATGATGGAAAGGCCTCCCATGTTCCTGGGAGTTAACAGAGTTACTACCACAA 
0 TGGTCTGTAGATTGAATGCAATCGACAATGCCAATACCGT^ 

TAGCCAAAGCTATAGGATAGTAAGAATATTTCAGAGGTATCACAGCTGCTGATATCAGAX3GTATCATAGCTGCT 

CACTACAGAGC C ATGGT AACAAAAAAAG CATGGAAG TG GG GTCAAAACAG ACGTGT AAATC AG CAG AAG AGG AG AGT CAG AAAGGA 

ACACAATGTAGCCACCTGATACCCGAGAAAGATGCCAAAATGTGCATTAGAGAAAAGACGGGCTGGAGAAGAAGTG^ 

GGAACCGGCTAGCACATATAAGGAAGGXSAACCAGATCCCAGGCTCCTACCCTA^ 
5 GGAAAAGGTAATGGTCTG AAACAGCTAAAGGAAAACATGCAGAAAACGCTT CGTG ATGTAGG CAAGCCCTTTCCG AAAAGGGCTCC 

AGCAACTCAGGAAATAAAATGGAGAACTAGCAACTGAGATTCAAAGGCGTCACAGCAGA 

GACAGCCTTCAGTOTGGGAAAGAATTCTTGCTAGCTATGTAGCT 

AAATTCTGG AAACAAACAACCCAGT AAATAAATGCGCTC AGAAGAAAG AG AGGGTTCT C CAAC AATG AGTGCAAACAGCCAACAAA 
TGTGAAAGCATGCITCACACCCTTAGTCTTTAAAGTTCCTCTGAG^ 
0 C CAGG AG CCTG CCAAGT AGCT CAGGTGGT AC AG T ACTT AC CTCTTGTG T AAG AAGT CCTGGGTTTG ATTCCCAG C ACTCCCTACAA 

ACGGCTGCACTCCCAGATGAGTTCAAGGTCGTCCCCAGATGTGTGTAGCAGATC 
CCTGAATTCAGTCTCCAGGACCCAGATGGTAGAAGGAGAGAACCCCTCCCCTCAGGTGGTCCTCT 
TGACCAACAAATGCAAGAATGTAAATAAGAAAATCAAAATGGAGATTTAGAATA^ 

CACTC CTG CGG AT ATTT ACAAAACTCTAACT CCACG T ACT AT AACAT ACAT AT^ C CCTGC ACAT ATGTGTTT ATTTCAGCCAT AT 
5 TCAAGAGCTACATCAAACAACCACCGTAGCTCTCCATTAGCAGATGATGGATT^ 

TTTTTTCAGCCATTAAAAAAAAAAAAAGAAAAAAGAAAGAAGGG 

CAGAGGTCCTGAGTTCAAATCCCAGCAACCACATGGTXK5CTCACAACCATCT 

GAAGACAGCTATAGTCTACTCACATACATAAAATA 

TTX3CAGGGCAGTGTATGCAACTGGAGATCTCTGTATCAAGAGA 
0 TCGGTTCTAGATCCTAGGTTTTATATAGATGTACATAAGATATTTTGCATATGCGTGCA 

TCCGTTGGTAGGGAGGGGACGAGTGGATATGAGGAGGGGAGAAGGGGCCTTAC^TACGGTCACAGTG 

GTTTTCAGGGAGTTATCCCGATGAGCACTGAGTATATGCCAAGAA^ 

TGAGGCCATTTGCTGTCAAATGTTCACACTCTGACCTGCCTGGGCATGAGC 

naTrzrrcCGCCAGGCAGCCGTGCA^ 
5 GCACTGGCAGCTTCACTCGGAGACTGGAGCCTGTGGTGTCAUACG 
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AGTTTCATCTAGCTGCATGGAGGGC^GGGCCACC 
GAGCCTCTAGTTTATTGTCTACAAAGTTAATACTGC^^ 

GCCAAGGTAACACGTCTTCCTCAGAGAGTGCATTATATTCC1TACATGTAACA 
AT AG ATTG CTGGTGCTCTAAAAAAAATT ACCTT AATTTTTGCAC CTGG CAGC^ 
5 CTTGTCrCrcTTTATCCCCAGTTTGAAATACATTTATTAAGAC^ 

CATCTTTCAGGCTGCAGATAATAGAATGTGCAGGGTTTCCAGTAGCTTGT 
TGGGCTTAACAAACAGTTGTTTCAGGAACCCTGCA 

GCATCCAGCCACGCATGGGTCTGTCTGATTTAAACAAGTGTGAAGGGAAGTGA 

CAGAG ATGAG CT ACAG CTGTTGGAGTTTTT AAAGGGGG AAGCTTTGCC CCACT CAC CACT ATGGGCAG CTTTG CCCCAAACCACAC 
1 0 AGAATGCTAATTTAGGTTGGGGGAAAAAAAACCAGAACATCTTCACrrrrC 

AATGTGTGTGTATATATATATATATATATATATATATATATATATATA^ 

TTTACAAATGACATAAAGTTAGCATGAAACAAGCCATTGGTCCAGAAACATCAGACAGCTCT 

TAGAGAAGCCATATGCCACCATCCCTCCCTGACCTGCTCCTTTCTGGGGAACCATGAGTGCCCCAGA 

ACCTTGGAAGTCTTTCITGCTCTATTTGTC 
1 5 TAAGCCAGGGTATGCATTCTCTTCTCCCTTTGAACAGAAGCCCCGAATGCATCCAACCT 

AGGATGTGTGACGGGAGGGGAGGAGATTTACCTTCTCTGTGACAAGGTTCAGAAA 

GTGTGGCTGTCAGAGTTCTCTGGAAACAGCGGACCAGGGGTGTTCTCCTTTTGCT 

CAC CATT ACACGCATTGCAATCACATGCTTG AAT AC G CCCGTTTGCAGATGACATC CAG ATT CGGTTTTATG AAG AGG AAGAAAAT 
GGCGGAGTTTGGGAAGGATTTGGGGACTTTTCCCCCACGGATGTTCATA 
2 0 ATGCACTTCTGCCTTGGTGAAATTCAGTGCTCAGTTTC^ 

CAATGCTATGTTTATGGCTCTTCCAGTACTCTGACATAGAGGCTGAGAA^ 

GGTGTCTCTTGGCrGATTGGTTACATGACCAGGGTCrCCCAGCT 

CTGAGAAATGGTGTTTCIAAATAATTGGCAACAACCG^ 

GAACATC^TACAGTTACTAGAAAAGAAGGCTCTGTGAATGAACAAAGACCTCAAAA 

2 5 GGAACACACTGTATGTAGAAACTAATTCTATGTAGAAACAATTCT 

CTCCCTCTAGGATCTCAAGTTAGTGC^TGAGTGGGAAGCGGGGAGGCAG 

ATATITGATGACATCTTGATCCTTATAAACATAGAAAGACCGAAACTAAGTCTGTCCAGTTC 

ACTAGCATGTG ACACCTCCTGCGCAG CCTCACTTAGCATG ACTGTCTGCGGGCTGTCAGGGGCAGCCCTGGTTCT 

CATCCACTCCTTCTCTTGCTGCTGGCAGTGATCCT 

3 0 CGATGATGCTGTTGTGTTTTACTTTCACGGGGGC^ 

TTTGCACACACTTTATTTAAGTTCAGAGTATACATAGACGTGTATAGAATGTGCCTGATGCTCAC 
CTACACTAATGTTTGGGTCTTTTG(ZACCCn3CAGTTTGCC^ 
CAGCTTCCGTGTTTGTTCAGCTTCGGAGGAAATCAGACCTGGAAACTAGTGAAC 
GGTACCTGGGTGGTTTAGACrrcrTGTGCTTGCTrcrGAAAGACCA 

3 5 GATGCCTGGATGCACGCTGCATAGTGTTAGGGGCAAGGAAGGTCATATGCC^ 

TATATTGCTGCCAAACTTATATGAGACTTAGACATTTGGCTATTGTTGGGTA 

AAGCACTTG CTGTCAGGGCTGAC CTG AGCCTG AGTCC CAAAG CTGTGTGCTGAAAGG AGAGACGTACCTGCCAGAAG ATGTGGGTG 

GTCCTCCACCAGGAGGGGGCTGGGCGGGGAAGAAAGGGAAAGTAAAAAAAAAAAAAAAAAAAGTAAGGATTTTTTC^ 

AAGCCCITAAGAATTAACCAGAAGGCTTAGCCACTGTCCAGCCCIAGAACTATAAAATGA 

4 0 GACACCTCAGGAAAGGTGCACTATAGTTAAAATGCAGATTACGATT 

GACACTAGGCAGCTGCTCTACCATTGAGCTGTAACCCTGGTTCTCAAATGATAAGTACCC^ 
AAATAGTTGAGGAGACAGTCCTGGATTCCTTGGTCTAAAGAATCAGCCAAA 

AAATGATTACTGTGAATCTGTGTGGACTGAGGAAGGTTGATTTAGGCX3AGGGAGG 

4 5 AGCTGGAGGTTTTTGCTTTGACTCTGATGGCTAGGC 

GTGGTGCCTCAGCAGAGTTAGAAAGGGTTCTAGCAGCCAGAAGTTAGCATCCAGC^ 

GGACCTTGGGTGAAGAGAAAGGAAGAGAGACTCGAGAGTCTCAGATTGTTCTAGGGTTTCCAGACT 

GTTAAGTGGATAAGAGGAACAAGAGAGAAGAGTCTGAGAATGAAAGAGAAGCI^ 

AGGAAGGGGM3CCAGTCAAAGGCIATGACTCAAGTGGGTGGGGG CTC AGC^ CCCGGC CCCACAG AGTTTAAGTGTAG 

5 0 GGACGCACAGTGCTGCCGTCAACCTCTCTGTACCTCTGTTTCGTAAGTTTTGTCTTG 

AACTTGGAGGCTACACAAGGCCTGGGATCTGATAAAATCAGAAATGT 

G T ACAGGG T CCT AGGTC CAGGG C CACC AAGGCCCCCAGG TAT CGGT CC CTTGAAGGCAAAGCGTG ATGGTGG C CAGG AGTTT CTGG 

GAGGCCATGTCTCTTTTTGAGGTATGGCCTCCCCATTCCTGGGTCAGGCT 

ACGAGCTTGAGGTCTTCTAGATCCTTCTGTCGAG^ 

5 5 AGGAAAAAAAGATTATTTTTCTAAGAAAAAAAAAAAAAAAAAAACAGTG 

TGCTGATGCAGAATCATGAGAAAAAGAAAAGAACTGCAGGTTCCCTGTGCAGGATGTGA 
CAGGCAACCTCrCCTGCTAGTGTCrGGATCTGAAGAGTACAGTAGACAAGGCCCAGCTTCCT 
GAGTTITCTAATGCCTGGATTAGAGGATTGGGGTTCCCn'GAAGTCCTGGTTACCrCACATCTT 
GTATACTTGAAATACAAATGAGGCTCCACCTACACACAGGATGGCTGAGGTACCTGTTGTGTT 

6 0 TTTCT CTGTG AG AT CCACAGTTCATC CACTGACT AAATG AAT AT AGCT AT TGG TTTTGG T AT AACT GGTGGG AAAACTTTAT AAAT 

ATTTCTCCGAGCCATClAGTTAGAAAGTAGACAGTCTCTTCACCrCAGCCTCACT 
CATCCTGGTGTGCCCAGGGTTGGTTTCCCAGGGACTGGCAAAGTCCACC^TGCTCTCCTG 
TCCAACATAGTCO^AAAGTCCTTGTGCCACACCGTGGGGTGATGGAAACACCCACAG 
TAGACCATGATGAAAGCCAGGATTAATCAGTCAGACACAGGACGATTC 

6 5 TTGACTATTTAATGTAAACACAAAACGACCTTCACre 

CTCCCAGACCACAAAGGAACAGGAGCACCATGTCAGATCTCCCACACAG 
GGCCATGACAGAACTGAGGGCACATGGCTGTCGATCACATTGCAGTAACAGAA 
GACAGTGAAAGACAAGAGGAACAC^AACTCTGTGAAGAGCACCAAGATAA 
CCAGAGGTCACTCACTGTCCCAGAGGCCATCTTTATTTTGGAGTCCGA 

7 0 TGTTTCTAGG AT CG ACATT AT AT CH" CTT ACCACAAT AACCAGT AAGGACTCGG TCTC C ATTG CCTCATG CCTGC!ACnX3TCTTCCTT 

AGAGTTTGACTCTAGATGTCCTGTATACAGTGATAGATGTAGCTCCTAAAATGCT 
AAGTCCCTCACATTCCTATCCCTGAGGCTCCTCAGCATGGGTGACCCCAGTCCT 
ATCTTATCCCTCTGmGAAGCCCTCTGTCT^ 
CTGTAAATCTGTTTTCACGTGATTTCACATGATGGTATGATC 
7 5 ATTGACGGTCCTTGAC!AGCAGAGATGTGTGTTATGTTTCrcTCCATCCCAGC^ 
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CTGGTTATTATAGTTGTCTTAGGTGAACAAGGAC^ 

TTCACGAGGCAAAGGCAGTTTTGAGTCTTGTTCAAAACCAAGCTAGTCT 

GACCCTATCTCAAAATAAAGAAGTAAAAGTCCTTTAAAAAGAAAAAAATATGTC 

TGAG ATCTCTCACCATCTTGTTAGCACTGGGAACTTAGGAGGGGG CX5CT CCGCACTGTGGTCTCTGA 
5 GGCGTGCTCTTGGTCTTCTCGTCCATTGGAATAAAAACCATGCTTCCrC 
ATATCATGAGAATCAATCGTCrCTCTGCGGCTGCTGGTTO 
ACAAAGAGGAAGTGCAAAGGAAACGCCAGAAGCTTATGCCGAACTTCT 
GGTGGAGGCATGTTCGGTAGTGGCGGTGGCGGAGGGAGTACCGGAAGCCCTGGCCCAGGTACG 
CCTTTCATGAAGAGGAGGGGGTAGTTTTCTTTCCCAGGTTGCCAAACAT^ 
1 0 AGATTTCAATTAATGGAAAATTTCCTACAGATGTAA 

TCC CCACCCACCCAC C CAT CAAGTTT CACTTTCTGTGG TTT CAGTTACACG CGG TCAGCTTGAGT CTAAAAAT ATT ACCATTGTGT 

TGACAGAGAGAGACCACACACACAAAACTGTCTTCACAGTCTGCT^ 

GTCTTTTTACGTTGTGAAC1TTACCCAGGTCTGTATGCATCTG 

GATAATATGTGTCTAGTCCCGAAAAGTGTAAGAAAAAAGTCAACTGAAGGGTTTCTGGAAGAAAA 
1 5 GTCTTTCCCTCCTCCCCCTCCCCTTTGCTTTTGGAGA^ 

ACCG CATGGT AGATGGGT AT AAT ATGC CT ACACAG CAT ACTGTGGGCTGG AG ACACTGT ACTG T AAACAAACTGAGT CACAGAAAA 
TCATGCCAGGGAAGCACTTCAATGGTGTGTAAACTACTGCTCCTA^ 

TGCTGAGTAACGCTGCTTTTGCCTCAGATCTAGCT CAATGGAAATACAGTTCTTATAGCAGCATAAGTGTG CT 

20 ' GTITAATGACTCACACCTGTAGCGT AAGTGACTTGAATGCTGTTTCCTGT CTCCTGTGTAAAATAACTGTGCTG AG CATGGCCTTT 

CTTTATTCCAGATAACATGTAAATGCTTTAAATGTCAGA 
TCTCCACTAAAGTATCATTATATTACGTATTGGGTTCTTCTCACC^ 
TCTTGGTGTGAGTACHTACCGTTAGCTTTTC^ 
TGGGCACTGCCTGTTTGTTCAGCTGTCTGTTTCCCACA^ 

2 5 T CCCCATG C C ACAGCCTGCCTC CCTGAGTGGG TC CC CAT ACTGTT AACAATCTC CCTGATGTTCCCTTTGTCTC^ CTGGTTCA 

GGGTATGGCTACTCGAACTACGGATTTCCTCCCTACGGTCGGATTACATTCCATCCCGGA 
CCATGGTAAGTCGGATCG^AATACCCAGAGCACTGCTCGGGCAACAGGATCATCTCT 

ATT AAT ATTCTCAGGTTATTCTAGGACAG CATAAGGGACACCCCTGACACGTATTTGCrCATCACATTGTGTGGAAG CAGGACAGA 
GGCTCTAGACAAAAGATGAGACTTGACTACGCAAACACACAAATACCACCTTC^ 

3 0 TXjAAATCAGTTTTCATTTTGTAGGGTTGGCCTTTTCCCACTCTTC^ 

TATATAAAGCTGCATTCTTCTGTACCATGGAAATTATTCTATAAAGTC^ 

TCACTGT AAT CTTTTTT AAAG ATTTTT AACTGG CATTGTCG TTT ATCCTAT AAT CGGGT AACCATT CATCCCAT AAAATAGAACT A 

TTCTTTCCATTACCTATAACCTCATGACCCCTCTGCAA^ 

CTTGACTTTCAGAAGACACTGTAGCAGGATTGGGTT^ 

3 5 GTTCCT ACAGCAACGACAGTCAGTCT AGTG ATGGTGCCCTAG AGCCC^ CTC 

CCITTCCCTTGCCACTCAGGATTAGGAAAGAAAATAACTGCTTTCGAGAACTTTC 
CATTTTTCTTCTTCACAGATAAACAAAATGAGATCCAT^ 

TATCACTGTGTAACCTTGACCTTCTGGGACATTGTGGGTGGAGCTCACTACAGCTGGCT 

TTT AAATTG TTTCTG AAT ATTT AAAG TATTTC CGAAACAT CC CAGGTT^ CACTTTTTG TGA 

4 0 GTTTAAGGCCAACCAAAGCTACACAGCAAGACCCTGTCTCAAAA 

ATTTGTGTTATAAACATTT AT AGTTTTCC ATTTCCTGCAGAT AAGCCTT CTTCCTATTTTCC'lT'rriT L'TGC CCTATGTTGAGACA 
GTGAGCTATAGCTTAATAGATTTTCTACGTTTTTTTCCCT^ 

AAGAGTGATGACGAGGAGAGTCTGACTCTCCCTGAGAAGGAAACTGAAGGTGAAGGGCCCAGCCrc 

GG AACCCATCGCCTTGGCATCCAC CATGG AAGACAAGGAGCAGGACATGGG ATTTCAGGGTAGGTGAG CTTGCCCACGAGGG CTTC 

4 5 GAGGGACAGCTGGGGTGGGGTTTCAGGGTAGGTGAGCAAGCCCACAAG^^ 

CAGG AAGCTGTCT CCJVGAAGACAGGTTTCCTTCCTCTTAATCTAAAGCTG ATt3 AGTCAGC ATCTCCTCT CTGCACCGTG 

ACATTGCCTAGTCTGTGCAGTTCTGGGTATGAGAAGAGGCAACTCA 
CGCACCTCCCTCCCAGCAGCCGGCTCJ^CTGCCACACTGAAGACGTTTGCCTG 

5 0 CITG AAG AAG TAG AATTG AAAGGG AACGTTCCATCTG AG CAGG CCTCATTG G CAT ATG AGTCATG ATTCTCG ATCAGACG CCATCT 

GTAAAGAGATAGTAACTATTTACACATAGTGGCCGTACACATGGTTACTATCACATATTTTTTTCCCT'rT 
ACTTCTTCTGAGATGCACAGTATATGTGTACAGAGCCATATGGAAACAGGGTAGATT^ 
TCCCTTGACCTTAAGATAATAAAATTTCCAAAACCCCTGGGGTTTCT 
ACTCAGACAGGATTCCAGAAGAGACTTTGTTTGTAAAGTGATTGGCTATT^ 

5 5 TTGCTCCCCTTCAATCATAGACACTGATTGTTCTTCTCAAAATAGCA 

ACCTCTTAACTGTATTATCTGTCAGTGAGACAATGCCCGAAATGCCCG 

AAAC^GAGCATTTTCAGCTAAAATGGTGTGCCATGCCGCACACX5CGC^GCGAGCAAATGACACTCA 

TTGCAGATAACCTCTTTCTCGAGAAGGCTCTGCAGCrCGCCAGGCGACAC 

GTGAAGATGTTGCTGGCCGTGCAACGCCATCTCACCGCCGTGCAGGATGAGAATGGGGACAG^ 

6 0 CAGCAAGCTCTGTTTCAGG ACGGGG AAAGG CAAGGATACTGCCCCCACAAACTG CCTTTGAAAGGGAGCTG CCCTCATCTGACAAG 

ACCTCGAGTATGCTGTCTCCCTAGGCAAGAAGGCTCAGATGTGTGTAATCATGCTAATTTTACACCCC 

AGTAATTGAGAAAATTATTGACAGCCCTGCAAAGTCCCTGAAGTGTTATTGTCATAGTGTACC 

ATTAGTAGAGTAACTAAGAGGAAATGCCATTCTCTCCTAGCTACTAGCT 

6 5 GTTCTGTGCACATAGCTACCTGGCTCCTTCCTAAATGGTTCTCTTA^ 

AAATATAGCCAGAGACAGCAGTCTTGAGATTCAAGTCCTTTATCT 
TGCTGTGGAAGCATTCTGGAGTGCTGTGTGCTGAGTCTGAGTTT^ 
GTACAAGGACCTTTGTCCTCGTCACTTGCAGTGTAACTAAGGATGTGGTATAAC 
CCCAGACGACCTTCTGTCAAAAAAGCTCTGAATAACAGAAGGCGCCTTTAGAAGGAAATCATCC^ 

7 0 AC CACAAAACAGAG CAGAT AAC CAG AG CAGG ACAC CGGTGCTCCAAGGGTCAT CTTTAAGGACTCGCG AAATTCATG AGT AGTGAG 

GGCGAGTGGATTAAGCMTTAGGAAGTGAGATGCTGCTGTCTGAAGGAGAGGCT 
CAAGTGTGTGCAGAGGAATGCAGGAACATGTGTGTAAGGACCTGGGTX5 
TC^TACAATACCCAGAGCCTCGTCAAGCCAAGACTTAAACAAGCAGGCTCCG 
AGAGGACCCACTCAGGAGTATACATAAGGGACTATTCTGGTTTCTCTTCTTTATTC^ 
7 5 CCTCTCACCATTAGTCATCATAGAAATACAATAAAGAGAAACT^ 
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GATCTGAAGGGGAGGGGACGTGCTTCTGTGAAGTGCTGAGCTGCACATGGAAATGTTGA^ 
CGTCTGTCTGTCCX5CATCACCTCTCACAAGCCTCCTGTCAGGGTCCCAT 
CTTCTCGTCAAAGCGTGCGTGCTTCACTTGCAGTTAACACAGGC 
TATTGCTAAGTGAAATATCTCAGGGATAAATAACAAGACCTTGTTrAGAAA 
5 CTCGGCAGCTGAGTTTTCCCAAGCTGCrCCTAAGCCTAG 

AGCTGCCAGGGACAGAGGCATGGAGACAGAGCTGGCTTGGAGCTTAATC 

CATGTGTCTGGT CAG ATGTT AACACTCCGGGGGCAGG TGGGCACGAT CAGTG TG TAGT AG CCTG CGATAGGCAGTGGAGAGG AAGT 

TCTCCAGAGAAATAATGTCACCTCTGATCAGTACATTCAGTACATGGTAGGGCCAAGGAAAGCT 

CTCCAGTCATTTCTTGACATGTTTTTGTTTCTGTCGCTT 
1 0 TACACTTAGCCATCATCCACCTCCACGCTCAGCTT^ 

AACATGAGAAATGACCTGTATCAGGTAAGCAGCCACCACAAGGCCGTCGAGAAGAG^ 

CTGTTCTCAGTGATGGGAAACAGGAAAACAGCATCTTCTCAGAAGT^ 

AGAGAGATGGGTTGGGATCATGGGTTTGATGGCAATCTICACTCCATGGAAAT^ 

TAGAAGTCAGAGTTGATAAAGTTTTGAGGGCAAACCTCAGAGGGGAAAAAGTCACAGCrGTC^ 
1 5 CCAAGTTTCAAAAGCCAGAGTTACAGAGACAAAGTTTGGAGCTGA^ 

TCCATCCCATAATCAGCCACAAAACGCAGACACTATTGCATAT^ 

TGTGAGGCTATGCTGGGGCCTAGCAAACACACAAGTGGATG 

GCTAGAGAAAACACCCAAGGAGCTGAAGGGGTCTGCAACCCTATA^ 

ATGTCTCTAGCTG CAT ATGTAGCAG AAAATGG CATAG TCG ACCATCACTGG AAAGAG AGGC C C CTTGGT CTT ACAAACTTTATATG 
2 0 CCCCAGTACAG<X3GAACGCCAGGGCCAAGAAGCAGGAGTGGGTAGG 

GAT AG CATTTGAAATGCAAATGAAG AAAAT AC CTAATAAAAAATTGGAAAAAAAGAATATTTTCAAATTAAAAAAAAAAAAAGAAT 
TTTGAGGTCTGCGGAAGGAAGGGTATTCGCACAGCAACTGGATCA 

GGTGAAAGG CTGTAAG AACAGGAAGAGGCACCTCAGAGTCCCATCCAGCTCAGGTGTGAGAAGGTAAGGG CA 
GAGGAAACCACCTGGAGATGGCACAGTGAGCTGAGGACAGAGGCAGATGGTAGGGO 

2 5 CAGTCTGGGAGCTGTGGCCTGGCCCAGGATGGTGGCCTGCTCTACGG 

GTGCCCCTACCCCTCTCTCrGGGCCTGTTGGTCTTATCCTCrrGTTGGC^ 
TTCTGATCCCCACAGCCAGGCCCATGCTGAGACCTCTGCTTC 
AATAATTTGTTATTTAGAAGAGATGAGATGAGAACTAGTGGAATTCT^ 
GTTX3TTGGTTTTTTTGGTTTGGTTTGGTGCTTGGGACTGTACCCAAAGCTG 

3 0 CTCCATTTTTAAAACAAGTTATTCAAGTTGTTATCTCTT 



AGGCTGGCACTG AGCCATCAGTGTCCTACCTCAGCCTCCCIAAATCAT AGG CTTGACAAG CATGTG CCTGGCATGTTATCTCATTTT 

3 5 TAATTTAAAAAAAAATCATTATTTTAGAAGAAATTAAAAAGAAACAAA 

TGAAGACCATGTTCAGATCCCCAGACCATCGCCAGCCAGAT^ 
AGTCAGGGCAGGAGACAAGCATCTTGCATATGCG^XGGGGGGGGGGGGACAAA^ 
GGCAGTACACACGTTGCCCTCTGACCTCATGCACGTCCCTGCACC 
GAGGGAGGGAGGGAGGGAGGGAGGAAAGGAAGGAAACTAACTGCACATACACACTAA^ 

4 0 AAGGAAATCATTAAATTCTTATTTTITAATACAATATTTTAAAGTTAC^ 

GCTTAAAGCCTTGAGTTCGTTCGCTCCCCACACCAAAGAAAAGGCTC 
CGTTTTCAGAGGAGCACAGCTGCTTTCTGCACAGCACTCTGCACTTCTCTGT 

TCATTGGGC CCCG AAAGTTGAATTCTCCAAACTTTTAGG CAG ATGGCACTTTTTAAAAACTITGTATTG ATTCTCTC 

4 5 CATCC m T T TTTT lT TTTTT n TTn ^ 

TTTGTAGACTAGGCATCATGCACCCCAGTCCCACTCATCTCCCTGTCCCCrCATATCT 

CAAAAT AAACACT CAAGT AACAGG AAAG ACAAAAACAAAAA^ CAT AG AG AACAT CT CG ATGTGGGAG CGAT AGTGT GT CAC CGG 

GCATCCCACAGTGTATCCCTCTGTCCACACATCTTCACTTGCAATGAGTG^^ 

AAAGTATAAGCTTTTATGTTATTTCAAGATTATGCTTATACTGAATTT^ 

5 0 GAACTAACGACATAAAAGGCCGTGTCACATGCCGGAGAATGCAAGAAGAGAACGCAGTTTAAAGACTCACA 

AGAGTGGGTGGGTT C CAACAGGGAT CTG ATAG TG ACAG ATGGTC AC AG AAATCT ACT ATGGTT AGTGG AAGATGTTT G GGG AG AAT 
GTTTGAGATAATTTACAAATAATCCAAATTACTTAGAGAACTGTATACTTTTCTGCCAAGA^ 
TGCATATCTATAATTCCAGCCCAGAGGAGTCTGAGGC1AGGA 
CAATTTTCTCTGTATTATATTATAGGGTCTTGTTTAAACAACAAGAACAAC^ 

5 5 CCTCCAAATTAAAAAGAAACTAGTTCCCAAGCTGGGTGTGGGGCACACATTTGTA 

GACAGCTTCATGTCAATCTGGGCTACACAGCAAGTTCTGGGCCAGCCTCAAACACA 

AAACAACTCCACAAAG C CAAACAAAG AAAC C CAAAAGTT ATGAG AAT CACAAACG CTG C CTGTCT ATG T CCTTG AGGCT AAGAAG C 
TGGT ATGGCAGGGAATGTTTGTGCCCATTAACTGAGTGAAGC CAAGGTGGGAAAAACTGGGGAGAAAGGTCTG AGGGCT CCAAAT C 
AGAGCAGCTTTGCCTTGAGCTCAGGGGAGAGGTTCCATCCAC^ 

6 0 ATCACCAACAGCCAGCAGGGCTTCCTG AATCCAACCTCCAGCAG CCGAGGG AAG CAGGTTTTG CAAAGAGG ATTGGGAAGT AG CAA 

CITCCTCTTCTTCAGTTTATTCTTATACTTGTGATGGTTAAAAA 
GGGTGATGCCTTCCTCCACCTCGCCTCTCACTACCTGTACAGGCAGTCAGGAGAGCT 
GCCCTGTCCTTCACAGGCITTCGAGAGCAGGCCTTGC^TCT^ 
TGAGCAGCCCTGAGGGTGTGAGAGCAGGAGAACTGGCCCTGCCCCTCACTC^ 

6 5 TTTGGGCAGCACAGTGGAGCCAGCTCTGGAGGCATAGGGAG 

TTTTATACATACATTTCCTATCCCACATCAGCCTITCCCTTTAAGT^ 

ATTCAACTGAGAGGACCTGTATATTTCCTGAGATTGAACCAGACGCTTACCT 

ACAATAATACAATTAGGAATTGCAACAGATTCTAGTCAGATCT 

7 0 TTTTTTTTCCTAAGCTGAACATGGTGGTATTGTGTTT^ 

ATGTTC CG ATGT AG CCTGGG CAGAAAGCAAGACTGTTTCT AAG CAGAGTG AATG AATGAATGAATG AATGGTCAATCCATGTG CTT 
GTGGATGTGTATGTAGCATTGTACTGTGTCCCTGTCTTGGGTCTTCACGCTGTT^ 
CTCATTGCTCCTGAAGAGAATGCTGTCATGTCCCCTAACTAGCTGCTCACTC 
TGTACCCATXSG^TGCCACAAGTGGGTTTCAA 
7 5 CATTTT ATGCGTGTGTAAAATGTAACCCTTGGGTGTTGGTGG CCCACCAGCCACTG CTT CCCACCAGCAAGAACTCCCTG AATGGG 
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AGG T CGGG CAAG T C CTCACT ACACAG CCTGGGCTTCAG TGT AAAG CG C CT ACAACACAG ACTGGCTGG CATTGAGAGTTGGTG CT A 

CTCTCTACCAGTGATCTCAGCTTTTTGACAGACACTTTTGTGATTGGGTT 

CCTACTTACAACCATGGAAGCACAGCTTTGAGGTCAGGGTAG'TCAGAAAGACCT 

GTTTTCTG C CIXIAACTTTCTT AGGGAGAG TG CTGT ATTGG CT AACTATGCCCTG AGG ATCTGGT AG CACCTGTGAT AGT ATT ACAG 
5 TGGAGCTTGTATCTGTGCGTCTCTTGAAGCCAGCCTGCAGC^ 
GATGACATCTTT C rTGATTGCTTriTT^ 

AT AGAATT ACCAG CT AT AGAT AG AT AC CT ATCAGAAATTCAGAGAAATT AAATTTT CTTCT ACAAGAG TT AAAAAAAAT CATG TAG 
CAAAOSTATCCCATAACCCCGACTGTTGAATGCTGGTTCCCrrCCGTGGGCC^^ 
GTGACAAATGTCCCTACAGTATGACTCCATAACGTAGCCTGAAGCCAGGGGCT 
1 0 TTCTCTGTGTAGACACCTCTGCACTTGGCCGTGATCACCAAGCAGGAAGATC 
GAGCCITCrGGACCGCTGGGGCAACrCTGTCCrGCACCTAGCTGCCAAA 
GCAGAAAAGCAGCGCCCCTTATCGACCACCCCAATGGGGAAGGTAAGAGCAGTCGCC^^ 
AGCATAAGGGCTTGGCATTTGACAAATGCXn'CTGACATC 
TTGTGGTTATTTCTGCTGTTTATTACAGCAAACGCCCTTAAAGAGTATT^ 

AGCTCAGTGTCTCCCGAGAACACAGTGGGAGCTTCTGC CTTCT 
GAACAGAAACCGGAAGTGCTGCGACCTGGTGGTGTCCCTCCTCCCCGCAGCCCAGCT 
GTTCGGCCITAC^TCTAAATGCCATCCACATAGCTGTGATGAGCAATAGCCTGCCATGTCT 
GAAGTCAATGCTCAGGAGCAGAAGTCTGGC^CACAGCGCTGCACCTGGCCGTC 
2 0 GCITCTGGAGGTAAAGGCGTACTTGTGATCrTCACCTA 
GCCAGCTCGCTGTCAGTATTCCACTGTCTCCTTTCT 
CCAGCCCAGTATAGCAGTTGGGTGAATGATCTAACCTGAGGTGCAA 

TAACTCTTTCTTTCO'GGGCGGATTTAGGGTGATGCCCACGTGGACAGTACCACCTATGATG 
CGGAAGAGGrGTCCACCAGACTGGCAGCTCTTCTCAAAGCAGCAG^ 

2 5 GCAAGTTCAAAGCTGCCTCCGGAAGCTGCGGTTCTGCCCTC^AAAGTGCCGGTAAAGGAGACCT 

GGCTGCACTGGAGGCTCTGCTTGGCCTCGGGTITACCrTGATCCT 
GTTGTTGAGGTTTACTCGGTACCTATAAAATCTTGAGCTGTGTCAGCTTCT 
AAAGACTTCCTTTTGCAAAAAAAAAAAAAAAAAAAGTCTCTTAGAAGCT 
TTACTGTGCTVCGAGGCACTAAGACTTGCTGGCCTCCACTGCCTTTCCrCG 

3 0 CTCCTTTACCACATCTACTCCTCAGTAACTCTCT 

CAAAGATGATCAAGCCCCACTCCCAGTGTCCCGGAAAGCTGAGTCACAGCACCGCAGCCTGA 

GTCTTTCTTTCTG CTCTGCCTGGGGTTCTAGTCCTTCCTGTAAAGC ACTCAG AAACATTC CAGCTCATCTCCTGTCTCCGGCGCTT 

TGTGTGTTCAGTGTTAACACATGCTTGATGAGGTTGGCTGGCCTTCCGGGCCTGCCCA 

TAAC^AGGATTTGTGAACGACACATTCCTTATAGAT^ 

3 5 CTGATTATACTGGGGGAAAAAAAAAAGGTTGTCTCTTAGTCAGACACCTGAAATCTCAACATTCAGAAGGCTGAGGC^ 

GC C ACAAG CTT AGG CT ACCTTGG TCCACAT AGTT CT AGG C AG AACAAAG AT ACAT AG TG AGAT ACTG T CTCG ACAGT AAATAT AT A 

AATGTAAACACTTTTACTTGAATTCACCATGAAAAC^TAGCAAATAATATCTTGTGTGACATAAACAAAA 

GCCCTGACCTCCTCGGAGTCACAAAGGTCACTTCAGGGACATGTGT^ 

GG CCATGTTTTGCT CACGTT AT AAAAGAG CAAGTG AGGCAG CAGAG AGTGGCAAGGCAGCAGTGTTXK5CCCAAGTCAGAGGTTCTG 

4 0 TCGGCAGAACTGAAGCAAACACTTCATTCCAAAAGGCTGTGCT 

TG AAGAACAGG CAAT AT C C CC^U^GATTTGTT AGCACATTTTTT C CC CAT AACAACACATC CTGCCT AG CAT AGCGGCCTTGTGG TG 
TCCATTTCTCTGTTGCTTAGGAACTAGAACTAGGGCCCCAACAGTACAGCACT 
CACAAGTAAGCGAGAGTGTGAGAAGTGTGTATTGGTAGATGTAAATGTGAAGGTTCAG 
45 CCACCTACTAGTCCTTAGGTTAAAATGTATCCTTTGTAGTATAATAAT^ 
AGTACTTGAAGTCTTTCATGTATTTACTTTACATCCTGATCACGTCCTCCC 
CACCCCCTCCCTCTCTCTTCACAGGGTCCTTGGCTACCTACTTCCATAGA^ 
TCACTAGAAATGAAAAGTTATCCTGTAATTGCCTTAATGAGCTAGAC^TAACCTGTCCCT 
GAGGCGGACAAGCTTTTTCTTTTACACAACACTGTATTTCCCTGACT 

5 0 TCT ATGACCTGGACGACrCTTGGGAGAAGGCTGGAGAAGATGAGGGAGTGGTCC 

TGGCAGGTGAGCTCTGGCCAGCCCATCTCATCGGTATTTACGGGGGAGCCAG 
CATGCTCCTCCCTAAAGAACATGCCTTTGTTTTTATGAGGTCTTGTT^ 

TGAAAGATGAAGGTCCGGTGAGCCAGAGTGTAAAGCTGTTTGCGTAACTGGATTAGAGTTCAGATATGAGAGGTTAGAA 
AAATGCACAGAAG AGTTACCCTGTCAG CACACGACCAGAGTTCTTG CCATTTCTGTCAG AAGCTAGCTTG CTCAG AG CTTAAGTAA 

5 5 GAGGTTTCTGAACATAAAAATAAACTAGAAATTCTCTTGTGGATTTCCATAAACATATTAA 

CTTACACTTAAAGGAGAAAACGCTAGGTGTTT<3GACCAAG 

GTCGGGTTTGTAGGAATCTAGACTCTCTtn^CACGTCTAGGAGGCTGTTTGCTGG 

C CC CTG ACCTG ATT CCT AC C CTG TGT ACCATCAG CAAATT CAC CT C AG AGAAC AC C AGGACTTTT AAAAAT CATTTCTG TG AAT AT 
TTTCTCATGTTTTTCTATCCGACAGGTATTTGACATACT 

6 0 CACAAGGTG AGGTGTGG CAGG ACAGATTCTCTC AGCCAGTGTTG CCCCATG CTGTGGTTTACAGG AGGAG AGG AGAGG CCAGCCTG 

GGCG AGGGTGT ACTGCAAAC CGTCTAG ATCTTATCTCAG AATCTG AGATCCCTGGAGGTGTGGCC CTG AG CTG CTAGAAGCACCTA 

CTGCTGCTGCTGAGCAGGGCCTCTGGACTTGGCAGAAGCTTGGTGGC 

GATTGTCCCTGGAAGGTTAAAGCAGTCATCTTTCTTCCTTAGG^ 

AAACTGCTGGAAATTCCTGATCCAGACAAAAACTCC^CCACTCTGGCACAGAAGTTGGGT 

6 5 GCTGAGTCCTGCTCCITCTAAAACTCTCATGGACAACTATC 

GGTAGTTGACCGTGTCAGCTCCACTGAGCAGCACAGCCTACCCCCTTACATAGTA^ 
AAGGCAAAG>GGAGCCAGGCCAGGATGGTGCCTTTTTCTTG^ 
CAGAGCCTTCCTGTGAAAATACTACTGCCCACAGGATGGGGACCCCTCCAAGCT 
GGAGCCCCCCAGGGCCATGGCTTCATTGTTCTGTCGGGTGACTCCTAT^ 

7 0 GAGCTACrCCGAATGAGGCTGTCATTGCrAGCTGCTGTCrTCCCTGGATTTCT 

TCCTGTCTCCATGAACTCACAGGCTTCTTTCCCTTCCT^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

7 5 nnnnnnnnnnnnNNNNNNN^^ 
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CAGGCAGCCTTCCGCACCCCGGCAACCACAGCCTCC^ 
CACGAGGCAGCACATAGGTAAAGAGGCTGAGAGAGTTCCTTTCCT 
CCCTTTCCCTAGT CTr rfTCCTCACATGTGTTCCTCTGTGGCATGCCACCCGCT 
AACAGGCAGTTCCCACAGACACTCACATTGGCATATAGGTCACTTAGAATCCCAG 
5 AAACACTACTT CT CAT ATTTG TT AAATTTGG AG AAAGATTCT AAGAAAGGTGCACAG CC CTGGGC CACAGGACAAG AAG T AGGAC C 

CAGGAAATATGCATACTGTATGACCCTTCCTTTCATGCCACCACCCT^ 
CCTTCAGCTCCCACCCTGCTCATGGTCCIGCCTTTGTCTTCCCACACT 

TCTACCCTAGCCACAAACTTCACTGAGACCAGGAACAACT^ 
1 0 TTCCATCTGGGCTCAGTAGGTTATGTCTGCCTTCCACCTGGGGCAGTCCTCCCCT 
GGACACCTGATTCTTGCCTCTTTGTCCTGTACTGTCCTTC 
ATCTTTAAACACTGGCCCGCCTGGCCACAGGCTG<nTAGCCCCGCTrTG 
ATAGCGCTTTGGTTAGCTCATGTTTTCACGTTTGTATGAT^ 

GGTAAACACAGCCTCCAAGTTCAGTAGAAAAATGTCATCATTAAGAGTATTATATCCAATAAACAGTAAAC^ 
1 5 GAAAGAAATTTAGAGAAAAACGAGACATCACTTCCTTGTTACACCCT^ 

CTGGTCAGCTCACCACCTCTGTCTGCCATCTGCTTCTGTCCCCTCCTCTCCT 

TGTGTTCCCTACCCTTTCCATCCCTTTAAAGCCTGTTTTCCGCCCCCTGGAGG 

ACITCCACATGAATATACATACGTGACTGTTGGAA 

CCTTGATTAGTATTCTCCATTAACCTGCAAGTTTCATCTCTCTTTAC^ 
2 0 TCTTGGACAAACT CAT ACTGGCCATGACTGTCAAAAG T ATAG CCAGGCTGT AACAAAAT CT AAG AG TGG CAATGAAT ACTTTCTGG 

CATTCAGGTCCTCATGCTTAACCCTGAAAGGGCATCTCTTCTTC 

CCAACAGTGGCTCCTAGTCAGCCrCTTTGACrCTITGCTTGCCCCT 

CTCCAGTCACAATTAAAAACTAAATACTTAGCAGCTAGGAACACGGCAAGTGTCCCCT^ 

TGACAAATATCTACAACCCGATAGCCCACTCATAGAGGAAAGGAC^CCGAGAGAGGTGGAT 

2 5 AAAAACAAGATGGAGAAGGAACAAGGAAGTCACACAATGTTGACCTCCCCACAC 

ACACAAAGCAAATACTTAAATACTGTATTAAATTTCCATCATTGTGAC 
AGGTCCCATGGTCC1ACTGCCTTCGGGGTATAGCAAAACCCTTCATCACGGAGGAAGCA 
GGTGTTTGGAAGTACACAGAGATGGGGAGGAGCCCCAGTACAGATCTCTTT 
TGAGGCCCCATAGTCAAAGAGTTGTATCACCTCCAGATAATGTCAGAGGCT 

3 0 GGGACATTTTAAGATCCGAACCJVTAATACACAGTAAGCCTTTCAAATGCTGCCC^ 

TGAGGTAACACCAGGACACCGCTCTGGCATGGTCATTGAGTCTCAGTGTTCT^ 

GCTT AGT AAGTGTTATGCTTCTGTTTCTTT AGT CACTCTGT ACAG AAAT AATCTGCTCTGGTT CCCAGGAATGCTTTTCAGGTGAT 
AGAATTTAACTTAAGCTTTTGGTATAGACAC^TTTAAAAAAAAAAAAA 

GGCAATGTGTATTCTTTAAAAG CTTCAT ACAGCTCAGAGAAAGGTGC CATTTTTAGTTCCTGCTTCTGTTATC 

3 5 TGTATTCAGCAGTTIGCCCAGCCGTTAAAAAAAAAAAAAAAAAAAAAAAA 

TGC CTCACTTTACAAAGG AAAACATGTTTATCCTG AACC AAG AG A CTGTTCAACAGAAAGACTGATGTACCC 
TGTAGCACTCTCTAATGTCATTTGATAACCCTGCCATTCACACAG 

G CTGCGCAGGAGCTAGCAAGCACAACCCTCCAT ACCATAGTCCACACGG AG CCCACTGCTGTGGATGGTGCTCT 
AGGGCACTGTGACAAAGCTCTCTGGGTTTATGTCAGAAGGGATTCAGGGCAGGAACCC^ 

4 0 ACCCCGGCTTGCTGCCn'CATGGGCTCATGCrCCGCTAGCTTTCTTTTCCT^ 

CAGGGTTTCTCTACATAGCCCTGGCTGTCCTGGAGCTCACTTTGTAGACCAC^ 

CCTCCCGAGTGCTGGGATrAAAGGCGTGCGCCACCATGCCCGTCTTCCGCTAGClTrCl rACACAGCCCTGGACTGCCTCTCAGGG 
AGTGGTGCTGTCTACAGAGGATAGGCCCTCCTCCCGATCAGTTAACTGTCAGGACACGCCCACAG^ 
CCTCCCTCATCTGCACCTCCCTCCTCAAGTGATTCTAGACGTGTGAAGTTGAC^ 
45 AGCCCTTGCCATGGCCTAACTCAGCCCCTCATGAGTCTGTGCTTTCTCCGCAGATC 

GTGGTGTGGAGACATCCT^CCGCAAACTCAGCTTTACAGAGTCTCTTACTGGAGACAGC 
CACGGTTATGGGCAGGAAGGACCTATTGAAGGCAAAATTTAGCCTGCT 

GTCCATTG CATCGTCCCAAAGG AGGAAGGC AAAG CGAATCCAAAGGTGCTGGAGAATCG CCGGCCT 

TCAAGGCCTTCCGAATTTGGCGTCCTTTCTTGGTTCTGAAATGAAA CACAGACGGTGTCTAG CAATCATGGCG 

5 0 CTCGCTCGCTCAGCTGCACTCTATGGCTCAGGTGCAGTGTCTTGAG 

TACTGCTGTCCCTCCGCTGGGTTCCTGCTGTCATTAAAAGGTGTCGCTGTCCCCACCCGG 
TTGTGCATTCAAATTAAGATTAAGGAAAAACATATTTTTAAATGAGTACC^ 
ATGTGGTTTATCTGTGATTTAAAAATAAAAAACACATGAACTTACAA 
TTCCCCGTTTTATGCATTTTACTATTGTAAATATGTTTTCrAAATCAAATACTTTAAAAG 

5 5 TTTACTTTTTTAT*TTTACTTTTATAATAAAAGTACA 

GGAGTCCCTGTGACTGTGGGAACAGCCCCACAGCCCGTTTGTCACATCCACAT^ 
TGTTCAAACTGTGGAAATAAAAGTGTTCTAACCCAAGACTGAGGTTCTGrACCCAGG 

TCCAATCTCT ACACCGTTCCTCTCCAG CGCATGCTGTCTGTGTCTATG AAGGATAACAG AC CCCAGCTTCAGCACTTCATCTTCTG 

6 0 GAAATACACACACACACACACACACACACACAGACACCACAGCAACTGGCTC 

CAAATGGTATAGCACATACTCCAGTAACTATTCCCTCCCCATGTTCATGCAGGCAGCCCTAATTAAACTCACT 
ACAGAAGCAGGAAAGTAGGAGGCGGAAACGAGAAGGCAGCGGGGTTTGTGGATATGATCTCAGTA 
CATGAGAAAATATACATTTTAATTAAAAACTAAATTCCAACATATGTAAAAGATT^^ 
CTGAAAGTAGAAAGAGGGAAGATTAATGGCTTGAAAGAGGTCTTAGTTTGGACrCTGGTGTGTGACCAA 

6 5 TGTAAAGATCAAATCCAAAACCTTCTGGTGAAGTGGCTTGGCAGTTAAGAGCACTGA 

TTCCTAGAACCCCTCACAGCTCACAACCACCTGTAACTTCGGTCCCAGGGAATATGATGCC 

GCATATAAATGGTGCACAG AT ATACATGCAGG CAAAACATC CATACACACAG AAAGAAAAAACCAAAAACCT AAGGCCTTGTCATG 

CCACACAAGTGATCTACTACTGATCCCACGGAAGGGCCTGGAAACCCACCAGCTATTACCT 

GCTCTCACCCCCATTTCCATGGTAGTCATATGAATGCCTTTCCCATTGCT 

7 0 CTGGGCTATCTCCCTCACACAGCTCAAGGTACCCCAAATAGACATCTTGTATGT 

CAATGAAGAAAATGGAGGTCCAGAGAGCGCAACTCAGTTGTGGCTAAGGTGCATCTGTGTTTAT^ 

CTTGTGAGTTAGAGAATGATTITATTGGGACAAAAAAGATGATAATGA 

AAATATCATTAATGAAGAAATGATTAATTATTTAAATATATAATACAAGCTACTTA^ 

AAG TG CCT AATC!ATCAATT AATCC-GAAT C CCTTTCT ACCAAAAATTC C AT AT GT ATG AAAATTT AAAC CAT ACGAATTTT AG ACAT 
7 S AC CACCAATCTT AGACAAACACCCTTG ACATCGAGAAGG CC CT AATCGAG ACTT CATT CTG TGATG T AAAAG GAG AG C AG ATI' C"f A 
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CCTCITAGAATGCATCACAAATGAACAGAAACTGGTTAAACAGCAA^ 
CACCCTTATTCTGCGTTGGAAACTTTAAGACAAAACTGTCTTGCTT^ 
GCCCCAGCCAGGACTTTCTATGGCTGTTAGCTGTGCGTCTTAAAACATGGTGCTC 

C CTGGG TTT AGG CTCCTG ACT CCAGG C CAC CTGTGG CAAGGGAGGAACGATGCTGAG TGACTTC CGAAACT AGGTC CGAATTTCTG 
5 CCAACCTTCCCGGAACTACCTTCTCCTGGAACTCAGTCACCCTGTTCAGAGGOT 

GTGTTCAGCCAACAGCCTTAACCGAGGTCCTAGCCAAAGCTTGCCAAGCCGACGGCTG 

GT CTCAGCAGATG CTGAC CACAAT AAACATGGGCTGTTTTG CTGTTCTCAAAGG AGAG ATTTG AA^ CAAATT AAATACT AGTGGG 
GGTTTT AAG CCCTGGGTGT AG AGTGG ACACCACCATTCT C CAGGCATTTG AAAAGACAGCTTGC CT AC CAT AAGTTCAG CAGAAAA 
GATACTGACTGGAGAGACGC 

10 

MOUSE SEQUENCE - mRNA 

AGCGGCa3CCGCGGGrcCGCTCTAGCAGCX3CAGGCCraiAG 
CCGCAGCC^TGGCGCGCCGCTGAGCCGCCCGCCCGCC^ 
AGCGGAGCCCGCAGGCGAGGAGAGGCCGCGCGCATCTCCAGGGTACCCT 
1 5 GAGTTTGACGGTCGTGAGCTGCGCATCTTCACCATGGCAGACGATGATCCCTACGGAACT 
TTTGACTCACTCAATATTTAATGCAGAATTATATTCACCAGAA^ 
AACCAAAACAGAGGGGATTTCGATTCCGCTATGTGTGTGAAGGCCCATGAC^ 
AAGAAATCCTACCCACAGGTCAAAATTTGCAACTATGTGGGGCCTGCAAAGX3TTATCX3 

CCACCIX5CACGCCCACAGCCTGGTGGGCAAG CACTGTGAGG ACGGGGTATGCACCGT AACAG CAGGACCCAAGGACATGGTGGTTG 
2 0 GCTTTGCAAACCTGGGAATACTTCATGTGACTAAGAAAAAGGTAT^ 

GGCTATAATCCTGGACTTCTGG TGCATTCTGACCTTGCCTATCTACAAGCAGAAGG CGG AGG AG ACCGGCAACTCACAGACAG AGA 
GAAGGAGATCATCCGCCAGGCAGCCGTGCAGCAGACCAAGGAGATGGACC^ 

CTGACAGCACTGG CAGCTT CACTCG^AGACTGGAGCCTGTGGTGTCAG ACGCCATCTATGATAG CAAAGCCCCGAATGCATCCAAC 
CTGAAAATCGTGAGAATGGACAGAACAGCAGGATGTGTGACGGGAGGGGAGGAGAT^ 

2 5 TGACATCCAGATTCGGTTTTATGAAGAGGAAGAAAATGGCGGAGTTTGGGAAGGATTTGG^ 

GACAGTTTGCCATTGTCnTCAAAACGCCAAAGTATAAGG 

AAATCAGACCTIGGAAACTAGTGAACCGAAACCCTTTCTCTACTACCCT 

GAAGCTTATGCCGAACTTCTCGGACAGCTTCGGCGGCGGC^ 

G CGG AGGGAGT ACCGG AAG CCCTGG C CCAGGGT ATGG CT ACT CG AACT ACGG ATTT CCTC CCT ACGG TGGGATT ACATT CCAT CCC 

3 0 GGACTCACGAAATCCAACGCAGC<X3TCACCCATC 

TGACGAGGAGAGTCTGACTCTCCCTGAGAAGGAAACTGAAGGTGAAGGGCCGAGCCTGCC 

TCGCCTTGGCATCCACCATGGAAGACAAGGAGCAGGAC^TGGGAT^ 

AGGCGACIACGCGAACGCCCTTTTCGACTACGCAGTGACGGGGGATGTGAAGATGTTGCTC 

GCAGGATG AGAATGGGGACAGTGTCTT ACACTTAGCC ATCATCCACCTCCACGCTCAG CTCGTG AGGG ATCTG CTGGAAGTCACAT 

3 5 CTGGTTTGATCTCTGATG ACAT CATCAACATGAGAAATG AC CTGTATCAG ACACCTCTGCACTTGGCCGTGATCACCAAGCAGGAA 

GATGTAGTAGAGGATITGCTGAGGGTTGGGGCrGACCTGAGCCTTCTGGACCGCT 

AGAAGGACACGAC AG AATC CTCAGCATCCTGCTCAAG AGCAGAAAAGCAG CGCCCCTTAT CG ACCACCC C AATGGGGAAGGTCTAA 
ATGCCATCCACATAGCTGTGATGAGCAATAGCCTGCCATGTCTGCTGCTGCT 

CAGAAGTCTGGGCGCACGCCGCTGCACCTGGCCGTGGAGTACGACAACATCTCCTTGGCTGG CTG CCTG CTTCTGGAGGGTGATGC 
40 CCACGTGGACAGTACCACCTATGATGGGACTACACCTCTGCATATAGCGGCCGGAAGAGG^ 

AAGCAGCAGGAGCAGACCCCCTGGTGGAGAACrTTGAGCCTCTCTATGACCTGGACGACTCTTGGGAGAAGG 

GG AGTGGTC CCAGGT ACCACACCCCTGG AC ATGGCTCCCAACTt^CAGGTATTTGACATACT CCTGT 

GTTCA.CATCTGATGATATACTACCACAAGGGGACATX3AAGCAGCTGACAGA^ 

TTCCTGATCCAGACAAAAACTGGGCCACTCTGGCACAGAAGT^ 

4 5 CCTTOTAAAACTCTCATGGACAACTATGAGGTCTCTGGGGGTACCATCAAAGAGCT 

AGAGGCCATTGAAGTGATCCAGGCAGCCTTCCGCACCCCGGCAACCACAGCCTCCAGCCC 

TGCCTCTCTCGTCTTCCTCCACGAGGCAGCACATAGATGAACTCC^ 

TTCCGCAAACTCAGCTTTACAGAGTCTCTTACTGGAG 

AGGACCTATTGAAGGCAAAATTTAGCCTGCTGGCCGTTCCCCCACACTGTGTAAACCAAAGCCCT 

5 0 AAAGGAGGAAGGCAAAGCGAATCCAAAGGTGCTGGAGAATCGCCC^CCTGCAGGGTCACTCGATTT 

TG^CGTCCTTCTTGGTTCTGAAATGAAATGTAGTTGCCACGCACAG 

CTCTATGGCTCAGGTGCAGTGTCTTGAGCTTTCTCnXaCTGCT 

GGGTTCCTGCTGTCATTAAAAGGTGTCGCTGTCCCCACCCGGTGTCCTTT 

ATTAAGGAAAAACATATTTTTAAATGAGTACCTTGATGCGCAATAAAAAAAAAGACATTTCTTT^ 

5 5 TTTAAAAATAAAAAACACATGAACTTATCAATATTTAAAAC 

TTTTACATTTGTAAATATGTTTTCTAATCAATACrTTAAAAGAAGAATG 
ATAAAGTACAGCACATGTGACT 

MOUSE SEQUENCE - CODING 

6 0 ATGGCAGACGATGATCCCTACGGAACTGGGCAAATGTTTCATTTGAACACTGCTTTGAC 

TTCACCAG AAAT ACC ACTGTCAACAGATGGCCCAT ACCTTCAAATATTAG AG C AACCAAAACAG AGGGG ATTTCG ATTC CGCT ATG 
TGTGTGAAGG CCCATCACACGG AGGGCTTC CGGGAGCCTCTAGTGAG AAG AACAAG AAATCCT ACCCACAGGTCAAAATTTGCAAC 
TATGTGGGGCCTGCAAAGGTTATCGTTOU5TTGGTCACAAATGGAAAAAACATCCACCT 
CTGTGAGGACG^GGTATGCACCGTAACAGCAGGACCCAAGGACATGGTGGTTGGCTTTG 

6 5 AGAAAAAGGTATTTGAAACACTGGAAGCACGGATCACAGAGGCGTGTATTAGGGGCTATAATCCTGGA 

CTTGCCTATCTACAAGCAGAAGGCGGAGGAGACCJGGCAACTCACAGACAGAGA 
GACCAAGGAGATGGACCTGAGCGTGGTGCGCCTCATGTTCACAGCCTTCCTCCCTGACAGCACT 
AGCCTGTGGTGTCAGACGCCATCTATGATAGCAAAGCCCCGAATGCATCCAACCTGAAAATCGTC 
TGTGTGACGGGAGGGGAGGAGATTTACCTTCTCTGTGACAAGGTTCAGAAAGATGACATCC^ 

7 0 AAATGGCGGAGTTTGGGAAGGATTTGGGGACTTTTCCCCCACGG 

ATAAGGATGTCAACATTACAAAGCCAGCTTCCGTGTTreTTCAGCTT 

TTTCTCTACTACCCTGAAATCAAAG ACAAAGAGGAAGTG CAAAGGAAACG CCAGAAGCTT ATGCCGAACTTCTCGG ACAGCTTCGG 
CGGCGGCAGTGGAGCGGGAGCCGGTC^TGGAGGCATGTTCGGTAGTGGCGGTGGCGGAGGGAGTACCGGAAGCCCTGGCC 
ATGGCTACTCGAACTACCGATTTCCTCCCrrACGGTGGGATTACATTCaVTCCC 
7 5 GX3CACCATAAACACCAAATTTAAAAATGGCCCTAAAGATTGTGCCAA 
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AACTGAAGGTGAAGGGCCCAGCCTGCCCATGGCCTGCACCAAGACGGAACCCATCGCCTT^ 
AGGACATGGX5ATTTCAGGATAACCTCITTCTCGAGAAGGCTCTGCAGCT 

GTGACGGGGGATGTG AAG ATG TTG CTGG C CGTG CAACG CCAT CT CACCG CCGTG CAGGATGAG AATGGGG ACAGTGTCTT ACACTT 
AGCCATCATCCACCTCCACGCTCAGCTCGTGAGGGATCTGCTGGAAGTCACATCT 
5 GAAATGACCTGTATCAGACACCTCTGCACTTGGCCGTGATCACCAAGCAGGAAGA^ 

GAC CTG AG CCTTCTGG ACCGCTGGGG CAACTCTGT CCTGCAC CTAG CTG CCAAAG AAGGACACGACAGAATC CT CAG CATCCTGCT 
C^AGAGCAGAAAAGCAGCGCCCCTTATCGACCACCCCAATGGGGAAGGTCTAAATGCCATCCAC^ 
TGCCATGTCTGCTGCrcCTGGTGGCTGCCtjGGGCAGAAGTCAATGCT 
GTGGAGTACGACAACATCTCCTTC^CTGGCTCCCTC 
1 0 ACCTCTGCATATAGCGGCCGGAAGAGGGTCCACCAGACTGGCAGCT 
TTGAGCCTCTCTATGACCTGGACGACTCTTGGGAGAAGGCrGGAGAA 
GCTGCCAACTGGCAGGTATTTGACATACTAAATGC^AAACCGTATGAGCCT 
CATGAAGCAGCTGACAGAAGACACGAC^CTACAACTCTGCAA^ 

CACAGAAGTTGGGTCTGGGGATATTGAACAATGCCTTCCGGCTGAGTCCTGCTCCTTCTAAAACT 
1 5 TCTGG<XK3TACCATCAAAGAGCTGATGGAGGCCCTGCAACAGATGGG CTACACAG AGGCCATTG AAGTGATCCAGGCAGC CTTCCG 

CACCCCX^CAACCACAGCCTCCAGCCCCGTGACCACTGCTCAGGTCCACTGTCTGCCTCTCT 
TAGATGAACTCCGGGATAGTGACAGCGTCTGTGACAGTGGTGTGGAGACATCCTTCCGCAAACTCAGCT 
GGAGACAGCCCACTGCTATCTCTGAACAAAATGCCCCACGGTTATGGGCAGGAAGGACCTATTG 

20 HUMAN SEQUENCE - GENOMIC 

TTGACAAGCTGGTTCTAAAATGAATTCAGAAATGCAAAGGACCTAGA^ 
GACTAACACTTCCTGATTTCAAGACTTATTGTAAAGCT 
TCAATGGAACAAAATAGAGTCCAGACATACAGATACAACTGATTTTTAAC^ 
TTTTrTTTCAACAAATGCTGGAGAATCAGTTGG 

2 5 GCAAAACTTAACTCAAAATGTATCATAGACCTGAATTTAAGTCCTCAA^ 

TAGTGAGTTTGAGTCAGGCAAAGAGTATGTGGATACAACACT 

ATTTAAAATCTTCTCTGCAAAAGAGGATGAAAAGACAAGTCACAGGCTAAAAAATATG 
ACTTTTATCCAGAATATAAAATGAACTCTCAAAATTCAATAATAAGAAAAAAAATAC^ 
CATTTCACCAAATGGCAAATAAGCACATAAACAGATGCCCATCATCATTAGTCATTAGGGTAA^ 

3 0 ACCACTAAACATATCAATTGAGACGATTAAAATAAAAAAGACrrGACCATACCAAGGAT^ 

CATTTCCTGTAGTGGGAATATAAAATAATATACTCTTTGGGAGAACAGTCTGCAGTT^ 

ACGCTCTAGCCATTTCACTTGTAGGTATTTCCCATGACAAAAGAAAGTATATGTCTATATAAACAGTTAAATAGTTAC 

GTTTTAGTGGTTTTATTAATATATCCC^AAACTGGAAACAACCTAAATGTCTATGAACAGGAAAAT 

TCTATACAATGGAATGCTACTCAACAATAAAAAGGAATTAACTACTGATACAACAAGGTGGATGAAT 

3 5 ATGAAAGAAGTCAGATAAAAAATAGTACCTACTGTATGATTCCATTTATATGAAACTCTAGAAAATGCAAACT 

AAGGAAATCAGTGGTTGCTTAGGAATGTGGAGGAGTGAGGAATGGAAATGAGGAACTGC^ 

G ATG CACATGGT CACTGT CTTTATT ATGGTG ATGGT CT CATGGGT AT AT AT AT AT AT CAAAAAGAACT CTCT CTCTCTCT CT CTCT 

CCAATCCCCCAACCCCATCTCTTAC^CAAGAAAGTATGTCCCTAGTATATCTGG 

ATGAGGAAACTGTTAACACAGGGACAGAAGAAAAGAAAATGTCTCTGAAGGGAAGAGAGCCAGA^ 

4 0 AGATTTTCAACGATGTGACCTGATGAAATTCCATAATGGTTAAGTC^TTTTGA 

TT AATT AT ATTA CTTATTT CAAAG AAAATGC AGT AAAAAACTGAT CTG T AT ATCT ACTG AT ATGT CT AT C AGT AAAACAG AT AA^ 

T C CTTTTTTTG AT AACAGTG C CC CAATTT ACG CTT AG CT AAC CAT AAT CTAG CATTACTAATT CATGTCTT CCAGATGAGAT AGAC 

4 5 CCTCCTTGTCACTAAAAGTAGACATGTGACTCACAGCATTGTAAGAAGATAGTAATGGAACACAGGTCCCCAAATT 

ATCT CTT CTCAATT ACT1T CC CCAACAAG AAAAAATGCT ACTGCAGGT AAG CAG GGG CAT CT AAAG CAGGGTCACTTGAGGT C ACA 
GG CTCCAGGGTCTCTAAAAGGAAAGAGATACTCTCACTCnTCTCT 
AACAAAGGTCTCAATTTTATGGCTTCCAGTGAAAGTTAGCCTGA^ 

TTTGTGTGTGTTCTTGGGAAGGTCAGACTAATCTGGGTAGTGACAGGTCATATGGAGGAACGCTG 

5 0 TAGCCCCTGAGGTTAGG CAAG CATTCTCCCTTGCACCCAGGG AG CTCIXX3G AGAAG AAAG ACTTTTTTT AAAGCTTCAGCTGGGTG 

TGGACAGATTGCCAGTTGGTTTACTGCCAGCCCACAAAAGCAGAAACCTTTCCCCTGG 
ACTACTTGTCCTGTAATTATAGGTCAGTGTATACAGAGCCTGAGGTCTGAACAGGAAATGG 
CTAGCAAGCCAGAGAGGAAAACTAAGATAAACACTTCAAACAGGTGTTTTGTTGA^ 
TTTTCTAAAAAGTAATAAATTCACTCAAGAAGATATAATTAAAGATTAGGA 

5 5 TACATATTTTAAAAATAGGTAATGACTCCATGTGTTTCTGACAGTATACCCTTCCTGGTTGAAATAACTAA 

GGAAATAACTGAAATGCTACATAACATATTTTTAAAAATTAATACTGCTTAGCCAG 
AACTGAAAAACTGGAGGGAATAGCTGAGAAATTtjACTTAGTCTC 

TAATGAAAAGAAATTTGT ATCT CACAGTT CTGG AGG CTGGG AAGTCCAAT ATCAAGGTG C CAGC ATCTTGTG AAAGC TTT CTTGCT 
ACATCATCCCATAGTGAAAGGCAGAAGAGCAAAAGAGC!AGACACrCA^ 

6 0 CCATTTATGGGGGTAGAGACCTCATGACCCAATCAGCTCTCAAAGGTCTCACCTCT^ 

ACACAGGAACTTTGGGGG ACATG TTC AAACG AT AACACTG C CT AT CT AGT CCTTGTCACTG G ACAG AAAAAGAAG AAATTCTCAC C 
TGTGAATTCATAACCACTTACACTATATGTGATGATCAAAAGCACAC 
TATACCAGATAACAGGCAAAAAGAAGCACAAATATTCCTTACTCAATTTTAAC 
AGAAACGTAAGCAATTTATTATTTATAATTCAGTTGACATATTAAACAGCATATGAAATA^ 

6 5 GAAGGCAAAT AT AAAGAAATT AT CCAGAATGAAAT C C AAAG AG AT AAG AG AT ATT AAAAG ACACAG AG ATT AGATTG AGAAG ATCT 

AACATATACCTAATTTAAATTCCAGAAAACAAGTCTAGGGAAA^ 
TAACTGTTGAAACATCAAATCTTGAGACCCAGGGAGCCATATAACATGCT^ 
CATTAATATTAATATTATATATTAATGTTATTGTTAATATAATCATTATATTA 
ATATATAAATGTATATGATATATAATATACAATAGTTATAATATATAAT^ 

7 0 TAACTTATAAATTATATATAGCTTTATATATAAATCAATAATTATATATTGATGTGTACAATTATTATTTTA 

ATTATATAATTGGTATTATATAAATATTAATTTTATATTGCAATAGTTATAAAGTAAATAT 
TATAGTATATTAATATTAATAACTTGTATTTATTAACAATATATTAATAAATGACATAATTTCAC^ 
AAGCTTATCAGGAGTTATATATATGCAAAACCATATTTATCTTATTAAAGAGGC^ 
GATTTGACAGCCAGATATACAACCTTTCAGAAAACAG 
7 5 GTCCAGTAACTAGCCTTATTTCACTCAGAACAGTTTTGT^ 
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GGTTTCCTGCGCAAGGGACTCTCTGTGCTAAAAGCAAGATAGTC 
CATAGAAATCATTAAAAGATATCTCTCTGTTATGGGC^ 

AGTACCTCAGAATGTAACTGGATTTrGAGATAGGGTTTTTAAAGAGGTATATAAGTTA 
CTAATATGACTGATGTCCTTATAAGACGAAATGATTTGGACGTGGACACAGAGG 
5 ATCTGCAAGCCAAGGAGAGAAGCCCTCAAGTAAACCAATGCTGCCAACACA 
AATATTTCTGTTGTTTAAGCCACCCAGTCTGAGATATTTTGTT^ 

CATTAATCTTTCTGTTATTAACGCTATTAACT 
1 0 GTGTTAGTCAAATAAATGAAGAATCACACTAACTCCTCTTTCCTTCCCTCTCACT 
TGCCCTCGTTTCCCCATTTCCCACTCACTTGTCAGTCT 

ATGTAAGACCACAGCCACTGCGTATCAGTGCTATTTGAGAGTTAATAAAGTTCCCTTT^ 
GAAGAAAGACTGGTTTCTGCCCTTGGCTGCACAGCCATGCC^ 
CTTCCCCTCTCCATCCTTTAGGAACTGTCTGCAGTAT 
1 5 TTTCTGAGCTTTCCTCCCACTTTTGATGGCACTGCAG 
CCTCGAATTCCTGTACTTAGCCCCTTGTCTC 
TTGCAGTTGTTTCCATTTCCAACGCCATGTCTCr 

GGACATCTCACTAAATCTGTCATACTAAACATGTCTAAAATATGTATGTTTACATGTT 
TTACATTCCCACAGATTTTTTCCCAGTTACAGTGTTACTATTCTAATCACT 
2 0 CTCTCACTCAAACIX3CATATATAATCTATAAACAGGTCCTGTTGACCT 
ATTGGATAATAGTTTAGTGTGGTGGTTAATATCATGCACACTG^ 
ACTAATTATGAAACCTTAGGAAAATTATTCAACTTTTC^ 
AACCAAATAAGTCAATACGTGTATAGCAGGTATTAACCATAT 
TAGTTCCCTGCTCTCTTGCATCTAAGTGGGGCCATGGAATGTGGGCAGAAGTC 

2 5 AAGACATACTTTTCCTATGTTCTGATTGCCAATGAACA 

AAATAGCTTCATGAGAAGGGAAATTTGTGTAGTTTCAATGATTT 

GTTGAAATGTGACCCCAATTTTGGAGGTGGGGCCTAATGGGAGATGTTTCGGTCATGGA 

CTATTCCCTGCAAGGAGGAGAGGATGAGTGAGTTCTCACrGCATTAGTTCCTATGAGAGCT 

CCCCTCTCTCTCGCCATACGATCTCTCTGCAC^ 

3 0 ACGCATATGCTGGTGCCATGCTTCTTGTACAGTCTGAAGACTGTGAGCCT 

TGTATTCTGCCATAGCAACACAAATGAACTAAGACAAGTGACAAACACAAAATAACTT 
TTCCACATTGAGGCATTTTTTTCCTCTTCGTAGC 

AT ATTC CTGGAAG CAGGTTTG T ACAGT G T AG AG AAAATGITGX3GTAACAAAG CAGGAATGG AGGAGGGTT AATCAGCAAG C CAAAA 
AGCTTTTATAGTTTTGCCTGCTGTGCTGCATGGAATTACATCCCCCCAAAACATATGTTCAAGTCCT 

3 5 CATGAGTTTATTTGGAACTAGGATCTTTGCAAATGTTATCAAGTTAAGATGAGGTAATACTGGATTGGGGT 

TGACTGGTGTCTTCATAAAAGAAAGGTGAGGGAGAGTTGAAT 

GTAGGCACAGAATGGAGTGTTGCAGCCAAGCCAAGAATGGCAAGGATTGCTGGAGCTACCAGAA 

TTCCCTAGAGCCTTCAGAGGGATCATGGCCTTAATGGCACCTTGAGTTCAGAATTTTAGG 

CCGTTGTAAGCTGTTCAGTCGGTGGTAAATTTTTACAGCAGCCCCAGGAAAATATGCCTGCT 

4 0 TTCATTTTATGGAATCATTTTGAGTTTGGAGTTTGATTTAA 

TCACACTGGCrCTACrCTTATAACAATTGTATTTATCTATCTTTTGGTAATGGGGCTGCCACC 
GTGTATGCTCACACCTCCCTCCCTCCCTCCCTCTCT 

GGCTGGAGTGCAGTGGCGCAATCTTGGCTCACTGCAACCTCTGCCTCCAGGGTT 

GCTGGGATT ACAGGCGCC CX3CCACCACGCCCGG CTAATTTTTTTGTATTGTTAGTAG AGATGGGGTTTCACCAT ATTGGTCAGG CT 

4 5 GGTCTCAAACTCCTGACCTCAGGGGATCCACCCGCCTTC^ 

CACAGAATTTCTTAAAAAGTGATTGTGTTCAGTCTT^ 

CACTTAAAGGTCGACTGTTGGGGCCTCCAAGGTCGCCT^ 

GTGTGGGTTTTTTTTGTTTATTTTTTTTT^ 

GAGTGGGCACGAAAGGACTAGGATGAACATGCCATAAGCAATCTAAATGCCACTGCAATATTC 

5 0 TGTTTTAAAGCTAAGCTTTCAGTTGTCACTCCAC 

ATGTGGGGAGGTAATCCACCCGAAGGTATCCCCAGCCTGTACCTAATACAGTGCCCAGCACTAAAGCAGCTCA 
GGTGGCCACTGGGAGGCCTGTGAGTGGGTGCCAGTAGCGGTCTC 
ATGAGCTGTGAACCAAAACCATTGCTACCACCATCACTATAATTCT 
GATGAACATTCTGAGTAACTGCTCTATAACCAGGAGATTTAAGACCGC^ 
55 AGTCGTTTATGATAACGAAATTGTAGTTTAATCTGTGAAGAGATGTG 

GCirTATTTTTATATCTGGCATGCTTGGATCCATGCCGACCCTCCAGCT 
TCTATCAGCGGCACTGCCACCGCCGCCGCCTCCGTGCTGC 

GCTTCCCTACCGGCTCCAGCCCTTGGGATTCGGGAGCGCCCTGCTAGGAAGCCAGAGCCCCGCAGG 

CTAACGCGCGCCCCTCGCCCGGCGCCCCGAAGCGGCCCCGAGGGGCGGGAGCCGAGGCGAGCGGCAAGGCCGGGCCGGGGGCGCAC 
60 AGCGCCCCTAGAAGTGCGGGCTTCCCCC^CCCCCGGCAGC^ 

CTGTATGCTCTCTCGACGT(^GTGGGAATTTCCAGCCAGGAAGTGAGAGAGTGAGCGAGACAGAAAGAGAGAGAAGTGC^ 
AGCCGGGGCAGGAAGAGGAGGTTTCGCCACCGGAGCGGCC 

GCCAGCCGCCGCAGCCCTCGGCCTGCACGCAGCCACC^CCCCGCTCCCGGAGCCCAGCGCCGCCGAGGCCGCAGCC 

GT AAGG CGGCGCCG CCGCCCGGCCACCG CG CGCCCTGCG CTTCCCTCCGCCCGCG CTG CGG CCATGGCGCGGCG CTG ACTGGCCTG 
65 GCCCGGCCCCGCCGCGCTCCCGCTCGCCCC^ 

CCGGCAGGCCCG CGCCGCTrAGGAGGGAGAGCCCACCCGCGCCAGG AGG CCG AACGCGG ACTCGCCACCCGGGTAAGCCAAACTCG 

GGCGAGTGGGGGCCCGGCAGGGGACGCGTGGCCCAAGTCCTGCCGCCC^ 

CATTCCTGCGCTAACCGCTGGGCTAGACCGTGGGAGGAGGTTG 

ACACATCCGGACCTCGCAGGGGCTAGTCGGCCGAAGGGCCGCG^ 
7 0 GAACGCGAACCGGAGGGGAACTCTTACAAACTTTTAAAATCCCCAACCCCAGCCCCACCTC 

AGTGCTTGTCAGTCCTCTTGGCTGGGAAAGCCCTCCCCAGCTCCCCAGAATTGAATAGGAG 
TAAAAGAAACCTGGCTCGCTCTCTGTCTCTCrCTCT 

CACACACAGACACACACACGATATAGTCACCTGCTATAGGACTTGATTCTGATC 
7 5 ATTACCTACCTGGATACTTAGAGCACTTTTTAGACTTGATTATGTAAAACTCrT^ 
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TG AAA CAAAAAT AATGTAAAAAC AAGGC CT AT C CT AAATGCAAG TTTTT CT ATGG TCCTT AAAAACAG ACACCAGGO^TGTTTGC 

CCTGACTTGCTGGTTTATCCTGAAACTCAAATGGGACTTAATGATACAG 

TTAAGTAGAGGTTTTCAGCATTACAGTTTATGAAATA^ 

TGCTTATGGTATTACTAAAACATTTAATAGCTCTGGCATCAAAATATTCTACATACTC 
5 GGGTCAAGATATTCAATAATTTTTTTCTAGCAGAATCCCACAACTGAATATTC 
AACTTTAAAAGGGAAACTTAAAACTAAAGTATGTTGTTTTTCTGATTTTA^ 
TGAGGCACTATAGGAATTATGTATTTAAATGTATGTATTTAAATTTGAAGCA^ 
AGGAAATAAAAAGATTTGTTTCCTGTAGAATTTTACA 

GGAGACTTGGAATCTGAGATTATTCATCCATATTGATACACCTGCAATTCCTAAATGCCCT 
1 0 TTCCTGGGCTGCAGCCTACTGGACAAACTTGAACACAAAGGAACACArc 

TGCAGTGAAAAGCAAACCACTTCTAAACTCTTTGAGACTTTT^ 

AAATCTATTATCTTTTTGCTGAAATTTCATTCTTATGTTAGGCATTGGCACT 

CCCAGATAGTAAATACrrGGGGGTATACATACAAAAATAGTCCITCCCCTTT 

CTCTACCTCTTAAACAAGATCTGTGTTGATTGAGTTGATTAAAGCACAA 
15 ATATTCCTTACATTTCTGTTGTTACTGTTC 

ACACTTTTAGACTTGCAGATAATTGATTAAGTAAGTTACTACCTTGACT^ 

ATAGGGAATTGGCAGCTACATGAATGTTGGACTATCATTCCATCAGCAAGACCTT 

TGTTCAGAAGAGGTGAAAGTTCTGGCTTCTGGGGGGAAGTGGTTACTTCATAACCTTCAATTGG 

AAAGTAGTCGATATTTTCAAACAGTAAAATAATTGTATCTGAGTTGCT 

GTATAACCAATAGAGTAAATTCTCAGATTATGCTCTTCTGGTCCACAGATTGATCAGAAATCACAT^ 
GCAAAATTTCTCATTAATTTTCTTACTTTTAGTAGGAAGTTTGTTTTATTCGG 
TGTCTACAGTAGTGGGATTGTTATATTAGAAGAGAAGGAGAAGGAAAAATAGAAGTC 
CTCCTACATCTACATTTGTGTCTAATCAGCATTAATTGATGACTACGTCTTGGTAAACAGTAGACC^ 

2 5 GCTTTTTGGGAATTAGTTCTCATTTGCACTCTTGCACCTCCCATAAATACT 

AACCATCAAGAGCATATGATCTCTCTTTGAATTTTGTCACTACTTTGGAG 
AGGGAGAAAGGTTTGAACGTTTTGGCATAGTCCTGCAAGTCAGTTTGGCTCCTTGACT 
TATTTGAAACCACATAATTTGGACAGTTAATAGTTCTGGTAACATAACCAAGC 
CCAGTGCCTCACTTTTAGAAGAAAAAAAGAGAAAAAAGTAGTAACATTCTTTTAATGATG 

3 0 CTGCTGTATTAGGGGTTGTTCCTGGGTGCAAATTAATATACTGTei riiAC^TTTCCGGAGGTTCTTGACCTGGCCTTGAAGATCA 

G CCAACAGTTT AG AAAAT AAGTTGTGCAAATTG CATTG CTAC AC CTCAAAT AATTT AAAGCCTGACAGTTCTGT ACAG AAT AAG AG 
TCACATAAGATGTGATTTGTTAAAGATTATAGAAATCTCAGAAGATAGATTATGGATGAAAATAT 
TAATAAAAATAGTGCATTTTACATTTTAAAAAGAAAACAATTTTTA 
CAGTAATACAAATTTCATCTGCTTTAATTTATTGCCTGTTTTTGAAAT^ 

3 5 ATTGCCTTATTATTTTTCATTCATCTTTCA 

ATCAACAGACTTGAGTTCATCTATGGATCCTATTAGTTTAAACTGTGACAAATTTTGACTGC 
CTTTAAAATAGCCAAAATTATAAAAAATGAAATAAAGAAAAAGCTGTTTGAGAGAATCAA^ 
AATATCTAATCAATACTGACGAAAGAAATATCTGATGCTTAGCATTGTACTTGTTGGTACATAGT 
TGAATGAATGATATGAAGTTGTTGCAAAATTTTTTTTTAAATGGACAGCCATCT^ 

4 0 TTGCCTTCTGGAAAGAAAGGAGCACAAAATAGTCCTTGTGTCTTAGATATTATGGAAAT 

TAGAACAAATAAT1TATTAAAAAGTATCTGGCGCCTTCCAAAAGGGGAACATTT 
AAAATGCCTTTTAGTTTCAAAGTTTTAGAGGCACCAGAGCAAGATTATAAAGTAATT 

AAAATGT AATTTTTTGGTG AAG TT AGAATTT CTGT AC ACT C CATTG AAACTGT AAT AT CAAAT ACT AAT ACCT CCAAAAAAGAAAC 
AATG CT ACAAG C ACTG CTTGG T AAT CT AAG AATTGTT AAC ACTT ATCT AAATT AAT CAACCAAAAAGTT AAAACAAAAT ATGTT AT 

4 5 TTTGTTATGTTACCTCTAGAATTAAACACCTATCACCCATAAATCT 

AG CAGG CAGTT AG CAGGGTTTACTGTACTTTCCACTTT CAT ACTTT AATG AT^ CAACACT AT AG AAG CTGGGGCCTCATC CT 
AGCACAGTTTGTTCATTATTTTTGAGTTGTAGCATTTTTTAA 

GACACAGTTTTTT AAATG AAAT ATACCATTT AG ACT AG AAAAG AG CAAAATG CCAATACT CC CTG TC CT AG TG TCATTGTGC CC CC 
TGGGAACCCTAATGCAGTCCTGTGGGAAAACAACCATTTTATTGAGCGCT 

5 0 TT CTGGTTCTGACCCTGATCT C CCCTCACAT CACTC ACAACAAACATT CAAGTTT AAAAATT AAG TT AAAAGTT AGTT ACAT AT AT 

ATGTAACTAATGTGTATATTAAATATACTGATATATATTAGTTACATATACAGATATGGCA^ 
ATATTTGTAACTAATATGTATATTAAATATAATGATATATGTATAAATACATATTTCT 
TAG TT AC AAATATG TAT AT AT AAT ATTTTTTGTTGTTGCCAG CAAAATG AGAAAGAA^ 
CTGTAAAGCCGTCATTGTTGAAAAGTACrrCCGAAGATTAATAGCGGCAGTGTTTGTTAAAATA^ 

5 5 CTGTGACTTCCCCATGAGATGGCTTCCAGAAAAAGATAAATTTTTGTGTGTTCCCTTAC 

AAGAGAAGCATAAAGAAAACAAACAAGGCTCTAACTTTGTTATAATCCTTAC^ 
TTCATATTGGAAAATAAAGCTATTAGATTTTTCTTTGAGATGAACCTTTTTT^ 
GTAGAGATGGGATTTTGCTCTGTTG^CCAGGCTGGTCTCAAACrCC 
CGGGATTACAGK3CGTGAGCCACCTAGCTTG^CCAAGGTGAACCTTTT^ 

6 0 TCACT AACTTG AACTAAAG AAAGTATAAGAC CTG CATAAAGTTGAGCATACCTCTAT AATGTCCCATTTTAAAAAG CAATTTGTTT 

TTAATGACTTTTATTTGTTAAAAGCAGTTATTTCAAAAAATTA 
ATATTTCAGTGTTAGGGCACACCTATCCCTCTTTATCCCAGACGTCA^ 
AAGTGATGACAGACTCCCTAATCAACCrATTTATTGGTTCTAAGCCCAAGCCTGC!AGT^ 
AATTGTCTTAGAAAATATGTCTTCCATGACAAGAATTTTAGG 

6 5 AATCATGTTTTATTGATAlTTGAAGAACCrcrTTGTCrACCAGTTCT 

GTGTTTGACTGTTAAGACTTGATCTATAACCTTCAGTGTCTTTTTCTTT 
AATACCTTACAATTTATAGCATCTGCATATAATACAAGAACCTTCATGATTTATAAA^ 
AAGAAGCATGTGTGGGTGGCTATTTATGTAGAGGAAATTAGAAGTT^ 
TTTTGCTCAGTTGGATAGTCAGGAAAGTACTTAAATCAGCAATTGATTTACGTAAAATC^ 

7 0 GGAAATAGGAAAAAAATTTTTAGTGTGTTCATACrGCTAATTTCAAACCAAG 

GG CTT AAAATG AACC AAAAATATAG CC ATTT CT CTAC CCT ATT CATTTAATGCATTCATTT ATT AT ACTG CTAAT ACATG AT AAGG 
TGAATAGGTTAAGCATCGTTTGTATATGTGTATGTGATTTTAAACCTGAGTCT 
AGCATAAAGATTTAGGGGGAAGAGTCCTCTATAAAACTAAGACTTAAACT^ 
GAAAGTGAATGAGAATTATAAATAAAAGGACAAATCATATATCT 
7 5 CATGGAGGGTCATAATGAAAGAAGTAAGAGAGATTGCCCTTC^ 
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TTG CT AAACTGG T AAG AATGGACTTTT CAC CACATT ATG AAAAAACACTTT AG AAG T CAGTTGCG ATCAAGCAAAT AT AAG T AC CT 
GGGAAATT AAAATCAAAAG CACAG CAGTCCAACAAT AGTCAAATG CATTT ACTCACT CT CAT AC CAT AT CATAGCGTT AATTTGAG 
ATAATGTCrrTTATGGTGTCTCTTTATATATTTACTCGATTrCCCAT^ 
CTTTAAAAATATTTCCTATTTTAAACCATTACATAATCATTTACC 
5 CCCATTTCTCTTAGAAAGTCTTCTGTAATAGCCAAATAGGAGTCAATC 
TAGATGACTTTTGGAGTAATTGTGCTATATTTACTGAGTATCTGTTG 
TGATTTTATTGCTGAGGCACTTAAGTATCTTCACA 

ACAGATTGGTATCCCCAGCACATTAGGGAATGCG<rrTGTTAGCCTGTTAGG 

AAGCCCTTCTTATACATTCAGCTTTCTTCTTCAGTCAGTCACCAAGTTCT 
1 0 TCCTCGTTGAATTTCn > AGATTTTATTrTTGC^TTGTCT 

GTGTAAATCGGATCATGTCTCTGCAGTGAATCCCAGTTGTCTTCACAAGCT 

TGTTCCCCAACCATCCCCTAATCTCAAATCCTCCACACCGACITTCAG 

TATGCCCTCCCTGTATGAGAAGTGCCTTTTCITTCCrAGCCATCCTC 

TTTCrCTCACTTGCCnTTCTGin'TTCTCCCTCCCCTTTCCATAAAGA 
1 5 GGTCCTAGAAAAGAAACATCTTGAGGGCAAGGAGTTCATC 

GTAAATACCGAACAAGATATCAAGAGACAGTGCTGCCAATTAACTGTGTGGCT 

G AAGACTT CAGATTTT CTTG C CAT ATCAAG AT AGK^CAACTT AAG AAAAATGCATG TGAGT AAG T AC CAAGC CAATTCTTGATTAT 
CAGTATGATAATATGAGGAA,CTGCCCATTTTGTCGATAATAAGTTATAA' r ' l ' l ' 1 ' 1 'GGTCTCAGTGTAGGTTCITTCTCAAATCCAGA 
2 0 TGTTTCTGGTGTTGCATTTTTTCATTATTATTCAGTCTCTCG 

AATTGAATGTAAAGTTTAATATTTAAATGCTAGATTGATTCAAGGTCTCACTA^ 
GGTGGTGGTAGTATGAGCTGAATCCTCCTCTCTTCCTTCCTAGAAG^ 
TCTTGGAGATATGAAAAGAACCACCAGGTAAACTAAAAAGA 
GAGGAGG<MTGGAGTAGGGCAAGCATTGCITGTTTATAGAAGCTCIT 

2 5 TTTCCTTCCTTTTTTTTGACCTCCAGCTTTAGTATAGTACAG 

GATATACATATGCATTGTGAAATGATTAAATCAAGCTGATTAACATAAGTAGGGTTTT 

TATGTTGAGTCCTTTCTATGTGCCAGACACAGTTCTAAGG\ACAAGGCACACAGTGC 

AGCTTAAATCATGTAGTCTTTTCAGTTTTTAA^ 

CCGAGAAGGGAGGATTGCTTAAACCCAGGAATTTGAGACCAGCATGGACAAAATA^ 

3 0 AATTAGCTGGGCATGGTGGCACATGCCTGTGGTCCCAGCTCTTC 

GGCTTCAGTGAGCAGCGATTGTGCCACTGCAGTCCAGTCTAGGCAACAAAGCA 

AGAGAAACTTTATTTTTACATTAGCCrTTTTTAAGTTGAAGAATCCCATAGC 

AGTGATTTATCTCAAGGTGAACACGTTTAACCTAGTGGGTGGTCAAGAAATAGAACACTGGC^ 

CC C TT l ' C ' TT AGTTAGGAC CCCTTCCCTCCACTGT CATTT AAAAGATTTCACAAACAACTTTTGGATTATATGTATGTCAATTC^ 

3 5 G ACTCAT ATATAT CCACAT AGACAATGG AG CGTTGACTG AATG AATT CATT ATT C CAAGGTG CT AAT AATGAG CTT CACAC CAG CG 

GGAACTCTGCTGTGTCAGTTGGGCTCTAAGCTCAGTTCATTTCAGATATTCCTCACTAAAC 
AGTAAAAAATAGAAAACAGATTACATTCTCATTTACTTATTACTTATGAATGTAAAAGTAAATATTTTGCTTAA 
AGGGTCACTGACAGAGACATGGGAGAAAGAAGTATTCACATCTGATGTTATCAGACATAGAAGAGAGA 
CTCCCCACTCAGTTTATACATTGAC^TCTGTATGTGTGTTGTTTTGTTTTGTTTTGTTTGT 

CCCAAGTAGGTGGGACTACAGGCATGCACCACCACACCTGGCAAAATTT^ 

GGGTCTCCCTATGTTGCCAAGGCTTGTCCCCAAATACCTGGACCTCCCAAAGTGCTGGGATTATAGG 

C CATGT AT ATGTT AAAG CGTTT ATCTG T ATG AGTGGTCAGTGTAG AAAGGG CTTC ATGG AGC CTGATT^ CTT GC AAATT A 

TTATATTTGTGTATGTGTAATCGGAAGGCCTTCGCCGTGAT^ 

4 5 ATGTTCATAGGACCAAACATTGCTGGTTCACATAATGACTCTTTTTCTTTTAAAACATAAAGGACA 

GATTAATAGAATAAAAATCTAAAGTCATATTCAGATTCAAGAGAACTGGATCAGTTTCTCT 

TCTAATTCATAATGCATGAGK3TCTACTTCTTCCTCAGTCTGAAAATATGAATA 

TCAAAATTAAAATTCTGTGATACTATGTTTTTAGXX3AAGAAGAAAACTCTAG 

GTCCAGT ATGTCAAGT CC CTTT ATAATGCCAT CTGAATG AAAG AC CT CACACCATGAACAG AGG TATTG AAG CAGTTTTT ATGTGG 

5 0 AAATGGTAGGCAGCAATATCTCAGGAGTGACTTTAATTATAACATTTAAGGTTTTATCT 

TCTAGCTCTACAAGTTATTCTTGGCAGAATT^ 

T CTT AATT ACAATATAATTTTAAAATG C CCAAAACATCTTTTG AT CTT ATGAATTT C AG TCAGCTATG ATT AAG AAGACAACCAT A 
GGCAAAAATTTCTTTCTTATG<IAACACGTTGGAATGCATT 

AT ATG AT ACAG CAAT AATTTAGATG CTT CTTTCAGGGGC^TG AGGTT ATGT ATTCT ACAGT CT ATTGATTCT AAGAG GCACTTTTT 

5 5 TCTCCCCCAATTTAACAGTTTTGTTTAATTATATACATTTCAGGATTTTTTGGGGATA^ 

TCCACTGTATTTAGATAACATATTCCGCATGCTTTCAATTCTTTGAAATTTATTGAG 
TCTTCX3TGAATATTCCGTGTGTATTTAAAAATAATGTGGACTCTGGGCCGG^ 

GGAGTCCGAGG CAGGCAAATCATGAGGT CAGG AGATCGAGACCATCCTGG CTCACACGGTG AAACCCCATCTCTACTAAAAAAATA 
CAAAAACTT AG CCAGGCGTTGTGGTGGGCGCCTGT AGTCCCAG CTACTCAGG AGGCTGAGG CAGGAGAACAGCGTG AACCCAGGAG 

6 0 GCAGAGCTTGCAGTGAGCCGAGATGGCGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCATCTCAAAAAAAAAAA 

TAATAATAATAATAATGTG<3ACTCTAAAATCATTCAGTGGGGTGTT 
AAGCCTTCTATATCTTTCCTACCATGGTTTGAATGTGTCCCCACCAAATTC^ 
AAAAGCTAGAGCCTGTAAGAAGTGATGGGATTAGGTGCCCTTTTAAAAGGGCTTGCT 
GCCCrCTGCCAGTTGCGCACACAGCATTCCTCCCCTCTAGAGGATGC^ 

6 5 GGCTTCCCGTTGCCAGAACTGTGAGAAATAAACCTGGGCTCT 

CAGACTAAGACATTTCCTGACACTCTGTTGATTTGCTITGATCAAT^ 
C^TTTGTCATTTTTTCCATTAGTTCTGCCACGTTTTTG 
ATTGTTTTTCTTTGATGAGTTGGCCCTTTGATTCTTATATGCAATGTTCCT 
ACTTTATCTGATATTTTAGAATGTAGCCTGCTTTTCTTATATTTAGCATTTGCATG 

7 0 CATTT AAACTGTGTCTTTAATATTAAAGTACAATTCTTATGAAAAGCGTATACT 

ATCTCTGTCTTCTAGATGAGTTCAGTCCAGTTCCATGTAATATTAT^ 
GTCTCX3CTCT<5TCGCCCAG<5CTGGAGTGCAGTGX3CX5 

GCCTCAGCCTCCCAAGTAGCTGGGACTACAGGCGCCCGCCACTACGCCCGGCT 
ACCGTTTTAGCCGC^TCGTCTCC^TCTCCTGACCTCCTGATCCGCCCGCCTCGG 
75 CCACCGCG CCCGGCCCCATGTAATATTATTACTGG^ CTA 
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TCTCT TCT TTATT CTT TT n 1 1 CTCCTTTCCTGCCATCCTTTGAATAAATC71AGTTTAGTGTTTTAAATTCCATTTTATCTC 
ATTGGTT ACTTTCT AAAGTGG TT ACTTT ACCACTTT AG AAAAG T ATTTG TTT ATT ACAATATG CATCTTTGAGTCT CT AATT AATG 
TGAGAACTTGACAATGTTATACTTCTGTITGTCCTT 

5 CATATTTATCATTATTGATGCTTTTTATTCTTT 

GCATGTCTTTTAGCAGGTCTGCTGGTGATGAATACTCTTAGCTCI^ 
ATTT ATTTG CTGAATATTCTGGACCAATCTTTTTTTTT 
TATCTAATGATCATCAGAAACATCATGCTTATCATGCTTATCAATTC 
TTTTGGGATTTTTTTTTTCTCTAGCATTAGTTTTTA^ 
1 0 CAGAGAGATCTTTGAGATTCTTGGATCTGTGGGTTGATTTTTTA^ 

T ATTT I T T CTGT C'l Tl ' l'l T CTG TTTCTGGGATT CTATGTT ACAAAT ACGT AAGACTGCTT AAT ATTGT C CTGT AGGTCACTGAAGC 
TGTGTTCACTTCTTTCTGCCrTTTTTTCT 

TTCTATAATGTTAAATATGCTAAGTCCATCTAGTCAAATTTCATT^ 
TGCATTTCTTTGCTGATATTGCCTATCTGTTCTCTCATTATGTTCA 
1 5 TTTGTCATTTGTGTCATTTTTTTTCTCAGTCTTATTTTCCT 
TTTGGGTTTTTTGTCTTCCTTTAAAACGTGTTGAGCTTTGT^ 

GATAGGTTTAGAGTAGCCCTTATTTTAGGATGTTTCTAACTCCCATCGTGTGGTC^ 
TTCAGTGATGTCTTTCCACCCTGGATTATTACAAATTCAGTGTCTCT 
GATGCCCAGAAACTGTTTTCTATTAGGTTTTGCAGGGTCTTGTCCT 
2 0 ACCCCTATGTAGATTTCTGAGGCTCTTGCTTTTTGTGTAAAGCCTTTCTTTGTGATATTCTGC 
TCTCCCTGTGTCATGATTTGGAAAGTGTCCTCAGTCAAAAAGTGAGGGCGAATTTA^ 
GCCCTATGAAGTCCAATGTCTrAAATTACATCTTTAGGTATTTTGTATGGTTTC 
TAACTGTTAATCTGTGATAACTAAAACCAGAATTTGACATTCTGACAAACTTGAAATT 
TACTTAAAAAACAAAAACAAAAAAACCAGTGTTTTCCAATTTGCCA^ 

2 5 TTCTATCCATTTGAGAAGATGATAAATATATAGGTAAAGTTAGATTGCAGTAAAACTATGCTGTCTAGTGTTA 

TATAAGTTTTATAGATATTATGTGACTACAGAGCTATAGGTTATAGGAAAAATATAGGATTTCTTGGCAGAAA 
GAGCCTTGAAGAGTGGATGCTATTGAGGTAGAAAAGATGGCATTTOVG^ 

ATGAGCAAGTCATTCTGAATGATTAATCTGTCTATCATAGAGGGTTTTGATTAGGACATAGTAGTAGT 
TTAATATAATAATGTAATGTGCTGAGTGGTTGAAGTCTTTGGACTTTATCTTATAAG 

3 0 AAGAGGAAGAAACCCTGCAGTGAGGGAGAACAGTACAGAGCCTATTACTGTCATTCAGAGGTGAAGCAATA 

ATGTTGG AAAGAAGGGG CAG ATCCAAGAAACAACATGAAG AAAATAATCAG CAGAAATTGGTTATTGACTAGATATGTGTGAAAAT 
GATATATTGGCAGAAATTGGAAAGTCAAAAATGGGGCTCATTGCAG 

ATTTAAAACATTAACTTGTTTTAAAGGTATCACTGCTTAAAGCTAACAGTGATTTATTAATATT^ 
TTTTAATTTTATGTATACACCACAGTGTTTTGGCTACTATACTTGCCT 

3 5 AAAAAATGAGTACAAAGTGGGCTTTGACTAGAACTCTAACAAAGTGTGATCACTGAATTAC 

TATACTCCATTTCCCATGACACATATTATATTGCATTGTTTTTGCCTATTTG 
TTTTGTTGCATTAAAAAAAAGGAGAATCACCACAAAGCAAATTAAAAG 

TCATCTGGCCTGGCCTGCTTTGGCTAATCTCTGAGGTCTTTTGGCTACTTGACTGGTACTGTTTGTAGCTA 
ACTGCTGTTTGTCTGAAGTTGTTTTGGCTACTCAGCTGGTGCTAGGGAGCCTCAGCTAGGACTGCCTGT 

4 0 TTTGATTTTCCAGTGGCTGGTGCAGGCTTGTTTGCATGGTAACAATGTTCCAAGAAAGCAAG 

AAGCTGGGTTTAGAACTTGTACTT CTACCTCTTCCTATrGGTCAGAG TATGTCCGTAGGTCTGCCAAGACTCAAAGCACAGAGAAA 
TAGATGTTTCCTC1TGTTGAGTGGCATATAAAAAAAAGTGTGGCCATTGATTTCAATCT 
GGAACTGCATCACTCTTTTGCACTTTGGTGTTGTTGGCATAGTAGGCATAAATATTTTT 
GGAAAACATTCAGGTAGTATTTCAATATACTGTAGTTTTAAATTATATATATTATATACATC 
45 ATACACATATAGTTTTCATATATAAGTATATTATATATTATATAAAATATTTATATGTTATATATAAA 
TAGTTTTAATATGTAAGTATATGTATATAATTATATATAATATT^ 
ATGGTTCAGGGAGAATCTCTTTATTCTCCITCTATCTTTTGAGATAGACC 
GAGTTAACATTGTTTGGTATTTTTATCTTCATAGGTGATTGCCTTGTGAATAACT 
TTTTCAGGCTGTGCTATGGGTAGACTAGCTCTAACCAGAACACTTGGGAGACCT 

5 0 TTATATGCCAAAGAGGAAAACCACTTCTGCCTCCTAAACAACTGAAAGAACCAGAGACTGTGGGGTCAGATTAG 

TATGAACCTGAGCAAATGACTTACTCTCTTTCAGCTTCAGTGTCTTATCTAGGACATTC 

AGAAATATGTGAAGTTCTAGCTACAGTTCTTTACCCATGAGGTAGTCAGTAAAGTTTACTTCTTTCTC 

GTCATGATGTCATGATGTTCATTAG<K5TTTCTCATGCTCAGCGTTATTGATATTTAGGGTT 

ATTCTGTGCAGTATAGGATGTGTAGCAGCATCTGTAGCCTCTATACACTAGATGCCAGTAGGACCACCTATCCAGCTGTC 

5 5 AAAAATGTCTTCAAACGTTGCCAAATGACCCCTGGGGGCAAAATGCAAACAGTTTATCTAG 

GAAAGGGAATTGTCAGCGTACCCATAAGTGTAGAATGGGAAACAAAATACTTGAGTCITCTCAGAATGTC 

T AT ATT CCTCTTTTT AT AATTT AC CA.CAAACATAT ATTG CTT ATT ATGC CAT CT AAT AAAATGG CT AACAT TTG CTG TGT ATTAT A 
CTCGGTGTTAATAAGCTTTGTATGAACTCACTAGCTGAATGTTGTATATGTATCTGCCATACCTGTT 

AACAGGT ACCTT ATCTTT ACCAGCTCCAAG C ACAATG T ATTTGG T AT AT CAG G T AGCT ATTCT ACATGTT AATTGAT CACATTGCA 

6 0 CCCAAATGAAACATGGATGTGGAAGTGGAAGGAGGAGCTTTAAAACATGGAAAATTAG 

CCAAATTCATTCCTTTTTAAAAACTTTCAGAAATCAGCTTTTAAAAAA 
CCACTAAGCCAATATCAGAAAGTTGTCAATCAACAGCTCTTTTAAAGTTGAAA 

AG TG CTT AG AAG TTTTTGG CTCATTT AT CATGTTTT ATT CACTG T ATGT CATT AATTGTT AAAG ATGT CAG AGTCAAACCTT AAAT 
TCAGGAGAGATTTAAATTTTTCCTTTTTCCTATTTAAATATATCTTTCTAAT^ 

6 5 CATATAACITAGTAAAAGTTTCCAGTTACTGTATACAGATTATATTGTATA 

AAATGTGGATAATTATAATTAAACAATCTGTTATCAAAGGACATAC^ 

CTTATTTTTTTACACACTTGCTCAGATAATTTTTAAGTAC^ 

GTGTTTCACATTTTATGCAAAGTATTTTGTTTCCTCCAGAAAAT^ 

T AG CCAGCAGAGAAATGATG AAAAGCCACCCCTTTTCTTGT AGTTT CATCCTCATTGCT ATCCAAGATG AGGT ATTAAAATCACAC 

7 0 CTAAGTGGTGTCTGTTGTGGCTGAATATAATCTTGAAAGTAGAGACACTTCTCTCT 

TAAAATTTTATTTTTTCAAACTGAAAATCATTTTAATGAGTTCGAAAAGGTTT^ 

CATTTTCTTACAGTGGGTCATGGTCAAAAGTGTGAATATCrCTGATCCT 

AACrCCATCCTTAATGGCTCAATTCTGTGGCTTTCAAGTTAATCC^ 

CTTTGGTGTT AG TGGGG C CATTGTCACACAGGTTCAGAAAAT AACC CATT AG<3CATTTTATAGAACTCATCACATGTTC^GAACT A 
7 5 AGCTAGTTGTAACATT ATGTTTTTCATGTGAAAAAACACAAG CAATTCACTC^CTATTTTATTTGTTG CT AGCT AG CT AT ACTTT C 
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TATT T CT'n TT CCATGGTGTGAAAATCTTCCTAGAAAC 

GGAAATTTACTATCGCCCTTTTC^CTTCAAC^TCCTTCCT 

GAACAC^GAGGTAGAAGGGGTTTTCCCTGGTCTTTTCTAATCCT^ 

AAAACCTGCTTTT AT AATT AATG CAAAT ATT ACHTAGTGATTCATC CTTTC CAAATGATTCTTC^ CATTTT 
5 T AAATTG TGAATT AACATTGACATCACCAT AGCG TT ACTGTT AC CTCAG CT CTTT ACAGTG AACTGT ATTG CAAAGG CT AT AAAAG 
TTCAAATTTT AACT AT CACCCACGTGAG CCTTGGACTG CTG TT AGG CTTTTG AAACTGTGATCCTTTGG CAAGAATTGGTGT AT AA 
ATCATGGGGAGGAT ATTG AAGGGGAAGATCATGTTT AATTTGG CAGTTGGGTGCCTGCAATTGACAGATTTT CATTAAGGTTTTAG 
CT AT AGT AGTGATCATGCATCAGTTCTAAT AAAG TAGGTTT CGTTTGAATTTTAAAAATT AGAG TGAGGACTGCTTT ATGTTAACT 
GTTTACATTCAGAAGTCTGGAAAACC^TCGAC^GGTTTAAATGT^ 
1 0 TTTTGAGGACCATAGGCTAATATTATTTATGAAGGTTCAGGAGATCAGTCT 

AAT ATTTTT CTTGG AAGTTTGAG CTT CT GTGGT ATTCTAAGGAT AGGG ATT1TG ACATTG ATTG TTCT AG CTGGTATTCAGT ACTG 
TATTTAAGTGTAATGTTTTTACTAGTATTCrTCTTATAACTGAAAACT 

T ACGTG T AAAAATC CAAGTTTACCAAAAAAT ATT AAC^CTT AGGGG AT ATGTTC TTTG CATTT ATTTG CT AATT AACAT A 

TTCTAATCTCAACAAGGTTATATTCTTTTCAAAAATCCATTCC^ 
1 5 GTGAGTTTAGTTAATATCCAAGATCACACAGCCAGCATGATAGAACTC^ 

TCAAATTTCCCITTTTTAACrccriTCT 

TTTAGACAAATGTTTCCCrcCAAATCTGCATGAACrr 

GACTGCTCTGCATCCCCCTAGAATGCTCATCAGCTTTCTCA 

AAAATCAGATTCTTTTTTGGAGGGCTTTGTGCTTATTGGAGTG 
2 0 AAAGAGACCTGACAGTGCATTTGGGTTAAAGAACAAAGTTTATTTCTT^ 

GAGCAGATCTGTCTCATGGCATTGTTCAGGGATCCTCATTCCTTCCCT 

GTTTTTGGTAGAAGCCAGGTCTTCTCCATGTCniA.GTTTTTATGG 

TATGCAAGAGTTCTGACATTGCACTGGTGAGCATTTGGTCAGATGGCCATCTCTAACT 

AAGCAGTCACATTTCCAG CTACAATACT7VATTCT ATGG AAAAGG AG ACAACATATTTTG ATG<3GTAG CTGACATTCTCATCTGCAG 

2 5 TCCrAAGTACTCATGACTGCTAGATAGACTTTATGAAAAAAAAAAAGAAAAATAACCCCAG^ 

TGTTTCTTTAATTTTCTCTATTTTTCATGTCACATATCT 

ATAATAAGGAGATTTCTCAAGGAAGTAAATCTCAAATGATATGTGATTTTAGATACT 

TCTATCAGGGAGGGCCTCTGGGTAATAGAAGATTGCTAGGGCCTGCAGTATATTAATC 

CTATAGCCTTGATAATAAGGACTTTTATTTTTGTATTTTTATTGCACTT^ 

3 0 CAGAAACATGGCTTGCTTCTCTGGAACACGCTAG 

CGTTTCCACAGACTGGGGGCIXKKK3GTGATTTTGTGATGATTCAAGCACATT 
ACATTGTAATATGTAATTAAATAATTATACAACTCACCATAATGTAGAA 

CAGTCCCATCTGGGAGTGACGGGAGACAGTGACCCATCTGGAGACAGTGACAGTGATCIATCAGGCG 
GGCAATCTAGATCCCTTGCATGCACAGTTCACAATAGGGTTTATGCTCCTATGA 
3 5 CGGAGCTCAGGCAGTAATGCAAGCGATGGGGAGCAGCTGTATATACAGATGAAGCTTGG 
TGTCCTGCGTTATCCATAGCCCAGK3GTTTGGGG ACGCCTGCACT AG AGG ACAG AGC CT 
TTATGCAGAGTACTGATCAGTCCCrTTTGAGCTTCTCAACCTTGTATAAAACTTTGAGACAGTT 
GAAGGGTTAGAAAACAATACACTTAAGGAACTAAGACTGTAAATCGTTGAAGTTCA^ 

ATCC CAG T ATGT ATGATTTTTT ATACAAAG CAT AAAGATG ATTTGTACC AAACT CAGG ACACT GTGG CG AGCTT AACACG AGGG T A 
40 TTGAATCATACX3TAAGTGGTAATTCCTAGATTATGCAATCTCCTTGCTGTTAGATTTAGATACCAAAGAOT 

TCCTCCTATGGTCTTCAAAAAAGGATTATATTTGGTCTTACTGGTATAATACAGTTTATTCCT 

TTTTTAAAGTTCATTCTAGTGTTACAGTTTTGTTTTGTTTTC 

GGAAGGCCTGAACAAGTAAGTGTCATAATCTCACTGATAACTTTAT^ 

TAGTAAGTTAGCATTAGGAAAATTCTAAAATTAGTAAATTTTTTTTTAAT^ 
45 GATATGTCATGTGAAATGAGTAACTTACACCCGCTTTCACACATAT^ 

ATTAGCTTTCTAGAGCTACCATAACAGATCACCACAAACCTGATGGCTTAAAACAAC^ 

TAGAAGTATCAAATCAAGGTGTTGGCAGGGTTGGTTCCTTCTGGAGGCT 

CTGGTTGGTTGCTGGGAATCCTTC^CTTGTAGCTTTATCGCT 

CTCTACTGTGTCTCTAAACCTAAATCTCCTCTTTTTTTCTTCCAAAG 

5 0 TTCATCTTAATTTGTTTATATOTGCGAAGACCCTATTTCCAAATAAAGTCAC^ 

TCTTTTTGGGAGGGAACACAGTTTAACCCACTACAGCCCCITGACAT 

GACAGACCATGGAGATTC^GAAAGGTGAGGGATGGGAGGGCAGTGGGATGGGGGTGGATGATAAGAAGTTACTTAATGGG 
GTATGTTATTTGAGTGTTGGATACCCTAAAAACCCTGTTTTGACTACTATACAA 
CATCAATTTATACCAAAAAAAAATTTTATTTAAAACTCATCT^ 
55 AATTTGAAGATCTTTATATAAGTACTGACTACTACAGATCAGCTCCTCA.GACTTGAAGAATCACTTAGTCCC 
TTCTCATTTGTAAAATAGAAATGCTAGCACCACACTAACAGGGATGAGTCAAAAATATTC^ 

CTGTT C CT AT CATT AT ATT AGT AGTT ATG AAAGGAT AG CAG TGTC CTTTTTT CTTTTTCT C CT AT ATTTT ATTTTGAAAAATGTTG 
GAAAAAATAAATAGAAATGTTGGAATAATCTAACAAATCTACATTGGTTCTTAA^ 

TTTTCGTGTGTGTGTGTGTGTGTATATATGTATATATGTATATGTATATACATAATCATTTGGAAGCAATTTTCAGACT 

6 0 aattataatggcaatattaaattatttatgtaatgattccaagaag 

ttgaggtattactacgtgccagaacattggcttagtggggaaaaaataagtgatatatacactct 
taataggaagaacagagacattgttttcatatggtgatatatgctgtagtagtag 
gcagcatcactaggtgctaccttggaggaagagcagagttacggaggactttccgaaggaagtc^ 
atgtagtttgctagactagaaaaggtgaaggaac!actaggtgccaggatagcctc^ 

6 5 aaatagtggatttctgaaattacacgtgcaaagaatcagccagaaaaataaaaac^ 

tcatttctataaataaaatggacatatatagcatttatcacttcc^^ 
tttcacattaagatgctcctttctaattttattttc 

agacatttc^ctaattgtattatcaaaaccgtttgttttacctttcgcttttaatgtg 
ccagcatc^ggattgtgtctgcttgattttaaaccatcctttcctgtct 

7 0 atgccacagaagagtaaaacctgctgaaaacattcattttgaa^ 

gtggxjGCatattctg acatttccatttttgagg caagcggtcitgaagtttttttctttcttggcttg ag cttggtggtttatatt 

TCCTAAAGTTTGCATGTCCTCAGTGTTTTAGTGGAGTCTC<5AATTTGCTCTC 
CAaCACAmCTTCTCTCTTAGTCTGAAGTGAAGAGTATATCTCTCT 
ACCACAGGAAAGAGTTTTCCTGCTAGGTTATCACGTTTGCT 
7 5 AACATGGTTTTTC^TTCCTGTATTTCCTTITCAGCAA^ 



693 



WO 03/008583 



PCT7US01/51291 



GCGCAl'rCTTTGTTCCTCAGTTGAAAATTTGCAT^ 

TAATTTAACTAGGCATACGAAGATCTTCAGTACCCCAGAACTTTCAATAATTTACTACT 
GCTACTAAAAGAAACGAAAAGCTAGTTCAGCACACTTACTAGTCTCTCATAAAAAGTAAAATATG^ 

AGACATTT CAAACAT ATAACTTCTTT AGG CTAAAATAAAT AGCAAG CTGAGAAATACAGTTTT AAAGTCCTAAATGAG CCAGCGGT 
5 TACAATGTTTTTTGATAAGTAATGAATAGTAATTTATT 

CTGCCCTAACACATTACCACATACTTGGGGCTTAAAACAACACAAATTTATACTT 

GGTGTTGGCAGGGCTGCTTTCCTTCTTGCGGCTCTCATCTCTTTGCCTTTTC 

CTCTCATC^CTCCAGCCTCTTGCTTCCATTGTX^TACTTCCT 

GGACAATTCAAGACACTCTCTTCATCTTATTATCATTACCTTAATGACATACGTAA^ 
1 0 CAGTTTCTGAGGATTAGX1ATGCAGACATGTTTAGGGGG 

TGCTAGGAAACTGAAGGTAAAAGAAATGAGTGAGCTCAGTGCTTAGTTGTCTACTCCTTCT 

CTGTAAATAGTCCCAGAGGTGCTCTCTAGCATTCTTTCAGGCCTTGACATTGGA 

AGGAAATACTGATTTCTAGGATTATATTATTTTGTTTATACrrTCTAAGAGAGTACAAG 

AAGTATCTGTTCTGTGATAAGCATGTACTTGGCCCTGAAGATATACCAGTAAGCAAGCT 
1 5 TATAGCCAAGTTATGCTTCTCAAATCCCTTTGTTACGTAAACGTTGTCCAACCCTATGTAAAGTTTTOT 

AGGTCTCATGAAGATCTTTTAAAACCCIX^TTTAAAAT^ 

TTTAGGTGTCCCAACTTGAAATTTTTCATATAATATAGGAAAAA 

ATTTGG ATC CTT CTTTG ACTCAT ACAAT ATTT AAT CCAGAAGT ATTT CAACCACAG ATGG CACTG C CAACAGGT AAG AAAACT CAT 
CCCTGTTACCCTGTTGTTCTGCTTTCAGTCTTAGTAAAATGCAGGATTTGTTAATAGTCTGTCCT 
2 0 GAAAGGAAATGGTGATTTGTTTTAAAAACCAATCTTTT 

ACCCCTTGT ATGGTTT ACTGCATTTCATATCTGCT AGGACTACT CAGAAGGAATTTCTTTAGTCAACATAAGGAT TT T CTTTA TTA 
TCTCTGTATTCCTTTTAAGTCCTTTCTCCCAGATTCACACGAGGCAGTCTCCACT 
CTGTTGTGTACTACCAGTGTGCCAGTCACTGGGGAATCAAAGTGTACAGAAATCATAAAAA 
ATAAAAGCTTATAACTTTGCACTCTGACAAGTTGGATAAAACTCATAAT^ 

2 5 GGCAACTGTTTAATTTCGTAAGAGTAATAAAAAAAAAACTCTCriTCTTAAAAAAAGTGA 

TGGCTCTTTTTATGTTCTGTTTCTCTCCTCACTTCACCCGTCACCAGTGGGACrTAAC 

ATGG AAGTT ATG TG T AAAACAG AGTTTGGT AATCG CTGAAGGAGGCAT CAG CT CTTTT ATTT AC CAGGG ACTTG CTTGTCTT CATT 

GGGGGACTTTATCCCATTTGTATTATTCTCATTTAAAATTA 

AACTTTTTCTGCATTrATATCTTTAGTATTTCTAGGAGlTCATGGGAG 

3 0 TAAAGGTGTATTTCAGAGGTTTTTCTGTCTTTACCCCTACAA 

GTC CAATTTG AGCT CTAGGTCAGCTTCCTGTG AACAAGTAAATGTGTTGGTAGTGTGTC 

TAATGTCTTGTGAATACAAAACAGTATCCTGAGG CAGACGTTTT CTT AATTTTATT CCTTCGG ATTT ATGGG CATG AAAAT AAG AT 
TATCAAGATAACCATCTCAGTAAATATTCTGTGGTCACAGCTTTATGTTGTAGAAATTATTTTGCC^ 

3 5 CAAATTACACCTAAAAATCTAATTTGTTAGTTGTTGTTACACCATCA 

TAGTACAATTTTTATAGTCCAAGATATATAGGATGTTGTATTCAAATTTTTAAAGAGGTAGAGAGG 
TAACTTGTAACTCTAAAATGGATTATTTTAGTCTTCTAGATTTTTTGAAATCTAAAACCTTT 
CTATGGGATGCGACATAACTCTTAGGTCTAAAGTTAAATTTCTCTCCACAGTGG 
AAAGAATGITGGCCTTAAAACAACAAATACATGCATATCAAATCTAATTTTCTAGTTG^ 

4 0 GGAAAAATGGCTCTATTTTACATGAATTTTCACATATTTAAAGGAAATT 

TTTTTAGCAGAGTTAATAAAAAGTGAGAGACTTGAAGTTTAATGTAAACATGCTTCCAACT 
ATATTGCTCTGTTAGAGGACTGTCTTGATATGTTCTTTCACCATTTTAACAGGGAATGT^ 
GCTGT ATCTTTAGTGTG AAAAT ATTT CTATTTCTAATCAATTTTATTTCC^ 
TTTTTAAATTATGATTATGTTTCTAAAAGTAATTTAAA 

4 5 TTGACCAGTGACTCCCCCTTTTTTTTCTTGTGTATGCTTCATTTCA 

GAATAATAACAGGAAGTAATTATACCCTTGTTTTCTCCAGTTCT 

TAAATTCTCTGGCCGGGTGAGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGG 
TCAAGACCAGCCTGGCCAAGATGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAATTAGCTGGGCA 

ATTCCAGCTCCTCTGG AGG CTGAGG CAGAGAATTGCTTG AACCCAGGAAGCAGAG ATTCTGGTGAG CCGAGATTGTGCCATGGCAC 

5 0 TCCAGCCTGGGTGACAGAGTGAGACTCTGTCTCAAAAAAAAATAAAAAAAATTTT 

TTTAATTTCAACCTGAAGGTTTATCTCCTTACAAATTTATTTA^ 
AAAATGGACATACAAGCATTCTCCTAAAATATTTTTAGCACCAAACATCTTAA 
CTTT ATTT ATTGCAAAGCTATGGATTATTGT ACT ATTGACATAAAAAT 
TTGTACTTATGTTGTCAGAGATTCTGATTTTCCTTTGAATT^ 

5 5 TTIGCGTTTTTACAATAACTAATTTGCATGTACCTATCTTCAATTAAATTAT 

GAACACAGTTTATATGATAGCACAATATTCAAAATTGCTTATAAAC^ 
GATAGACCAGATTACATTGGGAGTGTCGGGGAGGAAAGGGGAAACAAACTAACCAG 
AGTAAAGACGGGAACAGAAAACCTAGTGTAAGGAAAAAGAGATAGCCTGAGATCATTAT 
TGCTAGATCAGTATCTTTTAATTTCACATTAAGCAC^CTCTCTTTGAGGAAGG 

6 0 AAATGTCATAGTGCCCAGAAGTGGTTGCTGCTTTTAATTGGTATTTGTCATTTTAAAATGCTTTGTT^ 

GCTGTTGACATCAATTCTTGCCGGCCATATGCCTTTTATTGGGTTA 
TGGTTCGCTAAACTCGTAAAACTATTGTTAAAATAAGGGTTAATTTAATATTTCA 
TTTTAACCAAAAATTGATTTGAAATTTCTACATAAAAACATAATTTC^ 
AGCCTCACAGTTTCTTTTGGTTTCTGTTTGTTGTTTTTG 

6 5 GGTAAGATTAAAGGGGTGGGACTITAAATGTTAGATTCCAGTGTCrAATATTGAGAT 

CTATGAAGGAAGTAGTTTATCTGAAAGATCAAGAACCTGACAAATTATGTAACA 
CATTGTTCACACTGTCCTGAGCCTCAGTATGTAGTTCT 

TTCTG CTGTCT CT ACCAGCTT AACTT AC CAG CCACT CCCAGG AAAGG CAGAGGTG ATTTCCATTCATTTTCATTT ATAAGGAAAT A 
CGTATGCATACAGTTCACATCCACTGAGCGATATCCATCTArTATCCTTC 

7 0 GTAGCCCCAGCCTTTTCTGTGCTGCCTTTGGAACATGTAACCTAACATACTGAAAGTACC^ 

CTATTGAATTCTCTTCCCTCTGGTGTCAGCCTCTTTGGAGTCAGGAGAGGAACATTT^ 
CTGTAGAGAATAGCTCAGAACTGAAATGGCCTCCTCACTGTATC 
TGTGAAAGAGAAAGAAAGGCAATCTTTAATGTCTGCCTATAGGAGGTT^ 
ACAGAATGAGGACAGTCAGTAAATCTTTGGATGGATAAACTAATC^ 
7 5 CAGCTGTATTACCa^TACATTTTCTAAGTArrCTTT^ 
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CATGCrAACTAGGTCACAAG CTT AAATAAAG CATCCACTGCGTTGTGGTTTCCAG AATCGTG ACATT AT AT ATGTGTTATATTTGG 
TCAGCATTTG AT AT AAATGT CTT AATGATTG CTGAACAGG CTATCCTT AT AAGGTTG AAATT AG ATG AAAGTTTTTG CTGTGGGGG 
CAAGAATGCCACATGAAACTTTGGGGCATGGTTCTAGTTCTTC 
TTGCTCTACCTCCATGTTCAACTCTTTTAAGCAGTTATT^ 
5 TCTGTGGGCTCTGGCATCCTGGAAAATTTCCCAGCTTTTAAGATCT 

AT ATG C CAAAGT ATTTTTTCCAATAT CATTTT C AT AAG CAT AG CACTACAAT AAGAATTTTT AAATGTAATTCCTT ATTATGGATC 
(ZAACAGTTATAAGGAAAAATTGGCATTTATGATTTACCT 
CTTATGTGAGTAAAACAATGTGTTTACCAAAGTGATGCTATT^ 
AGACTTTCTGACTGAATGGATGTAGCCTTGCCTTTGAGGTTGATGACTCA 
1 0 GGATTAAAGGCAAAATTTTGATCTCAGAAATTTAGAATCAGAAATC 
TTGTAAATTTTCATCCTCAGATTAAAAACTTCrrGAGGATTTAGATA 

ATTTGACAC CACT C C CAAT AAGAT AT AAACACAT ACG TGTCTAT ATTTTT CCAACTGT CCCAAAGGAATTTTGTGATT AAAGAT AA 
TGAATTGTGTGTGTGTGGTTTTGGGGTTTTTTTTGT^ 

CCACAGATTTTCCCCAGCTCTGACGGAGATGAGTCATAGCATAGACTACTAATTTT^ 
1 5 AGTGTTAATTAACTAATACTGTTCAAAAACTCAAGTTGTG 
TCTGCAGGCTATTATACATGTGTTGTTTTTTTCATGGTTT^ 

AAAAGGCATAGGGCTGAAATTGTAAATTATTTITAAATATGAATTGTGTGAGAACCA 
CTCCAAAGGCTTCCCITGTCTTrAGATCroGAATCTCCCAAG 
GACATTTGTGTGCATTAAGGAAATTTCCCAGACCGGGGAATTTACTTTATGC 
2 0 CTTTGGAAATCAGGAAGTTTTTGGATAATATAGGAACAGTTCCAGATG CTTCCCAGAGATTTAAATCAGTTCAAGGCACCTGGACT 

GCTCAGAGACTGCAAGACCCTACTTGGCATAATGAAATGTAGTATAGTCTAAGTAG 
AAAGCTGTCAAAATCTGCATGTTTGCTGGGGTTCCACAATTTAATCAGTAAGGGAGTGAAC 

TGAGCTGATGGG CAGATGATCTAGTG CTGTATATGCAG AGTTGGTCCCACAAT AG AAACAAAAGGTG AGAATGTTCCTTAAAAAAT 
CTTAACGTATTTTCTTTTCACCAAAAAGTGAATCTGTAAAAAGAAAATC^ 

2 5 ACTACCACAAATTGATTGACATCATTATAAAGTTGAATCAGGAAATACAGCGAAACCT 

ATCCTTGTTTrATTCTTTGTATACAAATATGAATAAGTTAGCTTTATAAATCAT 
GGTCTGGTATAATCTTCCCTTITGTACAAATGAAAAAATATGACACTTCTTAGGTGTTT 
AGGCTTCCAGCTAACTTCATTAATTAACTATATCTATAAAATTTCTGGGTT 
CATTCTGGTAGGTTAATATTCAATCTTGAGCTTGACCATATTAATA^ 

3 0 AAGCTTAAC AGATGCAGGGTCTAG C^CAGGGTGTGTTCTACAACGCTGAAACAGTATATCTAAATAAACATGTCTGTAGTCCCTG C 

TGAACCAGCTGTAATCAGAGTAGATAAAGGAATGTCTTTAAGTAAGAGTCAAGGAAGCATAA 
GAGCTTGGCAGCAGCAATTTAAGACAAGGAAGCTCTGACTAGAACAAGCCGTAACATGTTAAGTCTA 
GATTACTCTCCACACATGCATAGCATGAGAGGTTCCATGGGCITAGGTACCTGGCTTTTTA 
ATTAATACCATTTTTCGTAGTAAGATTACGGGAAAAGTGATTCTTGTTTACAGAGCCCTCTTTCAG 

3 5 ATTT AGT AG ACAT AAG ATTT AAAAATTTGT ATGT AC CTTTGTTG C C GT AATTTTT AAT AAG T ATTTCT CAAACTT AATTGGCTT AA 

CGTTCACCTTTGCAGAGAGGATTTCX3TTTCCGTTATGTATGTGAAGGCCCATCCCATGGTG 
GAACAAGAAGTCTTACCCTCAGGTCAAAGTAAGTTTGTGGTAGCrCTCCTT 
- TTCCAAGGAATTG CTTT AAATG AG T ACGGGTTG C CTT CG CTC CT AAG CTG AAGTGTTCACATG ATT ATT AAAATTTTT AAAT AG AA 
ATTTGTCTCCTAGCAATAGAAGTGACAGATACTAAAACTTTGTTAA 

4 0 AACTT ACTATTTCTTGAT CATCTTAAAT ATTTTT AAAAATTGG AT ACTTCCTGAAACTTTAGTAAGT CTTTGAAAGAAATGGTTGA 

TTTT'GACTCTTTGGAGATTTAGTGAAAAACCAAACAAGTGACTGAGTGTATGGATTATATAGTATATT 
GGATCCAGACTTTGTTATTAGCTGTGTAGCTTTGAATAGGTTTAACC^ 
AAAATATTAATAGTATATACTTCATAAGGTCCTTTAGAGATTAAATGAGA 
CCTAGGGCCTTGGCCrGAGCAGATGTGTGGCCCrCTTGGAATGAACTGC^ 

4 5 TACCATGTTCAGGCTGTTTGTTTATTTGAGAATATTC 

TAGACTGACTGATGATAAACATAGACTGATAATCATTTTGCAGATAA^ 

ATTATTTAGTCAGTAATTCAAGGCnTGGACAAAATAGTTCrCACT 

AGTCTTTTTATTGGTTTGTTTTTGTAAGTAGATGAAGCTCAGTAGAGCAGG 

GGGCTCGTTGACTCATGTCTGTAATCCCAGCACTTTGGGAGGCCGAGGCTAGCAGATCACAAGGTCAG 

5 0 GCCAATATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGTGTACACCTGTAGTCCCAGCT 

AGGCTGAGGCACAAGAATCGCTTGAACCTGGGAAGCGGAGGTTGCAGTGAGCCGAGACTGTG 
AGAGCGAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAAAGAAAGAAATCAGTCTT^ 
CCTTTTCCTCTTATCCATCTTCTAGGCCCAGGCCTGTTAAATTTCTTATCCTTATTCCCA 
GTTCTTTACCTTTCAAGTGAAACTCACCCATTATTTAGTATGGGTACT 

5 5 TGTGGAATGAAAGATGTTTTTTCTATTAAAGGTAATTAGGCG 

ATGGCCTCACATCCrTT AGAACCTTGATGAGGAATG AGG AG CAG CAAT AAAG AACTAATTTG CACTT ATT AAACAC CAAAGACTTG 
CCAAGTGGTTTGTATAACATTATCTCTTTTAATCTCTTA 

TCAGAGAATGGAATT AT AATAACTGGCCTTTCTCACAAG ACT AGTGGCTCT ATAT CCTG CAGCAGGCT ATTGCTCACCGAACCTCT 
CAGATATAATGTGTGCATTTGGGGTACTGGGGGCAGGGAGATAAGGGACAAATTATGGGC^^ 

6 0 AATGATAACTCTTGAAAACAAGGGAGAGAGAGGCTCACTGGTCAATGGGTACTCAGGTATTGGGGAAGCCAGG 

TTTTT AACAC ATGTGT CTGTGTGT CACATGT AT AAAT ATTGTG C CTTT ATGT ATG TG TAT ATTTGTG AGAAGGGG ATTG AAAG AC A 
CATGTT AT C AG CATTG ACACTG ACT AT AATG ACTGAAG AATT AACTTT AAG CG AAAAACT 
CAAAAAATATAAATTATGGTATTTTGTTTACTTCTATTGCTCTA 
ATGGCATTAACTTTCTTATAAATTTATTGGCTAAGAAAAAAC^ 

6 5 T AG AGGTTT CAT CT ACT CAAAAACAT ATTG CTGCCTCAG TCACAAAAT AATTT ACAAATG AAACAACTT AG TGGG CATTAGATGTT 

CTTGTAGAATGAAAGTTrCrATATCAAGGAGTTTAGTACTTACTGGGAAATAT 
AACTACAGTCACATTATTTCACCAAATCATTTATTTTTTATTCACTAATT 

GAGGAGGAGGAACTTAGAGGTTAGAGAATATGACTATACAAGCTGTCATTGTTCCCTTAGCTTGTCAT 
ATGGTTCTGCTGTTCCTAACAGTTATAGGAAGTAATTITATTTATCCAAGAAATTCT 

7 0 AAGAGACAACGGCCAGGTATTTTTTTTTTCCAGAGCTCTTCT 

AAGTTTACAAAGGAATTTTAAATCCCCTTTTCCTCTAATGTTTAAGC^ 

AAGACCITTAAAATTGTGCATGTCTTTTCTGTGGT^ 

GATACATCTAAAGTTTCTCAAACTTGGCArTATTGACATT^ 

AAAATATTTAGCAGCCTCCCTGACCTTGACrrCACTAGATGCITAATAGGATTCCTATAATTGTGAC^ 
7 5 ACAAATGTCCCCTGGGGGGCAAAATCACCCCrCITTGAGAACCACTGATGT 
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TTGCATACTTTTCTrACTATAAAGTTGTCTCGCATAGATTAC^ 
T ATT AG TAG AT ATTT CTTT ATTTTCAAGATXX5 TG AT CATGGTG T CTGT AT A^ 
ACTAAGGAGAAAGATAGAAAAGATAAAATATGTATCTTCAACAT^ 
ATTGACTGTTATAAACTATTACIAGAACvAAAAAGTGAAAGTGGAGAG 
5 TGATGCTTTCAGCTGGGCAGAGGCCTCGTGGAGGGTC^ 

GAGGGAGAGCATTTACATTT ACAGGAAAGAAATGATACTTCTGAG CCTACATT^ CATTTGTTCAGGC 

TTGTGAGAGCGCCACCACrCTGOKSTGCCTGGAGA^ 
OVGGAAAGAGTCAGGGGCC^CTTGAAAGATTTGG 
10 ATGTGAAAAT ATAG C CTGT ATGGG TT ACTCTCTT ACTG ATTGTCT AGATTTG ATGC CTT ACATT CT 
AGAAGTCTACTCAAATCCnXATTTTCTAGTGGAATAGTGTCCCATGCAT7U^ 
GACCCTCTTACCATGTTGAGATTATTTCTAGAACTCAGGCT 

AG AAAATTTC CATTTAG CTTTCTG AT AT CTTGCTG TG CAAGTCTTCATG CTCT TTT1TT AAAAAAAATTTT AATT AT AAATTCTTT 
CAAGCATATAGAAAGTTTAAAAATTGACATAACAGACACTATGTACTACAACACAAATT^ 
15 TGCTTTTGATAGCTCTCTCTCTCTCTCTCTCTCCTCTCTCTCTCT 
TTTGTCCTCCTTCCTTCTCTTAGGTAAGCTGCTACTCT^ 
CATCTATAAATACTGGCATTTTCAAGAAAGTTAATTTTTACTAACTTCAACC 
GGGCAAACATAGGTTCAAACTGGACGCCATTGAAGATTAACTGGTACTCA 
AAAATGTATTTGTCACAATGTAGTTTGTACTAACTCI^ATAGCAAAAAATAATAT 

2 0 AGACTAACTCTTATATCrTCTCCTTTCATAAGAAAAATTTCCCATCATTAAACAT^ 

CCCTTTAAAGTTTTTTCCATTCTAATTTCATC^ 

CCAAGCTTGTCCCTGCCTGCTITTTGACTCAGCAAGGAGTGCCAGGGAAGGCTCACTTC 
GTTTTAAAAATCAAACTGTGTTCCCTTTCACAGAGGTGGGTGACAAGAGAAGCAGCCCCATC 
CTCCTCTAGAGTGACCAGAAAGTACCCCTTCCTATCCTAACTAATGTGACACGTCCTTCTGTGTGGTCCACT 
25 AAATTGACAAGAATGACTTTGATTGTAAAGTACAAATAGCTTACCAGTTAGTATTCTT 

GCAAGGGACTTGAAACCCAGCTGATAGAAGTGTAACTATTTTTAATCAAACATTTTCTGAAAGCT 

TCCTGAGTGTTTTATGTTGAGTGGAAGACGTGTTGAAATGTTTAGGTCAACAACAGCTTTTTTGT^ 

TAAAAAAAAAAAAAAAAGGCTCCACTCTTCGTGCCTTTCCTTAAAAATCCTA 

ACCCGATCTGCTGTGGCATTTGAGGATATCACATATCCTTGCATAATCTCCCTTTCCTTTGTATAGACT 

3 0 TTGGATCTATTTGGAGTACTAATTTAAAATAATTTAACCTGACTCAATTACTTAAAACAAC^ 

CTATGTGTGCGTATAGAACCTAGTTCATATCATTTATTAGGATTTCCATTGAACCCCTAAGCTGTTCACTG 
AGGGAAAATAACCATACCCCTATGTACATTATACTGTGTGTTTTGTAAACCATGAGAAAAGCTGCTTATTC 
TAACCTGGGGTCTATGGATCCCTGTTGGTGGTTTCAGACGCTTTTTAGATTCCTAAATTTATAT 

GTG CATTTTTT CTTGGG AG ATG AGT ATAACTTTTT AT C ATATTTT CAAAGGG AAAG ATTT AACATGGT T AAG AAT CATGCT AT AG C 

3 5 TTGTATCACTTTAGCTTTTTGATAGCAGCTATCTTTT^ 

CATTrTTAATCAATTTTAAATGAAGTGAAATGCTTTATGTTGCTCAACACATAG 
TGGAAAAAAGATGAAACTGAATTGATGGCCAAAAGAGTTGAATGGGCTTCACTGC 

GATTAAGCATGAAGCTCACCATTAGTGATTGAAGCCAAAAGAGAGAAGGCAATTCATTGAACAAATATTGATT 

CAGGT ACTG CTCTAGACAAG CAG ACAAAACTCCCTCCCATATG TAACCACAATTG AGACTTATAGGTAATTTTTGTTGCAAGT AGT 

4 0 TCAGAAGTTTTGTATTTTGTCCTAAAACAAAATATTACTTTGTAAAATATTTTTTT 

CAAAGTCACTATTATAAAGTGGAGAAAGCTACTCATTAGAGAAAACCTATCACACAGATAATAGCrrGGGAAGCTT 

ATT ATT AGG AG AG ATATACACT CCTTG AATTT AAG AACTGTCTG AATTTT AAATT TTG TTTTGT CAG TT CT C 

CTCTATTGGCCATTCCCATTTTTCTGCCCTCCACTCTTCCTTGACACATGATT 

CTCTGGTGG CTTCTTTTAAACCCAGTGGTCTCAGTCTGAAGGTATAGTG CATT CCATTCTGG AGTCTTGACCT 

4 5 CATAAAGTCACATATTGGCAATTCGCCTTGATAAGGTCAAGTGAAC^ 

CCTCCACAGACAAGTTGTTTTTCAACCCATCTGCAGOT 

ACCCTCCAGATGGATTCCAAATAAGAAGCCTCCAGACTACCTTCACCTCCCTGGGGTC 

AGACTTTCACAG ACAATG CTTTTTGCCTIT AGGTCCCCTTCTACCCAAC AGAATXIAT^ CTTGATGTG CTTACTTAC 

ATCATCTCATTTAATCATCACAGTAATCCACTGATGCAGGTTCTGTGATCCGCATTTTATAAATGA 

5 0 ATTAAGAAGCTTAACTTGGGTCTCACAGGCTTTTAGAGATGGAGCCCAAGATTTCAGTTCA 

TCTTAACCATCGTGGAATACCATTCATTATCATAATGT 

AATCATGGGTGGGCTTCAGAATCGTCCAAAAATGTTTCAACCACAGAGACGCCTGGGAA 

CATCATCTAGCCAATCTTGG ATTCTACAATAATTCTATG ACAGTTTCTAA CATTAT AT AATGG CAACATATTACAA 

CAGATAATTGAATAGATATTCAATGTATACCGTGGAATAGAATATTATGCC7VCTGTGTTTTAAAAAGATAATTAAGAGGATAC 

5 5 ACAACGTGG AAAACTGCTTAT AACATAAT ATTAAGG ATAAACCTCAGCTGGGCATAGTGG CTCACG CCT ATAATC CCAACACTTTT 

GGG AGG CCAAGGCAGG AGGATTGCTTGAG CCCAGGATTTTGAGGCCAGCATGGGCAACATAGTG AG ACCCTGTCT ATAAAAA^ 
ATTTTTTAAAGAAAAAAAACAGGATTTCAAATTATTGCTAATATT^ 
GTGGAAGCATATAGATGTATCTTTAGCTTTATTTATTCATCCATGACTTAAAG 
GCAGAACATTGGACTGAAGGAAAAAAAAAAATGTGAAGCTCTCTTACTTAGCTACAGT 

6 0 GGGAGACAGTGCAGTGTGGCTCTCAAACCTTACTACCCATA 

TCCAAAGCCTGGTAGAGTAGGTAATTCTCCCACTACAATACACTTAGTAACACTAGAAAA^ 
AGCTGAGTTTATTAGAAAGTATGAAAACTACTGGGGCGCCGTAAGACAAAAGAAAACTT^ 
ACTTTGGGGCCACAGCTGCCCTGGAAATATTTGCTGACCTAGATAATCAAGAAGCTTGGT^ 
ATAAGGCCTTGGGCCTTTCCAATCTGAGGAATTGAAACTGCAATCACCCACTTAATTC^ 
65 TGATGGGTGATCCTCAGGCAGAGAGGTGATGAGGAAGTTGCCTCTCTCAGCCTAGGGGGAATCAGA^ 
GAATCAATAACTACAAGCCCAATTCTTACAGATTCAGAGTTAAG 
TGAATATGAAGTTGTTCCACATTGGGGATACCCTCAGAGCC 
TCCTGATCAAAAATTAGAAAATATGTATGCAAATAAGCCACTCTGTTTAAGAACT 
CAG AATCTT CAGAT AATTGAATT ACCAATTGTTTGAT ATATAAAT A 

7 0 AAATAAAACCAAGTCGAATTTTTAAAATTAAATGTGCGACACTGAAAATGATTCAAC^ 

GATGAG CTGG CAGGTACAAAAGAAGG AGTAAGAGGTGTCTAGTTG AAGTCCT 
TACATTTACAGTTTCTAGGTCTGGGGTGAGGTTTAGGTGGTTGTGGACCA 

TTGG TT CkGGG CTGGG CATG ATGTTGG TT CCT AGTTTTGCGACTTG CAGTG TTACTT AACTTGG CT AAG CATCAG CTTCCTT ATCT 
ATGAAACTGAAAAAATAATACCTATGTCATAGGATATATGTGTAAATGAAGTTATC 
7 5 ATGACAGTTGCTTCTATTTTTATATAACCAAATACACTACTA 
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GAGCTACACAAAACCTTCACCAGTCTATTAATTTATTTTTACT 

ATATC C111 ' C1 " 1 'CTGTCTGGGGTTAAG AT AGGAGACTGAGATCAGCTGTGTCAAAAGC TT T C TTT1' AGATGTAACACATGATTTCT 

TATGAAAATCTCATAGGTAGGATACCAGAAAGGAATTTATTAAATTCTTGTTAGGCT 

GTTTTAATTTAGGATACTTATGATAGAGAGCACCCCTTCCATAATCAGGAGAA^ 
5 TGGTCCTGGGGAGCACACAGCACTCCAGGGAAGGCCACACACCAAGA 

GGGCCATTGCCTTTACTGGGTCCAGGACATTATCCAAACAGGTTTCCCACAG 

ATGAGTTCCAGGAGGTCACAGTGTGATGAAGAAGAAGTCACTGTGGCATATCTCC^ 

GACCAGTCAGGTAGGCTATATGTAGCTGGCCCATGGGGAGGCGATCACCAGGTGGCAATTGTATAAT^ 

ACATAGAGGAACCGGGAGGAGGTCTAGTACTGGAAACTGCGTCGACGGTCACTGAGCCCT 
1 0 ATATTCAGAATGGATGCAAAGGCAGCATAAAATTAAAA^ 

AAAGCAACCGAAATTT ATTCTTTCACAGTTCTGGATG CTAG AAGT C CAAT AT CATGGT CTTCG CAGGG C CGTGC C CCCTTTG AAGG 

CrCTAGGCAATAATCCTTCCTTGCCTGTTTCTAGCT^ 

TTCCAATTTCTGTCTCCATCTTCACATGAATACTG 

CCTGTCATTGTGTTTACGGCATCCATAACCCTATGTGACCTCATCTT 
1 5 CAATATTCTGAGGCTCTGGGTAGATGTGAATTCTCGG 

GCCAGTAGCCCTACAAGTATAGGATCATCCCCATTTCAGCCTGATAATACACTCCGAA^ 

AGTTAATCATAGAAATACTCAATTAGTATAGAAATAACAAGAAAAATGCTTTGTTATTCT 

ATGTATCTGTTATTGTTATCCTTATACACTTCTGCACAAAAGATGGTTTTAT 

GTAACTTGACCAAGAATGGCTTCTCTTTCTCCATTTC 
20 TATTAATATAGATTAATATATGTATTTATATATC^ 

ATATATTCTGTTTATATATTGAATTTATATATATATAAAACTTAACACTATTAAGTCTTAACACT 

TTATATAGTTCTATTAATTATATATATATGGGGGGAGGGGTGTTTCTCCAGCCTAGACTTCCA 

CAGCCACTTGTTTGACATTGCCACTTGGATGTCTGGTAGA 

TCTGCCTCATAGCTTCCCCTTTCCGGAATGTGGTTGCTCCATCCT^CCAG 

2 5 CCTCTCTTCTTCTCAAATCCCACATGTAATCCATCAGGTAGTCTTATTGGCT 

TTCAGCACCrCCAATGCTGCCACCCTGGTATAAGCTACCATCCT 
ATTTCCTAAATCAATTATOUVTAATTGTTATAAATTTGTGATTTTTCTAT 
AAGAGCTTTCTTCATTCTTCTTTAAAAAAAATTAAAATTAACATCAGTA 
AGCCATTCCTGTCATTATTCATTTTtlATTCTTAAATTGTCCT 

3 0 TTCATATCTCCCTATCTlTl'l'lTrCCTAAC^C^TTCCrTAi liuiGAAATTATAAGCTA^CrAGGTTCACCCTATACCTTCACTA 

CCCCAGTCCAGGCATCGTCTGTCTCTCTAAGAAGTCCTAGTTCCTTTTAGCAGGAGAAG 
GCTACAAAAT^GATTATTTTAAAAATCTCGAGTTCATACACTAATTCATA^ 

TCCCTCCCCTCATCCCATACTTGTATTTGGATTTGAGATATGAAATGTGTATCTCATATCCCATATTTCT 

CTCATCATCAATGAT AGCAG CATGCTTACTCATTTTGCT CAGTCCAATAATTCAC ACACATTTGTAG AAAAATAGAAT AGTTTTAG 

3 5 AATGGCTATACCAATATCTCCAAGCTTACTATATTCGGTTCAAAATT^ 

CATTCAGTCAGAGTAATGTGTACAAAAGTTACTTGGATTAAATTTTTTCCTTCTGTG 
AAGCTATTTT T TTATTTCTATTCCATTTTAGGGGTTTTCCCATCCCTGTTGATTTAGT 
TCAGTTTCACATCTTTCATTTATATGAGATTAACTAGACCCTTTAGAAATCATT 
TAGGTGTATAAAAACCAAAGGGATCAATCAGAGCCTTGATTTAAAG 

4 0 CCAGAACATTTCATTTGTAGATTTCAAAGACTTCTTGGCCTTCCTACC^ 

ACCACCTG CCAGGC ACAGTG CTGG G CTCTGTGTTGACACACATG AATG TG ATC AT CITTCTGT AG CCCC CAGG TT ACATGG C AC CA 
CAGCCAAGATACAGACATGTGTGAGAGAGTAATCAGGGTATTACTCAGGATTTGCTGGAGGAATC 

ATTGGTGTGTGTGTGTGTGTCTGCGCGCGCGCGCGCATGTCTGTGTGTGTGTGTGTTGAGGTGGACTGGGAGAAAATATACCCT 
TTGAGATCACCTCCTGGCTAGATAGTCTCTCCCTTCTCTGTTTCTGATAGAATGCAAGTGGAATT 

4 5 TAGAAACTAGAGAGCTGCTATCTCAGTCACCGTAAGAAGAACGAATCACACCAAACCT 

TACAAGGGTGAGATAGTCAGATGCTGGAGATCCCTGTTTTCGTAGTTCAGQ 
TGAAGAGATATTTCTAGTGATAGGTTCCAGGGATCTTTCACCXSAACACTTTAAT^ 
AAGGTTCTTACTAAATTTGCAGGTGACATGAAGTGATAGACAACAAA 
GACCCAATTCTTCAGCAATTATATTTAAAAGATATAAATACATGCAGAAAACAAAT^ 

5 0 AGCAG CAACGCATG CAAAG AG ATCT AGG AGTTTT AGTTAAT AACAG ACTCAACATG AACCAGCAATG CAATGTG ACTACG AAACAA 

GTAATTCCTCATTAGGCTGCATTAATGGAAACACGATACATAAGACCAAGGAAGTGGTGGTCTCA 
CCACACCAGGAATACAGTGTAGTACTGCTGGGCACCACATTTGTAGAGAAATAGAAAC 
TGAGAGAACrCAAACTTCATTCAAATGAGGAGTAGTTACTGAGCCTGAGAAATT^ 
TATGAATCGCTGTGATGGGGAAAAAAGGATTAAATGTGTCCTGTTCTCACAAGAG 

5 5 AGGCGATAGAGGTTTOlGTTGCTGCTGACAGTCAGGGCTGTCT'GAGGAT<^ 

ACTGAATTGGTTCAAAAATGACCAGGAAGACTACAGGGGAGG 
TGTATGCCCTTAGAAAGATGTTTACITAACTTGATAAATCTGTAACAAGGG 

T CTG ACAGATC CC CCAAAAT AAT AG AAATT AAT AG CTT ACACACT ACTTT ATC CAAG CTGT AG AATGT AT AACTT ACTT CAT AT AC 
TTTGTGTTTAGTCAGTTTATACAGGCTGAACAACATAAAGTTAAGACATAAAAGGGTT^ 

6 0 TTTCCAAATTACTCGOTGATCCTAGAACTTTGAGA 

TGGTACAATATTTTGGTTAACCTC^GTCTGACACTTAGATTGTGT^ 

T AAAG TAG CAC CAAGAAATCAAACxAAAATTT CCCAGTTCT ACT CACTTGT AATTCTCTT AATTT AGT AGGG G CTTTCCACAC^ AAA 
T CACAAT AG AAT AG TTG AAATGCTCTTT AT CTT ATGT ACITTC CTTCT CAACATG 
G TCCAAATTTTTATACAGGG CAGCTGTATTTACTGT AGTACAAAGGAG ATTCTACACCCAATG ATGTCTCT AAGTATTTT 

6 5 TTAAGAACCTAATTGCTTCTCTAGCCAGTGGGTTTCA 

GTGGTTGTATACCTTTAAGAAAAAAGAAATCAGTGATATTCAATCC 
TTATTTTAAATTAAATTTGAGTCAAAAACAATGTGTTTAAAGACTTT 
ATTTCTATTTTCAATGAATAAATGAGAAACCTAGAATTT^ 
AATTATTTTTATGTTATTAGGATTTTTAAAAAACCTCTTAGATT 

7 0 TCTCCACTTATGTAATTGAAGCATTTCTGACATATCCT 

TCCTTGGATTTACTGAAAATAAACACTTTGG<3TACTCAGT 
TTTTCTTTGTCTCTAAATTTTGATTTTCCA 

AGTTTGTAATTGTGGTGAAATGTATTTAAAATGTTCACAATTTTAAAGT^ 
CTGTGATGGTAAACrrTTTTTTTCTAGTTTGAAAATC 
75 7AGAATTT7AGAGCTGGAAAATGAGTGAGACATTAT 
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TCACGCAGTATCATAGCAGTACCTraTGACTAGAACT 

AAATTTGCAGACTTCAGGCTAGGCCTGTAAATAAATAAATAAGAAGTTAAAACATATT^ 

TTGGGAAAAGATGTGCCTGACACAAATGTATTTGG 

TCAAAATTCTTGAATITTGAGACATTTATGTTAAGCACAGTT^ 
5 TCCAAGCGGAATCAT ACTT AAATT ACAACTTT AGGGAGT TTAATGTTT CCT ATT^ TG CACTC CTGGG ATATTTTT CAGTG ACTCA 

GAAGCAT AAACTG ACACTGTGAAGTT AAGGCTGTGTT CCAGTG GGAATG A(^ C CCTC 

CTCATGCAGCTGTCTTGCGGAGTCTCCCTGCT^ 

TCTCGCTTCTGGTTTTACCTAGAGCCAACCCTTGGTGATTGTTTGCCT 

CAAGGTCGAATTCTATTAATACCTGAGAGGTGAATTTATAAACATTTAAGATC 
1 0 TGAAAATTGCTTCTCGAAGTAGAAAAGCATAAGGGAGTACAAGGGCTTCAAAAAA 

GTTAGGTGTTCCTAACTGGTTTCAAGCAGAGGAGTTAGTAAAATTCCAAAA 

CTTTGCAGAAACTCCTCAGGGATCCCTCAGAGTTAGATTA^ 

AGCCTGTTGGGCCCCACTCTCCTCAGCCTTTCCTAATGTTGCCTGGGAA 

GGAGACAATTATTTATTTATTCACTAGCT ClTrf AACAAATATTGATTAAGTAGGTA 
1 5 GAGAAAATTAAATAGTGCCTTTGTCCTCAGGAAACAATCCATTTAATAGAGAGACGAAGATATGCC 

GCATAACCAGCACCTGCGGCTGCCATGTGGGAATTGTTGAACAAGATAATTTT^ 

TGTAAAGATAAATTTCTAAGAGTCGGCTAGTTGGGATGATTATAGAAGGATTCAAGGGAGAGATC 

AGAGAAAACAGCACAGCAGAAAAAGCAGCAAAGGCTCAAAAGTAGAAATCTCTC 

GAATGAAGAGCACGTGGAAGGCAAGGGTGGTAAGAAGGCTGGAGATAGTAGATAAGGCCAAATC^ 

2 0 TTTTTAGAAAATGGGTAAGCTATAGAAATTTTATATAGGAA 

AGCAATCTGCTAACTGATATTGGAAGGGAAAGGCCCAGAGACACTAGTTGGA^ 
AGTTCAAGCTAAGATAGATAAACTAAGGAATGAGAATGGAAGAAAGGGACAAACTGAGATTT^ 
TGGCAACTGGTTAAACATAAAAGACAAGAGACAGAGAGGAATCCAAGATGAAAAGACGATTITG 
ATTTGTATTTCCATTTCCAGGAGGTGGAGTAGCTTTGAGAAGAAAGGTGAT^ 
25 TTATGGACAGGAAGATACCCTACAAACAGCTGGAAATTTTGCACATGCTTAAAAAGAAGATG 

ATCATCTGCATGGAGTAGACAATTCAATCCAGGAGACTTGATAAGAATGTCAAGGGAGTGATCAC 

GAG CATCGATCTCTGCTGAAACATCTCTTTGAGCAGG AAGAAGAGTGCCTTAGCAAAAG CAGCCAGTG AAATGGGGAGTTGGGGG A 
GACTTTCAAGGAGGGAATGGTAACATATCCCATTCCCAAAGAG^ 

AACATGG AAT CTGT CATGAC CTTT ACAC ACTTTTTCATTTG ATGG AT CGG GGG ATTACTT ACATTTGCAAAGGGC CT CAG ATT AAG 

3 0 TGGTAATGGTTCAGC ATTAGTAATT AGAGATTTTCTTATGTACTTTGTAGGGAG AAATAGAATGG CATT CAAACAGGTTTAAGTTC 

CTCTTAAGAACCTTTCTCATTGGATTGTAGAGAAGAGCAAATGAAAGAATCGTTCA 

GTTTC^UVTT AATGTT AACCTTT ACAG TTGTC ATCAT CATT AT C CAG AAGGGGCTrrriTTrAG AAT AAG G AAG AAAAG CAG TG C TC 

ATCTGTAGACAAGGGAAAAAATCCTTTGGAAGAGAGAAAGAGAAGGGTAATTCAAGAAGATCAGGAATA 

CAGAATGGTTAGCCCTGGAAAAGGGAACAAAC^TGTTTCTCTGAATTAGGAGAGGATATGT^ 

3 5 CAGTAACAGTAAGGAGAAGAGAGAGTGAAGGAACTTGCACGCAGTGACTTCTACCTTCTCATTAA^ 

GCAGATATCTGCTTAAGGTAGGAAGAGAGAAATAAGAATTGCrGTCATTTATT^ 
CACTTTACCTCCTTTAATCCTCAGAAAACTTAGAGATTATATATT^^ 

AAGTAACTCTCCCAGAGTCCCCCAGATGGTCATGGGAGCCCAGATTCAGAGCCAGCTATGTCTTACTCCAAAG 
TTGAAACATACTCCACTAGTTTGAGATCTTGGAGGTGTACCCCTCCACTACT 

4 0 GTTGTTCCTATCCATCACCACAGTGAATATCAAAGAAGGGCCCTCrCATTTGAGGAATAACAG^ 

AAATGCAAATCCnX3GATCCCTGAGGGAAACAACAAAATGAAGCATAGAGGAAATATCTGAGTCTGGTGCC 

AGAG AGAATGCCTAATGGGCAATAG CAGTTAGGG TGGGAAAAG AGAAGTACAAGGGGGTGTCTTGGAATTTGAGGGAAGAACTGCT 
GCCTTAGTTTTCGAAGGATGATTATTCGAGAGTTGCCCTCCCT 

GATTCCTTTTTACC AGATACTGTAGGG CCAGTGCAGCAACCAG AGCCCCCAG CCAGAAACCCTGCCTCCAATTTAGACCAGTGTTC 

ACTTTTAAGTTTTATGTGCAGGATGTGCAGGTTTGTTACATAGGTAGACXSTC 

TCAGGTATTAAGCCTAATATCTGTTAGTTCTITTTTCCrGGTCCTG 

TGTTGTTTCCCTCrATGTGGCAATGTGTTCTCATCATTTAGCTCCCACTTATAAGTG 

TT CTGTTCT CGTGTT AGTTTG CT AAGG AT AATGACCT C CAG CT C CATC C ATGT C CCTG C AAAGG ACATGAT CTTGTT CTTTTT CAT 

5 0 TTCTGCATAGTATTCAATTGGCCATTTTTTAAAAGAC 

TG C CACCT AACC C AAC C CTTT AT ATTTTAGT AATTG C CTT C CAT AAAT AACATTTT AT CCTGT CTT AATT ATGCATT CATTAG TGT 

ATTAATTATTGAATGCCCACAATATGACAAGGATTATTTCAAGCRCTGAGGA 

TTArrCCATTTrrTTAAGAGACAGGGTCTCACCCTGTTGCCCAGGCT 

ACTTCTGGGCTCAAAGGACACTCCTACCTCAGTCTCCCAAGTAGCCGGAACTACAGGTGCA.CACT 

5 5 CTGAGCCTCCATCCTAATGGAGAAGAGAGGCAGTAATCAAAATACATGAAATGTGTAGTTATTAGATGGTTATTAATC 

AAAATTAAGCIAGAGAAAGGAGCAAGGGATTAGGGGGTCAAAGTTTrGGTAG 

GG CAATT ATGT AAAG CCTG AAAG CAGGT AAGAAAT AT ACAGGT AT ACAGATG T C CCTGGAAAGAG TGTTCT AGG CAG AAG AAAT AG 
AAAGTGCAAAAGCTAAGATTG CTCACCTGGCAAGTTCAAGAAC AGTGAG A CTGCATAGGAGTAAGAAGAAAAAGG 
ACGGAGAGGACATGAGATGGTCAGATAGCCTAGGGCCTTGTAAGCCArrATAACGATTTTGTCTTTTAA 

6 0 GG TTTTTG TTTT AACTTTTT AACAT AG AAAAATGT AAT CAT ATAC AAACACAG AAT AGT AT AATG AG CCCC CTTG C ACTAATC ACT 

CAGCTTTAACAGCTTTCACTTTGGGAGGATTTTGA 

CTCAGGAAAATAAGCCATGGGGAGACAGGACAAGCAGAGGAAACACAGGA^ 

CCAAGGG AG TAG CAG TGAAGGTGGTG AG AAGTGATCAGAT AG AGG AT ATTTTITTGAAG AT/ITG AATT AATA 
CTTTAAAGAAGAATTTGAAAATGCATGTAGTGTAGTAAAGATGGTAAGAAATACAGAAAAGAGG 

6 5 GTAATCCTAGCACXrTAGGAGGCTTAGGTGGGAAGCT 

TGTCTATATGGAAAAAAAAATTAACCAGGTGTGGTGGCATGCACCTGTAGTCCTAGCT 
CTGAGCCAGGAGGTTGAGTCTTCAGTGAGCCATGATCATACGA^ 

AAAAAAAAAAAAAAGAAAGAAAGAAAAAGAAATATAAGAAATGGCACTAGAATATCAGTATTATATA 
ATCCACACAACTGGCAAAAAAAAGTGACATACATGAACATTTATAA^ 

7 0 TATTATTTTGCAGTGATGAATGATTAGGCCATCCAAACGTAAAGATTTATTTGTAATTTG 

GTACAAGTTCCCAGCAATTCTTGGAATTAGCAATAGTGTGTTACTTCITT 
ATGTAGTAAAACTGAAAGTTAAAATAAAAAAGGAAACXGTGGGTGAAAGTATGAGGGAA 
ACATAGAGATTTTATTCATCGTCTCTGTGTAACTCACGAGAACTTCrr^ 
ATCCCAGCCTGATATCATGAGCCATATATGCTGTTGGTCATTOUVAAAGGGCACTAA^ 
75 GCAACTCCATG AG T ACT AC CAGG AAAAGAAAAACATUVLVL^ CATG CCAT C C 
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CCTACCTTACTGAGAATACCACATTTATATTCAGAAGACTCATAATGA^ 
GTTATGTCACTTTGTTAATCTCTTTGAACTTCTGTTTTCTTATCT 
GGACTTTCGGAGGCCCAAGGGTCTTTACAAATAATAGAGAAGGAAAGA 
TGCGTAGTCCCCACGGCAGCCCTCTGAAGTCATGTX3ATTGGCCCATTGTGTAGATGTAGA 
5 TCTGTGTAAGGCCACAGAGCTAGGAGGGGGAAAAGCTGGGATTGGATCACTGACTA 
AAAGAAGAGACTAACTTTACTAAGC!ATCTTTTTATC 

ATTCTGTCCTTCCTGGTTGCCCTGGTTGCTCAACAAGTAATAAACAATAAAGCCT 

AC CTCACAGAAAAGAATG AATG CATGGTTT CATGTCTGTGG CA CTCT A 
1 0 CCACTTCTTATAGCCATTCCTGAGACCTCTCTTCTCACAACACATACCAGCAAAG^ 

CATCTATCAACACTCTCTAGGCAAAGTAGAACTATGAAACACTGAGGAGACT 

TGGTAGCACAGAGCAATGGGAAATGTGAATAGTTTGTTGAGGTGGACAGACCCAAGCT 

GGAACCATGGCAACCTACCTTATCTTACTGAGCCTCAGTTCCTCCAGCTGTGAAATGG 

TGTGGGGATTATCTGAGTTAGCATGTTCC^TAAAACACAGTAA 
1 5 AGAGAAATTTGACAGTAATCATTTAGTTACTGAACCCTGTCTTCATATTGCGCTAGGTTG 

CAATCACATTTCAGAAAAAAGGAGGCAGTAGTGGAGCCTTGTGGGCTAAAGTGGTGAGGAGGGTGTC^ 

GTCTTCCTCTTGAAGAATGGGTGCTTCTCAATAGGGAAGTGGAGGAACAACCTGGGAGTTGTCA 

TGATAATGGGTAGGTCCAAGGAGGTCCTGAGTAGGACTCIXIAGCACAGAGCCTTTC^ 

AC CCTCT AAACCCTTTGG C CTCACC CAC C C C CTTG C CTGGTT AT ACTC CT AG C CTT Lin CAG ATCT CCACCCT CAGT ACTC CCTC 
2 0 AGGAAGCTTTCCCTGACCTCCTGACCC^TCAAGTGCTTCTTATATATCrC 
ATTTGTGAGATTATTTGATTAGTGTCAGTCCCACTCTITAATTCT 
ATCGCTTCATCCCCTAGTTTACTGCAGTGTGTGGAACATACMACGGGCm 

TT AGTGAGG T ATTACCTAGGTGGG AGAGAGGATTT AT ATTTGGG ATTG CT AAT C C CRTT AAGTG CCAAG CACTG CATT AACT CATT 
TAATCCTAAAAAAAACTCCTATGAAATCAAGTGTATTATTCCTT^ 

2 5 CATCTTTCCCCX3TTGACACAGCTAAGTGGTAACAGTCGGTCTTGAGCCG^ 

TCTTCTCCTGCITAGTGCIAGTAGAGGACAAAGTGAGGTC 
CCCTCCTCTTCTTCATTACCATTGTCATCATCATCATCATCAATC^ 
ATCACCTCTTACATTCATGTTAACTCTACATTCTGCCAAAGTO 
CCTGATTCCTAGAAAATTATAGACTACATCCTGTAACCACCTTTGAAGAACCAG^ 

3 0 TTAAATAAAAAGAGGTAATTTGACTAAGATTGGTATGATCTGA 

GTGGTCATCAGTAGGTATTGAGTCATAAAGCTCCTt^A 
CAAGAATGATAGCTGTTACTATAATAACATAATTACTACTC 
TACCCTGGAAATAGAAGTCACCTAAATCAGTCGTTTATTCCAG 
GGTCATATTGTTTAAAGTTCTCTAGGTGGAAGAGTACrTTACAGTAA 

3 5 TATTATTTGTTGAGCATTGTTGTAAAGGCTTTACATGCATTA^ 

CATCTCCCTTTTACAGATGGGGAAACTGAGCAGTGCCCAAAGTCA 
TTGCGCCTAAAGTCCATGCTCTTACAGTGTCCCTGGGTAGCrGAGTACT 
AGCTTAACTCnTTTTTT TTlTTT T T rTTTTTAAAC^ 
CACTGCAACCTCCACCTCCTtKXjTTCAAGTGATTCTCCT 

4 0 GCT AAT ATTTTGT ATTTTTAG T AAAG ACAGGG CTT CACC ATGTTGGC C AGGCTGGT CT C AAACTC CTG ACCTCAGGTGATCT ACC C 

CTCTCGGCCTCCCAAAGTCCTGGGATTACAGG 
TTCAGACGCTTTGAAAAAAACCTAACAGACACAGTTGCT 

GC^AGACTGAATAAAAAGAACTAAGTTCTCCGTAGAGCAAATGAATTTCTGAATCGATTC 

CCT ATG AAGT ATGGT CT C CCCCATGTT AG AGTG CCAGTG TCTTCATG T AC CAT AT AACTTG CAAAT AGT ATCACTG TCTTCT CACA 

4 5 CTTCAGCrTTCTTCTTTCATCAGTATTTTGGA 

ACCAGATCATGCGATGCTGACTTGGACTCTTTCAAATGGATCa^ 
CAGGAATTACCCAGTTCCTCATTCTTGCCACCCTTCAGCCGTGAAGTA 
TCC^ACCTCTTTCCTTCAGTACCAGAGACTAATTGCCTCATAATG^ 
AATTATC^ATTCCTTCTC<jTGTCCTATTAAATTGGAAAGCCTAGCTGATTTGA 

5 0 TTGCATTCCTGTGCTGGTGTCACTTAAAAGTGTAATTACATC^TTTA 

CCCTCTGGGATATATCATCATAACAAAGACTGATTTCAGATGTCAACACCAAGAAACACAACT 
ATCTAGGGACAGCAGTCAGCTTTGGAGAAAACCCAGAACTTCTCATTCCCCATCCCACA 

AAAGGG AAAG CACTGAGTT CAT AATT AG CTTT AT AC C CAGAGTTTTTTTTTTTT AAATGTT ATAC CTGTTTAAGAAATTTTTTGTG 
CTTTGAGGTGGTCAAGATCAAATTTAATTAGCAACCCTCTTCGTGATT 

5 5 AGAAGGTTTAGAAGCAAACCTTTGCAG CAGT^CTAG CCAAGTGCATGCAT ACAGGCCTTGAGGTGGAGCTG CACAGTTGACTGTAG 

GGGAGTCTGTCCTTTGTTGACGAGTGAAACTTCTCTCCCTGCAAAC^ 

CTTGGATGAAATATTGGGTTGGGAAAAAGTCCCTACTAAGAAAATTGTACCAAAACCCTGC^ 

TGAAGTTTATAGTAGAAGTAGGCTAGTCGAAAAGGATTCTGTCTCTTTCCAAGTATTATTGGCAC^ 

TATATCATTAGGACTTCCATCAAAGAGTTATCTTTAGGTGTGATTCCCATCCTCCCTCCCTCT 

6 0 CGTTCCTTCCTCCCTTTCCTCCTTCCrrTTCTACCTTTCT 

GATTTATGCATTTTGAATTACTGCATAGTTGAGAG^ 

AGATAGGGTGACCAACTAATCTAGGTTTTGGCTCCAAAAACCCTGTGTCCCAGAAAA 

AGTTCATCTTGTGAGGAGATTATAGTGCATTTCCAAGTATTTAGCTGAATTTATCACAGGGCTCT 

GTTAAAATAAGGTCTTTTCTGTTATGTTAGTGTGACTTTAGTCTT^ 

6 5 AGGTGACTAAGGAGAAGAAAGAGGATATCTAAGTGTTTTTTGCTAAAAATGTCATT 

TGTCATCATTTTAAAGACATACCAATAAGAAATAAATTACTTTAAATAGAAAACTCT 
GAAATCrAAGGATTTGTACTTAAGGAGTACACTTTTTTGAA 

GAAGTCAGAGAGGGCTACTGAAGGCTCAGGTTTTCTAAGAGATGACTGGAGGCCAAGTCCTC^ 
TTCAGAACCAGGTCCCrGGCAAGATTCTCCTGTCTAATCACA^ 

7 0 CCTTTGATTTCCATCAACCACCAGTTTCTTTCTCATCAATCATT^ 

GAAGGGATGTTCTTGGTTAATAGGAAAGGAGGCTTCATGACAACACAGCATCCTCT 

T ATTG TG TTTT AAGTGGC CTG CTT CTTXK5AGTGT ATCTTCCCACT ACACAAAAC AAC CAC CA 

CTG TCATTTGTTCAT CACAT ACTTTT AAAAACAGGG CTG T AATCATGT AT ATACAT ATATGAATAACTTTATATTG T ACT'OTT AT 
ACTT ACTGG CTT ATCAT AAACATGTTTCCATX3TTG CTC<j AGAG CT AACTTTTGG AATGTG CTTCTTAT AT AG AT CTG 

75 CAACTATCTTGGGACC^GCVJUKs^ 
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AACACTGTGAGGATGGGATCTGCACTGTAACTGCTGGACCCAAGGACATGGTGGTCGGGTAAGTAGGGOT 
GGTAGGAACAGATAGAATATGGGATGCAGGAACTTAGAATGAACAGGCCCCTTTC^ 

CT CAAAAAACTG TGTAAACTTGTGCTTT C ATTTt^G ACTCCAGGAAGT AAGG TG AC CACATG TCTGGATTTG CTTG AG AC AG C C CAG 
TTTGCCCCTGACCrAATTGTTTATAGCACCCrCTTCTACTCTC 
5 ACCTACAAGGTATTTGTGCCACTATTAA ITi'T 11'rrCTCCTAGAAATAAAAATAAGGAa^TTTAAAGAAACATAACTATAGCAG 

AAATCATCACTTTATGTATATTTTTACATTCTTGATGATTTCTrC^ 

CAT AAAT AT CATTTCAGCTCTCTG CTG CHT* AGAAAGTAATGC^CAG AT AAGAAG CCAAGG AATG CAAATTT ATTTTGGCATTCATT 

AGGCTTGAAATGTAAATAAAAAATGACGTATTCATTTAAGGGTTGATCAAATA 

GAAGTAATTTO\ATCCAAAAGAATTAAGCTTTTAAA 
1 0 ACTAAAATGGCACAGAGAATAATAGTGTAGCACACAATAC^ 

CTTTC T l " r GG AAGTCCTT AAATGCAAAAG AAGAAAT TT T1 A ATTTTGTTTAATTTTT AAAAACATTT^ 

GGTTTAAATAGTTTATGCrGAAATCCCAGTTAATCTGGCrTAT^ 

TGATCGTTGCATTTCAGGCTTTGAATTCCXjATCAA^ 

TTCTTTGTTTGACTGTGAAAATATTTAAGTGTGAA 
1 5 TGTTAGAAAAAAAAAAGACTCTAGATTGAAGGAAGAAGCrTAAATTGAT^ 

AACATCCTCTTGAAAC^CTGGCGTGTCTCCTGTTGTATGTCAC^^ 

TTGACCTAGGAGAGCTTTGAAATCCAGGCTAATTATC^ 

TAGC^TTCTGTCAGTTTGAAGTGTTTATTTTCTTTGGTCTT 

GAAGTTTTCTACTAGTATTTATTATTATAAAACCTCCAGTATC^ 
2 0 TCAGG ATTTTAG AAAGAACTAATGGTT ATTT AAATTTGAGAC CTT AAAG CGGGAAT AG AAGT AAATTGTTTCT AAGGAAAGTGAG G 

GATTCTTTCACTCTTGTAGTGCAGTGGGGTGTGGGCTCAGTAGTTGGGAGAA 

GAGCATCTCAGCTCCTGAATTTGATCATTATGATG 

ATTTAATCAAGATGCCACTTGAGAGTTACAATAGACTCAGTAGTCrTAA 

TT CT AATTTG ATTTTT AG CTGAGAAATTTTTGCTT ACAATGTTT CCCAAAATACACT ATTTCTTTCAT CATTTGTTTCT AT C 

2 5 TCTCTCTATATATACATTTTTAAGATTTTCAATGTCAAAACT^ 

CCAGTATTTTATTCCCAGTACTGTTGCTCTCAACAGA 

CAT AAAATGAGAGAGTTTCACT ACAG AAT CTT AT CTT C AGG CTGTTT AC AT CTC CAG AATTCTGGACTGAACT AACATT ACGTAAT 

ATTATGAAATAACATGAGCAAACTTGCCACACACATATTTATGGTATGACATTGATTAT^ 

GAGTATCACATAGAGAAACTTAGGAAGATAGCCAAAATTATGTCrrAGACTTGTCA 

3 0 TTCAATAATACAGATTAGGCAAAGAGTATAGATTCACAAGCAAGACACT 

ATGAGAAGACAAATAACTTTTAAATTGTCAAAAC CAAAAG AATT ACTTT CTTTGG AGTG TTT ATCTTCAT CTTTC C CTCAG T ATG A 

ATTCTTAATAAACTTAGTTTCTACTTTCAAAAC<3ATTTTATCTATGT^ 

GGTTTGATTGCTGTAATGTAGGGGATTTGTCATTTTAATATTTTTTAAGT^ 

TAATTAG CCTTT ACTG CCHTT CTTC^T AAAAAAT AAT AATAAT AAT CTT AG TTTGGTGTTTT ATTTT AAACCCTG AAT AT CAACAA 

3 5 TTTGTTTCAACATATTTTAGTTCGTTGACATTTTTATTCCATGGAAAATATCT 

TACCAGTAATCACCACACTCTCTCTGCTTTCTGGCAGTTTAGAG 

CTGGTAAGCACT AGG AGTAGATATATAAAT ATCAAAATACAG AGG AG CTGTTGTTCCCCTTTTTG 

TCCATGAGTTCTCATCATTTAGCTCCCACTTACAAGTGAGA^ 

CAGTGGCCTCTAGCTCCATCCATGTTCCTGCAAAACATATGATCTTATTCT^^ 

4 0 TACCACATTTTCTTTATTTATTTATGTTATTGATAGGGAATTTA 

AACATT AACATGCATCTGTCTTTATGGTAGAATGATTTATATTCCTTTC^T^ 
TAGTTCTATTTTTAGCTCTTTGAAAAATCACCACACTGCTACAAAATCAATGTGCAAAAATCAC^ 
CAGACAAACAGAC^GCCAAATCATGAGTGAACTCCCATTCACAATTGCT^ 
GGGATGTGAAGGACCTCITCAAGGAGAACTACIAAACCCCTGCrTAATGAAATAAAA 

4 5 TGCTCATGGATAGGAAGAATCAATATCGTGAAAATGGCCATACTGCCCAAGGTAATTTATAGATTCAATGCCATC 

ACCAATGACTTTCTTCACAGAATTGGAAAAAACTACTTTCAAGT^ 
TAAGCCAAAAGAACAAAGCTGGAGGCGTCACACTACCTGACTTCAAACTATACTACA^ 
TGGTACCAAAACAGAGATATAGACCAATGGAACAGAACAGAGCCCTCAGAAATAATACC 
CAAACCTGACAAAAACAAGAAATGGC^AAAGGATTCCTATTTAATAAATGGTGCT 

5 0 GAAACTC^ATCCCTTCCTTACACCTTATACAAAAATTAATTCAAGATGGATTA 

CCCTAGAAGAAGACCTAGGCAATACCATTCAGCACATGGGCATGGGCAAAGA 

AAAG CCAAAATTGACAAATGGGATCT AATT AAACTAAAGAG CTT CT CCG CAG CAAAAG AAACTACC AT CAG AG TG AAC AGG C AAC C 
TACAGAATGGGAGAAAATTTTTACAATGTATCCATCTCxACAAAGGGCTAATATCCAGAATCT 

AGAAAAAATCAAACAA CC C CATCAAAAAGTGGG CAAAGGATATG AAC AG ACACTTCT CAAAAGAAG ACATTT ATG CAGCCAAAAG A 

5 5 CACATG AAAAAATGCTCTCATTATCACTGGCCATCAGAGAAATGCCAATCAAAACCACAATGAGATAC 

ATGGCAATCATTAAAAAGTCGAAACAACAGGTGCrGGAGAGGTTGTGG 
AACTAGTTCAGCCATTCTGGAAGACAGTGTGGCGATTCCTCAAC^ATCTAGAACTAGA 
CTGGGTATATACCCAAAG<3TTTATAAATCATGCTGCTATAAAGACAC^ 
AAAGACTTG<*AACCAACCCAAATGTCCATCAGTGATAAACTGGA 

6 0 CCATAAGAAAGGATGAGCTCATGTCCTTTGTAGGGACGTGGGTGAAGCTGGAAAC 

AAAACC!AAACACTGCATGTTCTCACTCATAGGTGGAAATTGAACAATGAGAACACTTGGAC^ 

GGGCCTGTGGTGGGGTGGGGGGAGGGATAGCATTAGGAGATATACCTAATGTAAATGCT 

GGCACGTGTATACATATGTAACAAACCTGCACATTGTGCACATGTACCCTAGAACTTAAAGTATAATAAA 

AAGGG CAAAAAT AAAACAAAAAAG AAAAAGCAC C ACACTG CTTCCACAGTGG CTG AACT AATTTGCACTCCCACCAG CAG TAT AT A 

6 5 AGTGTACCCTCTTCTCCACAGCCGTGCCAGCATCTGTTATCTTTTGACTTTTTAATAAAAGCCAT^ 

TCTCATTGTGTTTTAATTTGCGTTTCTCTAGTGAGCTTTT 
TAAAACAGTCACTTATCTTTTAAAGAAACTTTTTAAATCAG 
TGTTCATTCTGTTTCTTAC<jTCAAAATTTCTATCTC^TATCATTT^ 
GATGCTGATTAATTCTTTCAGTAGGTGTATGTTTTCATAGCT 

7 0 AATTTTAGGTAGACAGGCCGGGCX3CAGTGGCTCACGCCTGAAATCCCAG 

CAGGAGTTCAAGACCAACCTGACCAACATC^AGAAACCCCGTCTCTACTAAAAATACAAAATTA^ 
TGTAATCCCAGCTACTCGAGAGGCCGAGGC!AC^AGAATCACTTGAACCTG 
TGCACTCCGGCCTGCGCAACAAGAAAGAAACTCCGTCTCAAAAAA 
AACTATCTTGTTCCACTGTCTTCTnATTTGCCTT^ 
7 5 TTTTI I CTCTCTAGCAGCTTTTCAGATTTTCrCTTCCTCACTCGTTTTAAG 
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GATGTTGCTTGTGCTTGGGGTTCATCGAGATCCT 

TTTTTCAAATATATTTTCTGTCCACTCCTCTTCATCT^ 

CAGCTCACTGATGACCTGTTlTTTAAAAAATClT'rn^ 

ATATTTTTGTCTGTAGTGTCTAATATGTTCTTAATTCCATCTAGTGTATTTT^ 
5 GATTlX » 'r ri 'GGGGT Cri"l"l"l I 111 CATATCTGCCATGTTACTCCTTAACATAGCAATGC I' l ' l C 'l 1 C 1 I T I' IVIXS ATCATATCGAAT 

ATATAATAGCTATTCAATGTACTTGTGTACAAATTCTGTCTTCT 

TTTTTCTAriTCriTCCATGC CTGTTAG 1 1 1'l'lHATTGGATGACAGGTATTTTGAATTTTGTATCGTTAAGTC^ 
!TT T l " rrrri M r TTTTTGATGTGCTTTTAAATACGTTTGA 
TTTCTAAACTTACTTTTAAGCTITATCAGAGAGACCA^ 
1 0 CTTTCTGAATTTTCAACCTGATGCTCCATGTATTACAAGATTTCTT 
CCTCCACCTCTGTGGTGGTTCTTTTCCCAGCTTTGACATATTTCCT 
TGTATCrCT G 'lU'UXSCG U TTr C l'TCCTTTCCGGTACTCTGCTTGTGAGT^ 

TCCTCAACCTCAGAGATTGCCAGGCTCTGTTGGGGTTCCTCCTTCCTGTG CTGCAGC CTGGG ACTCTTTAGGCATTAAGCCAG AGT 

AATTACAGGGTTCACCTTATTTATTTCCCTTTTCTT 
1 5 TTCATGTATTTTAGATATTTAAGGTTCAAACCAGTCTGTTTTACTCT 

CATTTGTT AACCAGAG T AAAT C CTT CATT ATTCACATAATAAATT AAT AAGGATG ATGTTTTTCT CACAGGGACT AG^ CAA 

TATACATGAAAAGCATATTATTGACAGTAAAGTGTAAGATGCTACACAGATGTTTATCATTGCTAT^ 

TTTCCTGCAGTTAGGGAAGTTCTATATGGGAGTAAGGCTGAAAGGGCCAAAAGATATAGGT^ 

TTCTATGCATATAAGTATGTCATGTTGCATATTTTTCTGTGCTGTATTAATTCATGCATT 
2 0 ACTCATATGTCAGGCATTGTTCTAGGGACTAGAGATCTCTGCCTTCAAGGAGCTTATT 

CTTAGCTATCCACTTTTTTCATCTGCCTGGACACGTGACTTATT 

TTATAACTCCTATTTTCTTCCCTACTTTATCTGACCTTCCTACCTTAGCTTG 

CTCT T TCTTTCCTCTACTA ATT T TTTTTTTCT T TC TT TC 1 IT C T GAGTAAAAGCCAG AGATCTGGCCC CCTGCTTACCTCTCTGAA 
TTCTTTACTTACTTCTGACAACTTGCTCATTTCACTCCAGCTACATTGAC 

2 5 GGTCCTACCGCAGGAATTTTGTACTTACTGTTCCTTTTGCCACATTCATC^ 

AGTTTTTGCTCAGTTGTCATCTTTTTACTGAGTGTCCTTCCTGACCT 
TCCTGTTCCTCATTCCTACCTTATTTTTCTGGATAACACTTATTGCCT^ 
TCCGTTGTCACACTAGCATATAAATTCAGTGAAGGTAGAGATTCTTTTACTGCTGACAA 
ATAAGGATCTGCATAAATATTTGTTGAGTGAATGAATCTCCTTGGTAAAGTCCTTT^ 

3 0 CTTCTTTGTTTCATCTACTTTAATTTCAAAA 

AAGCTTATCACATATTTTGTTTTATAAATATGTAGCCTTCGTGAGATAAAAGATTCTCCATCCCT 

ACAACTCTAATGGTCATAAATAATTCCAACCTTATAGATAACTCAGGAGAAGATGAGATTATAAGTAGAC 

AAACCCAAGGACAATTCAAAAGGAAAATAATCATTCCAAATATAATATTCTTCT^ 

TGAATTTTT AAT ACAG CT C CCATAGAAAAT ACTTT AAAATGAACC CGGT AAG ACTT C CT CATT AAG ACAT AT AG ACACAT ATG CTC 

3 5 CCACTCCCGTAATCAAATTTGGAAGTCAAATAAGCTCTGAAAACCAAAACATTCTTCCAAGT^ 

AAAACCAATCTGACCTGATCTGTAGCTGTTTATAGTCTTTATTTTTCC^ 
CTAACATGTTTGACTACATTGTGTGCCCCAGACCT 
ATCAAAAATACCCTAAATTCTGAAATGCATGTAGCCCCAAGAGATTTG^ 
ACAAGTGAAGCCAGATTCCCTAGACATTAAGTGCCTAACTTTTGGTTGTTGTTGCTGCT 

4 0 CAAACCTGGGTATACITCATCTGACAAAGAAAAAAGTATTTGAAACA 

AATCCTGGACTCTTGGTGCACCCTGACCTTGCCTATTTGCAAGCAGAAGGTGGAGGGGAC 

TCCTGGCCTTGATCCTCCAAGGGGTCCAGGCTTTGGTTTTCATCTGTATGAATTATATC 

CCCACACTGCTGTCACCTTTTCCTTGCTCAGAAACCTTTGATGG 

GTCATACAAAGCCCTGCTTCTCTGGCCCCAGCAACTTrCAGCCTCATCTCCAGCCCACCT^ 

4 5 CT CTGAAT AACTGAG G CAT ATTT AT ACCTTTGT ATGTGTCTTC CTTT AC CAAAAACACCCCAC CGTTTC C ATTTCAC C CT CTCG AA 

AACCAATGCAGAAATCAAGAOTTTGTCTAAAGAACACCrCTGT^ 
TTTCGGCTCTGCAGACACATTATAAACTATTATCTTTATC!ATAAAAA 
TTTTTCAACCCCATTCCCTGTCCCACCACATGCCCCATTTCTACT1 1'i'GTA 

GATACTCATGTGATTCAAAACCGATTATCAGTTTTCCIACTAAATATACTCCATCTTCTGATACCCAGAC^ 

5 0 TCACTGACTCCCCCTCCTCGCCTGTCCrGCGAGGCTGTGGTC 

CATTGTTCCACTTATAGTAGTCTTCTCCCTC^TCTCTGTCCCTCCCCACTTCT 

TACTGAAGTTCTCTATCTTGTTGCTGCTCCGTGTCTC^ 

CAAGAGCTGGCCTCAGGCTCCATTTCCAATCTATTTCCCGTTTTGCC 

ATTTCTCACTTTCCTGCCTTGGCACATTCTTTCT ATTTTTG CTCT CTGGAATGCCTTTATCC ACTCTCTTATATG AA 

5 5 TCTGTTCAAATTCTGCCACCTTTTCAATATTCAG 

TCCCAGATCACTTCATTTGTAGCTCrCTTGTAG 

TATCCATCAGCGTGAAAACTCCAGCAGGAGGGAAGGAAACAGATGTCCCTGAGTCAT^ 
CCTACTATGTGCCAGGCTGTCTGCrTGGCACTGTGGAATGAGCrCACTCCCT 
TCTCAAT ACAG CTGGTGCCTAGTAGC^TI^ATTCAATTC CATT CCATCACATCCT 

6 0 GGGAAAGGTGGGGGAAGGACAGGAGAGGAGGGAGGCGATCTGATACACGGAAGAGAGTTT^ 

TCTTCTITGATGCCrrTCATATTGGATGACATAAATGAGACT 
AAAATXSC^TTTTTATTGTAAGTTTACATTATTTGGGCTTTATAAA^ 

GGACTGTGCTGTATGGCCCTAGATCGGGAAAAAGAGCTAATCCGCCAAGCAGCTCTGCAGC^ 
GGTCOMCTCATGTTTAC^GCTTTTCITCC 

6 5 ATGACAGTAGTGAGTACTTCACTTCCAACAGGGGGCACACCAAGAATAGACTTCC^ 

GTCCTGGGGAAGGTAACTTAATCACTTTAAGCCTCAGCTTTATCA 

TGTC(^^TTAAATGAGGTATTTTAAGAGGGGCTCAATACATGTAATTCTTTT^ 

AAATGTCACATCTCATCTTATTTATCAATGTCTTGCTATAAAGATATATGG 

7 0 TATCAAGAGAGAAACTTGAAGTTAAATGAATGAGTAGCATAGACAGCA 

CATGCCCTGTAGTCTCACCTACTTC<5GAGGCTGAOTTCGGAGGATATGGAGTTTCAGCC 
TCTCAAAAAAAAAAAATATATATATATATATGTATATATATATTAATTTATAGTTCCTAG 
TTCAGCATTTACCATGCAACTTTL~rrGi~rrGATTA 
TCAGACTCATTGAAAGTTTATAGAATTTGGCCTGTGTGGAAAACTCT 
7 5 AACAAGTTTCrAAGGGAAGTGAAAGGTT AT AAAATGAT ACAG ACTCATAC CG CAG AATC^ CAGCTGTTGGAGAAAAA 
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AAAAAAGGAAGCITTATGCTATTACCACAAGAAAGTTTTG^ 

TAACATCCCCATTTTCCTTACCTGTAAAATAAAAGGTAGTAAGTAGTTTAAGAAAACTC 
TGTAAATAGAATAATAGCAGAAT(^TGGAAACAAATTTGGAATTGAA^ 
GTACTTCCCAATACATCAGAAAAGCTCTTTCTCTTTTrcTGTCAGT 
5 TACCTCATTTGTTTTGGTGATCATAAATGACCCATGCCCAAGAAGTGTTCCGAGAATTTCA 
TGTTAGTGAAAGCATGTATTTAAGCTTTGTTTTGGTCATGTGTGCT 

AGGATT AATCATGTTATTTGTTGTTTTTCC C CTGTGAACAGAAGCC C C CAATG CATC CAACTTGAAAATTGT AAGAATGG ACAGG A 
CAGCTGGATGTGTGACTGGAGGGGAGGAAATTTATCTTCTTTC 
TGAAGAGGTGTGATCTTGAGAC!ACTACAGTTCT 
1 0 CCACAGAAAGTCTTAAAAAGCATTTACGTTTTACCTCTTC^ 

CITTTAGATTTCCCATGGAAGT AAAAAGAAATAAG AAAACCX5 CAGT ATCTTT AT ACATTAAAA 

TTAATATCTGTACTTTGTTAAACAAAAATAAATAATAGCCCATTCTTGAC^ 

T AGT ATT AAATTG TTTTTATTTCTTTTG CAATG CTTTGAT AT AACACTGAG AG AACTTCTCTG AAGA 

TAAGAAAAGGAAAGAAAAGAGGGAATACAGGGAAATTCTCAGAGC^ 
1 5 CGTTTATCTTTC^TTAGATATATTACAACATTGATAGATTAGTAAAAGCTG 

TTTTAAATAAAATGATAGTTGGAAAGTGTAGTATAATATCTATTTACT 

GAAAATATAAATAAGTAAAATAAATCAATAAAAGACATGTGGCTTTTAAAAA 

TAGTCAGAAAAAAATGGGATCAGATTTTAAAGTAAACACATATATACTTTATCCAAA 

TGCCTGTAATCCCAGCACITTGGX3AGGCTGAGGT 
2 0 CTGCTGTACTCCAGCCTAGATGACAGAGCGAGACCCTGTCTCAAAAAA 

CTTCTTCCAATCCACTCTGTCCAAGTAATACTCATTTC 

TATTTGTAAATGAGATCCTTAGTCATTATCATTACATTGGAATGGCTTAGTA^ 

TAACTAGGTCCAGTGTAGTCCTACCTGAAGGCAAAGGAGATTGTAAGATC 

CATAGAGATAAAAACTTTGAACAAAGTCAATAGTGTGGCCCTTTCTA 

2 5 TACATTGTC!ACATTTCIGTAGAAGTGAATAGAGTATTTATAAACATGCTGAAC^ 

GTGTGCAATTTAAGCATTGCATTTrAATTAGAAGGGACCT 

TATTATAGTGAGACAGTTTAATAGTGTGTTAAGAGCCTGGACrCTGAAATTATACTGTCT 

ACTAGCTGTTCAGCCTTGGATAAGTACTTAACCTCTTTGGACCCCAGTT^ 

TACATCACAAGGTTGTGGTGAGGATGAAATCAGTTAATATGTGTATATATGAAGC^ 

3 0 TAAAACTTATATAAGTTAATTATTITTATGTACATCTATATATTGA 

GTT CK3TGTTT CTCAGG ATTGGCTT AAG GATG GAG AAGC AGT AAGGT CAAAG ATT AAAAG AAAATT CAAG TTAACT AGGTAAAT CT A 
GT ATT CTGTCATTGTG AATG AAAG TTGGGGCGCATTACTGTGG T AAAT AATG AAAAAAGTC AG TGATTTG TGAAG ATTTT AAAAGA 
CrGAACCTTTTGATCTTGTTTTTTAAAAGTGTAAATAGATAGTAGGTAGAATATTAAACCA 
ATACTATTTACACTATGTGAAATTACACACTTCAATGTGATTGTTTGCAGATGACA 

3 5 ATGGTGGAGTCTGGGAAGGATTTGGAGATTTTTCCCCGACAG 

ATTATTTTTAGAGATGGGATCTCACTCTGACACCCAGGCTGCAGTG CAGTGGT ACAATC^CAGCT CACTGT ATCCCCCAACTGTTG 

GGCTTGAGGGATCCTCCTGTCTCAACCTACCAAATATCrrGGGACTACAGGCATGCTACC^ 

TTTTTTTGTAGAGATGATGTCTTGCC^TCTTGCTCAGGCTCG 

AAAATG CTGGG ATT AT AGGCATGATC CAT ATT ACTATCATCATTTT AT ACTTT C CTT ATTG AAATTT AG T ACTT AG CTTC CAT 

4 0 CTATAAAAAAGAAAAATCATAAATATTTAACTTCCATAAAACAGTACTATTTTTAAGACTCATGCAG^ 

TACTGTAGAGTTATGGGCAAGGTTCTGTTAGATCGTTrc 
CATAAGACAGTGCCCCACAGGTCCTTTAATGAAAATGTTTT^ 
GGA.CCTAACTATGATAACACCCTAAAAGGGTGTTCTAGGAGTCATCTTCT 
CCCAATGGCAACTGATTAAATGATCAGTTTAATCATTT 

4 5 TACAAGTAATTGAAGAGTATTCTCCTATTTACAAGTAATTATCTGGATTGGAAACAATC 

AAATTATGGTCTGTCCTTTTGATGAAATATCIATGTAGTTATTCAAAAGAATGAGTACT 
TATACAGTGTTAAAATATTTTAAAAGATGGGTGTAGAACAAAGTGTGTGTAGAAATGATT 
GAAGGATACCTAAGAAATTAATAACAGTATTTACCACTGGCTGGTACACTGTAGGAACT 
GAGAATACCACTGTATGCTTTTTTATATTTTATAAGATCTTTATCCCTGTGAAT 

5 0 TGCTTAAATCCCTAACATACCACCAAGAACTAATGTGTTATGATGCCAATGTAAGCAATG^ 

CTGTAGCCATCTCTCTCTTTCCATACCCACTCTTTCTGATCTTGCTCT 

AAGAAAGTATATGCAGTGAATTTTATAAGCATGATATTTACAACTAGAAGACATTTGAAATAGA 

TAGTACATTAGCATAGCATTGTATTTTGATTTTACAGTAGCAATT^ 

TCTCCAGGACACGGTGTTTTTTACTACACTCTGTCTAAACATTGGGTATA 

5 5 TACTG CAAAAAAAAAAAAAATGTAGTCCTACACC AACATGTGGTTCTT CGT ATCCTG CACTTTGCCATTGTCTTCAAAACTCCAAA 

GT ATAAAG AT ATTAAT ATT ACAAAAC CAG CCT CTGTGTTTGTC CAG CTTCGGAGG AAATCTG ACTTGG AAACTAGTGAAC CAAAAC 
CTTTCCTCTACTATCCTGAAATGAAAGGTAAGTCAGTTGTTTAAAATCTTATGCTCATAT^ 
GTTTTGTTTTGTTTTGTTTTGAGACAGAGTCTGACTCTGTTGCCCAGGCTGGA 
CGAACTCCTGGCCTCAAACAGTTCTCCTGCCCCAGCATCTCAAATAGCT 

6 0 TTTTTTTTTTTTACTTTTAATAGAAATGAGGACTTCCTACA 

CCTTGGCCTCCOVAAGTGCTGGGATTACAGGCTTGAGCTGCCATGTCTT^ 
AGAAAAGATCAGTTAGTCACCTCCTTTGAACAGCTCTCTAGTAACAGGTCCCT 
AGTATATTATTTTGAGGTCATGGAATTACTCTAGCTTTTTAAAATAATGTTATTTCC^ 
GAATTTGTCTTTGAATTTGAGAATCTAACATTTATTGACTAACTCAGAGTTCAAGCAGCrc 

6 5 TAGGAGAATCAGCCAGTTAGTCAACCAGCCTCAGGCGGTGGAACAGAAAACTTGGCCAAC^ 

TAAGGCATCATTCTTTCTCTCACAAAAGATGCTTTAAGAAAGGGC^ 

TTAATAAGTACCGAAAGTGAAGCATCCATAAAAATATAAGAGAATGTTGAAGAGAGAGCCCTTG^ 

TCCAAAAAGAAGGCTTCGTCAAGGAGATATCATTTGGCCCTCAGCATTTC 

GAGAGGGTTCTCTAGCTTTAAGTTACTTGAGGAGCTTAAAGAGAGAGGTAAACAC^ 

7 0 AG CG CTCG GATGGACIATG AAGGCCAATGTGGGT AAGGAAGG C CAGGG CCAGGCT AAAGAGGGCTGTGAATGGCAG CTT AAG AATTT 

TGCATTAATTCTGGAGAGGGATAGCCGGGCCTGCCTAGGrTT^ 

TTTAGGAGGCAGATGATGAGTCAG AAG ACT AATG CAAGAATCCAAGTGAGAAACAGTGCAGTT ACT 
TGAAAGGAGATGGTGGACTTAAGAGAGACTGGAGTGATACTGTGATACACTGATACTGTC^ 
GGAGAGACGGGGAGTCATAGATGGGTCTAAGACTCCCTGACCAGGAAGCnTGTGATGTGGTT^ 
7 5 CAAAAGTAGATTTGGCTGG^AAAAGAAC^AGCT 
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CTTAGTAGGCAGATCAAAAGGCAGAACTGGA^ 
ATAAGAGGGACCAACTGAGTCATCATTCIAAAGGTATTTCT^ 
ACTTTGATATCATCAGCTTCTTTTATAATCCTGTTTTGTATm 
CACCAGACTCCAGAAGGGGCTGTGGCACAAACAAAATTAGAAACCCCTCAGTTGGAG^ 
5 AAATGATTGCCACGGGGCACCCAGATCCTTGCTGAGGTTGGAAACCGCAAAGGTGCAG 
TCTCCTGTGAGAGCAA.GATGTGATAGGTAATCTTTGAAGAGGGCTTTCCTTTCCT 
AACGGTTCrACAAGTTCTTGTTTCTTTTCTGAGGTTGTGGCCTCT 

GGTTTG AAAACATTGTCAG CACAAT AACTTTTCTTTCTT CACATG AG ATTT CAT AC AAAATC C CAGTGT ATGGACATGGT AGGCTT 
TGATT AAAAT AGGGGTGAAGGGCCCTGCCAG CTCGGCCTTCCGAGAGGCTGTCCCACCTCTC CTTACTTCCCTATCAGGAGCTGCT 
1 0 GGAGGTCTTTTGAATGCTTCTGCCr ACACAAA^ 

GAAGAAAAATATGGATAATTTTTTTAGACTTTCTGAGGAAAA 

TTTTG CT AACACAGAGAATCATG AAGAAAGAAAACC CAGG TTAC CTGT ATG TG T ATG AAC CT AG AAACATTCAAAGCTCAGCTTGG 
TATCTAGGCTGCCTGTCTCCCTTTCCCAGTGGTATCT 

TAATGCCTCGTTATCATAGTAGGAACTATCCAATGCAGGGATTAGAGAATGGGGTCACCT 
1 5 TTCATGCACCTACCTTCCAGGCTCTCATTGATGAAAAGCITCATACA^ 

TTGAGCTCTGAAGGAAAGAATGGACAACATATATATATATGAGGACAACAAAT^ 
AAACTCAAGATAATAGGACC^AGGAGAAATAATCTGACCAGGCAATATAAC^ 
TAAGTAGATAGAAATGGATCCAAACTGTACAACCCAGGGAAACATTGCAT^ 
CCCCCAGTCTCTGT 

20 

HUMAN SEQUENCE - mRNA 

GGCCACCGGAGCGGCCCGXjCGACGATCGCTGACAGCTTrCCCCTGC^ 

CTGC^CGC^CCACCGGCCCCGCTCCCGGAGCCCAGCGCCGCCG 

CACCGCGGGCCCTGCCXSTTCCCTCCXjCCGCGCTGCGCCATGGCGCGGC^ 

2 5 GCCCCGACCCGCACTTOGGCCCGCCCGGGCTCCGGCCTGCCGCCGCCTCrTCCrTCrCC^ 

GAGAGCCCACCCGCGCCAGGAGGCCGAACGCGGACTCGCCACCCGG 
GAACAAATGTTTCATTTGGATCCTTCTTTGACTC^TACMTATTTAATCCAG 

TGGCCCATACCHTCAAATATTAGAGCAACCTAAACAGAGAGGATTTCGTTTCCGTTATGTATGTGAA 
TACCTGGTGCCTCTAGTGAAAAGAACAAGAAGTCTTACCCTCAGGTC^^ 

3 0 CAG TTGGT CAC AAATGG AAAAAAT AT CCAC CTG CATG CC C ACAG C CTGGTGGG AAAACACTG TG AG G ATGGGAT CTG CACTGT AAC 

TGCTGGACCCAAGGACATGGTGGTCC^CTTCGCAAACCTGGGTATACTTCATG 

CACGAATGACAGAGGCGTGTATAAGGGGCTATAATCCTGGACTCTTGGTGCACCCTGACCTTGCCTATTTGCAAG 
GGGGACCGGCAGCTGGGAGATCGC^AAAAAGAGCTAATCCGCCAAGCAGCT 

GCGGCTCATGTTTACAG CTTTTCT T CCGGATAGCACTGG CAGCTTCACAAGGCGCCTGGAACCCGTGGT ATCAGACG CCATCTATG 

3 5 ACAGTAAAGCCCCCAATGCATCCAACTTGAAAATTGTAAGAATGGACAGGAC^ 

CTTTCTTTGTGACAAAGTTCIAG AAAG ATG ACAT C C^G ATT CG ATTTT ATG AAG AGGAAG AAAATGGTGGAG TCT GGG AAGGATTTGG 
AG ATTTTT C C CCCACAG ATGTTCAT AG ACAATTTG CCATTGTCTTGAAAACTCC AAAGT^ 
CCTCTGTGTTTGTCC^GCITCGGAGGAAATCTGACTTGGAAACT 

AAAGAAGAAGTGCAGAGGAAACGTCAGAAGCTCATGCCCAATTTTTCC^ATAGTTTCGGCGGTGGTA 
40 CGGAGGCATGTTTGGTAGTGGCGGTGGAGGAGGGGGCACTGGAAGT 

CITATGGTGGGATTACTTTCCATCCTGGAACrACTAAATCTAATGCrGGG 
GACCCTGAAGGTTGTGACAAAAGTGATGACAAAAACACTGTAAACCT 

GCCCAGCGAGGCCACCGTTGGGAATGGTGAGGTCACTCTAACGTATGCAACAGGAACAAAAGAAGAGAGTGCT 
ACCTCTTTCTAGAGAAGGCTATGCAGCTTGCAAAGAGGCATGCCAA 

4 5 CTGCTGGCCGTCCAGCGCCATCTCACTGCTGTG CAGGATG AGAATGGGGAC AGTCTCTT ACACTT AGCAATC^T CCACCTTCATTC 

TCAACTTGTGAGGGATCTACTAGAAGTCACATCTGGTTTGAT 
CCTTGCACTTGGGAGTGATCACTAAGCAGGAAGATGTGGTGGAGGATTTG 
TTGGGTAACTCTGTTTTGCACCTAGCTGCCAAAGAAGGACATGATAAAGTTCT 
ACTTCITGACCACCCCAACGGGGACGGTCTGAATGCCATTCATCT 

5 0 TGGCCGCTGGGGCTGACGTCAATGCTCAGGAGGAGAAGTCCX^ 

TTGGCAGGCTGCCTGCTCCTGGAGGGTGATGCCCATGTGGACAGTACTAC^ 
GAGAGGGTCCACCAC^CTGGCAGCTCTTCTC!AAAGCAGC 

ATGACTCTTGGGAAAATGCAGGAGAGGATGAAGGAGTTGTGCCTGGAACCACGCCTCTAGATATGGCCA 
GACATATTAAATGGGAAACCATATGAGCCAGAGTTTACATCTGATGATTTACTAGC^ 

5 5 TGTGAAGCTGCAGCTGTATAAGTTACTAGAAATTCCTGATCCAGACAAAAACT 

TACTTAATAATGCCTTCCGGCTGAGTCCTGCrCCTTCCAAAACACrTATGGACAACT 
CTGGTGGAGGCCCTGAGACAAATGGGCTACACCGAAGCAATTGAAGTGATCCAGGCAGCCTCCAGCCCAG 
GGCCCACTCGCTGCCTCTCTCGCCTGCCTCCACAAGGCAGCAAATAGACGAGCTCCGAGACAGTGAC^ 
TGGAGACATCCTTCCGCAAACTCAGCTTTACCGAGTCTCTGACCAGTC^TGC 

6 0 TATGGGCAGGAAGGACCTCTAGAAGGCAAAATTTAGCCTGCTGACAATTTCCCACA 

GCGITGTCCACAAGACAGAAGCTGAAGTGCATCCAAAC^TGCTCAGAGAG 
CCTTTTCAACTTGGCTTCCTTTCTTGGTTCATAAATGAA 
GCTGAGCGGATGCATCTGGGGATGAGGTTGCITACTAAGCTTTGCCAGCTGCT 
65 GTTGTCCCTCTGC 

HUMAN SEQUENCE - CODING 

ATGGCAGAAG ATGATCCAT ATTTGGG AAGG C CTGAACAAATG TTTCATTTGG AT C CITCrrrGACTCAT ACAAT ATTT AATC CAG A 
AGT ATTTCAACCACAG ATGGCACTG C CAACAGATGG CCCAT AC CTTCAAAT ATT AG AG CAACCTAAACAGAGAGGATTTCGTTTCC 
GTTATGTATGTGAAGGCCCATCCCATGGTGGACTACCTGGTGCCTCTAGTGAAAAGAACAAG 

7 0 TGCAACTATGTGGGACCAGCAAAGGTTATTGTTCAGTTGGTCA 

AAAACACTGTGAGG ATGGG ATCTGCACTGTAACTG CTGGACCCAA^GACATGGTGGTCC^ CTTCGCAAACCTGGCTATACTTCA^ 
TGACAAAGAAAAAAGT ATTTGAAACACTGGAAG CACGAATGACAGAGGCGTGTAT AAGG^GCTAT AATCCTGGACTCTTGGTG CAC 
CCTGACCTTGCCTATTTGCJUIGCAGAAGGTGGAGGGGACCGGCAGCT 

GCAG C!AGACCAAGG AG ATGGAC CTGAGCGTGGTG CG<5 CTCATGTTT^ CAG CTTCACAAGGC 

75 GCCrGGAACCCGTXK5TATCAGACGCCATCTA7GAC^^ 
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GCTGG ATCTCTG ACTGG AGGGG AGGAAATTT AT CTTCTTTGTGACAAAG TT CAGAAAG ATGACATCCAG ATT CG ATTTT ATG AAGA 

GGAAGAAAATGGTGGAGTCTGGGAAGGATTTGGAGATTTTTCCCCCACAGA 

CAAAGTATAAAGATATTAATATTACAAAACCAGCCTCTG^ 

AAACCTTTCCTCTACTATCCTGAAATCAAAGATAAAGAAGAAGTGCAGAGGAAACGTCAGAAGCT 
5 TTTCGGCGGTGGTAGTGGTGCCGGAGCTGGAGGCGGAGGCATGTTTGGTAGTGGCGGTG 
CAGGGTATAGCTTCCCACACTATGGATTTCCTACTTATGGTGGGATTACT^ 
AAGCATGGAACCATGGACACTGAATCTAAAAAGGACCCTGAAG^ 

GAAAGTTATTGAAACCACAGAGCAAGATCAGGAGCCCAGCGAGGCCACCGTTGGGAATGGTGAGGT 

GAACAAAAG AAG AGAGTGCTGGAGTTCAC^ATAACCTCTTTCTAGAGAAGG CT ATG CAG CTTGCAAAG AGG CATGCGAATGCCCTT 
1 0 TTCGACTACGCGGTGACAGGAGACGTGAAGATGCTGCTGGCCGTCCAGCGCCATCTCACTGCT 
TGTCTTACACTTAGCAATCATCCACCTTCATTCTCAACTTGTGAGGGATCTACTAGA 
TTATCAACATGAGAAATGATCTGTACCAGACGCCCTTGCACTTGGCAGTGATCACT 
AGGGCTGGGGCCGACCTGAGGCITCTGGACCGCTTGGGTAACTCTGTTTTG 

CAGT ATCTT ACT CAAG CACAAAAAGG CAG CACT ACTTCTTG ACCAC CCCAACGGGG ACGGTCTG AATG CCATTCATCT AG CCATGA 
1 5 TGAGCAATAGCCTGCCATGTTTGCTG CTG CTGX3TGGCCGCTGGGGCTGACGTCAATGCTOIGGAG CAG AAGTCCGGGC G CAC AGCA 

CTGCACCTGG CTGTGG AG CACGACAAC ATCT CATTGG CAGGCTG CCTGCTC CTGGAGGGTG ATG C CCATG TGG ACAGT ACT ACCT A 
CGATGGAACCACACCCCTGCAT AT AG CAG CTGGGAGAGGGTC CACCAGG CTGGC AG CT CTTCTCAAAG CAG CAGG AG CAG ATCCCC 
TGGTGGAGAACTTTGAGCCTCTCTATGACCTGGATGACT 

CCTCTAGATATGGCCACCAGCTGGCAGGTATTTGACATATTAAATGGGAAACCATATGAGCC^ 
2 0 AGCACAAGGAGACATGAAACAGCTGGCTGAAGATGTGAAGCTGCAGCT 
GGGCTACTCTGGrcCAGAAATTAGGTCTGGGGATACTTAATA^ 
AACTATGAGGTCTCTGGGGGTACAGTCAGAGAGCTGGTGGAGGCCCTGAG^ 
GGCAGCCTCCAGCCCAGTGAAGACCACCTCTCAGGCCCACTCGCTGC 

TC CGAGACAGTGACAGTGTCTGCGACACGGGCX5TGG AGACATCCTTCCGCAAACTCAG CTTTACCGAGTCTCTGACCAGTGGTGCC 
2 5 TCACTGCTAACT CTCAACAAAATG C CCCATG ATT ATGGGC AGG AAGG ACCT CTAG AAGG CAAAATTT AG 



704 



WO 03/008583 



PCT/US01/51291 



Table 33 

MOUSE NOMENCLATURE 
ICSGNM Fyn 
Celera mCG13BO 

HUMAN NOMENCLATURE 
HGNC FYN 
Celera hCG34806 



10 MOUSE SEQUENCE - GENOMIC 

CCAACACGCAGTCCCAGTGGTTt^GAGACAC^ 

CAGATGGCGXSCTGOCAGATTCTGCTGTGGCTGTTTCCACAACC^ 

CTGTCCCAAGCTTGCTCCTTTCGATGCGCTTTTCATGTT^ 

GTGTGCCCGGGCAAAGCTTCCATGCAAC^CGTGGCATGGAAGAGTGGAGGTC^^ 
15 ' CAGGAATCAAACTCAGACCIX3TGGGCTTTGCAGTAGGCATCCTTACT 

GAGCTGGATCTTCGTGACTAGTGCTACTGAATTAAAGGAAGAGAAAGAACCCCCTC 

CATCTCTCCAGGAACCCCGMTGCTGACTGGG 

TCCrCCTCCTACCTATAGAAAGGACATTGGCGATC 

GTGACACTGTGACTCCTGOCCCACCGCCAGTCCCGCTACAACTGTrACTAAAACTlGAGGA^ 
2 0 AGCCGCAGATGTGACACTGGCTTTCCTGCTTGTGCTAGTTTGCGACT 

AAACAAGCCTCACTGGATATGGGCAGTGAGAAGTGATGTATCCAGCAC^C 

GGGGCTGCTC CCTCCTGTG CT C CAC CGGG ATTCAGGAAAACTAGCT CAAAAGTCAAAGCT AG CTG GGCAGTGGTGGTGCAAGC CTT 
TAATCCCAACACTTGGGAGGCAGAGGTAGGTGGATTTCT 

GAGAAACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAGAAAAAGAAAAAAGAAAAGTCAAAGCTGTGTCAGCT 

2 5 AACAACAAGCATAAGGACAAATCCCCCTGCCTCTACAACCAGCAGCATCCAGA 

GCTGGCCAGTACACAGCTCAGCCCAGTCCAAGCCCTCAGTTCCAG^ 
AGGGTCTCCAAGCATATAGGACTTAGAACTGAAGCATATTCATTGCCCCCACCTTGTCT 

TT ACCTGGGACC CCCTG AG TG AACCACGTGAAGCCGGG ATCG ATG CAAAAAG CAAG AGG AATTT ATTG CTC CAGTGCACTGGGG CC 
ATCCCAGACCCGAAGGAGTGGTGGCTACTCCCAATATCCCAGCGAGTTCTTATAl'GGTTTCCA 

3 0 CTAGGCACAATAAGATTGGTGGAATACTGCACCCTTTAA 

AAGGTGG CAACAGTCAGTACTGTGGAATGTGT CCCCACCGGCAGGTCCG AT CCTGCCATGTGGTCTGAGAAATGGTAATTAGCCTC 
TCCCTTCCGGAGGCrrCAGGTGTTTCATGACCTTCCCAAAGTT^ 

CTTG CTG CACAAC CC CT CAG CACCCAGCAAGG AGT AG AAAGCACG CT CACTG CTCTTGACTGGTT AG CT ATTGTG CCTTCCAG AAG 

3 5 TTGCCTCTTAATGTGCTCCCCTCCCATCTCCAGTTTTATGCTTTATTACTATGTGATT 

GAACTCAAACAAGGGAACTTGAACrrCATCAACATGGTGCATTCTCAC 

TCCCCCT AAAGCGCATCGCAG<KJVTCCATTTTGTAGCTGG AATGGCC CAGTGCTGCCTTG ACACAGCG CACTGCTTC AGAGTACTC 

ACCGTGTCTGTTTTCTCCTCTGCTTACCATTCTGTCGTAGACATACCTTATAAATTCACC^ 

TCTAATTTCCAAAGTGATGAACAGCATTTTATAAAACCTCACITAGCACATACT 

4 0 CTTGTTGACATTACCAGTTTGGTTTACCAGAAGTCCCCACAGTACTGTAGGAGCAAACTC^ 

GGGTAGAGGATTCATCTGTGAATTTAGTTACTAATTAAATCACTTACTAATCTAAGTGTT^ 
TTTCCCCTAAAGCTTCAGTGGTGTtSGGGGGAGAGGAAGGTGCTACCAATAGTGTTCCCT 
TCTATATAAATCCATTTTTTAAAATATCTATCTACAAAATAAACTTGTAC^ 
TATCCAGGATATTCATGATTTCTAAATGGATGAGCACCTTATGATAT^^ 

4 5 T AACTTT AAT ACAATG CT AT ATT ACT ACACTG C AG ATGAAT CTT AAATG CG TTGTG CCCTGATG AAAAAT AGAACGAAAC CAAGGT 

TGTGTATCATGTGATTAAATCCATACCAAATATCTGGATTAGGCAAACC^ 
GATAGGGGTCATTGGCAAGTGGCTGCTTAGAGCACAGGGGTTTCT 
GGCTCCTTTATAAAAACCAACTGTGCCACACTCTTCTGTGGTCTGGAAGAGTTGGGG 
TCGTAOVAACAGGGCTAATACTCTCATTTGC^TCAGCACCTTCTCTCT 

5 0 CAAGAAGCATCTCTCCCTCTCCCTGGAGCCCTTCCATGATTCC 

AGTGGCTGGCTGGCTCACAGCAGTGTGAATGCTCACT 

AGGCGCAGTGAGGGGGGTGGGGTG<K5AAAAAGGAAAAGACTGCAGTTTCTCAGCCT^ 
TGATTCCAGGTGCCGGGAATGCGCCTCTTTTCTGCTGTGC 

GGACAGGATCCATCCTCGTCCCCTCAGAGCCrTTX3CAGTGACCACACACAT<^ATGAT 

5 5 CTTTGATTTGATGCGTAGCAC^TCCAGTCTTCTGGCCACTGCATGTGGTCAGTGT 

CTTCTCACCTGAGAGACTGCAGGGTGTCAGAAGGAGCGTTTGGTAGGGGCTTTCCTTCAC^ 

TGCAATAGGAGAGCTCATTTTTATACCAAGGAAACTTTCCGCTACCGCCT 

GGGAAGACGGGGAAATTGAGGGCTOCTTTAGTTTTGTTGGCTCAAACAGTCATTAAGATGCCAAA 

6 0 TGACTGTGGTTCCAGGACCTCCATAGAACCCACTATTCAGAOT 

ACTGTGCTCTCTGTCCCTGGACGTGGTGTCAAGAGGCGTCCTTCCCTCTGAC^ 
GGAC^ACCAGGGCTCCATG^TACAGAAAGCAGCTTTGTT^ 

CCTCTCGTCCCCACCCCGAGCTGAGCTGTG^ACACATTCCCCTACCTCTCrATAACA 

TGGTGGGGTTCCG1TTT CCAGTGCAGTACAGTGCAGTGCCTGG CATACATGAGTATGCATACATAC ACTTCGGATGTTGGCTATGA 
65 CTGCCAGGATGTGGAACCACCACCTCTACAGTGCCCCATCCCGTAACTAGGCATTCATCTTCT 
TATTAGCTCACC^TTCTGTGGGCCGTAAGCCGGCACGTAGCATGACCGGAATCTCTG 
TGAGCTGTCACTGGGGCCTTGCTAGGGAAGAATOK3TTTCCTCACTCACACA 
GTGGTCTTGGAACTGACATGCCAGTTCCCTGCTCTGCCAGCT 
TAAGAGCTGCAGGAGAGAGGATCCTCACTTTGAATGCTTCTTC^ 

7 0 GGGCTTACTTGATGAGGTCAGCTCACGGGGACACTGTCCTTC^^ 

CTGTTTAATTCACACATCCTGGAGACTGTAC^GGAAGTTGGAAGGTAG 
CCAGGGACAAATGGCTCACTTCTTCCATCCATCnXjGC^ 
ACAGGGTGGACCCAGCACTAAGCCTGTCrcCGGCTCAGC 
TTTGGCTTAGGACCTCCACCAGTGCTTATGCAACCAAGAGGC^ 
7 5 CTCTCTGACTGTACCGAAAAGACAC^TAAAGACCCTTAAGCTG^ 
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TGGGCAGGATGGAAAGGTGTTGGTGAATCAGGAAA^^ 

GAGCTGCCT AAATGAGG CTCTAG CAGTTATGTGCGGAAAACTCCTAATCAG CTGCCCTGGCTCCTCAGGGGATTCACTGTCTGCCG 
AGAATCTAGTAAAGTTCCTTAAAGTCCTAGTAAAGTTACTCA^ 

CAGGAGCCTCTCTCCTAAGCTCGCCTCACTCTGACTGTGCTCCGCCTCCTCCTGCCCTCT 
5 AGCGGT CCACTTTTG CCTCTG CCC CTG ACACAGTGTG AGGAG CTG CAGGGG ACCT ACAACTCTG CTCTG AATTCCTCATTGCTT AT 

GTGTCGTTtKSTCCGGCT CCCATCCTTTTCTTACTCTCCA 

TGAGGCAAGTCGTGTCAGAGTTCTGTGGCCAGACCTCACTGTCCTT^ 

TGGGGCATGGGAGGGGGATAGGGATGAAGAGCTAAGTCTAGAGCTGTCACIT 

AGCCTACTCTGCTAAGATGGCGGCTACTCATTGGCTCTTTTCCCrrGTA 
1 0 TCATCCAGGCCTTGTTACCCAGCAGCTGTAGAACTTGATCATGGGCATTC 

CATTGTTTGTGTGCCCAGCCTCAGAAGTTGAGAGTGGGTACCCACCTGGGCCCACCTCT 

CTGAGCCCCCCCCCCATACTGCAGGGCCCCCTCTGTCACAGAAGAACTGTGTGGAGGGTTTC 

AAGCGATCATATCTGCTGTGCTCATGGGAGGAGGGTGTGGCTGTGTTTCAGTGTC 

AATAGATTTGTGGGACrrATGTCCATTTAAGGAAACAGG^ 
1 5 CTGGACTGGTCGCACAAGGTCAGGTTAGTGAAGACGCTCrcCTGCCAGCTCTTCT 

GGTGTCTTACGAGTCAGCACTCCAGGGCCCCAGTGTTGTCAGAGTGG 

ACTAGAATAGTTAGTACACITACTCCATGTTGGTGGA 

TGTTrT rTG TX w riTTGTVfTT G TTTATCGAGACAGGGTTrCTCrGAATAGCCCTGGCT 
TGGCCTTGAACTCAGAAATCCACCTGCATCTGCCTCCCAAGTGCT 
2 0 ATCTTAGTCCAAATGCCCAGAAGTTTGCCTCACAGTGAGGAGCCTGATGACACAGAATG 
TGTCTAGGAGTAGTTAGGCTGCCTGGGCCACATGGGAGGGTGAAGACCACAGATTCT 
GTCTGCATTATTTCTGTTTTGTCTTGCTCCCATTTCTCT 

CCGAGTAGAAGCATATAGTTAACTCAGAACATCAAGGGCATGCACTGCGTTCCCTCTGCAAA 
TTGATCATTTTGGTTTAGTCACTTGATGTCTGTAGCATGCTT 

2 5 GGACCAGCAGCCAGGCCCTGACTTCAAGCGGATTATGCCTTTCATACCT^ 

TTAGGGACTTrAGAATGTTGCCTTTrCTTTC 

TGCATTAAAAAGATGGTTTGCAGTGAAAGTTCTCATTGGTTTTAGGTT^ 
ACACTGGGAATTGCTTGTTG<^GAGACTATTCTGTGC 

GQUICTG AAAACATCTG CAAG CATTGCCTACTATGTCTCCTGGTGGACAGGGGTGCCAGGTGTGAGCGTGACCCTCTAATTCT AGG 

3 0 CTCACATCTTCTGCTTAAACAAATCAATGAAATCCCCAAGA 

GCTC^VAAGCAACGTTGTACC^TGGAAGAGTTAAGACAAAGTCAGCGAGGTACATACCCATCTGA 
TGCCTCAGTAGATTATGCCTTGGCGATAGTTTCCTGTTGGAAACTCATTAAAATCCTTCGCT 
GACACAGCATGCTGTAAATTCATCTTGAATGATTTATGAGGAAATGGGTTCCAGATTTGCT 
TGCCTGACAAACATTCTCATTCCCAAGAACTTGAAGTCAACTCACTAAGCAT^ 
35 TCTAATGTGTCCTGGTAAGACGAGCGCAGTGCTTAAGAACTGCCACCAGCATCTCCTCA 

AGTTACCCTTTATGGTCTTGAATCTCTTCCTGGTCTGTGCCAGCCCTGCCTTCCCCAGAGAGTGT^ 

CGTGGTTCTT ATTTGACGCTTTGGT CCCAAAG CGACTTTTCAGAGTCTTGG CAGGG ACACTTCCTGCAAGTTCTGTCTGCCCACAG 
TGCACTTACAGTGGTAACATTCATTACCTTCCCTGTATGTCTGAGTTCTCACTTCT 
AGGACAGTATGGTGATGCACAGCTTCCAGGTCATCCTAGTCTACATGGACAGCCAGTT^ 
40 GGGGATGGGGTCGAGGGAAGCAGGACATATTGTTGATGAAAGTCACTC 
AAATACAGCACAAAACAGACTTTTTCCTCATCCGGTTGTCACTTCTTC^ 
GAAGAAATGGACCTTTGTCCATGGGGCAGAGGGACAGTAGACAGGCCACCTCTTAATGTG^ 
AGCATAGAAGCCAGTGGGCACTGGGACATGGTGGGTTCCTTGCTTGTCCGGAAGAAAGGGCA^ 
GCAGGAAGGTAAGAGCTGACGCAGGAAGAGTGAAGTAAACAGAGCAGCCCTGAGGAGGAGGCCCCTCAGGATT 

4 5 CTGGATCTTTGGGGATTTACCAGATGAGGAGCCCTGGGAAGGCAGG 

ACCACTGCCTCTGGACCATAGCAACACTCTCTCTAAATGAGCATTTAAGAATGTCAAACTCT 

GCAGCGTCAAGTGCTTGTGCTTCTGGAGGCTGTCATGACAGCCTCrGGTCCTG 

AGTTTTAATGCGGACCATCCTTTTGTGAGTAGTA 

TGTCC AGGT AAC CCAG C AT CCTC CTTT AG AGGGCACT GT ATTGGAAGGTTTTG CTCTAGTGTTG AAAAACT CACTTT ATTGCTCAT A 

5 0 TTAGTCCTTGCGGTGACTAAAGAAAAACCTAAGAAATTGGTGGCCAACTTTGGGCAGTGGT^ 

ATCCTTGCCCCTTGGCCCTGCTTTGATCAGCCTGATGCATCTCCTGCCCGTCTTCCGTGTTAGAGC^ 

CACAGCCTCCTTTCTCTTCCATCAGGAGCTTGGATAATGGGCTGTGTGCAATGT 

GAGGGACGGCAGCCrGAACCAGAGCTCTGGGTACCGCTATGGCACAGACCCCACCCCTCAGCAC 

5 5 ACCCTACGCACG AGAGGAGGGACAGGTAAGCCATTGTCAATCAGGATGGGTGAGGG CACAGCrGTGCTCTTTCACAGTCATTG CTG 

TGTGTGGATGGGGTGGGTGCAAGCATGCCACAGGTGNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN>WN^ 

ACATGCTTCAGTTGACTCATTTTAACAGAGAAGTTTTGGTCAACTTTAGGCCCAAGT^ 
CTCAGGAGCAGGAGACCTCTTGTCACTGCrCTGAGAATTCAGCACCAGTTAG 
60 TGTCAGCCAAGGGCTCCCAGTCCTGCCTACCCTGAATAGGGCCAGCTCTGGGGGTACCCAA^ 
TGACTTACGGGGTAGTTTCTGTCCCTCTTCCCACTCTTCGGCCTGCCrGGCCAGTC 
GTCTGCTTGTCAGTATTCCAGCCTGGACAGCCCTCGAGTTTTCATTCATCTCTATATTCT 
TTTGCnTTGTGTGACTTTTGAGACAGGCTGCCATTTre 

CGTTGGGAGTACAGCCATGAAATATCACACCGGCTGTGTGAGTTTCAAGCTGAGCT 

CT CT AT AACGT ATCTGCCAT ACATCCT CACACACATTCC CT CG TGT ATTTGTAG TCTT CCAAAT AC 

TGTCCCAGAGTGTGAACCATGGTAACCAGATATTTTGCATTATTTGTTCCAAA^ 
GTCTCITGGCTCATCTGGGGGCAGATGCCTCGTGTCTG 
GGCTAGGGCTCTTTCTCTGTGGACAGCTTATGCAGTGGAGTTO 
70 ACTGCACTGTCGATGTCTACCCTGTAGAACAAACCCTATGCTCACCCCTGTGCT 
TTGTATGAGGCCCTAACCAAACCrCCATGGCCTGTACCT^ 
GCCCTCTGTCCITAGAAAATACAGAAGAAGAAGGAAAGCC!AAATGAATACT 

TTGAGATTTTTATGTTTG CTGACTCTTAATT AAGCAAACAGAAGCAT CAACCCTGTTCATTCATTCATGAAGGCrGGATGTGAATC 
CC&^TGAACATGTCTCATTCATTAAGGTCACCAGCIATGGGCT 
75 TATTTGAAGTGTAaVCCGCCAGTACCCTGGGGCTGGAGAGGTGCcerG 
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CCCAGGTTCTCTTCCCAGAACCTCCATCAATGGCTAATAA 

CCTACAGGCTCTGGCCCACATATGGTGCACATAGGCAAAAGCACATATGTTTAAAGATATAT^ 

TGGGCGGGOSTTCCATCTTTGGGGAGGGGGAGTGGTTGCATl^TGTGTCCT 

GTCGTGAGCAGTCTTCATTGCTTCCTCCTCC^TCCCCAGGGCATAAAACCCCCCAGTGAT 
5 TTCCACGAGTGAAGATGGCTTGGATCTGCTCTTATAAAAGAAGAGACATCTGTGAATGAAGT^ 

TAGAGGCTCCCACAGTGTGTTCTAAGCCAGACTCCCCTAATCTCAGAGAAACOT 

CGGTCAGGACATGCACCTGAGACAGATGAAAGGCAGGTCCTCCCTGGAAAGGATGA 

GAAGCCAGGTGGTGATAAGCCATCACACTGGGAAATGTGCCGAAATTAAAACAAGAAGC 

ATTAGGTATCACATCCTGTGTAACAACCACAAAATTAACAGCTTAAACAGCACATACAT^ 
1 0 CTGTGGGTCTGAATCTGGAAGGGAGAGCCCGGTTAGAGGCnn'AGGCATGAG 

TCTGTCATCATGCTGGCGTGACACCCTCCTTAAGGTAGTGTGGAAGTAGAGAAGTCACCT 

GACATCTGTCCCCTTTGTACACTGTGTTAAGTAAATCAAGTTGTTATGTTCCTAAGGAAG 

CAGGCAGGAGTCACTGTGCTCTGTCT ACTGCATGGGGCAAGCCCTAG ACTG AGTGG 

TCTGGGATGTGGGACAGTCCATGGGAGTCGGCAGGTGAGAGAACCTAAGATGGACGTGAGGATGAGGGA^ 
1 5 AGTAAATGCAAGGCAGAGGAAGAACCAAGGGGGCAGGAAGTAGCAACAGCAAGACC 

GGTGGGGGCCAAGGGCA.CTGCATCTCCCACCTGGGGTGAGCATTGCA 

TCCATTCACTGCTACCTTGTGCTAGGGATGGAAGAGTGATCCATTTACCCAGACCTTAGCTCAACCT^ 
ATAAAGTAGGCCAGTACTTGCCACAGATTGATGTCAGTCAGAGCCTTGTCT 
GCTCTAGGGTTTTGTCATAAAGCCTCCTCTATACAGACCACATTATAAAGCCAGCCATGTG^ 
2 0 CTCAATATGTTTATTAATGCTAAGTGTAGATTTTTAAGGATGAGACTTTA^ 
AATAGTCTCCGCTCTTTTTTTTGGCAGGAGTGACACTC 

TTTCAGAAAGGAGAAAAATTTCAAATATTGAACAGCTCGTAAGTTTGGGGTGG 
CTCACAGCAAATGCATGTGCTATCTGAAAGTAAGGAAAGAATGCCTTCTGCACTCACT 

TGAGG ACAAAGTG CAAT ACT ACATGGT ACT AAGTG CTAAAG AGGGCACCAGAGTC CTTTT AG AGC CT ACTTCTTAAGTG TG CACGC 

2 5 GAGAGAGCAAGCTTCTGTGTGTGTGTGTGTGTGTGTTGCAAGGTCAG 

GCGTCTCTCCTTGATCCTCGAGCACCTAGGTTTCAGCTC^ 
CCCCACACCCCCCTCGGCCAGCACTGCAGTTACAGGCTTGTGAGAGACACX3CT 
ACCACTGAGCATCTCTCTAGCIATCrcrcrAACCCCAGGGTCCGTGCGTTTGC^ 
CCCCGGGAAAAGCCGCATTCTC^ATGACTGGCGCACGGCTTGGGTGCCAGCCACGTGCT 
30 TCAGGTGTGGGCATTGTCAGAGGCCTCGCCCITCAGGCCAGC!AACCAGATCCC 
GTTTCCTGAGGGAAAATATGGGAGTGTCrGTGCTCITAAGAAGAAAGCAGAG^ 
ACAACTGTC^CCAGGGAGCAGGCAAACTTAGGGTAAAGGTTAACACC 

AGTTGTG CCTCTGCTTTCTGTCAGTGCC^yGAGC^TCACCTCCCTCAGCATG C ATAG ATG AGAG AACGGAAGG AAAATCCACTCCCA 
AGGAGATGCTACTGTCGTGCCTCAGCACCTCAAGTGTGTTGATG^ 

3 5 GC ACCTGGGGTGAGGTGGGGTGGGCGCTG CTCTTGCTAAGAAGTCTG ATTGCG ATTTCTCTGG CreAATTTCAAGGCAGTTGAATG 

GCTTGAAGCTTGGCCCITCTGCCCCCCCCCCAAAAAAAAA 

GAGGGTGTGGCIAATGGAGAACCCCCTGTAACCTCAAGAGTGTCTCCATGGG AGGCCTG CCCTTCT 
GCACTGCACGGAGAACATTTTCAAC^GCAAGACCCCATGC 

GCATAAAGCAGGAGGCTTAAAATTAGGAGCAGTTGTCAACACCAGAAGAGAGGTGGCT 

4 0 GCATGGCAAAGGTGTGTATAACCftCGCCTGGTGCTGGTT 

AAAArACAAAAAGTGATACATTGATGTTACATGTATTrTACCACAAATTTCAAGGACA^ 

ATGTCCGTGCAG CCAGAATTACCTCCCCCTCCACCCCCAGAGAGCTCCTGCAGG CIACAGGGGTTTTTAAAATAGTAAATGG CTGCC 
ATTT ATTGAAAGTGCCGTATG CACTG ACCATGT CATTGCTGT CGCTG ATCCTTC1ACAACAGCTGTCTGAAGTATGTGCCGTGACAT 
CTGAGAGCACGGGTTAGGGAGCTGCCCAGTGCACCCTCCTGGCAGGTGGGAAAGC^ 
45 GTGTGGTGTCAGGTGTGACCTGGGAAAAGCCCAGATGCTGCCATCAAAGGCAGCTGGAGTGTCTTA^ 
GAGGTGGCTCCAGCGTTGGAGAGAGGGGCGTTTGCCTTTGATCT 
GAGCGCTGCTTGTCAGAGTCTATTGACTGGGTITTGTGTAAAGCT 
AAGTTTGAAACTCTGTAGGCCCTAGGAAGTAAGAACTAGAGGGGTAGAAAAAG^ 
CGACATAGGCCTGTTTTGAGGTTGCAGCCATGGCCCAGTTAGTCACTGGCAAAGTCAAGTCCCT 

5 0 GAGTCTACACAGTCAACATGGAGGAGTCTACACAGTCAATAAAGCGGATT^ 

GTGCATTTTATTTGACTGC^ATAGCrrTTAACGGTTGCAGAGTTCT 
AGGGACITTTCCTTTAAGGATAAAATTCTTTGTCCrrACATGGTATTTAAGTGCC 
TTTGTACAGCAAAACGATATCCTGGAGTCAAGGGTAATAACAGGGAAGGGAAT^ 
CCTCCCTCCCTCCCTCCACCAC<^CC^CCCTGACAGGAGAGGAAT^ 

5 5 TCCTTCTGACTACCCCTCCCATCAGGACGTGCTCAGAAGGCTCCT 

GGGAGTTCTCAGTAGTAAAAGGCTTGCGTTCACAGCACTCTAAGT^ 
GTGATTTACCCACCTTGAAACAGCATGTCCTTAAAATGCAGTGGGT^ 
TGAAAGCTAGTTTCTGTrCATAATCAGGAAAAAAAAAAATTGGTTTGA 
GTGTTCAAAACAGGGTTTCTCTGTCTAGTCTTGGCTGACCTGGAACTCACT 

6 0 CCTGTCTCTGCCTCCCCAGTGCriTGAGATTAAAGGCATGTGCCACCACCACCCAGCTC 

CATGTGATAAAAGTAGTTTCAAGAGTCTTATCTTTGAGTAAGAAGTAC^GCTGTGGAATGCAGGAAGAATC 
GCCAGACAGCTCAAGGCTGGCTGCTCTTGGCAA 

AAGCCGCAGCCCTGTGAGGGACTTTGAGGATTAAAGACGGGATGGAAGAACAGAGTCTGTTACAAAAACT 

CAG TG TCGGATT AAATCACGGGTTTGGGT CAG ACAAT AAG ACAGT ATCTTACCAGGTACG AG TG AATT AAAAAG CTG AACTGT AAG 

6 5 AACAGTTTAAAGAACTGCAGCTTTATAGGCTGGAGAGACAGCTCAGCA^ 

AGGCCTGGGAATCTAGTCCAGACCCCAGCAGTCAGTCCTAGGAACCTTGTCTAGTCCGC^ 
ACTTGTAACCCATTCCTGGGAAAGCAGACAGACAGGAGGAGTCACT 

TG AG AGTG AG AGTC CCTTT CT C AAAAACAAAG AAGAG AAT AACTG AGAAAG ATG C C CATG T AGTC CTCTGG CCTCTGT AG ACATGC 
AAACACACTGGCAAGAATACCTGAACAGCATGCATCACAC^ 

7 0 ACACACACACACAGGCACAGATTlTCAAATGCrnSATTAACTACAGTAGCT 

CTCAACCGTTAAAGGCTAGGCTCACAACCAAAAATATAAGTGTAAGG 
AGGACTCTGGAGGTGAATGGGTGCTGCCAGTGAATTAAATAGTTTTTGC^ 
AGTCTAGGACCACTGAGAATAGAGTGTGAGAAAAATGCGACAAGATGTGACATTTGC^ 
GTGTCTVCTCTTGTGAGGGGGAAAAATGGCCCAGAAACXSTTAAGTCCAGAGTGGTGCTATCT 
75 AATGTGTG T aAACTT ATTT AG ATGGCT ACAC C^JL* CVlGACAG CTG AT & CT AACTT AAATTGG AAATTG CCTGATT AGTGAGTCA 
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ATTCATGATTACTCAGTCACCTGAGT AAGACTGGG CTTGTCCCCGATTGCAGCAAGATCC CCCAACATCCTGACATCACCTGAAAT 
TGGTGCTCGCTCTTTTTC^CTTTGATTTAACTAGAAGTAACTTGA 
CTCTTCATTCTTCTTCACTGTCTGGAGCGGTTCCCAGGCTGGGAGCAGTC 
TATGTTTTTCAATGATGAAATGTCTCCATCTAGTGGAATTTCAGAGC 
5 CTT AAACACATGGTGGTCAGT ATAACTT CCAAT ATG CATTT AGATG CTAGACTT ACAACT ACAT AT ATTTCTGTGGGG CTAGAAAT 

CGCGGCCACAAAGCATCTACGTCATGAGATGTACAAGGCAGAGGCACACATGCACA 
CACTTGACAAAGGACATGGCTCTCCTTGGGGGAGTTTTGTCT 

GACACAATGTGATTATAATTAAACAGTTTTCTGGGTAAGAGAACTACAGTATCACTCT 

AGGCTGGCTTIX5AACTCATTGAGATTCACCTGGCTCTATTTCCAAG 

AATAGGTTTTATAATCCCTCAAAATTAGAGAGATAGCTCAGGAGTTAAGAGTGT 

CATCTGTAGCTCCAGCAGGATAGAGGTCTCTGGTATCCAAGGGCACCTCCACTCCTCTC 

GCCTACACATAATTTTTTAATATAATTTTTTAAAACTTATATTAGCA 
1 5 TTATTTATCTCATAGCCACACCCACTTGGATGAGTTCCCATTTTAin'C'lT 

CTGGTCCTTGGTACCCCTGCTCAGAAGTTTTTTCTGTAGGGAACCTG 

CTACCTAATGTTGACCTGGGGAAGGTCTTCCCJ^GCCCAAAATCAGTACGTTTTCTTCCCT 

AAGACAATTTTAAAAAAAAGAAAAAGAAAAGAAAGGCCTGAGGTITCTC 

GTGATGTGTCTGTAATGGTCAGAGGCTGAGGCGCCTAATATGGTGACCGGCAGGCCTGTTT 
2 0 TTTCATTTGGTGCAAGTCACAGCTAAACTCCACGCACATGATCATT^ 

AACCTTCATGAATCCAAGCGTATGGATAATAGTTCTATATATAGTGACTGT^ 

CTCTGAGAATTTTGCTGCAGTGGCTCAGAGATCTCTTCTGCCCTCCAGCTAC 

GGGGTGGGGGGAGCTATGAATGTTTAATAGAGTAACATCCTGTTGAGAGTGTCTGA^ 

ACCTGCAGAAGGTCATGTCTTCTTCCTACTTCGTCAAGAGCAAGCTTGATAGGTCCACTA^ 

2 5 TCCTACTGCTTCTCCAAAGTAAACTGAGGCTGGTGTGTGTACTCCACACACACACACACACA 

TCAAGATTGCCTCCTACCCAGAAAGATGCTCCTCCCCGAGCCCTAAGTGGGAAAGT^ 

TGAACT CCTCAACCT C CT ACTGGG ATTTCCAC CCTT AGCTT ATGTGGGG CCGGGT ATTGAAC CT AG AG CCTCG TGCACGT AAGAAA 

AGCACTCTGACTTCAGCGCGTGGTGCTTTGGATTTCTTAGGGCCC 

GGCATAGGGTCGGCCTGCAGAATTCCCAGCTGGATGGTGCGTTCAATTCATGTCCCTTC^ 

3 0 GGGAAGCCCGCTCCTTGACAACCGGGGAAACTGGTTACATTCC 

TATGTTGTTGTTGTTGTTGTTGTCTGTCTAAATATCTG CTTTT AATTGTCACAT AAAATG CCAGTGACACAGCACAGG AGCGAG AA 
GACCTTT CCCAGTGGTT AAGGGTTGAGAGGCTACAGTC CTGCTGTGCTGTCG AGTCTGCG C AGT ACACTGTTATTCACTTCGTGGT 
TTCCHTTGAGCCCGTCCGTTTTCTGATTGAGAAGCTCT^ 

CCAATGTGGGGAAACCACAGCTGCAGTGTGTGTTACTGAGAGCCCACCGTGCGCCTGCAGTCTGCTCCCT 
35 GATAATTATGAGAGATTAAGCCCACACACACACACAGGAGTCCTCTGCTTGATAATCAGAAATACTGA 
GCTTTTTGTTCCCCAGGCAGAAGAGCAACTTGGAAACTACTGCAGTTGTGGGCTrC 

GTTGGTGTTGTGGCTGGTCGACCTTGACCAGCAGTTATTTGGCCCCACCCTCCCACTGGAGGGAGGCAGTTTTGG 

AGTCTGGGGCTCTCAAAGGATGGCTGACTGGTCATTGTTCATTCTCTTTCCAGAGAAACCTGTA 

GGAGGTTGATGTCCCAAGCTOVGGAACATGGATGGCCTCTACCCCACCTACAGGGACAGACCTCACCAGAAAGAGAA 

4 0 TGTCGGTTG CTCACGGCCTGGAGCCTGCTGTGGATTTGCGGAGGCACAGCACAG AGG AGT AG ACCT^ CCT AATCCTCA 

TGCGTCCATCATTCTTCTTTAAGTCTTTTGTACTCCT 

ACCCCCTTCGTTACTAGTGTGTTTCATTATCAGAAAATAGGAAATATGTGTATTTGTATTGAATTT^ 

TCTCCTGTGTCTTTTCGGAATTTAAAACTTAACTACAAAGTAACATTCCrrrTG 

TAAGTCATATTTATAATGATAAGGTTTTTTTTTAATATTTTGGGCTTTTT 

4 5 TTTTTACACGTGCAATTAATCGGTATTTTGCACAAGCAGCGTGT 

GTTTT ACACAG CATTTT AATTGAG TT ACTG AT AGAAACCAAAAAAAG ACATTT CTCACAT AG CTGGTGTCGTATC CTTT ACAG CAG 

CTATTGCTACGCTGGGATTGTGGAGAGTTTGGGTACTTCTCAAGTCACTGAAGGTGACGCTCCTT 

AGAAGGATCTGATGGTGAACTrCCTAGCTACATTCAGGCATGGTG 

AATACTGTGATGGTTTGGTCCTGAAAGCTGCTCCTACTCACAGATTCTGGTGGAATGTTCCAGAACT 

5 0 TACTTATCCTCTCATATACAATCCAGTGCTGAAATGTGAACGGATTGCCGGGCGTTGGTTAGGACTCAGCAGCT 

CAAAAGACGCATTCCGGTCCTGAGGAGGCCTGTGAGGCTTCCTGCCACGGCACAAGTGTGCTGTC^ 

TAAAATGTCTGCACCACCGACTCATTTTTATGGACATGCCCATTAACCG 

TGCAGCTGTGTCAGAGAAGCGCAACTGCAGGGCACAGGCCAGGTACTTGGTAAACGCTCCT 

ATCAAACCAGG CTTCCAAGAACAGCAGCCATCTCAGATCAGGGGTAGAG CTGAGTTCCCGTCTTCAG AATTCCTGT AGGTAGAGAC 

ATACTAGAGCTGAGTCTTGGTATTTCCCAGGAGTTGTAATTATGCTCAGAAAT 
CCTGCAGTTGGCATTTCAAACAAGGTGGTGGGTTTCTCTCTGTGCGTGCAG^ 

GGGAAGGAACACAATCCATGATTCAAGATrrrr'l 1' TT'l 1' I 1 AAGAATGGAAGAGATGCAGAGTGGATAG AATCAGGAAAGATTGAA 
AAAGGACTGAATGAAGCTAGAAAAAACTAGTTTGGGGCAGTGGAAGCAACATTAGAGG 

6 0 TACATTGCTCAGTCACAAGTGTATTTATTTTCTTGCCTGTGTC 

TGAGAGACAGCTCCTGTCCTTTGGAAACGCAAGAGGTACCTTTCTTATCCGCGAG 
GGCAAAACTGATGTGCTGTTCAGATCTGAAGGTCTTTGGGATCGGAAGGCGTGGC 
CCTCCTGCTGCTCTCCAGCTCCCCTGAAGACGCTGGAACTGCATTT 

6 5 TTTTTGATTTGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTTCAAGACAGTC 
AGAACTCACACTGTAGACCAGGCTGGCCTTGAACTCAGAGATCCACCT 
GCCCTACCACCACCCAGTTTAAATTTGTTTTTCTTTCCTTCCACTCTGTTTTGTTAGT 
AATAACTCACTTCAGCAGGCAGAGGGGCAGAGAACC^TGCTGCCACAC^ 

TTTGGAGCTTCCTGCTAGGGGATAGACTTCCATTrCITTTGTTCCCACCCTCTGTTCCTCCCGCA 

70 CTCTGTCCCCACCCCTGCCTCTCCACCAACCACCTCTC^ 

CCTCACACACACACACAC^CACACCCAC^GCCCCACCACACACACACACACACATATGCCATCCCTGTCT 

CCCTGCCTCTCAGCTAGGCCACAGGAAATGAAAGCAGGAGGCAAGCTCACCTGGA^ 

GTCTGTGCAATATTAGACAACACAAGAAACTGAAAGGAAACm 

GT ATTGCT AACAT CT CTG AACACTGGCAGGATAGCAAAAGTTGT CTTTGTCCTGTT ACAGAGTGG AG AT CTG TCT CTTTCGT 
75 ATGTCTGTGTGTGTGTGTGTGTGTGlVltiC^ 
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TGCAGAGCAGACTGGGGCTCCGATGATTGGTGTGGCTAATGCTTGCACGTTTACCAGGTGCCTACTC^ 

GATGAT ATG AAAGGGGACCACGT CAAACATT AT AAAATC CG CAAGCTTGACAATGG TGGAT ACT AT ATCACAACG CGGG C C CAG TT 

TGAAACACTTCAGCAACTGGTACAGCATTACTCAGGTAACCCCAG^ 

AGGCTGTGAGGGAGGAGGAGGAGATACGGGACAGAGTCACTAGGTAGAGCTC 
5 AGATGGAGTCCGGGGTTCTGTTGCTCTTCTGCCC 

GTAG TTT AAATCTTG AAAGCTGT ATTTGT AT ATT AATTTT CAT AAT AAAAATCCTTG CACG TTCTTT AC CTTTG CCTTCATTG CCA 

TGCAGAACTGTGAGAGGAAAGGAGGAAAGGAGCTGTCCCTAGGCAGGCTCAGA 

GCAGCAGAGGGAAGCAC7VTTCTTGCACTCTGTAGAAGAAAATCT 

CCCCCCCCCAGCTTGCCTGCTGAGGCCACAGCrCC^ 
1 0 TTTTTGTTTTTTCAATTTTTAAAGACTGGGTCT 

CTGCTGAGACTGAAGGTGTGCGCCACCATACCCGGCTACTGTGATC 

TCCTGTTCTTCTGCTGTTCTTGGCCATTCAGTTAT^ 

CAGTGTTTAGTXSTGTGCCGCTTGCCCTCCTTGGTTA^ 

CCAAGGTAAGGTCTC^CTTGTAGCCCTAGGGAGCCCAGTATCCACrrTG 
1 5 TAGTGTCAGCATCCCAAATGCTGGGATTACGGACGGAGCCACCACACCCITTT 

CCACACCAAAGAGACTTGTAGCCTTTTAAAAATAACAATA 

ATTAACAGTGGCTGAGCCAGTGATTTGGXSCATTAGCfrCTCTCACT 

AAACCTCCTTGCACTTC^GTGATATACACAAAACGCCT 

GCAAACATCCGCAGGGTCCCCAGG^TTTGGATCATTTGAAAATA^ 
2 0 GA.GGGAGCCX3ATGTGAAGCACCCTAGACGAGTTTTAATCTCAGG 

TTTCCTCTTTCCTCTTCTCTCTTGCACATAGAATGCCT^ 

AGATTTAAAAAAAAAAAAAAAGGAATCAAAAGTAAGTTAAGAATTCCCA^ 

C^CGTTTAGTCAGTCTCCCCA.TACCTTTGCTAGATTCTCAAAGGAA 

2 5 CCAG T AACCTGAATGGAAGCT ACTGTTT C C CTGCTTTTCATGTC1ACTGAGTTT CTGTGAATGTTGTT AATTT CCATT ACTTCTTT A 

AAT CCAGACTCCT AAATT ATG CAT AG AAAG CTGTTTAAAAGG TCGTGTTTTTGTTTTTG TTTGCrriT AG TGGAAAATTG CAAAAA 
AGCAATGTGAGCACACAGGAGCAGTAGAGTCAGAAGGACTGCAGAGAA 
CGGCTAGCTCTAACAGACCAGAGGTCTGTGGGAGGCTCCTATGCC^ 
CAGTCCCTAGATACCGGAAGGGACAGGGACAGTTTTTATTrTTATATGTA 

3 0 CAGTGAAT ACGAGTTCATT AG ACTAAAGT ACACAGAAGCCAAAG CAGG CCGTGCCTGTGAG CAGCGAGCATGCCCG ATGGTACAGC 

GGTGTCCATATCTGACACATCCTCATCTGCCTCCAGCAAATGTTTC CGAATCTCCTGTGGCCTGTGCGTGTTGTC 
GTTACATATTCTGCTCTTCTCTCCCTCACCGCTrGCCCTGCTCCTCCAGAGA 
TGTCACAAAGGGATGCCAAGGCTTACCGATCrGTCrGTC/AAACCAAAGATGTCT 
CAAGAGACTGC^AAATGGGCAGTTTGGGGAAGTATGGATGG^TA 

3 5 AAATCCCAAACATCAAGATGGAAGGTGGAAATAAAGATATTTTATCATCAGGACAA^ 

TGGAAAAATTTGGAATCTCTGAAAGAGAGCTAGAAACAGGCAAGAACT 
CTCAATGTTGCCGTCAGGGAAGTCACCTTCCA.TCITTCCTAGTGC1ATGCATGAT^ 
TCCCTTTCCTGTAGAGAAAGCTCATGGTTTGTGTTTTA^ 
CTAAAGATGCTTGGGAAGTTGCACCTGACTCGTTGTTTCTGGAGAAGAAGCT 

4 0 AAGGCAGAGGGCACATTCTGTTTTGGCTGCATCTCAAAGCCAATGAAAA 

GACAAAGACCCTTAACCTTTGTGACTAGATGGGCTCAAGCC^CCCAGGT^ 
CCTATTTCATGTGGCTCATTAAACACCTGCAGGACAGGAACCCAGTGATGGTGACGCm 
ATGCCATAGAGTAAGGTCCTGGGCTTTCTTCACIACCAAGCCT 
GTTTAACTCCTCTGGCTCCAGTCCTCATCTGTAAACTTGGAATAGTACCTGCTTC 

4 5 GTT AAAATCT C CTGT AG AT CTTG G ATC CATG CACACAT AT CTT CATTG TG TGCACATTCTTCAGT AACGG CAACTGCACTGGGCC C 

TTG AG ATGG CTTTCAT AGCCCT CTGGG TCAG TTG CAACTTT ACTTAC^ C CTCC C C AAAATG TTTG 

TCCATAAAATGATAACAAAGTGTATTGGGGTCTCCCTTCCTTAGAGTCTACAAGAATACAACA 
AATCTTAAGCCATTACATAAGCACCAGGGTGTTTCCCACAGGACATCAGAGCTGG 
AGGAGTGGAGATC^CACTATGCTTAAAGTTACAAAATGTTATCAGTCAGATCCCGTAGACC^ 

5 0 TACACCAAAAGGCAGACACTCTCTGAGATAGAAGAACTCTTTACACCTGCCA 

TGCATGTTGTAATTTTATGTTTTAACAGAATAATAAATGCTTCAGATATAGACCCCTTAGAGTG^ 
GTAAATCTTAGTGTGAACCTGGATTCTCTTTCAAATTGTAAGCAATAGAATCT 
ATCCTTCAAAGCCATTGGTTAGAAATAGAACAAGCCAGAATTCTGTCT 
GTTTTACTGATGAACTGACAGTGGCCATGAGCCAGGTCTGGCCTTT 

5 5 ATGTACGTACTTTTCTTTCCTGGCTGCTCTTTGATTTCCCAGCCCACCT 

GCTGGAGAAATTGACCTTACCTGGGAAAGTATGGGCAAGGAGATGGTGTGCTCCTGCCCTGTGC^ 
TTCTGAAGGACTCAGTCAAGTGTCTCATCCAGTAGACAGTTCAAAGCCT 

CTCTAAG ATGGAATTTAATCIT AATTTCTCrCAGGCTTTTCCTTT A^ CTGTTGGATACTCAGGCCAATTCTAAAT C 

CTAGACTGTCATGACTTAAATAAGAGCCC^GAGCTTTCTC^ 
$ 0 TATCAAATGGGAAAAGTTAGTGAATTATTAAACAGCTGTGAGTAAACTTGCTTTTCCTTA 
TCTTTGCTCGTAAGAACTACTTTAAGGTTTCAA 

TCAAAAGAGTACCTTTGGATTCACTTACTGTTGTGGGGTATCTGTAGGTACCT^ 

TTAAGCCAGGCACCATGTCTCCGGAGTCCTTCCTGGAGGAGG CG CAGATCATGAAGAAGCTGAAGCATGACAAGCTGGTGCAGCTC 
TACGCGGTCGTGTCTG AGGAGCCCATTTACATCGTCACGGAGTACATGAG CAAAGGTCAG CCGCACCCCTCC CCCTCCCCTCCTCT 

6 5 CNNNNNNNNNNNNNNNNNNNNTCCCTCCT 

CCACTCTGCTCACTCAGGATAACACTTAGAAAAGGGGAACATGGA^ 

GTTCCACATCAGGAACGTTCCTCTGCACAGTTAAAACATCCATCTTTCTAT 

TGCCATTTTGAATGGGTGGCTTCTGTTCC^jTTTCCCTTTTTAGCGCTAAGT 

7 0 CTCGGCGCATGCATGTGTTGACGCACACGTGAGCK 

TCATTTTGTTTTGG AAAAATAAATGAG CCAAACAAG AAACAG CCTTAAAAGGCTGTGTGCATTGT ACATCAGAAGACACCAGGGT A 
ATTTTTCTTCTCT^TGTACTTT CrT TlAAAATATAATTGGAA rr iTTTC 

TGT AAATAACTTTG TG ATT AAAGCATT AAAAAAAAAAAAAAAAAAAAAAA C CCCTG CAAATG ATTAT 

CTCCGCCTCGGTGTCAGATCaSTGCCTCCrCACTCAGCCTCACACGAGTGG 
7 5 CTGG CTGGGATCTAATTAGGAAGCT OCT GCCCCACCCATATTCCTTCO ATG CTT AAAATCTG AAA AT A AGTCAG CAG TTCTGAAAG 
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TCCCTTGTACCTGAGGCACCTAGAGAGCTTCTAAAAATACGGGCATCTGTGTCTGACTCT 

CTGAGGACACGCCCTTCCAGCAAATACTGTCACTACTGTTGCT^ 

ATCAGTCGCCTGTACGTCGTCGGTTACTGGGACATGAAGGAtGGTTTGGTTCCTCCTGG 

CTT CTGGTTTT ACT AACTCCTTT CTCACTG CTT ATTTTCAGAAGCT ACAACT CTTCTT AAAACAT AAACTTGATTTCTTGG TTT 
5 TTGTTCAAAGCTTTCTTTTTCCATTGATGG 

TTGAG AAT CTTTG C CT CTCTGAGT AAACT AAAATTGTTTTGTTTCATTCAT A 
. GAAGAGCTCTGAAGTTCCCAAACCTTGTGGACATGGCGGCACAGGT^ 

ATGAATTATATCCACAGAGATCTTGCGATCAGCAAACATTC^ 
1 0 TCGGTTGATTGAAGACAATGAATACACAGCAAGACAAGGTC^^ 

CCTTCTCCTTACCTTCATTTCCTCCCTCITC 

ATCTAGTTGGCGTGTGCAGTGCTGGTCTAGTAAGTGGGCGTCTC 

GGTGTTCCACTTTTGGAGAAAGAAGGCITTGAGAAGCCAAGTTATAGCT 

ACACTCCAAGTCAGACAAGTTCCTTTACAAAGGAACAG<3CGTG^ 
1 5 GCTCAGAATG CAACAGG AGTAGGTTIGCGXTCAGAGCAAAGACACCTCCTGGTAGAGAGCCTG CACATGGAGGCG ATTCCTGATCTCA 

GTCAGCAGCTGCCTGCCAGGCCCAGCCJU3GACCAGCTGGGACCAG 

GGTCTGGAGGGCCTGGAGGACTGC^GTGCTGGAGAAGCCGTCTCTTCTCCTC 

AGTTTTATGTAAACCACGTACGAGACCAGAAGCC^ 

ATTCCCAAGGTGTTTTCTGCCTTCCTCGCTGGCTGCTTG 
2 0 CCTCTGTGTGGGTTTTAAAGCCGGAACTCAGGCAGGGAGTAGGTTTCCTTGGTC 

CTGAGCAGTAGTGGGGATGGATTACAGGATGGTGCATTCTGCAGTTT^ 

C CTT CCG CCTTT CACTTG CCTCAGAATGG AAGTG CAGTGAATGATGTGAGAAAATG TTAAACTGGGG C C CTGC CTAGGT ACTGACA 

CTATCCAAGTGCTCGGTTCCATGTCGGACAAACCGCCCAGCTATTTGAATTCCCCACrCCC^ 

CCTTISTGGGATCCTGGGGGGTACGGGGGGGGGGGGTATGCAC^CAGAC^ 

2 5 CTCTAAGCAGAGAGGAATACMACGTGACTTGGTGACAGGGCA^ 

AGGGAGAGCCTGTGAGGGTTGATAACCAGCATGTTCCATTTAC^AAATGGCT^ 

GAAAGTCTCCGTGGCAGAGGGAGCAGTGCAAGACTGTTTACTTTTAACTAGACTTTGAACTCr 

TCACCCATCTGTGAATGAGAGGTCTCAGGC^ 

TGCATCATTCATAAGGGTGCCTAAGACTGTGGCCTCAGGCGGCTGGCA 

3 0 CCCTCTCAGCCTC^CCATGCMCCTTCCTTCCCACTTC^ 

C^CAGTCTGAGGCCCTTTCCAAGGACAAGCCTCAGTCCTT^ 
TATTATTTCTTCTTTCTTATCTTTGTCAACTGTCTTCAGATGC^ 
TAGAGTATGACAGGAGAGACGGGCAGTCTGTGCTGTCCTGAAGCTGAGAAC^ 
GCAGGAAATAGACCCATAGTTAGCATATTTGTTCATACAAATCACAAAGCT 

3 5 CCCACCTGACCAGGTGTGCTCTTGAGCTGTGCCCACTGTCTCCCCCCA^ 

AATTAGTCCTAAGTCCCACTTCCCCTTTATTTT7VT 
GAGAGACrrTTCCTTTTGCAGCCTTGGCTAGCCCAGC 

AGATGGAAAGGTCCGGCATGTGTTTGCCAATGTATTCTGCJVGATGGCATCTG 
AGCTCCTCTAGACAGCTGGTCCATGTGCCTAATAATATTTGCAJVGTCGGGTTTTATGTC^ 

4 0 TTGATTATGGGAGTGCAGGCAGTGTTCTGAAACATK3TGCCAAATGTAGCT 

ACTTGTGAGTTTCCCATCTCAAGGTCCTCTCAAATGTCTGTCACCA 
AGGACACTGCAGTATGGAGCTCTGTTACAGGTCTTCCATGTGTCTTTGTCTCCTAC^ 
CTTGTGAGGAGAAGCCCAGTGTCAAAGAAGAATTCAAGACCAGGCCTAGTGCCTAT 
GAAGCAACTAGAATCATTTGGGGTTCCTTGTAATGGCCTGGACAC^ 

4 5 ACTAAATTCTGTCCATCTGGAGTCGCTTAAATAATCCATAGCAAGTGT^ 

TGTTAAGGAGATACCCAGGCCAAAGCAGAGGCCCTGTGTGGTCTCAGCTCCTCTGGGGACTGAAGAGGGAAGATCT 
GGAGTTCGAGACTGGCATGCACAAAACTGTAAGATTCCACGTCGAAGGAAGAAAGGAGGGAG 

GGCACATAAGG ACAGTTGG CTAGCATGAG AAAGAACACIAAGAACATTAAACTCATGCAGC AAAGCCO^ CAAT ACCCACTGACTTG 
GG CCCTGGG CAAGCTGGGTG ATTC GT C CATGGG C AG ATTACTT ATTTTT AACTGCAAATG CAT CTTC C CTGTCTG TG CGTGTTTGC 

5 0 AGCTACTGGAAAGCACTTGTGTGTCTAACCTAGCCCCAGCGAGTAGGAACTGGACAGTGTCGCTTGCT iTiTACTA 

TACAGAGGACCAGGCTGCTAGAGAATAAACAAATGGGTAGGTTG CAGTTGTACTGGAAAGAG C CAATT ACCATTCAGGGTTAAAAA 
AGAAGCTAAAGGCCCTTTCC^GGAGGCAGGTCTAGGCTATGGTTAGAGTGGC^CT 
CATCTTAACATCTCAGCGAGAGAGAATCAGTATGAGTTGGGATTTGTTCTATTTC 
GAAGCTGCTGCAGCATCXSCAGCTGATCCACTGCCACAGCAGGGTCCTCTGACCCTCATCCTT 

5 5 CTAGAACTCTACTGACAGCCACCTCTCTACCCCCAGTCCCACGATCCTTCCAGAAGTTACTGGAGCC 

TC^AGGTGAGGCTGTGCTCTTGAGCCACACCCTCTCCTCAGCATCAGTTC 
TTCTCATGGAAAATGAAAATGGGTGGGAGAACTCTCCACAGACTTGGTC^ 
TGCTGATTTGAACCTGGCACACTCAGCTC^GCTAGACCTC 
CTCTTCTCTTCTCTTTCTCTTATCTTCTCITCTCTTCT 

6 0 GTAAAGTAGACAAATCTGGTATGCAACTC^TATGTCTTTACATAGTTTTG 

ATGAGTGTGACTTCACAGATCCCTTCCTTAGTAAGCCACCACTCTCAGG^ 

CTTCTTGCTTCCTGT AACAGG CCTTGGCTTCTC ACAG CAG ACTC ATATTTCACTC AGCAGAGGT ACCATGTGCCCCATCACTC 
GGAGGAGCAGTAGGGCAAACTCTACTTC^Cn'GGTACAGATGAAGCT 

6 5 CCTGTCAGAAGGGCTGGTACTGGCITGTCAGCAGCAGGTGCT 

TGCCTGGTTTGGAAATCAGAACCCCCAGGTTTAAACCACTTCTGTCACTC 
TTGGAGCATCATGTTTATCCTCTGTAAAATAGATAGTAACAGGGCCT 

AAAAATTTGAGGGTCGTAAAATATGAAGAATTATTTTAAAACTT C AA1"1TTITGTG ATTC ACATG TTTTC CAT AG AGAGGGITTT A 
TGGTATTAAAAATCATTTAAAGAGAAAAAAATGTTAGTAATCAATAGAGGAAAGGTA 

7 0 GGTGGATGATTGATTGATTGACTGATTGATAGATGATAGATAGGTGAGTC 

GTACAAGAGCACACACATATCATTATTTATAGCCCAGACCAGTCTATTTT^ 

CTGAACTCAACGAAGGGAGAGTCAACTAACCCTGCCCCTCCATTGTCTTC 

TCATGGCTGGGTGGGTATGGTGGTACACACCTTTAATCTCAGCACTAGGAAGTC^ 



75 AAGAAAAGAACT ATC CACAG CCCCGAG CTvCATG C AC^TG^TTTTTATTT AT ATTTATT AT AT ATC^TT ATTG TG TCTGAC ATCAC 
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ATATGTACATAAGTTCTCACATGAATCCTGGTAGCAATTCTCCTGGATATCACTAACACAGG 

GGGGCCTAAAGACTGAATACAGCCCTCATGAGCACGGAACATTTATTGGC^ 

CTGCCCTTCTCCTCGCTCTCTGCAGACACACTTT T C T GCTCTCTGTGCAGTATC 

CTCT AATCCTGCAAG AACCCTTAG AGAAGGTTCTTG CTT ATGTCGG TGTTTG CTG CACACTGTGAG CTGGTCTCTGGGGGAAGTTC 

AGCATAGTCATCTGGGACCCCCTCTCTGGTGGCTGT^ 

ACAGGACTTTCTCCTCCCAGACCTCCTACCTCCTrTCTCTGTATTCCT 

TGAGCCTGGCCX3GAGAAGTCGTGGCATTATCTTAGTATGAGTCCTGGAATCTCTC 

TTTGAAC CTG CCTTCTTCTG CAAACAAAATTGTCACACGGTCTG AG AACAG ACAGG^ CAC CACAGAGGACTT AGTG CAGT 

1 0 TGGCAGTGGCATTAACAAAAGCAGTGATAAGTTGACCCTATAGACACATACAGCCGTCCT 

CACGAGCCATCCTGTACAGGGCCTTCCCACTGCCTCCTGCCAG^ 

TCTCTGTGGGTTTX^TGGGTTGGTTGATTTC 

CTTTGAGTCTGTGGACTATTTGGACAAAAGGAGAACACTCAGAA 

AGAAAAATGTGTCTGTTCAAATACTCCCAGTATTGAGCATTTAAAT^ 
1 5 AGACTACAAAGAATTCATAATTGTTACAGTTTC 

TCTTGGAGAACATATTTTAGAAAGATTTGTTACTTTAATAACT 

AATCAGATTTTGGATTCAAAAAGCACAGTAGCCAT1TAGCACAAAGGAG 

GGGTCCCTGATTCTTTTAGCTTTCCAGTTTCACAATGC^ 

2 0 CAGTGGCAGATCAAGGCTGATCTCACCACGAGGTGCCCTCCCTCTC 
AGCATCGCCATTCCCACACTGTGTGCTGTGACCTGAACAGCAAATC 
AATTTAATTTAATTCTTTCCTTGCTAAA 

GAGCTTTTAAAAGGCTGTAGTGAAGTGACCAGGCCAC^CCACCCCAC^ 
AAGCCATCTGCTGAGCCCCACGTGCAGAACTGAGCCAAGCTGCCT 

2 5 CTGGGCAGAAGTCGGCTTCATTTGTATTCCGCTGCCTCTGTCTCTAGGCT^ 

TCAT CTGCTGTAGCTTCCGCCT AG ATT AGT AGTCAGGGGAGCAGGGCAGATG CTGGGAAATGGAGATGAACTGTTGGGAGCATCAT 
CCCCTCCTAGCAAGGCTCTGCTGACAGCCAGATGAAATAAACGAAGGGGGCTT^ 
GATTAAAAATGCTTGCAGGGTGGCAGGAAACTGATTTCTACCACTTCCATCCT 
GACAGO&CCTGGTGCCAGCACAGGTCCCTGAGCCGACTCTGAGCA 

3 0 GCTCACAGGGGC^TCGTCCTCTCCTCTGTGCTGTTTTGGGCA.GAGCTGCCT 

ACTTTGTACAGCCCACGTTTCTTGGTAATCGTCACTAAGGAAGCAGATGGTCACCCTCT 
TGCAGACTGCCATCAAACATCAGAAACGTCTTCTGGCTTCCT 
GCTGGAGAGTGAGGGACAGGCAGAGCACATGCACACTGCCTGCAGCTACGAGCCI^ 
AACACACTGCTCTCCACAGGGTCCCAGAAACATAGGCACTTTGCCAGAT^ 

3 5 TGCTTGTCTTCTGTACGTCCCCTCCTCTTGGGACGTTTTTTCT 

GTATCCCCACTGTCCACACTGCCTAGCACAGTGTGGCCTGGCGTCGCCCAGGTCCCTCTGGTC 
TGCTACATTCrTAACCGAGTAATGGGCCTCCCAAGAACCCT 
TTTCAGCTTGAACGTTTCTTCCATGGATTTATCCCAAACAGACCTTGGATCTG 
GTTCAGGCCCAACTCCTGTCTCCTGGCACTCAGGTTTCACTCATCT 

4 0 TCCTTGTCACCTTCTCTGTCAGACTCCTTCCTCCACGCCCCATCCCCAGCCCCGTAAGCT 

TC CTTTCG TG CAG CT GGATTTT CCTTT ATATTT ATTCTCAAAACAT G ACG CT AAACTCT ATGAACTT AG AGTGGAAATTT CAG CAG 
AATATGTTACCAGGGGTAACTGGAAACACTCAGTTTATGATTTCAGAAG 
GAGAATTAGGAGTTTTCCTCTGCCAAATTrGACCGGGTTlXiA^ 
AGCTTTGCTTCAGCTCCTAACACAGGATATCCTGTCCTAGACTTCTTGCTGTCT 

4 5 AGTCATGGCTGCTGTTCTTCTCTAGGGTGACTGTACAGTCAAAGTACTACATACTGTT 

CTGTGGCTTTGGAGTTAAAGAGGCTTTGGGGGTGGCTCTTTGGTT^ 
GCTTCCTCTGCTGCCCTGGTTCTTTGGATCGCTTCCTCCGATGCCCTGCTTCTTTAGCT 

G TGG AACTGT CACCGACAGTCCCAGG CTGG ACTGAG CTTGTGG TGTGTG ATGG CCC CAACAGGCTTGTATTG AC C CCAG CTGTG AT 
CCGTTCTGTGTCTGGGGAAGTCATGAGTCACACTGGTCCCTTCTCTGTCCCCCA 

5 0 ATCTTCGAGACAAACCACITTGTAGCTCrGTAGGTTCAGCCCAGTAGCTC 

CCCTCCTCCTGGCGACCCTGGAGTGCCCCACGCTGTGCrrCAGGGACTCGGCGG 
GCTGTCCTCAGGGACTCAGTGGCTCCCrCCTCCTXKXX^CCCT 
GGAATTCTCGGTGGCTGATGGATGAGTGTGGGGATTCTCTGCATCCT 
AGGGCCTTTGTCAGTGCAACTCCATCAAAGCATTTTCTC^ 
55 TGTCTACCTCCrGCCIATCCCAAGTTACAGACAGGAAACTAACACT 

TCCCAAGTGCT<^GATTAAAGGCATGCGCCACCACT<5CCCGGCTGACAA 

CCCAGGTTTGCCTCACTCCAAAACTCCTrTTCTTCATTTATTTTAACGTCATTAAATTCC^ 

CAGAGCCTGATTTCTACATGACCATTCTTGCCCTTT^ 

6 0 ACAGAGAGGTGGTCCTTTTTCCTAGCTCTTCCTTTTACATC^ 

TGAGCATTTCAAAOVITGTCACCATCrrTAACGTATGA^ 
AGGCAGAAGTAGAGGACAGGTTTCCTGGCTGACCATT'GGGCATGCAGCCZAT^ 
CCCATCCTGATGTGACTCTAACCGGACATCGCGTCTTCACTTAAAGCAGAGAGC^ 
CTCTTACATTATTATATCGCTAGTATTTTCTTTCTAGTCAGTTT^ 

6 5 T T T rTT TGGGTTTTrrGTTTTG^ 

TAAGTCACTGCrCAAGTGGGCCrTTGTTtSTTTTGTTGT^ 

TACTGGGACAGTGTAAGT ATAGAAT ACGCCAACCTTGAAGGTCrGCXXTTGTC CAGCATGGAG CACGGGG 

TCTAAGTGCTCTGACGCTGCAGCGAGTCTCCTTCCACACATCCC^ 

AGCTGAAGAACAGAAACGTCTGCGTGTGTCAGATGTGCGGCACACAGAGCTAGT 

7 0 GT AGTGT CCAG CATT AAATTG CCACTTCTG AT CAGGTTC CT CC C^G AAGACTTCAGGAAGAAGTCTTX3 AGGAGACAGACAGAAGGC 

TGCAGCTAGGCAAGTGGAGGGAAAGATGGCATTTCAAACAAC^CC^TA 
CTACAGGAGCGftAACGCAAACTAACCTGGATTAGCTGTATGTTTGCAATAGGAA 
AAACCAGGTGGTGGATTCTGATTTAATTCAGTTGGGACCCTCCC^^ 
GGCACTCCTtSAAGGCTAGGTACCAAGGCAGGGGGACTC^ 
7 5 GGACTGGTGACTAGAATCTGGGTAATTTTGAGAGAC^^ 
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TCTGGAAAGAG CAAGGGGAAG AAAAGTCCCAG CTTAAGGTTTCTGCCCTGAAGCCTAGCAACACAGTG ACCAAACCCCAGAAAGGG 
ATAGTAAAACACAAACAAATGAAGGGATTTTGTTCCTTT^ 
TAGATACTTTCTTTATTTACATTTCAAGTGTTATCCCCTTTCiCTGGTT^ 
CCCITTCCCCATCCCCTTCTCCCTCCTCAAAAAAAATTAATTTCTAACTCAGTC^ 
5 TCT CTT AT AAT CTTGCACAT AGT AAGATG AT AT ACTGAACAAT AAATT AAGCTT ACAGT^ c CAGGAGA 

CCCTAGAAACCCATAAATATTTGGAAATAAAGAGCCAGGAGTTCAAATGAGAAA 
AGCAGGAAAAATTAGTGACAAGAGAGGACATAGTGGAGAGAAGAGTCGGGAACACT 
GGAAGAAAATTGGACATTGGAGACGAGAGAGACAGGAAGAGGCTATGCGTAGATCAGA 

TCACTT ATGGTGGGAAGG AGAGAG CAAGC CAGCT CAGCAG CGG T CAGGTGGGGT CTTGTTGTTGTT ACTGTTCTT AGGG AC AAACT 

GCC CAGCAAGCCAGAAATGACTACTGCCCACTTCGTCTTGCT ATG ACT CAAGGCCCCCTGTTGCCTGCTCTTTCTGC CACCA 
AAACCACTAACATCCCAGTAGCMCCCAGGAACTGCCCCCTCCCTGGTCCCCTG 
GATTCACTCTGACACTAAGGGGCTCTGAGTACTCTGCATTTGGTAT^ 
GTGGCAGATGCTTTCTCATTGTAGrTAATTAGCAGTTGTTAGACCCTTA 
1 5 GAACAAGGGAAGACACCTCAGAGACAAGAGACCAAATAAGTTACCCATAGGAAACCA^ 
AGCTAACGCACTGCCCAGCTCTGCTCTCATCCAGTTGGTTCTCTCAGGAG 

AGACACIACACAGTTGTCCCAAAGACATGTTCCAACCCTCAGAGACAG TGCTG CCAAG AG AT ACACT AATGAT AACCT ATGC AT CTG 
TCACATTTCAAG CT AT AATAACTCGG AAAAAC AGG CTG CT ATGG CAG CATGG AGTTGGGGAAC CTGTCCAAATG TTATTGATCAGG 
CAAAGTCTTTAAGCACCTAGCACTCAGGCCAAGAGCAGW 
2 0 TCTAAGCTCTCTAATTTGGCCCAAATGAAGCACTTACT 

GACATACCCTTCTCAGTCTCAGGTTTTACCCCTGCACCCTGCATAGGCATATGCAGAG^ 

CCCACTCTCAGAACCACACTGTCAGG CCTGGGACTTTGTACCCTCTGGAGGGAGGAAG CTAAGACAAGAGCAAATAG AAGTCGGAA 
ACTAAGCACTGCTTACTTCTTTTCAGTAATAGCTTGTCATTGCT^ 

TTTTTTTTTT AAAAATCTG AAAAC CAAT ACAAACAAACAAAAACT ACAC CTTT AATCCTGC CAAGGCAG AGGCTGGTGGAT CT CTT 

2 5 GAGTTCAAGACCAGCCTCATGTATATTXn'GACTTCCGGGCACT 

CAAAAGTTGTGTTG AGGCTGG AGAG ATGG CTCACTGTTC^GGCTCCAG AAGCCCTCAGTTCAGTC CCAGTACCCAAGTCAAGTGGT 

riTACAACCTCCTATCACTCAGACTCCAAGTCAGTCCA(^TTCTCCTTCCGCCT 

ACACACAAATAAAAGAAATGTTTTTAAGCTTGTGTTGAAGGTAGCT 

TTTAAATACTTGTTGCTTGGTGTGAAAAATACTITGT 

3 0 ATCACATGC AT ACT CACT AATG CC CTC CTGTCAG AAACCCT CAG ATG ATGCAGACTCT T AAATTTTG CGT AT CTTT 

TTTGTCATTGTTTTGTCTTTGTTTTGTGATTTTTCTGTGTAACAGCCT 

AG AACTC AG AGAT CCACCTCCCTCTGCCT CCCAGGTGCTGAGATT AAAG ACATC 

TACCTTAGAAAATGAGAAAATGAGAAAGCAGATTAGTATAGCTGGCTGATACATTTACTT^ 

TTTACTCAGAAAATCAGAGTGAAAGCCACAGGTTA 

3 5 TTGGTGGG<^CAGCATCTTCAGCTACATGAGTCTGTCTCAAAGAAGGGAGACCAAGATAACTGTCT 

CCAGAACCCCATCCACTGGCCATTGTTGCAGGAGTCTGAGGAGGACCCCAGGATTCTTTTCCT 
ATGTATGCCAGAGAGAAATATCCCATTTTACTTTAGATATTGTCAGATTATAGAGCCATGGATTC 
ATATCTGTCCATCCATTCATTTATTTATGAACAAGAGAATTCCAGCT 
CCTGACCCCCTGACCCCAGTGGCTTGGAGTTGAAGCTGGACGGACAGAATACCAAG^ 

4 0 CGAAATGTTAAAACAAAATGACTGCTCAGAGAATCCCAGATCATGCAGGCGGTCGCCAGCA 

TAAATGGAATGCATAAGAGTGGAAGAGAGTGATAAGCACTTGCCTGCTCCAGTGTTAGGGTGAGATGCC^ 
GATTTTCTGCAATTTTGATTTCATACTC^CATGAAAGAGATGGAAAAGAAAA^ 
AACATTGCTATGGTGTACTGTATTTAAAGCAAAAATGTAATTGCTGTAATTTAT 
AGATTTTAATCATTTGCCACTTTTGCACATAGTGACTTTTCGTAGATGATGCTGTGA 

4 5 CTTTGTGGCAGCTCCTAAGCATCATTAGTGTGATTTCAGCATC 

TTTGGGCTGTTCAAGCAGTGCCCAGAACTGCTTTATAGTTGCTGACCCTAACATG 

GTCCACATCTTTCTAACTCTGAGACAAGCAGGGTGGGCTCTGCCTGTCCTTT^ 

ACTCTTGAAAGTGTCTCTGCCTTTGAAATGAGTGTGTGTGT 

ATTTGAATAAGAATGGTCCCCCACAGGCTCATATATTTAAATGTTAGCCATCAGACTCATAAGCT 

5 0 TAGTAGTGTTATTCACTTGAACCCTCTCTGGCTGGTAGGAGAAAAATAGGAGGAAAGCAGTCACAC^ 

CAGG AATAAAGATGGACAGAATGGGTAGG AGGAAGTG AG AAGTCAGCCIACGGG CAAAATCATTTC CCC ATGGTATGTGTAAG AAAT 
AGAAAATATGGCCCATTTCAGTATAATGTATGGGCTCTATCCATTCACTCCATCCTAGGGGAA^ 

CACCT CAT AAGAC CAATTTGTG ACTTT AT AT AC CT CAGG AC AC AGG AGAAT AGT ATG ATGACCT AATC CTGTGTG T ATTT ATT AAA 

5 5 GATGACCAAGGCAACTCTTATAAAC^AAAGCATTTAA 

GAAGCATGATAGTGTGCAGTC^GATATGGTGCTGGAGCGCTAGCTGAGAGTTCT 
CACTAGCCTX3CCTTGGGCTTTGAAACCCCAGAG 

CTCCTAATCTTTATAAGTAGTGTCACTGCCTGCTGACTAAGCATTCAAATGCAGGACCCTCTG 
ACACACACACACACACATGCACAAATCAGGAATCATACTCCAGACCAAGCACAT^ 

6 0 AAAG CT AGCTAGCTCTTCT CTTTACTGACATAC AAGAGTTTTC AAGTCCAACTCT^ 

GTCACTGAGCCAGGCCCAGGGCATAATTCATCGCCACACCCGCTATTCCTTCACCATCT^ 
CCCCTGCAGATGACTTTCCAATACTTCGTCTTCCCTCTTCACCCCAGTGTAGGA 
TGCAGGTGCGAAGTTTCCCATTAAGTGGACAGCCCCCGAAGCGGCCCTGTATGGAA 
TTGGAATCTTACTCACAGAGCTGGTCACCAAAGGAAGAGTGCCATACCCAGGTAAGG^ 

6 5 GGGTTCTATTTTAACrrCATTTTACAGGAGTCTTXSCCCAGCT 

CAGGGGAATGCAGAACTCAGAArTGCCGTGACCTGTACACCAAATTCAATGTGCT 

AGGTCATGGAATTTGTCCTGTCTCAGTAGGTGCATGTGTCATGGCTCCCACAGCAGAAATC 

GGAGCTTGTAGACACGTCTGGGAAAGGGAAGGAGAAAGGGTCCTGTT^ 

GGCAGGAAGATGCCCGTCCTGTTCTCTCTGAGGCTCCAGCCCCAGATGGCCCCAGGCGGAGGTGCAG^ 

7 0 CCAACCCATTCTCTGCCTGTGGCTGAATGGCCAAGATACACAAAA 

GGAG ACAT AAAGG ACACG CAC CATTTCG C CTCCACACTTT ACTGGTTTCAGGG CAGGGACGGTCTG CTTTGTG AAATGTT AAGGAC 
ATATTTTTCAGTTACTGCTGAGCAAAGAAAGAGCATAGCTGGCT 
AGGCCCACAGTGATGGCACCGATAGTAGAGATTAATGAGGAATGTAGG^ 
7 5 GTCAGAGGAGCCAGCGAGAGCCGATTAATTTGTAATCCCTTTGTTCATTTTGTCTGA 
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AAGTGTTCATAAAGAGCCCCTCTGTCTACKSCrAGra 

AAGTGCOVGGCTCTGAAGCCCCAGCCCAGCATTGGGAGGTGTGGGTGTGTC 
CTGCGGCTGCTGCAGCGGCTGCTGCTTCCGGCATGCAC^ 
TGCCCAGGAGGGGGTGGGGCCCAGCZAGGGGTCCTTTTGAAT^ 
5 AGAAGGCACTITTTAAAAATTATTATTGTCTATATTTTGAACT 

TAAAAGCAATCACAGGAAAAGGTTAATGTTGTTTTGCATTTCCGTGAA^ 

TTGGAGACCGGGGTTTTGTCAGGTTGGGTTTTAGCACAAAGAAAATAACCTW 

AT CATTTTGTGTGGCCTGGCCT AG CATTTGTTCCCTTGGGAAGTAGCACAAATG AGCCATTCAGCTCTC^ 
1 0 TC CTGTCCC CG AGCAGGTACT ATTCGAG CTGGAGGGAAAAC CATGAGCCACT AAAT AG TC CTG CC CTCACGGAGCGTT ACTG CGGG 

AGAGTGTTTGTGCCX5TGCTTATGGAGGGGAGGTTACTTAGATGGGCAGACTC 

ATGGCAG AAT CGGGGACACAAG ATAGCCCTTCATTGAGAGACC CATGAAG AG AG CAGAGGAG TTAAG T GAG AACTGGGTCCAG CAG 
AGTATGTGTATTTACTATGCCTTCCATGACrcCAGCTGCCTGCAGGGGCC^ 
AGAGCAAGAGGCTCGCTCACGATGAACAAGATGTGTCGTCCTGCCTGTCACCC 
15 GACGATGGATGAGGTGACGTCCTTGATGGCCTTAGAAAGGG^ 

TTGGAGCCTGGCAGGTTGACTTGATGGCCCAGAGAGCrrGCCTG^^ 
CTGCTTAGTTTTATTTTGTAACTCAGCTACT 

TTATAGTTACAGGGTTGCTAGCCTAGCAGAGGGTAGCACTTAGCATT 

2 0 AGGTCTCTGTGTATCAGATGCTTCCATTATGCACTTTGACACACTAATTTCT 
TCTCAAGAATGGAAGGTGGTGATAAACTTGCCTGAGTTCITTCATACCTGA 
GCTCTGGCTCATAAGTGACAGTGAAAGGAGATGGGGGCTGACTTACAAATGAGCT 
ATGTAAGTCAGTGC^TCAGCGTAGAAGGGAGCCTTGGAGAAGAACTGAGTGATGCTAGA 
GCACTGAGTAGGCGCTGCTGGTGTCCTCAJGGCTGTGCT 

2 5 TGAACAAGGCCTTGAAGCCGTAGAGGAATATTGAAGTGTTGTCCTGAGGTTAAAGGGAG 

ATGGGTGACAGAGACGGACACCATTGCAACGCACAGCTCGCCTGCTGCAGCCCTGTGX3GCTAAA 
AGGGCTTCAGGCTGCTGGGTAAGGCAGGACATGGGAAGTCTXSACTTGGTGCACCC 
CCCTGCTCTGGGGGCAGGGTAGCTCATTCTGAGGTCTCTGCGGACCTGCCGGGGTTGGCTTG 
TGAGGACAGCAGGTTCTAGCrcrTGGGTACCACAGAATGCCTCAGAAG 

3 0 GAAAAG AGTGGGAAG AG AGCTCAGGCTGGTTCCCGAGGGGAGG AGAGTCTTTGG CTTG CTCAGTGGTCAACTCAGGTTGGTTAAAA 

GGGTGGCTGAGAATGCAGAGAGGCCTTCTGTGTGAGATTA^ 
AAGAGACAGTCCATGGTCTTAAGGTGTTTAAAGAGAAGGAAGAGA 

GAGCTTGCCAGGTAACGGTGGGGAGGAGCATATCTGGCTGTTGCCAAGTAACCGTGG^ 

3 5 TTGGGGCCCTGTGTGACTGGTGGCCACAGAATTATGGAGCTC 

CTTCTGTCCCTTGCAGGAACACCTGCTGGGTCTCCAGGGTTTAAACCTAACTC 

ACAGCCTATCAGCTGTGCCATCCTTGCTGGAGTCCAGCCTGCTTTCTGTATGCAGCGCTTTAAGT 

ACTGTGTTTAGGCTAAGGGATTGTTGTATGTGCGCAATGGTAATGCTGGCGCCAAAGCTG 

TAGCTTTGC ATTTTAGCATGACATTTCACCTCTAGAAGCTTTCTCA CCTTCCCTGCCCCTG CCTGTATC 

40 AGGCTGTCACCACTGGGTGGTGACCTCCTAATGGAACTTCCTGCCCCCCCC^CCCCGTTCCTCTCCCTT 
CCATTTTGTCATTGTCXjCCAGCAGAAATGTTATCCrrGGAAAA 
CTCTTGCCCTTGCCCTCCAGATAACTCCAGAAGGCCGCACCCTGCACAA 
GTGCTTTTCCAGACCCAAGGATTATAGCAGTGAAGTAACACGAGAGTTCCTGCCCTCA 
AGGCAATACAAACCAACAAACTACACTCACCCrrGGCTAGTAGCATTGAGTGGAGCAA 

4 5 GTGGAAGTCACTAAGAGGTGCTCGCTGACTATAACTCTGATAGGAGCTAGCCTTGG 

CCTGCAGGGAGCCMGGAGCCAGGGAGCCAGAGGCCTGGAGCGATGACTGGTCGCTTG 

TClTlTl^TTGGGGTTTAGTTTTGAGCTTGTATTCTCGTTACAATGTTTCrCCTTGGCU'i n CCTCCTTCTAAACCTTCCTAGTTA 

CCCCTCCCCACAOVCTCTTTCAAACTCATAGCCTCTTTTTTC^ 

TATATATTTCTAAATACAGCCTGCTAAGTCCATACAGGGTTGCT 

5 0 CCCTGGGGAGGGCC^CCTCTCGCTCCAGCTTCACTCAGTTGCCAATAGTTTT^ 

CATTCAGTTG ACAT AACCATTG G TGTC CT CCTT CGTCAGGTCATGTTT AGG CAGT GATGCTGGTG AG ACTTT ACAGG CGT AG CTTC 

TGATGTTACTAGGAGACAAAACCTCACAGAATGCTGGTCCCCTGGCTCTTAGAGTCTTTCTC 

CTTAGGTGTAGCTGTGTATTGTAGGTGTGAGTGTTCTCCTA^ 

GTCTCTGCTGTCTCTCTCCCTTTACATCTTGGACACCCCAAGGAATTTGTCTATGACAGCATCTCAOT 

5 5 CTATGTAGACCAAGCTGACATCAACCTCACAGAGGTCTCTCTGCCTGCCTCTG 

AGGAACGGGGCATCTTCCATGTTCTGATTCCCAGCAGTGTCTGCCCACAGTAGACTCTGT 
TAAAAGGGGAAACATTTGAACTTTTTTTTTAAAAAAAAAAAAAAGGAAGA 

AGCCTCCAGAGCCTTATAACTCTGCCCGGTTATCCTCACATATGGCCAACACGGGGTGAGTGTTCCCAGCCG 
TCCCCAGAGTTCTGAGAAGTTGACCCCACGTTCAAATTTCATATTCCAGTA^ 

6 0 GTGGCCCTCGACTTGAACAC CTGCTTGTGTGCTGGCGGG ATCTGG CAGAAC^GGGCTCTTCTCTTAAGCAAATCTGTTTGTCTGTGC 

ATTGGAAGCCTCCTCAAATCGTTCTTTCAGCAAGTGTCAGCTGCTTAACTTCCTTGAGCA 
AGGAGTAAGGTGAGCAAGCAAAACTGCrTCGCACCTCCTGTAAAAGCCCAAGCGCCG 
NNNNNNNNNNNNNNNNGCATGATCACATGCCATGAAGACAGGTA 
AGGCAAGTGTGGACTGTCCAGGTAGAGGGGCCATGAGAAACAGTCTGAAAACAG 

6 5 CTCCGCAGCCCATG<K3CCCCAGGGCCAAAGCACTC 

AGGTTTCTCTTGGCTGACGTGGCTGATAAGGGGAAGAAAGGATAGGAGGCT 

TCCCAGTACAGCCTGAGCGCAGCGCTCTGCCGCTGTCTGGTCTTAGCTGTGGTCAGGCCCCTGCCT 
CTCTTTTTTCACTCCTCTCTTCAGAGGACAGAGGAGTTCTTTGTAGCACTGTATTATCGAAAGC 

TGGG AGACAGGAGCAGG ATT ATTACTGTT AGTTTCAGG T CAG C CTGGG CT ACG T AGCAAG CC CTTGTCTC CAAAAAT AAAG AAAAA 

7 0 GAAAACAAGAAAGGAAAAGCCCAGAAAGTATAATGCTGCATTATCC^ 

CATTTCTGATTTTATTAATACAGCGGTTTTAATAA^ 

TAATTATTGTCTTCTCAGGGTCAGTCCTTCIATCTCCATCTGTACCCTCrGGTCAGTA 
CATGGAGT(^CTCTCCAAACCTCCTGGTTGTTCCTGT 

TG CAGGCTTCGCTCAGCAAGCTlCCATTAACTCCA CAG CTCACAACAGCAAG C AG ACAGAT 

7 5 GGGTTTTTAAGAGAGCTTATACAGGGAAGCAAGGGGCCTGA 
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ATGAAAAGCTCAGTTCC1&AAAAGAAATTTCA 

GTTGTCCCGATTCCCACAGCCATGGTGAAGTCCA03CATGACTAGAAAATAAACCAC^ 

CTTGAGAAG CCACACAAGTGGGAAGCAGTAAAGTATTAATACTrTT TTGGTGAGACCTTGTGCTAAGGTGATGGAT AAGGTGTTCT 
GTTAACAAAAGGAGGCGCTGAGGCTGGAGCAAATCGGGTCCCCTGCCAAGAGGTAC^^ 
5 CTGCCTGTTGAAGTGGCTTTTGTTCCTTTCCTCAAGAGC 

GAATAATACTGATGCCTACAAGAAGAGGAGAGGCAAGGACCCAAAGG 

TGG CAACTG CAC AATGTCTG AGTTTGAAGG AAGAGTTGGAG AG CTGACTTCTCGTGT^ CAG CAAAATGACTCAG 

AAAG AAT AATACAGAGATCATGTCAGGAAGTGAG TGCAGT CCACTTGG CCAGAGGCT CAGCTACAATG TTTTAGGCAG AAGG AC 

TGGAG CTG TT ACAGC CATGACCATGTTTGAAAAC ATGAAAG AAGG AGTTGAAGGGACATCT CAG^ 
1 0 TGAACTCAGGCCCATCACCTGGGTCCTCGGAAAGAGTGGAGACATGCrrGTTCTCCTG 

AACTGGGTAACGGAAGGGTATTTTTTAAGAATATTATCATGCATGGTGTACT^ 

TGAGTCCTTGCTGAGCCAGAACATCITGACCATTGCAAAGGTTCACT 

GTTCTGGCCTTTGGTTGGTTGGAAGACAGCTATTCTGTTGCACT 

GCCGTTGGTATGTGTCTGTTTGAGCCAGTGCGTAGAATTC 
1 5 AAAAT AAAAGGAGAAATGGGGTGGGGG TTGGG TG CTGCAGGAAAGGAGG CAG CAAGCTAAG AATG CAT ATC AAAGCTG ACAGTGTG 

GCCTTGCAGTCTAGTCCAGTCCTCACTCTGCATATACTGAGTATAC^ 

ATGTGACATGCTGTTTTGGGTTGTCACATGTACTATGTCCAAGTCTTG^^ 

AGTTAAACTTTCCATTGGCAG&GACTGAATTAA 

TAAACCATTTACTGACAAAAAATTCACGGAATTCTGGGAAAGTAGT^ 
2 0 GGTTAAAGCCTTGAGATGTATTTGGGGAGCGGTTGGAGCGGTTGGAGCTGCT 

CATACCTCTGAGCTTCAGGCCTCTATGGACCTTTCCrCCAGGGG 

AGCTGTGTAGTTCCAGAAGCACCATGAAACTAAGAGGTTTATCTACT^ 

CCTATGTATGGCTGTGTTCTTGTCITGGCCCTTTAGGCTTCCT 

TAACCTCCCATGGGAGCAAATGCACATTG<5AATCAGACATTGT^ 

2 5 GTCTGGGATGGAACATAGGAAGGAAGTGCCATCAGGGCTTC^CAA(^ 

AATCCC^GTCCAATGTTTCTAGCC^CTTTTGACCATT^ 

CAATCAGC^CTAGAGCTGGGCAGATACCTACCATCTATGCTATTAAGCCT 

TTTCATTTGAGCTGCA.CTTAATTCCAATTGGCCAAC 

3 0 GCTATTCX3CTCTTGGCATTTTTATTTA 

CC CACIACACACACACACACACAAT ATT ACAAT AGACAAG C AAACCC CAACAGG TTCjWS CTCTGTT ATT CTG CCG AGTGCCA 

TTTCCTCAGCTTACTGGGGTTTGAACTGTTCCTGAGGAAGTGCT 

TTGGCTCCAGGGAACACCCAACAGAAAAAGGTACTCCC^ 

3 5 TGAAATTATTTGGTCATATTAAAGGTCTACTTAAACACAGTCAAAGGAATGC 

AGAATAAAAGAGCATCCTACAAGGACTTAGACAAACAGATGACTGAAATAGTTGTC^^ 
GCTTCAAAGATGAGAGCAACCCAGAAGCCTGGATGCTCCCTCCAGAG^ 
GACCCIAGTGGTCACCTCCCCCAGAATTAAAGAAGGCTGTGTTGAATATT^ 
GCTGTAAATTACAGCAGATTCTTGCTGCTGTAAACTAACTGTCAAGAG 

4 0 GCAGAGAG AGGGAGAAGGAAGGG AGG AAAGGGAGGG AGGG T AGG CGGGTGGG CGG ACAAGGGAGAG AGTACCTGG AAGCAGCTG AG 

AAGGGAGCACACTGAGTGATGCCTGCTTCAGCTTCCC^TGATGCCTTCTGCT 
TGCAGAGGGGC^GGGAGGTCTCAGATGATAGATGGCCGGATGCTGTCATTCATA^ 
TCACIAGTTCCCAAACCTCAGAGCACTGTGAGGGTCACCAG^ 
CCATGCCTAAGCATGAGGAAAACH^TGACATTCACT 

4 5 GTGCTTTAGTCATACAAACATTTCTTTTTTAATTTCT 

CAGGTCCTCTGGAACTGGAACTACAGGCAGTTGAACTAACrc 
AGTCATCTTTCCATCTCCCGAAGATTCCTTTTTAATTATGTTTCT^ 
TCCTGAAACCAGAGGTGTCAGTTGCTCTGGAACTGGAATGACAGGCCGCCT 
TTTGCAACTCTAAGCAACATTCTAGCAACATTAAGCCCCT 

5 0 ATAACAAAGGTTAATTTAAAAGTATACTCCCATGAGTTTTGACTAGTGAAAT 

ATGG ACTG CAGCCTGAGTT ATTTCT CATGGCAGAGC ATGGCTCCCAACCTGG CTTTGTTTTAAACTGTTTCAGTGACAGAAAT AAG 

ACAGAGCTTGCATGTAGCCCACACAAACAAAATATTT^ 

TCACCTCTCAGCTCCCTCrrcCCCTTTCACCCGTGGCCCCCACCCa^ 

TTCTATC^TGCAGTCTCTGGCCTTTCrAGGGTCT 

5 5 GTGTAGTGCTTTGAGAGTC^CTGTCTGGGCTTTTC 

TATTACAGGCATGGTTCCCCTCCTGAGAGGGAGGACGGCCTCCTTATGCTTTAACCTTA 
GTCTTCGCAGGCATGAACAACOSGGAGGTGCrcGAGCAGGTGGAGAGAGGCT 

CCTGCACGAGCTCATGATCCACTGCTGGAAAAAGGATCCGGAAGAGCGCCCGACCTTCGAGTACTTGCA 
ACTTTACXSGCCACAGAGCCCCAGTATCAGCCCGGTGAAAACCTGTGAGAGCCTGCGCri^ 

6 0 CCCCTCCCCATTAGCTTCCAATTCTGTAGCCAGCTGCCC 

CTTGAACTTCCACGGCCCTCATTAATGACACTTGTCCCCCAGTCCGAACCTCCTCT 
CTCAGACITGGAAATGCATTGTATCGATGTTATGTCAAAGGCCA^ 

C CAGTG CTTTCCTTTTTTAAAAAAGAAAAAG CAAATCCTATGTGATTTTAACrCTGTCTrCACCTG ATTCAACT 
GTATTATTTTCCAAAAGTGGCCrCTTTGTCTAAAACAATAAAATTTTTTTTCATGTTTTAA 

6 5 GGGT1TTTTTTCCCTTTTTTATACATATGATATATATGTTA 

TGGCCAGCGTAGGCCACATAGCTACAC^ACCGGAGTGGGGATTACTGCAGAACCCTGCCAGCA 
CCGGTGGCCTCATTTTTTGACTrCTACGAAGCATGACCTCCT 
ATTGTTCATGTGACTTTTTTCAC4GTCCCAGGGCCTAGTCACGA 
TGAAACTTAGGAGTGAGCATTTTAAGCAAAAAGCAGCCAGCC^ 

7 0 TGTATAAATATGAATGCTGAAGGGTTCAGGTGTTTTCCITTTATTTAATAAATCT^ 

GCATTGTAGTCATGGCAACCTACTAAACTTTCTCIATGCAACT 
TGTAAAATAAGTGTTTTACrGTCCTGTACTOCTAATGAGTGACTCrCCG 
TACAITTCATGGGGGTGCACAAAAAAAAAAAAGTATTAC^ 
ACAAATCJIATTCCTATTTCTATTGAGTTTTTTAATACT 
7 5 TTGTGAGTTTACATTTTTAAATTGTTTAACTTTCCTAATT^ 
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TTTATTTGTGTACACCACAGJUVGGTACCCTAGGCGTTCTGTGACATACCT 

CGGGCCACTG1GCAGCX3ATTTCCAAACGACAGAAACATGCAGAGCATCTGG 

GCTAACTCAGAGTGGCTAAAGCAAAACTGGGCGTACTTGAGCGCCATAGTGTA^ 

C CT ATTTG T ACAGG AGACAGT AGTCTTTLTTGG T CTCAAGTCATAAAT ACATTT CCAAGT ATCCT ACTTCT AAAATG AAGTTGCT A 
5 AGCATGTGACCATGTGGTATGGGGGAGAGGGGGAGGGGGGACTGGGTAGCCATCCAGGCCTG 
CTCCGAACCTCCCCTGCAGGCCAGTGTGTCTGGAGACAAAGTACAGACAAGATGCT 
CAACTCTAAGTCCACCGTGTCATATGGAGCCATATAAGGAGGAAAATCTTTCA 
ATAGCGTTTTGGGGAGGAAGATGAAACATTTTTTAAATCAAAACT^ 
AACTTACGAGGTGCAGTTTTTATCACATAGGATTAGTCATTAGCAGGTTT^ 
1 0 ACTTTGCATTAGACAATTTGCAAATATTCCCTGG^ 
TTTAATTTCATACAGCTTTTTAAATTTCATC^ 

TGAGAGAG CACG CTTACGAGG TCAGAGTC CTGG AG CACT CT AGAAT AG AAGAACT AC AGTCAGGAAGAAACCAAAG AAATGT ACAG 
GTTTTTCCAAGAGTCCAGTGTGCTTTCrAGGGAGGCrCJUVCTCCCAT 

GGGAAACTGAAGTTG CTCIACCAGAAAGTT CTCTT ATGACAAAGG AAAAAT AG CACTT ACTG TGAAG CAGGTGAAAATGTTTGGGGA 
1 5 GAAAACCTCTTTGTTGTTTCTCTTTAAA 

GGAGTTACAGACAGTTGTGAGCTGTCATGATGGGAACTGAACCTGGGTCCTC^ 

ATCTCTCCAGCCCCTGTTGCTTTCCnTTAGTAACCCAG 

TTTGCACTTGGATACCTTTTTAATAAGCTATTCrrC^ 

GAGT AAACATTGGTTCACTCCACTG T CGTTG CACCACACACTCAGG CTGCTG CGTTG AGGG AC CTG AT AGG ATCC CAGCAGATG CG 
2 0 GGGTGCCACATGTCAGCAGCGACTCAGTGGACTTTAAGGGCCCT 

CCATGGTCCCTCGAAAC CTGCCGTGTGACTC71AGATCTCCAGGCACCTGGCTGTGTTTACTCACC CGGAGACGACAGG AATCCTGT 
GAGTAAAGTCTTGGCAGCAGAGGATTCTCTAGTACCAGCAGAGCTATGAA^ 
CCGCCGTATTAATTGGGATGGTTCTTTCCTGGGAAGGTA 
AGAAAAGATGCGATGTCTATTGTTGGCCCTGACGCAGAACAAGGCGG 

2 5 AATGGTATTTGGTCAGTTTGTTCTTGGAAGATATTTTAATACATTCAGATC 

GAC LIT CT1TCTG TT AGGCTAGTGTCAGAAAT ATACTTTCTGG AAGGGAG CTTTG AT C 
CTGAAAGAAGGGGAAAAACAGCCTGTGGCATCATTATCTAAGTTTAATTl-TriT 
GAATGTAATAAAAATGCCTTTATTATCTTCTAAATATCAAATG 
TGACAAACATCGAAACTATTCCTCATTTTTCCCGATGTATGAAAT^ 

3 0 ATCTTTATTTTAAAATAAAAG CGGGAGAATATAGCT AAGTGGTAAG CC^CTTACTTTGCATGCCTTTGTCTCTrGTCTCTCTCTGA 

CCCTCTCAAACTCTAGGCATTGAAATATAAAGCGGTTTGTGATTTTGTAGGAACCAC 

TTCTTTCAGTGGGTTITCTGTGGCAGACTGAGCACTCCTGGAG^ 
35 TGAACXKSAGATGAGCTTCTGGCAGATGTGGGGCC^^TGCIATGAGTGGATGGTTAGATGGATGG 
GCTTTGCCTTC^GATGACACXSCTTATCTCAGAGCCTGCrGGGTC 

CTT ACAACTCGAAGTCATCAGTCTTCAGGATAACTGTGAAAG CTTTCTCTCCAG CCTG AACCAGGCAC AACCAGTGACCCCACTCCT 

TCCCCCAGGTCTCGGCACAACACCCrCGGTAGAAAGTCAGACTTCTTGGACCGTC 

GAGGTATAGTTCACAATGCATGCAAGTGAGGTGCCATTTGAATTTGATAATTTTATGACAACTGG 

4 0 CCATCTTTGTGTTCATATAAAAATGTATCTCAACTGGGCGTGATGGTACAC 

CGGATTTCTGAGTTCGAGTCCGGCCTGGTCTTACAGAGTGAGTTCCAGGACAGCCIAAGGCT 
TTAAAAAAAAAAAAAAAAAAGTATCTCCATGCCCTCCAGATGTT 
CTCIATGACCCAGGTACTTCT^GCCTACGTGAGAAGCTCTGTATGAAGTGGT 
GTATTCTTGCCCCCTACGCATGCGGCTTTCCrGACTTCCCCACT 

4 5 GGAGAGCTTTATGACAGAAAGGAAAAGATAAGCCAGCTGGAGCCTCTGGAAGCAGAGGTCAGATCAGC^ 

G CAAATCAGAG GAG GAGGG ATGG AAAC CTT AAAT AAAGG AGAAGACTGAGCTGGG AAG AG AG G CG C C CTGGTCATCACT CTCTGGA 
AGAGTGGGAGGAAGGCAGACATGACAGGAGGTACACATGAGGACTAG 
AGCTAGACCACGTGTACCAGAAACTCAAAGGTGTATGTTAATATTCA.CACCCCAGTGAAG 
GGGAAGCCCAGGGCTGAGTCGAAGGGAGGCACCCGTGCAGCATGGGGTCTGCAGGGCkG^ 

5 0 TCACATAATAGGGACAGAACITCTTGCTTTCAAGGTGACACC^ 

TGAACAAAGAAAAAGCAAAGCTACATGCATCACCTCCCCACACACATCTGTCCTTCTAGATGGAGGTGTGG 

ATGCGGTGTCACTGTGCGCCTGTCTCTGCCTCCTGCCITGTGCAAAGAATGT^ 
TACATGGGCACAAGAGACCCAAATTCAGTTCCTCACAAAGGCAAAGCAA^ 

5 5 AGAATAAGGGACCAGACAGGGTGTTGATTCCC^ 

TGCTCTGTCATACAATTTGCAAATTTATAAAAGTTACTGCTCAGAGTCACAAGGTGCTCT 

TTTAGTTAATTATCTAGTGCTTGCTCTGTAATAATCAATGGGTAGCT 

TAGTGTTTTTAATGAAAAGCACACATGAAATACTATAGGGAAAGAGC^ 

6 0 TGCATGTGTTTACAAAGAGATTCACATAAGCTTGTG^ 

CCCCCCCCCAAGGGGGTGAGTGGGCATTGGAGGATACCAGGCGGAGGCCAGGCCTCAGGAGA 
CTAAATGAAGCTGAAGGGCTGTCAGTTCCCAAAGAGTGCATGAGGAAATGGTTAGCA 
ATGAGTAACAATGGTTCCTGGTAAATTAGCCATATTACATCATGCACACAG 
TTGCCCAAATCCTTTAC!ATAAAATGGATTGACTCTTTTTTTAATTTG 

6 5 GAAG CT CCCAG CAGAGCAT CATTG TGAGAGGAGG AGG AGG AGGGGG AGGAGGAGGAGG AGG AGGAAG ATAAAC CATCG AAGG CCTT 

CATAAGCCATTrCACCCCCCTCCTCTCAAeCTGCGAA^ 
GGAAGGCATTAATCAITGTCTCTGCTATCACTAAACC^ 

TATGTCTGCGCTGACTCAGACGGTCTCTGGGTTTTGAGTAGTAGGGGGAGGATTGGA^ 

AGGCAAG AACAAAAGCAGC ATG CGTGCAG CCAAAGGGGACCTTGG AAGTTACACAGATGAAACAAAACGCTATGCTTGCCTGAG^ 

7 0 TCTGGTTTCCAAAGCAAACGCAAAATGGAGAGCCTTGTTCC^ 

GCCCTGTAATCTACTGTGACAGCCAGAAGCTGCACTGGTGCTCTGAATTATGAT^ 
GCAGCGTAGCATTTGTAGCTATAGCATCCTCTGCCCTTrGGCAGA 
CTGTCCCAGCCCCCACGCCTTTCCACCTCTrCCCATC^ 
CrTCATAGGATCCCTGTTAATTTATTAAAATCAGAL T l T^ 
7 5 ACCTCAAACTCAAGACCTGCCTGGCTC^GCCTCCCTCCCAACTGCTAG 
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GGCAAACCTGTGGCTGCTGGACTGGATCCTGAGCAGGCATCCTGTAG 
AAGTTGAACTTTGTAACCCTOVAATCrCAACACTTCTCTCCTGGGAGCCCA 

CCCTGCCTCGCCCCC^GCCCCCAGCCCCCAGCCCTGCCCCCGACCTATGATTTATCAGACTTTATACCTGA^ 
CTATCAATCAGCTTTATGCAAATAGAAAAGGCGACCCT^ 
5 ATGCAAAAGGATAATGATT CAAATGTGAATG C CACC CTTTCAAATCCTG CAG T AC^AAAAAACGGCACAAACTGGGGTG TCAAT CC 

TGCAGTGTGAAATCACCGGAGCTGTCCTGAAACCCTGCATAAAATCATGGCCCCATGA 
TAAAATCTAAGCCTTTCAACCGTGATCAGAAACTGTGG^ 
AGAACTCAGCCAGAGCGTGGACTCTAGTCTAGGXjCACAC^ 

AGGGCAGAGACCATGCTCAACCAGGGTGGGGCAGGGGGTGCACCACACCCCTCTGCC^ 
1 0 ACTTTCTGGTAATCTACAACACAAATCAACCCCAAAACCTCATGGCAGAAATAGATGCA 
OIGTTCAGAGGGACTTCTAGCTACGACTGGCCTCTGCTCCTCAGGGCC^ 
C^TAGCTCCTAGCTGCTGAGCAAGAACCCAGCTGGGACTGTGGGTGAGGACCTGAGCT 
COaTGGGGATCAACCGTGTTGAGACATCCCAGCTGACrCAT^ 

AGTTTCT AAACAAAAG AAATG ACTGAGG ACTCT AGCT ATGAG GTTTGGGG TTGTTTTAT AGCAG CT CAT CAC CAGAATGTG T AGG C 
1 5 TATCACAGCTCCCTCGTATGCCTCCCCTCATGCTGTGTCTAGGAAAATGAAAACCGAATCT 
ACAACCGTGAATTAGACTATACCGGTCAGATGTAGTTGTGGAAAGTTTGTAAGGCCA^ 
TTGTGCTTGTGTATGCCCTAGGAAGCAAGAACATGTA^^ 

CTCCTGGTTCATGCAGGAGACCACAGTTCCGCTGTCTGAGTGGTTCCCrn-GGTGAACCTCTT 
ATTGTTAOVTTACATCTCTCTTCACTTAAAAGCCATAGGGTGGGGCA 
2 0 CCCCACTGTTCCATCTCCTATCTCTCTGTCCTTGCTTGCTTGCT 

GGATGGAAC CGAAG CCT C CAAT ACAAT ACACTT ATCAAG CACTCTCAGC CCGCTGG TCITl'Cl'l'TT AGT ATTT ATCTAT ACTTG TG 

CCGGTGGGTTATTTCTCTGCCCGTTGAAATGAGCCAGTTCAAGCCAGATTAGATTATATATA 

AAGGGTGGNNNNNNNNKNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN*^ 

2 5 TTTTTTTTTTTTGCTCCTCCAAAAAAAAATC 

GAAAGTTCAGGGTTGGGGGCAGGGTATGCCAC^TAGGAACTGAG 

AAAACATGCACCTAAGTCCCTAAACCCAACAGTGCCAACCCAAGTTTCT 

AAGGAGTGAGGACTAGAAAAGGACACCCTCCGGGGCCTGTTGGGTTGCTlTCTriTGTATGTA 

CCC CATTTG ATTCTGTCCTGTC CAGAG CTGG CTT CACTCAACACTG ACAATGTTGGGTG ACTGGTTG AC CCAAAAGTCTGA 

3 0 cn-AGAATTGTCTCTCTGTGGATTCTGTAGGCACTGTGGGTCAAGTACCGCC 

TG T CATGGGGACTT CCATCTGTCATCACT ATT AGTCTTTTTCTG CTG CTTTGTTTTGTTTCTGTG CAAACAAAAC AAAG ACT ATTC 
TGGAACrCACTCTGTAGACCAGCCAGGCTGTGAACTCAGAGATCTGCCTGCCTCT 
AATAACTTACTTGTCAGTTATTACTGACAAGGAAACCAATTTAGAGAGATCT 
AGCAGGGATACAAGAGCCAAAGGAACAGAAAGGACTCATGGACTCGTGACTGGT^ 

3 5 CCCTCTCATGTCTTGCTACATGGCTAGGAAGGGTTTCAAATCAATACAGCATTGGG 

TGTGTGACTCTGGGCAAGCTATTTGACCTCTGAGCCTCTTGCTTAGCTTCAGGGTCCT 

AAAAAAAAGTCCTATGTGTAGGATATGAAAGC^GAAATGGATAGTGTGTGGAGGTGGGGTGAGGATGN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

4 0 TCAGGACAGGGCAGGGTGGGAGTGGGGTGGGATGGGGACTGGGTGAGGTGAGATAGGGGTGAGGTAGGAATC 

TGGCATGGGGATGAGGTAGGGTGGGGTAAGAGTGAGGATAGGAGTGAGGTGGGGTGGGATGGAGTAGGGTG^ 
TGGGGTGGGAGATCAGCA 

MOUSE SEQUENCE - mRNA 

4 5 CCTGGGCCCCGCCGCGGACGCGCXX3AGCCGCCTGGGCCGCGCCGGAGGAGGGCG 

GAGCGCCAAGCCAAGCAGTGTTTGAAAGGAAGCAGGATGCTGATCTAATCGTC 
CATGTGGTGTATATAAAGTTTGTGATAGTTGGTGGAAATTTGGGAGCTT^ 
AGCGAAACTGACAGAGGAGAGGGAC!GGCAGCCTGAACCAGAGCTCTGGGTACCG^ 
CCAGCTTCGGCGTGACCTCCATCCCGAACTACAACAACTTCCACGCAGCTC 

5 0 TCCTCCTCTCACACTGGGACCCTACGCACCAGAGGAGGGAC^ 

GGAAGATGACCTGAGTTTTCACAAAGGAGAAAAATTTCAAATATTGAAC^ 
CAACCGGGGAAACTGGTTACATTCCCAGCAATTACGTGGCTCCAGTTG^ 

GG CCGCAAAGATGCTGAGAGACAGCTCCTGTCCTTTGGAAACCCAAG AGGTACCITT CTTATCCGCG AGAGCCAAACCACCAAAGG 
TGCCTACTCACTTTCCATCCGTGATTGGGATGATATGAAAG 

5 5 GATACTATATCACAACGCGGGCCCAGTTTGAAACACTTCAGCAACT 

AACTTAACTGTGGTTTCATCAAGTTGTACCCCACAAACTTCTGGATTGGCT 
TCTGGAGAAGAAGCTGGGGCAGGGGTGTTTCGCTGAAGTGTGGCTTC^ 
TTAAGCCAGGCACC^TGTCTCCGGAGTCCTTCCTGGAGGAGGCGCAGATCATGAAGAAGC^ 
TACGCGGTCGTGTCTGAGGAGCCCATTTAC^TCGTCACGGAGTACATGAGCA^ 

6 0 AGGAAGAGCTCTGAAGTTGCCAAACCITGTGGAC1ATGGCGGCACAGGTTG 

TCCACAGAGATCTGCGATCAGCAAACATTCTAGTGGGGAATGGACTAATTTGCAAGATTGCT 

G AAGACAATG AAT AC ACAG CAAGACAAGGTG CGAAG TTTCCCATTAAGTGG ACAG CCCC CGAAG CGG CCCTGT ATGG AAGGTTCAC 

AATCAAGTCTGACGTATGGTCTTTTGGAATCTTACTCACAGAGCTGGTCACCAAAGGAAG 

GGGAGGTGCTGGAGCAGGTGGAGAGAGGCTATAGGATGCCCTGCCCACAGGACre 

6 5 TGCTGGAAAAAGGATCCGGAAGAGCGCCCGACCTTCGAGTACTTGCAGGGCTTCCTGGAGGACT 

GTATCAGCCCGGTGAAAACCTGTGAGAGCCTGCGCTTCAGACGCCrCTTCCCGAGGCCTCCCT 
TCTGTAGCCAGCTGCCCCAGAGCAGGAGAACCGTCCAGGATCAGATTGCATGTGACT 
AATGACACTTGTCCCCCAGTCCGAACCTCCTCTGTGAACCATCTGAGACA 
ATO»TGTTATGTCAAAGGCCAAACCTClX3TTCAGTGTAAATAGCrGCT 

7 0 AAGAAAAAGCAAATCCTATGTGATTTTAACTCTGAT 

TCTTTGTCTAAAACAATAAAATTTTTTTTCATGTTTTAACAAAAA 

T ACAT ATGAT AT AT ATGTT AACATATGTTC CTGT AC AT ACACC AT GTGGGTG CT AC CATGGAGACTGG CCAG CGT AGG C CACAT AG 
CTACAGGACCGGAGTGGGGATTACTGCAGCGTGATCATGCAAGCTCA 
GTGGCTCATTTTTTGACTTCTACGAAGCATGACt^ 
7 5 GTGACTTTTTTCAGGTCCAGGGCCrAGTCACGAGTTTTA^ 
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gagtgagcattttagcaaaaagcagccagcca^ 

gaatgctgaagggttcaggtgttttccttttatttaataaatcttgta^ 

gggcaacctaaacttttcrcatgcaactaaaattatgg^ 

tttt actgtcctgtactg ctaatg actgactctccg tgt caggagtg ctc cagtgaat aact atgcact acttt acatttcatggg 
5 ggtgcacaaaaaaaaaaaggtattacagtttttagttgct 

ctattctattgagttttaatactgactagcaattctgaagtot 
tttaaattgtttaactttcctaatttagtaattaaaaaga 

mouse sequence - coding 
10 atgggctgtgtggaatgtaaggataaagaagcagcgaaactg^ 

ctatggcacacaccccacccctcagcactaccccagcrre 

gccagggactcaccgtctttgggggtgtgaactcctcctct 

tttgtggcgctttatgactatgaagcacggacggaagatgacctgagtttt 

ggaaggagattggtgggaagcccgctccttgacaaccggggaaactggttacattcccagca^ 
1 5 tccaggcagaagagtggtactttggaaaacttggccgcaaagatgctga^ 

tttcttatccgcgagagccaaaccaccaaaggtgcctactrcact^ 

acattataaaatccgcaagcttgacaatggtggatactatatcacaacgcgggcccagtttg 

attactcagagaaagctgatggtttgtgttttaacttaactgtggttt 

2 0 gaatggaaatacaaaagtagccataaagacccttaagccaggcaccatc 

agaagctgaag catg acaag ctggtgcagctct acg cggtcgtgtctg agg ag cc cattt acat cgt cacgg act acatgagc aaa 
ggaagtttgcttgacttcttaaaagatggtgaa 

aggaatggcttaca.tcgagcgcatgaattatatccacagagatctgcgatcagcaaacat 
agattgctgactttggattggctcggttgattgaagacaatgaatac^ 

2 5 gccccogaagcggccctgtatggaaggttcacaatcaagtct 

aggaagagtgccatacccaggcatgaacaaccgggaggtgctggagcaggtggagagaggcta 

gcccx3atciccctgcacgagctcatgatccactgctggaaaaaggatccggaagagcgcc 

ctggaggactactttacggccacagagccccagtatcagcccx5gtgaaaacctgtga 

30 HUMAN SEQUENCE - GENOMIC . 

TCTCTCCCTCTTAATTTTTCAGTTEACAAAGTAT^ 

GATAATATTAAAGGGAACTTTCTTATAGAAAAATAAAGGTGATTTATTCTCATTATAAAAAAAG 

CTGAGAAAAAAGACACAGAGAGCAAAGAAAGAACAGTTGCCCATAGTGTTATCATCT 

TTCCAAAGAGTTTTCTAGGCATTTGTTATATAGTTGCAATCATGTGATC 

3 5 ATAATTTCAGTAAATGTAAGTTTTAATTTCTGCTTAACTTGTCTTTATGGG 

G CT AAACAACT AAC AAG TTTT CT AATTTTT CAAATGAT AAAT AAT ACTT C AT AG AACCTCTT CATGT C ATTTTTTT AAAAGG CG TG 
TCTTTAGCACACATTTACAGGTTAAAGGACATGCCACCCCCGACAC 

AGGCTGGAGTGCAGTGGCGCCATCTCAGCTGACTGCAAACCTCTGCTTTCCTGGTTCAAGCGAT^ 
TAGCTAGGTTTACAGGTGCCTGCCACCACACCC^TCTAATTTTTGTATTTTTAGTAG 

4 0 GGTCTTGAACTCCTGGCCTCAAGTGATCCACCCACTTCAGCCTCCCAAAGTGCTGGAATTACAGGTC 

TGTAAGAGAGTCATATTCATTTCAAGGAAACTTCATCACTATT^ 

T AGT AT CT CACT AATCT AAATTG T ATTT CT CTG ATT ACT ACCAAAGG CAAT ATTCTC C CAT ATGTTTCACAAATTGTTTTT CCTTT 

4 5 AGAGAGTTGAAAACATATCTCCACAGAAAAACTTGAGCATGAATGTTT^ 

CAAATGTCCATCAGTTCATGAATAGATAAACAACATCTGGTATATCCAT 
TACTAATACATGTTACATGAATCAACCTGGAGAACATTATACCAAG 
CTGTTGATATGAAATGTCAAGAATAGGCAAATCCATAGAGACAGAAATTA^ 
TGGGTAGCGATTGCTTAATGGGCATGGGTTTCTTTCTGGAGCAATGAAAATGTTCT 

5 0 TAAATGTACTAAAAACCATTGGACTGTGTGCTTCAAGAGGGTGAATTTTATGGTATATGAATTATATCT 

TAAAAATATTAACTACAAGGGGCTTTTCTATGGTCCAAAAAGGCTTGCGGG 
GCTCTAATTCTTTCTGAAACCTTTGGCATAGATGGGGTTAAAACATATCATCCTTGT^ 
CCTGAGATCCAGAGCAAAATGAATTTAAGAAGAAAATGTTCrTACT 
ATTTGGCCATTTAGGTTTGGGACTGGTTCCAGC1ATTCAGTTGACT 

GAGCAGCTGCCTCITAGCCTGGGGGCCATTTGAACA^ 

TGGCTTATTCTTAAAGTAA(2ATGACATTTTTAAGAGTGATTTATATTACTTTG 

TCCTCTTCCCCCTCCAGAGAACCTTTGCAAAGTCATGCGTAAATGATCGAA 

GGGTTTGATTTTTAAGTAGGCTCTGTTCTCCCATGATTGGATGTGGCT 

6 0 CTTCTCACCTCAGAGACTGTAATATTTTAATACTTGCATTTGGCGAGGGGCTCTCC 

TTATTTTGTGCTTGGTGGAGGCATTATATGTGTATCTCAGTTTATGCAGTAAATATGTTCCAGA 
CATATTTTAAGTGCAATAGGAAAGCTGATTTTTATACCAAGGAAACCAGTG 

ATGACT AGGAAAGATG AGG AAATTAAGGG CTG CTTC AGTGTCATCGG CCCAAAT AG TC ATT AAT ATG CCAAACCG CACGAAGG AAG 
ATGGTGACT CACTG CACCT AGTATGGTCCAGCATCTCTACAG AATCC ACTGTCTTGGGGG AATGG AATGTGCATT 

6 5 TGAGACCTAGTCCCTGAAATCAGTGCCTCCGTTAGTAATGTGTACTAGTCCATTCTCACACTGCT 

GGTAATTTATAAAGGAAAGAGGTTTAATTGACTTGCAGTTCCGCATTGCTGGGAAGGCCT 

GCAAAGGAG AAGCAGG CAC CTT CTT CACAGG GCGGCAGG ACGGAG T AAGTG CAAGCAGTGGAAATGC CAGATGCTTATAAAACCAA 

CCATCAGATCTCATGAGAACTCACTCACTATCACG%AGA^ 

TGCCCTTGACACATGGAGATTATAGGGATTACAATTTGAGGT 

7 0 GATATTGAGCTCTGTGGAACCAATTCTTCCCTATAGGATTTTTCCCT 

TGGCCAGGATTCCATGGTCAGGAAATAGCTCTAGTATAGAGACTCTGTGTGTTGGCCATGGAAT 

CATCTCTATGACTGACATAGCTGTGGACAAATTCCCCAACCCCTCTTTTC 

GTTTTAGAGTTGTTACTAGATTTTAATGGGACAGCATTTTGCAG 

TTGGT C71T G ACTGCTGAAATTGG AG CCATCATCCCCATGGTG CCCAT CCCACCAAAACATGT ATT AGTTTTCTC 
7 5 AAAGTACTACAAACTTAGTAGCrrAAAAC^ 
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TCTCTGCTCAGCAAAGCCGAAATCAAATGTT AG CTGATT ATGTTGTTGTTTG AAGG CTTG ACT AGGGAGGAATCCATTTCCAGGTT 
CATTCAGGTTCTTGACCGAGCTGTTCTCAGCTCTGACCACCCATC 

GAG AATCTCCCCCAC AATG AAGGCCT CT CTCACTTTTAATCTACITTTCCAGGAACTGG CAT AC CGTT CACGGGTTCACCTG ATT A 
GGTCAGGTCC^CCCAGGATGATCTCTO^AAGTCGGCTGTC 
5 TCCTAGGGATTTCC^GGGCACATACACCGGGAGTAGGAGTCCTGGGGGACAG 

AGGCACAT ATTTTCACTCCCTCCAGTTGGCAG CAAAGAGAGG ACTTTCC CACATTGGCAGAGTCCTG CATGCTTTTTTTCAT AGCT 
GGGCTGGGGCCATTGGCTGAACCCAGCATCAGTGCTGAAAGTGTTTC^ 

AGTGACnTTTTCrcnXSCrTaVGTACCTTGGGAAATGC^ CCCTG CTTTCTGTCCAGCCTCC 

CCAGTCCTCCCTGGCCCTrCTGTCCCGTCTGTCCCCAGAAGGC^ 
1 0 GCAGTATGCTACAGAGGAGAAAAGCCCCATTGTAAAGGATTGAAAGGTGCT 
CCCAACCTCATTCCTGCTGATGTGGTACTCAACTGTCTCT 
CITTCCrAGGCTTTCAATGGATTAACTCTCrCCACr 

TGD3AGGATCAAGAGCCCTCTGCCATGAATCTTCTATCCTGAAGTCAAATAAT^ 
CCTAATGCTATTCCTCCrcACTGAAAAAGTGTGTTTGCTT 
1 5 TGCAGTTG AT CTG CAGTT CAGCTTT AAATTCTTTTTTCAACTG CC CCCATCTCAGTTC TT 
CAGTTCCCTTCCTTGTTCTTATACTTCCATTATGA 
CACTCTCCAAAGTCTGCTGGAGGCrGTCAGAAGCAGGGATTATGCCT 

AGTGTATGTGTCTCAAGAGAAAGTAATTTGCTTGTGCAGAAGACT 
2 0 ACACCTGAAATCTTATCACACGTCCTGCAAGCITATA 
AATGTGTCTGTGTTTAATGTGCATCTTGTAGACT^ 
ATCTTGGATCGAAGCACCCTGTGCCTGTGGTAGTAAAGGATTGAA 
TGTGGAGAAGCATATCTCCCATAGCATGGCAGCAGAAAGGGCTGTGATCT 
CGGCGTTCCTTGAATCCTGTGTCCX^CAATTCTTCCCAGGTTGCTT 

2 5 AAATTTTAAATAGATCAAGTTTTATTTGCTTAGTAGAAAAAGGGCCCCT 

CTGTAATCCAGACACCTTGATC^CAGGCACAGGATCCAATTGGTGAAAATGCCC^ 
GAAGACTATAGCCATGAGTTCATGGCGTGTGGTATTTTCCTGATCAGAGTAACT 
GTGTTTCCCGGTGCCCCAGAGATCTATAGCAAGTGAATTAGTACATTCTATGTCAC 
ACAGCACATAACTCTGCATAATAAAGGATACTTTTATTTT^^ 

3 0 ACTGAATACAG CTCATGT AG CGTCTTCTGACTCTTCTGTCACG AAG CAGTTAGTCTGACTG CCCCACGGGGGAGAAAGAAGACCAC 

AAGTTCCTACATAAATATTTTCATTGTGATTAACTTTTAAT^ 

GTC^TTGAAGAGAAAAGAGTAAGCAAGATCTGTATAGAAACATTACTTAACTCAGAATGTCAAATGCGTGAAGTGGTGATGGTGCT 

GTCrCTTCTATAAAAGAGGATTGTTTTCrCCAAAATGATTTTG^ 

TACTGTGTGCCAGGCACTGTTCTGGCCACGCTU^ATGCAGCAGTTAAAAAAA 

3 5 riTCrrCTCTATTACTTCATAAAATTATAGATAAGC^ 

ATTATlTTl w rCrTAGTTTGGTTAAATGTAGGATCCTGTTTTCTTC^ 
AAGAGTGGTTTGGAATGAGTGTTCTGATTACTTCCAGGTTCTGTTTCT 
CTTCATGTCACTCITTAAAACTGCCTGCTTCrACCTAACATGCCT 
TCCATGTAAGGGAAACAGCTGTTAAACTGCTTCAAAGATGTAGATATAAATTC^ 

4 0 GACACTATTAACATTTGGGGCCAGGTAACTCITGTTTGTAGGAGG 

CTCTACCCAGTAGGTATCAGGAGTATCCCTCCCCTCTGAGTGTGACAACCC^ 
CAGGGGAGTGGTAAAGTTACCCCCAGC!TGAGAACCACrcCTGTAAGTGATATG 

CAACITTCCACCACGAAAGAGTTAATATTATATG 

4 5 TTAGTAC^TCATGCCTTGGAAATAGCTTGCATTTCCTATTGGAAACTTGTTA 

GTTGG^CACGGTATACTGTAAGTTAATCTCAAATGATTTAGGAGTAA^ 
CGTCTCTTTTCCAAACATTTTCATTCTCAAGAACTGACATTCAA 
AGAACAGTCTGATACTGCTCCCATCACCCAGTCCAGCAGAAAAGCAGTGCTCAGA 
CACCTCCTGCATTGGTTTGCAAGATAGTATAGAAAAGCATGCCTTCTCCTTTC 

5 0 TGCCCTCCCCAGTGAATAAATGTTATTACACATGATGGCATGTGTTTCTTATGTGATTCTT^ 

TTTTGGCAGGGACCTCCACTGGCCTGTTCTACCrGACCCCT^ 
CAACTTGTATTTCTGAATCTTCTTCACTTCTCACACAC^^ 

GGAGGGGCCTGTCACGGTGGTTCACC^CTGTAATCCC^GTACTTTGGGAGGCCGAGGCGGGCGGATCACCTGAGGTCAGGAGT^ 
AGACCAGCCTGACCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAA^ 

5 5 CAGCTACTTGTGAGGCTGAGGCAGAGGTTGTAGTGAGCCAAGATCGCTCCACTG 

TCTCAAAGTC^CAGGAGGCATCGGCTTTAATACAGAGAGGGTAAAAGAGAAACATAATAGGCTACAGAAT 

GCTATTTATTGTTCCAGGCACIX^CAATGTAGCTGTGATG AAAACAGACAGGGTCCCG CTTT CCTGG AACTTGCAGTCCATTGGGC 
AATGAGACAAAT AAACAGG CCCTGTTTTAATATG^CGCOU^GGGTGCCAGAAGG AGGG AGATGAGGATG CCATGGGAGCACTTCAG 
AAAGCACAACCCAGCCrTGGAAGGGCAQSGCAGTCGGAGAACGCCTTGCAGAGGAAGCGGTGCCTTA 

6 0 TCA(^TATCAAAGAGGAAGGGGAAAAAGTGTTCAAGGTTATGTTCCAAGGCCCGAAGCG^ 

TGAGAAGTTGGGCGTGTTTGAATAGGAGAGCCGAGACAGGAAGAGTGGGAAGTAAACAAGGAGGAGGAAATG 
GATGCTGACTGGAGCAACTGGGACACTGGTGGTCCCATTTACTTAGATGAGG 
GGCAACIAACrCGAGTTTTTAATCCGCCACTAACTCTGTAACATGGGA^ 
AATGGAAATTTGAAAAGTTAATTAGCAGCTTTAAGTGATTTCATTACrrGTAGACTAT 

6 5 (HTAGGTTCTG CCGATTTT ATTTTT CAT CCAGT AACT AGTT CTT AG TTTTGAT ACTT AG CAT CATCTT AGTG CTT AAAATTG AC CA 

GTTTTGTGCCTGAAGATGTGGTACCCATGAGTGAAGCAGTCTAGTGACTTGATTGAAGGTTGTATTCT 
GTGCCCTCTTCCTTAAATTTCCTGTGCCCTGATGGAAGTTTTTATTC^ 

TTTTTTAG ATGGCT AAAGG CAG AATTGGT AAAT ATGATTGGACCG TCTG T AGG CAGG TGT ATG CTGTTACAG AATTTT ACGAGAT A 
CAGGCCTTATTCGAGGAAAGTTGCTGCATGGTGAACATTGTGCT 

7 0 GATTGTGGGGTCAAGGAACTGAACTTCCTCCTAACCATCTCCCCAGCC^CTGCCTTTCCTCA 

CCAACAGTGAGCCTTGTCAATTACTCCTCGTTTACTGTAGAGCAACCCA^ 

TTGTCTTTGGCAGGAATTTAGATAATGGGCTGTGTGCAATGTAAGGAT^ 

CCTGAACCAGAGCTCTGGXSTACCGCTATGGCACAGACCCCJl^ 

ACAACAACTTCCACGCAGCCGGGGGCCAAGGACT C^CCGTCTTTGG AGG TG TG AACTCTTCG T CT CATACGGGGACCTTG CGT ACG 
7 5 AGAGGAGGAACAC^TAAGCCCCCAAAGTCACAGAGTGCAGAGAGGGGGCAGCC^ 
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TTC C CATC ACTTCC CTTTG ACT AGGGGTTTCACATGTAAG TTT CT AAT CTGTCT CCTCT ACT AGATGAGGGG TTG TAAG AAAAG CC • 
AGAGTTTGATGTACAAACAAAATGCTTGGCTACAGGGTTCC^^ 

TGAGCTACAACATTCTACTG AGGCACTGT AGGTCAATGACTCT C CTCACCTGGCTGGTTGG CACAG AAG AACTAGGGCAGCCT 
TTTCAAAACACTGGATGCCTGAGCCATGTCCCAGTCTrTTGAATTAG 
5 TCTAGATATTAATGTACTCCCTCTrCCCACTCACrrrcrrCCTAATCCT 

CTTGT CTTTC AT AC CACAAGG CT AT ATT CTG CTG CCATGTC CTGACTT AG CT AAT ACTTTGT AAAAT ATCTGG CGT ACGT 

TCTCAGACATATGTTTTCCGAAATATTTCTAAGTTATGTCATGCATCCCTA 
TAACTAGATATTTCACATTATTTGTTCCAGAATTATTTTGTTTAT^ 
GCAGCAGATGTCrTATATAAAAGTTTGAAAGAATCCAGTATCTCTCAGCCCA^ 
1 0 GCACTACTAAGGCATTAGCAAGTTTTTTTTCCCTCTGCACCCAACAC^ 

CTGAGCCTTTGGACATTATCTACTTCTCTTCTGCTAGCATAGCACACTCTGCCCAGT^^ 
TGTGGTGACTCTAGCTTTCCTTTTAACCCAGGCCTTGCATGGGT^ 

CAAAGGTG C CTG AAGAG ACACAG ACAGGCAGTCT AAGG C CATCT GT CTT CTGAT GATG CAGGAGAAAAAAAGAGG CAAACACAACC 
CCCATCCTTGTGCCTACCGTGAGAAGCATAAGGGCTTTGCCAGCCCAG 
1 5 TTCATTCACGAGTGCTGGATATGGACCCCTGGGTCTGCAGGTATCATTCATTA^ 
CTGGGTCTCCACCGTAGTGAGGAGGAGTCTTTGTTTAAGAGGCTACCT 
CCAAATGTTGGCAGATTTTTCTGCCTTGAGTTATTTGGTTT^ 

TAAGGCCAAGAACAACGTCTGTGTTTCTCTGTAACCTCCTTTGATTCTCCCCTCAGCTCCTGGAAC^ 

2 0 TCATTTTTAAATTGTTTTTAAAGAAGGGAATCTGCTC^TG 
TGTTTTAAGCCAATCTCTTTTCTTACCTTTCAAAGAAATCT 

AGACCTCTGAATGCTAACTCTCCTGAGGAAAGTCCCACTTGGGATGGGGCAGAAAGAC^ 

ATAGTG CCATG ACACATTTCTTCCCTTGGAACTACCTACACAGCA^ CAGCTTGGTAAAAAAGGTGGACT CTGGC CGGGTGC 

AGTAGCTCATGCCTCTAATCCCAGCACTTTGGAAGGCTGAGGTGGGTGGATC^^ 

2 5 ACATC^CGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCAGGCGT 

GCTGAGGOVGGAGAAGTGCTT AAACCCGGGAGGCGGAGG CTCCAGTG AGCCGAGATC^ CTTGGGCAACAA 
GAGTGAATCTCCATCTCAAAAG AAAAAAAAAAAAGTGGG CTCCCTGTCATTTCTGT AGGTCATAGCAGG CACTCTG AAGTGCAGAG 
AGACCAGAATGAAAAGGCAGACTGCAGGGAACTACTATTAGTTATCAACTGCT 

AAAa^AAACACATTTAArrrcrrrrrrnTTCAATT^ 

3 0 AACAGTCTGATCAAACATACAGCTCAAAAATGTCTTTTGACGTTTAACAA 

ATGGGGTAAAT AAGTT AC CACTG GCAAG TCTG TGG CACT AGTAAAACAAAAATAAAAAATT AAAT CT CGG C CAGG CG CAG TG GCTT 
ACACCTCTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCAGATCACGATGTCAGGAGTC 
ACCCCGTCTCTACTAAAAATACAAAAAATTAGCTGGGC^TGGTGGCAGGCACCTGTAATCT 
AGAATCACTTGAACCCGGGAGGCGGAGGrTGCAGTGAGCTGAGATTATGCCACTGCACTCCAGCCTGGG 

3 5 CATCTCAAAAAAAAAAAAAAAAAATT AACT CT CTTGATCAT AT AG AT AT CT CT AT AT CT CTTTTTTT CAAT CGGG TACAAAAG AT C 

TTTCTTCATAA'riTTTTTTATAATTTAATGGCTATCTACT 

TCCCAAATCTTACCCAGACGAGTATGGCACITAGTTAGACATTTCTGGCAACAGATTTCC 

TTCTGCCTTAACTAATGCAGAAGTATCAACTAGTGGGAGACCCAGGTTTATAACTGTACTAGGTACACAGAGGGTGCT 
CCTGGGCTTTCACAAAGCAGTAATATTCCAC<X5AGCATAGAACCAGTGATACCGTGAC 

4 0 ATCITATGACTATGTAATTTTCACTAGAATCTACACITCTCAGAGC^GCCAGGA 

TCATCAAATACAAGCATAAAATGTAAGAACTAAAAAATACTCAAAAGAAAT 
TGGGCTGTGGAAAC^GGCAGGGCCAGCAACCTGTCCCAGCACAAAAGAA 
GACCAGAC^GTACACAGTTTTGGAAGGCAACTTAATTTACCATTGCTTTTAAGACA 
ATAACACTGTATTATCTTATATGTACTAAAGTAGAAATAACTGTGTATTTAT 

4 5 ■ CTTT ACACTTG AACACTTGTAGTG ATG ACAT AAAGTGTGGCAC C G CTTGG AT CC AAAT CTTCCTT CATGCTTTTTTG TT T ATGTG T 

GTCCATACATCAATTAAAATTATAAAACCTAAATTTACAGTTTCTGTGGGTCAGGAATCT 
ATCTCTCACAGGCTGTAATCAAGGTACX3TTGGCTGGGCTATAGTCATCT 

GTAGTGCTGGCAGGATTCGGTGCCTCGCTGGCTGTTGGCCAGAGGC CACCCTCAGTTCTTGGTCACATGGGCCTCAC CAG CATGGC 
GCCTTGCTTCATCAAAGTACACAAGCTGAAGTGGCAATAGAGAGAGTCTGCTAGAAA 

5 0 TCACTTAAGGGACTGCACATTCTGTTGAGTAGAAGCAAGTCG 

AATACTAGGAGGCAGGAGTCACTGGGGACTATGCTAGAATTATGCCTGTTGGGTTGGGGACAGT 

ACTTTGGG AGG CCAAGC CAGGGGG AT CACTTG CACCCAGGAATT CG AG AC CAG C CTGGGC AACAT AG TG AG AC CC CCGTCT CT ATT 
TGAAAAAAAAAAAAGAATTATGCCTAATTATATGGAGGAAATCCTAGGCCAAATGACACTGGATATTCTCTGCTGCT 

5 5 TGTGAGGAGAGGTGAGTTCAAC^TAGGGGGAGGACAAGGGAGAGGGTTCCATTTTCAGCAGTGG 

AATGAACTGCAGATGCCAAGGGCAGTACATCCTCTAATTC^GGTGAATATTACAGAAACGGTCTTCT 
AGGAGGCATTTCCTATTACGTTCTATGTTACTGTTTTACCAGGAGAAAGGGTGGTGCATTTTACCCAGA 

AGTGGAGGAGAAG ATACGACC^GCAAAGGATGTCAGrrAACTGCTGC AGAGCCCTG CAAACCTGAACCCTG ACGCTGACG CTGCCCT 
TTTGTTTGGGATGCAATTTTAATATTTCCCTGCTCCTCAGCCCTCATTAAAAGCTATTACAGAAAGAAGCCCT 

6 0 TCTTTCAGATACAGTAAGGCTTCCCCTACAAAGACCAAGGCATAAACACAGCCCT 

ACTTCATCATGTTATTAGTGCTGGCTAGATGTAGATTGTGAGAAT 

CTTGTGTAATAATCTCCCTTCTTGTGAATTTCGTTTCGGCAGGAGTGACACTCTTTGTGG 

GAAGATGACCTGAGTTTTCACAAAGGAGAAAAATTTCAAATATTGAACAGCT 

CTTT CAGGTTGGGAAGGGGTCTTGCATTTCTGGTCTGCATCTCTT CAG 

6 5 GGTTTACAACAATATACTTTGAGAAAAAGAAGAGTCAGTGACACAGTGCAGTATTTTATGGC^ 

G AG AGTACCAAAAAT ACT AG CACACT CCATGTTT ATTGGTTTTGGTTT AGGTTT AAAG GTTG T AAAATCACCAG CAT CG CTCTC C A 
CAGCCCTGAGTGTTTCTACAAACCTACAGTTTCAAGTGAT^ 

TTTTCTTCACATTTCAGATGTTACACTAGCTAAGCCCTCACTTGAGGCTTGCAACAAGT^ 
TCTTTGGTGAAGCCTCTGGAGGCGTTTCCCTATAACAAAAAT^ 

7 0 CCCACTCTTGAATGAATATG CTGCATGGTTGTTT AATGTTGGCCAGG AGGAACAACAGTTAAACAAG CAGTGGGGAG ATTTCAAGA 

GTGAGGATCACTCACTTTTCAAATCCTACATTCACTACCTCCATAAGAW 
TACTGCCTCTATTAGATTGAGGCAGTGTATGTGGATGAGGCCCTTCAGAAAAAAATCTAGGT^ 
GGTTCCACATCAGCAAGCCAGGCATCTCCAGCGTCCCTATGAGTGCTCCCGCTGGTTGGGAAAGAATGCC 
GCACACCAGGAAGTTCTTCTATGCCACCAAGTCTGGTTGCTTCTCCTCTGCTCTCT 
7 5 ACTTGGAGCC^GGCTCCTa*GCTTTCCCA« 
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A£AGGAGTGCTGATTGCCACAGGGACCTCCT^ 

G CT CACAGAACCATG CAGTG AGAAT ATG CCG CCTTT CAAC AG C AAGAACTG AAT ACTG CGAT AAAGGGC C^UVCGACTCTCTTCG AG 

ACATCAGAAGGCACAGTCTACATAGCCACAGGGTTTTGAAACATATATTAGGATTGTC^ 

TTTTTAACTTTTTGTTGGAACACATGCAGAAAAG 
5 TACTGCCGCTCAACTGTACACTTAAAAATGATTAAAATGATAAATTTTATGTTGCCTATA 

CAGTTTGCTG AATTTTCACAAGCTGAACATGCCTGTGTGGCCACCAG CCAGGTCAAGAAGAGTGTTACCTG CCCCCCAGGAGTCCT 

CCTCCTGTAGCCACAGGAAGTTTTTAAAAAATAATAAACAGCTACCATTCATTGAAGGCT 

TACAAACATTCTCATGCATTCTTCCACTACTGTCTGAGGTATGCATCGACGTCATCGTTTGATATCT 

TTAGGAAACTGCCCCAGAGCACAGAGCTGGCAGGTGGCAGAGCCCAGGGCGA 
1 0 TTGATCTGGGAAAGGCCCAGATACTTTATCAAAGGGGATGGGAACCTTCAAGTGACA 

GGAGGAGGGGATTGGGCTTTGATCTCTCTGGGTATACCTCCTTGCTGTTATT^ 

TGAAATCCTCCGCCCCTTAGCCTCCCCTACATGCCAAGGTTGATTAGAAATTTCAAGACT 

CAAGGAATAGTAAGAGTGCAGACCCAGCTCTCAGGAGGCITTGTCCACACTGGACGTGCGTCTGTGCATTAC 

GAAAGTCAGTCCCTGCCAGTGTCAGTCCCATGGCTTCTTCCTTTGTTAATCT 
1 5 TAGAGAAACACTTAAGGAGGATCCAATCCGTCTCAATCTCTATTTTGTGACTTCCATGCACG^ 

TGAGCCTCCAACCCIGTTCITGTAGAGGAAAATATTACAAAGGTTCT 

ATATCAGTGGAATTTTAAAATTCCCTTTCTCAAGGTTATAGAAAAGAAAAATAAGGTTTGTG 
AGGGTAATAACAGGGAAGAGAAAATCGTGAAGGAAAAACAAAGCAATAATACAGAAGCAGATGGCACACG^ 

2 0 GTGTGTCCTCCCACTCCCTTCCCTGGTGGAGCCCTGGTGGCGCTGGAGGCCACAGG^ 

TGGGATGTGCATGTATGTATGTACCCAGAGGCGGTGTTCCTGCTACCCAGCAAGGGGCAAAGGTGCT^ 

GCTCAGTGTCATGTATTAGGACATTATTAAATTTGAACCCCTTTTGCAAAGTATTTAAGTTCAT^ 

ATTTCAAGACTAGTCGTTTACCATCTACGAACAGTGACTCTTTAAAATGTAATACATGTTTTAC^ 

GCTGAGGGTAGGTTTGTTGrrrrr rr i 1 TCCTATTGTG ATAAAAAGAAATCCTGGTTTAGAGCCCCACCCAAG CATTTG ATAAAAT 

2 5 TGTTACCGACAT AAAATG CTACACT ACTG AG CAG ATCAAGGCTGACTTCTGTTGG CAAAT AACT ACTCC CTGG CTC C AAGG CTCTA 

GGGCTCCCAGCTCCGGGGCCCACAAGCCTACGAGCAACTTTC 

GCACAGAATCTTTTAGCAACTCTTTAAATCAGAGGAACAGAGACAG 

GCTTTAAATCATATATTTGAATTAGAGCAAGATAACAAAGAATATCTCAC 

TTT AAAGAAAT ACAG CTTTAACAATG CT AATTACCCACAGT AACTGAAG AAAT AAAAGAT AACAT AGT AGAG ATCCTATTTT CCTA 

3 0 AATCAACATGAATATAGAGTATATCTATTTATAGGTGAATTGGTGCT 

CGGCTCrrCAATGAATATTTTGAGTCTGTACCAGCCTCTCTAGGCTCAACATGTGAGG 

TCACTTTATGT CT CCTTTGATGAAGAAGATGACCCT AG AAAGCCTAGAAAAAGAGAAATCAAGGATTTTATC^^ 

CTG AAAACCCTTGTG AG ATT AAT AGGGT CAAGTGAG AAAAT CATGAATTCCTGATT C^TGACTT CAGAATGAGCTGATAATTGCAT 

TCTGATCTGTGCTACACTATTTAGGTACCTAGACCAAAATGAGATAGTTAATGCT 

3 5 C AGATCACCTG AATAAGG CTGGGCTTGTCCCy\ACCTXjC AGCAAAATX3TATTGGCTTCCCGCCC CTATTGACAGT CTCAGATACCCT 

AAAATTGATGCTAGCCATTTTTTTCTTAATTTAA 

CTCTGCTACAAGCACTTAATTTTTTCCTCGCCATTTTGGAGCATTTCCC^ 

TATTGTTATTTTTAATGATGAAATGTCTCCATCrA 

TCTCCAAAGCCATTTTCCTTAAACACATACAGTGTTTAAAGAAAA 

4 0 CTGTATGTATTCCTTTTGGAAGGCATAGAAATGAGACC!AGTACAAACT 

AGGGGTGCTTACAATAGGGGAGCAGCACAGTGCAGATCCCACCCAGT^ 

GGGGTTCATATATACTGTACCTATATTCTTCAGTATGTATAAATATTTCCTAACAAAGATCTAGACATGAACT 

TGAATAAATCTAGACATGAACTAAATTTTATGAATAAAGAACT^ 

TAAAAAGGAGAGATTAGGAAACTAAATTAAAGCATAAGGTTTTAAAAA 

4 5 GTT ACTTATGC CACCAAACATTTTCCCACAGCAGCACCCCCAGCACATG CTGGCTGGTCACCITTGTTCTGTTG CTCTATT AAATC 

CTATCTGGAGGCGTTGCTCACTGCTCACCrrGCTCAGAGAGTCCAGCT^ 
TGGAGGCTGATTATTCTCTACCTGATTAAATCTGTCTGGGGAGCATCTTTACCATGGC 
TGGCTGAAAGCTTTGACACAGAACTCAGCCAAAAGAACGATTTTTTT^ 
AAAAAAGGAGGAAAAGAAGGTCCTGAGCCCCAGGAGATCAGATTT 

5 0 CTGCAGGCTGGTGGAAGGTTCTGC^GATACTCAGTGGATGTCAGAGCCGCAGCGCCA 

AGAGGTACACACTCCATTCCTGCTCTTCATCACAGGTAGCCCTGTTTGA^ 
CTGGCTTTAAGATGAGAGTAGCTACAAGATCCCCTTTCACCTCCTCTGAGAGCTCCCCAGA 
CATGGAGCCTAAGGCAGTACAAAGTGGCGTGACTAAAAATCTCACTCCCTTGCCGAG 
C^GCCATTCCC^TCCTC^CCCCTTGCCC^TGAGCCTCAGGCCAGTC 

5 5 TGGGGAAGGTGATCCTACTCTAGCATCATTCTTlCC^ 

GAGCATGCCTTCATCCCTCATCAGACCTCCTAACCTCCCAATGTTAAGATGTTAACATT^ 
AGAGGGGTGCTTACAATAGGGAAGCAGTACTGTGCAGCTCATATTGACCTCAGACCT 
GCAGCTAACATGTTCTGCTTAATTAGGCTGAGGCTGGAT^ 
CCCTACCAAGAGAGACTTCTCATGGCCAGGATTCCACATTCTTAATT^ 

6 0 GCTGACTCGAACTGGAACCTTCTGACCTTCACAGGGGGACTATGTTGG 

CAGGATGATT'AGGTGACAGTTATACACITCAGAC 
CTTTTTTATTTACAGGGAAGGAGATTGGTGGGAAGCCCGCTCCTTG^ 
CTCCAGTTGACTCTATCCAGGCAGAAGAGTATGTTTC^TTTTGTTTTCA 
AAATGCCATCAGCATTGGGCAGGAGGGAGAAGAAATTCACAGCAGTC 

6 5 ACCCTGTGTAATACTTGCACTATTATTAACTTAATGTTTCTCTTGAAGCCT 

CCCTAAATGAAAAGCGAGTAAAACTTGTGATAGATTAA 

AACCCAAGTTTCTAAGCCTOCAGTCCACTCCCTCTGTTGGGAAGAGAGGGGA^ 

TTTTTCTCCTAAGTGTGATCAGAAATATTAAAAGAGTGCTGAAAA 

GTCCCCCCAACTTGGGAAATACTACAGTGATTTGGAACCTCAGTTGGCTTCT^ 

7 0 CAACT ATTCAACTGGT AG CAGTTT AGGAAGGTG TTCAGT AGG CTG AAGGG ATCT AAAAGGG AT AACCTTTCT CC CTT CTTCTG ACC 

CATXjAGGATTAACAGGCACTCCCAAGGGGAAGAACTGCCAGTGGCTGTTTC 
TCCCAAATGCAGGAACTTGGCCTCCTCCTGACACCTTGGGTCTTGCCTCT 
CAACCTTTGGCTCTGCTATTTAATTTCACCAT^ 
CAGTTGGATTGTTCTATTCTTTAATTCCTTTGTATTTCCTCTA^ 
75 TTAGTTGTTTCTITCTCCTAGTGACTGTGTO 
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CAAACATCTTTG TGTTCTTTCAGAAT AAAAAGT AC CACCAAAATGT AG CATT AC CCTG ATTT AACTGG GAATTTGTCAG CT AT AT A 
TTTAAGTTATTTTTATAATGGTCAACTTTITAAA 
TATGCAATTAACTATCATTATCTTAAAATACTATTGTTTT^ 
ACGGCATTCTAATTGC^TTATTGGTACAAACCCAGAAAGCCATTTCT 
5 CTAGATTAAGAATTGTTGCACATTTTC^TGTTTCTA 

AATTCACAAAAGAATTTGACAGTGAACTCATt^TAC^ 

GCTGCraVTTCAGCACCTAGTGCTGAAATAGTA^ 
AAATAACAAAAGTCTTGAGGTGACTTAATGTGATGCCTCCTAAAGTG^ 
1 0 AAATATCTGCAACACCAAGTCATTTTAAGGGCATGCTTT^ 

TTGTGCCAAAGAAAAGATCTACAGGGTCC^CCXSTGATGGCTCCCTCCT 
CTTGLAGCCGAGGAGTITGAGACCATGCTG 

TCGGCTCACTGCAACCTCCACATCCCTGGTTCAAGCAATTCTCGTGCCCC^ 

CCATGCCCAGCTAATTTTTGTATTTTCACTAGAGT^ 
1 5 ATCTCCCTGCCTCAGCCTCCCIAAAGTCCTGGAATTACAATCATG 

GCAGGTGTGGTGATGTGCACCTTTAGTCCCAGGAGGCT^^ 

TATGATTATGCCACTGCACTCCAGCCTGGGTGACAGAGTC 

GAACACAGATCACAATCCAGGTGCTTGGTAAATAAATGCCT^ 

CAAGTAAGTACTCATGGGCGTTTTTCTCAGAATTCTTATAGGGAGGAAGT^ 
2 0 ATCTAATCTGACITTTITTAATGCTTGATTATCTTC^T^ 

CAGTGGCGCAATCCTGGCTCACTGCAACCTCCACCT^C 
CAGGTGCATGCCACCACACCCAGCTAATTTTTTGTATTTT^ 

CCTG ACTTCATGATC CACCCAC CTTGG CCT CC CAAAGTG CT AGG ATT ACAGG CATG AGC CACCATG C CTGGCG TTTGTTTTi.'TTTG 

2 5 TTTTGTTTTGTTTTGAGACACGGCITTGCTCTGTCACCCAGG 

CCCAGGTCCAAGTt^TTCTCCTGCCTCAGCCTCCCGAGTACCT 
TTTTAGTAGAGCTC-GGGTTTCACCATGTTGGCCAC^ 
AGTGCTGGGATTACAGGTATGAGCCACCATGCCCAGACTAGAGTCTT^ 
CAAATCAAGGTCTTAGATTTCTGGTTTTTTGCATACAGGTGGCAGTTT^ 

3 0 AAACAACCCACAACTCAAATCTGATTTTTTTTAAAGAGTGAAAGAGATGAAT^ 

TGAATTAAGCTAGAAAAAAAATACATCCAGGGAAAGGTTAAGGACTAAATTGGAGGAGA 
TATGCTGTTGCATTTTGTAAATGACTATTTCAAAATATTTTCTT^ 

GAAAAGATG CTG AG CX3ACAGCT ATTGTCCTTTGGAAACCCAAGAGGT ACCTTTCTTATCCG CG AGAGTG AAAC CACCAAAGGT AAG 
ATAACGAGTTGAAGAAGTTGGCTTGTCATTCAAATCTGTGGATTAGTCAGAGGGGATGAA^ 

3 5 TTAAAATGGCACATACCCTrcCTGCCTTGTGAGTGGA 

AAAGAACTACAGAGGCCCCAAATCAAGACCTTCCTCCTGAAACGGGCTGGATGCAGCTACACTGCT 

AATTTTCAGAAGTCCTTTAGAATTTTGATTTTCTGAAATCTTAGTTCCTCAGGTGCCAGTCA 

TGTTTTTTAAATAACTGAGGACTTCACTTCTCTTCCTCCAACT 

GGAGACAGGTCAAGGAAGAAAAGAGTAGTGGCCTCTAGCTTAATTTGCCAGATTATCCATTCCTGTAA 

4 0 AGCTTGAAGCTCACTTCTAGAGAAACAAGCCTTAGATTTCTTTTGCACATGCTGCTGCTACCT 

AGACAAGGGAAGCAAGCTCAGCCATTCATCGTGGTCCTTCTTTACCCTTGGGGGAA^ 
TATGCAACAAGGGGTACCGAACAGTGTGTTTAAGAAAAATGTCCAGG^ 
TATATTCTGACATTTGCAGAACATCAAGAAGCATCHTCTTCCTGTTACTG 
ATCCTTGGTAATTATTAATCAGAGTCAAGAAAATGAGGCCTAAAAAGCAAGCT 

4 5 CCACGTTTACCAGGTGCCTATTCACTTTCTATCCGTGATTGGGATGATATGAAAGGAGAC 

ACTTGACAATGGTGGATACTACATTACGACCCGGGCCCAGTCT 
ATTCTAACTACCTACTAACCATTTC^ATGCTGGAGGGAGTGGATAGGCCCAGAGCAG 
ACAAAATCACTTATAGAGTATGTTTTCTTTGCCAGCCCTTAATGGTAGGGCTTGGCTG 
TTGTTATTATTAGTTCCACATGTTTGTCCGAAGATTCTATAATATT^ 

5 0 GTTGTTTGTTTTTTCATCTTCAAGTTGTCTTTTCCCATTCCC 

GGGACTCCTACAAGGAATATTGTGTTTCCACTATCACriTTACCCTCAGC^ 

ATGCAAGCCATTTATCATGTTCCCCACCAATACTGCTGTTTCCTAACTAGGCTTAGACTTACA 

TATGTCTATTGTGTTAAATTAAACTTTTATAATGTACATCAGTT^ 

G AAAACAT AT AATT ATT AG ATATG CAGTTT AT ACATTTT AT CAAAGACTG GTTTT ATGTG CAATTG TT AAAATG ATTG CTTTCTTT 
55 CCTAGATTAGCAAGTATTATATGCCACTATATAATACATGTTCACATGTGCCTATAGATATAGGTGCAaVTGTATAG^ 
TATATCCrrATTAAGCCTAGAGATATTTTTTTGTGTGACCCATTTCTCCATTTTCTATTC 

AATAGT AT ATG ATTT ACCT CT ACT ATTG CTCTGTG CACAAT CATTGAAATATGT ATG TAT AATTTTTGGCATG AAAAAG CAATCTT 

TCGAAAATGACAATAAGAAATGTCTGTTGATCCCTTGCACTAATGAACAATAGAAAATATTA^ 

TATGGCATTGGCTAACTCGGTTTATGTGGATTCTAATGAAACATTTGGGAGAAAT 

6 0 ATACTCAAAATGCCTGCATGTTGTAAGGATTTTTCAAATGCTG 

TTTGGATCATTTGAAAATACTGTGTTCGAAAGGGC^CCTTAATAACAT^ 
CACTTTTTTTTAATCTCGGGGAAAAAAAATCCATCrrCACGTTAGACT^ 

AC ACAAT AAAATG CCTTTGAAATG CTGACAC AT AAAT AATTT CCT ACCT AAACAT CTCAG AG C CTG ATTCCATCAAAAAG AAAGG A 
GTAAAAAGCAAGTTACAGCCCAGCAGCACATCTGCTTTCCCTGGGTCCGGGG^ 

6 5 ACAAGGCAC^CTCTGTCAGCTTTTGTCACTCCCTGATTTCnTATTCITTC 

TTAAGTTCCCCTTAGCACTGCCAGATTCTAAAAGGTTATATTCTTTTTAAAAAAG^ 

AAGAATAAAAACCTCCGAGTGTTAACTACTTTTTCCTTTCTTCTTTTT^ 

TGAATTTTAC^TTCCATTCTTTCrCCTTTCAAACCAGACACCT 

TAATGGAAAATATCCAAGAAGC^GCCCAGGAGTATCTGACATGGTGGAATGGAATCAGTTAGAAAGCGAAG 

7 0 GTTACTTCTTTTTTTCCCCACCAGTTATAATCTTCAACCTTACT 

AAGTT AT AATTTT AAGT ATAGT CAT ATAG AG AG AT CC CT AAT C C C CTGGG T AAT CT AGAT ACT AAAGG TGGGG AAG AACAGTCAT A 

TTGACATTCTTTAATCCAAAACCACTGTTTGAAATTAGTAAGGATATTTTC^ 

TGAAAACGTTTTTCTTCCTCAGTGAGTACAGAAACCAAAGCAGTCT 

ATGGAGTAGTCTAAATTTAAAACGTCCTCTCTTCTACCTCCAATGAAAATGTTTCCGTGTC 
7 5 GTGGTCGTCTG CTGGTGT AGCGCTGTTTAAGG AGCGCTGTGTGCTG CT AGTGTTCCACGATGTGTGTGGTCGTCTGCTGGTGT AGT 
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AGCACTGTTTGAGGAGCACTGTGCGCCCKrrAGTGTGGG 
TCTCCTGCCCCTCCCCTGCTCCATCAGAGAGAGCTGC^GGTCT 
CTTACCGATCTGTCTGTCAAAACCAAAGATX3TCTGGGAAATCCCTCGAGAA^ 
GTTTGGGGAAGTATGGATGGGTATGCTGAGACTCAATTACTCT 

GAAGGTGAAAATAAAGACTGCGTGACCGGGAAGAAAGTTTGAATTACTAATAGTGGG^ 
TTCTGGAATTCCTAAAAAAAAAAAAAAAAAAAAAAAAAGATAGTAAGTGGGCAAAATTGGC 
TGACTCTCAATTCATAAACCAGOVAATGGTCCAAGGAATGTTCCAGCrGCCATAA 
ATTTGTTGAGTGTCATCCTAACGGGCAAGTCACCTTCCATCTTTCCT 



CTGGATTGGCTAAAGATGCTTGGGAAGTTGCACGTCGTTCGTTGTGTCTGGAGAAGAAG 
TGGCTTGGTAAGGCAGAGGG<^CATTTTGTTTTGGCTC 

AGCCGTTGGGACAAAGACACTTAACCTGGATAACTAGACGGGCACAGGCCATCCAGATTTTCAGAATATGCT 

CTCATCTGTCCTACTTCATGTGGTTAATTAAGGCACCCGCAGAACAGGAAACCAGTACACT 

AGAAATGGGAAGGTAAGATACTGTATATGGGCAGCCATAAGAAAGACCATGGGCGTTTTGCTC 

ACTAGCTAATTAACCACGAACAAGGGGCATATTTAACTCCTTTAAGCCTCA 

TGTTGTTGTTGAGACGGAGTCTTGCTCTATCACTAGGCTG 

GTTCGAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCCTGCCACCAC 

GTAGAGACGGAGTTTCACCATGTTGGCCAGAATGGTCTCGATCTCCTGACCrGGTG 

GGATTACAGGCGTGAGCCACCACGCCCGGCTGGAATGATACTTATTTAAAGGATTATTTTTAGGAT^ 

AAGT CCCTTG CAGAT AGT AGAT ATT CAGTAGAG AT ACAT ACTTCT CTITT ATTTG ATTGTC ATTGAT AAGG CTG^ 

TTTGTAATTTCTTTTATAGCCCTTTTGGGTCAGTTACAGCTT^ 

TTTCAGTAAAACAATATTAATACTTATGGAATCCTCTATCTTCCTTCCTCAGAACTCATTAA 

GCTACGTGTGGGTATAGATCTTTAGGTGTGGTCTGCATTAAAAGAG 

CCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCA.CCTGAGGTCAGGAGT^ 

CTACTAAAAATACAAAATTAGCCGGGCATGGTGGTGCATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGG^ 
AACCTGGGAGGTGGAGGTTGCGGTGAGCGGAGATCGCACCATTGCACTCCAGCCTGGGCAACAA 
AAGGAAAAAAAAGG AGTTAAT ACAGTTAAAGCACTTAAAAC AT AG CC CAGCAAATGAT AAG 

TT AGG TGGAT AAAT AGGT AGT AG ACAG ACAT ACACAT AT ATATG TAT G AGTT AG CT AG TTT AATAAATCCACATTATTG TG TCCAT 
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AATTACTCACAC7UVTCACAGAGAATTGGACCCTATTAGAAAATTTATTTT 
TACTGTTCACATTTTGTTTTCCAATAAACACAA 

TATTACAGAGGAAGATCTTAATGTATTTAGAATTTTGGGTGCTGACTCTTAAATAGCAAAG 
CATGAATGCTTAGTGGGCAGATATTTTATGAAGTTTAAATGAACTTT^ 

GGTTG CATGTTGT AATTGT ATG AACGATGTCTTTAACAGGAT AAT AAG AACT C CTT AAATTC CAAAG TCAG T 

TATATTAAATATAAAAATCTTAATTTAAACCAGTCTTCTTTTTCAAATAGAAAATAATCT 

AAAGTTATTCTTAAGCATGATAGGTCTAATAATGGAATACACAAAGAACCTAGGTATGGT^ 

AGAGACCCAAAAGATAATCCAGTTCCATTCCTATATTTTACAGATGAGGTAACTGAGGACAATCATAATCCAGGC 

ATTTGCTAGATTTTTTTTTAACATGCTAAATTTAAATATTTCTGTTCCCT 

GACAGAAGCCATTGTTCTTGTGGCTAGCCATCAGTAAAGCTGAGGAAATTTGCCTTAC 

GAGCTCCTTGCCAGTGCTTATCAGCATGCTTGGCACACCTGTXSCAAGGACTCA^ 

TCTAGAGGTAATTTCCAGACATCCTCATGTGGTGCCATATTTT^ 

TGCACACTGTACTCCCTAAGGGACAGCTATTAGATGGCCAGGCCTGTTCCAACCCCTGGACTTTCATCACTGTA 

TTTGGTATAAATGTGTAATCATATGACTGAGCCAAAAAAAGAGAGAGTGGTAATATATCAAAATTTAAA 

TAAATCTTAATATAGCC^TCTTAGCCATAACCACAAATAAACTCATTTTTTCT 

ATCAAAAGAAGATCTTTGGTTTGACTTCATATTCTGGGATGTTTGTAGGTACCTGGAATGGAA^ 
CTTAAACCAGGCACAATGTCCCCCGAATCATTCCTTGAGGAAGCGCAGATCATGAAGAAGCTG 

CT ATGCAGTGGTG TCTGAGG AG C CCATCT ACATCGTCACCG AGT AT ATG AACAAAGGTTGGG CAACAC CTCTCCTGTCTCCAG CT C 

ACAGTGCCTTAAGGGGTTGTTTAGGGGAGAGAAATGGAAGTTTCCTACTTGC^ 

CATTAGGAACCACCAGTTATGTGCTCGTTAATGAAAACATTTTTTAAAACAGACT 

TG AAATGC CAAAT ACTTTTT ACATGTT ATGTG TGTC AGAAATTCA CAGG 

GTGGTACGGAGGGTTATGATAGTTTGCTGTTATTTTGAATGTAGTAGCTTTTGT^ 

TATCAGAQlCTGGTCTCTAAGCTAAGTTTAGTCrCCATCATT^ 

ATGAAATTACATGCTTTCAGAGACCACATTTATTCTCAC^TTCTCTCCCCT 

TTAGGGATAAGGCCAGAGATCATTTTGTTTTGGAAAAATAGAT^ 

CAGAGGATACCAGTGTAArri T TCTTTTCCATGTACTTTCTT^ 

TTTGTATCCAGGGTTATTTTGT AAAT AACTTTGTGATTAAAG CATAAAATATCACTGATGAC CATCATAAGCCTCCAGCTT CTGCA 

AATGATTATCCCTGCTTTGGTGTTAGTTCTAAGCCTTCTTACXSCTGTGT^ 

AGATXn'CTGATTTAAGGCTTGGCTGGCTGTGATCTAATTAAGGG 

GATTCAAAATCTGTAAATAAGCTAAGCTAGTGAGTCTCAAACTTTACCTTGTATTAT^ 

CAAGTTTCTCGACCCAACCTT CAAAG TTTCTGAT 

GTGTTATTGTAGCTGCTGGTCCCAGGACCACAGTTTTC 

GGTGATTGGGGAATGGGATTGAAGAGGTGCAGTTCTTCATAGGTTCCTCTTTATGACATAC^ 

AGTTTTATCAAATATATGTTGTTATTCCTAGAAATGGAATTAATGCT 

TACTTTTTTCTACCCTGCTTTTTTATATAAACCITTTTGTCCCCCCA 

TAATAAGAAAACCAAATACAGCTCTCTGTTGGGAATTATCITTGAAA 

TACrGGATTTCTTAAAAGATGGAGAAGGAAGAGCTCTGAAATTACCAAATCTTG 



GGCTGCAGGAATGGCTTACATCGAGCGCATGAATTATATCCATAGAGATCTGCGATCA^ 
TATGCAAGATTGCTGACTTCGGATTGGCCCGATTGATAGAAGACAATGAGTAC^ 
GGCTCGTGATCACAGCTGTGAGAACAGTCACTTACTGCACAGCCCTTCCTTCCT 
AATCTCATTCTTCAGCTTGGTTTGGACACCITGATCT 

GAGGGATGAGATATTGGTTTTTGGTGGGTGGGTGGGTATTGGTATTTGCTGCT 

GCGTCCAGGCCTAGAATACTCTGTTTTTGGAGAAAGAGACC^ 

ACAGGCAGGGTTTCAGAAAuCChACAGGG 
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GGGGCAGAGAAGTGGGCACAAGAAGTCAGAATGGGCCACTGCACAGAGAGGGCCTCCT 

TGGGGATTCACACCCAACCCCACTCTAGCCCACITCTGGGTCTTCTG CCAGACCCAACCAG AAGT AGAGGTCACAGGAG CTCAGCG 
ATGAATCCrrATGGGTCCAGAGCAGGGAGAGGGAGAGTGGAGAGTGGACTTACACT 
TCTCTACCTC(TITTCATCATCCCCGCCATCCCACCCTCTGCAAAAATTTATAG 
5 AG CAG CCATT CCAGTGGGG C AG CTTTT CAGGTG CT ATT AGAG CACCAGATACT AGAAAGGAGTTTTT AGGAAT ATTTTC CITCTT C 

CCAGATACTGACAAAGTGGTCACCAGGTAAAAGTGTGTGATTAAGTATGTGGTTTC 

T AATGTTT AT AGCTGGGG CAAGGGGGAACTTTCATTGTTGGTGTTT CTAACC AAAGC CT AGTCAT CC CAAG CAAAGCGG AGCAGTG 
ACTGGAATGTATTACAGGGTGATCCTTTCTACTGCATTTCATAATGGCCAG 
TTCTCTTGCCCCTGTTCCTCCTCCTCTTGATTCTTTCATT^ 
1 0 TTCTTAGCAGCCTCAAGTGATGAGGAAAAATGTTAAACr 

GGTCAGACAGCTCAGCTATTTGAATTTCCCCAACACGTGGCAGTTCTTG 

G CGTT ACACAC ACACAT ACAC CCC CGGTGCTGGGCCAGATGG CATG CTG T ATCT CT AATCAAAGG CG CACTG CAG CTTTGATTGG C 
A(HX3GCCGTCCAGAGAGGAATGTGCATGTGAATTTGAGGCAA 
CTGAATGAGGGACTTTATCCTGAGGCAAGTCTCTCAGATCCCAGCTCCTCAGACT 
1 5 TGAATATTTGATTAAGGAAAGGGCTAGAATAGAGCCGTGTGCTCACCCTTCTC 
TTTCAACTCTACAGGGTTTGTGGTTAACGCTCATGAACCATGGGTAAATGAAA 
AGGGGTGCTGTTAATAGGGCCn'GGGGGTTTAGTCCACGCCTTATTCACTGGA 
GCACCCTCGCTGGTTCCCCTCCCATCAGGCTGCTGCCCTGAGCGGGTCCCAGGCTC 
CTGCATTAGCTGTCCTGATTGCCCTTCTGCCGGCTGCCCAGCTGCACCATCATGTCCC 

2 0 GCCAACCTTGTTCTTTTAGCCTTTGTACTGCGTTTCCTTACTTAA 

TGAACTGGGGGGGGGGGAAATGAAGCTTGGCAAGGCAGGTCCCAGGATGTCCCAG^ 
GGGGTAGGGAGTGGCAGAGCCATTTATAAAACACCTTCCAAGTGCCJVGGCCTGAGAGGAA 

AGG AAT AGACT CAGAGGGGTTAAGT AACTTGT ACAAAGT CACACAG CT AGGAAAAT AG CAG AGCCAGGATTTG AAT CCAGAAATGG 
25 CTGACCCTGTTG CATCT ACCATTCTAAGCTATG CTG ACCACATCTCATCAAGTCAGTG ATCAAAAGAACAGAAAATAAAGATAGAG 

AGG ATT AGT CAT ACAT C C CTTTT CCC CTTT ATTTT ATGGG ACTGTTTTT AAGG AGCACCTT CT AGTGTGTCTGT ATG AGGGAGTTG 
GAGAGACCTTCCTTTTCCAGCACTC^GACAACCCAGCTGTTTATAAAATTC 
AGGCAGAAAGGGCCTGGTGTGTCTCTGCTGGTGTACTCTGCAGATGGCATGTG 

3 0 CITGATTATGGGAATGCAGGCGGCATTCGGAAAC^TGCACCAAACACAGATATTT^ 

AGTTGTGAGTTTCCCATCTCAGCGTGCACGGAAATATCTGTCACCCCTTGGGCA^ 
ATTGGAGTGTGGAGCTGGGTTCCACAGTCCTTCCTGTGCrCCTTACTTC 
TGTCAGGGAGGTGGTCAAGGCCAGGCCCTGAGCCTGTGAGGTGGTCTTCCACTGACAG 
CGATTGCTTATTCTCTATTCCCTGGACATCTCACTCAATTACAGCAGATAGTTCGTCT 

3 5 CCCAATCAGAGTCTCTAAATAATGCCAACATCAGTTATTTGAGTGCCnTATCT 

AGAAGCCTAAGACTGCACGACAGTCAGCAAGCATGAGGAGGAAGACGTGGGGGATTGTTGAAGTCAGATGGAACT 

CCAGCGCTACCACCTGATGTTCTGGGGTCTTAGGCAAGTTATTTAACTTTTGCCT 

ACATTTTGTCTGCTGTATGCATTTTTGCAAAGATTAAGTGAAACAATGCATGTAAA^ 

TAAATGTTGCTTGTTACTGTTGGTATATCTGGTACAATACTAAGTTCTTGGGGAATAAACAAATGAGTGAATGGATTGTA 

4 0 TGGAAAGT CTCAAAC ACT ATTCAGGAAAGGG AG AGAAGAAT C AAAGAT AA^ 

GCATTACCAATAACAGAAATGAGTACTTGGGAAGATCITACTTTCAGAGAAAGA 
AGGTGGTGGTCTTGGTCCTCCAAGCCCAGAAAGATGAGTCGGGGTGCCCT 

TCCAGC CCT CCCCTG AGTTCTGGTCACAG CCACCTCCCTAC CCCACC ACTGCACCTTCTTCCCAGAATCATT CCAGAACTCACCAC 
AGGCACCACAAGGATCTGCTGT CCATCTTCTTGTCCATACCACACCTGTCT CCTGTGACCCCCGACATAGGCAATGTCACAAGCCT 

4 5 TGCTTCTGGAAATTCACAGCCCCTCCAGCCCTCACnTCCTTCAGAGATC 

AGACTCTAGTTTTTTCTTCATGAGCTTTCAGTGCCGACTAA 

AGCATAACTTCCTTTTGTGGATACCTTTTTGTTCTTTATGAGCTATAACACACAT^ 
CCCGATGAATTTTTATGTACATATATGCCCCTGAATCCACCCCGAAGATAAAGATA 
TTATTCCCACTCCCAGTCAGTAACCACCCCCACAAATAGCCTTTTTTCCCCTTTTCACCATAGATT^ 

5 0 GAACATCACAGACATGAATCCTATAGTATTACTCTTGTGTCTC^CTTCTTTCCCAT^ 

ATGGAACAGTAGTTCTTTTCATGGCTCAGTAGTATTCCACTGTACAAATATATCA 

TGGGGTTATTTCCAAATTGGGGCCATTATAAATACGGCTGCTGTGGTGATACACACATGCCCT 

AGTGGAATTGGTAATTAGAGGGCAAACCTAGGTTAGCAATTGCAGTTACT 

GA^ATACACACTTAGTGTGTCATTTG CCAGT ACTGTACTTGCCTCT CTGCTAGTGAGGCAGCCTAGTGAG ACTGAGG AACCAGG 

5 5 GGTTGGACATTAGGATCCCTGGCTTTGAATC^GCTCrGTCACTTACAAGCTTGGTGTCC^ 

CCTCATTTTTATCATCTGTAAAATGGTGATAATAAAAATGATCACCTCACATCATTAGTC 

AG TT CTTTG AG AG TTGTG AAATG T AAAAATCATC CTT AT CAC CT CAACTrrTTGT ATTTG AC ATACA'O TT C CTTG CT 

AAAAAGTTTTTAAAGAGAATCACTATTAGTAATAAATAGGGAAAAAATCAATCTTTCAAGAAAAAA 

TAGCCTAGGCCAGTCAGTGGATCTGGGCAGTCACCACTTTGATATACCTGCATGCTCCAAGGGCCCCCTCACCCCG^ 

6 0 GGAGTCACACATCCACTGTCCTCATCTCCGGCCAGCTTGCCTATTTTCTCATATTTC 

TGCGCTGCCTCTCTTGTTTTTAATCTTGTGAAGTCTGAATATCAGATGGGCC^ 
ACATCAAC^TGAGCACACATGAGCACACACACGCTGCATGAGCACACACACG 
ATCTGTGGAGTCGGTCTTACTGCTTCTCTATAGAAGTGATTTAGTTTAAAGACTAGCAG 
GTAGTATTTAATAAATATTTGTTTACAGCAATGGAAATAACCAAGGTCACCA^ 

6 5 CTGTCTCTCATCAATGTGCATGGCTCrCTCTGGGAATCAAATGCTGTTGTGGCTGC^ 

TTGATATTCCTTGCACACTCCAAGATTGTCTTTGGAGTGATGGGAAGCACAA 

CTGACG CCGT ATT CCT AGAG CATAAGTTCTGG CC CCTCTGAG CCT CAC CTTTCT CATCAT AAAAT AGGATTG GTGTTGG TT AC CTT 

TAAGGTGGTTGTAAGAATTCAAGAAGACAATTAGTAGAAGTT 

GAGTTTTTTTGTTCTGTTTTATTTTGGTTTGTTTTTO 

7 0 TACACCTCAGTCCCCTCCCAAAGAGGGCAGGGCCTCGACTGGTTAAGAGCATGAC 

TAGCATTGCTGCCAATTCACTGTGAGACCTTGAACAAGTTGCTTAACCTGTCTC 
ATAATAGTTATCTATATCATAGAATTGTATGGCTTCCCTGTGAATAACCATAAAGCAGTTGGC^ 
GTGGTAGTTGGCAGTAGTAGTAATAGTAATAT'fAT-TATAATATTGATCCA^ 
CTCK^CA(^CAC^TACACACACACACACACACACACA^ 
7 5 CTCCTTCCACATGGAGGCTCTGCCTCTAT CATCCTITCTCCTGATTTC AATCCTCATG AAGG ACAGGAACTGGGGTTC CTCTATGG 
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TTTTGGTTGGTTTGTTTTTTAAGTTCAAAATTAATGAAACAAATCT 
AATAGGAACATTCGGGTCITTGTATAAGACM 

TGCTCCCAGTATTGAGGGTTTAAATTGAATAGATTATTGTCTGGAGGAAAAT^ 

TGTTG CAGTTTTCATGTT AATT ATTTCATGTT AAAAAATTG^ CAT AG AAAA CAT ATTTT AG 
5 AAAGATTTCTTACTTTAATAACTCTTTGTGTATTA 
AAGCACAGTAGCCATTTAACACAAACAAGCAGGCAG 
TTTTGAATTTCACAATCCAGAACTGTGAATTCAATTCCT 
CCAGCirTXTCAGCrCAAGGGCTCCTGCAG^ 

CTCGCCATGAGATGCCCTCCCTCTGCCCCCGTCAGCACATGAATTTCTTC 
10 TGTCCTGTGACCTGAACAGCAAATATC^VGGAGAGAGAAAAGGAA^ 

TCCCATGCTAGAGAAGCAAACCATCCTTTCATGAAAAGCAAACCCTTTAAA^ 

TCCC^CTCAGTGGTGCCXSCCATAAAATGACCATGCTCAACCCC 

TCTAGGGAGAGACCCCACK3GGACTTGCCCGACAGAATGCCCTGCX!CTCCGAGG^ 

TGCCTGAACCGTGCTGCITGTAATCAAGCTrc 
1 5 T ATTCTGTCGCTTCAAG CT CTGAG CTT CCACCTCGTTTCTGGGAGGG ACGG CTTTGTTT ATCTG CTG AG CCACCGTGGAG CTTCGG 

GGC CCAGATT AAT AGTTAGGGGG AG CAGGTCAAATTGG CAGGGAAATGG AG ATGAATTGT AGAG CGCAT CCATTTT CCTCCCGG AA 

AGGGTX3TGTTGAAR.GCCAAAGATTAAAATTAAAAGAAAAGGGACTCT 

AAACrrcTTTCAGGTC<K3AGATGGGACCAATTTCm 

TGGAGGTAAGCAGTGGTCACCTCAAGCCCAGCre 
2 0 GTGTGAGCATCTCTTTCTCCTTTCTGCTCTTTTG^ 

TGTGCAGCCCACGTTTCTTGGTAATCATCAAGAAGGAAGCTAGATG^ 

GACTGCCATCAAACATCAAAAATGTCAGTTGGCTTCCrTGTCTT^ 

TAATCATGGACACGCAGTACATTTGTAAACTGCCCCAAGTTCTGAAGCCT 

TGCTCTGCCATCCCCAGGGTGGCTCCCCCATTGGCACAGACACTGTACCAGAT^ 

2 5 AAGTGGTGTITGTCrTCTGTATATCCTATTATCCTTTGATATGTTCT 

OVCAGTATCCCCAAACCCAGTGCTGCCCATCCAGCCCAGCT 
CTGGGGGTGGCTGGATTCTTGGCATCATTrCTGTGTTGAGCTAC^^ 
ACCCTGGGCAGCAGTGTGCAGTTCGTTACAAAGCTCTO 
GAAGCTGTGAATCAGAAlTTTTATTCTGlTTG'lTTr^ 

3 0 TTCTG AT ATTAC!TTCCTTT CTGGCTCTTCGG AAA.CTGGAAGG A CAG AG CCT CAGT ATTCT AGAACAGCAAG C 

AGGGCTCCTTTTTTCTTCAGTCCCACCCAAGATAACCACTTAAATA^ 

CG ATGGTTCTCAGATTT CCTT CTT AAAAGGACT AAAT AC CCACAGTTT CAG G AG ATG CC CATT AATGTT AT CCTCT ACTTTTGTGT 
TGCCCCCATAAGTTACCCTGCK3TTTGTCATTGGTCACCGTTG 

CTT CA CTTCCTGG CT CCTCT AGGG AT ATTTGG AAGGAATGT CACAGTTT AC CTIT CTGGT AG CCT CCA.TGTCTGGAT CGAATGCC CT 

3 5 TCCACAATTTATCCTGCCTATCCTTCCCCGACTGCAAAACCCCAGGATTAATACCTAGAGTCCAG^ 

GGTAGCTCrAGCCTACTCCCCTGTGTCCTCTAGTTGTCCTC^ 
TCCAGTCTTXl^CTCTGGGCTCrCTTGGAACTT 
TTTGGATCTGTGCCTACTTAATCAACCTTATCTTTGCAATATGTT 
TAACTTCTCCCTTCTCCACTCACATCCCCGGTGGGACAGCATTCCTCCTTCCT 

4 0 AC ATT ATG AAATTT CAG ATGGAAT AT AAT CTT AACC C ATTGT AACAT AT ACTTTT CAAAAAAACATGATGCT AAAACTTT ATGAAC 

TTGAAGTAGAAGTTTC!AACAGAGTGCGTCACCGGGGATAACTGGAAA 
GAAAGTCATTCCTTTGGCCTAGTGTGAAAATTCTGAGTTT^ 

TAACCACAAAGTTCCAGGTCAGGTCAGCTTTG ATTTTGCTT CTTTCACAGGATATCCTGTCCTGGT ACCG C CTTGTCACCTCCCCC 
TCCTGTAAACCAAATTTCTGTTATTGGCATTGTTTGTGATTT^ 

4 5 TTG CAT ACT ATT AAAT ATTGG AT AT AAT AATTTG CAC^G TT AGTGG CTTT AT AATT AAAAAGGG CT CTGGGTGG CTCTTT GG TT CA 

GAAAAACAGTGTCTATTGTTGAAGTTAATGGGTTTCCTCCTTGAAAATGCT 
TTGGCTTCTTTTATTCAATACTC^CTTCACCTTCT 

ATTGGCCTGGTCACCAAGAAAAATAAAAAAGAGAGAGACTTGGCAGITCAGCT 
GTCCTGGCCCCreCCGTGACCCTCCCC^CCG<^AGGAGAGCCATGGGTC^ 

5 0 GCCCTCATCTCCTGCTTATGTTCTCTTTTCCTGTCTCTGCTGAGCCCT 

TC^GTTTGCCCGTGAAGTCTGTCTCCCTGGCTGCCTAGAAACCCCAAGGGCGGAGGCAA 
TCACAGGGTGCTGGGGACATCATGGACCCTCAGAAGATTCTCAGTGATTGGTTGATGAG^ 
TTCTTTTCCTCTTCTGATGTTTGGAGAGCTTTGGCATTGCTGATGGCA 
CTGAATAAAATAACAGCTGGTATTTATTCAGCTATTACCAG^ 

5 5 CCACAACCTTAGGAGGTTATCATCCCCAGTTTACAGAGAAGGAAACT 

AGTGTGCAAGCACAGGGATACCTGACTCGGAAGCTCCTTTCCTAAATGATCATAGTGGTCCTTCCACAAG 
GCTCAGCCCACAATCGGACCACTTTGCATTGTCACTGAAGAGCATCCACT 

GGAGGTGATTTGGCAGCAAAAGTATCTGCGGTCTTCATTGGAG CATTTGAAATGCCCATG CCATCATCTTAATGT ATGAAAACACA 
TAAAGAACAGGAAGTAACTGAGTCAAGTAAGCTATTTCTATTTGAGGCAG 

6 0 GGGACTTCATTAACTGX5AATGTGGCCTGGGGCTTGGGATGGGCCATCCTGA 

AAAAACCGAGAGCATCAGTCAGGCTGTXSCTGTCGTATGAAAGGAAGATTrT 
GTTTATTAC^CCACTGTATTGAC^CrrCTTTG 

GTCAG ATAACTTACTG CCTGGCTGG ACCCTTTAGCT7U3TTAG AGCATT^ CAAGTTGCTATGAA 
GAAATATGTGTCTCCAACCTCAAAGGATTTGCATTCTATTTAAGGAGATTTTTTAAATAA 

6 5 CAGAGGAGACATGGACACAGTGGAACAGTCTATAGTGTTTATGCAGTCAGGCT 

CCTCATGTGCCCAGTGCTTTCCATGCCTGACTCTTAGCAAGCG 
ATTAAACTGTTGTTTCrAGCACTGCSCGCTAGGT^ 

GGTGGGACTGGAGCTGAGGAGGCTGTAGCTAAGCAGGTGAAGAGAAAGAGAGCATTTCCAAAGCC^ 

7 0 AGCATGCTAAGTGCAGGACACAGGTTATAGTAAAGCAAACTGAACTAGAT^ 

AGG CACGG TGGCT CACATCTG TAG TC CCAG CACTTTGGGAG AC CAAGGCAC^IAGG AC CACTTG AGC C CAGGAATT CG AG ACT AGC C 
TGGGCAACATAGCAAGATCCCATCTCTACAAAACATAAAATTAA^ 
AGGCCGAGGTGGGTCGATCACCrGAGGTCAGGAGTTCAAGAGCA^ 
AAAATTAGCCAGGCATGGTGGCGTGCACCAATAGTCCCAGCTACTCAGGAG^ 
7 5 GAGGTTGCAGTAACTCftAGATTGCACCACroCA 
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TAAATTAAACATT AGCCAGGCATGGTGGTG CACATCT AGCATCCT AG CTACTCAG GAGAGG CAGGAAACTTG CTTG AGT AAG CC CA 
AG CAAGAAG AAGAG CCCACAGGGG AGACACACAAAATG CTTGGAAAG AAAGG AAG G ACATGGGGAAG AAT AAACTCAGAAAAAT AA 
TTCATGCATTTCTCATAGAAATGTTTGAGCTTTGCGGGTTGGAA 
ATTCACATGTGAGAAGTGTGCTAAAATAGTACAAATCTACCCAGAGC 
5 ATCATCTACCCAATTGCTAATTGCTGATAGACACCCATGGGCCTTGAAA.TGTCACGA 
CACTTTTCCCATTCTGTCTCATCATAAGTCATTCTCAAG^ 
GACGTGGGAAGTCATCGCACTTCTGCTCACAGCCCAGTGACTTTCTGTTG 
CAGAGTCCCACAAAATCTGTCCTTGTACCCCATCTTATTATTCTTCATAGCA 

TTTATTTTATCTCCCTCAAACTAAGCTTCCAG CTCCATGACTGC CAG AGACTGTGTTCCCTTTCGTCCTCCK^CACCXIAGTGCAG 
1 0 TGCCAGGCACTGGGGAATACTCCAGAAATACTTGTGGTAGGAAGAAAGAA 

ATTTCTGTGGACAAACTTTGATAACTGAATAAATCCACAGCCCTCTCCT^ 

TTTAAATGAGGAAGACCAGGGCTTGGTGAGGTCAAAATGGTTG 

TCTTAATCCAGTCTATTAAGGATTTTCCACTTTGTTTCACTGTGAACT 

TAACTAACCACCCCCATCCCCAGCTGCAGCCCCAGTGAACrGGGCCCCCTTGTCCr 
1 5 AG AT ACATTCACT C CGAC CCAAAGGGGATCTGAACACACTTC CTTTGGCAG TGAAAATG CAGGGTTCTC CAGGTGTTTT AAGTTT C 

ATTAGTGTGTGGCTGTTGCTTTTCTATTTCAATTGGCAA 

ACCAAGGACAAGTAGAAGACTCACCAGAGGGCAGGGAACTGATAATACGCTCTGTAAG^ 

AGGG CTGGGAGGG CTG ACCAGCTACTCATTCATTCATT CAGCACCTGTTT ATCAAGTG CTTTCT CTG CAG CAGACACTGT ACTCAG 
TACTGGAGATGTGGCCATCTGGGAAGACATGCTCCAGCACrCAAGC^ 
2 0 ATGGATTCATTGCCCTTTAATAATAACAGCTTCGTTAGGCAATATTTAC^ 

TCTGGCAGCATAGAAGTAGGAAACCTACCGCAGTCTTTAGTGATCAGGCAAAGGTTCCCTC 

AAAAGAGAAGTAAGAATTAGCCATGGGTTAAGGGCTTTTGATAAAGGAAATAGCC^ 

GCAAAGCAGTCAAAGAAGGGCAATGGAGAGGTTGCAGAGGAG^ 

AGCTGGGGAAGTAGGTAGGGACCAGATTATGCATGTAGATTCrrATTAGAAGGCTACTATAGAAACCCAG 

2 5 AAGTGAGGACTAGAGGAAGGGTAGTGAATTCAAGATCATGAAATGGTCAGGATGTAATGGACCT 

GAGAGCACAGAGATCATTCAGTAAAGATGTATTGACCTCCTTCAATAGG 
AAATAGAGTGTCCAGGTGAGGATGATTCTCAGGATTCTGTTGTCACCAACTGGGTC 
AAATTGTGGGGAAGATAAATTTTGGATGTTGCATTTGAGGGGCACTC 
AAATTGGTTATGCAAGTCAGGGACTCGAGAGAGAGGTCCCAT 

3 0 GGAGCCATGGGATTGAATAAGATCCCTGATATTACACAGAGTTTGTGACCA 

GATTGAGCTGTATATCAGAAATTATTTAAAACATCGACACATTT^ 

TAtATGTTCCTGGGAGTTAAATGACTAAGGCACAAAGATCCTCTTACATGGAAGTTCTAGGACATGGGTACCAGTG^ 

G TTGTCAAGTTTT CACAATG ACAAGTCG AGGTTTCGGCAATG AG TTT ACCT T AAAATTTGCAAAT AGTAC CTGGA 

AAACATTCAGTGGAAAAGATGCTTCCAAGTACACACACCACACACACA 

3 5 GGCCTTCGTGTATTGGCTTATCCTAAAGAATCCTAAACrGTTAAGCT 

GAATT CATGT AAACATGGAACAAGTACAGT AAG TGTT AC AG TG AAG CT CAAG ATAT CAGTGACAT AGATGAT CATTACCTG C ATAG 
TTTG CT CTGTTT C CT CT AAAT AAAATGCAAGG AGG AG AAAACTG TTTTCAT AT CT AAATCCTGCTCCCA^ CAGAAG AATTT CAAA 
GTGTG AAT CAGCT AGTT ACAT AACAAT ACCAAG TT AACTGCATTGCG GTT CTC CTG ATGll'U rCTCCGTTGTTCTGTAAGTCCATA 
CT ACTTAGAAGACCACrTTGATTCTCACTTTTTGC CTGTCAGAACTTTACTTTCAAGGG AAAGG CAACGTTTACACCCAGGAG ATC 

4 0 CTTCCTTCAAG^TGTCTGAAGTCTCCTAACTTACAGAG 

TATGTACTACACAACCTTAGGGATGCTGTTTTCATAGACTGGGATTTTACACTGTC 
GCCAAAAATCCCCAGTTAACTCCACCACATTTAGAGCTTC 

AT AAGGGG AAACAGAAG T AATGAGATGTGT CCAAGAGGAAG CAGGG ATT AG CT CTTTT CAGTAGAAACCTCAGTT AATCATT AAT A 
CCTTATTGCTGCTTTGATTTAACCTTTTGTCTTCCTTC^AAAAGGA 

4 5 CTGCTGTTTTAAGACAGAGAAGAAGCAC^AGAAGTTGGACAAAGACATTA 

TTGGTGTAAATATATTGCTTGCCTTAGAAAGAAAATTAATACA 
ATATCTACAAATGACTTCCTTTCAAAGATTCCTTACATTAATGCA 

CCTTATTGTAGATTAT AAAATGAG AAAATAATAAAG CTGACT AATG CAACTC^G CTGAT ATTGCCGTTGG T'l' T ri'Tl'TAAT CTGAAA 
TTATGAAATCCCTTGGAAAATCTAACATGACAGTATCCAAATTTTAAATAG 

5 0 GAGTGATTGAAAGAAGGGGAAAAACAACTCCTTTC^TT 

TTTACTTCAGAGATGGTCAGAGATTTCCAAGCCATTTGGCT 

TCTGAAGTAAGAGAACTCCAGCTGAGGTTTGGTCCAGAGCAAAGTCTGCTCT 

TGGCTCAGAGTCCAAAGCTGCGTGAACAGAATGCTAAAGGATAGGAAGTGTGAGAA 

AC AACTGACT ATTCAG AGAAT ACCAGG T CTTATGTGGAATCATCT AG C ACG T AAGGAAAGAT ATAAATAATT ATGC CTG AG AAACA 

5 5 TAACTAATTAAAAATAACTCCATGCATAAAAGGAAAACTATAGAATAACTTTTCT 

GAAATAGTAGGATTTTTTTCrriTCATTTC 

GGTAG ATTTACT AAAG ATGTTATCAT ATG CTGT ATTTAAGACAAAAATCTAATTG CTGTAATTTATCATACCTCTGGCT ATTAAAA 

CTGTGCCAATTTAAATATTTTAATCATTCGCCACTTTTGTACATAGT^ 

TCAC^CTTTGTATGATTTTCAGGAGCTCCTTAAGCAGAATGAACATGCTT^ 

6 0 TCCTTGTGT'IT' i' T CTT IT AGTTTGTTCAGT AAAAT ACCTGGAACTG eTTTATAATI^CTG AGCTGAATGTrTGAGAGCACAAG CTG 

CTGAATCC CTTTT ATTTTATTTTATTTTATTTATTTATTTATTTATTTATTTTTAT^ 
TGCTCTXH'CACCTAGGCTGGAGTATAATGGCACGATCTCC^CTCACTG 
CAGCCTCCCAAGTAGCTGGGACTACAGGCTCGTGCC^CCATGCCTGGCT 
GTTGGCCAGGCTC^TCTTGAACTCCTGACCTCAAATGACCGCCCGCC^^ 

6 5 CACACCCGGCCTGAATCCCTTITAATTCCCTGCACACACATCAAGCTTGTAGAAAG 

AGAGACAGGATTCTTTCAATGATCAAATAAATTTTTGAAAAAAAGG^ 
AAAG TCAGAGACTCAT AAG ACAGTCATGGAGGCAGAACTGCTCTI^ 

AGAAAAAGTGGTAAGAAAGTGATCTTACAGTATGGAAGAAAAAACTATCCCAGGAGTAAA^ 
GAGGAAAAACAGCAAATGTGGAGAGAGAAGGAGCTGGAGGCAGATTCCT^ 

7 0 AACACATTTCAGCAGAACACATGC^CAG^ATCCAGTTG 

AGTGAGACCACACTATCTTATATACCCGAGGACTCATGAGAGCAGCATTATGACTAAATCCT 

TTATTTCAAAGTTAAATATATATATAATTTTCCTAAATCATAAGAAACCAGGG 

TTTGAGCTGGATTACTTACGGTCACACAGTCAAGACCAG 

■rGTCTTCCCCTCTAAACCTGCTCTCAGGAAAGTAGCTTGCCAAGTGCCTTTCCCT 
7 5 GCCTGATGAATAACCAAGTGTT^TTCTGC*C^TG 
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T CACAATOUU3 TCTGACGTGTGG TCTTTTGG AATCTT ACT CACAG AG CTGGTCAC CAAAGGAAG AGTG CCAT ACC CAGGT AAG AG C 

AGGGCCTCACAGACATGGGCCTGAGATTGGAGGCTTTTTTGACACCA 

CCTCAGCTTTCTGCTGAGCTTGAGTTTGAAGAACATG^ 

AGACAAGCAGCCACCATGAACTGTGTGCCAGGATTAATGTAAGTAGC^ 
5 ATCTCTCCTCrCTCAGTAGGAGTATGTGTCTGTGGTTC^ 

AGACATGTCTGGGAAAGAGAAGGAGAAAAG CCCCACTGCTTTTTGATTACACTGTAG CACTG ACC CATGG AG TGAGC CTGGCAGGA 

AGGTGCCAGGACCTGTTCTCTCCACAGGACCCAGCTCCAG 

ACTCACTGCCTGTTGCrGAACAGTCAAAACATGCTATACCAAAG 

TGTTTCTGTATTATCAGATTTATGCTCTCCAAAGAAACCACG 
10 TGACATAAAGGACACCCATGCCATTTCACCTCCACGCTCT 

CATATTTTTCAGTTACTGCTGAGTGAAGAAAGAACATAG 

AAAAAAAAACAATGATAGAAACGATAGTGACGATTAATGAGGAATTTAGGTCT 

CAGTCAGAGGAGCTAGCGAAGGCAGATTAATTTGTAATCCCT 

AGAGAGGGGCACATCAAGATAATCTCGTTCTGGGTTGGTGTAGCC^ 
1 5 TGACAAATTCCCAGTCTCTGAAGTGCCGGCACTGGGCGTTTAGAGATGCGGGTGTA 

TGCTCCCGGCATGCACACACGGGCAGACACTCCGGGAATCGCCAACCAGC^ 

CGTGGGTGTTTCrcSAATTGCTTCTCAAGGGGGGAA^ 

TTGTCTGAATTTTGAACTGATTTCTTCCTGATCCCTC 

TAATG TTGTTTT GCGTT T CT GT GAAGCCTTCCCTTCCAGTAATACAAAGCAGCIT^ 
2 0 GACAAAGGGGTGGAGGTGGGGGCCTTGGAGAAGGAAGGCAGGTGCGTGTTTCTGAGGGA 
AAAGAAAATGCCCTCATTTTACCACTTAGACCAGGATTCATTCATTCAGCGTAGCT 
CCTCTATCGGGGAGATAGTGCAAATAAGACCCTC!ATACTCrAAACAA 
ACAGTGGGCAAGTAAATAAATTTAGTCCCTATGTTCATGGATATTACAGTCAGATA 
CCAGCCTTCTTTATTCCTTGTTTATTATCAGAGGGAGTCATCC^ 

2 5 TCTTCATTCTGATAATGGGGTCAGGGACACCTGATCCAGGTATCC!ACACAC^ 

TGTGAGGATGGCAGAGGAGTTTGAAGTGAGAACTAACTCCAGCGGCrCCTG 
GGCTGTGATAACTGCCTCTAGCTGTGGAGGAGTGCTGGAGAGGGGG 
GTAGGTTTCCTCAAGCCCAGAAGGCGGGAAGAGCATCCCAAGGTCTCGC^ 
GGGACCACAGGATGAATCTGAGAAGATGCCATGTGCAGGGGCACACCTGGAGATGTGA 

3 0 GGGCCTTGCTGGTTGGCAGTGGCAACCAGGGAACCGTGTCCTCATTGGGGGTGTCGTG 

AAAAATTGTTC C CAGG CCTGTTCCT AG CCATGGG TTCTTCCTGGG TTTTG CTT1TT AACAT AG CT AT AATC CT ATTT AATTGGGTT 
TGTGAATCCAACAGAAAACTCTTCTTGAGGGAGAAGCTTTTAC^ 
TAATTCCTAAGGAACAAGAGGTTTCACGAAGCTTCCTTCAGATGTC^ 
CTATTTGTTGGTGTGGCAAGTCAGTCAGTGTGCGCTGAGGGGTCCTTCTCAGCTTCC^ 

3 5 ACCTCAGGACAGACTCCC^GTCCCGGTCCATCATGATTTC^TTCTGCATTTT 

AGTATATTTTGAAGTAATTCTTTTTTGGAGAA 

TTTCCTTGACACATAAGAAAGCTCCTTTGCCTGGTAGGTGAAGATGGAG^ 
AAAAGCTTTCCAGGCATCTTTGCCCAGCGTGAAAGAAGGTCCAGAAGTGA 

TGTGACCAAATAAAGGGCAAAGGGAGGCGGAGATGTTAGAACATTGGCAAGTTGTGTGAGAGGAG^ 

4 0 ACAATCTCAAACTGAATGATAACCTGGAAGCCGAAGGAAAAAATACTGAAATATGATGCrrGAGATTAACG 

GTTACATCACCTTACCAGATGAGGAAGGTTGCTTTCGAGATGTGTA^^ 
CCCCGCTAGCAGCCTGAGCTGAACGCArTCTCTGCATTGCAGTCTAGCTTGCTTTT 
CTTCAAGATTAGAACTGATTTCATACTGATGGATTGTTGTATGTGAGAAATAAAAAC^ 
ATGCATAAAATCAGTTATGCATTTTTACCTGTCATTTCACCTTAAT^ 

4 5 CTCCCCGCCTCCCCACCACCCACCCACCC^CCATTCTATCATTtSTTGCCAACAGA^ 

TCTCACCCTG CTAGTT AAAAACGTCCACGG CirCCCTTTGCCTTTCAGAT AAAT CCTAGAAAACX3TAATTCTACATAGTTCT 
AACTCTTTTTCTGTAGTCATTCCACAAATATTTATTGTGCACCTACTATATG 

TG AAGC^AATAGACAAATGCTGCTG CCCTCAT AG ACTT C CATT CTTCTTTGAGG AGACAAACAAT AAACAAAGATAAAT AAG CAAAA 
TATATTGAATGTTTGCTGCTGGTAACAGCTAGGGAGAAAAATAAAACAGG^ 

5 0 CACTGGTGAGTAGAAGACCTAAAGGAAGCAAGAATCCAGCCAGGTCGGTATCTG^ 

TGCAGAGACCCTCCAGAATCAAAGGAGGCCAGCATGG CTGGAGCAG AGGGAGCAGG AGGTCCCAG AGG CCCCAGGGGCAAATCAG A 
TAAGGCCTTTCTAGCTCATGGTTGAAACTTTTCCTTTT^ 

GATTTCTCTCTTGCTGT CTTCCATCCTTCACACGTTACCAAGAC ACATCZAAGAGATGTCTCTT^ 
GCTCCCCTGTCTGCCTCAGAACCCTTCAAAGCCCAACTC 

5 5 GTGCTCCTTCTGCTTGCCTCCATGACACCTAGCATATTTCTGT^ 

GG AGTCCCTTTGATGGATTCCCGAGGTCCAGCACAGGGTCCACCCCATAGTAGACTGTCAGTAAGT 

AT GAAACAGAAAAAAAAGAACT CAC CT ATGAGGTTTTTG AAACAAGGTTTTT AAATTT AG C C CAT AGC CTG CAAAG C CTTTT AACT 

GCATTCCATCATCrTCACACATGGATATATATTTTCCTAGCCACAAGGTAGCCTCTG 

TGTGATCATTTCAAACATCATGTTCCAACACTATTCTAGGCAGAGTGTCCTG^ 

6 0 GTTTTGTGGATTTGG CAGG ACCTGXSTTVG AACAGGAACACTTTC'r' r rr r T'l T 'rr'l T'l rTlTTl'TTrTTTTG AGATGGATGGAATCTT 

GCTCTGTCAGCCAGACGGGAATGCAGTGGCATGATCTTGGTTC^ 

AG CCTCC CAAGAAG CTGGG ATT ACAGG CACCCGCCAC CACG CCCGG CT AATTTTTG CATTTTT AGT AGAG CCAGGG TTT CGCC AT G 
TTG^CCAGGCTGGTCTCAAACTCCTGACCTCAGGTGATCCACTCT 
TGCACCAGGCCAGGAACACTTTCTTAAGCAAACCACACTTATTC^ 
65 TTGAATACATCTAGTCGCTTGATAATGACCTGAAGAAATAAGGGAAATTA^ 

TG CCACTCTGGTCCrCCCTCTGCCTTAG AGCCCTGAACCGTAGATACATTTTCCTTTG 

TTCATXX5GAGAAGAAAGGGTTAGGACTCTCCAACCAGGAGGAGACT 

TGTCTCTCCCTGGCTCGTCCACAGTCCCTGGGGTTTTTCCTGCAG 

C CAG AGGTCATGCTGTCAC CTC CTTCT ACAAG ATTT CTCTTGGCCAG C ATGT CT AG TGAG AGGAGGAGGG AGACCAGGGGAG AATG 

7 0 CGTGTGAAGAGGGTGGCCAAAGAGGGACTTTCGGCCCCCAGTCCT 

TCCGGGGCCTTCCTCATGGGCACCCTGCCTCTCTTTTTCACTCCTTTTT^ 
TGTATTATCAACTTCAAATACAGAAATACCTGTGTATTCAGTTACATGAGTAGCTGTCT^ 

AAT AAAAACAGCTGT CCTGAGGTT AT CTGG CAAG ATGAGGAAAG AGAAACAAAG CAAGT CATT AT^GTACCTT CTCAAGAT AAGTT 
CTTTATTCTCATCCACACrCTTCCATCAGCATCrCCCre 
7 5 CGGTTATTAACCAGTACGATAGCAAGGACACATGAGAATCTGTTAATACAAGCACATGC 
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ACCATTAACACCCCTCCTCCTTCCCAGCGCCAGCACA^ 

AGGGGAGCCTCAGACCCAAGGCCCTATGCACAACGTTGTGTGATTCCCTTTGAACGGAAAGCTC^ 
ATCTGATAATCTAAACATCATTCCCATTGTTAAAGTGCTTAATTATTCACATGGCACAA^ 
AATCCAGATGCAAACATGGACGGGTAACZACTTAAAATCCCTGATTGTACAGATATGG 
5 ACAAATGGAAAGCTATAAAACACAAACACTCTTCTTTCATAAAAG^ 

ATTTTGTTAACAAATGGAGAAGCCAAGTTCAAGGGAAGTCAAGTCACTTGCT 
CTATTTCGGTGCTCTGCCAATGAAAGTGATTTTTATTCCTTC 

TAATTCCAGTGCTTTGGGAGGCTGAAG CAGGGAGG ATCGCTTGAGG CCAGAAGTTCAAGATCAGCCTGGGCAACATAGCAAGATG C 

TATCTCTACAAAAAAAAAATAAATAAAAATAAAAATCAGCCAGGCAT 
1 0 AAGTGTGAGGAGGATCGCTTGAGCCCAGGAGTTCAAGGCTGCAGTGAGCCATGATCATGCCA 

GCAAGACCTTGTCCCTTAAAAAATTTTTTTTCAAAGAATCTAAAAGGAAAAAGGGA 

AGTGGAATAGGGAAGTTTAAAAGAAAAGGGAAAAGAAAACACACACACAAGAATAGACACT 

TTGAAAGTGAGAGTTGGAAACTTGACCTCTCAGTTTGCTGGAGGAAG 

GCAGGAAGTGAGCACAGTCTTCTTGGCTGGAGAATCTCA^ 
1 5 ACC^TGTTTGAAAGCCTAAAATCTGAGTTGACAAGGAGA 

CACAGCGGACT AGTAAGGTCTCG CTCTGTCACCCAGGCTGG AGTGCAGTGG CACCATTTCAGCTCACTACAGCCTCAACCTCCTGG 

GCTCAAGCAATCTTTCCACCTCAGCCTCCTGAACAGCTGGGACCA 

GT ACAGACAGGGTTTT AACATGTTGCC CACG CTGGT CT CGAACTC CTG AGCTCAAGCGGTCTGC CTGC CT CAGCTTCCCG AAGTGC 
TGGAGGTGTTTTATGGATTGAGTACTGTTCAGTTGTACC^ 
2 0 GAG AGACAGCAGAAGGGG CTGGTGG CT CCCGTGAGG CT AGAAGGG CAG AACAG CAG AG G AGGAG AGGG TTGGAACAAAATTGGGCA 
GTGGCCTCTGTGCTTCCTGGCTCCCCAGCCCCAAGCCGCCTCT 
TCCCACAGTCAGACACCACTGGCCAGCCAGGATCTCCCCTCCTGTTGAAAAATG^ 
AAGGCCAAACTGTTTATGACATTGTTTCCCTAAAATGTGCTCAGACACC^^ 
AAAGGAGAAAAAGGATCCCAGCTACTCAGGAGGCTGAGGCATGAGAATT^ 

2 5 ATTGTCCCACTGCACTCCAGCCTGGGCAACAGAGCGAGACrCTGTCT 

AACATTrCCAGCACTCTTGTCCACCTCCTCAGTTGGAAGTGTAATAAAAAGATTCT^ 

ATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGATCACTTGAAGCCAA^ 

CTCTACTAAAAATATAAAAAATTAGCCAGGTGTAGTGGCATG 

CTTGAACCTGGGAGGTGGAGGCTGCAGTGAGCCGAGATCACACCACTGCACTCCACCCCAGATC 

3 0 AAAAAAATAAATAAATAAAAAATAAAAGGATTCCTCTCTCTAGTGTACCCATCTCTCT 

CCAAACTCGTCTTTATTCCCTTCCTATT(^CTATCCIT(nACCTCT 

AAAAGAGGAAATGACAGT TT TCriTCTAACAAAAATATGAAACCATCTAAGAAGAAAAAAATGAA^ 

TGTGCAGAGTAAAAGCCATTTAAAAAAAATTATTTAAAAAAACCATGATT1T 

ATACAGGTTAAGTATCCCTAATCCAAAAATATGAAATTCAAAATGCACCAAAATCT 

3 5 AAGGGAATGCTCAACTTACTGGTTGAGCAGCCCTATAATGCAAATATTTCAAAATCCT 

AT CAT AAG CATTTCAGAGAAGGG AT ACTCAC CCTGAATT AACACAAT C CATTCAATTTAGCT ATTG TTT AT AAAATAAAAACT AG C 
CCAAGCCTTGCTGAGTGGGCATATCTTGATTGCCACCGTTATTGTTGCAG 
CCGTTGCAGGTCCTGGTTTATGGATGGTTGGAAAAGAGCTAGTCCC* 
TCCCACTTTCCTGGCAAAGCCCTCATTGGTATGTTTCC^ 

4 0 GAGGTCTGGGCAGAAGAAAGCAGGAGAAGAAGAGGAGTGGGGAAGAGCGGAGGGGTGC^GACAGGAAAGGGGAGAGCGA 

CCTGACCATGTGGGTCTGCCAGGACCCTGCATGGGCCAGCTTATGCCCTGTCCCCTGGCT 
GGAGACGCTACATGTTGAGGTTCACATCACCAAGTGTGCTGTGTCrGAGGCTTGGAC 
ATTAAACTTTCCATTGGCAGACATTGGATGAAAACTTTTTTT^ 
TGAAACATGTACTCACATGCGCTCTGTGGAGCAATAGGAGAGTGATTAATTA 

4 5 GAGCCAGAGTCAAGCATTAGGAGGCGTTTCAGGGG 

AGTTCCTGTGG ACTCCTGG AAGGGACAGAGACAAGGG AGATTTCTG CTGGT CCAAG CTCTGTCATTGCTGAAATATGAAACCAAGG 
ACAGTACrTAAAAGATGCGGCAAGGCAAATGCTTTGGAGAGTGCC^ 
CAATATGTATGAAGAAATAAAAACAGTAAATAGCCTCGTAATCAGTTCA 
TGTAGTCAAGTGAGTTACAAAAACTTCATGGTGTTCAGAGCTCGTTGGATC 

5 0 CGCGTTTATCTAGAGCCGGAGGAGGCTGCCTTTCATTCC 

CCCCATTTTCAGCCATTGGAGGAGGATGTAGGCTAATTAGCCACAGCGCT 
GGTAAAGGCATGCATGGCACAGAGGCAGGGACCCTGACTCCTTCACCAAACCTCTCTCCCT 
GAGTGAGGTTATTTTACAATAATTGCATTGAAGGCCTACTTCAGTACAGC^ 
GATAATGGCAGAGAATAAGAAAGCATCCTGAAAAGGTCTCGAGAGACATGTGACTGACAGGCTC 

5 5 AGGATACACTCAGCTCCACAAGTAACAACACAACCAAGAGGACrGGCAAC^ 

CAGGCTGTGCTCTGGCCCAGCGCCTCTGGCTGTGCTGACAAGTTTCTAGCTAACGGGT 

TTCTTGCTCCTGCAAATTAACCGGGTCAGATACTGGGGAGGATTCCAGATGAC^ 
. CACAGCTTGACGGGGAGCTGGCAGAAGGTGATACAATGAAAA^ 

6 0 ' AGGTCAGGACACTGGCCACCCCCTAAGCATGGTAGTTCCKjTAGTGTTAAGGC^ 

GTTAGCGATCTGTCTTCTGTTAGTGCTATGCAGCCCAGTGTTTCAACAGCTTCCCAAGAGTCAGAAAAAC 
CCTGGCTGAAGTCATCAGTAGAGATAGGCCCCGGACATGGTCTATCTCCCCATCAAAre 
GCAGGGCCAGACAGTGCCCAGCCACGCTGGCATGATGGCAGATCTCCAACAGTGGGGAACT 
ACGAATTTATGAAGTTCATAAATGGAGTTTATAAATGCTATAAACAAGGGGCAGCATGCTTTC 

6 5 GCTTTATrCATACATTGTTGTCAGTTTAATTGAAGTATAATCTACACACAA 

TGTACGTTTTTAAAGGGTTGTTAAAAAGAAGAAGACAGGGATCATATGTG^ 

TACAGGAAAAGTTTGTTGACCCGTTGACCCTGACAGAACATTCCCATCCCACTTGGAG 

TCCCCTCCCCCAGGCAGCCACTGATCTGCTTTCTATA 

7 0 ATGTTGCCTGTTGTGTCTGGCTTCCTTCAC1TATCATGTTTTGAGAGTCATTCT 

TTTACTCCTGAGTAGCATGGCAGGGTTTGACTCCTGGGGGGATGGGAGTACT 
AAGAAAGGGGATCTGCAAGTGCGGCGGTCAGGGCCTGGATCCCCGGGGTGCCCACTCTCACrG 
AACCGGGAGGTGCTGGAGCAGGTGGAGCGAGGCTACAGGATGCCCT^ 
CCA(nXSCritySAAAAAGGACC<n^AAGAACGCCCCA Cr 
7 5 CCCAAiTACCAACCTGGTGAAAACCTGTAAGG CCCGGGTCTGCGG AG AGAGGCCTTGTCC CAGAGG CTGC CCCACCCCTCCCCATTA 
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GCTTTCAATTCCGTAGCOUXTGCTCCCCAGCAGC 

TGGCCCTCATTAATGACACTTGTCCCCAAATCCGAACCTCCTCTGTGAAGCATrCGAGACAGA 
GAAAATGCATTGTATCGATGTTATGTAAAAGGCCAAACCTCTGTTCAGTG 

TCCTTTTTT AAAAATG CAAAT C CTATGTGATTTTAACTCTGTCTTCACCTGATT CAACT AAAAAAAAAAAAGT ATT ATTTT CCAAA 
5 AGTCGCCTCITTCTCTAAAACAATAAA^^^ 

TTTATAAATATGAATATATATAATATATATGTCCCTGTACATATACAATGTGGGTGCTAATGTGGAGA 

ACCAAG CTGCGGGACCCAGAGX3GAGGATTTTACTGCAAGTCAG CATCAAAGCACCGGTGTTATTCTGAAAACACCAGTGGCCTCAT 
TTTTGGCTTTTGCAAAGCATGAATTTTT^CATTTC 
CTCrri T CAGGAACCACCCCCCAAGCTGAATTTACAAGTTCTGTTAGCA 
1 0 TTAAAAATAAGCAAGCAAATGGCTGATACTACCAAGAGAACTC 

CTGAAATGTTTCAGACATTTTTAATTTAATAAACCTGTAACCACATTTAAGT 
CCGCTAAACrTTCTCATGCAACTAAAATTTCTGGGGGAAAT 

ATGTCCTGTACTGCTAACGAATGACTTTCTATATGTCCAGGAGTTCTCCAGTGGAATAACT 
ATGC^CAAAAACAAAAAAGTATTACATTTTTAGTTGCTGTTTC 
1 5 ATCCTATTTCTATTGAG Tl'TlT AATACTGACTAGCAACrCTGAAGTCTT 
TTTTrAAATTGTTTAACTTTCTTAATTTAGTAATT 

CATGGAAGGTACTCGTAGTTCTGTGGCACATCTGTCCCACACCAGATGGTATGGAA 
ACTTGCAAATGACAGAAACATTCAGAACATCrGGAGAGTTCTTTTCCCTCT 

ATCATT AAAG CAAAAG T AGG CAT ACCTT ACCAT CTG AATTT ATCAG ACATTTTTTCT AACCGGAATTTACTG TGTC CTG CTTGT AG 
2 0 GAGAAATAGACAATCACAAGAAGAATCTTTCTTG 

GAGCT AACGG AGCGACAAC CTGGTGTGGTAGG AAGGGCAGTGGG TGG ACG CAGG CT ACCTGT CCACG CCTGGCTCTG TG ATC ATTC 
AGCTTCAGGACCCTGAGCCACCTCCTGGCATCTGTCATTTTCAGGGCCTG 
TTAGATCTCTACAGCTCTAAGATCCTGTAATTCTAAGATCACTGATGCCATGCGGGGTC^ 
AAATAACCTTTACATTTAGGGAGGGCTTCCAACATTAAAAA 

2 5 TTGTTTTTAGCATTAATAGACTTGCTAAACTGATAAAATTTATTT 

GCTGGAGTGCAGTGGTGCCATCTCGGCTCACTGCAACCTCTGCCT^ 

CTGGGATTACAGGTGTGCGCCATCATGCCTGGCTAATATTTGTATTTTTAGTAGA 

CTCAAACTCCTGACCTCATGATCCGCCCGC 

AATTTT AAAG ATCTT AT ACAT AT ATT ATT AT CACATT ATTAGG T AGTTG ATT AACAT ATTCT AT AT CTTTTT CAAGAT AACTTTGC 

3 0 TTTTAATCCTCACACCCCACTAAGTTGGATAATTTGCAAATATTTCTGGAGGATTT 

AAAAAAAAGAACCTG TTTT AAAT AGAAATT ATTTTT AAAAAAAAGT CTCT T ATAG C CTTTAATTT ATCAGTTGAGTTCAT 

GGTGGACTGTCTTATATGAAAGAAAGCTAAGGACTTGGGTCACCTCAAGGGCACT 

GCAGAGTGAAAGTAAGAGAACATCTGTAAATACAAAGAAGTCAAAGACTTCC^ 

AGAGATATTCCTGAGGTAGACCATTCCAG AGAGTCCCCACACCT CCCAAGAAGCCTCTCTTCCTCAT AAGTTGT ACAAG CTTAGGG 

3 5 TGAGGAAACGGAATTTGCTCACCAGAAACTTCTCCTGTTAAGAAAGAAAAGTATCGCCTT 

TTTGTCTTCAGTAACCTGGCAGTGAATAAGTATTTGCTTTGTTGCCT 

AGCCTTC CTAATAAACTATTACTTTTTACTTTTTTTTTAAGACAGGGTCTTGCTCTGT C C AGGCTGG AG TGT AG TAG TG CAAA 

CACAGTTCACTGCAGCCTCAACCTCCTGGCCTCAAGCX5ATCCTCCCAGCCACACTAGCTAATTT^ 

AGATAGGGCCTTGCCATGTTGCCGCAAGCTCGTCTCGAACTCCTGAGTTCAAGCAATCCT 

4 0 TTACAGACGTGAGCCACTGGGCCAACCTCTTTTTACTTCTTTCAAACAACGGGCCAG 

ACTTTGGGAGGCCAAGGCGGGTGGATCACGAGGTCAAGAGATCAAGACC^TCCCGGCCAACATAGTGAAA 

AAT ACAAAAATTATCCGGGCGTAGTC^CATG CGCCTGT AATCCCAGCT ACT CCAG 
GAGG CAGAAGTTGCAATGAGCCAAG ATCGCG CCACTG CACTCCAGTCTGGCGACAGAGCGAG ACTCTCTCTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAATCAAATAAACCTATCGGAGAAGACCATTTGC 

4 5 CACTCCATTCTCTGTAGCTCCTGATCTCCTCTGATGATCAGTTCACGGGCTGGGAGGAT^ 

ATGCTTTAGAACATCAGGGTAGATAAAGACAAATGATGCATTTGGCTCTGCCGTAGTCGTGACACAGAAAG 
GGCATTTCCGTGAGAAAAGTTAAGAGTGTATATAATCCCTCAAAC 

TGTTTACCTGCTTTGAGAAAGATAGGAATCCTCCTGTGGCTAAGTGTTGGCACCAGAGGATTTTATAGTAGCATTAGTGTGGC^ 
ATGTTATCTGCAATTTCCCCGGGAGTCTTAAGAATGTTTACTATATTAATTGGAATGGTTCTT 

5 0 TTTT CAACAACAGG CAGAAG AGAATTG T ACTAAAGCC CACAG CAAAAGGGATGT CATCTGTT CATT AGTCATG ACTCAGGACAAAG 

AATGTCACATGATAGGCATTTCCTAATAAGTTGCATTGGAATAAAATGATCTTTGGTCAGTTTGTTCCTG 

G TTT AAATG ACTTTTG TTG ACCTT CTT ACCACT ATAT ATTATACAAT ACTG TTGG AATT CAATT ATGCT AT CAAAAAT ATTTGTT A 
GAAGGGAGGGAGGAGACTGGTTTTGGTTTGGGGGCAGGGGAAGG 
ATAGATTACTAAAGCAATTGCCAAATTTTAAATATATTTT^ 
5 5 ATAGTACCTTTACTCTATTTGAAATATCAAATAGACATTCATAATTTATGTACACCT 
AACTCCTAAATCATACCT CATT CTTTCCCAACATGCAAACT ATT C^ 

T AATT CTG AG C ACTGAAAAT AAAAATAAATGGTTTT C CAGTGTTG CAGGG ACCATT CT AACAAAACGT AAC CAG TGACAAACGAC G 

GCTTCCTCAACAGCCAGAACACTGTGCTCCrCTCCATACGGCTTACT 

TACTGCTCTCCTCTAATCTGTTTGCCCTGCCAGACTGTGAGCCCGAGAACAGAAA 

5 0 CACCTAGAACAGTGATGAGCTTCTGGGAGATGCTGGCTGG ACTG ATGGATGG ACTG TT AGG TGGATGAG^ 

GCCAGTGGTTTACCTTCGAATGACACATCTACCTAAGAACCTGCTAAGCACATGGGACT 

CTGAGCATTATGAATGTGTGGCCACCAGTGTTCCAGATATGATTGAAGTCCCAATCT 

CCCAACTCTTTCCCCAGAACTGTCACAATGCCAGTGAGGATAAACTTGTAC 

CACCAACTGAT ACAT ACT CTTATTTTGATTCCGTTTGGTTTTGAGATGGGGTCTCACTATGTTG 

6 5 GGGCTCAAG CAATCCTCC CACTT CAGT CTCCCAAGT AG CTGGGACT AT AAACATGTG CCAC CAC CC CTGGCCACAGT CTTTGTTTT 

TTGGGGATGCGGGGAGGCGACAGGGTCTCACTCTGTTATCCAGGCTAAAGTGCAATG 
ATTTTGGGCTCAAGG CAT CCTCC CAC CC CAGG CTCCCAAGTAGCTAGGA 
TTTTTTTTTTTTTTTTTT C^ 

TAGCTTTGTGATTTTATGAAAAGTTGATATTCTACT 

7 0 TCAAGTTGTGCCATGTTACCCAGCCTGTGGCTTGTAG 

GAGAAG CACAG CGTGAAGATGTTGTCCATAATATTGTACATGGAAAAGGTAACGTTAGCCCCCATTAAT^ 
ATTTCCTCTTTTCACCCAGCAAAGTGCCCAGTGACTGAGA^ 

GTCA.TCAG AG CCACCTGGAAGTAGAGGT CTGGTCAGG TTGG C CTGCTT AACCAAGATGG C AGCCAGGGGAGT AGGAAGGATGAAGA 
CATTGCACAC^TTTGGAGGAAGAAACCAAAGCTGGCATGAGAGGA^ 
7 5 TATGACTGT AG ATGGG TGT AAAG ACTT ACCATT CACTC CTCC AAACTTT CGTTGTATACTT ATT ATG GG CT AGGCC CTGGG CCT AG 



728 



WO 03/008583 



PCT/US01/51291 



C C CCAGAAATT CAAAGACAG ACAATT AAT ATCT ACAGT CT AGT AAGG ATAAAGACC CAT AAAAATCGTTG CAAT AAGCTTTCAG AT 
GGAAGCAC^TTGTTAGCAAAGTAGTAACATCTAGCTCAGAATGGGGGGGTACA 

CT CTCTT AAAGACAAATGG AAGT AGGTT ACAT AGTGTTT AG CGGGG CCTGGTGCTCCTTG CCTTTGG AT T It y TTTCCTTCTGATTT 
CTAGCCCCCATTAGGGATTACCTG&TGCITTGAATATC 
5 CAAGCATCTCAATCACTGTCTCTCTGTTTCTCTCCTCT 

GGTACACCATTTATTCTGATTAGTGGCCACTAAATAGAGGGGGT^ 

TTGTTGGCT CCAGTGT CTCGATT AGG AT ATGGT CTGAAAT AT ATTCACATTCTCAT AAAG T ATGAGAAT AAAA CT AG AAA 

GAAGCITGGAAT AAT AATAACT CTTCACT ACTT C M 1' C'l'i AATAACAGCTGGTGGG AGGCTTT CT C CT ATTTACAAAT C CATTG TCAT 
TCTTCCACTTACTAGCCAGGCnXaTTTCGCTTGAATC^ 
1 0 GGTGTTCTC^TTGTTTTGTTCTGGGTAGTTTTCCTT^ 

G AAGCAAGAGAGAATGAAATGGGTATT ATT CAGATAGTGTTTTT AATGAAAAGCAT AG ACGTAGT ACTGTGGGGAAAT AT CACTAA 

TCATAAACTAAATCCAGGTATGTGAATCrri-l'Cr'I r i' rilT'UAGACTCCTCCCAATGGGTAAACATTGGAAGA^CCAC^^GGAA 
GCCAAGCCTCTAGAGATTGCTCACX3AACCCAAACCACCCCAAATG 
1 5 TAAGCAAATGGCAGCAGTTGCTTAGAAAATTAC^AATGAATAACAATGCTTTCT 

AAAGCGAT AGGAAAACCCTGTT AAAAATTGAATAAT AATTTGC CC ACAT CATTT ATG CAAAATGCATAATTTTC CT ACTT CAG TGG 
ACTTTAGTTTGATTT AAAAG ATG AG C!AG CTCTAGG ACT ATTCACT CAT AATTTGTAAGATAATTGGGAAGGGGAAGAGAAAT 
CAGACTGTAATCAGCACAATCTTTCGTAAGTCACTT 

CCTGAAGTT CTCT AAATGGAAGTG CAAAGCATT AAT CATTG TCATTG CT AC CACT AAAC CATG ACCTC ACCATG AG AGGGGGG AAA 
2 0 GCCATGTTTCATTGTATTCAGTATTTTTCTGCT 

CAAAGCTATAGTTAAAACAACAAGCCGAGAAGAAAAGCAACA 

aaaagctgtgcttggctgggattctgtttcagaatcaaatc 

aatcagtcagaatgaaaagatatcatgtaatccaccttaacagtcagaagct 

actaatgtagattttcagacccagaatggttgaaggtgggcac^aatttgt 

2 5 acccatgtcagctgatgtttatatctcttcctctcct^ 

aaatttttacttatatattcattaattctcttcatccccatct 
acagtgtctcatc^ctgtaatctcagctattcggaaggatgagg^ 
caacatagccagatgtcatctctaaaaaaattaaaattaaaaatt^ 
aggaggctgaggtgagaggggatcacrtgaaccctggaggtca^ 

3 0 gcaataaaagaagaccctgtctctgaaaaaaaaaaatttttt^ 

aacctgtttcctttgttggactaattcctgtctcttctacaaacttg 
aacactt:cttatcggttc<;gtgcccattctggttaagcaaaaggaccttccct 

tatccccaaccg ctgct actggctg aag atttatt agatactagttttgtacctatg cgtcctgccacttgctataaatcgccttt 
atgcaaatcgaaaaagctaccctgcctcaagacactttgctgccatctgccaggaaat^ 

3 5 g actagaatgtgaatggtg ccccttagaattgtg cagtacg c aactggcac aact at atgtg acaaccctgttx3tttgaaatcacc 

tcaactctcctggaaccctgtgtaaaaatcacaaccaccataggtcagtt^ 
aatcttccaccatgatcagaaattgctgaggggcagtttctaggagagacaga 
gctcttccagcattcagcccaagctttgacctgctacctgggcagat^ 
taagttagaaagctggtaaggctgaatgttggggtgcac<k5ggaggtagacctagct 

4 0 ggcrttggcggcitcatttcttgttgat 

ctctcactacataacaaaccactccaaaacttagtggcagaaatagaaacagt 

human sequence - mrna 

CTrcCTTCTCCCAGrccyiCCGAGGACaSCCC^ 

4 5 CAGGGAGGGATTCGGCCGCCGGGCCGGGGACACCCCGGCGCCGCCCCC 

CGGGGACCCCCCGGAGCroCCTCGGCCXSCGCCGGAGGAGGGCGGGGAGAGGACCATGTGA^ 

CAGTTTTTTTGAAGAAGCAGGATGCTGATCTAAACGTGGAAAAAGACCAGTCC^ 

AAGTTTGTGATCGTTGGCC^ACATTTT«3AATTTAGATA^ 

GGAGAGGGACGGCAGCCTGAACCAGAGCTCTGGGTACCGCTATGGCAC^GACCCCACCCC^ 

5 0 CCTCCATCCCCAACTACAACAACTTCCACGCAGCCGGGGGCCAAGGACTCACC 

GGGACCTTGCGTACGAGAGGAGGAACAGGAGTGACACTCTTTGTGGCCCTTTATG^ 
TTTTCACAAAGGAGAAAAATTTCAAATATTGAACAGCTCGGAAGGAG 
GTTACATTCCCAGCAATTATGTGGCTCCAGTTGACTCTATCCAGGCAGA^ 
GAGCGACAGCTATTGTCCTTTGGAAACCCAAGAGGTACCTTTCTTATCCGCGAGAGTGAAACCA 

5 5 TATCCCTGATTGGGATGATATGAAAGGAGACCATGTCAAACATTATAAAATTCGCAAACTTGACAATC^ 

CCCC^CCC^GTTTGAAACACTTCAGCAGCTTGTACAACATTACTCAGAGAG 
TGTCACAAAGGGATGCC^GGCTTACCGATCTGTCTGTCAAAACCAAA 
CAAGAGACTGGGAAATGGGCAGTTTGGGGAAGTATGGATGGGTACCTGGAATGG 
CAGGCACAATCTCCCCCGAATCATTCCTTGAGGAAGCGCAGATCATGA 

6 0 GTGGTGTCTGAGGAGCCCATCTACATCGTCACCGAGTATATGAACAAAGGAAGTTTACTGGATTTCTTAAAA 

AGCTCTGAAATTACCAAATCTTGTGGACATGGCAGCACAGGTGGCT 
GAGATCTGCGATCAGCAAACATTCTAGTGGGGAATGGACTCATATGCAAGAT^ 
AATGAGTACACAGCAAGACAAGGTGCAAAGTTCCCCATCAAGTGGACGGCCCCCGAGGCAGC 
GTCTCACGTGTGGTCTTTTGGAATCTTACTCACAGAGCT 

6 5 TGCTGGAGCAGGTGGAGCGAGGCTACAGGATGCCCTGCCCGCAC^SACT^ 

AAAAAGGACCCTGAAGAACGCCCCACTTTTGAGTACTTGCAGAGCTTCCTGGAA 
ACCTGGTGAAAACCTGTAAGGCCCGGGTCTGCGGAGAGAGGCCTTX3TCCCAGAGGCT 
CCGTAGCCAGCTGCTCCCCAGCAGCGGAACCGCCCAGGATCAGATTGCATGTGACTCT 
TAATGACACTTGTCCCGRAATCCGAACCTCCTCTGTGAAGCATTCG 

7 0 TGTATCGATGTTATGTAAAAGGCCAAACCTCTGTTCAGTGTAAATAGTTACTCCAGTGCCAA 

AAAATGCAAATCCTATGTGATTTTAACTCTGTCTTCACCTGATTCAA 
TTGTCTAAAACAATAAAATTTTTTTTCATGTTTTAACAAAAACC 

HUMAN SEQUENCE - CODING 
7 5 ATGGGCTGTGTGCAA'raTAAGGATAAAGAAGC^ 



729 



WO 03/008583 



PCT7US01/51291 



CTATGGCACAGACCCCACCCCTCAGCACTACCCCAGCITCGGTGTGACCT 

GCCAAGGACTCACCGTCTTTGGAGGTGTGAACTCTTCGTCTCATACGGG^ 

TTTGTGGCCCTTTATGACTATGAAGCACGGACAGAAGATGACCTGAG 

GGAAGGAGATTGGTGGGAAGCCCtSCTCCTTGACAACTGGAGAGACAGGT^ 

TCCAGGCAGAAGAGTGGTACTTTGGAAAACTTGGCCGAAAAGATGCTGAGCGACAG 

TTTCTTATCCGCGAGAGTGAAACCACCAAAGGTGCCTATTCACTTTCTATCCX5 

ACATTATAAAATTCGCAAACTTGACAATGGTGGATACTACATTACC^ 

ATTACrCAGAGAGAGCTCCAGGTCTCTGCrGCCGCCT 

AAAACCAAAGATGTCTGGGAAATCCCTCGAGAATCCCTGCAGTTGATCAAGAGACT 
GGGTACCTGGAATGGAAACACAAAAGTAGCCATAAAGACTCTTAAACCAGGC^ 

AG ATCATGAAG AAGCTGAAG CACGACAAGCTGGTCCAG CTCTATG C AGTGGTGTCTGAGGAGCCCATCTACATCGTCACCGAGTAT 
ATGAACAAAGGAAGTTTACTGGATTTCTTAAAAGATGGAGAAGGAAGAGCT 

GGTGG CTG CAGGAATGG CTT AC ATCGAG CG CATGAATT AT ATC CAT AGAG ATCTG CGATCAGCAAACATT CT AGTGGGG AATGG AC 
TCATATGCAAGATTGCTGACTTCC^ATTGGCCCGATTG 

AAGTGGACGGCCCCCGAGGCAGCCCTGTACGGGAGGTTCACAATCAAGTCTGACGTGTG^^ 

GG T CAC CAAAGGAAGAGTG CCAT AC CCAG^j CATCAACAAC CGGGAGGTGCTGGAGCAGGT GGAGCG AGG CT ACAGGATG CCCTGCC 
CGCAGGACTGCCCCATCTCTCTGCATGAGCrCATGATCCACTGCT 

CAG AGCTTCCTGGAAGACTACTTTACCGCGJVCAGAGCC CCAGTACCAACCTGGTGAAAACCTGTAA 
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Table 34 

MOUSE NOMENCLATURE 
ICSGNM Nfkbill 
Celera mCG1594 0 

5 

HUMAN NOMENCLATURE 
HGNC NFKBIL1 
Celera hCG22180 

10 MOUSE SEQUENCE - GENOMIC 

CCTCCCAGGCCTTGGGAATACAGGAATGAGCCACTG^ 

CCAAGGAACTGAAAGTCTATTAGTTITTTGGCAAGGTATAATGACATACAGCCAG 
CCAGATTACAATCTATTTTCAAAATCCTAGGAAA^ 

AC AGGTTAGG AAAACAATGTCT AATG CTTGGTT ATGCCG AGCAATGGAAAC CTCAGAGATCACAAGATCT CA 
1 5 TTTTG CACATACTGCCTTTC C TTTT CTT C TTCTAGAATT CCCCCTITT AAATCTGTCATGCTGGGCGGTTGTGGAG ATGCTCGCCT 

TTAATCCCAGTAGAJSGCIAGAGGAAGGTGGATTTGA 

G AAT AGG AAGT CTG CACACATCT CT CT ACT CACAGGGCCAG CTT ACAAGGC CTTTCAAACC CAG CAGGCC ACT CAC CCAAG CATGG 
AAAT AT AT AAT CTCAAAAGTTC CAAAGG ATG CCAACTGTGAGG CCACAC CACG TT C CTT CAC AG AGG ACACAGG CCACTTGTT ATG 
ACCCCCGGAAAGCTTrAACCACTCTAGAGTTAAA^TCCACACCAGGGCTG 

2 0 CCCAACTCTCAAACAATCTACCAAGCAATTCAGGAGAACAGAACA^ 

ACTCAGGAACCTGTGGCAGGAGCCTCAGGTTCAAGGCCTACTATCTCTAGACT 
TGGTTAAGAGACAGTCGTGAACATAAAAACAAAGGACCCAGGGGT^ 
ATCTGGAATACCAAAACCAACTAACCACAAAGTGTTGCGGTGGTATC^ 
CACAGTATGGAACCTGGAAGGGCAGCCACAGAAGGGTGTGGCACTAACrGAAGTCTCT 

TTCGGCATGTACTTAGAGAAGCGCTCATATTCCnTGCTGATCTGAAAAG 
CTTTGAGGGAGCAAATGGAGACAGACACAGTAAAGTCAGCCCCT 
CATTCCCCTTGAACTGCAGAGGACACCAGCTGAGAGCAC^ 
CAGCACTGCCCCCAACATACCTGCCCAGTAACGGGCTCCAGCTGC^^ 

3 0 TGACTTGGCCTGGCACAGGACATCCATCCCCAGAATGGOT 

TTTGCACCATGCCAAACCCCTTTCTCCCCAGCCGACTCATCATATGTCACCTTCTCACCT 
TCATCACTCTTACACTCAAGTCAAACCCTCCTACCCCAGCTCCCCAGTTCCGCT 
CACCTTCCACGTTCCTGGCTAAACTTACTTTTCTTACACCGTAACAATCT 
GGTTGCTCTGCITTTGGGTCATGTAACCCTGCAACCTTGAA 

3 5 CATCCTGATCGTTAAGGTTTCCGGAAACGACACTTGGCCT 

GTTCGAACAGTCAGAATGACAAAAACCAAGCAAAAAAGCCAGGGACA 

CTCTTCTACAGCTGCCGTCTTGGGTACATTCTCAGAGCTTGGCCTATCCCAAATAAATTCAAATCCT 
TCCCTTTTCTTGCACCAGAGGTGCTAAATAAGGTCrCTAC^ 

CAACGATGGCC CGGAGCAGCTCTGGCTTGAGT AGAAAATCTCGGAAGCCGG AG CTATGG ATGG AGACGT AGGAGCCCTTGACGTCT 

4 0 TTCTTGWCGGGAGCCTCG^TCCCATCTGCCCCAGCGGCTGTCT 

GTTCTCTX3CCATAATTG<^CCGGCAGGGGGAGAAGGGAGGTCTCTGAGTTAGGTGTCCCAG 
GAGAGGGGTTGAAGAACAGCAAGGGGAAAGACACCAGTTTCCGACAGAAGAGCT 
TCATCAGTCACGAATGGACTCGTCACTTCTCGTCGATCCCTACACCCGTCCTATTCTTCC 
AGAAGAGTTAACAAAGAGTCTCATGACATAAGAAACGGGAACACCAGACGGGAGAGACAGA 

4 5 TGTAACACACACTTCGACAGGCAGGAGCCAACTGGGAGACCGCAGTAAAATAAGCG 

CAAAGGACAAGGGAAGACAGAGACTGCGCTTTCTCCTG 

AG AT AAGG GTCAG C ACTGCG C AGAG CGCT C ACTTCCCAAATGCAAACATCATGTGG CTG CCG CT ATT CT CTTG CTCAGAAAC C CC C 

GAGTGGCCGTCCCAGCGGCGCACACCTGGACCGGGGAACAGAACCAGGAAGGAGGGTGGTGCGGGGC 

AATGGCTCCATTTCAAGAAAAAAAAATTTTTTTTTTGTTTTG 

5 0 ACGCGAGAGGCCGCAAGGTTCGGCTCAGCGCTGCCAAAGTC 

ATGCAGGATTAAGAGTGTGAGCTGGGTGGCCTTAGAAAGGAACGCCTC 

ACCATCGGCCACGCACAGAGGCAGCCATTCACAGAGAGCrCCAAATAAGGCAGC 

GAGCGAGATCTTCCC^TCCCTGGGCGAGCAAGAAAAGGCCCXXS^ 

CGGCCGCCGCGGCCCCGGGTGGGCAGAGGACCCACTC^AACACCCAGGAAGCGCGGAGACACAATG 

5 5 GGATCGTTCCCCCCTTGTCTCCCAGCn'CCCCGCCAAGATCTGCTCTCAGAAAGGGGTGAAGACA^ 

GCGCCGGCGGCCACGGCGCCGACCCGGCACCGGGTATCAGGGACCCGCGCGGCGGCX5CGAGCCGCGGGAGAAGCA 
ATGGTGCACAGGGCCAAAGCTTACCTAAGCACTGCGAGCGTGTATGGCGGCAGCAAC 
TCCGGTTCCAGGCTCGCCCCGGCCTCCa3CCrTCCCX3GCCGCCAGGGCG(^CGCGCAAGGAG 
CGCTCCGCGGCTGGCTTTGCAGAACGCCCACTTGAAATGAAGCTTCTGACGACGCCT 

TTTTCTTrGCCACGCCAGGCTTCTCCCTGGGCACCT CGTTTCTCACGG 
GGTCCTTAGCCTCGTAGTCAAGGTGACCCGAGAGCGGCTAACTCAGGCCTCTG 
TACAGTCGTGCAGAGAATAGGCAGGATTTACCCAAAATTATATCGAAAGATGGAGGCTTAC^ 
GTGGGGTrTt^GGTTATGCACCrTTCCATAATTCTTGCTAACTA 

6 5 CCCCCTCCCCCCATGTGGTATTCCCTTTGAGATGTTACATATAG 

GGGCTTTCCCTATTAAGGACACTTAATATTAATACAAAGTAACACT 
TCGCATAAGCTCTCTGGGCTGGGGCTTGGTAGCTTTTGATCCCT 

TAAATCAGAGAGGAGTGGGAATTCCTCGGCTCTGGTAACTTATTACGAACTGTAGCTCCAGAAAGTGCGGTA 
TTTGAAGCGTTTCCAGGAACTCTCCTAGTGACCrAGTTGGCGGG 

7 0 CATTGTGGAGGGTAGCATAGTAAGGTTTGTCCCAGCTGACCCAAGTGAGCCA 

CACCAAACACAGCAAACAGCAAACTCGTGCAAATGAAGACTTGTGTTTGC^ 
ACACATCTTCCTGCTACTGGACTATGGGAAGTTCCACCCT 

TT AGC7GTGTGG TTT C CT CLVLAGA.CTTT AT AG AAGATTCAAAATGG CCATG CTGACAGT CCTTtSTT AAGGT AGC CAGGGCAGGAAG 
GGATTGTCTTAACAGCTGGCTGTGATTTTAACAATGCTTAAAACCT 
7 5 TCTTGCCATTTTTTTCTTCCTTTCTCTGTTACTGAGTITGAGC^ 
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GATTTGAATGCAGACCCATGCCCCGCTCCACTCTTCCT 

AGCTAGCAGGGAAACACCAGGGTGTAAGCACCCCAATTTCCTATCACATGTGTCCCTCCT 

AG CT ATTTCC C CCGAAACATTT CCTC CTGATG AGAGTGG AGAAGGCTCAGGAAATTCAGAAAAAAAGTTC CAAGACACGG AACACT 
TCTCCCAGATTGTGTCACCAATAAACAGCTCCTGGTGGCAGGACTCTTO 
5 ATGGGAAGCTACTCCAGTCTCACTCCTATAAGTAACCTATCACCCACCATACTGCCAA^ 

AGTGAGACTTGGGTGGTGAG AT AT CTTTGGTG TTTCATGACTG CCTTATTTGGCAT AGATTTTTTTTTTTTCCTGAGACAGGG CTT 
CrCTGTGTAGCCCTGGCTGTCCTGGAATTCACrCTGTAGACCAGGCT 
GTGCTGGGATTAAAGGCATGCGCCACCACACCTGGCI^ATAGAT^ 
GAATTACCTCATTGATTCAAGTTCAGGGTGAAATGAGCAGTGAAGAGAACT 
10 CACCCCCTCACCCCCTACCJVCCACCACCCCAAC^ 

AAGTGTGTGCCAAGTGCTATCTAGGCTCTGGGGATACAACGACCTTTATCA 
TGTCAAGCCAGGCGTGGTGGTGCACGTCTTTAATTCCAG 

CCTGGCCTAGCAAATTCCAGGGTAGTCAGAGATACTTAGTAAGATAGTAAGTCT 
CTGTCTCTCTCTCTCTCTCTCTCTCTCTCrCGTGTGTGTGTGTGTGTGTGTCTGTCT 
1 5 TCTGTCTGTCTCTGTCTCTGGTGCTGATGAGCAGTCTAAAACCAGGAA^ 
CTATCTATGCAACTCTCTGTTCAAGAGGGGAC^CAGTACTGTC 
TGTTTTGGAAACTATATAAATGCATAACTCrGAGGACAGCGAA 
GACAATTGACTCATCCACCAGTATTTTATTGACACTCC^ 

GCATGTGAATGTCATAGATATCTTCAGCTCCCGGAAGCAGCAAGATTTGGGAAAACAGAGAA^ 
2 0 AC CAGCTCAGGAG AAACAG AGAGGTG CT AAGAG ATTTGAT AT AT ATTTGGGACT CGGACAG AGGGG CT AAAT AACATT ATCATTT A 

ACTGTTCXXSGGGCATGATGTGCCAAGGGAGAGCCAGAGAAACAAAT^ 
TGGAACACX3AGGAATAAAAAAGTCAGGAGAGGGGGTTAGAATTCTGGCAGG 
ATCTG^AACTATGCTGAATTCTCACAGATAATTCACAGC^ 

ATGAGGACACTGAGGCTTCCAGGTTAAATAGTTGTCCCAGTTGTGTAACCAGC^ 

2 5 CCTGCTCTGAAGTCAGGGTGCrTTCATAACTGTCACCAG^ 

TTAGTAAACTTCCAGTGGGTGTC^TGGAACAGAGAGATTCGGGTTTGTCGGGT^G 

TATTG<3ATCCTGTC!A£^TAGAAGCCTC CCCTCCAGCAAC AGGTTATTATGTGAAG CGATTTGGTGGGTGGGGGGGATGATATTATG 
GATTGAACCCAAGGCTCTGCGCATGCGACACAAGTGGTTTTCiA 

TCTAGGGATGTGTCCCTGAGCX3ATCGTTCTAGACAGTAACCCGATAGTTGGGGTGGACCTGGGG 

3 0 AGAAGCTGAGCCAGGACGCGCTCCCGATTCCTCTGCTGGGAACTCTGCATGCCCT 

AGCAGACAGGTTCCCCT^AGAGCCCATGGCCTGGGGGATGGG<K3CGGAGGAGAGAAAGA 
CCCAGC^TGTAGGAGGAAG«3TTCAGAGGAAGGCCTG<MAAGCCT^ 
AGCAGGTGGGCACAATCCTTACTTCTGAGACTGACCATAAAACT^ 
TCTACCCTTTAAAACCCTCAGACCTTACAGATCCCGTCAAAGCTTT^ 
35 ACACCCTAATTTCTTAGTCACCTGTATACACCTGGCAAGAGGCACCCACCGATTTGC^ 
AACTCACCGCCTGCTGCTTGCTCTGAAACTCCTGCTrc 
TGCTTCAGTCGCCGGGCCTTCCCTAGAGGCAGAAGGACAGTGAGTGTACAC^ 

GCCCTCTAGAAAGTTATCCAGTTTCCTGTCAACCAACCAAATGCTAGACTGTAAAACCCACCCGTTTCTTCTACCC 
AGTCCCCCATCTGCCCTCATTGTCCTTG<3GCriTTTTTTTCCT 

CTGGATACCCTGGGTCTGACTGGCCATTTTTATCACTAGAGAGGATGCTGTTCCC^ 
CTCCAGCGCCTACTCCTCCGGGCCCTTGATGCGGTCCTCCACTCCCACTGGG 

CCGGG AGGGTGT CTGTCAT CCTCACACTGATG ACTGGT CCTCCTCTCGAACCTCACTG AGGTAGTACCCG AAACGGCCGC CCGAGG 

4 5 AAGCGGAGGGCAGAGTGTTTGGGTGTGAGATCTTGATTGGTGGTCTCGAAGGAGACACAAAGGCGATGA 

AAGGATGCGCTTGGACACAGTACGCTGGGTTTCACTTACACT 
TCAGCCTCCATCTTCTCTTCAGACITTCCCCCCAAATCCGAACGTCT 
GTGTCTAACCATGTGTTTTTCTTTTrrGATCTACCCATATATTTGGCCTCATTGTGG 
GTGTGTGAGATTGTGTGTGTGTGTTTCGAGACAGG^ATTCT 

5 0 CTGCCTCTGCCTCTGCCTCCCITCX3AAAGCTCGCCTCTCATCTTCCG 

CCTCCGGAGACCTAATCACAAGTCACCCAGTCCGAGCGTraSGTACGATTTCCGGGATGGAACAC^ 
CTTGCGCtSTGTGCTTGCGCCATTTCCTGCATCCTGGCGCGCCTC 

GCCrCTACGTCTGTCTGTCC^TACATTATATTCCATTTCTAGATCGCTCTTTGAATTTGT 

5 5 TGAAAATTGCCCCCATCTGGCCTTACTTGATT^ 

TCTGGAACCTTAAAACAACTATGTATCATTTTCTCAAAAATTACATCAAAAATTAACT 
CAACTCCAACCCCATCCCCACCCTGCXSCCCCAAACTTGCCATCTGCTAAGGGATGAGA^ 
AGATGCAAGGGGGAGGAAATGGGAGGTAAAGGGAGAAACGGGGAAGAGCGGAGCTGGGACCTCACCGCT 
CCTCAGCCCC7VGAGTTGTTCCATGGCCTCCGCITCCCGCCGGCACCGCCGAGAGCGACGCTTCCGGCGGTACTTGT 
60 GCTGGTGCGAGCCCAGGCTCTCCTCCAGCGACATCCCGGGCTTGATGTGGACGCTGGGCAGCCCCCGCCTCT 
CCCGCCACGATGCCCCTGCCCTGTGCCTGCTGCTGCGTCTCGGCGCT^ 
CACGCTGCTGCCCGCCAGGGCCCTC-ATGGTGAGTGGGTCCCTGGGAACT 
TGCGTGGGAAATAAAGGCCTGGTGTGGTGGCGCAGGTCTTTTGACTCTGGGCAAGC^ 
CCTACACAGTGAGCTCCAAGACTACAGAGAGACCCTGTATCACACAACAAAATAGT^^ 

6 5 GAACTCTTGAAGATCCCTGGTTGGGATGTGGCCATACCTCTCTAGTAACAGAAAAGCT^ 

GGTCTCTGCTAGCXSTGGTGTGATTGTTTGTTTGAAGATTTATT^ 
GTGAGTCTTCATCTGGTTGTTCGGAATTGAATTTTAGGGCAACCGCACTCTCT 

TG T AGCTG ACTT CAGGCGTG CCAG AAGTGGACGTCAG ATCTCAAG ACAGATGGTTG TGAGCCACCATG TGGTTGCTGGGATTTGAA 
CTCIAGGACCTTCAGAAGGGCAGTCAGTGCTCTTACCCACTGAGCCATCTCAC^ 

7 0 TACCGAGTCTCTCAGGCTGGCCTCTGCATCCCAAGTGCTGAGG 

TCTTTCTTTACATTTCCTAGCTTTCAGCTCAGTGACTACTTTGT^ 
GATCCC^GCTCC^CTCACCCCACCCCCACCCCCTTAGTrrroCTTCTCT 

ATGCTTTGTCACATGATCAGCTGGG CCGG CCAG AGG ACTTGAAACTTTT ACTTGGTGTT'ATGCCGGAGATCTGCTGCTCT 
GGGATGGACTC^TGTCCCTTGACACTTACCAGGCGCCAGACCA 
7 5 TAAGCTGGTGGGTGCGGT CTGG AGG CGGACCCACTCCGGGCTCTGCAAACTCAAGGCCAGTCTG CTCT ACAT AGTGAGTTCCAAGC 
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TAG C CCACGGT AG AG AGACGTC AAC7VAAAC ATCTCCAGT C AGCATCGGT AGTGG CT CCAC AG TG C AG ACAG CCTCTTT CT A 

GTTC ATTTGTG TT ACTT ACTGG CAG AT AG CCAGC CCAGAAACAG CCTCC CCGC CT AG AAAGACAGGAAGTTC ACAACACACTC CTG 
TAATCCCAGCCCTCGGGAACACCTCAAGGCCAGTCTGGGCAGGGCCTAAGAGTCTCAA^ 
AGATCTCACCTAACCTACATGGCGGCTATGGTGTX^CTAGTTTTATCCCGCT 
5 TCCACTCTCCTGCCCCTGCTCCCCCAGCCT 

CTGGAAACAGCGTAGGTGTTGCCTTTGCAGTGACAGCAGAAGCAGTTCCCT 
CCCGTCTGTTAAGAGACGGACGGCCTTCTGCCCACAGATCAACAACCCTGCTCACTC^ 

CAAAACCAAAACCAAACCCAAAATCCT CT AACAC1AAATAAG A CTTT AAGCTT AAAG AG AGGAAGG 

CTGCAGCTGCCTGGCTGAGTTTATTATGGGATACCTCAAGAGAAGAAGTCCCC^ 
1 0 TACCACCAGTAGGGCTGTTCC^VAGCAGAGTAGCAAGCCGGAAGCAGCCTCACTTC 

AATTGTGG CGT ATACAG AAAGGCAGCTCTGGACAGCTGCAGCAAG CAAG CTTTGCTTACAGAAGCAGAGAGGTAG CGGTTAG ATGA 

TCAGGCAGTTAACCTTCAGCTCAGAGTCACACTGTACAGGAATTT^ 

CX3TTCAGTTCACCCAAGGTTGTTATAGCACTTAAAG 

CCGGCTACAGTTTAAGAGGCAGCTCCTGGGTGGTAGC^^ 
1 5 ACCTCGGGAGCTTGTTCGCCTCCX3AAACGGAGGTTGTGATAAC^ 

GTTCGTGCGGTTCAGTTAACTTGGCTGAGGATCTCCTAATGGTGGAATA 

TGAAACATTTAACACACCGTGGCGAAAGTGAGGCCTCCCTGGGGAGTTTCGGG^ 

ATGGTCCAAAGAGGAAACCATCCCTCGACACTTCTCCCAAC^^ 

GAGGTGGCGCrrCTAATCCCAACTTCATCTCATTCAGCGGCTGCCCGCTGCTT^ 
2 0 GAT/TGAGCGATTGAACCTCCAGGAGTCTCCCCTCCACTCTCCTGTT 

GCGGGGGGTACCACACACAGTAGGGGGCCAGAAGAGGCTCACATTTTAGGCTT^ 

CATGGAAGATCAGTTTCAGCTGTCAGAGGGACCCATCAGTAAAGCATGCGTCAAACAGT 

TGGACACTGGCAGGCAGTATAGTTCTCTCCGTTTTTCGGGGGAGGGAAAA^ 

AGGTTTGCTTTGGTCTT CI U 1T AGGGTT1 GG CTTG CTGGTTTG CITGGTTrTTrGTTTTlTGGGGGGCAGGTTGTTTGTTTTG TTTT 

2 5 TGTTTGTTTGTTTGTTTGTTTTT ACAT AG ATTCTTTCTG TAGG CTTGTCTCTTCTGGAACTCAGT ATGT AGAT CAGGTT ATC CACA 

AACTCAGAGACTGATCCGGTCCCTGAGCCTAAGTCCAGGGTTGCAGGTGTGAGCCCGCACCAAGCCTC^ 
TCTCTCrrrAGTGGAGACACCACCATGTAGCCCTGGCAGCTCTTGACTGATATTCCTGACC 
TTGTTTTCTGGTTATGTGACACTCACTGTCTGCTGATCCGTC^ 
GGAGCCAGAGACGGCCTTGTTTCAGAATGACTGGGCACAGT^ 

3 0 GACCTGTCACCTCGGCCGCAGTTGTCTTCAGTGACCCTAGCACAGGTTTCACAAGCCTTACCC 

AGTTT ACCAGAGCCAGACATGGTGGCAT AGGCCTG C AATCCCAGCACCCAGG AGG CAGTGACG AGGATT ACAAGTTCAAGGCCAGC 

CTTGGCTACAAGGCCATCCTGAGCTACAAGAAACCCTGC^ 

AGCTGTCTGC^CTTGGCATCAGGAAGTGTTTTCACACCATGGTACCCT 

TCATGTCAGATGCTCAGCTGGATATTCCTCTACTGTGGGATACTTGTCACAGTGTAAGTTAT^ 

3 5 AGTTACCAGGGGTCAGACCCTGATGGGCTGAGCCTGCTTGTTAGAGAGAA 

GGCTCATTGAGCTTTGTTCTTCTAATCTTAAAATTTAGTTTTGTTTT^ 

CCTCAACTCTCATTGTACITGAGACAGAATTTTTTTTAAGAl^ 

ACACCAGAAGAGGGCATCAGATCTCATTAGAGATGGTTGTGAGC 

AGCAGTCAGTGCTCTTACCCGCTGAGCCATCTCTCCAGACCATCTTGTTTGTTTTCCAACCATGTCCTCCAGGC^ 

4 0 GAGTCTCTATGGACTCCCAACTCCACCGCTTATCTTGCAGTGGGAGTATTGGGATTGC^ 

AGATGTGTCCTGGTATGTGGGATTTCAGGTGTGATTGCTGGGGATGCGAAC 

AACCAGCCATCACAGAACMCACC03CACACCCCTACA 

CTGAGTGCGGCCCTTGCCCACAGTGCTGATACGTCAGAGTCATGAGACCACAGGACATCA^ 

4 5 TACTGTTGAAGGGGACGAGAGGTCAGAGGTCAGGCTGTGACACCAGAGC^ 

ACGTAGTGGCTGCTGTTCCTGAGGCAGCACCATCTGGAACT^ 

TAAGGAAGGGCCAAAGACACACAGCAGACTTAAAACAGAGGTTCCTGAGAGCTGAGGG 

AGCTGCC CCACGGAGGCTGGTGAAAC CC CCG AGTTAG ACAAAGCAACAG CTGCTTTGTTTCTCACTGG AATAAAGAG ATGAG AAAC 
ATCTG AACACC AGGG AGG CAGCACAG ACT ACG ATCATG CAGTTCTGAGACTTG CAAATATTTGTAAGTTAGAACAGGAAAGCTGTG 

5 0 AACATGGAAGAGCTGTCCCTGGACAGGACCGTGTGTCACCTTGTGTGTCATAGCCAT 

CAGACCCAACCTTAAGACAATAGAATAGAAGCAAAATAATGGTC 
GGAAATGGAGGGGATTAGGGTGTGTCTGTGTGTGTGTGTGTGTGTGTTC^ 

CCCTCTCTCTGTCT AGGGAG CCTGAAATGTTGGGGTAAAGGTGGG ACTACC CAGATAGTTTAAG AGGT CTCCTTAGGG AAAG CCAG 
AAAG AATCCCTTGAAATG AG CCCCTCGGTGC CTGGAGGGGG TAGG AAG ACA^ CAGC 

5 5 ACTGTCXaCTCTGGCAGAAGACCCAGGTTCZAGTTCCCAGC^CCCAGG 

CCACGTGGTGTGTATACATAAACGTTAGGAAAATGCrGAGACAC^ 
AAATTCTATCATTAGAGCTGAGAAAGGAAGAGAAATTACCTCAGCTC 
TGGTCCTCCCCAAAAGAAGAGAGGGCTCTCCAAGCAGTGTGCAGG 
ACACAGAACAGAGGCCTTACAGCATGGCTGCAGTCTGAGAAGAAAG 

6 0 TTGTCCTGAGAGAGGGTTGGTrTCAGGTTGTTTGTTTATATGGCTGTT^ 

AGCTGTCCTGAAACTCACrrCTGTAGACCAGGCTGGCCTTGAACTCAAAGATCT 
GGCAGGAGGAAGGATTTTTATGGTAAATCCAGTTCCCAGGTTGTGTGAATGTGAAGTGCT 

CCCTCGCACCCAGTGGCATAGCGCACAGGGACAGCACAACCATGAAGGG CTTtKSAAGGGCCGAGTTTGGAGTGCAG CCTGGTGAGG 
TCCCTTCCTTGTTAGATTTGTCCCCACCCCCrTTCT^ 

6 5 TTX3CTTGTTCTGAGTGACACTGGGAATGAGGACCGGGTCCCTGACAGACTGAAGGGA 

CGAGACAGTGGAACCCrGGGATGTTGTTTGTTTTTACCACCCAGGATCTCCT 
CTGCTTCTAAGTTCTCCCTGCCGT^TCCTACGCTGCAGACTCT 
ATGTGTATCAAAGGCAGGGCACAGAGCAGGCGGTGCAGGCTGAGC 
AGACAJGCGTCTTGGGGACATTTITATGGCAGCTTTGGGAAGGTI^^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

7 5 NNNNNNNNNNNNWNnKnnN^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNKNNNNNNNNIWNNIWNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNI^^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 CGGGAGGAGGTGGCTGATTTT*GTTTTATG CT AAT ATTT A CTC CTTCATCAT CATCAT CTCTT ATTTTTGTTGC 

TGTCTTCTX1AGACAAGGTCTCTACATAGTCTCGGCTGTTCT 
CTCACCTCTGCCTCTCCAATGCTGGGATTAAAGATTTGTGCTCCCACAC^ 
TAAAACTCAGTTTCAAAGTAACCAAAAGCCCGACAAACTTGGCAGAAAAC^ 
AGGAGCTTTTGGTCAATGCTCCAGGCAGCA^ 
1 0 CACTGCACCGGGTTTCACCGGGTTTCGGTGCCCCAACCAGCT 

GTGTCAGTTTTACTCTCGTCITAGCTTTTCCGTTTTGGAAAAGTCCAGT 

ACCTTCTTCTTAACTAAAAATACCCTTTTCCTTTCCAC 

GTACACTGTCTCATTTTACTCTTTACTTCCTGACCAGAGCCTTGTCT 

tg cagacatgatag ctgactagggggcgcttotctgtcgagagtctttataactgc^ 
1 5 ttaaaaaaaaaatgttgagatagagaattacagagtattccattcctct 

attgtctcagtttaagccccagactggagacatggcccacggtgttgcaacaa 

gattctagatgcagttgttcctgcccctcatcccctcacccccccccaaaaaaaaacaaaac^ 

tttccttggaggaccagggggagaggagaaaaagagagcgggatggagcagaag^ 

agattctgttgggtccagtgatggcagactccgtctagagcagttgt^ 
2 0 gatagtgtttcggcctatgtgagctgccttgtccagggcagagatgtc^ 

tctttgaatgtgtgtggatgagagggagtgccx;ttggagggaagagtcagaggggaaggtggggtgc^ 

aggtgctggtctcggagaagctggtggttgagtgggacct 

cggtatgg^aggcacctgttcttgtccostcctgaggccataaagg^ 

caagtgtaggacaggagtgaaatggtgtgttgtgctgtggccagcctagtctggccagatt^ 

2 5 gttggctgtgtcttttgagaggttgaacatagagtttaaaga^ 

ctcacccagaaggcctcggag^gcagcaggcctctttccagtactgacttgactg 

CAGAGACCTCCTGGTCC^TGGGGTGAAATCCAGTTGGTCAGGCCAGACTGACCTCAAAGA 

AGGGGGCTCACTGCCCCTGAGAGAGTGAAATTTGGGCTACCGACGGGTCT 

AGCAGAGTTTCTTATATGGTTGTCTATGGTGCTGAGGTTCCAGCCCCCAAACCTGTCCC n ' T 

3 0 GAGTGGAAGTGCTGTGCTGTTAAAGGTGAGGAGAGACTTACACnTTTGTCATCAAGGA 

TATTCGCTTCCCTGTGTGAAAGGGTGCTCCGGCAAACATGATGGCACAAACCTCGCGGGAAGGTC 

TGAACATTAAACGCATTGATTTAGGGGCTGGAGAGGCAGCTCAGGGGTCAAGAGACC^ 

GTTCCCAGCACCCAGGTTGGGCAGTTCGCAACAGCCTGTACCTGCTCCCCGTGAGAGAGTACCAGGAC^ 

CCTTGTATn-rr CTGGCT ACTAGTTATA iti Ti 'CT GGT ACTTGTTATATTCAAGT ACACTATAGCTG IT ri'CAG ACACCCCAGAAG 

3 5 AGGGCGTCAGATCTCATTACAGATGGTTGTATTATTTTGTGAGTAACCTGTTCATGTOT 

ATATTTGTACATATATGAAAATACTTTTTAAAAAGTATTTAATCCTTAGGGCTGGAGAG 

TGCTCTT CCAG AG CTC CTG AGTT CAATTT CCAG CAACC AC ATG GT GG CTC ACAAC CAT CTG TAATGGGATCTG ATG C CCT CTT CTG 

GTGTGTCTCAAGAGAGTGACAGTGTGCTCACATACATAAAGTAAATAAATACA 

ACTTCACTCCAGCTITAGGAAGGTAGGTCCTGGCAGAGGCCAGGCGAG 

4 0 CTCTCTGTTCAGAAAAGTCTGGCTGGGACAGGGAGATGGGACT 

CGCCTG C CT CTG CCT CTCT G CCT CCTC CT AC CAATT ACTGG AATT AAAGG C CTGTG CC ACT ATGT CCGC C CT AAT AAAACCATTTT 
T AAAC^T AT ATTT ATTT ATG TATCATTT ATTTT ATG CATACGAGTACACAGT AG CTGT ACAGATGG TTGT AAG C CAT CATG TGG TT 
GCTCGGAATTGAACTCAGGACCTCTGTTCACTCTGGCCC^ 
ACACCCCAGAAGAGGGCGTGAGATCTCATTACAGATC<3TTGTGAGCCACCAG^ 

4 5 AG AG CAGTCAGTGCTCTT AACCACTGAG CCAT CTCT CCATG AAGGGTTGTCATG C ATG CATGCACAG T AAAGC CCAG CAGG CTTGC 

AC^TGTTCCGACTTAGAACACCCACCCTCGGCAGATATC 

CACAGTCTGACAC^CAC^GTAAAGAGAGGTCTGTGTCTCAGCTCACAGTGGAACACT 

GG CAGGTTC CTGGGGG AAAAT AAAAG AT AAAAAGCCATG AGGAGGG AG AG CTGTT AGTT CAC CCCAG C CTGA CTCCACAG TG AAAC 
CTGTCTCATAAAAACACCAAGATGAACAGCTAGCTGCTGCTGTCAGCCTGTGTGG^ 

5 0 CAGTTGGATTGTGGTGACCCATAACAAGGTAGGAGAGTCACGAGGTTGAGACCTAGTCTGC^ 

CCTGGGCTGTGAGACCCTCTCACAACACACACA 
TCC^GTATCTCAAAAGAAAACAAAAACAAAATGAAGCCAGTTGCC^ 
CACAGCTCCAATAACTTGCCGTCTGCACAATTX5TATGGA 
TAAGCTGGATATGTTGGTATAC^CACATACACACACAAAATAAA^ 

5 5 AACAT AT AAACACACAC AT AAG CACACACT ATG CACTC AGACACT CIAAAC ACACT CTCT ACTCTCACAT AAAT ACACTG CATTCTC 

ACACATAAACTCACTGCCTCACACTCCCATAAATACACACTATGTATTGACACACT 

ATAAACATACATTGTG C ACT CTCAC ACACAAACTCTGCACTAT C!AC AC AT AAACAT ACACT GTG CACACAT AAACACTG C ACTCAC 
AT AAACACATACACTTGTT AACAT AAACATACTGCATG CTT AT ACACACATAAAC ACTCAG CACTCACACACACAT AAACAC CACA 
CTCACATACACACAT AAACACT ATG C ATGCTCACACTCACAT AAACAC AC AG TGTG CTCT CACACG CT CCAT ACACAGT CACAC CC 

6 0 TGCAGTGAGCCCTAGTC<5GAAGAAGTTGAAGGACGGAAGCCAGAAGCAG 

GATCACAGCAGGAAGCCTGCCTTGCCCTGATGTGGTTTCAGAGTGG^ 
TC<5CAGCTA(IAGGTCAGCCCTGCCTCTGTGCCAGCCTCCTGGTAAGTGAGCT 

TCCTTCTACTCTAGGAGAGGAAGTAGGGCCCCGGCCCAAATGCTGCCCCAGCCGCTCGCCTCCCTCTGGCT 
TGGTGGTCCCAGCTGCTGGGCTT CCTTCGGTAGGGTGAAGCGATGGCATCTG 

6 5 AGCAGGCACCCAAGGTGCTTGCTCCTTCCCTGCCCTTGACCACCATCAT^ 

CTCACTGCC^CCCTTCCCTCCCCTGGGGAAGGCTGGAACACTCTCCTGAAAACTTTTAT 
GTAAATGTACTACTATGTTTGTGTGTAGTGGTGCAGAACACAGTGTTTTGATATTCT 
TTTTTGAGACAGGATTAATCCTGTAGCCCTGGCTGACCTGCAACT 
AGCACTGGAATTAAAGTGCCACCATGCCTAATGTGATAATCT^ 

7 0 TACTGATAAT AGCTTCAAAATAAAAAATAAAAATAATCAAAAG AGGGCTGAAGAGATGG CT CAGTGGTTAAGAGCACAGATTGCCC 

CTCCTGAGGTTCTGAGTTCAATTCTCACCAACGACATGGTTGCTC^ 
CATG AAG ACAGCC^CAGTGT ACT CACATACAT71AAATAAATAAAATCTTTCT 
GTGCCAAGGGCAAGAAAAC^CGGTTGTAACACTTAGAGOUVACTGAGATTGTCT 
CACTCXTTGCCTTATX3TGATTGTTAGTGGCTX5TCGAG1TAAGA 
75 GGTCTCTGCCCCTAGCTTACACGGACTTCTT 



734 



WO 03/008583 



PCTAJS01/51291 



GAGACTCCTGGGCAGATTCTGGGCTGGGGACCCCCCT 

ATGGC^GCAGAAGCTACAAGGCGAGCTGGAGGATGAGTGGCAGGAGGTCATCGG^AGGT^ 
CC^GCCGCCCTCCCTCCCGCCCCTGCCCTGTGTC 
AAGTCCGCTAATCATAGCCTCTCTCCGACAACCTCATCCCACCT 
5 CCTTCTCAGCCTGGTCAGAGCGCCTGGCTCGGGAGCATGCCCAGAAGCAGCGGC^ 
CCAAGGGCTGAGGGCTCCAGTCACAGCTGGCGACAGCACGAGGA^ 
GGAACTGTGTGAGAGCCGAGCCAGGAGGGCTCAAGAGGCTCAACGGGACAAAGG^ 
AACACCCACGAGGGGCAGAGAGGGG<1AGCCTCTGGCGCTITGGCGATC 
ATGGCTGCAGCCITGGTGGCCAGGGGCCCCCCTCTGGAGGAACAGGGGGCTCTG 
1 0 GCATCCTGACCGCTTCCTGCAGCGATTCCGAAGCCAGATTGAGACCTGGG^ 
CTCAGGCCCTGAACCGCCACGCAGAAGCCCTrCAAGTGACCCT 

CGGAAGG AAG CAAGGTCGTGGATGGGGATGG ACAAGAAGGGGCAG CTGCGG CACAG AGAGG AG AGGAT ATGGG ACGGCT AACG T AA 
TGGTTGTGGGGGTGGGGTGGGGGAGGCTCTACCACTGCTCTGGACTGCCATTTCCT 
AGTCTCCTCTGTTTTTTCCATTGCTATGGTTTTCATCACCCACGA^ 
1 5 CACACCTGCGGTACAGTCACAGTGCGCTCTCAATGGAGACCAG AAGC CCCTGCCCG CCCTCCTTCAGAAGGCCTGGTTCATCGTCC 

AGACCCTGTCAGCTCCCCACCCCACCCCCGCCGGGGTCCTGACAAGGTTGCAGTTGAAGAAAAATGTC 
GAACAACCCCTCCCTCCATCTCCAACCACATCCTCACCCCTGCTCAGCT 

GTTGGGGGCAGCAGGGCTAGGGGACCACGAGGACCTGGGAGGATAGAGGATAGAACCAGAGGGG^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

IINNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNin^^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

tTONNNNNNNNNNNNNNNlWNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNIWNNNNNI^^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN 
35 tJNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNIJNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN1JNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNGAAGGAAGGAAGGAAGGAAGGAGTGGG 

ACCCACATAGTGGGGGAAGGGAACTGATTCCTACAAACTGTCCTCTGACCTCCATAGTGCCCTGT 
5 0 TAAAAGTGTAATTAAACTTTTTTAGCCAGAAA 

TGAGTTCGAGGCCAGCCTGGTCTACAGATCAAGTTCCAGGACAACCAGGGCTACACAGAGAAACCCTGT 

GG AGG AAT AGTGG ATGG TT CCTG AG AAAGAAC CAAG ATTGG ATT CT AGGTT CTACACACACATG CACACGGACAGGTG T ACG CCTG 

CACAGATGAACATACATACATTTACACACCTGCATATACACACCCGCATATACACACCTACCAAGA 

AAACTTCGTACAAAGGGACATGAAGATGTGCCACATGGTTATTGGTAGJ^CTGG 

5 5 CCTAAAGGGTTGGATGTGGCCTAGAGCAGGAAGGTTCATGGCTGAGGAGAAAGACAGGGGTTCCGTTTCAG 

CAGAAAAGACACTCCTGCACAGTGAGTAAACATGAGGCCACTGTCAGAACCCT 

AGCTAAATCTACAGAGAACTTGGAAGCAGCCTGCCCAAGGTACTACAGTGCTTGGCTCAGAACAG 

TCCTGCAATCGGGTGTGGAACCTAAGAGCTTGGGCATCTCCTTCT 

GCCTGGGTGGGGGTGAGGTCAGCCAGAGAAGGAGCCTCAAGTCAAGTCGTGATGGGGCAGCTACGACTTAGGCCTCAAAGAG^ 

6 0 GCTGGGCGAGAATTGAAAGGACCTCGTAGGTTCACAAACTAGACGATTACGAATTTTAAG^ 

GAGAGTTGCTTTTCACACTGAGGAGTGAGTGGGCGGTGAGCTGAGTGAGGGGCGGTC 
CATCATCTCTACCCACGAGTCTTGATGATGGGGAAAATCTTACC^ 

CAGAGGGAAGGCGAGAGGGAG ACAG ATTC CAGGGAGGGAATTT AAAACTTAT CTTCAAG AAATG ACACCAAATATC CGTGCGTT 
AAACAACATACGATGGAAAAGACACTAAGGACGGGGATGAGGCGGAGGGGGAA 

6 5 AAAGGTTTTTCCAAGAAAGCTGTAAAGGGGAAGCTTTAGTTAACAGAAGGGC^ 

GGCCTTAAGCTCAGTGGAGGGTGGGGGAGGTCGAGGGTTGTGGAGAACATTGX5A 
GCCTGGGGGCCCACCAGCAGCCCCTCGTGACCGATTCTGC^ 
AGATCTTCAAGGTCTTCTTGGTTAAACATCGGGTGGGAACATTTTTCAAAACT 
TTGTGGTAAACAAAAACCAGCAAGAATTGCTGGGGGCTG^ 

7 0 CTGGGATG AG GGAGCTGGGGAGGGGCTGAG ATGG CTGGGATGGGG ATGG TGCATTT^ 

GCCCAG CAATCCT ACCTTGAGATAG AACATGAGGAGGGAAGGAGAATCC CCAGGCACTTGTGGGCTAGCCTGGGCTGTGCAGCCAC 
AAAACCTGAACCTGCCTCAAACAAGGGAGAAGGTAACACAAGCCTCCrcAGGCTGCCC^ 

GG ACACAAG AATTTTT AAAATCAG AAGT AG CCCGGTGTGGTGGATTATG CCTG CAGTCCAGCCTTT AG ACGTGAAGGCAAAAGCAC 
CGGGGGTTCAAAGTCAACTTCAGCTATATATCAAAACCAAGACCAACCTATGCTAT*^ 
7 5 ATAAACACCAAGAAAGAATTCAGGGGGTTGGAAAGATGGCTCAGTCATTAAGAGTGCT 
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AATCCQVGCAACCACATGGTGGCTCACAACCATCTTCT 

AATATTTAAAGGAAGGAAGGAAGGAAGGJUVGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAA^ 
TCAGAC^TTAAGAGCACTGATGCTCTTCCAGAGGTCCTGA^ 
GGATCTGATGCTCTCTTCTGGCATGTAGGCATACCrACAGGTAGAACACTCATAT 
5 AAGAAAGGAAGGAAGGAAGGGAGGGAGGGAGGGAGGGAGGGAGGNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNIJNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNIWNNNNNNNNNKNNNNN^ 

NNNNNNNNNNNNIJNNNNNNNNN^ 

TTACTCTGTAGCCTTGTCTATGTCTAGTCTGAGCTTCCTATGTAGGCCAGG^ 
1 5 CCTCCCCAGGTCTGGGATTAATGGCGTACACCACCACACCTGGCACCAGTGA 
AGAGAAACTGAACTGATTCCCGTAAGTTGTTCTCTAACCTGCACACATC 
ATTTTTAAAATGTAATTTTTTTTAATGAAAGAGW 

AGAAGAAACAGGTTATTATTCCTAGGGAAAATTCAGAGATCAAAGTCTTAAAAAAAAAAAW 
GTGTGTGTGTGAGTTGTGTGTTGTGTGGTTGTCCCCAGAGAGACCAAAAAAAAAGGATCAGAGCC 
2 0 CACCTGACATG AGTATTGGAAACTGAATTCCTCCTGTCCTCTG CAG AGG CAGTACATGCTCCTAACCAGTGAGCAGTCTCTCCAGC 

GCTGGCCACCTTAAGG CTGGAGAGCTG AGCCCAG AGCCGAGGACAG ACATGTG CACTCTGTGTGGGTTTGAAGATGGTGCCAGGTG 
ATGAAAGATGAAGAGAAACTGAACCAGATGTCCAAGTTGTTAGGCATACAGTACAGATGTTC 
GACGAGTACGGTTCGTGCAGAGTCAGGAGCCTCTAAAGAAGAGAATCCTGTTGGTAGCATGGCCACAC^ 
TAATGCTGGTGTCAAAGGAAGAGCTACCTTCTCCAGAGAGGGCACAGGCGTTGTCTGGTC 
AACAGGGGATGTGCGTAGATATTCC^CAGCAAGAGGACACTGTGAACCAGATTCACAGAAGAGTCCTTC 
GAGGGGACCAAGGAGAAGGACATCCTGACAGGCTCXCTTTTAGAATCCAT^ 

ATGCATTCTATCAACACCTGGCATTTATTGTGGGGAGGCGTGGGACACTAACAGGCCCCGTGGTCCAGG 
CACCATTATGCTGGCAGCTCCTTGTTTCCCTCTAACCCCATCITGC 
TCTCTCTCTCTCTCTCTCTCTCTCTTTCTCTCTCT 

CTGTACAGATGGTTGTGAGCCATCATGTGGTTGCTGGGAATTGAACTCAGGACTTCTGCTC^ 
GCTTGCTCTGGCCCAAAGATTTATTTAATATTATATGTAAGTACAGTGTAGCTGTCTTTAGACAC^ 

TCATTACG AATGGTTGTGAG CCACCATGTGGGTG CTGGG ATTTG AACTCTGGACCTTCGG AAGAG CAGTCAGTGCTCTTTCCTGCT 
GAGCCATCTCTCCAGCCCCAGACTTAACAACTCTTGAGAG^ 

GCCCACCTACCTAAGGACCCTACCTTAGCTCCCATCCAGCTCCATCACCCAGACCCCTGTTTCT^ 
GCATAGACCAGGGAGCCACCTTGTAGACCAGGCTGGCTTCAGACTCAC 
CGA.CCCTGTTCTATCACACACACTCACACCTTCCAAAAGGTCCCACT^ 
TTGTGTCTTCCAAAAGATCACAGTTGAATAAAATGGAGATACTTTGTGTCTTCT 
GCATCTGACCTCTTCAAGTGTCTCTGATGTGGTCAAGTTCAGTTTCTTGCTGTCCATCAGCATC^ 
ACCTATCTGTCACCTGGGCCTTTTCTTCAGTGCTACCCTTGCTGAGAGTCACATTACT 

CACTCCCCACGC CAC C CTG CCTTGTT AT C C CTGT AGG ACTGTTTT ACAG ATT C ATTTT ATTTT ATGG CGATG CTGGGGATGGT C CC 
TGGGTCTCACTC^CGCTAGGCAAGTGCTCTATCT 

AC ATGATCCCT AAGAGTAAAGGATTT ACT CTT AGGATT CTTTGG CTCAACTTTGTG CCACAGTAT ACTTCCTGCCTTGAACAAACT 
TCCTGGTTGGTACTCAGTAAACATGTATTAAAAGCTGTTGGAGGGTG^ 

AGGTAGAAGTTCCATCTAGTATCATGCTTTGTTTGGGGTCAGTAGCTGGATAACACCAGAGACAGG 
GAGTTTATTTGAGAGTCCTITGAGGGGTATGTGCITAGATTTGAGGT^ 
TAAAATCTGAGGCTAAAAAAACAGAGACTCACAGTAGTCTTTGA 
GTGGCCCTTCCTGAGAGTGGTTATAGACGATGCACAAGACTCCrrAGT 

TGGCCTCCCCAAGAGAGAAACTGAATTTCACACAACCTCTAAAACTGAGATTGAAACCAAAATTGGC 
CCAGCGTGCTGCCAGCTCCCGTGACACAGCTTAGCCAAGGCTTACCC 
CCTTTCCTCTGCCCCTCCAGGTCCCAGCCTTCTGCTCTGACTCAAATCTAATCCTCTCTCT 

TCTGTCTCTGTCTTTCTCTGTCTCCGTCTCTCTGT CTCTCTCTG CCTCTCT CTCTGCC^TTTTCATTTTAATACAGCTTAGCCAG 
CGTCTCGGCTACCATCAGTTAAGAGACAAAAGCAACAGGAAGCATAAAGACTC^ 
GATGACACAATGACAGAATTAAATTCTGTCTGAGAGAACTCAGTGAAGAGAGCC^ 
CCAGTTCTGCCAAA 

MOUSE SEQUENCE - mRMA 

CCGGGTCCATCCTGGCGCGCCTCCTCCCTCCCCACCTCCCTTCrCCGAGCGTCTTAAGCCCAG^ 
GTCCTTGGGGGGGGCGGGGAGCAGGTCTGATGAGTAACCCCTCCCCCCAGGCCCCA 
6 0 CCCCAGAGTTGTTCCATGGCCTCCGCTTCCCGCCGGCACCGCCTGAGAGC 

GCGAGCCCAGGCTCTCCTCCAGCGACATCCCGGGCTTGATGTGGACGCTGGGCAGCCCCCGCCTCTCCACAGGGCC 

ACGATGCCCCrGCCCJTGTGCCTGCTGCTGCGTCTCGGCGCTGA 

GCTCCCCGCCAGGGCCCTGATGCTTACACGGACTTCTTCCTGCCGCTGCTGAGT 

GGATGG AG AGACT C CTGGGCAG ATTCTGGG CTGGGG AC C C CC CTGGG ATTC TG C CGAAG AGG AAGAAG ATG AGGAGGTTTC CAAGG 

6 5 AGCGGX3AATGGCGGCAGAAGCTACAAGGCGAGCn3GAG 

GACACGCAGGAGCCCGAGTCCTTCTCAGCCTC<5TCAGAGCGCCTGGCTCGGGAGCATGCCC^ 
AGAGGGATCCTGCCGACCTCCAAGGGCTC^GGGCTCCAGTCACAGCTGGrc 
GAGCCCC^GCCAAC^AGAAGGAACTGTGTGAGAGCCGAGCCAC<5AGGGCTCAAGAGGCT 
AGGGCTGGGCCCAGGGCAGAACACCCACGAGGGGCAGAGAGGGGCAGCCTCTGGCGCTT^ 

7 0 TGGAGGGGACCCAGAGGCCATGGCTGCAGCCTTC^TGGCCAGGGGCCCCCCTCT 

GAGTCCAACAGGTCCC^TGGCATCCTGACCGCTTCCTGCAGCGATTCCGAAGCC^ 

GGAGCTGTGACAGCCCTTTCTCAGG CCCTGAACCG CCACGCAG AAGCCCTCAAGTGACCCTAG^^ AATAAGCAAG AACCTGTGGGG 
ATGTAGCCTCAAGAATAGACGGAAGGAAGCAAGGTCGTGGATGGGG 

TATGGGACG<5CTAACGTAATGGTTGTGGGGGTGGGGTGGGGGAGGCTCTACCACTGCTCTGGACTC 
75 GTTCCACACCTCAAAAAAAAAAAAAAAAAAAAAAA 
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MOUSE SEQUENCE - CODING 
ATGAGTAACCCCTCCCCCCAGGCCCCAC1W3GA^ 

CCGCCGGCACCGCCGAGAGCGACGCTTCCGGCGGTACITGTCTGCAGGACG 
5 CCGGGCTTGATGTGGACGCTGGGCAGCCCCCGCCTCTCCACAGGGCCTGCG 

CGTCTCGGCGCTGACCCTGCCCACCAGGACCGCCATGGCX1ACACGGCGCTGCACGCTGCT 

GGACTTCTTCCTGCCGCTGCTGAGTCGTTGCCCATCTGCCATGGGGATAAAAA^ 

GCIX3GGGACCCCCCTGGGATTCTGCCGAAGAGGAAGAAGATGAGGAGGTTTC 

GAGCTGGAGGATGAGTGGCAGGAGGTCATCGGGAGGTTTGAAGATGA 
1 0 CTGGTCAGAGCGCCTGGCTCGGGAGCATG CC CAG AAGCAGCGGCAGCAGCTGGAGG CAGAGGG ATC CTGCCGACCTCCAAGGGCTG 

AGGGCTCCAGTCACAGCrrGGCGACAGCACGAGGAGGAACAGCGGCTT^ 

GAGAGCCGAGCCAGGAGGGCTCAAGAGGCTCAACGGGACAAAGGGCCAGAGC 

AGGGGOVGAGAGGGGCAGCCTCTGGCGCTTTGGCGATCTGCCC 

CCTTGGTGGCCAGGGGCCCCCCTCTGGAG<3AACAGGGGGCT 
1 5 OSCTTCCTGCAGCGATTCCGAAGCCAGATTGAGACCTGGGAGCTGG^ 

GAACCGCCACGCAGAAGCCCTCAAGTGA 

HUMAN SEQUENCE - GENOMIC 
TAGTKrrrity3GGATCTGGGCCrTGGTA 
2 0 GGAATCAGACTGTCACGACGCT AATC CT ATGG T AAATCT AAAATCATT AATGAGG C CAGG TG CAG TGGCTCACTCCTG T AATCCCA 

GCACTTTGGGAGGCCGAGGTGGGTGGATCACnTGAGGTCAGGAGTTCGAGACCAGCCT 
TAAAAAAAAACAAAAATTAGCCAGGCATGATGGCACACTGTAGTCCCAGCTACTCGG 
TGGGAGGCGGAGGTTGCAGTGAGCTGAGATCGCGCCACT 

AAATAAAAAT AAAAT AAAATAACT AACAT AAG TCGACCAG ATTTG TGG C AT AACAGG AG ATACAGCAT CAC CT ATGAAGG ATTCTT 

2 5 GCCAAAAATGCTTAACTTCAATCAGATT T I TTCTTT T ^ ^ 

ACAATCTCAGCTCACTACAACCTCTGCTTCCTGGGTTCAAGCGATTCCC 

GTGCCACCATG CACGG CTAATTTTTGCA r ITTT AGTAGAGAGAGGGTrrCATCCTGTTGGCCACATTGGTCTTAAACTCCTGACCT 
CAAATAATCCACACGCCTTGGCCrCCCAAACTGCnG AGATTACA^ CCATTGTG CACTTGGCCAGAATCCTCAATATTCA 

CACACCACTGCAGCTGTTTTAAAGTTTCCGGCTTTCTCTGCCACAT^ 

3 0 AAAGTAGTAAGAAAAGGGTGGTCTTGTGTTAAGCATCATCCATAGCCCAATTAC^ 

TGITACACCACAGCAAACTAAAGCTTCTCCAAAATTAAAGAGACTATTGGCCTACAAG 

TCACTCTCAGGTCTCTTT AC CTTGGCTTACCTTGACATTGGGCATGTATTTAGAGAAG CG CT CAT ATT C CTTG CTGATCTGAAAAG 
CC AACTCCCGAGTG TG ACACAT CACCAG CACAGACAC CIT AGG C AC^IAAGT AT ACGG AG ACAT AT GG TAAATGTAG CTCTTCATT A 
TCCCCTCTAGGGAAGTGACTGTCACAAAAACACACCTGGGCCGATAATAAATGACTTC^ 

3 5 CGCTTGCGACAGAACATCCCCCACAGCTGTCAGGTTGTCAAGGGTAACA 

TTAGGACAAAAACIACCCrrrCCCTTATAGTCCTAACCAAAATCCCCrCCCCA 
TCCAGCTGTTGCAGTGTGGCCAAGACAAACACTGCTGTCTTTC 

GGCCTGAGGGATGCACTCATGCTC^ACTAAAAGTTGGGGGGGGAGGAAGATAAATTAGA 
TTCACCATGCCAAGCCCATTTCTTACCACTCAATTCTCAAAGTCTAGTATTTAC 

4 0 CTCTCTCTGCTCAATTACATTCACCTCAAAATCAGACTCT 

GTCCTAGTAACAAACCCCTTGCATCTACCAACCGCTCACCTTCAATGATCCTAGCTCTGTCCT 
TCATGACATT-TGAATACCTGCCACAGACCACTTCTCCTGC^ 

AAATGAAGGACTTGGTACCTGACCCAGAAGCCAGTCATCTCTAAACCAGTCATAGAGG 

aacagagggagactacaggtccaagcaggacctttctggaaatttaa^ 

4 5 agaaaagccagtcacaagaatgaatggc^c>acctc<;aagtcacttttggatcattagcact 

AAGCCTGTATAAGCCTCCTCTATCCAAAATTGTTTTTGATACTTATCCCGATTTTTTCT 
AGTGGTGCAATCACGGCTCACTGAAGCCTCAACCTTCACCTGC^ 
GCCCAGGCTTCTCTTGCACITTTGAGCTCAAGTGGCCACCCTCCTGCCrCG 
CTGCACCTGGCCCTGATCTAGCCTTAAGTATAAACCCTTACC^CCACCTGAGCAACGA 

5 0 AGAG CCCAATGAGCACT ACATG CC CAAGAG AAAATTT ACCTTCT CCACAG T CGACAATGG CC CGGAGCAAC 

TCTGGCTTGAGCAG^AAGTCACGAAAGCCAGAGCTGTGGATGGAGACATAGGA 
CCCATCTCCCCCAGCTGCTGTCTCCACCTCATCATCTTCATAGTCCAAGAGC^ 
CGGC^GGGGAAGAAGGGAAGGGGGATCTGGATGGGTTCTCGCAAAATAGGTGAA^ 
AGCAAAGGAAAACAAAGATACTATTTCTAACAGAAGAGCTGGAGGGGGGAAA^ 

5 5 ACCATTTTTCCATCCCCAGTTCCCACTTTCCCT 

AAAGCAAATATAACAGAACAGAAAAAGCAGTACCAGGGAAAGTGGTTAGGACAGAGGT^ 

GGAAACCAGAAAAGTTGGAAGGGGAAGACCAACTTATAAATTCTTGATCTGA 

GGTAAAATATGACTAATAACTTAGTAAAGTGGAAAATGGAGATGA 

TCCCTATTATAATCCCACCGTTATGGATGCCTAACTCAGCAGCCATCAGTCAAGGGTGATAGATGAGGG 

6 0 GCTCACCTTTCG AAAAGAAAACATCATATGGCCGCCGTCCACCTCCCATAG CTCT 

GCTGTTTAAGCAAGCCrTGTGTAATTAGCATGGGGGGGAGGGGCGG 
GTCTCCACCCTACAATAAGAAAGCTAATAGGTGACAGAGAAAGGCAATCCCCGC^ 
TCACATCACTGTTACGCTACGAAGGTGAGACTCCTTTTGGAGAAACATAC^ 
CTCCAAATTAAGTTGGGCAAGTCAAGGTGAGAAAAATCCAACTGG^ 

6 5 AAAAAGAGCTCAAGAAAGAACAAGGAAGGTGAGAAGAGCCCCGCCCTCCGC^^ 

TCATTC^TGCTGAGGCCTCCAATATGAGAAGAACCCATTGGAAGAAGGGAGC^^ 

GGATTGGGTCCCCCCTTAGCTTCCCTTCCTTCCCCCAGGAGCTCTTTGCT 

ACTAGGCCCCAGCGACCAGACCATCGCCTCTGAAAAGGGTATCAGGAACCCATGTGACG 

AACGGATTG TAG (XJAAGG CCAAAG CTT ACCT AAACAGGGAG AG CGCGT ATGG CG G CAG CAACAG CG ACGAAGGAGGG AAAT CTG CC 

7 0 TTCACTTCCGGTTGCAGGCTTCCCTCTACTCCAGCCTCCCGCCTTCTTtKS 

CCTTCCGCGGTTATACAGATCCGTGCGCTCCAC^CTTGCCTTTGGAAAATG 
TCACTGCTACCTCCAGTAACAAAAGATATAAAGGAAACTAACGTCTCCCCCCCACTGTTAT 
CCATGCCCCTC^TCrcX-reGrTTAGGT^ 
GGCGAAGGCTGCGAGTTCCCACGGGGTCCITGGCCCC^TAGTGA 
7 5 GCCCTAGCTGAGGGGAGGGAGGAGGGAAGTCTCAGGGAACTGGATTGCTCGGGGGT^ 
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GGTGGGCGGTATTTTCCCAAAAGGATGCrrcTCCGAGGTAGCTGATGCCCTAGGGCCAGTGAG 

CGGGAAGACGGGGTCTGGATTCGGCCCCAAGTGTTAATAGTAGGGCTTGAGGGTTATACTACATTC 

TTGTTTTGAGACAGAGTCTCGCCCTGTCGCCCAGGCGGGAGl^CAATGTCCTGATCTa 

TCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCT^ 
5 AGAGACGGGGCTTCACCCATGTATGACCTCAGGTGATCCACCCACTTCGGCCTCCCAGAGTC 

GCGCCCGGCCGATT AATACTGTT AATT CGAGCAGAATGTTCTTGG C C CCG CCC CAACAG CCCCATTGTTCAACCTGG ATll-lTTT C 

CTGAATGAAACATTTGCTATCCCCGT CTTTGAGATGGGGAGCCACAAAAGTAAGACCTGATGTC CTGCTGTGTAATAAAACAACAA 

ACGTTTGGCCCTCTCCCTGTTAACATACTTAATCATTTAATACTAAGG 

AGCAAAGCAACATATCTCAAGCAGTACGGCTGGTGAGGTTACAGCCAGGATC 
1 0 CCTG CTCAAAGAAT CTGGTTAGT AAAT ACACTG CAGGTT AC CTTATTGGTT CAAATTCTTGGTGAAG T AAG CTTGTCTTCAGTG AC 

AAATGAAGTAACTAATTCAAGAATGGTCTCATAGAAGGTATTT^ 

CTCCCTTGTGCCACATGTTGTCCTAGGATCTGGGGACACAACGGTG^ 

AACACATAAACACAAATATAATGTCAAGTATCGATAAGTGGTCAGGGTGCAGTGGCT 

CGAGCCCGAAGGATTGCTTGAGCCCAGGAGTTTCAGACCAGCCT 
1 5 OVAGGCATAGTGGCACTCGCCCGTAATCCCMCTACTCAGGAGGCr 

CGTCAGCTC^AACTCCAGCCTGGGC^IACACAGCA^ 

GTAAAGGGGCAAAGAGCACrirGCrCCTACACrCCAGCTTTTCTCTA 

CATGTTTCrCACTTTAGAGAAATAATAAAGTACTAC^ 

AGGGAATATGTAAACAAAAAACAAGAAAGTGACTGAAGGCCA 
2 0 AGGCAGGTGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGACCAAC^ 

AGCCAGGCX3TGGTGGCACACACCCATAATTCCAGCAACTTX3GGAGGCT 

GCAATGAGCTGAGATCACGCCATTGTATTCCAGCCTGGGCAACAAGAG 

GAAAATGAGAAATGATGAGGCAAAAGGAGGCTCCTTCAACrCACCAATT^ 

TGTTACAATGGGAAAGTAAGGTGTCAAGGATATAGAAAGGAAGGGCATGCA^ 

2 5 AGCAATGAGCATCTGAT AAG T CTTTGGGG AAAT AGG AAAGG AGG AAAAT CT AAT AAAGACAAAGATCAG CAAAAGAAAAACAAAG A 

GAGGCT ACAAAATGCAG TT ATCT ACCTGGAATT AT AAG AG AGGGG CTAAATGT AGT CAT CT CCTCTTTTTGGAG AT CAG AAGG TCT 
C1XXX3AAAAG AGAAGAACCAATTTTT CAGAAAAT AACT AGGGT CA CAG AATG AACAAGTGG AATT AG AGAG CCAGTG ATGGACGTG 
ACX1AAACAGCTGTGTAGGTTTTGACCAGTGAGCAGGTGGTGG 
ATGCAAGGAACTGTGCTAGACITTACAGAATGATTCCT 

ACATCGGCCAGACACTGAAGTCAGGATGTTTTCCACIAT^ 
ACTTGGAAGTTTACAAAGTCCCAGTGAGGGTTAAAGAACAACAA 
AAATTATTGGATCCTGCCTCCCAGGATTTCTAGGGGATGGAAAG 
TCTTTGAGGGAGGGAACTC^CAGAAGGAGTCAGGCCCTACGGTGGCCCTAGGCAGA 

3 5 GACGTCGCAGACCATGCCAAGAAGCTGGGCCAGGACACGCTCTCGGTTTCTCTGCTGGGAGCT 

TTGTAGCCTGCTCCACCTCAGCAGACAGGTTCCCCTGGGAGCCCATGGCCTC^C^ 
AAAGCAGAAACAGACAAC^GTCC!AC^CATATGAGGGGAAAGATCCT 

CCATCCCACAGAAATATCCCAACACCAAAGAGATCAACACAGTCCCCTTTCCCCTTAGACCT 
GACCGTAATATCCCCACCACCTCACCATCCATGACCATAAAACTCTACCCTC^ 

4 0 TATACTTTACACAGACTGTGGCATTCCGCCCACAAGCrCTATC 

TTTCACCAATGTTGTGGTCCCTGCCAGGGTCCCCTOVGCCTCAGCCCT 
CTCACTAGATGCACCCACCAACTTGCAGTGGGGTCTCATCCCGACTCT 
GCTCTCGCTCTCTGCGGTATTGCTCCACCTCCATCTGTGCCTCCT 
AGAAAGGACAGTGAGTGGGGATGGACCCACACACACACAATGTAA 

4 5 CTCAGCCAACCAGGTGCCAGATTCTAATATCCATCCATTTCTTCC 

CTTGGGTTTTCCCAAAATGTTTGCTCTCCCCCACCCCCAATTT^ 
CTGGCATCTGCCACCTTCTCAGCrcCCCGCTTCrrC^ 
TATGGCCGATGCTCTTTTCAATGCTGTCAAAGTACCAGATGGCTCCCRCC 
GCAGTCCTCCACTACCCCTGGGTCTTAGTCCTCCCCTCCTCACTCAGCCTCCTG 

5 0 TTGATGACTGGTCCTCCTCTCCAGCACTTCTTGCTCAGGCAGTACCCAAAC^ 

CTCCGGGCTTACAACAGGTTAGCCATCCCAGTCGGAAAGGTCT 

GAAGGAGAACTTG ATTGGTGGT AACAGAG GAAG CAT AAAGG G TTGTG AATG CGG TG AAAAGGT AAGGATGT C ATCATG CAAC CTG T 
GTTGGC AAAAGAXXIATT'CTGGGCTT AATTCTAAACTAACTCT CTACCTTTCTCTCT CTCTCCACCATCCCG CC CCCTCCCCTGCCT 
CCCGTTGTTAACATCTCCATCTTTTTCTACATATTTCTCAAGTCCAAATTTTTC 

5 5 TTCCGTCTTTTGTCTGTATTTTTTCTTTTTTTTGATCTGTCCCTGTTGTTGTCCCACTG 

AATGTGATTTTTATCCTCTCTTTATCTCCTCTATTTTCTCTGT^ 
CCAAGTTAGAGGAGGCGGGGAAAAACCTCCAAATAACTCTCTTTTCT 
CAGTCTTCTGGTTTCAGACGGCCCCTTTAATTTAAGTTCCCTAGTTTCCCCTGG 
GAACGACATCCGGTAACGCCCCTCACAGTTCACTTCCGTCCTCCAC 

6 0 CTCCGCCCTTCCCGCCTCCCGTCTCCGAGCTTCTTAAACACAGGCCTTTGGGCCTA 

AGGTCTGATGAGTAACCCCTCCCCCCAGGTTCCAGAGGAAGAAGCCTCCACATC^ 
ATCATTATTGGAGATTATCrTGTGACTTTTTAAAACTCAGATTTCTGCTGATAAAAAT^ 

TAAAGTTCTTTG TT AAAAATT AAAAATTT ACCTGGGCTCCTG AG C CTT AAACCAATT ATTT AC C CTTTT CTCGAATTTT ACATTAA 
AAAAATTAAACCTCTGATCCTATCACCCCCCTCAAAAAAAATTTTTTT^ 

6 5 GGCCTCATCTCTTGCTGAAGATATTAAAAAAAGACGGAACCAAAGGC^GAAACAAC^ 

CAGCCAAGGCTGAAGTCCTGACTGCTGCCTTTTTTCCTTC 

ATGCTGGGCAGCCCCCACCACTGCACCGG^CCTGTGCCCGCCACGATGCCCCTGCCC^ 

C CTG CCCACCAGG ACCG CCATGGGGACACGGCACTGC ATG CTG CTG C C CG CCAGGG CCCAG ATGGTGAGTCTGCTCAG TGGGGAAC 

7 0 AAGGTCATAAGCAGCTGACCAGACCTGAAATGAAAGCCAACCAATAGTTG^ 

GAAGGAGTTAACCAAGTTGGCATGTGGCTGTCATTTGTCCCTT^ 
AAAGATCAACTGGTCTTCAAATTTCACATCTGTTTAGATT^ 

GTAGCATAGAAGGTAGGCTCTGGAGTTAGATTGCCTGGATT CAAACCCCAGCTCCAAATCCCAGCTCCACACTTCATAG CTACGT A 
TTCTTGGACAGGTTACTTGAGGCTTAGTTTGCCCATGTGT AAAAATT 
7 5 ATAAAGTAACACATTTAATCCTC n .CAA 
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TAGAGAAGTCATTTGCCAGAGTTACAGTTATTCACTGGTAGAGC^ 
TTAACAGCTACATTCTCCTGTCTTATACATTATAGC^ 

TGTTTTGAGAATCAACTGACATAGTACATAATTAGAGTACC1WCACACA03A 

CTGTTTCTCTG TTTrr i GT1TTTGTTT 1"IGT Tl' TTl' ATGAAGTTT CACTCTT CTTG CC CAGG CTGGAATG CAATGGTG CGAT C CTG 
5 GCTCACTGCAACCTCTACCTCCCAGGTTCAAGTGATTCTCATCCGTCAGCCT 
CGCCCAGCTAATTTTTATATTTTTAGAAGAGATGGGTTTTCGCCATGTT^ 
CACCAACCTCAGCCTCTCAAAGTGCTGGGATTACAGGTC 
CATACTCCCTTAACCACAATTCTAA^ 

ACAGAAACCTAATCTGAACTAAAGTAAGACTATTATTTATTTTCAT C CT ACTGATG T CAAT ATTCAT AC ATTTCCCTG CAG AAACA 
1 0 CTCATGTGTTTGGTTCTTGGGCTGCCTAGGCCCTCCTGGGCTACCTAATAT^ 
CCCAAAAACATTTGAATTCTGCAAAACCTTTGGCACTGAAGGAT^ 
AGGTGCGGTGGGTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAGTCAGATC^ 
ACCAACATAGTGAAACCCCCATCTGTACTAAAAATACAAAAATTAGCCAGGCATGGTGGCAC^ 

G AGGCTG AGGCAGG AG AATCACTTG AAC CCXK3 AAG ACAGAGGTTGTGGTG AG CCGAG ATT G CACT ACTG CATTCCAACCTGGG CAA 
1 5 CACAGCAAGACTCCGTCTGAAAAAAAAAAAAAAGAGTGGTGGAAGC^ 

TG AAAC CTT CTTTCC C CATCT AGAG ATTTCCTT CTGGAG TAAG AACATT ACAGGAAAACCTCT AG AT CCAG ATGAACAAC CCT AAC 
ATCCCCCAGCTCAAGTATAGACAGAAGGCCCCTCCCCCAAAACTCCCCCAAATGGTCAAA 

AACGTAC CTG AGAT AGG CT AGCATATTCAG ATTTGTTTCTTG TTGTT^ TTT ACTG AGTG CAGG CAT C 
TAACTTGACAGCTC!ATATTGTAAGAGGCAGGACCCTGGA 
2 0 GCTCTGCCATCAACAGGGACATGCAGCTTACCTCTGTGAGCCCAATTTGCCTC^ 
TGTTTTTTl'GAGATGGAGTTTCCCTGTTGTTGCCCAGGCrAGAGTGCAATGGCGC 
GGTTCAAGAGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGCACC^ 
GTAGAGACG^TTTCTCCGTGTTGGTCAGGCTGGTCTCAAACTCCCGACCT 
GATTACAGGCGTGJU3CCACCGOCTCAGCCAAAATGCCCCTC 

2 5 TTTGAGACAGGGTCTCATTCTGTCTCCCAGCCTGGAGTGCAGTGGTGCAATCATG 

GCAGTCCTCCO^CCTCAGCCTCCCTAGTAGCTC^GGACTACAAGC^ 
GACTGTTTTGCTATGTTATCCAGGCTCTTCTC 

AC^GGCGTGAGCCACCACACCCAACTCCATTGTATTGAATTTTAAGAAG 

AAG CAGGG C CCAAAAGGTTCGGGAACTTGTCTG AAATCT CACAG CT CTCAGGTCATTG TCTTC CAAAGGGGGAC CCAAG CTCAGTG 

3 0 CCTTCACTCCCAGACCCTC^TGTCCTCTCTGGCCT^ 

CTTGGTTGAAAGGCCCATCTCATTCAGCrTC^ 
GAGTGCAGTGGCATGATCTCGGCTCACTATAACTTCTGCTTCCT 
GATTACAGGCACACACCACCATGCCCAGCTA Al " !^ 
AGACGGAGTCITCCCTCTGTTCCCCAGGCTGGJ^TGCAGTGGCAGTAT 

3 5 TTCTCCTTCCT CAGCCTCCTGAGTAG CCGGGACTACAGGAGCCTGCCACC^ 

GGTTTCACCATGTTGGCCAGACTGCTCTCGAACTCCTGACCTTAGATGATCCACCTGCC^ 
GGCATGAGCCACCATGCCCGACCTAATTTTTGTGTTTT^ 

TGACCTCAAGTGATCCACCCACCTCAGCCTCCCAAAATGTTGGGATTATAGGCATGAGCCACCGTGCCCATC 
. TTGGTTTTGTTTTTGTTTTCTGTTTTGTTTTGTTTTGTTTGAA 

4 0 ATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGAGATTCTCCTGC 

GACTGTAGGCGTATGCCACCACGCCTGGCTAATTTITTGTATTTrT 
GATCTCTTGACCTTGTGATCTGCTCACCTCAGCCTCCCAAAGTGTTGGGATTACAGG 
TTTATGTTTATTTTTCTTAGTATGAAACTCCAGCGTATTAAAGAGCATTGAGAACGGTTGA 
ATTTCTTTrrrrTTTTT^^ 

4 5 GATCTCGGCTCACTGCAAGCTCCGCTTCCCAGGTTCAAGCCATTCTCCTGCCT 

GCCACC ACG CC CAG CT AATTTTTTGT ATTTTTAG TAG AG AC AGGGTTT CACTG TG TTGGCCAGG CTGGTCTCGAACT CCTG AC CTC 

ATGATCCGCCCGCCTTGGCCTCCCAAAATGCTGGGATTAGAGGCGTGAGCC^ 

GGAGAAAGGTGAACTTTTTTTTTTTTTTTTTCCG 

CTTGCTACAACCTCCGCCTCCCGGGTTCAAGCAATTCTCCTGCCT 

5 0 AATTTTTGTAT'rTTTGTAGAGATGGGGGTTTCACCGTG 

CAC CT CAGCTT CCCAAAG TGCTGGG ATT ACAGG C ATG AG CC ACTG TG C CTGACC AC^TGAACT ATTTT ATAAAT AAT ATTGG AACA 

TTTGGCTCATCTAGGGAAAAACAAGAGTCCCACCTCACACCTAATCAAAAACT 

C^AAGGTACACACCAAATTCATGGGACAGATGGCCTGGGGAGGAATGAAACTTC^ 

GTGATGATTTTATTTCCTTAAAATAAATTATACTACAAACATGACAGTACATTAACAAACCCTGTGGTAG^ 

5 5 TGTATTATACTTTGTATTTTTGAGAGTTTTTTAA 

GTCCAGTCGCGCAATCTTGGCTCACTGCAACCTCTGCCT^ 

T ACAGG (ZAT CC GCCACCACGTCAOTCT AATTTTTGT ATTTTT AGTGG AG ACGGGG TTT CAC CAT CTTGG C CAG ACTGGTCTTGAAC 
TCCTGACCTGATGATCCACCCACCTTGGGCTCCCAGAGTGCTGGGAT^ 
TTCTAAAAATAAAAATGAGTATACATCTAAAGTAGTAGAAGA 
60 GAGTGAAGTGAAATAAAAAGTCAGAAAATAATATATTGATATAGATTTAACTACATGAAAATTT^ 
ACCTGTAATCCCACACTTTGGGATGCCAAC^TAGGCAGATCACTTGAGCCAGGAGT^ 

CTGGTCTGTACAAAAAATACAAAATTAGCCAGGCGTGGTGGTACGTGGCTGTCGTCCTGTAGTCCCAGTTACT 

TGGGAGGAT(XCTTGAGCCCAGGTGGGGCAGAGTTTGCAGTGAGCAAGATCATGCCA 

ACCTTGTCTCAAAAAGAAAAAAAAAGCTTTTTTAAAGATAGATAGAAGAAAA 

6 5 AACCCTCATATCAATATATATAATTTCCAAAGAAAAAGTGGATAAAGAATATGAG 

AAGACATGAAAAAGAAAAAAACATTGTTAATAAAATAA 

ATTTTGGTAAAGGAAGACATAGGCACACTTGTATATTGGCACAAACTTATTGAGA 

CTCAGAAATTCCACTTATGAAAATCTACCCCCACAGAAAGACTGTAAGATACTCATGAGGCT 

TCGATATTGGGGTAAGGAGATTTGTTTTGTACACACAGTGGAACACTAATTATAAAGTC^ 

7 0 TATTCCTGTGGAAAAATATAAATGAGATGTTAAATGAAAGAAGCAAGTTGCCAAGCA^ 

ATAAATGTGTCATATGTATGCTTTTAGGTACACCAAGGTGGGACT 
TCGTCCTTCTTTAAAAAGAAAATTTCAACTTAATTATATCCCTTTCAC^ 
rAQCCCCACC^CCCATAGGCTAGAA lT^ ^ ^ 
AGTCTCCCTCTGTTGCCCAGGCTGGAGTGCACT^CATG 
7 5 TGACTC AGC CT CCCAAGT AG CTGGGATT ACAGG C^CATGCTACCACG CCTGG CTAATTTTTGT ATTTTTAG TAG AGACAGGGTTT C 
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ACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGA 
GAGCCACCACGCCCGGCCTCTTTTTTTCCAACC^^ 
CTCTTTTTTTTTTTTTTTTTTTTTTT^ 
GCAACCTCCGCCTCCCAGGTTCAAGCAGTTCTCTGCCTC^ 
5 TAATTTTATATATAATATAAAATATAATTATATATATATTTTGTTTGTTTGTTTTAGTA 
GCTGGTATTGAACTCCTGACCTTGTGATCCATTGTCCCCCCCCCCAGCCTCCCAAAGTG 

C CTGG CCACATCTG ATCCTCTCT ACAGTTTTGT AG C CAG CTTTTTTCAAACACATG C CT CAGTTG TAGTGGCTGTTT AAT ATT ATG 
TTTTTATAGTTGGAGGCCCTACTCCTTAAAACCTATGAATGTAAACCT 
AGCTCAGGTGGGGAACAAGCGAATGAGAGACAGGACCAGGTACTTTCTGGGTGGGA 
1 0 CAGCACTAGGGAGAAGTGCCCCCCCCACCAACXCTGATGTGATT^ 

TCTGTL 1 1 Cri'GCCTTCCT^CCTTGCCTTCCCCTATTCCAGe l l'i CTGCCAGGTAAACAGTACTTTCCAGCACTACCAAATAAAG 
ATTT AAGGACTGCTAG C CCATTTCCTCTr AC C CCGGGGAAAG AAAGT AGGTCCACAGG AAGGAAGG CTGC CT C CCTCCCC1TCTT C 
TATCCCCCAAGTGAAAGAGGTGGTTGGTGGCCACAGCAGGTGGGCCTGGCCAGGATC 

GCACTCAGCTAACCTTGGG CCAGATGCAGCAAGGTTGGGACTGAAG AAAAGGGGAGTTCAGG AACGTCGGTT 
1 5 TCTCAGCCCATGGGTGAGACCCTCTGTCACACTCCACT CCCTTTCCCCTG CCCTAGGTOU3CAGTCATTTGGAAAGAGCTTG CTCT 

GCCGCCGGCCATAGTTGTTGCCGGTTCACCTCCCCACCCCTCTCCCATCATCCCCnXX^TAAG^ 

AAACATTT ATTTT ACTCAGTTT ATTGATAAGTGATAAT AAAAGATA^ CC CAG AG TAG 

AGCATTCAGATATTTTCCCAGTCTCATATATGGTTTCAGGTGAGAAAATAAG 

GTGG AAT ACAT AGT CAAAGAATTTGGG CACTG C CTTGGTGGCTGG AGG CAGGT CAG AAAATGTGAGTTGG CATGGT AGAACTTCT A 
20 GGATCCTGAAATAGTTTGTGCAGACACAAGTAAGAA l " ^ 

ACTGCAGGCX3CCCGCCACCAGGCCTGGCTAATTTTTTGT 

GATCTCCTGACCTCGTGATCCGCCCGCCTCGGACTCCCAAAGTGCT^ 

TTTAAACTACCACACTTACCATGCATGTGGTAAACTATCAGTCrGTATTT 

2 5 CCATGTGAGCCACCCTTCCATTGCTTTAAGACCAAC^GGAGCGAGTGAC 

AGTCCCAGCTAACTTCTGCTCCCrcCTCTCCCACCAACAGCCTACACCGATTT 
CATGGGAATAAAGAATAAGGATGGGGAGACCCCTGGCCAAATTTTGGGCTGGGGACCCCCCT 

AAGATGATGCCTCCAAGGAG CGGG AATGGAGACAGAAG CT CCAGGGTGAGCTGG AGGACG AGTGG CAGG AAGT CATGGGG AGGTTT 
GAAGGTGAGAAGTCCACTGCTATCCACAGCTGCCCTTCCCCACTC*G 
30 TGCCCCCTCCTCTGTGCTTCCCTCCTTCrrTGGGGCC * 

ACAGGTGATGCCTCCCATCAAACCCAGGAACCTGAGTCCTTCTCAGCCTG^ 

CCAG CAG CAG CAG CG AG AAGC AGAGGG ATCCTGTCG AC CC C CACGTG CTG AGGG CTCCAG CCAG AG CTGGCGACAG CAGGAGGAGG 
AGCAGCGGCTCTTCAGGGAGCGAGCCCGGGCCAAGGAGGAAGAGCTGC<3TC 

GACCGAG AACCCAAGCCAACCAGGG CCGGGCCC AGGG AAGAGCACCCCIAGAGGAGCGGGGAGGGGCAG CCTCTGGCGATTTGGTG^ 

3 5 TGTGCCCTGGCCCTGCCCTGGGGGAGGGGACCCAGAGGCCATGGCTGCAGCCCTGGTG 

GGGCTCTGAGGAGGTACTTG AGGGTCCAGCAGGTC CGCTGG CACCCTG ACCG CTTCCTGCAGCGATTCCG AAGCCAGATTGAGACC 
TGGGAGCTGGGCCGTGTGATGGGAGCAGTGACAGCCCTTTCTCAGGCCCTGAATCGCCATGCAGAGGCCCT 
AAGAAGCAAGAAACTTCGGGGCTGCAGCCTC^GATGAGGCAGAAGGAAGGGTA^ 
GGTGCTGCTCAGCAGAGGATATGGGTGGGAGCGAAAGTTGTAACAAGTGGG 

4 0 CTCTGCCGTTTCCTAATAAGACCTGGTTCCACATCTCACTCCCAGTGTCTCCT 

CATGACATCTCCTTTCCCGCCCCGCCTGCTGAAACCCACA^ 
GAGCTGGAGGCAAGAAGCCTCTGCATGCCCCCTCAGTTCAGCCCTAAGAA^ 

CACTGGGGTCTTG TCCACC CTG CAAT CTGTGG CTGG AG AAAT AG ATG CGAACAG AGG CAAAAAGGGGAACAAAACCAG TTT C C CTC 
CCCTCCCTC^CTCCCCAAGCTGAACCACATCCTCCTCCCCACTTAACACCCCCT^ 

4 5 CACTGAATGAG CG AGTTGGGGGTAGCCGG CG CTGGGGGGCC ATG AGGAGGCTGGGGGAGGATGGGGAATACAAGCAGAATGG CTGG 

AGGAAGAGCCCTGTGGGGGAGTGGAATTTCIAGTTGCTAAAATTAGGAG 
GGGTTGTCTGTTCrTGGGCAACTGGAGCTCCACACCCAAAGCCAGCCAG^ 
AGTTGTATCTCTCTrCCTCCAGGGCCCCAATCCTCATCTCCGCCATTCAGCTGCTGCCC 
CAGCCCAGGCATC^CCTTCCCTATAAAGATTTCCTTTTCCAAGAA 

5 0 TTCCCACGGAGGAGAGAGGGGGGCTCCCTGGAAACTAAGATAGGTAG^ 

CAAAAAGAAAGGACAGGGCTGCAAGGAGAGTACAGACATGTGCTGGTGAGTGCACTGTCT 

AATCAGAAACTTATCTTTCAC^TTTTGAGCCCAGTTCTCTACAGGAGAATCC 

AGGGC^CGCCTCTGGGAACACGGGAACATGGGTGGGCATGAGATCCTTGAATAAGACAGCCT^ 

CTGAAGG ACCAGG CAG ATGTTCAGGGTGCAGGAGGGGGAAGGGCTGGTGAGAAAGATCCTGTG AGAGGAAG CTGCTGTGATTCAGA 

5 5 GAAGAGACTTCAAGCTGTGTGTGACCCTGGCGTCCGGTTCCTCTCA 

TCCTCCACCCAGGAGATTTCTCACTGGTTTTTAAGCACATCATTTCCC 

CCTGAAACCTGGG CCCACCGGACCACAGT CTTTTCAACGTCCCTTCCTGGTGT CTGGCCCCCAGCCCTGGTGGGGGTTCCCCTG AG 
AT AAGGG CTGTT CACTTTCT CTG AC CACATGGTTTCCG CTTCTG TG T CTCTTG TTTC CT AGG CTGAT AAAAAT ACTG AG CCCT AG A 

6 0 GGCCCTGGCTTCCTCTGACCCCTTGGGGCAGGCAGCAGGCA^ 

AAGGGGTGACTCAGTGCCTGCCATCAAACACTGGGTAGGTGGAAGTGT 
C^GTGGAGGATGCATTGGGATTCAATTGGAGGAACAGGCCTGG 
TTGGGGCAGGTTTGGATTAAGTTAGGAAAAGGATCTGGGAGGG 
ATAAATTTAAACCCTGCTTCAAGCTTACAATCTAGTGC^GAGGCAGACCCATACCT 
6 5 CTTTAAGATGATTACAGAAAGAGGGGAGATTAAGTCCATTTGAAAGCATCCAGGGAGCCT 

CTAATGAAT GGGTTCACT AGGTGG AGGTGGATGGAAAG GTT CCAT AG G CGAAT AAC^CGTCTTGAGCG AGCCTGACAAGTCAGAAA 

ATGCAGTATGTTCTGGGAAAGGAGCGTCCTGAGGGAGAAC^AAGCACAGGTGT 

GTCAAGGTCAGAGATTGGACTGCATCCTGTGGGCAGTGGTCCAGGTGACGATGAAAAAGAGGGA 

AGACAAGG AGT AAG AAAACGG C CAT CAT CTTG TG AATTAAT AC CT ACTG TGTG TTAACCAGC CLTTTT C CT AACACCACAAATCCT 
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CTTTGGGAGGATCACTTGAGGCCAGGAGTTCGAGACCAGCCTGGCCAACA 



GCAGTG AGCTGAG ATCATGCCTCTG CTCTCCAGTCTGGGTG ACAG AG CAAG ACT CTGTATCCAAAAAAAAAAAAAATT ATT AAGCA 
CCT ATT AGGAOCAGGG C A CTG CTTGACAATAGG T AT AAAT AATAAAGTCACTG C CTT CAT AG AACTTG CAGTCTAATGAGACAGT A 
75 TAOU^TAATAACTATACATTATAGTTATAATGTATAGGTATGCCTGCTTAGGGTAAT 
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AGCATGACTCrCAGGTGGAGATTTCCAGAAAGCAGGTCTGGAGCT 
CCCreCCCAGTAGAGGTTGAGCCTTTGAGTGGACAGGA^ 

AAGG AAGAGAACCAATAAAGG AGAGGTTGGGGAGCAGTCAGAGAAG CAGGGCACGGGGGGCGGGGGATG CCAAGCAGAGACAGGGC 
AGCCATGTTTGAGGCTTCAAAGAGGTCTAGTAAATGAGGGTTGAAAAGATTGT^ 
5 G AAAAC CGTTTC CATTT AAT AGAGGGG CAGAAATC^CTTT ACAT AGG TTG AGGAATG AG TGGGAGGTG AGGAAAAGG AGGTGGTGG 
GCATGAGGTCAGAATGGTGAACACAGAATAACTGAGAATCATCT 
CAGTTCCTGAAAAAGTGGTTCGTTAAGGGGGCTAGTCTTTTGAGACATCACACGA 
GGGAGAAATATTACCAAGGAAGGATCACTACTTC 

10 HUMAN SEQUENCE - mRNA 

CCGAGCTTCTTAAACACAGGCCTTGGGCTACGGCTCTGGGGGTACTTGGG 

CAGGTTCCAGAGGAAGAAGCCTCCACATCTGTCTGCCGGCCCAAGAGTTCCATGGCCTC 

TCGCTTTCGTCGTTACTTGTCTGCAGGACGGCTGGTCCGGGCCCAGGCCCTCCT 

GGCAGCCCCCACCACTGCACCGGGCCTGTGCCCGCCACGATGCCCCTGCCCTGTGCCTGCTGCTTC^ 
1 5 CACCAGGACCGCCATGGGGACACGGCACTGCATCCTGCTGCCCGCCAGGGCCCAGATGCCT 

AAGCCGCTGTCCCTCTGCCATGGGAATAAAGAATAAGGATGGGGAGACCCCTGGCCAAATTTTGGGCT 

CTGCTGAAGAGGAGGAAGAAGATGATGCCTCCAAGGAGCGGGAATGGAGACAGA^ 

GAAGTCATGGGGAGGTTTGAAGGTGATGCCTCCCATGAAACCC AGGAACCTGAGT CCTTCTCAGCCTGGTCAGAT CGCCTGGCCCG 
GGAACATGCCCAGAAGTGCCAGCAGCAGCAGCGAGAAGCAGAGGGATCCTGTCGACCCCCAC 
2 0 GG CG ACACGAGG AGGAGGAGCAGCGG CTCTT CAGGGAG CGAGCCCGGGC CAAGGAGG AAG AG CTG CGTGAG AGCCG AGCCAGG AGG 

GCGCAGGAGGCTCTAGGGGACCGAGAACCCAAGCCAACCAGGGCCGGGCCCAGGGAAGAGCACCCC^ 
CCTCTGGCGATTTGGTGATGTGCCCTGGCCCTGCCCTGGGGGAGGGGACCCAGAGGCCATGGCT 
CCCCTTTGGAGGAACAGGGGGCTCTGAGGAGGTACTTGAGGGTCCAGCAGGT 
CGAAGC CAGATTGAGACCTGGG AGCTGGGCCGTGTGATGGGAGCAGTGACAGCCCTTTC^ 

2 5 CCTCAAGTGACCCTAGGGAAGAAGCAAGAAACTTCGGGGCTGCAGCCTCAGGATGAG^ 

GACCACAAGGAAGAGCC!AGGTGCTGCTCAGCAGAGGATATGGGTGGGAGCGAAA 
GCCACCACIGCTCCTTGACTCrrGCCGTTTCCTAATAAGACCTGGTTCCACATCT 

HUMAN SEQUENCE - CODING 

3 0 ATGAGTAACCCCTCCCCCCAGGTTCCAGAGGAAGAAGCCTCCACATCT 

GCCTCGATGTAGATGCK^CAGCCCCCACCACTGCACCGCXSCCTGTGCCC 
CTOGGGGCTGACCCTGCCCACCAGGACCGCCATGGGGACACGGCA 

3 5 GGGGACCCCCCTGGGATTCTGCTGAAGAGGAGGAAGAAGATGATGCCTCCAAGGAGCGGGAATGGAGACAGAAGCT 

CTGGAGGACGAGTGGCAGGAAGTCATGGGGAGGTTTGAAGGTGATGCCTCCCA^ 

GTCAGATCGCCTGGCCCGGGAACATGCCCAGAAGTGCCAGCAGCAGCAGCGAGAAGCAGAGGGATCCrGTCGAC 

AGGGCTCCAGCCAGAGCTGGCGACACGAGGAGGAGGAGC^GCGGCTCTTCAGGGAGCGAGCCCGGGCCAAGGAGG 

G AG AGCCGAGCCAGG AGGGCG CAGG AGGCTCTAGGGG ACCGAG AACCCAAGC C^ CCCAGGG AAGAGCACCCCAG 

4 0 AGGAGCGGGGAGGGGCAGCCTCTGGCGATTTGGTGATGTGCCCTGGC^ 

CCCTGGTGGCCAGGGGCCCCCCTTTGGAGGAACAGGGGGCTCTGAGGAGGTACTTGAGGGTCCAGCAG 

CGCTTCCTGCAGCGATTCCGAAGCCAGATTGAGACCTGGGAGCTC 

GAATCGCCATGCAGAGGCCCTCAAGTGA 
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Table 35 

MOUSE NOMENCLATURE 
ICSGNM Flt3 
Celera mCG12743 

HUMAN NOMENCLATURE 
HGNC FLT3 
Celera hCG96642 



10 MOUSE SEQUENCE - GENOMIC 

GCTCTGT ACAAC^TCATTTT CCAGACAAGGCCAG CC AC CTGTG TT CAGGG ACACTCT CCAG ACAG AG AC AGGG AAGCCCCACT AAG 

CATATACTCAGATGTCACCGGCTTTTCrCAGACCACATTGCAAGG 
GCTGCCTATGGATGGGGCAGTCCTCACGTCTATCCTTGGAGTCCCAGAACC^ 
1 5 GGGAATGTCCTTAGTGAGTCATGACAGTGGGCACTGTACTCA^ 

AACAGCCACACATAGGTGATGCCACATAGTGTGCAGGCTTTCGTCACGTGAGCAAGC 

GGGG AGG CCTTGCTCATGT AT AT CACTCAC CACATX3G CTCTTTT ACTTG T ACCTGGAGGAG ATT AT CTT ACAG AGAG CCTGTCC CT 
GGGTCCTCAGCAAGAGGAGTATGTCCTCTCAGTTTCTGAGTCATCTCTCT 
ACAAAAGATGATGGGCTCTGTCTGAGTTCrrCTTCTGTTTCTTTTT^ 
2 0 CTTTCATTAAACTGACAAATTTTTAAAAAAATCIATGTATGGTAGG 
TTAGATGGGGCTAGAACCATATATATATGTATATATATGTC 
TATATATCATAACATGATATATGTGATATATGTATATATGAT^^ 
ATATCTAATTTGTAAATTTCCTTTTTACAGAAAAAGGGTTTATAAACGCTA 
GAAAAGTTCTGCTTCTCAGTCAGGTTTAAAGCGTACCCACGAATCCGATGCANNNN^ 

2 5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CCGGTATCGAGCACCTCTGTCCCCGAGCTATAGACTTAGCCCCCTCAGTCT 
ACTGAAAGCAAACCX3CACCTGTGAGGAGAGCCACTTTTCCTTCT 
AGTTTTCTGTCAAGAGACTGTGGTTTCCTGTCTAAGTGCTGCGCTCT 
GGAGTCGGACCTCCTTGCCAACTGAAAACATCTGCCCrAGGTGTACTGATTTATGT^ 

3 0 CATAAGTCACAAAAGAGGATCTCCTTCTCCTGGGGGTTTGTTCAACTCT 

CTTTGAAAAGCAAAAGCCTGCTTTC 

GGAGTCACAGTGGCTGTGAGCTGTCCCACGTGGGTACTGGGAACCCAGCTCAA 
TTCGCTGCACCAAATCTGCAGCACAGCTCATGGAGAAACACTGTCTCAACAAC^ 

3 5 AAAACAGGAATGTAGAACATCTCATTAATAATTTTCACACTGAGGCT 

TCAACGAATGCACTCAGCTTTCCCCTCTCTTTACTT CAT ATACAT ATCTGCAC^GG ACTTAAAAGGAAAACGAGGCAGCTGT CTGT 
TGCTCTGGTCACTGTGACAAGAG CTGACCTCTTTCTACCCTCTACG CTGCTCCT AGTGTTTCTTTCATG ATAGTATGCTTTITTTT 
AAAAAATTCCCATAAGTGATTTATAATTGGTCTTTGAGAAAGTGT^ 
TGTATGTCTGTGTTTATTCTCTCTCTCTCTCTCTCTCTCTCTCrrCTCTGTGTC 

4 0 GTGTGTGTGTAGGATGGATTCCCTGGATGGAACTGGACTTATAGGCAGTTGTGAACCACCT 

AGGTCTTCTGGAAGAGCAATGCTTGCTTTAAACCCC^ 

TGCGGTGTATATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGCGCGCGCGCGC^ 
TCAACATCCTCTGTTGTGCTCCACT G TTTTTTAAAACAATACTTCATGTAATGCTTATGAGTGT 

CACGTG TGTCTG CTGGGG CAT CAGG TCACTTGGAAATGG AGTT ATGGGTGGTTGT AAGC C ACCATGTGG AT CCT ACGAAT CATG CT 

4 5 TTTAACCTCTGACTCATCTTTCTAACCCTCTC^CTATTTTTTTTT^ 

TTTGG CT AGACTGG CTG AC CT CCAT ACCC CAG CACT AAGG CT ACAG AG CACG CACT AAGGCT ACTG C ACT CAACTTTT ACATGGTG 
GGGATCTGAATTCAGATrCTCCGGCTATACTACAAGCTTTATATACAGAGCCACCCTCT 
TTTATTTATTTATTTATTTATTTATTTATTTATTTATTGGTGGGGAGAAG 
CCCTGGCTGTCCTGGAACTTATTATGTAGACCCGACTGGACCTTACACTCACTGA 

5 0 AAGCTATGCACCACCACCACACCAGGCTCGCATAAGCTCTTATCCAG^ 

GACCATGGAGCCTGTTTTCAGGATAGCTGGCTTGCTGGCCC^ 

GCATTCCAGXSATGGTTGTGAACCTGACTCAGGTGAAGGAGGGGATAAACCTGGTTTA^ 

CATTTAATCCTAGCAGTCCAGGGGCAGAGX3TGGCTCATCTTTGAGTTCT 

CTGGACTATACAGCAAAG*GGCCATGTGAATGGGTAAGAAGCTG^^ 

5 5 ACCTGGTGACACACAGCAGACCCTCrTGGGGTATCACGGAGCTGTGTCT 

TTTCATCCTGAGGCTCCAACCTACAAGCCTGAACTCCAGACACGTCCCATCCCAGCTAGAGCC 

GTCACCTGAG AG CAGCTTCC CAGATGTGACATCT CAG CT CT GAG TTT AG AAC CTCC AT AAAGAT GG C CAGG CTGTCATCT CT CT AG 
CTCCX3TCCCGTGCCTCATACAGGAAATGTGTGTGTGAACCAGGAAACGGACTCACT 

6 0 CATGG CAGCAAGG AAG CCTGAGGT AAAG CAG TCAG CG CTG AAC AATGTTTTT CCGTGTTCT ATGAT C CTTCTCT CTG CTG TG T AGG 

CTCTC^TTGAGGTGAGAGGTAACTCAAGCTCTATTCAACTTTAATGGTGGAAAGTTTCAAAT 
CATCCACTCACTCTCACTCCAGTCCAAAGATTTACTTATTACTATATGTAAGTACACTATAC 
GGCATCAGATCTCATTAGGGATGGTTTTGAACCACCATGTGGTTGCT 
TCTTAACCGCGGAGCCACCTCTCCAGCCCAC^AGCTGGCATCCTTAGCTCCCCACT 

6 5 GATGGTAGAGGATCGAGAT(^TGTTTTGAGAGCTGACCTGAGTGGCGGGAGAGTTGCTGGTTO 

TCTGTGTAGTAGCTGAACTACTGCAGAAGCCCCGATGCGGTCGTGX3CACACGCCAGAAGTCTCT 
TCATTTCTGCAATTTACTTCCTCAAATCAGACCCTGAGGCT^ 
TGGAGCCCTTTTACTTCTGAAAAAGAGACTGAAGGAGATTCACCCCTT^ 
TGAATCCGCATTAGTGTCGTGGTAGAAAAAGTATGAGCATGGCATAACTTT^ 

7 0 TCAGTCCATTCTGTCCCCTCrGLAGCTCCCACT^ 

CTGCTAGGGGCCTGCTCAC^ACAAC^GTGGCCACATAGAAGCTCCTGAAC 



GAACCCAACTAAATCTGTTTTCCCTTTAl~rcAiijCTGCCTGTAA 

7 5 TTATATTGTTATATCTTAGTTACCATTTTTTAAAGATTTATGTATTTTATTTATATGACT 
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CAGAGCCCACTCCAGATGGTTGTGAGCCACCATGGGGTTGC^ 

TTTTTTTTTTTTTTTTI rTA ACATCTGCCACr CTTTTTTTTTTTTTI 1 1 r T AG ATTTATTTATTT ATT ATATGTAAGTACACTGTA 

G CTGT CTTCAGACACT CCAG AAGAGGGCGTCAG ATCTTGTT ACAG CTGGG ATTTG AACT CT 

GGACCTTCGGAAGAGCAGTCGGGTGCTCTTACCCACTGAGCCATCTCACC^ 
5 CTCCAGCCCCTTAGTTATAATTTTGGTAGCTATATAGTGTTAAAGGTm 

GTAGCCTGGAACTGGAGACATAGATCAGACTGGCCTCAAAXTrCACAGAAATGACCACCT 

TGAATCATAAGCTGATTCCAGAGCCCTTAGCCAAGGTTGGCCTTTTT^ 

AGTGACCTGATGGTAGAGAGGCCCTCTTCCTAATTGTAATGACACTC 

AATTTCCTGATGCCAAGTGGGAAAGGGGAGGTGGGGGAACCTGAAAGAAAGGAGG 
1 0 TCCTTGCTGCTX3GGGTCATATTCATCGGAG AAAGCGAGGTGTGGTGG CACAGACCTGTCACTCTAG AACACTGGAGGCTG AAGG CA 

GGGCAAATGCAGGTCTGAGGCCTCCCAAGGTCAGACAGACAAAAGAAAGTG 

CAAGTTATGTGATCGATATCCTGACCTCAGGGGGCTCTGTCTGTCTGAGGCGAGGC^ 

AGAAAAGACAAATCATCATGCAGTCCTGGGCTTTGTTCCACACAACAC^ 

ACACACACACACCTATTGTGTGGTCATTAGTTTCTCTCCCAAGTT 
1 5 CACCCAGTCCCTCTCCCCTCCCGGTCTCCATCACTCTCTCT 

GAGCCACATTGGCCTTGCACTCTCTTTTTAAAGGATTTATCTATTT^ 

CTGTGTCCATGTGCACACCCGGTGCCCATGGAGGCCAGAAGAGGGCATTGGATGCCC^ 

GTGGGTGCTGGGAACTGAACCTGGATCCTCTAGAAGAGCAGCTAGCACTCT 

CCTCCCATTTCTGCCTGCAGAGCACTGAGGTTAGAGGCATAC^ 
2 0 CTGGTTGGCCTTTGGGGTTCTCCCACAATCTTTCCACCATTTCATCCCTCCCCCCCAACCC 

AGGCCAGGATAGAAAATGGGGAATAGGGCTGGAGAAATGGCT 

AATT CC CAG CAACCACATGATGG CT CACAACCAT CTGT AATGGG ATCTGATG C CCTCTTCTCGTGTGT CTG AAGACAGCGACAATG 

TATTCATATATGTGTGTGTGTATATATATGTATGTATATATATATATATATATATATATATATATATATATATATATATATCTT^ 

AAAAAGAATAATTTATGATATTGAGCCGGGCATGGTGGCGCIATGTCITTAATCC 

2 5 TGAGTTCGAGACCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGG 

AAAAAAAAAGAAAGAAAGAAAATGGGGAGTAGTGATGTAGGT<^GATGGTAAGGTGCTTGCCTCA 
CCCTGCTTCCACAAAACCAGATCACTCAGAAGGCAGAAGC^ 
GCCAGAGACTCTCTCTTTTACAAAAGATACTGGGCATAGTGGCACAT^ 
CAGAGCTACATTGAGAAACTCTGTCTCGAAAAACTCTTCCTTCTCCT 

3 0 T CCC CTAG CCG CAC CCCCTG CAG AAAAAGAAAGAAAAGAAAAGGGGTAAG AACT GGAAAAAAAAAT GTATG CAATT CTCAG CCTTT 

AATTGCCGAAAGCACTTTATTTTCCTTTGGGTGTTCCTG 
ATOCTTCACCTGCAGACCAATC^CTGGAGGGCTCCCTC^^ 

TCTGACTAAGAGATGGGCCGCTTAGAGCGCTGGTGCCCAGGCACCCCTGGCTCAGAGCTCCAT 
AACAGGGAGCCTGCAGCTGCCTTATTGCATATGCACACCAGCACCrGCAAGTCTG 

3 5 CCTTG CCTTT CCCTCTGTAGGAAC ATT CTCTGAAACCTCAAATTCCAACTTTAATTCTTTAGGG 

TATCTGCTCTGTTTCCCGTCCCT^ACGTGCTTTAGAGACGGTAGCTACT 

CCG CCTCAACCT CG CAAG CGCTG AG ACG CAGGCGT ACATC^CCATGCCCACTTCCCATAACTGTGTCTG CTTCT AACATC ACG CTG 

TGAATTTTAGGATCAAGAAAACACTCTGTCTTCAGATCTC 

ATGAACTCATGGGCTCCTAGACTGTCACTACCATCTCTAGACTTCCGGCCAAT 

4 0 GTTGTACATGCTATGGATTTTCTTTTCAAACGTC^ 

CCTAACCAATGGACCCCATCCTACCTCTTAGCCCTCTTTGCCCAGACACCCCAGCCCCTC 
TTCTCTTCCATGCCCTGTGACCrTCCCGTCCTCAGGATTCTGGTGGCATCTCCTTAGCGCT 
GGACTCCCGCCTTGCCTCTGAGAAATCCACAGAACTTTCTACGCCGCCTCTGCCTCT 
GCTTTAACTGCTTCTTTTGTGGGGTTCCACAGCTCACTTG^X; 

4 5 ACTG AC AC CTCGTTG CTG CGTGG AGCCAGG AG TG ACTG CAAG CCTG TCTG GCAAGG CCTTGTGGG AAGT CAGTGG ATGGATG AATG 

TGCCAGGGGATTCTX3GGATGCCCCACTTCCTCTCTTTCG 

GCCCCAGAGCAACAGGGACAACCAACAGTGGGTTGAAGTTGTGTGCTCCAATC^^ 

CCCTCTGTTATtjGTGACAGAAGGCATCAGGGGAGGGGGACAAATTA 

GCCCATTAATTTGTACTTTTAAAAATAAGATTTATTTATTTCATATATGTGAATAC^ 

5 0 GGTTGTCMTCCCATTACAGATGGTCGTGAGCCAC 

TCTTAACCrCTGAGTCCTAATTTGTACrTTTTTT 

AGTTClTCTTTAATTCTrTCTCAAACAGCATATCTAAATTTTGCGATCATAAGAACAAGCCAG 

AATGATGACGCCCAGTTCACCAAAATGTTCACGCTGAATATAAGAAGTAAGTTGAATCTC^ 

ATTTTTAATTAAGAAGAGCCAAACACAGCTTAGGAGGGACC^ 

5 5 ATATGAGTGTTTTGTAAGTGGT ATGT ATGTCTCTGGCCCACATG CATGCIAGTGC CCACAAAGGCCACAAG AGGGCGTCTGATCCCC 

TTGGAACTTGAGTTACAGATGGTTCTGAGCCATGATGTGGGTGTTGGGAAC 
CTCTGAGCCATCTCTCCAGCCTCTGAAACTGCTCTTGTAAGCGCCATGCTTCT^ 

TATACTTGTTCT CTGAG CT AGTACATGTT CCAT CTC CG ATGTGCTG TG ACTTT ACTG C CTT GG CG CT ATG AAAGG AACACT AAGG A 
AAGG CT AT ACAG TCACCTG CT ACTT ATTTCTTTT AATG ATTTTCGTCTTGGT CT ATG CAAAAT AAG AAAAAGACACT AT ATTT CAT 

6 0 T AAGGTTTTTG CAGATTTTTTTT AC CTG CATTT'ITTTTITTTACTTG CATGTATGTCTGTGTGAAGGTGCCAG ATATTGG AGTTAC 

AGACAGCTATAAGCTGCCATGTCGATGCTGGGAATTGAACCCAGGTCCTCTGGGTG^ 
TTTGCAGATAGATGGAGGTTCTACACTTCAGCTCCCTCTAGGAGGTGAGT 
TTGTGACAGAGGAGGAAGGGACCAAACAAACTTTGAGGCATCCTAAATAAGGGG 
GAAGCAGCCTAAGTTGAAGCTAGCTCTGAACTGATTCTAGAAAAAAAAAAAAAAATCT 

6 5 TTCTAC^AGAGAAACCTCAAGTGCTAGCAAATGCCTCAGCCAGCCA 

CCTGGAAGAAGTGTTCX3GACAAATCTCCCAAGTAATGAGAACCCTGA 
CGCATGCCTGTGATCCCAGCACTCGGGAGGCTGAGGCAGGAGGATCIT 
TTTCTCAAAAAACAAGAAGGAAAAAAAGGAAAGAAGGGAAGGAAGGAGGGAG 
AGAAAAAGAAAGGAAGAAGAAAAGTGAGAATAAGGCATAAGATTCTATCACCTGl^ 

CCCAGAAGGAGTTTGGAATAAAAAGGCTAAOVGAAAAGTGTTTGGC 

GGAAAGGGCTTCTGGTCAAATGCTGTGCGTACAATTCT^ 

CCATTTGCTGTTGTGATAAACACACCCTGTCCCCAGAAATT 

7 5 TCTTATG AGATTGCCCAGTTCTGAGTGAGCATAAGGTCTTTTT CAG CAGACTGT AATTTGACGTTTCA 
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GGACCCAGTTATCCTTAGGCTGTGTGACTCACAGGTCACGGAG CTGTGGAGTCTGGGAAATCCATGGTGTG AAGATT AGCCT CCAG 

AGGTGCAGGAAAGCAGACTTAGTGATCrGGAATTTCTGTCCTrTTCCAl^ 

TAGATTTTGGAGTTGTGTTTAGAGTCTTTCTAA 

AAAGCAAGAGGCTCCTAAGTTCACXrTTCAGCCTGGGCTCCATAGTAAGACGCTGTCTCA 
5 AAGTCAAAATTGTGGCTTTGGTCTCAGTAGTTAAGAACACAGGCre 

GGTGGCTC^TAGCCATCTGTAGCTTTAGCTCCAGCTCTGGGGGGTGGGG 
AGGCATGTACATGGTACACAAACACACATGCAGGCTAGCAGTC^ 
AAATACACAGA CTrCVrri^ ACCTGACACGGGCTCTGTAAAAGGTT^ 
AAGATGraTTCAGAATCGTrTCTGATTTTC 
1 0 ACACACAGATTGCTCATAAGGCCTGAAGATGGTGGTTACTTG 
CTGTTTTATCTCGCCCCCGCCTAGTGATTGCT^ 
ATTGGGCTCIGTCTCCCCTTCATTGTTGTTCT 

CATCCCTGTCGTGACTCTGCAGCGCTGCTGACCCGCrGTTCATCTCTCAAGCAATTTA 
GGTGACTGGCCCCCTGGATAACGAGTACTTCTACGTTGACTTCAGGGACTATGAATATC 
1 5 ACTTAGAGTTTGGTAAGGATTGACTGTTCAGAGCTTCGGTGGCATTGC 

C^TCTTTACAGGGAAGGTCCTGGGGTCTGGCGCTTTCGGGAGGGTGATGAACGCC^ 
CAATTCAGGTGGCGGTGAAGATGCTAAAAGGTACCAAAAGGTGGGGTTATCTCAGCAAA 

AAGACTGCAAGCAGGGAAGGCAAGCCCCCACCCCTrCCTCTTTCTCATTGTT 
2 0 CAGATTCTGGGTAGAAAACACACCGGGGACTTCTCACTTCCCACTCTGTCCCATCCTCT 
GCCACCCTCAGCCGATTCTTAACCCAGCGCAGAGGATTGAATCCAGGATTTTGTGA^ 
CATCCCCAGCCTGTCATTCTATAACACGATGTCTTGCT^ 

GTATCATCCCCTCGGGACTGATGGCCC^TTCTTTTGCTGAGGTGTTCCCTGTAAGGATA 
CACACACACACACACAC^CACACACACACAC^C^CACACAC^CACACT 

2 5 ACAGAGAGAGAGGCAGAGAGAGACAGAGACAGAGAGTCACTGAAGTGCATTGCAGTACGC^ 

GACATCATGGACAGAAATCTCACAGTCACCTCCTAGCCAGGTGACTGATGATGCT 
CTGGAACTGTTAGAGGATAGCTTTAGGATACATACATTTGCCGGGCGGTGGTGG^ 
GAGGCAGGTGGATTTCTGAGTTCXSAGGCGkG^ 

TCAAAAAAACAAAACAAAACAAAACAAAACAGGATACATACATTTTCTGTAGAT 

3 0 CCCTCTGATATCTGCAAGGTACAAAACCTTGIAAGCAATGT^ 

CTTTTTACACTGTTTTTCTTC^CCCGTACATCCTAGTTCCACAG 

TGGTCCAGTTCACTAGCACCGTTCCCCCTCCCCTCGTCAATTCTCTTCCTGTCGGTC 

CTACTTCCGTGCCTTTTCCTTTTAATCTC^ 

TAACATCCCAGTGTCTGGTTCCATCCATCTTCCTGCAGGTGGCCCATCCCGTTCTTCGCA^ 

3 5 GTATTTCCAACATCCATCCATCCATCTGTGGCAGGCCCCT AGGCTGGTTAGAT AG CATGTTG ATTGT AATAGTGC CAACATCG ACT 

TCCTGTGTGGAGTTCCTTATGCCTCCACrCGCrCCACGCACTCACCAAGGACC 
ATAGCTCTCATAACTGGTTCCCTAAGCCTTGGCTTTGTGTGCCCAAATAC^ 

C CAACCCTGG CCTCCATCTCTACTCCAGCCn^GT AATT ACTGGAACGTTTCAGTGAGACTACACAT 
AGGAAGCGGGTTCCATAGCTGGCTCTCTCTCAGCCTTCCCCAACAGGTTTCC^ 

4 0 CGTGGTCTCTGTGGTGGTCCAGTTCTGTCTCTCCTAGAGACT 

GCTATCTGCTGAGGTGGGGCATGGCCACGATGCAGAACACCTTCTTC 
CTCTGAAAATGGTTCCCACAGGGGCTGGAGAGATGGCTC^ 

AAATCCCAGCAACCACATGGTGGCTCACAACCATCTATAATGAGATCTGAATCCCTCTCCTGGTGC 

TACTCACAT AT AAT AAAT AAAT AAATCTTAAAAG AAG AAAGAAAAAAGAAAG AAAAG AAAATGG TT C C CACAACTTTG AGCAG TG C 

4 5 AGCCGTGGCTCAGCCTTTCCGCCTCATTCT^ 

GGTGAATCCTTCCACCTGCTCTTCCCCTTTTGCTGTAGGAAGCCACTTACCT 
ACACCTCCTTTCACGCGATGGACAAGATCAAGTCCCTGGGGACCGTG 
GGCCCTGAGACGAATTCTAACATTAAAAATATAGTACGTCCTAAGAACACT 
AGTTGCGTTTAGGGATTCAGTTAGGTTCTAAGCCCTCTGAAA^ 

5 0 ATTCTGCTGCCTCCGCCTGCGGTGGTCTTGTGTGACTCCTTTCG^ 

CTGTCTGGTGTCCCCCCTAGAGTTCCCCCACAGG^ 
AAGTTGGTTGCACAGTCCCCTTTGCAGACGCCAGGGTCCCAGACGATC^ 
AGGAGTCCCTGGAACAGAGACCCCAAAGAAAGGTGTTACCCTTGTCCCCGAGGCCC 
CAGCTATGCTAGCCTCAGTGACCAGAGCCCTGGGAGTAAC^ 

5 5 CTGGAGAGATTGCATTGGTTCATGTGTTGTCTTCTTTAACTTTCCCC 

GAGCAACTTCATGTTGGAGACATCTGTGCTTGCTGACCCTC 
ACTAGGAAAATCACGTTCTGTTTATCGTGTTGTCTACCTGCTTTGAGAGCT 

ATTCGAGGCCAGCCTAATCTGCAGGGTGAGTTCCAGGCCAGCCAGCGCTATATAGTGAGA 

6 0 T ACATTCAT AT AG TT AATATCTATAACT CACGTTTT AAAACG CTGTCTT AAAACACAG AAC ATT ATTGTCT AAATTT A TT AT 

TTTTGTGTCAGTTTTTCTCTTCCAGTTTTGAA 

CAGTTCCCTGAGGTCAATCCTGTTTCTGCTCCTCCACCAACAGAGAAAGCTGA 

CAAAATGATGACCCACCTGGGACACCATG ACAACATCGTG AATCTG CTGGG CAGG T AATGTCTACACAT CTG 

CATATCGAAGCCCTGTTCCACACTACTGCAGCCCAAGGGAGCGAGGCCTGCGC^ 

6 5 CTGTTTCTTCCTTCCAGCCCCATGCTCCCAGAAATAAGACTCAGACTC 

TCTCCTTTGACTAGATAAAAACTTAATATAACCCATTTATTCTAATC^ 

TCTTATTTCATTCCTAAACCATAGGCCATCAGACTTATATGGGCATAAGCT^ 
TOTCCCCrCTGCAGAGCACATGACCCCGTCTTTCCCCTACTCTATCT 

CACACACATACTGATATGTTGATGATGGTGAGGTCAGACGTTTAGAACT 
GCTGAGATCXjAGTGTAGTCTAATGTACGAGCTCATTTAAAGCACCTTCT 
CGCTACAGTGTGATTTTGGAGGAAAGACCTGACCTGTGTCT 
GTCTGTC^AACCTTGTAACTAC^AT^^ 

7 5 AACTACCTAAGAAGTAAAAGAGAGAAGTTTCACAGGACATGGACAGAGATTT^ 
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CCAGGCACATTCAAATTCCAGGTAAGAGCTC 

CCT ACCTGCAACCCCACCC&GGCAACATTCGGA G T T G CT G TGTGTGGCCTGAG ACCTAAGGCAGCT A1 "1'T G TTT ATGCTGGGAATT 

TCTGTCC CATT ATTT CTT ATTTT ACTT AAT AAAGTTT CAATf AC CCACCAG AAG ACATGG TT CAAAGATC CTAAATAG AAAAG T CC 

AGAAATAAACAATCCTTAAGTTTTAAGTACCTTTACAAACAGCTTAT^ 

TCGTGTGTTACTATGCCCAATTTATTTGTACTTTACTTACTATTATA 

AGGGGGCAAACCCAGGGCTTTATGAATGCTTGACGCATCCTCCCrCTCTG^ 

AACTATCCTAACACAGACGTGCGTAGCCAATAATTAGATCCTTTGGTTCATT^ 

TTAAGCCITTTrTCAAAGGGAAGGGGAGTTGAAAACTGGG 

GGTGCGGGCCTCTCTTTGATGCGCTTGAAAAGGTAATCTGGCCAGACAGTG 

ACTCGGGAGGCAGAGACAGATCTCrGTGAGTTOAGGCCAGCCTGGTCT 

ACCCTOTCTCTAAAAACCAGACCAAAACAAAACAAAAACAAACAAACGAAAGAA 

AGAAAGAAAG AACAAGCAAAAGGAAAGACAGAATCTGTTGCAGCT CTT ATCCCTCAGAGGAGGGG AACGTGGTTCACAGAC 

GCTGGCTTGAAAAGATGGCGKTTGGAAATGTACTVCCTTCCCACGAGGCAGAAACCT 

AAAACTCTAAATTTGTAAGCTCTGTGTTGCATGAACTCT 

TGGTTCCTCTCTTTATAAACTTGAGGACATAGTAATCATTTACTGCCCCAGTTT 

TGACTGAATGGCAGCTCTTATCCAGGTTGGCTCGGCAAATTCCAA 

CCAGCATCX3TGCTTTCTGTTT AAAAG CCCATTTGTAACTGAAATCT 

CTCAGAGGAACAATGTAAATAACAAAGGAACAAATTTATTTTTAATA^ 

TGGATCAGCTCn'CAGGGTTCAATGGGAATTCAATTCATTCTGAAG^ 

TGAATCCAAGAGTCCCATGCCAGCTATTTTTAAACTTTTTA 

CATGG TGTGT ATATGTGTGTG CTGTGTGT ATGTGTGTGTTTT ATGTGTAT ATG TTT ATGTGTG CTGTGTGTG T ATGTGTGCATG CT 
ATATATACATATACATACATGTATATATATGTATATATATGTATTGTATGCATGTATGTGTGCTGTGAATGTTTCT^ 
ATGTGTGTGTGGTATGTGTGTGTATGTGTGTCX5AGTGTGTGTATATATATATGATGAATGTGTATGTAAGGTTTGTC 
ATATGTGTTTTTGTGTAGGGTGTCTGTTTATATGTA 

ATGTGTGTATATATGTGTGGAATATGTGTATGTATGTGTGTGTTTC 
GTGGTATGTATATATGTATATGTGTATGTTTGTAGGTGTATATATC 
GATATATATATACACACATATACATATATATGTATATGTGGGT^ 
TATTTGTATGTGTGTGGTGTATATATATATGTATATATGTTTTGTGATATC^ 

TTTGTATGTGTGTGATATGTGTACATATGTATGAATGTGTGTGGTATATATATGTA 

TCTGTGTGTGTGTTTGTATGTGTtSTGGGGTATAAGTATATATGTGTGT^ 

TGTGTATTTGTATATGTATGGTATATGTGTCTGTATATATATGTGTGTC 

GTGTATATGTXSTGTGTGCTTGTGTGTGGGCATGATCGTGCCGTGGCACCCATGAA^ 

AGTACCAGGGTGTCAGCCTGATGGTAGCITTTCTCCAGTGAGCCATGGCAGCAGAGTGK3AAGCAGTGTGT 

GGC!AGACTCTGTCAGTGCACACTTTCTCGGAAAGTCAAAAGCTGATAGTTGTGAACT 

AACTGTTTTGTTTGTTCGGAATATTCTTTATAAAGTC^TAAGCTAATTC 

ATCAGAGTTTGGTTATGTTCCAACTCTTTTCCTTTCGAAACAGT^ 

TCCAATTCTACTTTCCTCATAGGACTGTTAGGAGGCATACACAGA^ 

CAGTTGGTCAGTCTTTGCTTTTTTCn'ACrGACAGATTTACCTAGCA 



CCATATGAGTTCAGGGAACAGAACCCAGGTCCTCTGGGAGAGGAGCCAGT^ 

TTAGTATTCCITGTCriTAGCTCAGATGGCTTTAACAGATTATTAA^ 

AGGCTACTGCTTTCAAAACrGAGGTGCAGGAGACACTTGCTTCT 

4 5 AGAGGCAAGAGTACACGCTTTCrrCTCTTTAAAAAACTTTAA 

GAATCCATTCTCTCCTTCCACCATGTGGGTTCCAGGGATTAAACTCAGGTCTTTCGCCTCT 

CATCTTGCCCATTCTCCTCTTCCTATTTTATAGAGTTCATGGTCC 

ATACCATTGCTTGAGAGTTCGTrTGCCTGTATGTGTGTATCCATC 

ATGTATGTATGTATGTACGCACGCATATATGTATGCATGCATGCATGCATGTATGTATGTATATGTGTATACATGC^ 

5 0 ATATGTATGTGCACTGTATGTGCCTGGTACCTTTGGAGGCCAAAAGATAGAGTCACATTC 

TTGTGAGCCATCATATGGGTGCTGGGAACCAAGCCTGGGTTCT 

AGCCCCTGGGGATTTGAACAGGGCTTCATTCCAGAAAACAGAATGAGTGTTCAAGTAGACT 

AAAAAATGACT AAGGG CAGG AAAAAAAGGG AGGCAG AG ATC AGG ACAGG AATG ACTGTG CA CAACTTT AACCCAAG CATC CAGG CG 
GC^AGTGTTAGTCTGTGAGTTCAAGGTCAGCTAAGTCTAC^^ 

5 5 CAGGCTGATCATTCCAATCATTTCCCCGGGTTTGGAGTCAGGAAATC 

GAAGCTCTTCTGTAAGGTTTGAAGCAGAGGCCATTCCATCTT 
CGGATTTCTCCGAAGGTCAAATAATCCrmCTCmGTA^ 

TCTAGAGCAGGAGATCAGATCAAACGCCTCTTATAAATATCTCCACACTGAGGTCGCCTTAGCTGCCTCT 
GGTACTGCCTAGTGTCTGGGGAAAGTGACAAGTGGACGTCTTTTTACAGATGAGAT 

6 0 AGAAGAGGAGGAAGATTTGAACGTGCTGACGTTTGAAGACCrCCTTTGCTT^ 

AGTTCIAAGTCGGTAAGTACCTTTTAAAGAGAAAACCG 

TTGAATGAGCCGTGTGTCTACGTTTGTTCC^TGGTCATITTAACATGCCTCCCT^ 
ATACCTGCAGCACGAAGGCCCAGAGTCACTGGTTAGAGCCAACGTCCCGTGGGATGTCT 



6 5 GCCO^GTTCACC^CTCTGCCACCCCAGCCAGCCAGCCTGAACAGATTC^GCCTA^ 
CTTCTCAGTGGCrrcGCCCTGTATTCACAAGTGCTGCAGCAGGAAGACT 

CT AGG CAATG CTG CTTCTC CAC CCGCTGTCCC CTGTCATTC CACACT AT AAG CCAC CAAAGCCTCAAAT CTTT ATGTCTG CTGAAG 
TAACATGGTAGTAAACAGATTAGCGGGGGTGGGGTGGGGGGGGGGAGACAACATCTT^ 



7 0 ATTAGGGCTCCCTAAAGAAACAGAACTGATAGAATTAAAAATAATACATATATATC 
GGCTGGAGAGATGGCTCAGTAGTTAAGAGCACTGACTGCTCTTCCAGAGGTCCTGAG 
ACAACCATCTGTAATGGGATCTGATGACCACTTCTGCTGTTTCAAGAGA 
AAATTTAAAAAGTTATATATATATAAGTATATATACATAtAAA^ 
AGACATACACACATACACACACGCATATATGGGGGATTTGTTAGAGC^ 

7 5 CGAGACTGGATGTCTCAGAAATCCCCATCTGGAGTCCCGGGGAAGTCTTAGAAA^ 
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AAGAAGTCAGTGAGAAAGCAACCACAGCAGCAGAG 

TCCTCCrrCATACCCTCTATGGCTGCCACCAGAAGGCGTGGCCCAGGTGTTAGGCAGGTC^ 
CTGTCCCCCGTGCTGCAGAGCCGCAATGACCAGTTTAGATAATCCAGC^ 

T ATTTTT ACT CTT AGGTG TGTTATG ACATGGACTTTTG AAG AG CAT CCJ^TTTTTTTTTTTT AAAGATTT ATTT ATT ATT ATAT CT 
5 AAGTACACTCTAACTGTCTTCAGACGCACCAGAAGAGGGCATCAGATCT 
TTTGAACTCAGGATCTTTGGAAGAGCAGTCAGTGCTC^ 
TTIXH'CTTCACTGCAGCCTTCCGCrrCCAATCATGCC^ 
TCCATGCGACACTGTGCTGGGGCAGAAAACCAATGAGGATGTGACTTC^ 
CAGACCTATACATCAGATCAGTGTCTTGTGATTGATACTGTCACrrAAAGT^ 
1 0 AGGCAAGAGAGCAAGAGAGCCTCCCnTGGATGGAATCCAGAGGGAGA 
GCTGGTCAGAAGGAGCCTTCCCAGAGC^GCTGGGAAGAATACGGGCAG 
ACATCTAGGAGATGCCACATGAAGAAAGCCAGACTTGGAGAAGGAGGGCAGGAG 
TTGTTCTTTTAAAGGCTGAGGAGAGCGTAGTGGACCCCCATGGGACACCATGTGCT 

CTGGG CTCT ACTCTCATGTACT C CTC G AAACT C ATCGTT AT CTT CT GT CTG TCTG TCTG CCCC CTGT AC CCTGT ATGTG CATGTGA 
TGTGCAGAAGTCAGAGGAOU^CTGTGGGAGTTAAT^ 

TATTC^TTTAGAGACACGACCTCTATGTGTAGTCCAGGCTGGCCTCAAACTTAGAAAT^ 

G ATT ATGGATCTG C ACCACTAGG C CTGTCAG ATATGATG CACTTTT ATT CTGAAAGC CAGTT CT/IT ATCCAAAAGGG ATTCTGTTC 
2 0 TCGCACGCACACACG CACGCACAGGAAG CTGG CTG TT AG AAGCGT CTCTGGAGCT AGG CGAAGG CCC C CrTTGGCACTT CTTTG GC 

TCTCTGTCCTCTGTAACrGTTGTGAGTCTCTCTCTGAAGGTGT 

CCACAGAGACCTGGCAGCCAGGAATGTGTTGGTCACCCACGGGAAGGTGGTG 

TGAGCGACTCCAGCTACGTCGTCAGGGGCAA.CGTGAGCCGTTTGCTTTTTGTCACG^ 

2 5 GCGTTTTTGATTTATGGTGACACCAAGGGGAGAAAA 

TCACTAAGAGCTAAGCAAAATAGCCACGCATCTCTGGGGGGCTGGCCACT 
ACCTGAAAAAAAAAATGCCCTTACTCAGACCrGTTTCGGTTTCC^ 
ATGAGAC^GCTAGCATCTATAAAAAG CT ' ri ' rT GTTT rrrmTrr CT 
GGTGCTGTAGTTAACTGGTAGCGTTTGTTACCCTGTTACTATATGAAGAGTTCAGGA 

3 0 GTTCGTTCAAAGAGAATGTGAGGGGGCCAGAGAGATGGCTCAGTGGTTAAGAGCATAGAGTC 

TACCCAGCACCCACGGCAGGATGCTCGCAAGCATCTATAACTCCAGCTCCAAGGAATCT 
TGCTCATGTGCACATACTTACACATAAGTGAAAATCAATTCAAAGTATG 

AGAGAGAGAAAGAAAGATAGACAGAAAATGATAGATAGATAAACAGACAAATGATTGATAGATAAACA 
AATGATAGACAGACAGATAGACAGTGAGATGTCTCAGCAGGTAAAGGTAGTTGCTACCAAGCCT 

3 5 AAACACATGGTGGAAAAACTGGAATGGTATAAGTTGACCTCTGACCTTCACCnXa 

CACACATGCACAGGAATAAAAACATAATTATTCTTCTTTTC!ATTTTTC^ 
ACTCACrCTGTAGACCAGGCTGGCCTCGGNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNinWNW 
NNNITONNNNNNNNNNNNNNNNNNM^ 
40 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNGGAGTCT 

TCTCTGCTCGACACTCTCAGTACTCACTTCGGTCTGAAAATACCCACTCTGTGAAG 

CCGCGGTTGCTAACTTGCCCCGTGTGGATGCGAGCGTGCAGAG 

CCTACAGGATTCTGATTTTACTTCCTAGGTTCTCAGAAACCAAC 

4 5 TCCAATG AATG CTCATCT CAGGTGG CTTAGAGCCGGTTCCCAGGG ATGGTCGCTGGCTGTGGTTACAGT ACACTGAC CTCTCTGTG 

TGCITATGGGGTTTTCACCCCAC(IAGTGGGCTCrCCTGTGGGAAGGTGAATTC 
CCAGCCTGGACCACAGATTGGGCCAGTGACTCCATTTGCATAGGATTTAAA^ 

C CAATT AAAC ACATGAGGTTCCT ACTGG AAC CAAAT CAG AAC AAAA CT C CGTG CCC CAGG AGCT AACCAGC CACTGTG TCCTG ATG 
CAAATCAAAGCCTGTGCAAACAGAGGCCAACCCCCTGTCCCGCCTCTGGAAGTTA 

5 0 TAGGCATGATCACGTAAGTCCTTGAGGGCGATGCTTCTCAGTTCTGAGGAAAGCTCCCGTA 

TTGTCGACCTCTGAGCAAATACAGAGCTCTTCCTTAACGAACGAAC^ 

TGT CCAAGATG ACITTTAAAGAAGTGTATTrTG CCAGG AGTGGTGCCCT ACGCCTTTAAACCCAGCTCTCAGGAGGCAAAGGCAG 

TTGATTIOTGTGTTTGAGGCCCTATTCrGCrrATGTCACCAGAGCT^ 

CTGGACGGTGGTGGCGCACGTCTTTAATCCCAGCACTTGGGAGGCA^ 

5 5 CAGAGTGAGTTCCAGGAC^GCCAGGGCTACACAGAGAAACCCTGTCTCAAAAAAGAAA^ 

ATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGAGAGAGAGAGAGAGAGATCGCTATAGGTAGGTAGGTAG 

ATAGGTAGGTATAGACAGGTAGATAGATAAACAGATACTTAGATAGATGTTAAATATAGATAATAGATGATAATG 

GGATAGATAGATAAGAAACX3TATTTCTAAAAAGCTTCTAC 

TTCAACAGACAGAATAATTATTTTAAGATATATTTTGTTGTTATGTCTCCTTTTCA 

6 0 CTGTGCCCTCTGGTTAGTCAGGCTAAGGGTCTATCrATCTTGTTTTTT 

GAGTTCTTTTTGTTTCTATTTGGCTGATTTCAGCCCTGAGTTTGACTATTTCCT 

TTTTGTTCTAG AGCTTTCATGTGTGCTCTCAAGCTGCT CTTTTTGGAAGCACTCAGAG CTATG AG 

TTTTCCTCTTAGCACTGCTTTCATTGTGTCCCATAAGCTTTGATATGATG 

TTACTTTCTTTTTTTATTTCTTCCTTGACCAAGTTATCATTGAGCAGAGCAT^ 

6 5 GTTTTTGTTCrGTATTTAAGACCAGCCTTAGTCTGTGGTGATCTC^TAGGGTATATGGGATTATTTC^ 

GG C CTGTTTTG TG ACCAATTAT ATC^TCAG TTTTG AAG AAGGT ACTGTG AGGTG CTG AG AAG AAGGT AT ATTCTTTTG CTTT AGG A 
TGAAATGTTCTATAAATACCTGTTAGATCCATTTGGTTCATAATTTCTG 

TGATCTGTCC^TTGCTGAGAGTGGGGTGTTGAAGTCTCCCGCTATTATTGTGTGGGGTGCAATG 
TCTCTTTTATGACTGTGGATGCCCTTGCATTTGGAGCATAGAT 

7 0 AGTATAAAGTGTCCTTCCTTATCTTTTTTGATAACTTTO 

TTTCCTGAAACAATTTGCTTGGAAAATTGTTTTCCAGACTTTTACT 
GTGTGCAGCAAAATGCTGGGTCCTG1TTATGTATCCAGTCTGTTAGTCT 
ATAGATATTAAGGAAAAGTGATTGTTACTTCCTGTTATTTTTGTTGTTAGAGGTC 
GTTTGTTGAAAGATTACTTTCTTGTTTTTTCTAGGATGTAGTTTCTCTTCTTG 
7 5 CTGGATTTATGGAAAG AT ACTGTGTAAATTTGGTTTTGTCATGG AAT atcttggttt ctctgtctatggcaatttag agttttgct 
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GGGTATAATAGTCTGGGCTGGCATTTGTGTTCTC 

TGAGAAGTCTGGTGTAATTCTGATAGGTCTGTCTTTATATGTTACITGACCTTTTTC 

ATGTTCATGGG CATCTCTTTCTTTAGGTT AGGGAAATTTTCITCT CCCTTTAAGTTGGGA 
5 ATCTTCACTCrCTTCTATAACTACTATCCTTAGGTTTGGTCT^ 
TTTTTGCTTTTTGCGTTTTCTTTAACTGC^ 

GGTGATGCTTGCATCTATGACTCCTGATCTCTTCCCTAGGTTTTCT 

CTATTTCC ATTTTAGATCTTGGAT AG l'l 1 iGTTCATTTCCTTCGCCTGTT^ 

TTATGTCCTTCTTAAAGTCCTCTATCATCATCATCAGGAGAAGTCATTTTAGA 
1 0 CTATGGTGGAAGAATTGGGTTCTGATGATGCCAAGTAAC 

TTATCTCTAGXX3CTACCTGCCCTGGCTATGTCTGACTGWAGCCTGTCCTTC 

CAGCTGTCTCTGGGATCCTGTGGTTCTGGGATTCTGAGATCCTGGGGTGTTCCT 

ACTTCTGGTGGTGGAACAGGTGTTGCTCTCCTCCTCCACCTGATCATATCCT 

GTTTAAATTTTATCATCTAATGCTGGGCAATGGTGGCATACACCTTTAATCCCA^ 
1 5 TGAGTTTGAGGCCAGCCTGGTCTACAGAGCCAATTCCTGGACAGCCA 

AGTTGAGT AAAT AAAT AAAT AAGT AG ATATTGT CAT CCAATGG AAAACAAAAACT AAAACTTGGAAAG GCTGGAGAGACA 

TGGTTAAGAGCACTGGCTGTTCTTCCAGCCGATCCGGGTTCAATTCCCA 

GTTCCAGGAAACACAGCACTTTTACAGACAGACATGAAGGCIA^ 

TAAATTTTATCATCTAACAACAACAAAGAGGGGCTGGAGAGATTGCTCAGTGGT^ 
2 0 GTTCAGTTCTCAGCACCCATATAGTGGCTCACATCCATCTAGAATGTCAGTTTCA 

GG CACTGTATG CAT AAGG TG CACTT ACAT ACATG CAG G CAAAACACC CATG CACAT AAAATT AAAAACAAAACAAAACAAAACAAA 

AAAAAACTrrAAAAAAATCTCCAAGTCTGCATGTGGGGGAACTTC^ 

AAGTGGATGGCACCCGAGAGCTTATTTGAAGGGATCTAC^ 

ATTTTCACTGGGTATGTCTGGCACTCTGCAGGCCTCCTTGGGCT 

2 5 TGCATGCCAAGGGCGGTTAACACACCACTCAAGACCCTAGCTTCTAGTC^ 

ATCTTTCTTTCCCACACTCTCTAATTGATTAAGAGGCT^ 
CACTGGGGAATTCTCITTGCTTGTTTTCAGAC^^ 

AGCGCGTGTTTTTCTCAGGTGTGAACCCITACCCTGGCAITCCTGTCGACGCT 
ATGGAGCAGCCATTCTATGCCACAGAAGGGATGTGAGTACAGCCAGACCG^ 

3 0 CCTCCAGATGCATTGCAAAAACAAAAGCTTCATGGCTITC 

TCCTGAGAGCTGTTATTAAAAATGGGCACCGCTGGGCTGGTGAGATGGCT 
CCTAGTTCAAATCCCAGCAACTACATGGTGGCTCACAACCAAACGTAATGAAAACTGA 
CTACAGTGTACTTACATATAATAAATAAATACATTTTTAAAAATGTGCACTGCCACC^ 
35 CATTG CTGAG AT CTT ATTT ATTT AT ATTTATTT ATT C AT CT ATCT AT CT ATCT AT CT AT CT AT CT AT CT ATCTATCT AT CT AATGT 

ATGTGAGTATACTGTCACTATCTTCAGAGATACCAGAAAAGGGCATTG 

CTGGG AATTGAACTCAGGACCTCTGGAAGAGCAGT CTT AACCACTTAACCGCTG AG CCATCTCTCCAG CC CTTGTTC 

AAGCAGAGACATCITTGGTACAGTGGCACCCACCTTTAATTCCCACAGCX3AGGAAGCAGAG 

GCCAGCCTGGTCGACAGCCAAAGCTGTGTAGTGAGATCCTGTCTTGAAAAAAACAATCGC 

4 0 GGGCAGGAAGGGCAAGGAGACACTTGCCTATAATCCCAGTACTCCAGAGGTAAGGTATGA 

CAAGACAAGAAAGGAAGGAAGGAAMNNNNHNN^^ 

AGAGAGAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAGAAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGA^ 
GAAGG AAGG AAGGAAAAG AAAG AAAGAAAG CTT CAGGT AGCTTG CGC C CTCTTGGG CAG C CT C CAAG G CTTCATT CTT CTG CTTGT 
GGTACCTGAGGATTTGACTTCACCTCACTCTCTGGTGTGGTACCAATTCT 
45 CCCCCACCCCCCACCCCCATGCTGCCCTGGGTGTGAACTGTGGGTGGAGCCTTGCCAAGTTCATTCACACCCCC 
TGTTTTGCrCTCTCACAGATACTTTGTAATGCAATCCTGCTGGG 
CATTTTTAGCATGTCAGCTGGCAGAGGCAGAAGAAGCGGTACGTGGA 
GGTGTCACACCGGGCACTAATGGCCTCAGATCCGGTTTAGGCCGACCATCTTTCTCACCT 
GTCCH'GTGCAGAAGATGTACTATCTGGTGTGGGCTTGGGGGCGGGGCTTTCCTGT 

5 0 CTCTTAAGC^TCCAGTGCAAAGGGCTAGACAACTAAATAAACCTTAACTCCT 

TGTG AGGGTGGGT ATGGATGGG AAGGG CTTT AG C CCAAAGC CCTG AGG AC CCATT C AG T CTCTG CTG CTCATT CCCCACTGAT CT G 
AACGATGGCAAGTTTCTTCTTTGGCCTCGGTGTTCCCATCTGTAAAATGG 
TGGTATTACGTCATTAGCCCCTCCTCCTTAGCACTCTTAGCACAGGAGAGGTCTGCTCCA^ 
GAGCCCAGGTTCAGATCCTGCCTCCX5TCACTAGTGTCTCACATAGTGACTCACATAATGTCT 

5 5 AATGTCTCACX3TAGCATTTGCAACTTTAGACTTTAATGTGGACA 

TCTGCCCTTTTCCTACrTTTTATTAAAAATCAGTGGCGGGCT 
AGGTCC!AGAGTTCAAATCCC!AGCAACCAC!ATGGTGGCTCACAACCATCCGT 

GACAGCTACAGTGTACTTACATATAATTAATAAATAAATAAACAAATAAATAAATATAAATAAATAAATA 
TAAAAAAAATGAGTOKTTGCTGTTTCAGAAAAAAACAAAACCCATGGAGGGGACT 

6 0 TGCTCTTGCAGAGGATCAGGGTTC^CTTCCCAGTAAACAG 

CCTCITCTGGCCATTGCAGGCACTGCATTCAAATACTC^ 
GCCTGGAAACTACCCAAATTCATAGTTTTATATTTTTAAGATTATTTTA^ 
CAGAAATCAGAAGAGGGTGTCGAATCCCCIXXKjGCTG^ 
TCAC^TCCTCTTCAAGAGCAGCAC^TGTTCTAACCACTGAGCCATCTCT 

6 5 GTAAGGTGTATTTAAGTACTGTG CTTTCTAGGCACACAGGCG AGTCCTGCTCAG CCGGTCATTCTrCTCCCrTTCCAGATGT ATCA 

GAACATC^TGGCAACGTCCCAGAACATCCATCC^TCTACCAAAA 
CGCCACAGGCCCAC^TGAAGATTCACAGAGAAAGAAGTTAGCGAGGA^ 
NNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNMJl^^ 

TATTCTTGTAAATATOTGAAACAAACCAAACCCGTTrTTTGCTA^ 
TACTATGTAACTTTTTCATCTATTTAGTGATATATTTTATGGATGGAAATAAACTTTCT 
TTGCCTGAACTATAAGCTCATGGCCTCATCTGTATGTGAGATTTCCTGTTTT^ 
TGAGCAGGTGCATTCTTGGGGGATGCTTGGTGTTGTTTGGTTT^ 

7 5 GAGATCTTCCTGCCTCTGCCTCAACTCTTGGGGrlAAC 
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CCTC^CCGACCAGGCTGAGGACCGCGGGCACCACAT 

TGAG CAAGCTC CAG CC CAC AGGTTT AT AAACTCG CCTTGCTTTGTC CCTTTGT ACG CTGGACACCACTGAGTT CTG T C CTTT ATTT 
CTTTGTTTTGTCCTATGACAGGGTCTCATGTGTCCCAGGTTG^ 

TCCTCGTGTCCCCACCTCCCGAGTGCTGGGATCCCGGTGTGTGCCACCATGCCCGGCTTCGCC^ 
5 CCATCTAAGTCAGAGGTTCTCAACCTTCCTAATGCTGCTACCCTTTAATACAGTTCCrrCATGCT 
TTATTTCATTGCTACTTCATAACTGTAAGTTTGCTACTGTTATGAA 

ACG ATCCACAGGTTGAGAATG CTGCC CTACTGTG TCTCTT AGCTG CTGCAAGTTC CGTCT C CCTAAGCATTGACTTGG TGTGTGGT 
ATGGTATGGTGTCTTGTAAGCACCAGGTCTCTCCATTGAT^ 

CT AG CTGCATAATGTTTTCTCTCT AG C CAAAAC CG C CTGGATGGT C CTGGATG TGTCTGGGCGTTT C C CAGC C CCCTCTTCTG CCT 
1 0 CAGTCTCTGTTTCCATGCCCATCAAGAAT7VGCTGCCTCCGTGGCAACCTGTG 

CACAGACAGCCTACCTATGTGATGCTGTAATTAGCTTTTGTGATAATGTCCTCTCTCTTCT 

TACTTACTTACTTTAATGTATTTGAGTACAGTCATTGTCrTCAGACACACCAGA^ 

AGTCACCATGTGGATGCTGGGAACTGAACTCAGGACCTCTGGAA^ 

ATGTCCTCTCCTTTAAGCTGGAGTTTGAGCCGTTTGAAGGCGGATG 
1 5 TAGGTATAAACAGACATTTGGTTACTTTTTAGCCTACTCATGTAAAAGCATAAGATA 

G ri'T TT 1' ATTT AGGTGAGTGGTGGTT AGC CATGT AACAAGCGTTGG AAC AC CT CT C CAG CTGAG CTCCGGTTCAGGTGT ATCTCT C 

CTACCTTCCTGTCrrGAATGTATTCCTACCACACACAC^ 

TGAGCAT ACGCAG ACGCACACACATG AG CATG CGCG CG CGCACACACACACACACACACACACACACACAC^ 
AAGGCGGGGGTGCAC^ATGCAAATGGGAAAGAAGTCTTTGTATTAG 
2 0 TGCAGGCTCTTGTTTACTGTTAATATTG CTGCTGAGAACATGGT ATGG AAATTGGAAAT ACCCCCC CATCTCTCT CTGTCTCTGTC 

TGTCTGTCTGTCTGTCTGTCCGTCrCTCTCTCTC^L^ 
GTTTTTTTGTTTTGTTTTGTTTTTGAGACAGAGTTTCT 
CTCCCTCCCGAGTGCTGAGATTAAAGGTGTGCCCCACCACCACAAGGGT 

CTCCTCCTTGG CCTGCTGAGGTCCTGC^CTG CTTCTTCAC AT^ CACAGCTGTCTCTGTCG CTTACCCCTGAG AACCTAAT 

2 5 GIXKiAACCCGAGGCCAACTCTCAGCCTCCCA.GGCACAGCCTAGTGGTTTGCCGA 

CTTTCTCGCTTTCTCTGTCrrTCrCT 

TTGTCG TG AGATGTGTGTTGAT AG AGCAAGG CAGAAT AGTGATG TG AG CT ACC CAAG C CT CTT CCIT C1TTCAAAGTG ATTTTG C C 

CAGGGCATGTGCACATTGTGTCTGTGACATCCCTCTGGATGGAGCCA 

CGTTAAAACCCTCAAAGGACITCATTTCCACGGAAATGGATATTCTCT^ 

3 0 ACCAAATGCTTCTTTTAATAG CAT AATATTGGGGCTGGAAAGAC AGTTAATAGTCCATACTG CTATTGTGGAGGACCTGAGTTCAA 

TTCCTAGCACTCACAACTC^AACATCACAGCCTCTGGCAACTCCAGTTCT 

CTG CATGC ACAT AGTG CATACACACACT CAGG C ACAT ACACAT AGACAT AAAAT AGAT AAAT CCTT CTTTT AAAA^ 
GACTTGTTCTTCAAAGATGTATTCATGTTATGCAGTGAtCATCCCTAATCTGAAATACT 

ATCCGCAT AAT ACGTCACACAC AC ACACACACAC^ CAATGAACAG ACG 

3 5 TTAGAATGAAGTGAGTTAAAATTATCACATCAAGTTACCACCAGGCTGTC 

GAATTATATGTGCAAAATTATTCCAGTCATAGCAAGTTTAGTCTGGTTCCT 
CCCAAGCTGACATTGTCCATATTGCTGATGTCCTATGAAAACACACT 

CITCTGGGCTGTGTCGGGTAG AGGTTTGGAGCATCTCCTTCATAGAAAGTG AAGTCAG CAGAGGTGGAAACTTAAGGGCT CGTTGG 
AAAGTCAGTGG TGCGGGAAGG TGTTTGCT AAGG CAGACACGTGAAGG AAT ATTTAG CTGAAG C AG ACACAGGAGAAAAGGATGTTC 

4 0 TGCT AAGG CAAGCATGTGATG AAGG ACT CTTTG CT AACAACACG CATG CG TTAGTCCATCTT ACG TTGTGT AG CTCT ATTTGTC CG 

GAATCTATAGAGAGAAATGCACCAAAACAAAACAAAACAAAAAACAACTT 

ACTCCGATTGGCAGAGTAATGTCAGCTGAGACAGTCGCACAGGATGAGGCAAG 

ATAAGTAGAACTCAACAGACAGTC^TGGAGACTGAGCTTGGCTTGT^ 

TCACCAGTCTTCGATTCCCCGAGAGAGGCACAGCTAAGAACTTCTCCTGTGTCCCTCTGGGTCCT 

4 5 GGTTGTGCCAC CG CTGCTGATTCTGTTTGCTGTCCCGGCTCTTCCAAACTGGACTG CTGATGITTCTGTGAAGTGTTTGTGCGTGG 

ATCG AG CTG CTGACCTGTGAACTG AACTGCTAATTTCCAG ACAG CACAGACCGG AGTTGCTCTAAAG AACCTTTCTAAACAGGTCC 

ACTTTCCCCGTATCCTCCCCCCCCCCCCACTTCCTCrGGTGGGTGGTGGACTAAAAGGGAGGTT 

AAAAATAAGGATTGGCAGAGCCGCACTCAGGCACTGGAGCGCAGCT 

ATGG CTGTC CTTT CT AAGG AAT ATGG ATTTGTG CTT CTCACTC^TG CTGCC AG CTTTG A 

5 0 GGGCAAAGCCTGCAAGAAGGACAAGGTAGAGTACCCTGTCACCTACAGCAT 

AGCGCGCCCACCAGAACGTGTTGGAGGTACACCCTCCCTCCCTGTTTTTCCT 
TCTGGCCTGGGCTTGGCCTTGATTTCTTGGACGAGTTCTT^ 

TCTCTTGCCCTCTTTACTCTGATGGG CACCG CCATGTGCTCTGCTT CCCAG CATCCTCTGCTTCCCAGCATCTTGGCTGG ATCAGA 
CCGGGCTTAAGCTGCGGGTCCAGATCCTGCCACTGAGGAAAGGGGGGCCGGGGGGGGGCTTCCT^ 

5 5 T CAT CCTG ATTTCCAGTT CCACTGTTT AGTTTTTGTC CAG CCT CAT ACCT AA 

AAATGAAGAAATAAAAATAAAAATAAGGATTGAAAAAGTTAAGGTTAGAGT^ 

TGGGCCAAGCTGTGTCCCCTGAGGGGAGGGTACCCCTCTTTTTCTTTCCCTTTCTTCCT 

TCCTTTCTCTTTTAAAAATGTATTTATTTTATTTGTATGGGTGCTCTATCAG 

TACAGATAGTTGTGAACCACCATGTGGTTGCTGGGAATTGAACTCGGGGCCTCTGGAAGAGCAGTGTACTG^ 

6 0 ACTTGATACAGCTGGAGTTATCACAGAGAAAGGAGCTTCAGTTGAGGAAATGCCTCCATGAGATCCA 

ATTAGTGATCAAGGTGGGGAGGGCCCTTTGTGGGTGGTGCCATCCCTGGGCTGGTGGTCTTC 
AGCCGGGCGTGGTGGCGCACGCCTTTAATCCCAGCACTTGGGANNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNm*NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNTOWNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNinnOJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNOTJNNNNNNNNNNNNNNNN^ 
75 NNNNNNOTINNNNNNNNNNNNNNN^^ 
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TAGGGTGTGT(n"GTGTGTGTGTGTGTGTGTCrr(TTC 
GAGTTG TGGG AAGCAT AGGGGGAGTTAG AAGXMAAAGA^ 

GACATGGCAAGTAGTACCCTAGTTATGGCCGAGCAGGATGCCTCTCCATCCCCCACCCCACCCCT 

GT AAC!AG CCTT AG AT AAACTGCTGCTGTG AT AAAA<IAC^ CAAAGG TTTGTTT CAGTTT AT AG T 

5 TTATAGGCTGTTGTTGAAAGAATCCAAGCCAAGAACTCAAGGTGAGAACCTG^ 
ACACACACACACACACACACACACACACACACACACACGCTTGTACACATA 
CGAGAGAAGATAGCACACTCCACCAGAGGCACCTTACCCTCTTCTtJTTTACACTAAT^ 
TTCAAAGAAAGATTAAAACGCAGTGCCTGTGCTCTAGAGCAACAGCAAGGAC 

G<K^ ACAG AACCAGGGTTGTCCCT<1AGG ACACTGTCCrn^T AAT C CAG<5C AG<5TCACAGGG CCA.CCTG TTTGG TCTG CATG TGGT A 
1 0 CACAGATTACATGCAGGTAAAAGCATACGTGCACACAGACAAGATGCT 
GCCX5CCACCTCTTGTTATG<3CAATGCCTCGGGTGGAAT»TTCTT^ 
AGGTGGCGGATAGCTCAGTATTCGGGTGCTCCTCGAGCATGCXaTCAGACCCTGGGT^ 
ACCAACAAAACAGTTAAACCTGTGTTTCTCATCGQVGGACCGTTCTGGATAAG 

G C CCTTGCACAGGG CCAG CATGTGTGTTCAGCAAACACAACCCATGAXXIAGGAATT A^ CGGACCACA 
1 5 GGCAG<K3CAGAACACTGCCTCCAGTCCAG<3ATCAGCCATGGACATX3CT 
TCTTTGGGAACATTGTTGAGAAATGCCCTCTGATTGCCGC^ 
AATCACTTTTTTGCCTTTATTGATGCTCTCCCGAGATCAC^ 
ATAAAAATATTrTAGATTACTTTAGCCTCCTTACTCAGTTATTTAG 
CTAGTCTCTTGAAGGTGGACACAAGAACATXKKX5G03GGATG<K5CA^ 

2 0 GACAATCTCTGAGGTCGAGACCAGCCTGATCTAGAGCGAGTTCCAGGACGGCCAGGGCT 

AAAGGCCTTCGTGACTCAGAAGCTGAAGGCCCACTTnKSCA 

CAGAGAGG AACGGGAAG CGCTGAGCGATTTTCAAAAAAAAAGGGCCGGTTTGG ACCAGACAAGG ATGCA CCCAAG 
ACTCTTTGCAGAGAGAAGTAAAATGGGAGC^AAGCAATGTTT^ 

GTTTATTCAGGATGGCACAGCGGCCGAAGGTCCAAAGGTACATAGTCCCTGTTCATAGGCT 
25 ACATGCCCTTGGGGAACAAAGTCAGAGGGTAGGAGTCACCTGCCCTTGAGACTGT^ 
GACAGAGCAGGCACTCGCAGCTGCGTTCAGTATG<3CTGTGACACrCTGTC 
CTTACAGAGGAAGAGCACAG43TCCAGGTTATAAAATCAAGCT 

TTCCAGTCGACTGGAGATCAATTTTAAGCACACATTCCCCACCCCCCAAAGATGCTCGC^^ 

T CCTG CTCTGGCCTCCG T AAACACC AAGG ACACATGTG ATG TG CAG ACAT ACATG GAAACAAAACACTC AAAT ATGTG AAATTT AA 

3 0 GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTATCTGTCTGTCTATCTGTA 

» AGGGTCrTTCTTGCTTATCCTGGGCACTCTACCACTAAGATGTACT • 

AGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGG^ 
GAGGAGAGGAGAGGAGAGTCTCAATGTATTTGAAGGGTCATTTAAAAATCCTTAATGGGAAGTG^ 

35 MOUSE SEQUENCE - mRNA 

GCGGCCTGGCTACCGCGCGCTCCXSGAGGCCATGCGGGCGTTGCCGCAGCGCAGCGACCGGrc 
AGTAATGATTCTTGAGACCGTTACAAACCAAGACCTGCCT 
CGGGAAAGCCATCATCGTACCGAATGGTGCGAGGATCCCCAGAAGAC 
TATGAAGCGGCCACCGTC^AGGTGGCCGAGTCTGGGTCCATGACCCTGCAAGTGCLA^ 

4 0 CTGGGT CTTT AAG CACAG CT CCCTGGG CTGC CAGC CG CACTTTGATTT ACAAAACAGAGG AATCGTTT CCATGG CC AT CTTG AACG 

TGACAGAGACCCAGGCAGGAGAATACCTACTCCATATTCAGAGCGAACGCGCCAACTACACAGTACTGTTC^ 
GATACACAGCTGTATGTGCn'AAGGAGACCTTACTTTAGGAAGATGGAAAA 

GGAGCCCACTGTGGAGTGGGTGCTCTG CAGCTC CCACAG<3G AAAG CTGTAAAG AAGAAGG CCCTGCTGTTGTCAG AAAGGAGGAT^A 
AGGTACTTCATGAGTTGTTCGGAACAGACATCAGATGCTGTGCTAGAAATGCACT 

4 5 G ATCT AAACCAGG CTCCTCAG AG CACACTG CCC CAGTT ATTC CTGAAAGTGGGGG AACC CTTGTGGAT CAGGTG T AAGG CCATCCA 

TGTGAACCATGGATTCGGGCTCACCTGGGAGCTGGAAGACAAAGCCCTGGAGGAGGGC^ 
CAAAGAGGACCATGATTCGGATTCTCTTGGCCTTTGTGTC^ 

AAGCACCCCAGCCAGTCAGCGTTGGTGACCATCCTAGAAAAAG<^TTTATAAACGCTACCAGCTCGCAAG 
CCCGTACGAAAAGTTCTGCTTCTCAGTCAGGTTTAAAGCCT 

5 0 CTTGTGAACAGAGAGGCCTGGAGGATGGGTACAGCATATCTAAATTTTGCGATCATAAGAACAAGCCAGGAGAG 

GCAGAAAATGATGACGCCCAGTTCACCAAAATGTTCACGCTGAATATAAGAAAGAAACCTC 
CCAGGCGTCCTGTTCCTCTGATGGCTACCCGCTACCCTCTTG^ 

AAATCCCAGAAGG AGTTTGGAAT AAAAAGG CTAACAGAAAAGTGTTTGG CCAGTGGGTGTCGAGCAG T ACTCT AAATATGAGTGAG 
GCCGGGAAAGGGCrrCTGGTCAAATGCTGTGCGTACAATTCTATGGGCACGTCTT 

5 5 CTTCCCTTTCATCCAAGACAACATCTCCTTCTATGCGACCATTGGGCTCTGTCTCCCCTTCAT^ 

GCCACAAATACAAAAAGCAATTTAGGTACGAGAGTCAGCTGCAGATGATC CAGGTG ACTG^ 
GTTGACTTCAGGGACTATGAATATCACCITAAGTGGGAGTTCCCGAGAGAGA 
TTTCGGGAGGGTGATGAACGCCACGGCCTATGGCATTAGTAAAACGG 
AAGCTGACAGCTGTGAAAAAGAAGCTCTCATGTCGGAGCTCAAAATGATGACCCA 

6 0 CTGGGGGCATGCACACTGTCAGGGCCAGTGTACTTGAT^^ 

AAGAGAGAAGTTTCACAGGACATGGACAGAGATTTTTAAGGAACATAATTTCAGT 
CCAGCATGCCTGGTTCACGAGAAGTTCAGTTACACCCGCCCTTGGATC^^ 
GATGAGATTGAATATGAAAACCAGAAGAGGCTGGC^GAAGAAGAGGAGGAAGATTTGAACGTG 
CTTTGCGTACCAAGTGGCCAAAGGCATGGAATTCCrGGAGTTGAAGTC^ 

6 5 TCACCCACGGGAAGGTGGTGAAGATCTGTGACTTTGGACTGGCCCGAGACATCCTGAGCG^ 

GCACGG CTG CCGGTGAAGTGGATGGCACCCGAGAGCrTATTTGAAG<5GATCTAC^ 
CCTTCTCTGGGAGATATTTTCACTGGGTGTGAA^ 

GATTTAAAATGGAGCAGCCATTCTATCCCACAGAAGGGATATACTTTCT 
CCATCCTTCCCCAACCTGACTTC AT l Tl'r AGGATGTCAGCT 

7 0 AAAACAGGCGGCCCCTCAGCAGAGAGGCGGGCTCAGAGCCC^ 

GAGGAGGCCTTCteACCCCGCCACCCTAGCAGGC^ 
CGTTGCTTCGCTGGACTXrrCTCTAGATCCTC 
GGGAGAGCCAATAATGAGACTTGTTGGTGAGCCCGC^ 
ATATAGTATATTCTTGTAAATACGTGAAACAAACCAAACCCGTTTTTTCCT 
7 5 TTTT AAAAT ACT ATGT AACTTTTT CAT CT ATTTAGTG ATAT ATTTT ATCG ATGG AAAT AAACTTTCT ACT GT 
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MOUSE SEQUENCE - CODING 

ATGCGGGCGTTGGCGCAGCGCAGCGACCGGCGGCTGCTGCTGCTTGTTGTTTTCT 
AGACCTGCCTGTGATCAAGTGTGTTTTAATCAGTCATGAGAACAATGGCTCA 
5 GAGGATCCCCAGAAGACCTCCAGTGTACCCCGAGGCGCCAGA 

CCAGCCGCACTTTGATTTACAAAACAGAGGAATCGTTTCCATGG 
TCCATATTCAGAGCGAACGCXXTCAACTACACAGTACTGCT 
TACTTTAGGAAGATGGAAAACCAGGATGCACTGCT^ 
1 0 CTCCCACAGGGAAAGCTGTAAAGAAGAAGGCCCTGCTGTTGTCAGAAAGGA 

TCAGATG CTGTGCT AGAAATG CACTGGG CCG CGAATG CAC CAAG CTGTTCACCAT AGAT CTAAACCAGG CT CCT CAG AGCACACTG 

CCCCAGTTATTCCTGAAAGTGGGGGAACCCITGTGGATCAGGTGTAAGGCCA 

GCTGGAAGACAAAGCCCTGGAGGAGGGCAGCTACTTTGAG 

CCTirGTGTCTTCCGTCGGAAGGAACXiACAC CGG AT ATTACACCT CT CTTCCTC AAAG CAC CC CAG C CAG T CAGCG TTGGTGAC C 
1 5 ATCCTAGAAAAAGGGTTTATAAACGCTACCAGCTCGCAAGAAGAGTATGAA^ 

GTTTAAAGCGTACCCACGAATCCGATGCACGTGGATCTTCTCTCAAGCCTCATT^ 

acagcatatctaaattttgcgatcataagaacaagccaggagagtacatattctatgcagaa 

atxsttcacgctgaatataagaaagaaaccn'caagtccragcaaatg cctcagccagccaggcgtcctgttcctctg atgg ct accc 
gctaccctcttggacctc^aagaagtgttcggacaaatctcccaattgc^ 
2 0 ctaacagaaaagtgtttggccagtt^tgtcgagcagt^ 

gcgtacaattctatgc^cacgtcttgcgaaaccatctttttaa 
ctatgcgaccattgggctctctctccccttcattgttgt^ 
agactcagctccagatgatccaggtgactggccccct 
aagtgggagttcccgagagagaacttagagtttgggaaggtc 

2 5 tggcattagtaaaacgggagtctcaattcaggtcgcg^ 

tgt cgg ag ctcaaaatg atg acc cacctgggacaccatgacaacat cgtg aat ctg ctggggg catg cacactgtcaggg ccagtg 

tacttgatttttgaatattgttgcratggt 

g arrm 'aaggaacataatttcagttcttaccctactttccagg 

tacacccgcccttggatcagctctcagggttcaatgggaattc^ 

3 0 CTGGCAGAAGAAGAGGAGGAAGATTTGAACGTGCTGACGTTTGAA 

ACTTTGGACTGGCCOSAGACATCCTGAGCC^CTCCAGCTAC^ 

GAGAGCTTATTTGAAGGGATCTACACAATCAAGAGTGACGTCTGGTCCTACGGCATCCTTCT 
GAACCCTTACCCrGGCATTCCTGTCGACGCTAACTTCT 
3 5 CAGAAGGGATAT ACTTTGTAATG CAATCCTGCTGGGCTTTTGACTCAAGG AAG CGGCCATCCTTCC CCAACCTG ACTTCATTTTTA 

GGATGTCAGCTGGCAGAGGCAGAAGAAGCATCTATCAGAACATCCATCC^TCTACCA 
G CT CAGAGC CCAG T CG C CACAGCGC CAGGTGAAGATTCACAGAG AAAG AAG TT AG 



HUMAN SEQUENCE - GENOMIC 
4 0 CACTGAGACGCACATCTTATCTTTCAAACAAGTCAATGGACTAGAATAA^ 
TGTAAGAATCATCTGAGCAAATGTTTAAAATGTGGATTCCT^ 
CAGACTCTGCTTTTTTAATGAGAACCTTTGGTGATTCCTATCACAGCT 
ATGTGGCTATTGAGAGCTTGAAGTGTGTCTG<3TCCAAATTGAGATGTACTCT 
GTATGAAAAAAAGAATGTAAAATATATTATTAATAATTTTTATATTGGCCGGGCrCG 

4 5 ATGAAGCCAAGGCGGGTGGATCGCTTGATCCTAGGAGTTTGAGACCAGCCT 

ACAAAAATGAGTTGAGTGTGGTGGTGTACACCTTTAGTCCCAGCTACTTGGGAGGCT 

GTCGAGGCTGCAGTAGCCATGATCACGTCACCGCACTCCAGCCTGGATGCAGAGCAAGAACCTGCC 

TAAAATTAAAACAATTTTTACATTGATTATATATTAAAATG 

TT AATTT CAC CTG CTTC CTTTT ACTTT AT ACAGCCAACAG AATTTGT AAATGATG T ACAAGGCTTGAAG TTGTGACT CACATTCTG 

5 0 TTTCATTGGCTAGCCCTGTTCTAG^TGAAGAGTAGGAGTTCTTTTCACCTTGATTCCG 

(TTCrTTATACCTAAAAGGGATTTGTACTAAAAGTCCCTTGTTC 

CTGTCTCTTACTGAAACCCCCCAATGTTCAGAGCATTCATTTTTAAAGTAATTAAGAATGCAGA 
CACACTCCCCCAAACTCAGGGTTTGTCAACAGTGGCGCTGCTGGCATGTTGA 

T G CACTG T AGG AT ACTTTTCAG CAT C C CGGGTCTC CTCCCACTGGATGC CAGGAG CACATTC CCTTTTCC CAGTGTGACCAAAAAT 

5 5 AT CTCCAG ACACTG C CAAATTG AG AAAAAT AAC CC CTGG CA rri'l'TriTTTlTTTTTTTTTTTG AGAAC CATTGCCTT AGAGG CAT 

GAACCCACAGTCATCGTAAGCTGTTATTGAGAAATATCAAGCTTTT^^ 

TG AAGTGCAGTC^3CACGATCTTGG CTCACTG CAAGCTC CTGGGTT CAAG CAATTCT CCTG CCTCAG C CTCCT AAGT AG CTGGG ATT 
AT AGGTGCATGCCACCATGCCCAGCTA A ' l"r T "r T T GTATTTTTAAT AGAGACGGGGTTTCrTCATGTTGGCCAGGCI^TTCTCGAAC 
TCCTGACCTCAAGTGATCCAACCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCC^ 

6 0 TTAAATGTAGAAGGTTGGAGAGTCAGCCCCTCTTTACCCTAGTTGACTT 

TATCJVCTGTCCCCTCTCTCCTTTGTATCCTATCCCCATGTTAGATAAGCTCTTAGCATTATGG 
AAC^GAGAAGGGGTCGJ^TTCAATTTAGTTCAACAAATATTACTTT 
AGTGAATAAGGTAATGTTGCTTTCCCTCTCCCGCAAGAATGTACACAGATTGCT 
AGATTTTACAGAATCGCCAGTTGGGGACC^TGTCACTGACATATGGGTC^ 

6 5 TCCATGGCTACAATCCATACCCATGACCCCAGATTGTTATCCT 

TATAGCGTTTTGTACCTCACTCATTCCTACTTATCTG^ 
TAAGAATAGGTATGCTGCCATATTTCTCTATCTTTGTACAATGCCTTG^ 
CATATCTGGAATGCAACTCACAGTGCAGTCTGGGCCAAAGATGAGTCGACAGCATCCTCT^ 
CITAGCACAACACCCAGTACTCAGTGTTTATTGTTATT^ 

7 0 TGGTACTGATAATGTACCAGGTGTTATGGGCCACTGACGACACACAGCAAAGGAAAGATGGGCC 

GCTCAAAGTACTGAACAAGAAACGAACAACACAAAATTTGAACAC^ 

CTCTATGTTGAGATGGGGATGGAGGACTAGATTATAGTCAG 

AGACAGAGTCTCACTCTGTCGCCCAGGCTC<3AGTGCAGTGGCATC 

TTCTCCTGCCTCAG C CT CCCT AGT AGCTGAGATT ACA_a^ CATGTGC CACC CTGCC CAGCT A1~1TTTTTGT ATTTTT AGTAGAGACG 
7 5 GGGTTTTGCCATGTTGG CCAGG CTGG T CT CGAACTC CTGAC CT CAAATGATACG C C CACCT CGGCCTGCCAAAGTG CTGGGATT AC 
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AGGTATGAACCACCGCGCCTGGCCCTTCTTTTT^ 

GCGCAATCTTGGCTTACTGCAACCTCCTCCTCCTGGCGGAGGTGATTAAGTGATTCT 

CCTG C CTCAGCCTCT CG AGT AG CTGGGATTACAGGCATG CACCACCACACT CGG CT AATTTTTGT ATTTTT AGT AG AG ACGGGGTT 
TCACCATGTTGGCCAGGCTGGTCrCAAACTCCTGACCT 
5 AACCACCTTTCCTGGCTTCTTTTTGCTTTTTAGAGTCAGGGTCT^ 

GATCCTCCCGCCTCAGCCTCCTGAGTAGCTGCAACTTGGCGTGTGCCACTC 

GTTTCAAATTTACTAATAAGTTGAAATAATAGTGCATG^ 

TTGGTTTGTtlAAGTACAATATTTTAGT 

GCCCCTGTAATAACACTACCCCTGAAAAAACTTTGGTGTGGTCATGGTATGTG 
1 0 GCTTCATTTCTCCAATTTACTTCCTCAAATATTACGCCAAAGTTTACT 

AGCCGTCCTGGTGTCCTGGTCCTGATAAAGGTTTCCAGGATATTTCACCCATTAGTTTG 
TTACAAATTCCAAATCTACCTTAGCITTTTCCACTTTTATGGGAAAAAAA 

TTCTGAAACCTTCTCCTTAGCCTTGCCTAAGCCTGATGTTTTC Cr rr i 1 1 ' C CTG AAAT ATTT AATTTTCCAACT CTGTGAATCT AAG 

TTTTGCCCACAGTAGGCACTAACCGAGTGCTTACGGGGTAAACAGCTGTGTGGTGGTCGTTTG 
1 5 CCTGCTCAGAAGAACCACAAACTGAAAAGCAGGCTGTCAC^ 

AGGAGTTGTTTTTTTTTTTTTCCTATTTTCTGAAGGAGTTGAGC 

GAGTCTAGAGAAATACCAGC^VGCTCCTACCrAAATGCCGTTCCCATTCA 

TTCCCAAGACCCCACTCAGTTGTAAGTTATTAACCACATCTTAAGAAGCCATTCTGT 

ACCATGTGCAAAATCCTATAGGTGTATTACCTCATTTAACTCTCATAATCCCATTTTATGA^ 
2 0 TTGCCCAATCACAGAAGGCAAAGCCAACACCCTTTATTCCT^ 

CAGAG ATAACATTG CAG ACTTTTTCT ATG T AAG CACCATGATAG AAACAT ACAAT ATGG ATCTTTG T AT AAACATG AAAT AAGGT A 

CCATTTGTAATCTGATTTTTCACTTAAAATATATC^ 

GTAATGGCTGCATAGTATTCCACTTTTCAGAAATTITTATTTTAACTAACTC 
TCTCTTTCTTrGACATTTTTATTTATTATTATTATTATTATTATTATTA 

2 5 CTCTCACCCATGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCATC 

TAGCTGGGATTACAGGCGTGTGCGACCGTGCCCAGCTAATTTTT^ 

TGATCTCGAACTCCTTACCTCAAGTGATCCTCCTCCTTTGGCCTC^ 

CAAGAAACCCCTTTTCTTAGTTTTAATGAGAACGCATCTTCTTAGACACCGAGG 

3 0 TCCCCTTGGAATTCCCATGGGGTCAGGACACGGTTGTCATTGCAGTTGTCATTGTTTC 

GAT^AAATGTTTCAAGTGCTATCTCTTCTAAACCATGGCAAACCACACTCT 

TTACX5AAACATTTTGCTTTGGTCTGCCTCATGATAAGTAATTGTACT 

CATGCTCTCTGGTGAGCCACACTGGCTTCATTACCT 

CTGGCATGGAGCTCTGTTTCCrCITCTCCTCCCTAGGGTTGTAATCTGGCTTTCCCCCAATCTTTCT 
TGATTTCCTCCTTGTTGTCCAGAAATAGGTCAGGATAAGAAATTCCCCTTTTTGACTTCTTCAGCCT 
GAAGGTTAAACACCTCATTCAATGCTCAGCATTTTTTT^ 

GTGGCTCCAGCAATTCCCCTACCTCGGCCTCCCAAGTAGCTGGGACCACAGGTGTGTGCCATCGC^ 

4 0 AAATTCTGTAGAGACAAGGTCTTGCCATGTTGCCCAGGCCG 

AGAGTGCTGGGATTACAGATGTGAACCCCTGCACGCACCCAAAGGTCAACTTTTAACTGCCT 

TGAATTCTCTTGTTCACAATACACAGTCAGAATTACCTTGGCTTACGAGTTCAGG 

CTCAAAAGCTGCAATTACXSCAGTGGCGTTTTGTGGATTTTTTTTG 

CTTCTGGCCTCAAGCAGTCCACCCATCTCAGCCTCCCAAAGTGTTGGGATTACAGGTGTGTTTTG 

4 5 TTCTCTTCTCCCCTTTATCTTCTGTATCCAATCTAGCCCCTGCCATTC 

CCTTTCCCATTCTCCTCCATTAAACAACGCTCCCCTTGTCATGCT 

AACTTCCCACACTCCAACTGCTTATTACCCCTCTACCTTCACCACCAAATTGTAAATTCTGTGAG 

ATTTGCTGTTGTGTCCTCAATTTCCGTCACAAACAGGTTCTAACATTGACAGTTGAATGAAGGAATGA^ 

TGCTACAACTGCCTTTAAACTTCCAGTAAACTTTTGCT 

5 0 GACTCATACCTGTTATCCCAGCACTTTGGGAGGCCGAGG 

ATAGTGAGACCCCATCTGTACAGAAAAATTACATAAAATAAAATTAAGGCTAGGTGCAGTGGCTCATGCCTGTAAT 

TTGGAGGCCAAGGAGAGAGGATCACATGAGGTCAGGAATTCAAGACGAGTCTGGCCAACAT 

ATAAAAATTAGCTGGCTGTGGTGGCGTGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCT 

GCAGAGGTTG CAGTG AG CCAAGAT CACGCCACTG AACT C CAGC CTGGG CG ACAGAGTG AG ACT C CATCT C AAAAAAAAAAAAAAAA 

5 5 GTTTTATTGCAATGCTCAGGGCAGTAAGCATTGTTGTAGAAAATAAGTAATTTGTAl 'Tl'l 

TTGGGAGGCTGAGGTGGGTGGATCACTAGGTCAAGAGATGAAGATCATCCTGGCCAACATG^ 

ACAAAAAATTAGCCAGGCGTGGTGGCGGGCACCTGTAATCT 

GGCAGAGGTTGCAATAAGCCGAGATAGTGCCACTGCACTCCAACCT 

TAAGT AAAT AAAATACTGTTCATGAT ATT CTG CTC CCATGAGAAGCTTG CCGTTC CT C CT CATTG CAGC C CT ATGACTTCCCG TCT 

6 0 GGCGCCTTCAGTCGCTGCTTTGTCTCCTCCTTGGATGCCT^ 

CTTC^GCTCTACCTCCTCTATGAAGCCCCCGAACCTTTCCCA^ 
TTTAATGCTTTAATGATTCCACITGTATGGGTCCTGTCTATCCACCCC^ 

AGGTCTTTTGCATCCTCCACACTCCTC^TCTAAAGCTGCGTCCACTCTGACTCACCTTCCTAAC 
TTTTTAAACTTTAAGTAGGCTAGGCCGGGCGCAGTGGCTCATGTATGTAATCCCAGC 

6 5 AAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAACCCCTGTCTCT^ 

CAGGAGCCTGTAGTCCCAGCTACTCCGGAGGCTGAGGC^GAGAGTGGC^^ 

AGTGCCACTGC^CTCCAGCCTGGGAGACAGAGCGAGA 

TCGAGTTAGATATCTrCGGTACAAGCCTTAGATTTCTTTrTCTGTTT^ 

GAGTT CACX3 AAGGCATATTCAACAGCTCATTG CTGCCCTGCCT CTCCTGC^CCATCCAAC ATCTCACTGCAGTATCCAGCAACCCA 

7 0 CAGGCCTGCTGGGCGTGGTGGACAGTCCCCAAGATCAGGTTATCaSTGTCAAAATAT^ 

ACTAGAGCCTGCTGAGAGCACTCAGAGGTTTCnX5CACAAGCTCAGTGGATTTGAGCC 

AG<5GTTGGTCATAGGTCCTGCCTTGGACTGTCATTTCAGGAAGCA 

GATCCATTTTGTGGAAGACAATTTTTCCATAGATTGGGGGTAGGGGGATGGTTTCG^ 

GGATT AGATT CT CTT AAGGAACGC a .CAACCT AG A CCTGG CGTGTG CAATTCACAGTGGGGTTT CCGCT CCTGTGGGAATCT AAT 
7 5 CCCTCTGCTGATCTGACAGGAAGCGGTAAAGCTCTCTTGCCT^ 
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GACAGTTACCAGTCTAGAGGCTGGGGGGCTGGGGACCTCTGAACTACAGGCTTCTCCACCT^ 

TCCAGAGCAAGAGGGGCAGGAACTTCCTGGTCrCACTGAGAATCTTCGGCCAATCT 
TGATCATCTTAAAATGCAGATTCTGAGTCAGTGATTTGAAAG^GACCT 

ATGGCGATGTTGATGCTGCCTGTCTGTGGACCACACTTT^ 
5 TAGCAAGTGAGGAGCATTTCAAGGAAGACACCTCCCATGCACACrrGTCAT^ 
TGTCTTTAATTCTTTTTCAAACAGCATATCCAAGTTTTGCAATC^ 

ATGATGCCCAATTTACCAAAATGTTCACGCTGAATATAAGAAGTAAGTTAAATTGCCGAGAATTTC 

T AATTTTTT ATT AGTCTTAACT AGGAAAAGTTAAGT AATGG CTTAG AAGG G CT ACT ACAAAACTGTTTTTGTACG CTGAAG TGTTT 

TGAAACTTT ACTCT CAT AT CT ACTGG CATTGATCAT AT ATCTCTTT A'lTTTIT ATGTTA1 
1 0 GGTCTCACTCTACCACGCAGGCTGGGGTGCCATGGC^C^ 

TAAGCCrCCCGAGTAGCTGGGACTACAGGCACGCGCCACCATGCrrCAGCTAAT^ 

TTTGTTGCCGLAGGTTGGTCrCAAACrCCTGGCCTTAAG 

CCACTGCACCTGGCCATATATCTCnTTCTGTGTGTATCAT 

TGTTATGACTTTATTGATTTTAGTATTTTGAAAGGTCATGT^ 
1 5 AAT AATTTT CATTT CTTTGG CTT AT AAAACAATGAGAAG TCCAACTTCTCCTGAAGCT AAG AAT CATTTT AAGGCATG CAAACAGG 

TGGAGGTCTGTACCTCAGCTCTCTCCAGGGAGAGGCTG^ 

GCATGGCAAGGAGGAAAGAAGAAATTACCAGAGGTCTGGAGGCATCCTAAATGAGA^ 

GT AAATGGGT AGC CTGT ACTGT AGG CAACTTTG AAACT ATTTCCAG AAAG T AC CAT ACT C CATGAG CTT ATTTTATTTT ATT ATTT 
TCAGAGACAGGGTCTTGCTGTGTTGCCCAGGCTGGTCTTGA^ 
2 0 TGGTATTATAGGAGTGAACCACTATGCCCAGCCAGTGAGCT 

AAG CATCGG CAAG T CAGGCGT C CTGTTT CTCGGATGG AT ACCC ATT ACCAT CTTGGACCTGGAAGAAGTC CTC C C 

AAGTAATAAGGACATTGATGCCTCTTATAATGGAAGAACATAAAACTCTCTGACCT 
GAAATGATGAAATATTTAAAATGCTAACCTTCTCAATCAACCATAACTCT^^ 
ATGAATCTCTTTTAGAACTCGCTGCAGAAATCCTAACTTCCAATCTGATG 

2 5 CTCAGCTGCACAGAAGAGATCACAGAAGGAGTCTGGAATAGAAAG^ 

TCTAAACATGAGTGAAGCCATAAAAGGGTTCCTGGTCAAGTGCTGTG 
TAAACTCTCCAGGTACAACAGTTGTTCTTGTGATATTCGACC^ 
ACHTGGGGAATTTTACAGGAGAGTATAGGGTCAGTGTGTGAGAAAAG^ 
CACAGGrTCGGGCGGAGTCCTCCCTGTGCCCTTGTGACTC^TGTGTCA 

3 0 TCTCCATGGAAAGATCTAACrCTAGAGAAACAGATTTTGTCACTTGGA 

GCCCrCAGCTTGCrrGTTTTATAATTTTTGGTTTGCTTTTTAGAGCCCT 
AGTAGTTAGGATCACCAACCTCACAAAACTGACTTCTCC1ACITTCA 

CTGACTTG T ACAT C CTG T AGG TTT AAATC CATGCATTTTTTTTTTTTTG AGATGGAGTCT CT CTG TCACC C AGG CTG C AGTG CA 
GTGGCGTGATCTTGGCTCACCTCTGCCAGGGTTCAAGCGATTCTCCTGCCTCAGCCT 

3 5 CACCACGCCCAGCTAATTTTTGAATTTTTTTAGAGACGGGATTTCACTGTGTTGCCTGGGCT 

GATCCTCCCATCTTGGCCTCCCAAAGTGCrrGGGACTACAC^ 

CCTAGTAAATGGTTTTAAAGAATGGATTGAGAATAATCCGCAATTTTCTAGGGAGG 

TTGTCTCTGGGGAGTAGAGTATATGTAGAGTGGTTGTTAC^CTGAAAATGATTATTACTGAAACAGGATGTGAGAGATTATAATG 
AGTTGTCCACTATTTATAATGTCACACAGGAATTCTGTTTCATrc 

4 0 C^GACAACATCTCATTCTATGCAACAATTGGTGTTTGTCTCCTCT^ 

AAAGG TAAAAGCAAAGGT AAAAATTCATT ATT CTTT CCT CT ATCTGCAGAACTG CCTATT CCTAACTG ACT CATCATTTCATCT CT 
GAAGCAATTTAGGTATGAAAGCCAGCTAC^GATGGTACAGGTGACCGGCTCCTCAGATAATGAG 
AATATGAATATGATCTCAAATGGGAGTTTCCAAGAGAAAATTTAGAGTT^ 
TTCTTTTCCATTGGAAAATCTTTAAAATGCACGTACTCACCATTTGTCTTTGC^ 

4 5 AGTGATGAACGCAACAGCTTATGGAATT AG CAAAACAGGAGTCTCAAT CCAGGTTG CCGTCAAAATGCTGAAAGGT ACAGTAT AGT 

GGAAGGACAGCAACAAAGATGCCCAAAAATGGGAGGCAGAGTTTCCC^ 
CTGTTTGCCITGTTrCAAGGCTAAAAATGGAGTGGATTGGGGTC 
ACTCCACCTCTTCTTCCACTTCTCTCTTTGTTTATTTTTAUT 
TGGAGTACAGTAGTGTGATCATGGCTCACTGCAGCCTCCACCCCCTGGGC^ 

5 0 GACCACGCTCAGCTAAGTTTTTTATTTTTGGTAGAG 

CTATCTGCCCGCCTCGGCCTCCCAAAGTGTTGGGATTACAGGC^ 

T ATTTGGGG AT ACAGTTG C CACAGGCAGAAGTTCTTTCTTACATTTGTGTTGTTTTATT^ CTT 
GCTCTGTTACCCAGGCTAGAGTGCTGTGATC^TAGCTCACTGCAGCCTCGACCTCCT^ 

CACAGGAG TG TGCCAAG ACAATTGGCT AAl'ri'lTTTTTTTTTTGGT AGAT ACGG TGT CT CAC CAT ATTG CC CAG G ATGG TTG CAAA 

5 5 CTCCTGGCTCAAGTGAGGCTGCCTTTGGCCTCCTAAAGTGCrGGGATTACAGGTGTGGGTTG 

ACTGGGGCCCTTTGT ATCCTCACTTCGTTGCTCCATCCATTCCG ATCTGG CGTCCAGCCCCATTGCTGG CCTCACCGTTACCACCG 
CTGACCCAGCAAAAGCATCTCCAGCTGGCTCTCTTGCTAAGGTCAACAGTGACT 
AGTCCTCATTACTTGATTGTTCAGCATGAATTGATGCTGTTGATTCTTCTCT 
CAC CAT ATTCTGGTTTTCCTCCCACCCCTCTGCC CTT CTCAGTCCT 

6 0 GCTGATTTTGAAACTCAGTCCTTTGTGGGCTCATCTTCTTCCA 

TCTGTTGCCCAGGCTC^AGTGCAGTGGTGTGATCTCAGCTCACTGCAACCTCCACTTCC 

GCCCCTGAGTAGGGGGGACTACAGGCATGCGCCACCATGCCTGGCT AATTrrrGTATTTrTGGTAGAXSATGGGGCTTCGC^TGTT 
GCCCAC^CTCCATCCAGCTTTTTATGGGGCATCCCTGTGGGCTTGCTCCTGGACCGCCTTG 

6 5 GTTAAATGTTTGGATGTCTCAAAGGTCCCTTTAACACAGTGTA^ 

TCTCTGTGACTGGTAATGAATGCCAGTGAATAAATAACCACATTCTATATAG^ 
GACCTTCCTCTTTCTAACACCCTGTGAGCTAATCCTATCAGGTTTCCGAT^ 
TATGACCACC^CTCTAGTTCAAC^CTCTCTCCTC^ACAAAAGCTTCCAAAGCAGGTTCCT^ 
TCTCCATACTGAGGTCCAAGTGACAGTGGCGAAAGGCTAAGCAGATCAAATGGCTTTCCTACTT 

7 0 TGGCTGGGCCACCGCTGCCTCCCCACCTCTTGCTCTCC^C^CCCCAGAGCGCT 

CACCCACTGTGGGGCCTTTGCACAAGTGGATCCTTCrGCTTCCCCTTCCC 
TCAGCTTAAACACCACTTCCTCTGTCTCACACCACAGACAAT^ 
CCCCCGTAGAATCCATCACCTTTGTCATGACATTTTTTAACATCCAATTG 
GGG^GGGTCTGTOTCTOTGGTGCTGTGCT 
7 5 CCTTCCACCATGAGTCACCTGGCCAGCTCCCATTTTTCTTCAGGACT 



752 



WO 03/008583 



PCT/US01/51291 



GGCTAAGGTAGCTATCCCTT'l l'GACCCTACAGTACCCCTGCACTCCT'l'I'GCCACGTTGCAr 111 ITCTTTTTTTTTTGAGACGGAG 

TCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCACCATCTCAGCTCACT 

CCTCAGCCTCCCTAGCAGCrGGGACTACAGGC^CCCACCACCACGCCCGG 

CACCGTGTTAGCCAGGATGGTCTTGATCTCCTGACCTCGTGATCCGCCTGT 
5 GCCACCGTGCCCAGCCCAACGTTGCATTT TITCI^ 

TGGAGTGCAGTGAAGTCATCATAGCTCACGGCAAGCTTCAAGTCCTCGGCTCAA 

CTACATGCAGGCACCACCAAGCCCAGCTAATTTTTATTTTATTTTATTT^ 

GTAGAGACAAGCTCTTGCTATTTTGCCCGGGCCTTTCTTGAACTTCT 

ATGTTGGGATTAGACGCGCAGGGACCCACGTCTTGCCTGCCGTATTGCATTTA 
1 0 TGCCTTCTCCAGGAGACTCTAAGATCCAGGAAGACAAGGGTTGTGTCTTGl'l M rrrC 

CTTACAG AG TAGG CACTCAATAAAT ACrTGTTGAATGACTG AATGAAAG AGGT AATGG T ACATG AAAATGCTTAGAAGAAGCTAAG 

CATGCTAAAAAATGTAAGATATTATTTTAGCCATGAGGAATAGGACATGAACAAGTAAAGCCCCGAA 

TGAAGACATGATGTATTAGGGGX3TCCCAGTGCCCACAGGAAATCCCACCCCCAAGCTA 

AGCTTGCCOVAACCAAAATAGAGGATGTCAGCACCGGAGAAGCCACT 
1 5 CTGGCAATCGAACCAAATCTTCACTGAATGTAAAGAAGAGTTACTATATCTGGAAGAAAAAGTCTG^ 

GGAGCCTGGAGAGGGAAGTCCAGAGACCCTGCCCTTGCCCCTGTGGCTGCCCGGGCTC 

GACGGGAGCCCTTGGATTCCATAGCGACTTTCCTCACACTGTGCTTCTAAG 

ATTTTCCCTAGAGAGAGAATX3CATTGGCCAACCAATATGTTCTAGC 

T CTTGAG CACCTGATAGTGGGAATT AGTCCTCTTTG AC CTTCT C C CATTGGACTTGCTGG C AAATCT AATGTTCT AGAC CTG AGGG 
2 0 CCACAGGCTAGAGTCAACCATGCTCACGACACTGTTATCTCTCTCTGACTA 

GGACGTTTCTCCCTTCTGTTAACTTGTACTGAGGAGGAAAAGCATGTGTTATTATTTTG 

GG ATTTTT ATTTTGGT AGG T AGAGG AAAGTC CTGTTGAAAGGTGCAT T AC CAG ATT CAACATTTAT ACTAT AGTCTTTG TTTTG TT 
TTGTTTTGTTTTGTTTTTGAGACAGAGTCTCGC^ 

TCTGCCTCCCCAGTTCAAGGGATTCTTTTGCCTCATCCTCCTGAGTAGCTG 

2 5 TTTTCTATGTTTAGTAGAGACAGGGTTTCAC 

CCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACT^ 
GGCTCTlTGCCTGTATTCCCAGCACTTTGGGAGGCGGAGTTGGGAGGA 

TGGTGAGACCCCATCTCTACAAAAAATATAAAAATTAGCTGGGCATGATGGCCTGTGCTTGT 
GGTAGGAGGATCACTTGAGCCCAGGAGGC1AGAGGCTACAGTAAGCTGAGATCATGTCACT 

GTTTGTTTGTTTATTTATTTATTTATTTTAAGATGGAGTTTCACT 
ATTGCAACCTCTGCCTCCCACATTCAAGCGATTCT^ 

TGGCTAATTTTTGTATTTTTAGTAGAGACGAGGTTTCACCATATTGGCCAGGCCAGTCI^ 
CACCTCGGCCTCCCAAAGTGCTGGGATTACAGGC!ATGACCCACCATGCCTGGCT 

3 5 CAGATTGACTCTGAGCTGAGAAAAATTTTATTTAAT 

TTTGACAGAAAAAGCAGACIAGCTCTGAAAGAGAGGCACTCATGTCAGAACT 
TTGTGAACCTGCTGGGGGCGTGCACACTGTCAGGTAACCC&CTTCCACGA 
AAGAGGATCTTTGCTCTCTCTCTCTCTCTCrCTTTTTAATTCT 
■ AATTAGACACTGAGCACTCTATAAATCAGAGGGGTCAAAATAAAGAATGCCTCATTC^ 

4 0 TCTCACCTGCTTAAAAACAACAACAACAACAAC!ATTGCAAGTACA 

GCACAGGCGTGGTGAGAGTGGTTACAGAGCCTGGCCACTGCCCCAGTTGGGCTGCCGT 
CAGACTTTACTTTC CCAG AATTCTCTGGGGCATGCC CTTCTC ATCT 

TTCATT CTTTCTGCAGCAGCAGGCCTGGGG CTGGTACT AAAAGG AAAAAGAGG AAAGTGAATCTGAG AGAGGACTGGGAAT ATATT 
TCTCCCTCCTCAAGCAAACAGCAAATCAGCATATAGTTAC^ 

4 5 CAACTGGAGAATCCAGGAGGCCAGTGGTATTGGTGAGGCCACACATACAAAAAAGGGCCACAGA^ 

TCCAAAGAAATTTGAGCAAAAGCTGAAACTGAATTTAAAGTATAAGAAGAGCTAG 
GCAGAGAACCAAGCCCTCCTAAGAGTATGTTGTCTGCTACATAGACTT 
AACTGTAACTATTTCAGGACCAATTTACTTGATTTTTGAATACT 
AAAAATTTCIACAGGACTTGGACAGAGATTTTCAAGGAA 

5 0 TAAG AGG CT GGG TC AGGG TTTCGT AATT ACACAT CAT AG AACGT AG G C AAAGTGTGCTGT TCAAACACACTT CGTTG AC CAGTTT C 

TAAAGGCACCGTGAAGTCTGTGGGTCCTGGAACCTAGAGATTTCCTGAATCT^ 
CTGTCCCTTTCCTTCAGTACAGGTACTATAGCTGAGCTAGTCTCTCTCCACCCATGCCTG 
TTGGGGAAAGGTGCATTATTCXX3TCTTTCAGGACACCTGGTTGCCTC 
CCTTAACATATGCAAAGAGAGCAGCTCGGGTCCTGGGACTATTG05CAAAA 
55 GATGGATTGATCGACACGTGATGGXSGACACAGTAAAGTAGCAGTAACCT^ 
CCTGAGAGATGCTTGGTCCTGCAGGGATTTATTGCT^ 

CCTTGAACTTTCTTGCTAATTACTTGAGCATCATGGCTTTGTAGCACCTTTCTCT 
TTCGTTGAGCCTroGAGCATT LXmiTl M lTrr i" l ' lU 'TTTT U"l 'C 

GGTGTGATCTCGGCTCACTG CAAC CT CCACAT CC CAGGTTCAAG CGATTCTCTTG CCT C AG C CT CCCG AGT AG CTG AG ATT ACAGG 

6 0 CATGTGCCACCACGCCCAGCTAATTTTGTATTTTTAGTGG 

CTCAGTTGATCCACCC^CCTAGGCCCCGCAAAGTGCTGG^ 

TATTTCCCTCCCTGTATGGAAG 11 L"I CCCTTATTCTGAAGGGTATTTGTTTATTGGAG 
GGATACATrTGAGAGTCCCCGCTTTGTTTTCTTAGAATCTAAGTTTTAAGT 

CCAGTGTCTTAAAGCCACCAGGCCCCACGGAGATCCCCATGCACGTCCGTGTGCAGCTGCCCTTAACCT 

6 5 CGTCCTCTCCTTTACATTCATTTCCGAGCrCATCGTCTAGAAATGAGAACT 

ATCCTACTGAGTCCGATGTGTGTCACCAGTATCCACTGGTCAGTACTTAGCCCCTTTGAT^ 
ACACTGAAATTATTCCTTTTGGAATATTTGTGGTCTGAAAGC!ATGAAGCCATTrCCTAAAGATATGT 
ACTCGTATATTAGAATAAATTGAATCGTGATCTACTTATAATTCCTACAGAGCTCCT 
AGATTTGTATTATTGCTATTTATTAATTTCACAGTAAAAAGAAATGAGCT^ 

7 0 ACGCTTACGGGACTCTCGGGAAGATCTGTATTATGTGAGGG 

CCTGGTTCC^CTCXSAATCATATTTATCTCACTGTGCCTAATTTCrCTTT 

TTGAGTGAGGTCCTTTTTGAACTGTAAGATTCTATATGGAAAACAGGGAT^ 

ATGTTTTTAATGTACACAAAAAATC^GACAAATGGATGGTGTGCTTTTAAT^ 

ATGAATT ACATTTTTAAAAT AAAATG TCTG CTG AG AAG AAT AATGTAAAT AATG AAAAAACAAAATTTTTTAAT AGC^ 
7 5 TCAAG AGAAGTT C AGAT ACACCCGG ACTCGGAT CAAATCT CAGGG UTT C^TGGG AATTCATTTCACT CTGAAGGT AATATTTTATT 
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CTAAGTAGTAGCACITTAAAATGGAAATGAATGCAAAAGTTTTATGTTAGCTATl ITrrAAAAGTGTGTAAACACAAAAGCATTTT 
AAAATTTG CT AATTG T CAG ATG AT AGCT AAAAG C CAGTTTGT AT ACTGT AG ATTTT ATG TCAACATT C AACT AAT AAACACTG AGG 
ACTATTTTATTTC^TCAGATTATTCATTTAAAGTCAT^ 
TTCCGAAGTTACATAGAATTTGGATTAACTTCTGC^ 
5 ATCACAATTTTCTTGTTTATAAAATAGGCCTAACACTACCTTCCTCAC^ 
TGGCTCACGCCTGTAATCACGCCTTGGGAGGCCGAGG 
GGAGAAACCTTGTCTCTACTAAAAATACAAAAATTAGCTGGGTAT^ 

GCAGGAGGATTGCTTGAAC CCAAG AGGCAGAGGTTGCAGTG AG CTG AG ATTG C ACCACTG CATT CCAACCTGGG CAACAGAGTG AG 
ACTTCAACTCAAAAAAAAAAAAGAAAGAAAGAAAAAAAAAGGAAGGAGTAAATGGAAGAAT 
1 0 AATAGGATAGATG<^CCGGGCACAGTGGCTCACGCCTGTAATCCCACCACTT^ 
AGATCAAGACCZATCCTGGCTAACACGGTGAAACCCTTCTCTACT 

CGGGAAGCTGAGG CAGGAGAATC CAGTTGCAGCAGGCCAAGATCGTG T CACTGCACTC CAGC CTGGGCG ACAG AG CG AG ACT C CTT 

CTCAAAAAAAAAAGGGATAGACAATTAATATTAATTCCTTTCCCACT 

CTCCTGACTAATCAATCAGGAGGTCATCGGTCATCAAAGCAGTTCCAAAGTTTATCTCTAA 
1 5 CATAGGCAGCATAAACAGGTCATCTCTAATGCTGCCGGGAAGATAATGCGAT^ 

TTGAAT AAC CTTC CCATAGAG ATTTAATAAGGCAACTTCTT AG AG CAAAAGTGAAGACTGG CTCT ATTTT ACAGATGAAA 

TTGAATATGAAAACCAAAAAAGGCTGGAAGAAGAGGAGGACTTGAATGTGCTTACATTTGAAGA 

GTTGCCAAAGGAATGGAATTTCTGGAATTTAAGTCGGTATGCTCCTT^ 

GTGCTTATATATG1"!' riCTTGTTT ATTTT AACTTGTATCCCTTAAAGT 
2 0 CGTGTTACCTTGGCTCTTGX5CTAATTGTAX3TAGCATAAAGGC 

TGCACAAAGTCCCCTGAAACTTCCAAGGCTGAAGGTCGAGATAGGAAGTCCTCAGAAATCCTTCCT 

TCAGTCCCITTCAGAACGGCTGTTCAGGTTGTGCCCTGCATGTGGGCATT 

GCCACCCTGAGTGTCTTTGTCTAATTCTCACCCTGCGTGCTGTGGGCTGGCAGC^ 

AACCCCAGATCCATTCGTTGGT AAGGCC CTGGAGCCTGC CGTG AACCATGTCTCTACCCCAATTTTGCACTCTTTTTGAACTGCCT 

2 5 TTCCACCCGCTGTTCTCTGTGATTrCAAATGTCTCCTGCCTATAAGCAGATCAT 

TGCTTGCCTTTCATTACCTTCTCTTTATTTCCCAAAATTCTT 

TCCTCCACTTCCTTCCCCTTGCCTClTCrCTCCTGTTTTCT CT CTGAATTTGGAAAGACT ACTTT C AATTGT ATT'rT'riTGTrXTG 
TTTTGTTTTGTTTTTGGAGACAGTCTCGCTCT^ 

CCTGGGTTCAAGCAATTCTCATGCCTCGGCCTTCCGTGTAGCTGGGACC^ 

3 0 TTTTAATAGAGATGGGAATTCACCATGTTGCCCAGGCTGGT 

AGTGCTGGGATTACAGGCGTGAGCCACCACACTCGGCCTTTTTTCITTAAAAAAT^ 
T CTCCAGTTCCAGCTACTTGGG AGGCCAAGGTGGGAGGATTGCTTGAGCCCAG 
ACTGCACTCCATCCTGGGTGACAGAGCAAGTCTCTTTCTCTAA 

ATGCCAACACTTTGGGAGGCCGAGGTGGGTGAATCACGAGGTCAGGAGATCGAGACCATCC^ 

3 5 CTATACAAAAAAAATTAGCCGAGTGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTTGGGAGG 

CTGGGAGGCGGAG CTTGCAGTG AG CCAAG ATGGCGCCACTGCACTC CAGCCTGGGCGACAGAGTG AG ACTCTGTCT CAAAAAAAAA 
AAAATTAATTAAAAATAAATAAATAGGCTGGGCACAGTGGTGTACGTCTGTAATCCCAGTGCTTTTC 

CAGG CTG CAATG AGCT ATG ACT AT ACCACTG C C CT CCAGTCTG TCT ATT CAAAAT AAT AATGAAATAAG CTT AAAAG CAAT AAT AA 
TACAAGTTTTAAAAAATTACTGTCACAGTAATATTTCC^CA 

4 0 CTGTTTC<3ATTCTTCTTGCrCACGCATCTTGCCCACCTGAGTCTTTCTCTATTTCACTT^ 

CATTTTGTT TTTGT T I'lTGAGACAATGTCrTGCTCTGTTGCCCAGG CTGGAGTGCAGGGGTGTGATCTCAGCTCACTGCATCCTCT 
GCCTCCCAGGTTCAAGCGATCCTCCTGCCTTGGCTTCCCA^ 

TTGTATTnTGTAGGGATAAGGTTTTGCTATGTTGGTCAGACn'GGTCTCAAACrCC 
CCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTACCTGGCCCACGAACATC^ 

4 5 CH'GGCCTGGTTTGCTGCCCATAGCTAAGTAAATACTCAAACTACTCCCAACT 

AGGCITGTAATAATATGAATTTTTGAAGAACATCCAGTTTCT 

ATCATGCGTGTCACTGTCCCCTTCTTGTCTCrCACCCCCTCCCTGATATTCCTCTGGGTGCCCCT 
GACTTCATCAAACATTTGTTGAGCCCCAGCTACATGCCAAGC^ 

CCTGGGCTCGAAGATCTTATGAATGATAGGGGAGAOIGAGACAAGCAGACATGCAAGTCGA^ 
50 TGGCTCAGGCCTGTAGTCCCAGCACTTTG{X3AGGTTGAGACGGT^ 

TGGCAAAACCCCATCTCTACTAAGAATAGAAAAACAGCCAGGCACGGTGGTGTACGCCT 
GCATGAGAATCGCTTGAATTGC<3AG>GCAGAGGTTGCAGTGAGCAGAGATTGTACCGCTGCACTCCAGCCT 
CTTTCTCAAAAAATAAAAATAAAAAATTTTAAAAAGTTTAAAAAAGACAACACTX3TAGCATG 
TCTGGGAACCCAGX3GGAGGGATCTGCACAGCAGACCTGGTTGGGGGAAGGGGAGA 

5 5 TGCTTCATGCITGGACTAATGGAGATTTTTGAAGGAAAAGGAAAACCAGC^ 

GAGGAGTTTTTTTTGGAGAAAGGTAATGAGTTGGACATACTGAATGTGAGGTG 

ACACTTGG ACACTT CTC CTGTCTG AAG CTC AAG AG AAATGTT TGGG TTGG AAAT AGGG ATGTGGG CAATGGTGGTTG CATC C CT AG 
GTTTGGATGAAATGGCITGGGTGAGGGTGTGGGTTGTGA 

AGGGAGACAGGAGCCGGGAGAGCAGTGGC^AGAAAGCAGGAGTGAACCATGCGCAC^ 

6 0 GGGCTGGCCAACAGCAGCAGACACCACAGGGAGGTCCCGTCAGATCGAGACTGTC 

CCAGAGCAGCTTGGGCAGACAGGCAGGAGGAGGAGGCAGGITGCAGACCT 

AAAAAACTGGGTATGG AGGAGGG AG CT CT AGAGGGG CTG TGG CACAGGG CTG CCAG TGTTCATGG TG CTG CTTTTT ATTTTTT AAA 
G ATGAAAGAGTGT ACATTGGACCTTTGCTACTAGTG CCCTGTTCCCCTT CAGATCTTCCTTTGTAAAGTTCATTTTCTTGTC^ 
CCCTTTGATTAACATCACGACATCTCTCAGCTCAGGGACTCCTTCTAATTCCAGTTCAGTAAACCT 

6 5 TTTTGTTTTTCCGATTATTTCCCTCGATATTACTCC 

CGGTGGCTCACGCCTGCAATCCCAGCATTTTGGGAGGCCGAGGTGGGTGGATCACCT 
AACATGGCGAAACCCCGTCTCTACTAAAAACACAAAATTAGCKX3GCCT 

TGAGG CAGGAGAATCGCTTGAACCTGGGACGCAGAGGTTG CAGTG AG CCG AG ATC ACG CCATTGCACT CCAGC CTGGGGAACAAGA 
GTGAAACTCCGTCTCAAAAAAAAAAAAAAAAAGAAAA 

7 0 AGATTCCCTCTGAAGGCTCAGGGGAGGGCCCCTGTTGGGGAGCCTCTCACTTTTGCTCC^ 

GGCCTCTGGGGATGCTTCAGAAGCCGCACAAAGAACTGC^GCCACCATAGCTGCAGAATTAGGTCT 

TG CACAT CATCATGGCCG CTCACGGCACAGC CCAGT AAAGAT AAGAG^ CCTTC CATCAC CGGT AC CTCCT ACTG AAGTTGAGT CTA 
GAAGAAAGATTGCACTCCAGGATAATACACATCACAGTAAATAACACTCTCGTGT 
C03CC3M3GAACGTGCTTGTCACCCACGGGAAAGTGOTGAAGAT 
7 5 TATGTTGTCAGC^CAATGTGAGGCTGCTATTTCCTACTTATTTTTATACGGCT 
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CTG CACTCACTGTGGTGCATTTTTG ATTT ATGGTAACATCAAAAAACCCTCACAG CAGT CTGCTTACTTATGCTTAAAAGGTTTTT 

CTGCAGCTTCAGGGAATCTTCCATCTATAATAAAAGCTGAGC^^ 

TCAGTTAGGGGCCTTTGAGTGCAATCCTCACrTTA^ 

AGTGATGATTGAGTTATGTTTTCAATTATGTGAAAAGCTAATAGTCACAAT 
5 AAATGTTAAAGCTAAATACTTAAATATGAATTAATGAATATTTATCATCCTTTTACT 
GAATTGGCTGGGTGCAGTGGCTCATGCCTGTAATCCC^CACTTTGG 
ACCAGCCTGACTAATATGGTGAAACTCTGTCTCTACTGAAAATACAAAAAATTAGCT 
ACTATTCGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCA^ 
CCTGGGCAACAAGAGCAAAGCTCTGTCrCAAAAAAAAAAAAAA 
1 0 AACTAAGGACAGATGTGTTTACAAAAAGAACTATTAGAAGTAA 

ATTACAATACCTGGCTGGGTGCTGGTGGCTCACGCCTTTAATCCAAGGCAC^ 

GGGTTCAAGAC CAGC CT ATGGAACATGGTG AG ACACTCAACTG T CTCTAT AAAAAAT AAAAAACT AG CTGAGTG TGGTGGCACACG 
CCTGTGGTCTCAGCTACTTGGGAGGCTGAAGTGGGAGGATCACTTG^ 

ACTGCACTCCAGCCTGGGCAACAGAATGAGACCL 1 T 11T1T1GAGACCCTGTCTCAAAAAAGAATTCCCTACTTGTGAAATAATAA 
1 5 T AATATTAAATGCTGACTT AG TGCTT ACT ATGTG CT AAACATCT AAGTG CTTT ATG CATATTAATTCATTT ATTC CTCAGAG TACC 

CACTTGGAAACTGAGGCACAGAAAAGTAACTCGCCCATGGTTACACAGC^ 

TGTGCTAACCACTATGTTGTAGTGTTTCCCATGAGGGTTATGCACTTACCTC^^ 

ACGTX5AAAGCACCTTGTCCAGGGTGAACATCCAGTAAATGTGAAAAGTCAAATT 

CTTAGCTGATTTACGATCTACTCCTACTCATCATAGACTGCT 
2 0 TATGTGGGAAAGTGAATTCCATTTGGAACCACAATTA A TTTCTTTAATTAATCA 

CTCATCAGTTTGCATACATT/rAAAAGAGGGTTTA 

ACCCATGTAAAAGAAAATCAGTACAGCCAGGTGTGGTGGCTCA 

ACGAGGTCAGGAGTTCAAGACCAG CCTGGCCAACATGGTGAAATCCCGTCTCTACG^ 

TGCGTACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAACC^CTTAAACC^GGGAGTCT 

2 5 AG CGC CATTGCACTCCAG CCTGGGCAACAAG AG GAAA CAGT ACAAAG CT CTG CC 

CCAGTAACCTACCCATCTTGTCTCTTGATTCTAGTCrGAGACTATACTAAGTC^ 
AGCCACAGAAGCCAGACTGGTATCATTTCTGGAAGTGATGGCGTGGGCT^^ 
AGGTCTTCCCAGAAATGCTCTAACAGAAGATCrCACTAATTCACT 
ATGGAAAGTACCTCTTTCCAGTG^jAAAGATGGCAAGTTA 

3 0 GAAATATGCCTTATGTTTTAGGCrAGTATTATTAGAATGT^ 

ATAAGGCATTAAGAGTGGTCTTAGGAAGATGATGCATTTC^TCTTAATACAAAAG 
ACCATCTTATCTCCTCTGGGTTTTCCAC^GGCCCGTCT 

CATTAAGAGTGATGTCTGGTCATATGGAAT ATTACTTGTGGGAAATCT CAGCTGCTGGCCT CCCTTGG 

CTCCTGGCTTTGCATCCTCTGCTCTCTCGGTGCCCCCCCACCCCTCCCCCQAA 

3 5 ACTTTATGTTTATTTGCTGGTGTCCTTTAGCCTGTCT^ 

TCCCAGCCGCAGCCCGTGTGCCTGClAACAGTGTCrGGCACATGGCAGATOT 
TGAAGAAATGGATGAATGGCTGCAATTGTCTAATGTTTTCCTC^ 

TACAAACTGATTCAAAATGGATTTAAAATC^GATCAGCCATTTTATGCTACAGAAGAAATGT 
TTGAAATCAGAAAGGTACAGTCTGTCTACTTCCAAATGCATGACCAAAAAAAGTCAAACTCCTTAGTG 

4 0 TTTTTTAAAAAATAGAAGTGAGGTCTTGCTGTGTT^ 

TCTCAAAGTGCTC^ATTACAGGTGTGAGCCACCAC^CCTGGCT^ 
GGAGTTTCTGA 

HUMAN SEQUENCE - mRNA 

4 5 CGAGGCGGCATCCGAGGGCTGGGCCGGCGCCCTGGGGGACCCCGGGCTCCGGAG^ 

CGTGCCX3CTGCTCGTTGTTTTTTCTGCAATGATATTTGGGACTATTACAAA 

ATCATAAGAACAATGATTCATCAGTGGGGAAGTCATCATCATATCCCATGGTATCAGAATCCCCGGAAG 

AGACCCCAGAGCTCAGGGACAGTCTACGAAGCTGCCGCTCT 

TGCCCCAGGGAACATTTCCTGTCTCrGGGTCTTTAAGCACAGCTCCCTGA^ 

5 0 TTGTTTCC^TGGTCATTTTGAAAATGACAGAAACCCAAGCTGGAGAATACCT^ 

ATATTGTTTACAGTGAGTATAAGAAATACCCTGCTTTACACATTAAGAAGACCTTTACTTTAGAA 
GGTCTGCATATCTGAGAGCGTTCC^GAGCCGATCGTGGAATGGGTGCTTTGCGATT 

CAG CTGTTGTTAAAAAGG AGGAAAAAGTGCTTCATGAATT ATTTGG^ CTGTGCCAG AAATGAACTGGGCAGG 

GAATGCACCAGGCTGTTCAC!AATAGATCTAAATCAAACT 

5 5 ATGGATAAGGTGCAAAGCTGTTCATGTC^CCATGGATTCGGGCTCACCTC^ 

ACTTTGAG ATGAGT AC ATTCAACAAACAG AACT ATG AT ACGG ATTCTGTTTG CTTTTGT AT CA CAGTGG CAAG AAACG ACACC 
C^IATACTACACTTGTTCCTCTTCAAAGCATCCCAGTCAATCAGCTTT 
TTCAAGTGAAGATTATGAAATTGACCAATATGAAGAGTTTTGTTTTTCT 
GGACCTTCTCTCGAAAATCATTTCCTTGTGAGGAAAAG^ 

6 0 CAGCCAGGAGAATATATATTCCATGCAG AAAATGATGATGCCCAATTT AC CAAAATGTT CACGC TGAAT ATAAG AAGGAAAC CTCA 

AGTGCTCG CAG AAG CATCX5GCAAGTCAGGCGTCCTGTTTCTCGG ATGGATACCCATTAC C ATCTTGGACCTGGAAGAAGTGTTCAG 

ACAAGTCTCCCAACTGCACAGAAGAGATCACAGAAGGAGTCTGGAATAGAAAGG 

AGCAGTACTCTAAACATC^GTGAAGCCATAAAAGGGTTCCTGG^ 

G AT CCTTTT AAACT CT CCAGGCC CCTTC C CTTTCATCCAAG ACAACATCTCAlTCr ATG CAACAATTGG TG TTTGTCTC CTCTTCA 

6 5 TTGTCGTTTTAACCCTGCTAATTTGTCACAAGTACAAAAA^ 

TCCTCAGATAATGAGTACTTCTACGTTGATTTCAGAGAATATGA^ 

TGGGAAGGTACTAGGATCAGGTGCTTTTGGAAAAGTGATGAACGCAACAGCITA 

TTGCCGTCAAAATGCTGAAAGAAAAAGCAGACAGCTCTGAAAGAGAGGCACT 

AG CCACGAGAAT ATTG TGAAC CTGCTGGGGG CGTGCACACTGT CAGGAC CAATTT ACTTG A 'l'T lT'l GAATACTGTTG CTATGG TGA 

7 0 TCTTCTCAACTATCTAAGAAGTAAAAGAGAAAAATTTCAC^ 

CCACTTTCCAATCACATCCAAATTCCAGCATGCCrGGTTCAAGAGA^ 

CATGGG AATT CATTT CACTCTG AAGATGAAATTGAATATGAAAACCAAAAAAGG CTGGAAGAAGAGGAGG ACTTG AATGTG CTTAC 
ATTTGAAGATCTTCTTTGCTTTGCATATCAAGTTGCCAAAGG 

CC<?CCAGGAACGTGCTTGTCACC(ZACGGGAAAGTGGTGAAGATATGTGACTTTGGAT^ 
7 5 T ATGTTGTCAGGGG CAATG CCCGTCTG CCTGTAAAATGGATGG C CCCCGAAAG CCf GX I^GAAGG CAT CTACACCATT AAC ACTCA 
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TGTCTGGT CAT ATGGAAT ATT ACTG TGGGAAATCTTCT CACTTGGTGTGAATCCTT AC C CTGG CATTC CGGTTGATG CTAACTT CT 
ACAAACTGATTCZAAAATGGATTTAAAATt^ATCAGCCATTTTAT 
TTTGACTCAAGGAAACGGCCATCCTTCCCTAATTTGACTr 
GAATGTGGATGGCCGTGTTTCC^AATGTCCTCACACCT^ 
5 CTCCGCAGGCTCAGGTCGAAGATTCGTAGAGGAACAATTTAGTT^ 
TTACCAAAACAAGATTAATTTCATCACTAAAAGAAAA^ 
GTTTACTCTTGTTTTO^GGGACTTTTGTAAAATCA^ 
CATTGATCTCCATCCAAGGCCTrCTCAGGCCGGCTTGAGTC 
AAAAGCATTTTCCTAAGGAGAAGCTAATATGATTTITTAA^ 
10 AT ATTTT ATGGG TGGGAAT AAAATTT CT ACT ACAG 

HUMAN SEQUENCE - CODING 

ATGCCGGCGTTGGCGCGCGACGCGGGCACCGTGCCGCTC 
TCTGCCTGTGATCAAGTGTGTTTTAATCAATCATAAGAACAATGATC 
1 5 AATCCCCGGAAGACCTCGGGTGTGCGTTGAGACCCO^GCTCAGGGACAGTGTACGAAGCTGCCG 
GCTTCCATCACACTGCAAGTGCTGGTCGATGCCCCAGGGAACATTTCCT 
GCCACATTTTGATTTACAAAACAGAGGAGTTGTTTCCATGGTCATT^ 

TT ATTCAG AGTG AAGCTAC CAATT ACACAAT ATTG TTTACAGTGAGT ATAAGAAAT ACC CTGCTTT ACACATTAAG CTT AC 

2 0 ACAGGGGGAAAGCTGTAAAGAAGAAAGTCCAGCTGTTGTTAAAAAG^ 
GGTGCTGTGCCAGAAATGAACTGGGCAGGGAATGCACCAGGCTC 
CAATTATTTCTTAAAGTAGGGGAACCCTTATGGATAAGGTGCAAAGCTGTTCATGTC 
AGAAAACAAAGCACTCGAGGAGGGCAACTACTTTGAGATG 
TTGTATCATCAGTGGCAAGAAACGACACCGGATACTACACr^ 

2 5 GTAGGAAAGGGATTTATAAATGCTACCAATTCAAGTGAAGATTATGAAATTGACCAAT^ 

TAAAGCCTACCCACAAATCAGATGTACGTGGACCTTCTCTCGAAAATCATTTCCT 

GCATATCCAAGTTTTGCAATCATAAGCACCAGCCAGGAGAATATATATTCCATGCAGAAAATGATGATGC 

TTCACGCTGAATATAAGAAGGAAACCTCAAGTGCTCGCAGAAGCATCC^C^ 

ACCATCTTGGACCTGGAAGAAGTGTTCAGACAAGTCTCCCAACT 

3 0 ACAGAAAAGTGTTTGGACAGTGGGTGTCGAGCAGTACTCTAAACATGAGTGAA 

TACAATTCCCITGGCACATCTTGTGAGACGATCCTTTTAAACTCT 
TGCAACAATTGGTGTTTGTCTCCTCTTCATTGTCGTTTT 

GCCAGCTACAGATGGTACAGGTGACCGGCTCCTCAGATAATGAGTACTTCTACGTTGATTTCA.G 
TGGGAGTTTCCAAGAG AAAATTT AGAGTTTGGGAAGGTACTAGGATC^ 

3 5 AATTAGCAAAACAGGAGTCTCAATCCAGGTTGCCGTCAAAATGCTGAA^ 

CAGAACTCAAGATGATGACCCAGCTGGGAAGCCACGAGAATATTGTGAACCTGCTGGGGGCG 
TTGATTrrTGAATACTGTTGCTATGGTGATCTTCT 

TTT CAAG GAACACAATTTC AG TTTTT ACC CCACTTTC CAATCACATC CAAATTC CAG CATG C CTGGTTCAAG AG AAGTTCAG AT AC 
ACCCX^ACTCGGATCAAATCTCAGGGCTTCATGGGAATTCATTTCACTCT 

4 0 GAAGAAGAGGAGGACTTGAATGTGCTTACATTTGAAGATCTTCTTTG 

ATTTAAGTCGTGTGTTCACACLAGACCTGGCCX3CCAGGAA 

TGGCTCGAGATATCATGAGTGATTCCAACTATGTTGTCAGGGGCAATGCCCGTCT 
TTTGAAGGCATCTACACCATTAAGAGTGATGTCTGGTCATATGGAATATTA 
CCCTGGCA.TTCCGGTTGATGCTAACTTCTACAAACTGATTCAAA^ 
4 5 T ATACATTATAATG CAATC CTG CTGGG CTTTTG ACTCAAGG AAACGG C CAT C CTTC C CT AATTTG ACTT CGTTTTT AGG ATG TC AG 

CTGGCAGATGCAGAAG AAGCG ATGTATCAGAATGTGG ATGGCCGTGTTTC<3G AA CGAC CTTT 

CAGCAG AG AG ATGGATTTGGG GCTACTCTCTCCG CAGG CT CAGGT CG AAGATT CG T AG 
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Table 36 

MOUSE NOMENCLATURE 
ICSGNM Dntt 
Celera mCG2378 

5 

HUMAN NOMENCLATURE 
HGNC DNTT 
Celera hCG24648 

10 MOUSE SEQUENCE - GENOMIC 

GATGCTCAGCCCTCCAGGGCAGGAGAGGGCTATACTCTAAGTGGAGAGATAAGCT 

AAGACTGGTCAGGAGGAATCACGGTAGAGTATATCAACACAGG(2AGTTOT 
CAACAACCTCTCCCAACCATTGATTGGGAATTGGTGTGGGTTTCCCCT 
1 5 GAGCCATCAGCCTCCTGTCTGAGGACCTGGGGAGGCTTTGTCAG 

GCTGGAGAGAAGAGAGAGAGGGTCTGAGGCTAAGGGGCTGGACGTGATC^ 

CCGTTTTACCCCCACTT AAGACAGCGTAAACTCCCAGAGCT ACA CAGGAAA 
GAAAACCAAAACCATTTAAAAGATCTCTTCAAGCCACGCAGGTCA 
TGTTGTTGTTTTGTTTTGTTTTGTTTTTTTGA^ 
2 0 GTTGGTCTCGAACrCGAAATCTGCCTGCCTCTACCTCCTAAGTG 
CTTTTTTTCTTTCTTTCTTTTTTTT^ 

AGTCACTTTTGAAATTGCTTCTTTCTGACTTGACTCCTTAGCC 

CAGAC CCCTGCTT AAATTTT AC CTT CT ACTT AATTG GACTGAAAACAAGT AAAG CCTTCTCAGAG CAACTTGGTT AAAATCGGGTC 

2 5 TTTATCTAATAAATGGTAAGCTCTGAGAGGGCAX3GGACCAGCTGTTTGAGCCTGGAAG 

ACTGTAGGTTGAACTCATGGCrGGACTCTCTGCTTCCATTAGCTCC^ 

TCTGAGTCTCCATCCTAATCCTCTGGGT AGATCCTGTT AAG CCAGCATGCTCACCTCCAGCGATAG CTGAAGGTCAGG ACAAATGT 

TAAAGTGTTTCCTAATACAGACCATCTCTATAGATGTCCACGATTACTAATATCTCAGCGTATT 

ATGTGCTCTGAAAGCTGTGACCAGAACTGAGTGAGATTTTCCAGGTTATGGG 

3 0 GCCACCTGAGGACAGCCACCTGAAGAGCGTGAGTGGCTGAGAGTTCAAGAGCT 

GACCCGGTACTCACGGCTTATCAGCAATCCAGAAGGAGAGGGTATTTAGCGGGTTGCCT 

TGGAAAAGATTTCAGAGATAGTGGGGAGTGGTGGTGAATAACCCTGTCT^ 

TCCCGGAAGTGAAGACAGAGATGCAGACCAGGCATTGGGGCTCCCACTTGTAACTTCCACACT 

GAG CGTTCAAAGTCATCCTG AG ATCAAGGCAGTGCAG CCTGGCTACAGGAGACCCT ACITCAAGAAG AAAAGCAAAGAGCAA 

3 5 GGGAATGGGCITAG AGATACAAATGCAATAGAAG CCAGGAATGGCAGGGCTGGAGGGCTCTGCAGG CT ACCAGGGGCCATGG ATGAG 

AAGCCTTTCT T CCAGCCAGCGACCAACAAGCAGTTCAGCCTGTGGTCTGGGGCTC 
GTCTGGTCATAGTTCACATCTCACAACTATCATCTTCTCATTGGTTCCCTCT 
CAGGAGGTGCCATATTGCTCAGTGGGTCTCCATGCAAACTCATATCAACGGCTCCCA 
TCTACACCTTTCCTGTGGGCAACTGCCTGTCCCTCCTCACACCCAC!AGGGGATGCTCAGGATGTAGC 

4 0 CTGGAGTTTAATAAATATGGTTGGATGAAAACTAGAGGCCACCAGTACCATGATATGTCCATCGACT 

CACITGG T AGAT AACAAAAT AAAG TGG AAAT ACAGAATGTT CTTGT^ 

TCAATGACAAGAAACTTTGAGAAACTCTATAGTTTTCCCCAGTGGGGCAGTTCCAGAA 

CTGTGCAGGCAATAGAGGAAGCTGGCACTCACTCAGTAAAAAAACCAAACC^ 

TC C CAG ATGGGCAGCTTG ACAT AAAG AGG CG CTGTG AGGTGT TTG AT AT AAAAG AAGATAAAACCGCCGGAAACAGACATCTC CT C 

4 5 TC CTAAACTGACTCCCTCAATGTGTG CAAAG AACAACCCGGCTGAGGGTAAAAAT AGTGTTTGTGGATGAAGGGGGA 

CCACCCAGGGAAAGACTGCAGCTGGCCAGCCTCITTCCTGCACAGT 

GAGAGCAGGGTGGAGATACAGATGTGGATCCATATTAAACCCAGAGGAGAAGCAACCGCTCAGGAATG^ 
GTGGCCTTTGTCTCCTCATCCCTGTGACATCATATCCAGGCAGGAGCACCAGTCATGCC^ 

G AGGGCAGGGAG AATGGT GAGTGGTG CC CAAG CAGG C CAG G CAG GCAGAGG AAGTTTTGT CTT AGGAGGCCGTGAAGGTCAGT ACT 

5 0 CTGTTGGAAGCTGAGGAGTCCACCGGTGACAGAAATTGCCCAGGCAGACACACCCAAA 

TCCGGGGTCATGATACTCCTTACTAAGTCAATAGATTGTCTAAGTACT^ 

CAGACTCAACAAACCAAGCAGGATT CTTT ATTG CAGGG CCCC ATG AGGTGTTGG ACATTTT AATACT CGTTG CAT AG AGTTTGTCT 
GATCCCTCTGACCTAACTCATTACAGAATCATCATCATCATCAACTTTATCAGCAGTCAT 
GCTGGCCTCAAACTCATGACCCTGCTGCCTCAATTTCCCCGTTTC^ 
5 5 TGTCTTGTTTGGCTTGGAGCATCTCCCTGAGACCCTGGG 

AGAGTCTACCTTGAGTGGCACTGGCAGAAATATTGGTGAAGTGCAAAGAGACACAGGGATOTGAAGG^ 

AGCCTTTCAGTTAACAGTATTGTCCTGAGAAAGCACTGCATGCCAATATTCCTGTATTTAGT 

AACCACTAGAACCTCACTGTGCATATTAGGGAACTGAGGCACAGGAATGTCTGGCAACAC^ 



6 0 TTGTTCAC^GGGCTCCCACCAGATCAAGTTTGGAAGTATTAGTTGATGTGGTTCAGGG 
AACAACTGAGAACTCTAGCCrGAGCACAG»GTGGGTGTTG 

CCTTTCCTGGCAT CACG CAC ATTCC CCATCATCC CCTTGACATCAG CCAG CAT ATAATCCTGACATGT CAG GAACATG T ATTCT CC 
CACATCCCACATGCTCTCCTGTGGCCCTGCCATAGAGAGGTAGA 

TGTGTATTACTTGGCATTGAGTGTGGGTACT AGGTAATCCTG CTGTG ATCTCTGTCCGACTGCAG CAGTGGCTGAGACGGTCCAGA 

6 5 TG TGG CAGG T ACAAACATGT CAAGCTT CCATTTGGC CTGG ATTT AGAGT AGAG CAGAAC CCT CATG AAGGAAAAA 

AATAGACTTTGATTTGTTTAGCCAACACAAAGGATTTGGAACTAA 

CAATTAGCACCAGAAAATTATGGTATAGAAGGAAGAAATTAACTCCTGCCGCTATCAGGAATG 

GACrTTTGTATAGTGTTTCATAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

CCACCC^CTTGTTTTGAGACAATGTCTCTCACT 

7 0 TCACCCTAGCACTGGATTACAAGTTCATGCTGTCACAACCAGCTTTTAAT^ 

TTGGGTGGCTAACACTTTACCAACTGAGATAGCTCCCCAGACCCATC 

TAATTCCAAAAGTGA CCACCATG CTTCCATGATT CTG AG CAG TTTTC CAC CT CCTTTCATT AAG CCGTG ACTG TGT AGTCCATTTT 
CAGCTGTTAGAGCAAACCATCTGAAACTGGGTAATTTATAA 
ATAGATAAGGTTGCATTCTTCTTCCCATGGTGGGAACAGAAAGGCAAGAGGGG 
75 TTTATGTTTCAACAACCCTGCATCCAACCTGTGACAAAGGCACTAATCCTT^ 
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AACAATGCTC CAATGG CAAT AAAGTTC C C AT AT AGG TTTCG AGGGT AAAAACT ATTTT CAAAT CAT AG T ACAAAGG CTTCAGG CT A 
CATGATGCTATTGTGAAGGACCCCGAGAGTCAGAAAACTGGGATTGGAAGCCC 

TT CCT AAACGGTTCT AACTCCAACCCCTG CTTT CCC CACCATCAGGG CTCTGGATCAGTAGAG T ATTG CCTG ACAG TCAGG CAGG A 
C CTCACACAGGATGAAG CAGAG ACATGTCCACC CTTT ATTCAG CTCACTT AAAAT AAT CT CACCAC CTTTGAGTCACCAAG ATG T A 
5 AAC!ACAAGAG AAAAAAATT AAAAGT AATTT ACTGTTGTGG CT AAAAT ACAATT C CTGT CT ATTG T AGG AGTCTGTTAAACAACAGC 
CAAACATAGTGTGTAGTACAAGTTGCTCCAAGAGGTGX3CGTTGGG 
ATGCTTGACTCCATCTCAAAGAAAAATAAATAAGGAAAGGAGAGGA^ 
AAAGCTAAAAGTCCTGCCTAAACCCCCTGAGTGGCAGGATTCTCTGTTCT^ 
TATGAGAATTGGTCACCACGTATTATGAGCTGGT^ 
1 0 CATGACTTACCTGTTACCCrTTATCCAACAGCTGGCTAG 

ATGAGCTGCCTGGTATGTGCGTCTTCCTGAGTCrCTCTTCAGATGG^ 

TCTGCTCTCATTGTATTAAAGATTCAGTTGGGTGTTTC^ 

TTTTTTGTTTGTTTGTTTITT^ 

TGAACTTGGAAATCTGCCTGCCTCTGCCTCCCGAGTGCTGC>GATTAAA^ 
1 5 AATGGGCACTGGAAATCTAGGAATCAGAAAAGGACACrrGAG 

GCAGTAGTTAACACTGGTCAATGTGATACTTGGATAACTTAACAA 
ACCCTCTGAACAAAATAAAAGTTACTATTGACrCTTACTTAAGGTTCTCCT 
TATCTCAGTTCTTGATTTAGATCTGGGGAAAACAGGCTTAAGGAAGTG 
ATGGGAG^TTCGTG CATCCCTGGCTG CATCCrCTCTCTGTGTGCTATT 

TACTATGGGGATGATACCACCCCGGGGCCTAGCTAGAGGACTCC 

T ATCATGT AGCTCTCTG AAACCAAGCATCT AACAGGGGGCC AAAGTGTGTG^ CCACTCATrGGTCTCTAT 

OU^GCAAACATTGCACCCTAGTCTCAGAGAGCTGCACAA 

ACCGTCAAGGTCTCTACCACACAGGGAGCCCACTCTGCACITTTGACA 

2 5 CAGCTCTCTGTCJVITCTTGCATGAAACAGCCTTGTGGATG 

CTCATGCGTTCATACGCTCTTGAAGGCAGTAAAAGTCCCAAATGCTGTGGTGGA^ 
TAAGGAGGCATGACAGAAGGAGGCTGTTTGGTATGTGTATGTGTGTAAATACATGTGTGTGTGTGTAC^ 

Gcn T i-it rnTrrrGTrrrirr 

CCTGGAACnSACTCTATAGACCAGGCTGACCTTGAACTC^ 

3 0 TTGTACTACCAGTCTGGTTAAGAGCATAC^TTTTTAATTTTGATTTTTTAGTTTATC 

GGGTGAGCCTCACCACCCATTGAATGTTCTCTCAGACAGGCCCTGACAC^ 
GCTAGTAACCTTTGTTAGACCaVGTTCTACCCACAATAGTTTACATAAAAATa 
TTTGAAGCACAAATGTAAAO^TCTGGTTTCAGCACCCT 
CACTGGCAAGGTCACATTCCTGGAATTACTCCTTAGACTGTAG^ 

3 5 TAAACCCTTTCTTCCTCTCTGACTCCACAGCCTTCTCT 

GCTGGTCCTCCCATAGCACTITATCTGTGATCCACTGTGGTTAACTGACATTTTG^ 
ATCAGCCCCCCCCCCCCCGTATTATATGGGTATCTAGCTGGCAGTATGGTC^ 
TGCCTAGGTCCTGTAGTAGGTGTTCCAGAAGTGAGACCTACTAAGTAATATT^ 
GTGAAAATGTCCCTAACTCCATGCCCATGACrrcrTCTCCAAAACTCATCrCG 

4 0 TCTCTGGAAACTTAGTGCACTGAAAAAATAGAAACCAAAAAGAAACT 

GATCTCATTTGCCCCATACTCTGTGATCTGCAGTGTCTGAACC^ 
CTGTGCCCACTGCAAAACGATAGTAACTAAAGCAGCAGAGTAGACAG 
GTGACCATGGGGCCAGGAGAGATAGCTCAGTGATTAGGAGCACTAACTGCTCTTCCAGC^ 
TGTGGCAGCTTACAACTGTCrTGCAATTCCAGTTCCAGGGAATCT^ 

4 5 AAAAAAAAAAAGCAACCAGCAGAGTGAGAAATTCAGGTGGCATTGACGTGAA^ 

TTACAGACATCCIAGTGTACCATAACGTAGGTCCGAGAGAAAGAATAGAAGTGAGCTAGAGTAAAGC^ 
AGCAGGAGCAGTTAGATGACTAAGTGGTGATGATAGAAGCTTC 

CGGTGGAGGGGGAAAGATGC CTCTG ACTGCCACACTGATGTCATGG CATGCACACAC ATGCCCCCTTGCGCAGTAAGCAAATGTAA 
TACATTTTTTAAAAAATGGCTTTTAAAATGCAACCTAAAGGGTTTGAACATGTCT 
50 AGGT CC C CCITTCTTCAT AAGTTT ATTCTG AGGT ACTGCCGAG AT TT CACTGTT CTTTCTGT CAAG 

AGCTCCATTTTGGGAGAAAGGACCTGATCAGAGTGAGACATTCCrre 
TGGTGAGAGGACATCAGAGCCCTCATTCTCGAGACACCACCTGATXK3CACAGA 
GCAAGCAGTCC^CCTGGGTCCTCGGAAGAAAAGACCCAGGCAGCTGGGCA 
GAGACTTGGTCCTCTTC^TTTrGGAAAAGAAGATGGGAACAACTC^ 

5 5 AGGGTGGAAAATGAGCTCAGGTAGGACAGCCCTGGTCACITTGCAAGCATG 

GGCCAGTTGGCrAAGGGAGATGAGTGGGGACTAAATGAGATTATATAGAAACTAATTGGTACCAA 
AATATGGCTTTCCATGACTGACTCTGAGATTAAGTATCCAGGAGTGATA^ 
CGATTCCATTCTGTACCAAACTGAACCAACACCATTGGGTTCAATTTTTTAC 
GTCTGGCTCTGACTTATACACACCTGCTCTGCTAAGGGTCTATGGTGGGCTAGGCT 

6 0 CTTGGTGATGGTTTTCCAAAGGGTG<3TCCAAGTGGGCTTCAGGAA 

CACCCCAGTGTGCTATGATCAGAGTCTAGGCAGGGAGGTGATTGAAGCTACCATCTGCA 
GTTTTGCTGACTATTCTGTACGTTCACTCTCTTGACCCAATGAAAAG 
TGTGCCTGACGGCTGCAGACAGAAGTGTTGGGGTTACAGTGGGAGTGGGGGGGGCACT 
GCGAGGGGTGCTGAACATACCGTGTCAGAGATGATCTGACITT^ 

6 5 GCCAAACGTTTCTTGCTGATGAAATGCTGGCTTTGGAGACATTGAGGAGAC^ 

CCACATGCCCAGCTGAGGGGATCTTTTTAGATT^ 
TTCTGAGTCITrGTCACATGGTCrCCTA T ' l lTTTGTTTTAA^ 
ACAATCCATTATTTCAACTCGTTTTTGAGTATAACTAATCrc 
ATAGGTAAAGGGGGAGGGATAAAACAGAAAAAACATCTTO^ 

7 0 GATGCTTTTCATGGTAAAGGAACTGAAC7U3GGACCA 

TCTGATAAAAGGTCCATAGArCTGTTGGAGGCCCATGGACTGAGTC^ 

TGATGATCACGAAG CAAGT ATCrTTGCTGGAGAGGTGAATCAG AGTGGTGCCCCAGATGCCAG CAGCCAGAGCT ATGCCTGCTGCT 
GGGGAGACAGGAAATGGGTGAGOVAGCXXjCTGCTGTGCTCAG 



TGTGCCCAGATAAAGAGTCATTAATTACTGTAGGGA^^ 
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NNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNIWNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNIOJNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNKNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNKNNNNNNNNNNNN^ 

AGAGGTGATTTCTGACCTG CACATGTGCACCATG T CAT ATG CT AATGTGCAT ATACAAT CATTGT AATT ACTGCT AGTG ATG ATTA 
1 0 ATTAAAAGGAAAAATGTC^TGGTACTGATTAACAGTAATGGCTACACTT CTTTCTTAACriTGTAAAATTOUUGGC^AAGAGAAT 

GCTAAGCTCTGGAACAGCCTAATGTCCACATCTCCTGTATACTT^ 

GATCCTCTATGTACTGAGTTCTCTGGATTGAGTCTGTTCTCTC 

TGTGCCAATAGAAAGCTGGTTCCTAAAACATGAW 

ATTGAATTTATTCTGTCACCAGCATCCTAGACTTTAGAGCAGGTTGCT 
1 5 AAAATCTCCTGTTTTCCATGCCCTGATTGCATCCTGAGTTCCCATAGAGTGAGAT 

CTCTATAAATACTTCTGCTACCTTCCCCTGTGTCTTGAAGCAAATCAC^ 

GCGTTCCATCTTTCTAACCATCCAGGCGCTTCATGCTAGCTTGCTA^ 

TGTCCTGCAATGGAAGATtSCTGTCTTAGCTGAAAGCATCACAAAAGC^ 

GAAAAACAGAAGAAGACATGAGCCTTGCAAGCAAAGAGCTTGCT 
2 0 TCTTAATTACTTTTCTTTCTTTTAAAAAAAT 

GTATATATGCATGTATGTATGTCCATATGTATGTATGTGTC^ 

AGCAGAAGAGGGTGTTGGATGGTCTGGAACCGGGTTTACAGATGGTO 

GATCCTCTGCAAGAGCAGGATGTTCTCrTAACCACGGAACCAACTCT 

CTGGCAAAAGCAAATTAAGGAAGACAGGGATTAGAGCTCACAGTTTAC^ 

2 5 GGGAGAGCTGGTCACCTCACACAGGTGACCAGGTACAGAGAGCAGTAAAT 

GATCAAAAACTAGCTTTAGCCTCCTCAACAACTCATTCCTTTC^ 
TGGCCCCCACCCCCCATCTGTTC^CTtSTTTGACCTGT^ 

CAAAG CCCACGGG<K>ACCTTG TGGT AAT AAAAC CCAC ATTTTCT ATTT CAAAAATT ATG AG CTTCAAGTT AATGCAAAG TTTT AT A 
ATACCATTAAAGGTTGTTAAAGAATCAGGCAAGATCCTGACCAAG<3CTGGAAGAGAAAGCT 

TGGGTTGAGCCAGCTTCTTTAAGCAGCTCAGTTTAAGTATAAATAT 

CCCAGGTGACAAGACCATCCATGAGAGAGATAACACTCACIACTGGAAAGGGTG^ 

AAGGCATCTTTAATTATATGCAGTGAGGCACTGCCAGCCAGG 

AAAAGCCAGGAGTGGTGGTGCACACCTTTAATCCCAGCACTTGGGAGGCAGAGACAGGTGAACT^ 

3 5 TCTACAGAGTGAGTTCCAGGACAGCCAAGGCTATACAGAGAAACCCTGTCT 

GATTTGTG AGTT AAT C CCAT AAT ACAT AGTTTC CGAACACCTGG AG C CCT ACCTG ACAATC CT AG CAAT ATTTTG AACATT ACATG 
AAATTATACTOSTCATATATATTCTACTTAACATATAATAA^ 

AT AG CATG T AAGGAT CAAAG TGACT CTGTCTCTTT AT AG CT CTAT CATT AT AG C CACCTGT CGT AGTCAGGGTTT CT ATTTTCT AC 
AAACATCAGGACCAAGAAGCAAGTTGGGGAGGAAAGGGTTTATTCAGC^^ 

4 0 AGGACTGGAACTCAAGCAGGTCAGGAAGCAGGAGCTGATGCAGAGGCCATGGAGGGATGTTACTTA 

TCTTATAG AACCCAAGAC CAC CAGG C CAGGGAT AGCACCACC CACAAGGATC C CTT C C C CT C ATTG ATC ACT AATTGAT AAAATG C 

CTTACAGCTGGATCTCATGGAGGCATTTCCTCAAGGAAGGCT 

AACCAGCACACACACCTCCTTCCTAC^TCTCCATGATATAAACAAAGGGC^ 

GAACATT ACAACTT ATTT CTTTT AAATGT ATGTTGG TT G CCC AC CTCATATGG CTG ATGTG AAGACT AAGTTTTAT ACACAT CCC C 

4 5 CACACACATATCCCACAGATGGTAGTGTGTCATATTTGCAAGTGGTGTAGCATC^ 

TGTTGTTTGTTGCATCCCAGGAGCTCCTCCATCCCTGAGCATCCCCAGA 
CAGACTTAGAGAAGGAGATGCAGGGCAAAGCCAGGCAGTGAGAATGTGTG<^GGAGG 

TTCTGCACTCATCCTGGAGGGGGGCAGCAAAACCCAG AGAGCCCAG AG CAGTTGGTGGCCTCCCTG CAGATCTTCCGCCTTGGAGT 
GCATCTATAGCATCTATTTTTAC^TTCAGGATAGCTCT^ 

5 0 CAAAAATTATACCCAAGGGGGCTTCCTGGGCAGATATGTTACAGAGAGTTTCTTC^ 

TGCCTTTAAGTATGGCGCCCGTTCAGATAACCCTrCTGTGAGGAGGGTGGAAGGGAGCT 
GAACAAGCACTGAGTCACAGATTGCAAACTCATGGCTACTCAAGCT 

ACCATGCAGTTATTAG CAGGGAAjGGGTTTTAATGTCTCCCAGGG CTATCTCCTCATTGCTATGTGGGT CTGAT AATCCCTGTCTCA 
CAGCATTACTAATAATTAAATGTATGACATGTACATTTAATGTATTTAATATTTAATACTAGTACC 

5 5 ATACTCTTC^TATTTTGAATACTGAAGAAAC^ 

TGCATCGATAAGGAACTGTCAGTGGACAGCAGTGGAGGAAGCAAGTTGCCCT^ 
GAAGTGTTTAGTCCAAGAGAATCATCTGCAGGAGGCCAGTGACAGCCACAAACCAGAAGAA^ 
AGGGAGACAGGATCGAGTAGACTCAGTACGAGAGATGAGAACAGAGATAGGACCAC^ATC^IACGTAGCCTCT 
TAAATTATTTGTAGCrAAGGACCAGTTTAACAAAAAATAAAATCATATTTATCAGTC 

6 0 GTTTGGCAATGTCAAGTCGAGGAGAGTGGGTCTCATTTTGGACACTCCTTTTACTGAT^ 

GTCACCCACATTGTGGCAGAGAACAACTC^GGGTCAGACGTCCT 

CGAACT CTTGGACATCT CCTG G CT CATCX5 AATGCATGGG AG AAGC CAGTGGAG ATGATGGGGAG ACAT CAG CTTGTTG T AA 

GTGTCTCCTTTGCTGGGTGGCCACTC<5CATGGGCATCAGAATCCACAGAC^ 

TAGCTCTGGTGG AC CT CCATTT CATC CTCTTT AT AT AACAAT AATGC C CACCTCCTGAGATG CTGTGGAAATCG AAGT AACATGCA 

6 5 TACATC^TGCACACT<3CTGACAT ATT AAT AGATGGG ACT ATTTTCT 

GTGTGTGTGTCTGTGTGTGTATGTGTGTGCACATGCGTGCAGGTGTCCACAGAGGCCAGAAGA^ 
GTTGTGAGCCACTCAGGTGTTTGGCAGGAGTAGCACTCAGTCTTCACCACTC 
TTATGAACACATTCACACACCGATATTTOWSAATCTCAGCCT 
GATAATTCATGAAAAAAAAAAGAAAOAAAGAATATGACITCITAAATCT 

7 0 GGGTCTGGATCATCTGTAATAAAGGAAACAAAGGAAATAAAATTTAATAGCTGGT^ 

TTATTGACATTTCAAATGTTATCCCCCTTTCCAGTT^ 
ACTCCTGCCTCAGTGCCTTAGCATTATCCCACCCTGGGT^ 
ATAAGGCCATCCTCTGCTCCATTTGCAGCT<X3^^ 
GCITTAGGGG<KKiTTGGTTGGTTGATATT<;CCGTTCTT 
7 5 CCCCATTGGGGTCCCCCTGCTCAGTCCGTTGTTTGGCTG 
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GGACAGTTTTAACATG CTTCTG T CAGCAAG C ACTT CTTGACAT CAGCAAT AATG T CTG AG TTTGGTGTCTGT AGATGGG ATGG ATT 

CCCAGGTGGGGCAGTCTCTGGATGGCCTTTCCTTOVGTCTCATTTTTGTGA 

TCACAATGAAATTCATTATCTGGTGTAATTAATATGTACTAATAA^ 

TATGCAACACATITTTGAGTCCTCTATGATTTTGTCCTCTTTCAGAGAC^ 
5 TGAGTAAATGTTTATAGTTATATGTACACT^ACTATTGGCTAATTAACTGTCATTC 

AATAGAAACTCCTCCCCGAGTCCTGTCCCAGGCTCCCAGAATGTTCCAGCACCTGCTGTGAAGA 

GCGAAGAACCACATTAAACAATTACAACCAACTATTCACGGTAACGAGGTTTT 

TTCTTTCCCTGAGTCGACAAGTTAAATTATGATTTTCTGGAG^ 

GGTTCTTCGTCAAGTGGGCCTCCTTGTTCATAGACTCCCAGCT 
1 0 TGTCAAGCATGGGACTTCTCGTCTCTAACTTTCGAGCTGCATCTC 

G AAGCACAGCTGCGTTCTCTGCT AGG AAGG CAACATGC CTCAC CTT AAAT AACCCCGGCTCTGGGTC TGAGTGATGAACACAGCCT 

TCATCCC^CAAACACCAGCTGTGTTTTCTGACCACACTAAGAGACCTGAAAGTCTCAGGAG 

GGTCCAGAAGATAGGAAAGGGTTATACCCAACCCAGTCATGACTGCTGGGACACCATC 

TTTCTAAATGTCTAATGATTGAACTCAGTTTCCTCATAAATGCATTAGATACATATCTTTTCAG 
15 GGAGGATGCAAAGGTACAGGAGCTTGATCCTGACTCCAAAGGCGCTCCATTAGTAAAGGGGACAGGAGGG 

CACGAGCAG ACCTGCTGACCTGG TTTT ATTGTTTT AGGATG CC CTTGAT ATC CTGG CTGAAAATGATG AG CTTAGAGAGAATG AAG 

GTTCTTGCCTGGCATTCATGCGAGCATCCTCTGTACTGAAATCTCrc 

CCTTG CCT AGGGGACAAGGTGAAG AG TAT CATAG AGGT AAGGG AG AAGTGG AATTTGCT ACATCT C CAAAGAGGTG AACGGGAG AC 
AGAACAGATCGCATAAGTATGGGCTCTCTCCTCTATTTATCT^ 
2 0 ATATAAAATCCATACCCAACACGATCCCAGTAGGAAGACITTCT^ 

TTAGATCTCCAGAGCAGCAGAATATTCTGCTTTCCACCCCAGTCTGGGTGGGAGACTCGGT 
CAGGGTGATGTTCAATGGGCTGATTGTGCATCTCTCTGTTCTCAGGAGAGACTG<K3CAGTAG 
ACTTCCCAACGAGGCACC^TGTCATCTAGTAGCAAGTTTCAATCT 
CTCCAGACAACAGAGACCAGACCTCATAAGGACTTGATAATGGCCAGCTTTTC^ 

2 5 GCCCTCTTATTTCCACAACTCGAATCAACTGAAAAGAAAATGACCAGGGTGATTTO 

AGAGTCTCTAGGTTCAGATCTTTATCTTTCTGTATCCTCGTCAGAGGTTATGAAGGTCrA^ 
TTCAAGATTACAAGGAAATTCTITGCTTTCTCCAGATCAGCCAGGGAAAAAAC 
TGCGCGCGCTCGACTG<5CCAGGAAGAACGACGCTGCAACAGGATCCTTCTG(^ 
AAAAGACTTCGAGCCCAGAACTGGTGCTGCTTTTATAGGCCTAGGAGGGGCGTGTCTC^ 
3 0 CATTTGCATGTTCCTCATCTGATTGGCTACTCTCTCTCAGTACCTTAC^ 

CTGATTGGTTATACTCTCAGTACCTTACAGAACCTCATTATCATGCCTGGGCCAGGCAGTGTCT 

TGTACACATTGGTTGTTTGTCCAATCTTATGCGTGGTGGCCAGCAGTAGTCAGTC 

AGTTGGGGCTGGGCTGTCGGTTTGAGACTGGCACTTTGATTATGCAGAGACATAACTCAAAA 

GG AATT ATTGAAGATGG AG AAAGTT CTG AAGCT AAAG CTGTGTT AAATGATGAG CGAT AT AAAT C CTTCAAAGT AAG CG ACTGT AC 

3 5 ATTTGG AT ACTGGGG CGTGTTTTGTTTT ATTT ATCCTGTTG TTTGCTG TCG ATGGTT AT ATAACT CAGGCTG CT AT ACCAAATG CC 

ACAGAGTGAACTGATTAGTCATAGAAATTTATTTTCTCTCAGTTCT 

TGTGGAGGACTTTGTGGTTTTCAGTGGCCAGCCACCCTCTTGCTGCTGCCCTGACATAGCAGAGA 

TCTCAGCTAGCTTTCTTTATTCTTACATTATTGTGTGAAGCACGACT 

ACATTAATTAACAGACATGCCCACAC^JCCAAGGTAGTATGGATAATTCCT 

4 0 TTAAAAATTAAACTAACCAGCACATCCCACAAAGACACTACATGCCAAGAOT 

GTCACTCATGGTTACAAATGTCAAAGGGACACAGGATTCAAATAGAGCAAGT^ 

ctcattgcaccrcaatttctgggacttagggttgaccaccct 

actatggccagtccattc^ttcataagtgatacagggcctaacatacggccagaacaggagctagg 
agggttgaagtttgtactttgctgttatccacttgtagatggggtgaatttgct 

4 5 ctccttcgaccaaac^tgtggttcaggacttcatgtaaaa 

gatccttcagaaacaaaaagcctctcaacgttaccatccccaggctcaagagagaactcccta^ 
gtcccagaaccaacaaaaaagctgggtgtggtagcgcacct^ 

aagg ctcactggccaatcgtcccagcctg aggttgacitctggtttc caaatgcacagttatgcatgtctgtg caagt acatacat 

5 0 gcg<:gcgcacacacacacacacacacac^cacacacacacacacaca 

aactgggccaaaggtttacrratacttcttgttcctggcct 

tggataacagcctacctttccctccctttcagctctttacctctgtgtttggtgtgggactgaa 
atgggtttcagaactctcagcaaaatacaatcagacaaaagcctgaggtttacacaa^ 

AGTOVCGGGTCATTCrGGGCTTGTCAGGGTCCAGTGACATGGAAGCCAGT^ 

5 5 GGCTCCAGCACTTGTCACTGAGTGATCTTGAGTAAGCTAGTTAAAACCTTG 

TT ACAGCAG AGTGGGCACTGTC C CG ATTCCTACAG ACACACCTT AGGTCTGGCTGTCTCCCTG 

CGAGGACAGAAGGGTAAGCTGACGTGCAAGAAGGACAGAGGGGAGTACTGAACAAAGTACACTCTCGACC^ 

GAATGTGCGCTGGGGGCTGGAAAACGGAAGAGAAAGGGGAGAAAGTCTAGCCACAGCCTTGATCTG 

6 0 TAGGAAGAGACAAGAGTCCAGAGCAGAGACGGAAGAGCTCTCATTATCATCTGAACGTCTGGCAAG 

CCAGTTCACTGCCTTCCATGGACAGTGCTGGTACCTTTTCCAGA 
ATTTTTTAAAGAATTATTTATTTATGTATATGAGTACA.CACTGTAGCT 
TTGAATTTAGTACCTCTGCTCCCTCAGGTCATCCGGGTGGTCCACCATATGGTTGCT 
CAGCCAGTCCTCTTACCCACTGAGCCACCTCACCAGGCCAACATGTGAAATTTTTAAATGTT^ 

6 5 GAGTAAAATTGTCCTTCAAGAGGGAGTCACTGCTATAGTTCC 

TGGAAGTGATAGAGTCTTCTACCAGAGGATTCTGGGTATGCGCAATTATTCA 

CAGACAGACAGAATG AAC AAAAATGATAT CAT AAG AAGTGGT AGATG AGGTGAAT CATAT CTTGG GGGGGT C CCATCAATGACACT 
AT AG AAAG CC C CT ATTAATCTT AT AAG C C CACT AAGTT CTTT AT AAAAAC ATTTTT ATTT ATTTTTAGTTTATGT ATATG GGTGTT 
GTCTGTATGTTTGTGTATGTACCACATTTGTCCAGTGTCCAAGGAC^^ 

7 0 GATGGTTGTGTGTTGCTGTTGGTGCTGGGAATCAAACCTC^GCCCT 

TCAAGTTTGATGATTTCTGAGTGACTATTAATAAGCTTTATATAAT^ 

ATATTTCTACTCTGGATGCCAGAAAGTGTAGTCrAGTAAGGGTGTTrTAC^^ 

TGACTATCAAACC~TAAG<iGCATGuAG 

TATTTTCTCAGAAAGCCTTGAACTTGTGATGTCCTGTTTCAGCATCT 
7 5 CACAGAGTCTTAAAAAGTCATAACCTCACATTCTTGCACATCAT 
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AACTTAAGTGCCAGTGATGATGGCAATGTCTATATATATTAGCACCTAAAAG^ 
ACACACACACACACACACACACACACACACTCACTCACAGCAGCCATGTCCCAATCAAAAOT 
AAGTTTATGT03TGTCAGTTATCGATATGAAGATGGCAGATCCCCCAAGGAACT 
CATCACCATCTTTTCCTGGTCCTGAAAACACAGTTCTGCAACTGT 
5 TTACAGTGTGCTCAAGCCAAATGACTTTC1X5GTCGTGTCTCATCCTGG 
TACATCCCCCTCCTGTTTATCGAGTGGCAT^ 
CTCTGACTCTCCCAGACATGCCCAGCTTCCCACCAGTACTTTAC^ 

AGGAAGCAGCACGTCAGCCCCTCGAAGGCACTTTACACTTt^ 
1 0 ATGAAGCCATCACTGCCAAGTCCTGAATTTTCCTACCCC C1T T T AAATAGTATGG 
GTGCTACATACATGGAATCTCAGCACTTGGGAGGTA 

CAGCAGAAAG AGG AGGAGGAGGGGGAAGAGGG GAGG AGGAAG AGG AGGAGGAGGAGG AGG CAGTGTTGG ATTTTCATT AT AAT AT C 

AGATAGGATCAATTTAGATTCCAGAAGGATCTGAATACTTCCCCCTT^ 

CTCCTTTGAAACCATGAAAAAAACCCAAGCAAGTGGGCAAGGCAGTG 
15 CTAAGAGCACTGTATGAGACAGCA.TCCTCTCCTATCAACAGCTGGTCCT 

TCCTCTACTACGAAGACCTCGT^AGCTGTGTGAACAGGCCAGAA 

TTTCTTCCAGATGCCTTGGTCACCATGACTGGGGGGTTCCGCAGGTAAGTGATC^ 

CTGAAATGGTGCAAAACGCCTTTTCTGACAACAGAACCAACCTCACT 

GTCTCAGATAGCTTGACZACCCCGCACACAGAGCTGTTTCTTACCTCTG 
2 0 AATTTGGTACTAATCTGACTTCACAATTCCTTCAGACTCAA 

GGAGACCAGGGTGTGAGCCTGACCACTAAGTAGATGCTCAAGAAACTIGGTTG 

ATGAAATCTTCAGTTGGATACAAATGCCTCCATTTCAGAAAGAGGCTTG 

TTGCATCAAAGAGGTTTTCTGGGCTGGGTCTGTCACTT^ 

2 5 ATCCAGTAAATCAGGTATGGTGATGTAGACCTATAATCCCAGCACTTGGGAAGTACAGGC^ 

CTCAGCTTCAAAACGTGCTCAAAGCCAGTCTGCATCATATAAGACT 

AGAGCAATCTCTTAAGCCATACCATAGTTCCAAGAAGGCTACCTTGTATGGTGT^ 

TACATGTTTCTCTATGTATGTGATG^TATAGAAGTAATATTTATTACTTG 

G AAGTr r rrr'l'i'TTTTCTGT CACAAATTTGTTTTG CAAGAAG TGAGTTGAACAACCAGG AAT AG CTGGTTTTAAAATGTTGTG AGT 

3 0 ATTTTGAACTACAAACCTCAGACTTGAGACACAATAACAA^ 

• CAACTGATTTTCCTTTTGTGCTTTTTTCCCCCCAAACTAGGGGTAAGATGACT 
GAAGCCACAGAGGATGAAGAGCAGCAGCTCTTGCATAAAGTGA 
CCATTTAAGGCTGGCTTAGATTGATTTCTGCTTGTTTCTTTTCCGTGTGTGTG 
ATGTGTGTGTGAGTGTGTGTCTGTGGTATGTATGTTATGTTTGATATGTATGTGTGTC 

3 5 ATGTGTGTGTGATGTGTTTGTTATGCATCGTGTGrrATGTGACACGCACATGTGTGCATA 

GCGGACATGTGGAGCCTGAGTTTGATGTTGAAAATCATCTTCCGCTACTTCTCCACCTTA 
AATCCAGAGCTCATCGATCTGGCTAGTCCTACTAGCCAGTTTGTTCTGGGAATCC 
AGGCAGGCTGTCATGCGTACCAAGCATTTGTATGGGTTCTGGGGATCTGAACCTTGGCCCCCTGTC 
CCACCGAGCCATCTCCCTAGCCCCTTGCTAATCTTAGGTGTTACATGATCAGTCTACAGTGACTACT 

4 0 ACCTCATCGCAGCACCATGTGGCTGGCATGAGA 

GAAGGAAGATAAGGAGGGCAAGCTGGGCCCTGCAGAGGTCATGGCCAATGGCTCA 
GCCAGTGGTTAATGGTCAAAGCTGAGTTAAGTTAACCCCACAGCCAAAAACCT 

CTTCCTGGGCTGAGCTCACTGATGCACCAGAGAGATAAGACC 

4 5 TTAAGATCACTGACTGCTCTACCAGAGGTCCTGAGTTCAATT^ 

GATGCCCTCTTACTGGTGTATCTGACAGCTATAGTGTACTCACATAAAATAAGTAAATAAATCTTTGGGC^ 
TTTAATCCCAGCACTTGGGAGGCAGAGGCAGGAGAATTrCTGAGTTCGAGGCCAGCCTGCT 
CTAGCTATACAGAGAAACCCTGTCTTGAAAAAACAAAAACAAAAAAAAACAA 
TGAGATTTATAAATGAAAGCTAGATTTCAGATGCAGATCCAAATTCATCTGTGAGACCATT 

5 0 TG AGAAG TCAGGAGAAGGTG CACTGGAGAAC CCCTTCAGTGAT AG CAAGAT AGGCATGACAAGAGT CAG AATACTTTTCCAATTCT 

CGATTCTCTGCTCCAGGGGTTGGGAGTGGGGGACGTTCGGGGGGTTGGGTAGGGGAGAATA 
GACCAAAGAATACCTCCCCACCCCCAGGCCTGTTGCCA 

AGACATCTGCTGGAGTTGAGAGGTGACAGAGGAGCCCTCTTTGCATCGCTAGT^ 
GATAGTCATTAGAAGCCCAGGCCCTGCATTCGGAGCCAATAGTAAAACAGGCCT 

5 5 GGGTGGTGCTTAACAGTTCATGGCCCATTTTCTGTCTGTGCTAACCCGGGATTCAATGAAAGTCT 

CTGGTGTCTGACAAATGGTGGCTAAGATCAGCAGCAGCAGGATTCCT 
GGACCCAGGAACCAGCAGGCCAGTTCCTTCCCTTCCCTC^ 

AG CCCTGGACCCATCCCCAAT AACTGAATTGTGAAATCAGATGAGTT CTTCCCATCT 
TGCCAGCCTAACCCTGAGCTTCACTTTACAGATGGCCGTGAGCACA 

6 0 CAGCATGCCCACAAGATGCTTAAATGTCTGCGAAGTAGGTTGAACAGCCTTTAGTGCTT^ 

GACAACCCCTGGGTCAGAGGTAGGAGAAAGGAGGTGAAGAGCTTAGAAATCTCATCTCCAGGGA 
TGTTCTGCCTGAATTCCAAGCAGACTGCTGGTAGCTCAC 

TGGTAAATGGGACCGGGGTACAGGGACCCTAAGTGCTTCAGGTGGTCTGTCACCCCTGGAGT^ 
TGTTGAGTCTACAGAGTAAGACTGTAAGTCAGAACCAACCATGGTGGCTCACACT 

6 5 TGAGTTCCATTACAGGATACCTAGAACCAAAAGGAGTTTAAACTGGCCATGA 

CAGAT AGG AAAAGAT ACTTT AAGAAGAAAG AAAGAT AGGAAAAAAGAAAGG AAGGA^ 

AAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGAAGAAAGACCCACACCTATACTACCT 

CCATAGGCACCTCCTATCGTCAGTCTCTAAATACTACACTGTGACAACACTGC^ 

GAGTGATTGAAAGTCTGTGGGGATGIGTGTGGCTCATATATAAACATTCTTACCACTT^ 

7 0 ACCGTGATGACTGCTAGACGACTGAGGGATAACTATGGGACAACTGAGGGACAAC 

ATTGAGGGATGATTGAAGGATGACTGAGGGATGACTGAGGGATGACTGAGGGATGACTGAGGGATGAT^ 
TGACTGAGGGATGACTGAGGGATGACTGAGGGATGACTGA« 

G ATGG CTGTGGGATGACTGAGGGATGATTG AGGG ATG ACTGAGGG ATGACT GAGGXIATGACTGTGGGATG ACTGAGGGATG ATTG A 
AGaATaACTGAGGGATGACTGAGGGATGACTCT 
7 5 GGGATGACTGAGGGATCACTGTGGGATGACAGAGGGATGATTGAGGGATGATT 
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GAGGGATC^TTAGGOfcATATTGAGGGATGAT^ 

TGAAGGATTAtrTCAGGGATTACTAATGGATGACTATGGAATCACTGAGGGATGACTC 
GACTAAAGTTGATGACTGAGAGATCACTGAGGAATGACAGAGGG 

AGGATCACTGAGGGATGACTGACTGAAGAATAATTAAGGAATATTGAGAGATGATTAAGG 
5 ACTGAGGGATCACTGAGGGATCAATCAGGGATCA^ 
TGACTCAGAGATGACTTCCCCCACITGAGTTTGAA 

TCAAGAATCCTGGCTGGAGGACTCCTCATAGAGTCCTGACAACACTTCCTG 

TTGGTCCACTAAGTTGGAAGAGGTAGATCCTCTTTCTTGTAAGAAGCAAAGCTGAACTGCCCTAACC 
CCCCATACCCTGGGAGCAGAGAGAAGGAATGGCAGGGAGAGCTCCTTCCTAGCCrrAG 
1 0 TGGGCAGCCATGGCCGAGCCTGAGCCCAACCTCAGAGCAGTTCCAGAGGAGATAAATCACACTGAGG^ 
AAACTGCAAACAGTAGGGGAAACCCAACAGAAAGCTGGCCTGGAAAGCTAGGAC 
ACAGCACCCAGAATCAGATGGAACGCCCCAGTCAAAGCCTCGCCAAGTTCAGGATCAATTC^ 
AGCCAAGTGCTACCCCCCACCCCCACCCCGTGTGCAGCACAGGAACCCCTAGGGGCTGGTGCTTGC 

TT ATT AACTTT CTGG AT AGAAACTTG AG AT AAAG CAGG CTTCGTGGG CCTT CTTT CA CTGCCTC CGAG ATGG AGAGAG AGTTCTTT 
1 5 CAGGGCTGGGGTGTACTCAGTGGTAGAGTGCATAGCTAGCAGGTTCAACCCTG 
CAGGTCACCATGCCCCAC^CTGGGTCTGGAATTTCAGACACAACCTI^ 

GAATTTCTGTTCACCGTGTCT'ri'GC'rTT T CAAGGAGCAAAGTTGT CTCTCAG AAGCAAACAAGGAGAG ATGGAGGCTCTGAGGTTT 

A^CATCCCCATTTGTTCAGCTCTGAAGAGGCTCTCT^ 
2 0 ACCCrTTCTAAACCCTCCCATTGCTTTTCAGGGGTTGCTTTTGTACTGO 

CCAGCAGGAAGGTGGATGCTCTCGACCATTTCCAGAAATGCTTCCTGATTCTGAAGCTG 
AGCGGCCAGCAGGAAGGAAAGGGCTGGAAGGCCATCCGTGTAGATCTGGTCATGTGCCCCTATGATCGCC^ 
CGGATGGACCGGCTCCAGAGTAAGAGCTGTCTGCTACTAGCTTTCCTGGCTCAGGGGAAGAAAG 
TGAGAGAACTAACXXIACCATGCCCCCAAGCCTCTAAACTGGGG 

2 5 TCCCAATGTATGGTTCACCAAGCCAG CAG CAGTGGCCTT ACCTGGGG ACAGGTTAGATTTATAAATTCTTCACTGTATACCTACCA 

CTACATCAGAGGCTCTAGGCAGTTAAGCTCTGACAGGGATGTTTGTAAAACACCCTAGGTAA 

TACTGAAACCTAAGAAGTGTCAGGGTAGTGAGTAACTCAGATGGATTCGGGTCCTAGACCTACAGAAGGACCCAACCCTGCCTCT^ 
GGACACCTT CCTAAAGCCTATAAGAT AGATGGGAGTGG CGCCCTCTGGGGGCAGCAGTGCACAG CTGCAGGGTGGTCGACTCAAAT 
GCTGCrTCTAAGAAACTAGTGAGGGTTAAAAAAAAAAAAAACAACTTTCTTTTTCAA^ 

3 0 GGTTGCGCCGCTTTTCAGGTAAATAACACTCCTCCAGCCTT 

CTGGGACAGCAGTCCATCCACTCCTCTTAGATGTCATGCCAGGCTCCTGTTAAGCGTCTGTTGCCCT^ 
AAAAGACTC^GATTCCTTAGTTAGGGTTATGATAGCTGTGGCAAAATACCACGA 

ACTGACATTTCTTTATCACTGTTCATCATCAAAGGAAGTCAGGACAGGAACTCAGGTAGGGCAAGGACCTG 
GCAGAGGCCTTGGAGGAGTGTGGCTTACTGGCTTGCTCTCCGTGGCTTGCTCAGCCTGCTT^ 

3 5 AGGTACCATCCACAATAGACTGGGCCTTCTCTCATTAATCACTAATTAAGAAAAGATCCTACAGATGGATCT^ 

CTCAATTTAGGTTTTCCCCTTTCAGATAACTCAAGCATGTGTCAAG 
ATTCTGCAGCCATTTTGGTTTTTAATCACCTCCTTCGAGCCCTCGT^ 
CAGCACTGGCTTCCTATGGGCCCATAGTTTTCTTATTTCTTGTTATTATAAAC 
GAACTGGCACTCTGCAGGTATTTGGCAGGTGAACTGAATTTCTCTCTGCACCAC^ 

4 0 GGAATGTTT CT CACTCCCAAAT CACACAGGG C AGGG AAC CGGG AGT AG CCC ATCATTG A CTG CAT AT AACCAAC C CCCATT AC AC C 

TCCAGCCTCTCCAGTTGCTGGGAACCAGTGTT AG CTGTGGG AAGACACTGC C AGTC AAGAGCTGGGGTG AGCTAAGCAGGACCCAC 
AACTTCCCAGACCAGCGCTAAGCAAATCTGCTTCCTCAGCAATTG 
GCTTCTGGAGGCAATGTCTCACCCAGTCTCAGAAACTTTCCAGCAGAGAGGAAA 
CAATGTTTTGCCTAATCCCAGGGGGAAAGAGAAAGTTCTGGACCTGGGAAGT^ 

4 5 TTCTGATTCACTGTCCTTGTTTCTAGTGTTTGCTGGTCACTTGTCCT 

TTGATCATGATAGATGTCATTCCGGGGCAGGTGACTCTTGAC^GAATCTCAGATTGAGTC 

GTTCACTCCCTCCACCTTGGGACAAAGGAAGGTTCAGAGAAGCT^ 

ATCTCCCCACTCCTCACCCTCrATTATAATAATG 

ATTCTAGAAACAACGCTCCTACTGTCCATTTATCCACTCAACAGATATTCACACATCACCT 

5 0 AGGGTCCAAGCCAATCAGCTTCTTGTTCACATTGAGTTTCT 

CCTGAGGAAGAAAGTCACCCCAAGAATTTATGTTAGAGACAGCAGTTTCAAAC^ 

TACCAGAAATAATCT^GACTGG AGGGT AATTTGT1TT AATTCT CAGTTTG AGAGAGACTTG CGGCGCTATGCCACACA 

CGAG AGG AAGATG ATG CTGG ACAAC CACG CCCTGT ATG ACAGG ACT AAGGT AGTGTT C CCTT CTGGGG ATGG GTG ACTG ACCTTT A 
CCTGTAAGCCCTCCCTTTGAAC!AGATTTTCTAAAACAATATTCATTACACTTATTTATTTT 

5 5 TGTAGAGGTCAGGGGAGCCAGTTCTTTCCTATCAC^TAGGTTCTAC^GATGGAACTCCGG 

CACCTG CTGGTCCCACTT ATC C C ATC C CT ACC C C CAAC C CAAACTG ATTTT AGTGT CT AGGTTTGC rl CTi'n TGCTG CAAATGTT 
CTGTTCTCTGTGCATCACAGTTGAAAGGAGCrrGGAAACATTGCTTCTA 
TCAGAGAGTGAAGTAGCAGGGCTCTAGAACCAGCACATAGGTGAGCAGAGCT 
ACTCCAAAGAAAATCAGGGATTCCTGTAAATCATAAATGTCACAGGAATAACCATCCAGTG 

6 0 TCATCAATGGGTGTCTCCCACAGTAGAGCGACACAGGTGCCCAGAGGTGGATGGGAAGCCATCAA 

GGACTTGAGTGTCAGCAGCATCTCATTAAAGCCACGCCTCAGAAAAGTGGCAAGTTACCATGTGGCTGGCTGATG 

ATTCTATCGTGAAATCCATGAACAAAAAGGrrCTCCTGTTCTTCAAGTGGT 

TATAAAATCACCAAAAGCAAGAACTGATCTCCrTGCCrCCTAGGGGTTTATTAT^ 

TAACACTTAACCTAGAACTTTAAATAGACTGACAGCAAGGCCTCrrCTGGGACTTTTGACTTCAT 
6 5 TTCAATATATTTTATTGCGGGGGGGGGGTGGGGAGGGAGACAGAGACAGAAAC^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNOTINNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNITONNNNNNNNNNIW^^ 

NNNNMNNNNNNNNNNNNNNNNNNN^^ 

NNHNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNCCCAGGAAATTGTAAGTGGCTCAGTGAG 
75 G CAT CCACTG7TG ACTT CT ACC CT CCATATCT AACACG CTCATG CAC AT ACC CCCACAC AT ATG CGT AC ACATG CAT AAACATG 
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CACTCACATACACACACAAAGGAGAATAGAGACTCAGGGAAAAAATATCC^ 
TTCACCTTTGGATGGGAAAGTGTCAAAACTACAGAAAGCCTTGGTCA 
NNNNNNNIWNNNNNNNNNNNNNNNNNN^ 
NNNNNNIWNNNNNNNNNNNNIWNNNNNN^ 
5 tWNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNCTGGCTGACTGGCTGGCTG^ 

CCAAGTACTGGGATTCCAGGTCACCACACCTTGACTTCTGACCAAAGCAG 
CAGTTCAACAAGTCTGCACTGAGGTGCCAAAGGAAAATGCCTGA 
1 0 CAGAAAGTGAAGAAGAGATCTTTGCCCATCTGGGACTGGACTACATTGAACC^ 
ATTTTTCTTTrCGGTTTAGACAAGCTGTGCTCTGTGTTAG 
GCTGGCAGCAAGTGGATCATAAGTATACTATGCAGAGAGAATATC 
TGGTTCAGATTCTTGGGCCATCrcTCCrrACTGTTGCAiTA 

CATTGCAGCATCATCTAGAGAAGCGCATCAATGCCCACCTGACTCCTGTGTGTCTG 
1 5 AACAGAAGCATGAAGTTTTCTTTGCTTTGTTCAGTTAT^^ 

TGATTCTGAAGGGATCCATGGGTGTAATTAGTACATCAGAGGTTCCCAAGAGGAC 
GGTG CTGATTCCTAGCTGTATC CACrCTGCTCTGAGCTGGTTCCiTTCGCACCCAGC 

TTTCATCTCAGGGAACCCATGGGAGGAACTCAGTCTCTGCAACAGCCCTC^ 
2 0 GCACAGGACAGATTCTATGCCTTGAAGGGACCTAAAGGCCACCTATTGAAGCAATATCA 

GGCAAACTGATGAGTGTCAGCTGTTCCCAGGGGATGAGGGCTGGAAGGAAGTTGTGCCGAG^ 
GCGAGGTCCCTCACCACAGGGACCCTCGTAGGATTCTCCTAGGAGGACAAGAGG 

CATAGAGGGTT AACCAG CCTGACCCAG CTGCCTCTT CTCCAAGAAACG CTTTGCCATTT CTTAAGGCCTGGTCAAAAAACCAGGGT 
GGCCTTGGGTTGCAACAGGACAATCCGGTCTGCCCCAAAAGATTCGATTGTGTCATACTACT 

2 5 TCTGGGACTGTCAGTCACTTGGCCAAGAGGCCACCCCCCAGTAAGGTCCTTTCACCTAAAACTA^ 

CTGTCCTTCTAAGAAGATGGTGTACTTCCCCCGAGCCCCGAAAAAACCACCCCT 
AAGCTTTGCCAAGCCCACCTGCrrTTGTGAAATTTACTCAAATAAGCTTAAC 

ACITTT GG AT ATAGG C CAG AG CCT AGG CTGTCTCTG ATG CTCTGGG L 1 1 1 AGTG CTTTTTT CTCTGAAG C CAT ATC CTT CCACTT A 
CCAAGTGAAGTGACTGCTTGTCTGACATCTGACTGCAGCCCTCCATGCTTAAGCAAAAAGCCCCGTT 

3 0 TT CCTCTTCTTCCCTCTCTTCCATG CATCTGGCAAAATTTAAAC CTTTTTTTCTTTCTTCCTTTTCTTTT CTTTGCTT AAATGCTA 

AT AAAGCTATCCCAAAGCTGCTATCTGAG CTATTXTTTTTTAACCTTTCATTATTAGGG CCAAG AATTTG AAG AGGAT AACCC CAA 
ATTCCCTAGTAACATCAACATTGCAGCAGAACACCCCAAGGCTGGGCTCGGGCAATACTGGGGGAGATC 
ACAAAAAAGCAGCCCACGCCCTTGGGAAGTGAGATTGGCAGCATCAGTT^ 
GCCCAAAGTCTGGAAGCACATGGGACCCTGAAGTGAACGTGTACATGGGGCTC 

3 5 GTCTAAGAACAAAACCCTCAGGGAACCCAAGGGTTGTCTAGCAGGGTAATAGGACATTCTGCCAACCCT^ 

CCACTGCTGGTGATTGACAGATGCCTCTCCAGCCACTTTAGCTCTATCCATTACACCT^^ 
GGGGATCGCTCAGCACCTCTCCATGCAGGTATGTGCCCCAGTCATGGG 
GTCGAGTGGTTGAATGGCAACCAACTACATGGCATTGGCTAAACCAGTGCTCCCAGATTC 
GAATCTAAAGGTGATAAGTCTTCGTACCAACAGAAGTCCTGAACCTTGCAAGTTTAG 

4 0 CAATTGATTGTAAGGAGAATTAAAATGAAGAACACAAAAACAAAAA^ 

AGGCTTGCTGATCTGTATATTTCAGTCAGGCCTCACAGCTGTCCT 

GCCAACATTCTAGGCTGGAGCTGAAACCCATATATGTCAAAGCCATAGTCTCTTCAAGTTCTCAGGG 
TCATGCAAGAGAAATGAGGCTTCATCCCTGGGCACAGCTGATTGAAGATGTTCAAATC 

TAACCCAAAATTG AATTTCAG AGGGG CTTTGAAAGG CAGCTG TT AACATGTG ATGGTCATTG GG TTT ACAGAAC CATTCCCTAT AG 

4 5 CTCAAGAAATCCAACTATTTCTAAACAACCAGAGGCCCCATGCAAGAAGAGTTGGTTCTCT 

CCATGGATGTTTCTAGAGATGCCCTGAGGCCTCAGCTCTCCCACTGTGAAGCATGGTGTATTTCAAGGC^ 
AGGTCACAAGGAAACTGTCTGGGGAAGTGAAGGCAAAGATCAGGCCCGAACTGGTGAAGTTGTCCCT 
GGTGTAGCTATCCCAACCCATCCCTTTGCTCCTGTGTTGATCTTTGGTCCTCT^ 
TGGTACTTCAAAGCAAATGGCTCATTACAAAGAAAAGTCATTAAAACATGAATATC^TTTTAT 

5 0 TATGATAAAGTCAACTTAAGGGTGGAGGTATTTAJTrTGGTTCAGTCCAAGAAAGGTGT 

AAACACTTT ATCTGTCGT CAGGAAGG ATAGTGATGAGTTCTG CATGCTCAACTTGAATTCCCAG1AACCCCAGGCTATGG AATGATG 
CTGGGCCACATTCAGGTGGATCTTCTTACTCAGTTAAATCTTTCTGGAGGTGTGT 

ACAGTGAAGATTAACCATCCAGGACTCGAGCATCCTAAAGGTTAATATGTAATGTGTAAC1ACAGATCTCTG 
ACTTGGTTGACACCCAAGGGGAAAAAAATGGTGTGCAGAGTCGAAGTGAGGAGACTGCCAGTGACCAC^ 

5 5 TCAAAAATTCAGATTACAGGCCACAGAG AAGGTTTAGATTTCCTTGTC^ 

AGATGTGGGAAGAACTGGGTGAAGAGGATTGATATCTCTAGTGTCTACTAAGTGGAAATTGCTTC 
TCCAGTGGCCAAAGGAAGCACAGAACCCAAAAC!AGGCrCGGGGAGGGTAGATGAGCTCATCT 

CGT CTG ACT AGGG AGTGC CT AG CTCATT CT CAG CTGT ATGACAC AAATCAG C CACAT ACTTC CTTGTG T CGGGGCATGGACGG CTC 
CATG CG AG CTTT AT CCTGAG ATATT CAAAAGGAAG AAATGGCTT CTGTCACATG T CTTTGGTT AGGTTTTATG CCTG TCCT AACTT 

6 0 GTGCCAAGCTGGCCTTGGTCAATGCACCCCCCAAAAAAGCCAACTTCT 

GGTAGAGGAGAAGGCAGGACACTCC^GAGCCAAGCCGTTTCCCAAAGGCAGACTTCTCT 
CATCTGGACACATAGTCGACTCACCTGGGTCGCCTTGCTGATTACCCT 
CCCTAAAGGGGCAAGGCAGCCAGATGTGTCTTGGATCTATAAATGAAATTGATGTTGAGTC 
CAAGCATGAACCCTGAGCACAGATTCCAGGTTTTCAGCCCTGATCGAGCTCT 
65 GCTCAGGCTGGGTACAAGCCACCAAAGTCCTCTTCTCCTTTCTAGT^ 
TTCCCGAGACAGTCTTCACATATATGTGCCCTTCTTGGC^ 

AGAACACAACAGATTTTTAAAATG AT ACAGT AG AG ACT C CAGCTTCCCG CT AAGATGAACTCAAACAG CCAT ATTGACCCTTGAAT 

GGGAACAGGAAAATTCAGACCAAACATTTGAAACAATGGTTGGC^ 

GGGCAGGAAATGAGTGACAGTTGCCTGAGCTTGTTGTCATCAAO 

7 0 G<^GACCGTCTCCAGCTTTTAA aTT TAATCTATC 

TTGG CT ATTG CTTTT ACAGCCAAC CAG ATT CTCT CTGTCTTGT AAGTGAAG CT ACTT AT ATCT AAT ATGATTGTTG ATCT 

GCTCAAACCCACTCTCTTGCTTCrCTTTTTCAClTGTCCT 

CTTTAGAACCTTACAATGTTTACTGAAAATGTGTT 

TAGATGTTTTGCCTAAATGCACACGCCTGGTATCCGAGGAGGCCAGAAGTCTGCAGATGACCTAGAATTC 
75 TGAGCTG TTATC AGCG CCTGGAAACTG AAT C CAGCTCTTCA<? A CAA &G C ACTTATG ACTG CTGAGCCG TT CTCT AG CG CCACAATT 
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ACACGATGATAGGGATGCTAAGAGTACCAGGAA^ 

ATCATAACTTAG CT ATTGTTGAATGT CTTCT ACT AAGTTTTATT ATTT CCATGAT AGGATTAG ACATTCGATCAGATTTATTCTTT 

AGCATTCTATGTATTTTATTACC^CTATAAAATAGGGTTTTTATAAAATTCCT 

ACATATTGCTTTTGAATTTTTTTTCAGTGCTGG 
5 AGTCCTGTGTATTGTTTTTATATACAATAAACATATTTGACTATTTT 

CTATGTTGACAGTCATATATC^TCTACAAATTATGATAATTTTGCTTC^ 

ATTTGTGJATATATATTTATATATATGCTATATTTGTTTATATTTCCTTTCAATC 

GCCCTAAAGTTCAGTGCrGAATGTAAGTGATGAACACTCTTGATTCACT 

TCAG^TTACCATCCTGTTCTAGCITGGCTTrGAAACT 
10 CTGAGTTTGAATCCCCAGCCCACATCAAAG 

CCCCCGCCCCAGCCCCAGCCCCAGCCCCGCCCCAGCCCCGCCCCAGCCCCGCCCCAGCCCC^GC^ 

CCCCAGCCCCAGCCCCGCCCCAGCCCC^GCCCCAGCCCCAGCCCCAGCCCC^GCCCCAGCCCCAGCGCT 

GGCAGACTCAGGAGAATCCCCGGAAGCTGGTGGGCCAGGACGA^ 

GGGAAGGGCTGAC^CTGAAGTTATCCTCTGACGTCCTCTCACATACATTC 
1 5 AT ACATTGTG CATACATT AAACAAACAAGT AAAACTG CTTT AT CACAAGTGACITATCTTTTAACAAAACC CCATTTTTTTCTT AT 

TAAGATAGCCACATATCTGTTTGTTTTGACCATTAAATAAACTATATTAAGCTG 

GACCGGGCACAGAACCCGATGACTTCTACATGTTAGGTCAGCACTTTACCACT 

AGGTCTCACCTTTGCACAAGCGCACACTrA^ 
2 0 CCTCAGATATTTTGGGATTGTCATAGAGTTCTGAAATTTCACATGGTATCT 

TAAATAATAAATAAGTGTATGTATGTATGTATGTATGTATGTATGTATGTATGTGAAGGGCCATA^ 

GTGTTTTAAGCCACTGTGCAGATCAGAATTGCCAGAAGCCTTTAAAAC^ 

TGAACAGATAGATTTAAGAAGCATCAGTCCTCGGCTCGCAT 

2 5 GTGGTTTCAAGTGCCAGCCACCAATGGGGATCTTAAGATGTTAAT^ 

AGCGTGACAAAGCCTTGAGCCACCCTACCTGATCAGAGCCTGGATTAGA^ 
ATAGAACAGGCATGGTTGTTCATCAATAGAGAATGAAATGAGAAC^ 

GATGCCAGGAGTGTATACACATCCTTTTGTATGTAAGTC^TG CTCA CTATGGT CTTAT CAT AT 

CCCTCCCATAGACACTTCATCCCTCTTTGT 

3 0 GTAGACGACACCCCACTGCACAGAGACATGTTTGCATGGCATCAGTCATCTCCCT 

TCTT ATTGCCAG AGATTTG CT CAAGCAACCAGGGAGTTATGTGAAAATCAGCTCTGCCGGAATCTACTG ACA^ 
ACAGAAATATTTACTGTCTTGTC 



CAAGG TAAACAG CAAAGCCACCAGCAGGGCTGTGGCCAGCAGGG ATGGG ATTC CGG CTGCCAGTCCAATAGAGGG ACCACAGTCCT 

3 5 GTCAGAGAGAAAACTGAGTCAGAAAAGCCCAGATTCTTCAAGAATAGAGGTTACAGCGGGTGTTG 

GCATTCAG^AG^CAGAGGCAAGCAAATCTCTGTGAGTTTG AGGCT AG CCTGGT CT ACAAAGCAAGTCC AGGACAACC AGGGCTGGT 

TACACTGAGAAACCCrGTCTCAACAAAACAAAACAAAATGTAGGAt^TATTCAGTGATTA 

GGCTCTGACTGCATTGGAAAAATGAAATATTTCAAAAGATATCTGTACTGTATATGA 

AT AT AGTTT AAG CCCATTT GG ACTG CTG ATCATG TGTT AAAAC AAG GGACAACT GATTTGAAT AGT AG C CTT ACTG AACAGAAG CA 

4 0 CAAACTACCCTTCAGAAGAAGAAAAGCCAAGGTACAGTTTGGAGGCACAGTTAAAT 

AGCCTGAAAAAGTCZAAAG L ' TITTTT AAATGTTTATTTT^ 
AACTAGCGAAAGTGCAGAGAACACATCTCGGTGGAGTGCTCCGTTGTGA 

AAGGAATGTTTCAGAAGAGTGGGGAGAAAAGGTAGAAGGGCCAGAAGTTGAGAGACACTAGGGA 

4 5 GGGCACATGGGCTCCCIACAGCTGGCTGCCGAGATGCTGCAGTTAAAGGCTTTCA^ 

GTCCCTGGTAGGTCAGCCATACTCCAGTGGTTGACACCAAAACTATG 
AAAATTTAAAAGGAGAAGAAGTTTCGAGAGGTGTGGAGGGGAATTTAAGGAGAGCTGA 
AATATGTTGTGTACATGCACAAAACTCTAAAAGAACAATTAAAAAGTTATATTAA 
CCXSCGTTGTTAGC^CATCCTATGCCATCTGTTGTTATTTTGAAAACATTCA^ 

5 0 CTGGTCTGAGTGGTAACAGACATGCTTGATAGCACCACAAGGTTTC^ 

AAATTCCTCTAAACACCCTACAAATGCAACACTAGCGGGGTGTGATTTTTACAA 
ATGAGGACnSTTTGCAGAATGTCATTTCACACATAAAACCAAACATT^ 



CATAGCGATTGACTCTTCTATCCCTGGCTGCTTTCCTCCATC 

5 5 GCATTTTATTTATTTATTCATTCATTTATTTATTCAACCTGCATGTATGTGCA 

GAGAG CAT AAG ACTCTCAAGAGCTAGG 

MOUSE SEQUENCE - mRNA 

ATTCTGGAGACACCACCTGATGGTACAGACAGAGCTAGACTGTCTGCTTCCATGGATCCACTGCAAGCAGTCCACCT 

6 0 GAAGAAAAGA.CCCAGGCAGCTGGGCACCCCAGTGGCTTCCACGCCT 

AAAAGAAGATGGGAACAACTCGAAGAGCCTTCCTCATGGAGCTGGCCCGAAGGAAAGGGTTCAGGGTGGAAAATGAGCTC^ 
TCGGTC^CCCACATTCTGGCAGAGAACAACTCAGGGTCAGACGTCCTGGAGTGGCT 
GCTCGAACTCTTGGACATCTCCTGGCTCATCGAATGCATGGGAGCTGGGAAGCCAGTGGAGATG 
TGAATAGAAACTCCTCCCCGAGTCCTGTCCCAGGCTCCCAGAATGTTCCAGCACCTGCTC^ 

6 5 C!AGCGAAGAACCACATTAAACAATTACAACCAACTATTCACGGATGCCCTTGAT 

TGAAGGTTCTTG CCTGG CATTCATGGG AG CGT CCTCTGT ACTG AAATCTCTG C CATTCCC CATCAC CAG CATG AAAGACACAG AGG 
GGATTCCTTGCCTAGGGGACAAGGTGAAGAGTATCATAGAGGGAATTATTGAAGATGGAGAAAGTTCTGAAGCTAAAG 
AATGATGAGCGATATAAATCCTTCAAACTCTTTACCTCTGTGTTTGGTGTGGGACT 
TTTCAGAACTCTCAGCAAAATACAATCAGACAAAAGCCTGAGGTTC^ 

7 0 TCGTTAGCTGTGTGAACAGGCCAGAAGCCGAGGCTGTCAGGATGCTAGTC 

TGAAGAG CAGCAGCTCTTG C^T AAAGTGACACACTTTTGG AAGCAG CAGGGGTTGCT^TTGT ACTGCG ACATCTTAGAGTCAACCT 
TTGAAAAGTTCAAGCAGCCCAGCAGGAAGGTGGATGCTCTCGACCATTTCCAG 
AGAGTGCACAG03AGAAGAGCGGCCAGCAGGAA 
7 5 CGCCTGTGCCCTGCTTGGATGGACCGGCTCCAGACAGTTTGAGAGAGACCT^ 
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TGGACAACCACGCCCTGTATGACAGGACTAAGGGGAAGACTGTCACGATTTCACCTTTGGATC 

GCCTTGAGGGTGTTTCTTGAAGCAGAAAGTGAAGAAGAGATCTTTGCCCA 

TGCCTAAGACGGAGTTGTTGACATTTTTCTTTTCGGTTTAGA 

GG ATTCTTTGACCTCAGAGG ATG CTGGCAG CAAGTGGATCATAAGTATACTATGCAGAGAGAATATT CTAAGGCAACTTTGAAGTC 
5 TTGATAAACCAGGATGCAGCCATGGTTCACAGGTCTTGGGCCATCTGGCCTACTGTTC 
TGTTTGAATAAAAATGGTGTTGTCATTGCAGCATCATCTAGAGAA^ 
GTGT ATTGC CAAT AAAAAT AGGAAAC 

MOUSE SEQUENCE - CODING 

1 0 ATGGATCCACTGCAAGCAGTCCACCIGGGTCCTCGGAAGAAAAGACCCAGGCAG CTGGGCACCCCAGTGGCTTCCACG CCTTATGA 

CATCCGATTTCGAGACITGGTCCTCTTCATTTTGGAAAAGAAGATGGGAACAAOT 

GGAAAGGGTTCAGGGTGGAAAATGAGCTCAGTGATTCXK3TCACCCACATTGT 

TGGCTCCAGCTGCAGAACATCAAAGCCAGCTCCGAGCTCGAACTCTTGGACATCTCCT 

GCCAGTGGAGATGATGGGGAGACATOVGCTTGTTGTGAATAGAAACTCCTCCCCGAGTCCTGTCCCAGGCTC 
1 5 CACCTGCTGTGAAG AAGATCT CCCAAT ATGCTTG CCAGCGAAGAAC CACATT AAACAATT ACAACCAACT ATT CACGG ATG CC CTT 

GATATCCTGGCTGAAAATGATGAGCTTAGAGAGAATGAAGGTTCTTGCCT 

GCCATTCCCCATCACGAGCATGAAAGACACAGAGGGGATTCCTTGCC^ 

GGACTGAAGACAGCTGAGAAATC^TTCAGGATG 
2 0 AATGCAGAAAGCAGGATTCCTCT ACT ACGAAGACCTCGTTAGCTGTGTG AACAGGCCAGAAG CCCAGGCTGTCAGCATC 

AGGAGGCGGTTGTGACATTTCTTCCAGATGCCTTGGTCACCATGACTG^^ 

GACTTTCTAATTACCAGCCCAGAAGCCACAGAGGATGAAGAGCAGCAGCTCTT 

GTTGCTTTTGTACTGCGACATCTTAGAGTCAACCTTTGAAAAGT^ 

AGAAATGCTTCCTGATTCTAAAACTGGACCACGGGAGAGTGCACAGCGAGAAGAG 
2 5 ATCCGTGTAGATCTGGTCATGTGCCCCTATGAGTGCGCCTGTGCCCTGCTTGGATGGACCGGCT 

TCGGCGCTATGCCACACACGAGAGGAAGATGATGCTGGACAACCACGCCCTGTATGACAGGACTAAGG^ 

CACCTTTGGATGGGAAAGTGTCAAAACTACAGAAAGCCTTGAGGGTC 

CTGGGACTGGACTACATTGAACCATGGGAAAGAAATGCCTAA 

30 HUMAN SEQUENCE - GENOMIC 

TTTGGGATTCAGATTCTATTAGTGTGCAATGGAGCCTGTTCACT 

GTGAACCAAAGTTCTACACAGAGAACAAATATTTATGTAACAATGTATTCATGTAACAG^ 
TTGCITTGTATCCATTGCTCTGGGCTGAATGTTTGTTTCATCCACAAATTCACATGT 

GTG T ATTTGG AG TT AT AC CTTTGGG AGT AATT AGG ACATG AG TGTGG AGCCCTC CTG AACGG G ATT AGTG C C CTT AAAGG AGGAAA 
35 CACGAGAGAGATGATCTCGTTCCCGACCATGTGAGGATAGAGCAAGAAG 
GAAACTTTTCAGCTGGCACCTTGATGTTGGATTTCCCAGACTCTAGAAT 

ATCCTGAAGTGACTAAGACACCCTGGTGATCTGGGAGACTCCAATGTGTGGAATTTACCTAGTTTTCCTCT 
CCrCTTGGATTTTCCTGGCACTCTTCCCCTTCTCTCCCTGTGCCCT 
AGTTCTTTCTGTTGGACACTCTTCTGCCCTTTCTAATAGAGCTGAGATTTACTAGTAG 
4 0 GTAATCTGCTCTCTCAGAAGCTTTGGAAAGGAGCTAGGCCCTTTT^ 

TTAGCACCTAATTATAAACTATTACCTCCACAATTTAAAAATATTGAATATGAGATTTTGTTATT 

CACAGAG CTTT AT AAAAAAAGATT ACT ATG TTG AAAGTGAGTC CCT ACTT C CCCCT AC CCCGATCAAATT ATT CTCT CTGAG AAAC 

ACACGTTTAAAATAAATCTCAAATTCGTTGCATTGAGAGCTGAGTTTGGTACCAAATTCACGATGTG^ 

GTTGAACCCCTTTTTCCTCCCAGGTGGTGATGAAAGACTT^ 

4 5 CAAAACCAGCTTTAGATAAATGGTTTGGCACAAACACTTGTGGGCCTGAGCT 

AGAATTCTCACTGGTCTCTCTGGATGGAAGAGGAATGGTGTTCACAGC 
CTAAGCCCTCTAAGACATACAAAGAAGGTTACACTCTAAATCGGAAGACGAGCCATACTTG 
GCAGAAGGTAATTGAAACAGGGTAAGTATCAACAATGTGTTATTAAGGTGAGTTCCCAG 
GTGCAGAAGTCTATTGGGGATGCCCTTTGGGATCATCATCTATGGGAGCATGAAGGAGGAGATTGGGCA 

5 0 TATTTGATTGTAGTACAGATCTCAGCCAATCTAATAG^ATGCTCTAG 

AGATGACCTGACCTTTGTAACCITTCCCCCGATCATTGACCAATAAGT^ 
TGAGCCATCAGCTCTGCCATTGTCAGTAGCAAAAGGAATGGTGACTCCATCCTGA^ 

ACTACAGAGGGAGATCTGTGACCTTGAAGATAAATGTG rm ^^i'GGTTGTTGCTGTTTTGCTTTTTTTTCTTTGAGATGGAGTCTTG 
CTCTGTTGCTCAGGCTGGAGTACAGCAGTGTGATCTCGGCTCAGTGCAAGGTCT 

5 5 GCCTCCTGAGTAGCTGAGATTACAGGTGTGTGCCACTATGCTCAGCTAATTTT^ 

AGGTCAGGCTGGTCTCAAACTCCTGACCTCAGTTGATCCTCCTGCCTCGGCCTCCCA 
GTGCCTGGGCCAAGGATAAATGTTTATAGCACGAGGGAAAAAGGTACATTAGTAAGGCACTGG 
TATTGTTATGATCAGTCTCTACTTGCTATTCATTTCACTCTGATAAGACCATCT 
TACTATAGCATAAAATGTCCTCCATAAGATCCAGTAAAGAATAGCTGAAATATTTAGCTGACTCTTT^ 

6 0 CACATTTCAAGTTTAGTTTAATTCTATTTTACAGTTTTGGT^ 

TGTGAGATTCTGTCTGTCTCTCTCTTTATTTATTTATTTA 

AGTCATGGCTCATTATAGCCTCAACCTTCTGGGTTCAAGCAAT CCTCC C 
ACCACCACACTCAACTAATTTTTTTACTTTT^ 

TCTCAAATGATCCTCCTACCTCAGCCTCTCAAAATGCTGGGATTATAGGCATGAGCCTGGCC^ 

6 5 TTTTTTTTCCAGTAGTTTITGGGGAACAAGTGGTGTTrGGTTACATGGATAAGTTC 

CCCATCCTCTCTCCTTTTACAATGATTTCTTTATGATGTGCCT 

TTAGTTTCTTTCAGATCTTTGCTAAAATGTCACCTCCTCAATAAAGCCTAATTGGATAATCTT 

TACGCTCTCCCCTTTATTTCTCTCTACAGCATTTATCACCATCTGGCCTC 

TCCTTCTAGTAACAGAGAAGTATGTAAATCCCATGAGGCAGG 

7 0 GCACATAGTAGGCACACAGTCAATATGCATTGAATGAATGA 

ATAGTGACCCAGCTGGAGTAGCAGCTTCAGCTCTCTGCACCTTCCAGGAATCCCGTTTTG^ 
GGGCTGCCrTCTGTGGCCCTGCCTCGGGCCTGCCACTGTCCTGCTCAGAAGCTCCCA 
GTGGTCTCTGAGGTCTGCrCCAAACCTAACTTCTCTCCTTTrCT 
GCTCCCTCTCATGTGAATGCCCTTCCTGCCACCCTTTCCTGCCCTCCT 
75 AAACATCTTC CAGGG CCTT CT G AAG AATCTGAUG AT AAATG GT AAAG CATTT CACAAAG C CTT AAC AT CTTTAGAG ATG ACT ATT AT 
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tgc cagt atg t catg atagt att attgt at cat aaacttctcaatt cagt aat ag atgtctggaaggttg aag ct aaag catc aag 
ccacattctccagtttaccagtaataaagctgatcttgtgactctc 
agaacccatggctaaatgttctggaactctgcgagccgactc'ct 
tttgcagtcagttcataagctgcaggagagaaagatgtgtttaact 
5 aaggatctggtgacagaagtgaaggaacagtttggggagtgatgatgagtagt^ 
tggaatgtaaacgggcctggtgctgaacouiagagaaatgggctagagataaaga 
ggattttacccccggatacccaggagccatggccccaaccactgatccacagc^ 

tcattattgcctgattaatttccctaaggcaccattttgatcacct^ 
1 0 tgcaaataaaatcccaactgtcctattcaaatttgt^ 

tgtcacccataattttgtatacagttaagtttaataaacattttggaata 

cagttccattgttctaactatcacrtctcatgttgtagatgaaggtgaacc 

tgttaggcacccaacatatatagctaccaaatgcitcttaatggcaaga 

tctacacatgctcctggcaag^cagttctaagctccagaagacact^ 
1 5 aagctggtcctcggtgaaaaaccaaaccaaaacaaaaccctgcccgcacatt^ 

aacgaatagaagatagaagatatttgatataaaagaaaatgaaagtgccggaaacagaagt^ 

aa.cgtgcacagagaataacaacaggactcctaaggataaaaatag 

ggaaaacctaaaatcctccccattgtttccrcccccataczacrtgta 

ATATTACCCAAAATGGAAAAAATTTAAAAACAATCCCGAACTAAGGGGAGAAGTGGCTGCCT 
2 0 GAGCAGCTGTGTTCITTGTTTCCTCATAGCCATCATCACCT^ 
GCAGCATGAAAGGGAGGACAGAACAGTGGCCCAGCTCCCATC 
AATGTGAAGAAGTTGGCACTATGACTTGCTATTTTACATC 
TACATCTGGTAAGTAGCAGGTGGTTCAGGCTTGTGTAGCTCCTGAGGC^ 
TGTTCTACAAAACAAACAAAGAAAAGGTAAGCATGTTGAGAATACAGTGGGCGAA 

2 5 CACACATTGT AT AGAGTTT CT CT GGTTT C CCT ACACTT AATTCACTGCAGAATTTTTGTT1TG ^ CATGGGAC CAG 

TGTTCCCATAAATCATTGCAGGACACCCCrCrGGTCTC^ 

TTTTGAGACAG^GTCTCACTCTGTTACCCAGGCTGGAGAGCAGTC 

AGTGATCCTCCTGTGTCAGCTCTTCCTAAGCGGGCTGGC^ 

TTGTGACAGGATTACCTACAAAGTATATGGGAACACAGGGAAGGGAAGGGGGAA 

3 0 GG AC AG TTCTTGGAACTTG CAT GCCAGTGGTCTTTGAAGTGCTTT 

CCATTGCAGAGATAAGGGAAATAAAGCAAAGAAAGGTTAGGTTT^ 

TTCATATTGAAGAGTCTTAAATGGAGCAGAATTCTCCACCAAGTAGCATAAACAAATAAACT 
GATTATTGACCAATCTAGTAGATCAAGGAGAGTGCTCAACATCTGATGC^ 

TCAATTCCTGCTATTGGCAGCAATGCCAAAGAGCATAAGAGAACTCTGTATAGTGTTTTATTATTTTCAAGGC^ 

3 5 GATTTAATTTAGTTCCTGAGAAGGCGATGAGACATCTGTGCTCTCAGACCTTATT^ 

TAAGAGACATTAATTGCTGAAAAAGCAAAAAGAGGACATTGAGGTAACTCAGAGAAAC^ 
GGCTGGAGCAACAAAGGGATGGAGCTGGGATTGTTGAAAACTGGAAGTTT^ 
TAATAAGGAAGTGCTGCCTGTCGTCCCTGGTGCCATAGAATTTCATAGGGAAACCTCTX3CA 
GATCTGACCTAAGTGGTATTGATACCTCTGAAGGGTACAGTAAGACT 

4 0 AAGCTAAAGACTAGAACCAATCACTGCTGTCTGAGTAGAGAACCATTACTGCGAAGA 

AAAAGGTCACCTCTTTCCTCITATCTAGACTTTCAGTCT^ 

AATGAGAAAGAGAATTTGCTCAGTCCCAGCCTCAACCTCATGGAATCCATCAGAAAAAAGGTAGAT 

CCTGATAACCAGACAGGCATCAGTACATTTGAAATTTTTGTTGGTAATGTTT^ 

ACATTGTTTAATATGATATTTTGATGATTTAGTAGTTTCACAAAACATATATATCCGATTTCA 

4 5 AACAAAACAT AAATT ACTACG AACTG ACATTT ATTG AAAGTGCTACCT^ CAGGTTATGTGGG CT A 

T CTCAGTC CTCATTTT ACAG ATGGGG T AAAG CTGAGG CTT AAAGAAGTTAAG C AA CTTGCTTTG AAGCTT CAG AG CAATAGAGACA 
GAATTTTTATGGGCAGCCTGACGAAGCCTGAGCTGTTAGCCACATCT 
CACCTTTACTATTAGGTGGCTATACCTTGGGAACGAACACAACAATCCCAGACTCTTGC 
CAGCACCATTTGGTAGCTAAC^CTGCATTCTAACATGAAGCATTCAGCT 

5 0 AGACTCTCATCTTCCAGTTGATCTCTAGCAAACAGTCATAGCATCCGGGAATTTCAGGGCTGCATT 

TG AATT CC CTT CTGTT ACAG CTCAATAAACAGTTATCCAG CAT CTGTTTG AAT ACTTTTC ATGACTTG T AACACACAACTGT AT AA 

AGCTGCTCTCTGTATATTTTTAGACAGCTGAGTTACTAAACAGGTCTTACTTGCGGAGAACTGGAATCT 

ACCTGAAGATTTCTGATTTTGCTTTCTTGAGCAAC 

AATACGGTTGTGAGGTGCTCACGTGTTTTGGAGATAGCAAAAGTCTCAAATAATATGGCAC^ 
55 CCTGTAATCCCAGCACTCTGAGAGGCCGAGGTGGGCCGATCACCTGAGGTCAGGAGTTCAAGACC^ 
CCCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGCGTGCGCCT 

GATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGATTGTGTCACTGCACTCTAGCCTGGGC 

TCTCCAAAAAAAAAAAAAAGGACAAAGGGTGAGCAGAGCTCAGATTGGTTATTGGA^ 

GTTAAGAGTAGTCTTTTAAAATCAGACAGATCTGAGTCCAAATCCTAGCTCAATCATTTC 

6 0 TGACCTTG^GCAAATTGTTTAGCCAAATGAGAAAGGTAGGACCACCTGAGAGTG 

AGGTGCTTGCCITGGCCGTGTAGTAGCTGCTCAGGAAATGAGATCTATAGAGTGTATTGG 
AAAGTTATGAAAATGTCTCCTTATGGCAGTGGTTATCATCTAAAAGTCATACCATCCCACTGAAGAAGA 
AACAGACTACTCTCTX5GACAACCTTAATCCACTTAAAAAACTAATA 
ATATGGCTTGGCCCCrTATTCTGTGATCITGAATAACrGAGCCTTGGrr 

6 5 TGGTAATACCCACCATGAAATACTATGGAGCAGTTAGAAGCAACAGAGCAATTAGAATACGGATCT 

AAATAAAAAGGTGATTGGCAAAATGAGAAATGTAACAATGCCATTTATGTAAAGTCAAAACAACC^ 
TGCAAATAACAGACIATCCAGTACAGTGTAATGATTGTCTACAAAAGGAGGAGAATGCAAGTC 
ATAAATAAAGACCAC^GAAGGGCCTCAGTACATTTAGAAATAAATAACATGCAAACAATGATGC^ 
TACTCTGAGGAGCTAATGAGATTGTGCGTATGAAAACAAAAGTTAATTGACTGTCT^ 

7 0 CTGAACAGAGATAAAAATTCCCCATCACACITTTGGCAGGAAGCTGTTGCCAGG 

GTCTGCTGGTGAGATGACATCAAAACCCTTCGTGTAGGAGGGTG 
CAGAGGCAGCAGC^GCCTCTrTCCCATGGATCCACCACGAGCG^ 
CrrGATGGCCTCCrcrrCCTCAAGACATCAAATTrCAAGATTTGG 
CGTTCCTCATGGAGCTGGCCCGC^GGAAAGGGTTCAGGGTTGAAAAT^ 
7 5 AGCAGAGGCTTTGTGAACAGCTTCTTGGGAACCCAAGGA^ 
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GAACAGCAGATTTGAAATCAAAGAACAACTAGTTAAGG^ 
CACCAAAAGATGAGATAAATTTTTTCTTT^ 

GGAAAATTTAATGCTGJUVCCACTCAATTCTATTCTTGACCAATTTC 

TTTGATGCAGAAAGATGATTTGGTTGCTTGGTTCTTACCTATAC^^ 
5 ATGACTAGCC^CTTACCACAGGCACCCAGATCTGGCCr^ 

GAGATGCTAGGTAAAGATGTATTCACTGTTAAGACAGCTCTGTCCAGGGATGACA 

CCAGGCAAGGAGGCGATAGAAGCATTTGGACCACCTAGATTGGTTCACTCACT 

TGTTTTTCTAACTTCTGTGTAAGCTCTTCACTGATCTCAA^ 

AACCTTGACATAAGTGATGGTTCTGGATTAAACTGTTAAGGGGCTTGCAATG^ 
1 0 CTG TGCAAG AAAT AAATTG ACCACAAATTTT CAGG CAAAACT AT AAAAATTTT ATGTGAAAGGACATTTTG ATTGT AT AAAT AGTC 

TTTGATATTTGAAAATAAAACTGACAAGCATGTCATGCAGAAGAAATGTC^ 

AGTTTTCAGATATGTTAATGTTCGCAACACCCTGATTCATATC^ 

AAATTCCTGAATTTCCTATGGCCCTTCTTCCACCTGATTTCT 

CCTACAGCCACTCTGAATCTCATTTTCTAACATCCTTTCCAAGAA 
1 5 CAG ACTTACATAG AAGG CT AG AGTTGGAG AAG ACAGGAAG ATT AT AT AGGT AAAGAAG CCAAAAAAAT AGA 1 i 1 11 11 TTTTTl IT 

TTTTGAGACAGAGTCTTGCTCTGTTGCCCGGGTTGGAGTGCAGTGGCACAATCTCGGCT 

AGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACGGACACGCGCCACCA 

GACGGGGTTTCACTATGTTGACCAGGATGGTCTCTATCTCCT 

ACAGGCATCAGCCACCACGTCTGGCCAAAAATAGATTTTTTGGAAAAGA 
2 0 GGTAAAGAAATTTGACAAGGGCTAGGCAGGAAGGAAAGAACCAGAA 

TTCCAGCAAGGACCCGCTGATTTATTATAGACACAGCAATCAATACATACAA^ 

AGGTAAGTTTCTTCTGTTGGCAGGACAGCTGAAGAGGCAGCTACATGCCCCAG^ 

AGGCGGCGGAGGAAGCTCACTGGTGCATCGTGAGCAAACTGCCGCTGTC 

GAAGACACATTAAATACAAATCTGCTCTATGGCGCAAAAATA 

2 5 GCATAGATATATAGACTTTAATTGGATAGAGAAATAAAAATTATTGTAGGGGACAAATCTCT 

ATCCTXK5CTCTGAGCCTCCCCAGCCATGGGGTATGGGACAAGGTCCCCTCTCTCCCGACT 
ATCTGATGTGATCACCAAATGAGATAATGCATGTCAATGTGCTTTGATAAATTAAAACAGTT 

TT CTTGTGAATTGCTGGATC CATTG AC CTTT ACAG CAAAACCCT AAGGAATGGG CT CT CCCTC CAC AAT ATGAACAAGGTCAAATG 
ATATCAACAAATAGAACAAAGACAGGAAAGAAACCCGTCAACATGTTATAACTG GT^ 

3 0 TTATGGCTGTTTTTATTTTCTGGGTTTTCCAGTTTTTT . 

AGGCTTGGAGTGC^AGTGGCACGATCTCGGCTCACTC 

AG CTGGGACTACAGG CACGCGTCAGCACGCCCAG AT AATCTTTTTT ATTTTTAATAGAGT^ CC AGG AT 

GGTCTCGATCTCTTGACCTCGTGATCCGCCCGCCTTGGCCTCCCAAAATGCTGGGATTACAAGCGTGAGCA 

3 5 TATGTGTCTGTGTGTGCATATAT ATAT AT ATATATATTTTTTTTTTTTC CAGGCTGGAGTG CA 

GTCCTGCCATCTCGGCTCACTCCAACCTCCGTCTCCCAGACTCAACGGATCCTCT 
GATGCXSGGCCATGCCCGGCTAATTTTTGTAGAGATGGGGTTCCGCCATGT^ 
CACCTCCCAAAGGATTACAGGTGTGAGCCACCGTGCCCAGGCTACAATAAGCATGTACm 
GTTTTTTCAGCTGATAGGCAACAGTTTTCATTGTTCCTAAC^ 

4 0 TT CCTGCTTTACTK3GGCCTGAACACACAAATTAG AAATGGTG CTGG AGCTATTTCATTCTGCAA 

CATCCATGCAACATAATTGGGTGGAGCTTTGCTAACATT^ 
GGTCTTAAAACGTAGATAAGGGAAAGTATTAAAACCCnTGAGGCATTm 
GGCTAACTTCCTAAATTTTAGAGCARTTTGCCACCCGGGTCAAG 
CATCCTGAGTTCCAATACGTGGAGCTATTTTAAAATTTCATTCTGTTAT^ 

4 5 CC ATG TG AAG AG A CCACCAAACAGG CATTGTG TG AG CAACAAGG CTGTTT ATTTCACCTGGGTG CAAG CG AGCTGAGT CTGAAAAG 

AGAGTCAGCAAAGGGTGGTGGGATTATCATTAGTTCTTAC^GGTTT^ 

GGGGG CGG AT CTC AACAAAG T ACATTCTT AAAGG T GGGG AG AATT ACAAAG AAC CTT CTT AAGGGTGGGGGAG ATTACAAAATACA 
TTCGTTAGGGTGAGGCAGAAACAAATCACAATGGTGGAATGTCATCAGT^ 

CTTCAGGCCAT CTGG ATGTAT ACGTGCAGG T CACAGGGGAT ATG ATGG CCTGG CCTGGG CTCAGAGG CCTG ACAATT AGTT AAGCT 

5 0 TACTCTCTGTGTCTTGACCCAAATCATTGATGCTGAGGAAGAAAATGCACGACCA 

TAGCCGTTTAACACTTGCCAAGTTCCATTGTTCAAGCAGCTTGACACCCAC 

TTTATCTAC^GGATGCCACTGGAAGGCAITGTCTTCAGTGAAAGCAAGATGCAGC 

AGTACCTACTCTGTGCIAGTATTTGTCAGGTTGTCGAGACACAAA^ 

AGACAGGAAAGTCAC^AATTGTACCTTTGTCATCACCTGGAATTAAAAAAGTAGCATAGCATATAT^ 

5 5 CTTTAAAAAAAAAAAAAAGTTAGCATTTGCCTCCTCATT^ 

CTGGCTGCTTGCCCTTTCCATTCAGTCATTATATAATGTTTGACATGTGGATTCTGT 
CAAAAGTCCCTG^CAATCTCATGGTAGTAAAATGCATTATTTTTTTCCA 

GTAGCATGGCTTCTAGCAGTTAAGGCCATGTCCCTAGGATGGCCCCCTTTAAAATGCCTGCCTGAGA^ 
GAATCTCCTGTTTATTCTAGCCAAAGGCAGGCAACAGGGAGCTGGCCCGTGAACCCTCT 

6 0 AATGATAAATCCTTTCTCTGTCCCTTTGAGATGTAAATCTTCTAACCATCCAGAACT 

TTCAAAGAGGTAACTCCGACTTTCAGATTCTGTGGGAGGGAGGGCCTAACTTCAGAG^ 
TTGCTTCTCCTCCAGGAAAGCTCCAATTAACAAACCCAGATGGCCT 
ACCCTTCCCCTAGCAGCACATCCCAGTTTTTTAAAAGTCTCATGCCTTTTGT^ 
TCATTATG ACTGAATATAACCATCCTTCCCACTTTAACT AG 

6 5 AATTTATGGTGTCTGAAATCAAAATATTCAAAGTCATTGTTTGCAAAATAAA 

CTAG AAAAT AAAAG TTGAACTGG CCACT CTC CAGG C CAGGC C CAG AAT CCAAACACAG ATTCC CrGGl"ril'Cri"l ACAGTTTTTCT 
TGCTCTTTCCCTACCACCTTTGAGTGCCTTCTCCCTTTACCTCTCrc 

CTTTC CTC CCT AGAATG CTGGTGGG AG TCACTG C CT AGGGTGGG CT CCCAG CTTTTCCTTTTG ATGAG CAAAGG CT CTT AAT CT CA 
TCCAATTACCCCTCTCTGACTCCACAAGCCACGTGTAGACCT 

7 0 GAAATATTTGCTCAGAGAATGATTTATCTGGTGAATAT 

AATTTTTTTT CTG CAATG CATGAG T ATT AATTTTG T AATC AG CAACAT AAT AAAT ATTTT ATTTTT AT AAATTCAAGAG CATGAGC 

ATGAGATATAAGCxrrrrrcrrGcrcrGATcavAATTAcrT^ 

TCCCATTTAAGGTGGGCCAAAAAAAATCGAGGCAAGATCGTCTCAGTCATAAOT 
GTGGTCCTAGGTGCTAGTAAGAGTTTGCAATAAAACCAAACAAAAC^^ 
7 5 TTG CT AAGTGAAATTGG TAGAAGGG TTTTT AAACTGTAAACAC^ TCAGTTT AACTGT AAATGACTATTT AA TCACTG TTTACAAA 
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ATAAGTCCTATATCAGATGGGAAGAATCAGGAATGACCTCTCC 

GTCAGACAGAGGGTGGGAAGGAGTCTATACCCAGGCAAATAAAAATAGCTGCCACT^CCT 
TCATGAGAGGCAGGATCAACTAGGAAAAGAAAGACAGGAAGAACAGGGAAA1TATTGCCTGAGGG 
TGTGAGTTAATTCCATAACCCAGGGCTTTTCAfiACATCTGAGTC 
5 TTATAATATATACGTAATCTGTTTATTTTCTTTGTTTATTTGTT^ 

CAGTGGCACGATCTCAGCTCACTACAATCTCCACCTCCTGGGTTCAAGCGATTC^ 
CAGGCACGTGCCACCACACCCAGCTAATTTTTGTATTTT^ 
TTCACCTCAAGTCATTCGCCCACCTGGGCCTCCCAGAGTGCTGGGATTATAGG 
TTAATTGTCCCTAAACTGACTCATTTAAAATAAAATTAGCC^ 
1 0 T AATTT AAAATACAGATATAAAT AAACACAT AAG CCTTTT AAATGTTCAATTGTGT AC CT C CT AAAATT ATGTCACG TAC CTGC AG 
AGAAACGAGTACCATGTTGGAAAACAC^GCTGTCATTGAGGCCAAAGGCTCGAGTGGGAGTCAAATCT 
AGAAAAGATCTITCTCTACAGTGGTGTTTCTCAAAGTGGA^ 

GAGGGAGGGGCAATTTCCCCGTCCTTTGCATCTGAGAGTGCTGGTGACTGATTCAACCAGTA^ 

TGACTTTCCCTGGGACACAATCTCTGGGACCCCAGAGCCTCATGTAAGACATTGGCGCCTCCTCTTAGACC^ 
1 5 GGTCCCCAGTCCCTAGTTGCACTGAGCCCAGCTTTC 

ACCTCAGCAGAAGAATACAGGGCAGAGAGATCACCCAGCCAAGCCTGCCTGAA^ 

ATAGTTGCCTTTTTAAGTCATTACATTTTGGGATCACTGGTTATGCAACAGTAGATA^ 

CCAATGGAGCTTGTCAAAATGCATATTCCTGGGCCTCAGGTGCAATTCTCCAGGAA 

CTGAATTTTAACCTAGCTTCCTCGATGTCTCTGGTAGA 
2 0 ATCTATCCAGTCTACCGTCATCTTTTTACCATTACTATAAATCTGGCC^TGTGCTGGGCTCCAGGCATAA 

CTGTCCCTGCTCTCAGGGACACCAAGGACCAAGCTGK 

C CCTG T AGG AGACTGAGGT ACCAATG C CAGTTCTCTCAG AATTG CT AT AATTTGTTCAAG ACCT CCCAAACTT ACT C CAAAGCACT 

AACAATCGGTCrATGGCCAAGAAATTAGGAGAAGATTGAATTCACTGATAT^ 

CTGAGGTTGAGGTTGGCTGCCGGAGGGTTTCTGTCACCTTTTAT^ 

2 5 ACTTCAGCCTCAATCAaVGATTCATTTGCCCTTGAAAATG 

TTCATTACAGAAAATGAGAAAATAAAGAAAAACATTTCAAACACACAAACAAAGCTCC 
AACGTTAAGAGGTTCCTTGAGTC^GGCACACCTGGGTTTGAACCT 
TGATTACATTGCCTACTGTGCTAGGCTGTTGTGAGGATTAAGTAAGAATATGTGC^ 
CTAATAGTCACAAGTATTTCTTTCTAGTCTGATTCCCTTTATATC 

3 0 ATCCCATCTGTACTCCCCAGCATTCTAATCTCCCCATCAGAGG 

ATGGAGCAGCAGGGTAGGAGTGGATGGTGACAGGCCAGAATTTGGACGATGAGTCCAGGGCTCCCC 

TCTTTCAAGTTTTCTGGGGCAGTTCTACATGTCCTGTAGTCTAGTTTAGT 
TTCTATGTGGCAGCAGCAAGGCAATACGTTTACTGCTGTTATATAGGAGGTGCT 

3 5 TCGATTTTCTTATTATTATGATACCCATCCAGATCTTACTCCCT 

ACACAAAGTCATCGATGGCGAAGCACATAGACTTTGATGAGAACAGCCATAG 
TGGGTGAAATTTTTAACATTrCCAAGGCTATCTCTTCCCTTTTTTT^ 

CGCCCAGGCTGGAGTGTAGTGGCACAATCTCAACTCACrGCAACCTCCGCCTCCCAGGTTCAA^ 
CGAGTAGCTGGGACTACAGGCGCGTTCCACCACGCCCAGCTAATTTTTTGTATTT^ 

GCCTCTCTTCCCATTTTTAGATGGGGATGATCATATCTATCTCATGGCATTATTGATAATTAAATGAA 

CTACCAG AG TGT CT AG CAT AT AGT AAT ATTCT AAT ATTGTT AACRTTTC CAGT ATTGG AG TGTTCAG AG CAG AG AG AAAAC CACTG 

TACCAGCAAATAGACAAACAGCTGCCACAAGATAGTGATAAGAAAGTATCAGCAGAGCACACT 

ATACTGCTCGGTAAGGGCTCGGAGCTGAAAGTGAACAACACACAGAGACTGTGAATTGCCTGTGAGGGG 

4 5 ACGAGGAAGCACATCACAGCCCAGATCTAGCTGGCCACATGCAGAGGATCAGCCGAACTC^ 

AGTCTGGGCCACAACA-GAGATTCTCrrCAACAAAATTTCTTAAAT^ 

CCAATG CTTCITTTTTTTTTTTTT AAGC CACAATAAAACT ATTTTGG TGATGTCAAATT ACAATT AAAG TTTCT AAATG CTGACTT 

ACAATTTCTGAACTTACCTTGTTGGGAACTAGTGGCAACGAATACTGAGCT 

TTTTGGTTTITGTGTGTGCTTAAGATTATTATTGCn'CTTTTGTTT^ 

5 0 TCAGAGCCTTCCTTCATATGACGTACAGCCTGTCATCCACAAATT^ 

AATGTGAGCGCAGACATTCTCACACTGTCTCAGAGAATCCCAGGAGAAGAGGCAGTAGGGGGAGTGG 
GGGTCCCAGGGTTCCTCTTTAACATACCCTTCTCCCTCTCAGGGTTAGGTAAAACrCTA 

ATAATGTATC^TAAGTATAATAATATAATAATAATTTTTCCCTCCAAAAGGCAGAAATATAGATGGAAAATATAG 
CTTCCTATTAATCCTTGATCCAAAGAATCCnTCAGGAATTITATTATAAA 

5 5 TGGTTATCCIATCTrCCTTTAATTCTTGTAGAATGTTGCTTTGCCATGACTTATTTCTG 

CTCAGCTCTGACCCCAGCCCACACCTCTCTCTGAGCACCAACCTC^GATCCCAACT 

GCTGTCCTTCAAATGCGGCATGCCTGAAATAAACTTCAGTTCCACCGCTC CACCCCTTCCCCCA CAACTTCCTAAATC 
TGATTT CT AC CAAAAGTG C CATT ACTTT ACTGTG ATT AAT ATG CTGG AG T AATATTTG ACTCC CTC CCTG AATT CAT CTTTCAT AG 
GTGAGCTGATGCAGATGTCTACCATATTTCTCCTCTGCTTTT 

6 0 TTCATCTTCCTAAGTAGTTCCGCAACCCTCCCTACCCCATTACGCTCCCCTCCAAATCACT^ 

ATAACACTGTAGTGCTATCAATGTTCTGCTCACAAAAGCCrTCACTG^ 

AGGCTTCTAAGACCTTCTAGAATGTAGTTGTCCCTACXSAATTC 

ACCCAGGCCATTCTCCTCCCTGTCCTTCTGGCC^ 

TCAGGACTAGACACAGTCCCAGCTCTGCCACCAACCAGCTGCCTCTCCCAGCTTACACCATC 

6 5 AGATCTT CATC CCTCATG TCTTGTCAT AATTCACAT CAT AAGC CAT CTGTTT CATGT AAC AT ACT AGTGTCTAG T AACTT CCTTTT 

TCTTCTTTGGTACCCAACCrCCTCCTACTACCATGACCGCCACCACT 
TAAAAATACCTCATGAATCAGATTCTATCAAAAAAAGAAGATGGA^ 

AAAT ATTG CT ATGTTTT CT CTT CG ATGCTTGGGT ATG CTGTGTG CT CATG CT ATTG CTCAG ATTT AT AG ACTATTTG AACTG AAAG 
GATCTTAAAAATCACTGGGTCCAATAATATAATCCTGAATCTCAA^^ 

7 0 AACAATAAAAGAAAGATGAGTATCATTTACCAGCTCTGTGACTTT^ 

TTTGAAGTGTGAATGATCTAACACTTATTGAGGAGCTAAATGAGATGATC 
AGTGCTTAGTACCTCTTAAGAAAAAGATCCCTCTCCATCCTTCGAAAACAT^ 
ATTAATTTATTGTTAATAATAAACTGTGAGATCATTTAATGGGGTGTTTCT 
AATTGAAACCTAATCCCCAAAGCGATGATATTAGAAGGTGGGG 
75 G AG AAATT AATGT CCTT AAAAAAGAGGCCCGAGAGAGCKi CC"1 TC CCCTTCFG C7 ATGTGAGG ACACACAGTGC GTC C CATCTATG 
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aagaatgggccttcactggacaccatatctgccgacccct 

gttgtttataaattactcagtctaaggtattttgttatagcagcctaaatggattaagac 
tacatctgtacctatttctatgtctactttggatcataatataaaacgtatttctaa 
gccattggattattaaagtcctccactagtaaaaatggagcagcatgccaaaacct 
5 cagttttaccttccatcttttctggaccttgcttttcc^ 

acatgtaacccggacc^ctgttgccatactaattaatgtac^ 
taggtctagcatgtgtctttctccatcagat^ 
cagctggtaactctgtcttgcattttgcagtgattctgtcacccacat 
ggcttcaagcacagaaagtacaagtcagctcacaaccagagctcctcgatgtctc 
1 0 ccggtggaaatgacag<3aaaacaccagcttgttgtaagtgtcatgggtgtgatt^ 
aggcitacxjatcaacggagctgc^gtaaaccctaaatt 
agcttcagt ct cttc cttt at aat atgacat aacgat aatg 
aagcgctgaacacaatgtgttgactatagcaggcatctaacgagtggtcgctat^ 

aaatattt aag ccttg at attct agcag c cagaaaagcagg ag attcctttttttctct cactgtg atcatt cagaaaacaaaaca 
15 aaacaaaacaaacaaaacaaaaagttaagcttctcaattccaaat^ 

ccataactacagaagttcatagaaaacgaagatgatatttaa 

gtgttgataatagcaagttcaaatattagagatgtatataacataaa1 o 

actacttccaaatttittccgtacatatctgttattactatta^ 

agagactattcagatagcaccaacccaggccccccgaagactcc^ccaattgct 
2 0 aag aaccacttt aaacaactgt aaccagat att cacggt aacgggacttt acatcaacac ccagggcaa^ cag ct 

tctttctgtggacoscaaattaaattataaattt^ 

tctctgatccttctatccttcttctttctatcgagactgggtctcct^ 

tttag agacag cgg attact attt acagat agttgt c att aatgggcg caacac^ caaat att ctg aaaaaacaaat 

actg<3aattagtccaaaacatagtaagagcaaattacatag 

2 5 TTGAGCAAACCTAGATTCCAGCCTTTCTGATATTCACTAGTTTG<K5CAAG 

CAAAAAGAAATTGGATGGATCATATCTAAAGCTCCACTCAGCCCTCAGATTGGTAAT^ 
CTG<riTGTCTGCTAGCCTAGAGTCAGTGGTTAGCTG^ 



tcccctgctagaaatacaaao3tgccctaccttaaataatcctg<;ctctgg<;tctgtgtg^ 
3 0 caagcctgtgttaagtgctgtgtttcatggggctgctcagtgatcaaggcagaat^ 
gaaggagggattatgctgagtcatgactagcaa^ 
agcagtgtactccaagtttccgcataatccactctggtacctgtcttt^ 
aagggtatgggagaccccatcctgacctagaaggaaac^cgcaagtggtattccaatat^ 
gtgaccactggctcaggggcitg^aagcaaatctcctgcttatct 

3 5 ctgtgagtttagagaaaatgaagactcctgtgtgacatttatgagagcagcttctgtattgaaatct 

gtatgaaggacacagaaggaattccctgcctggggtccaaggtgaagggtatcatagaggtaagggtgaaatggg^ 
ttcccaataggtaggtgggaacgggggagagagggatgtagctaacagattttcctgtcaactatgtag^ 
aattttcctttatacatattcctctctctctcctctgtctctctctct 
atacacatatatatatgttaggggttgtgaatatcctagtcac 

4 0 g at agtg aatc cgttcgggtctg cagcaaccccaatt attgctt ccgcag aag aaag aattcgactgggggg cataagg cagaaaa 

agagactgaggcaagtttcagagcaggagtggaagtttattaaaaagcttt 
acccaagtgggcaacttggaggacaagtgcagcgtttgaccrrri'gacttggggtat 

ACTTrTCCCATGATTCTTCCCTTAGGGTGAGCCACCCGCATGCACAGTGCCC 
TCTACTGGAGTTGTATGCATGCTCACCTGAGGCATTCTTCCCTTTTCCGGTG 

4 5 CTCTT AATGTG CATGCT CATGCC CACTCG CC CAATTC CTG AGATTTT AATGG AAG CTGCTG ATT ACCAATTTCAAGTGTTTTTAT C 

TATTGGGAAATTGCCTCTCCCTGGCGCCAGTTGTGACTAATCATTTTATAGGGTGA 
CTCTAGCACTGGCTGCAACCAATTATCATTTTAGAGAGACAGTGTGATAACTCCTGGACCATCACCT 
TGGTTTGGAGGGGGGAGCCCTCTCCTGCCCTGCTCATGCCTGA 
TTATAATCCATACCCTAATATTATCACAGCTGGAGCTACATGGAACCAGCTAGTAG 

5 0 ACAGTGAAAGACTCAAGTGGCCTAAATCTTGACAGCAGGGAAATGGTCTGATTTCCCTCAGTTCC^ 

GCAGTGGTGCGTTTGACATGAAGTGGAGGTGGTTCT ACACCT CAGAGGTTGTGCTTATTTACTG AACGGTG 

TTTAGGGATTCCCCAGCAAAGCAAGTTGGTCATCTGGTAGTAAGTTTCTGTATAG 
TATCCCAACTCTACTCAGAGGAAGGGGGCTCTGAGCTAGACATCACAAGGG 
TTAACCTTGTCAATGGAAAAGTCTTCTTATTTCAGTAAGGTCATATTCATT 

TTGGATATATGATGAGCTAGCATTTTCAAGATTGTAG 

GATGCATGCACATTTCATGAGAAACCAAACTACTAGGTCCATGAATTTGAGAGTCTTA 

AAAATATTTTTCAACTGTCCATTAGGAGATTATTGAAGATGGAGAAAGTTCTGAAG 

AATCCTTGAAAGTAAGTGATTTTACATATATTTATTGA^ 

6 0 GTCAGCTTGGGCTGCAATAACAAAATACAGAGATGGCXaTGGCTTAAA 

TCAGCAGGGTCAGTTTCTGGTGAGGGTTCTCTTTCTGGCTTCTT 
GACAGAGAGAACAAGCAATCTTTCTGGTGTCACAAAGACACTAATCCTG 
TTTGGGAGGCO^AGGCGGGTGCACTTGAAGCCAGGAATTCGACCAGC^ 
CAAAAATTAGGTGTGGTGAGGCATGCCTGTAATTCCAGCTACTTGGGAGATTGAGACATGA 

6 5 GGTTGCAGTGAGCTGAGATCACGCCACTGTACTCCAGCCTGGGTGATAGAGTGAGACTC 

ACTAATCCCATCATGAGAGCCCCACTCTCATGACCTCAAAGGCCCrGTCTTAAATACGGTCATA 
ACAAATGGGTTGGGAGGGGGAATTCCGTCCATAGCTTTGGTCTAGGAAGAAAATGCCTAA^ 
CATGCTGTCGTGATGTACAAATTGCCTTACCCTCCAAGAGTCTGTCACTCATGGG^ 
GGGGTTGGTGGGGACAGGACGGAACTGCTCTTAAAGAGCA 

7 0 GCAC^TTTATTACCTCCCAATTTTCCCTACCrrAGAGTTGCAAGTGGA 

T ACAGGCCT AAAAT ATGG CCATG AGCAGGAG CACAGG CTTGGAGTGTAAACAGACCAGGGTTTAAGTTTGGGCTGTGCCAC^ 
ACCTGTATGATGGTGGAGAAACTATATAACCTACATGAGC^TCT 
GTGAGAGACCCAGCAAAAGGGAGAGCCCTATAGAGCTAGCGTATCACTCATG 
TTAGCCCTGCACAGGTGCCAGAACCTGCTTGTTACTrCTGTT^ 
7 5 AAAAATGC CCTTAAAAATAACCATCAC CCAC CTGCT CGTT AT AIT C^tiG AG AaaCT AGGTCAAAGATTTG CTTAT ACTTTTT CTTC 
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ATTGCCTGAGACCTAGAAAGGGTC^TGACTCGGATGTTATAATGATTATCTTAGAGACT 
CTCTTTACTTCTGTATTTGGAGTGGGGCTGAAGACTTCTGAGAAGTG 

GTGGTGGAG CTCT AAGT CAGTT ACACTGCAACTC CAAT CTGGGAG AG CTGGGACACTTCTTTGATG T CCATTGATG CTTGGATCT A 
5 AATCCTAGCTCCAGCAGTTTCCAGCTGTGTGATCTTGAGCAAGT^ 

GAGAT AACATTGAT ACTT AGCTGAGAAGT CAATCAGTT AATG CAGG CAATG CCTCAGCACTGTGTCTGACACAGAG CAAAGGCGCC 
CTCCACTGTGGCTGTTrGTGGTCCTTCACCCAAATGTCAAGCCAGGTTGTGra 
TGGGTCCTGGCTGTCTCCCTGGGTAAGAGTTGTGGGTtlAAGAGTCAAAGAC 
TGTTGCCTGGCACAAATGTGAAGATGCCCTAGAAACTGTAAAACACCAGGAAAATAGA^ 
1 0 GTTGGTATACCCAACCAGGACTC1TTAGCCTAAGACAAG 

AGGGCCTCTCAGAGGTTGAAGGTAGCCTGGTCACTTCCATTTATGAGGCTCCCAGTAAG1 
CAGTGTGAAAAAAACTGTGTGTTTAAAAAAATGTTTACTTTTAACAAGCAA 

CACAGCCAT AG TCCCT ACT CATnTTTGT AAAGTTG CT AAT CITTGAAAG ATTGCTAAT AATACAATG CCACAT AGGAAAT AAAAG 
AG TCT AAG AAAAAAAG ATCCAGT ATTTAT ATT CCAGG ACCCTG AAAGCACACG TTTAATAAGGATG ATTTACACC AGTT CT CAAGA 
1 5 TGCCATTCAAAAGAAATTT AAAT AAGTATT AAACATAAGTAGCAAG TAAATGGTTTGG AATG ATGAGTAAAGACTCT CAATG CTT A 
GAGAGAAATTGGGAGGGAATATAGTGTTT'GATTTGGATGAAAC^ 

CAATTC CCT AG CTl'lGT AAACTGG A i n i iGi T^ rrCCrGTTGAGCTTTTC CAACC CTTTGAATG AAACAG CTTT ATTCATTTAGT 
CAATCAGTGATTTCCATCTAACTGTTTAATGCTACCAGTTCTATG 
CAAGTAGTTATAGGCACCCCnTGGAGCAGCACT^ 
2 0 TAATTGAACATATTGAAAGGACAGTTCATGGTCAGATGGATAACCTGAT^ 
TGAGGCTTGAGCGTCCACATGCCACGTTTGCCTTGAAAACT 
ATAAGTTTCCTGTAATAAGTTAGTCGCCTTGCCATACCAAAAAGGAA^ 
ATAGTCTTACATCTTCATTCCTGATAGCTGCACATCrTCTTAAAGAAATAG 
GCATAAGCACAGGTGTTTGTGGTTACCCAGTGAGTTGTACT^ 

2 5 AGCATTGCTTCTTAGATCAAACTCCTCATGCTCA 

TAAAAAATAAGTATCATATTGAAATAAAGAAAGTAGTGTTCCACAAGTGGCGTGGTTTGGTATTGGA 
CATATGAATACCATTTTCTCTT'GGTCTTGCCAGCATGTTTCCGTGACTGTCC 
TGCAAGCCAAATGGCCACCCTGCTGCCTCCCTT'CCACAAT^ 
GAGGGCCAGTGGCAGGTGAAATCGCAAAGAAGTAGACTTAGGCTTTTC^ 

3 0 AACAGGCTCCTCTGGAGGTATCAGAGTTCCCCATTATTAGAGTCAAGCAAAGTTTG 

AGAGC!AGCGATGCTGAAGCATGATTTGATGTCCCTCAGGCTCrATC 
CC^TCCrTGGAG^TAAGCCTCCCTTCTCCrrrCCTCTTT 

GCTTTTCATCATAACAACTCTGAACTTTCATTAATTCATGCTGGTCAGGCAAAGAAGCA^ 
TGCCTTGCTTGAGACACACATCTACTCTGCTCCTATTCT 

3 5 CTCATCCCAAATCITTATGCGTTACCTTTTAAAAAACAGTAGGACACA 

CAGATCTGAACAGTTTGGATTCCAGAGGGATGTGAAATATTGTCTCCTTCT 

AACCACTGTTACCCTGAAAAATTCACAAC CAGGAGAACCACATGG ACTATTCTGTCTCTGTGGG AATGG AGTTTGGTGTTGATG CC 
TGTAGTCATGTTAGCTCTATCGTGGGGACAGAATCCCCT^ 

CCTGTCAAATGGCCTCTCAGG ATTTCTGTATTATGAAGACC CAG AGG CCG TCAGTGTGC 

4 0 TGGTTAAAGAGGCTGTCTG<JGCATTTCTTCCGGATGCTTTCGTCACCATGAC^ 

TTGCCTCCTCTCCCCGAATACTTCAGTGGCTCCCTGACTTGGACAGGCAT 
TTCTTrCTACAGCTTTATCTCCTGCCTCTCACATTTCATTTCA 

TTCTTG CCTGTTGCTTTGCTTACTC AGACTTTTCTG CTTTGACTGCCATTCCTCAAATTGTCGCTTGGCTCACTTCCCCTCATTCT 
TTAAC ACTCAATTT AGATGG CACCTTCTCCAGAAAG CCTTCCTT'GACTCCCTCTTTCTCCATGCTGGATT AGTGCCCCTCTT CCAT 

4 5 GCTCCCTGAGTACCTGCAAAGCCCTTGATCCACATAGTTACAGTGATTTGTT^ 

TTAGTTCTCAGCAAGACrAGTTCATGATGCCTAGCACAAAGTAGGTGCTCAAGAAA 

ATGAATTCATGAATGAGTGAACGCTTCACAAGGATGTATGTTGTTGTTGGATATTGAG 

CTCCCACAGCCTCCTCTACCTCTTTCTCCAGCCCTCTC^^ 

GAAAGTAAGAGAAGATCAAATCTAAGATAATGCCTTTACTCCATGTCATAATTCCAAGAGCGCT 

5 0 TTCTTAGTTTGCnTGTTCTCATGCTTCTCTCCATGTGTCTATG 

TTTTTTGTTATTTTTTGCCTTGATATACATAATCAGATACATGCTATGTATG 

ACAGCGATGTGTAACC^GGATATTTTGTTTCnTAAAAAAAAGTCAAGAACCTTCr 

AACAAAGGTGATTTTTTAAAAATGAAGACTAATA^ 

GGGCATGATGTAGATTTTTTAATTACCAGCCCAGGATCAACAGAGGATGAAGAGCAA 

5 5 AAAGAAGGTGAGAAGAAAGATGAAAAATACATGCACACGCAAACATTATGTTAACACT CACATT ACTTG CCTTATAA 

TTGTGTAATGACCATCTAATCAATTCTCAArTATTTGTGGCAGCTTA 

GGGTCCTAGAGTACATGTTATGAATATGTCCTGTATCITGTTTATTATACTTACCCrrATTGC 
ATTTGTGTTTGACATGCAGAGGCATTAAAGCACCCAATCAAAGTGAAAAGTA 
AGCTGCAAG CTTGGGTGAGGGGGCAGGACTGCAAAGACTGGGATAAAATCA 

6 0 AGGCACTTCGTGCTGAGATGGAATGAAGGAAACrAATGAGGGCAAGCTGAGCCCTG 

TT'GCAGAGATCAGACTT'GTGACAGGCCTATTGCCTGGTGGAGGAAACCAAAGTAAGCTAA 
CAAGCCCTGACTCCTGTCCAACTCTAGTGAGAGCCTGACTT^ 
CTGGCTGGGCTGGTGTTTCTCCTCACAGCTTCCTCCTTC 
CAGAAAAGGAAAACCTCAGATAAAGTGAAAATAAAAGCTAGAAACACAACCCT 

6 5 TTAGAACC!AGATCCAAACTGAGACGAGTTTGAATACrrGGGC^ 

CAGGGAAGCGATCCTXK3CTAGAGAGCCTCACACTCAGCC 

ATGGGAAGGATGGAGAGGTGGGGAAATGCCTCCTCCTAGTCAGG 

TCTCAATATTCAGTCAACCAGTGX3GT ATTGATGGGG AAjGCCAG AGC CCAGAGG CCCTGTAGATAJG CCGGGGCTGGAG ATCTA 

7 0 GAGGTGGCATCAGGGGTTGTGTGCATGTTTXIACAACTGTGGCCTGCTGC^ 

TCAAGG ATGCATTCT CAAACAC CTTGCAAAATTCAAGACAAAACCCAATAAAGAATGG 

TGTTTACTGTTAGGAATCCAGCTTTGTGGGCTTTTTT^ 

ACTTGTTCATATGTTTTATATCrrCC T TTTTCAAATTTACATAATTGAAA 

AAGTlTCCTTAAAAAGTGGAATTGTATTTTCCXSGAGCrGGTGATTCGAGGTTAACTACGGCAACA 
7 5 TCTTCCTAACATTAGTGCTrCCCCCTTCAAATG^ 
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CAAAATAAAAAGTATCT1TTTGAATTGAGATAATGT 

AATTTACCAATGATGCTCAAAGTGTGGTCCCTGGTCCGGGAACACCAGCATTGCCTG 
CTCCTGCATCACTCTGGGGACAGAGCCCTGCAAACT^ 
CTAACCTAAAGAGTACATGGTAACAAATCACACATCCCAGT^ 
5 GATTTAGTGGTTGCTAGCACXnTTCTCTCTCCTGTCCCCACATCTGCCCT 

ATTGGGCACAT CTT AGAATCACTTGT CCACTTT CTGGATAGGCAACTTTAAGTGAAACATGTGTCATTAACTTTAT CCCACTTTCT 

CCAAGGCAGOTGGACTAACACTCTC^TGAGAAAGAGTCTCCCAAGTGCTACCT 

CTCAGCCTCCAGCCACAACGCTTTTGGCTGGT^ 

GCTATAAAGGAACACCTGAGGCTCGCTAATTTATAAAGAAAAGA^ 

CAGGGTGAGAGAGAAAGCAAGACAGAGTGGGGAGGGGCCAGGCTC^ 

CTCACTCATTACCATGAGGATGAGGACt^CACAAGTCATTCATGAGAGATCTTCCCCTATC 

ACCTCCAACATTGGGGATCAAATTTCTACGTGAGAT^ 

CCCAGAAACAATATGATTTTTTAAGTTTTATTATTT 
1 5 GGTGTGATTACAGCTTACTGAAGCCTCAACCTCCCAGGCTCCAGGGATCCTTCCACCTCA^ 

CACGTGCCATCATACCTGGCTAATTTTAAAATTTTATGTAGAA^ 

AGCGTGGCTTTTTX3TTTATGGTTGGTTTGTTTTTAAAG<3ACT 

AGATTTTCTTGCAGTTTTGCCATTAAGTGGCTITGCTTC^ 

AACCACTTTCTTAGTGTTTTCTCCTAGTGTTTGCAGCTGGGCCT^ 
2 0 GAAAAGATTTCATTAAGAATCCAGTGTTAAATCATAGAT 

GATTACTTTTATATTATGACCTTGTGGAGTCAACATTTGAAAAGCTCAGOTTGCCT 

CAAAAGTGCTTTCTGATTTTCAAATTGCCTCGTCAAAGAGTGGACA^ 

CATCCGTGTGGATTTAGTTCTGTGCCCCTACGAGaSTCGTGCCTTTGCCCTC 

GGACCCATGGGATGJVTGTTAGCTTTCTGAAAGACGTAGGCCGAGTCTA 

2 5 GTGACAGGGGAGGGGCCAGTACCCAGAAACCTCTAACTCA 

CAGTTATTCCCAGTGTGTGGTCCCCAACACCAGCAGCJVGCAACCTTACCCAGGAACA 

CCAAACCTATGCAATCAAATCAGCATCTCTGGGGATTTCAGCCCAGAAATTTGTTTGGAAAA 

CTCAAGTACX3AGAACCACTGAAGTACAGAAACCAGCAAG 

ATTGGGAAAAAATATCTGCACATCATACATCACATAAGGGCCATTATCCAGCATATACAAGGAAACC^ 
30 AAACAAAT AATC CTATTTAAAAATGGGCAAAGG ACTTGAAT AGAT ATTTCT(1AAAAG AAG ACATAT AAATGGCAAG CAGATATTGC 

CATCTTGCTTAACATCTCTAATCATCAGAGAAATGCAAATTAA 

ATCAAAAACATGAAAGACAGCAGGTGTTAGTGAGGCTGTGGAGAAAAGGGAACCCTCTTGTATTC 

CATTTTGGAAAATAGTATGGAAGTTCCTCAAAAAACTAAAAACAGGGCTACCATATGATCC 

CCAAAGGAAATGAACTCAGTATGTCAAAGAGAGGTCTGCACTCCCGT^ 

3 5 TAAAAACAACCTAAGCATCCACAGCAGATGAATGGATTTTTAAAATGTAGTATATGCACAG 

ACAGGACATTCTGTCATTTGAGACAACATGGTTGAACCTAGAGGACATTATGTTAAGTC 
ACTGTATGATCTCGTTTCTGTATATGTAGAATCTAAAACAGTCAAACT 
AGAGGCTGGGGGATGAGGACAGAAAATGGACAGGGAAAAGGGTCAACAAGTACAA^ 
TTCTATTGCACAGTATGGTGACTATAGTTAATAAAAGTGTATATTTCAAAGTAGCT^ 

4 0 AAAGAAATGGTAGATGTTTGAGGTGATGGACATGCTAATTAGTCTGATTTTATCATTCT 

CATATTATAACCCACAAATGCATAGAATTATTATATGTTA 
TTGAGTTCATCCCAGACCTGGTTGCAAGCTCCTGCrrCTGCTTCTTGCTGT^ 
ACTCAATTTTCCCGTCAGGAAAATGAGGACGATGACCTTAGCTACTTCCCTGAGTTGCAG 
AAGCCTCAGCACCAAGCCAGCACGTGCCCAGTACTGGATAAGTAGTAGCTTCCATATCCTGTCACT 

4 5 ATTTGAGTCCCATTCGCATGTGTTCCACCCTCCCAACTAAACACCTCACCCTGTTT 

TTTGAGTGAAATGGGTTCGTAGAGGGTTTTTTATACAGATGAATGACACAACACAAA 
CAGGTGTCTTTGCTTTGGACTGGAATTAATTCAGCACAGCATCTTAGCAC^ 

AACT AAATG CTT T TTT CATT C CT AAAAAAAGG GGAAG AAAAG AAAAAAT AT AC C AT AAATGC AACTTGTCTGAT AG ACAT AAAT CT 
TTGGGAACACATAGTTGATTGAGAAAGGGGAACGTTCCAGAAAGAAG 

5 0 GCTGAAGTGGTTGACATTTGCCCATTTTATTTTTAAGATGGA 

AATTGTTGCTGTTGGTTCTGCAAGGCACTGGCAGAGCACTAAGCTCT^ 
TTCTGATTGCAAAGTCCATCTCCTCAACCATCCCGGGTGAGCT 
TATCCCAGGCTTGCTCCCTACTCCCAC^TGAAGAGGACAAAATTTCrrAGA 
TTCGTTGCCCTCCTTTCCCAAGGTGTTCCTGTCCTCTGTCTTAGGCTTCATG 

5 5 CTGTGGGAGATGTCATCCAGGAGCAGTTGACTTAGACGATCAGTTTCATGGCATCT 

GTCTTAATCATGTCCTTCCTCCATCCTTTCATTCACTCAACCAAGACTCAGGGAG 
GGTAGGGACAAAGTCCAGGCTGGAACGGGTTCCCTGACTCCTCCATCCTCTTTTA 
ACCAAACTAGTATTTATCCAGCATTTACTCTGGACCAGGCATTATCCTGGTAGAACG 
AACATTTGCTATACATCTTCTACATGCGAGGCACTATT^ 

6 0 GAGCTTACATTCTAGTCTGTTCATGGGAGCACGGACTTCTC 

GTTTAGTGGAGTTACATATTTCACTAGAGGGATGTAGCCACCTAATACTGAAGACATTCTAGACG 
ATCTCCTATCCAGCAGTTTGJ^AGAGACCTCCGGCGCTATGCCACACATGAGCGGAAGATC 
ACAAGACCAAGCTACAGTTCTCTTCCTAAAAGGGGCTACrTTGATCCT 
GATTTTAGTATCCAGGTATG<rrTTCCTTrGTCATG 

6 5 CAGACTCGCAGTTTCCACC05GCCCAGAATGCAATTCGAGAGATTTTG 

TCCAGCAGTGAACATTAACTGGGTTTGGTATATCCTCGGTGTTGGCAATGAGTAACTCCAAAGTA 
ACAAATACTACTTGTGTACCCCTCAAATGCAGGGGGCAGTGGCATTTTGA^ 
GAAAGAGAGCAACAGAGCTGCCCTGAGACATAAATGAGCATCCAGCTAGTCTCATTGACT^ 
GTCTTACTGTGGTCATGATCCAATTCATCCCTCAGCAAAACGAT^ 

7 0 AATCCACAGTTCTAAGGGCTACTCTTCTTACAGATATCAAGGA 

AAGTAATGGTAGTGGCCAGGCGCAGTGGCTC^aSTCTGTAATCCCAGCACTT^ 
GGAGTTCAAGATCAGCCAACATGGTGAAACCCCATCTCTA^ 
GCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGAG^ 
CCTGGGCAACAGAGTGAGACrei^T^TCAGGA 
7 5 ATGAGATGTATTCAGTTAGAGTTGTCTCAGTTACAAATGTTAGAAATCAAATCT 
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TAT ATACTGCAATGTCGAAGCGTTCAGGGGCAG CCGAATATGGGTGCTCAGG CAGG CTTCCCTCAGT ATTG CCTTTATTCTCAGGC 
AGGTlTCCCCATAAGTCCCACTCTrCCAGCTACTTCAACAGAGAAGAGCCTC 
GCTCATTGGCCAGGCCTGGGTCATGTGCCTATCCCTGAAC^ 
CTTGGAATTGGGAGCAGCTGCACCTAACCCTCCTGGACTGAGAG 
5 ACAGAAGAGGGGGCTGTGGATGCTGGCAGGAAAGCCTCGGATGCTAAGCATGTC 
AACACTTACAAATTAATACATTCTAAATAGTTCAATGTAAGCAAG 
ATGGAGGAGAAGAGCTGCTTGGTCTTAAAGAGATCACAGA^ 

ACCCAACACAT ACAAGTATCCTATAATT ACCAG CAGT ACAAAATT AT AACTG CATG TTAG AATT ACC CAGTG CGTT CAGAT AGATT 

GATTATAAGATCTCTCCrrGAGCTTCTGATTTCACTGGACTGGGGTATGGCT 
1 0 TTGTTCTAATAAACAGCTGTGGTTCAGAACTACAGCCCGTAGGAAAAATATCCAAATG^ 

TGAITTTACTTTAATTCATTTAGTACCATGTTTTGG^ 

GGCTGTAATTCATTCTGACAAGGCAATTTTATAAGTAACA 

CACATATTTCTTGTTATGTTTTCAGAGGATATTCCTCAAAGCAG 

TTGAACCGTGGGAAAGAAATGCCTAGGAAAGTGTTGTCAACATTTTTTTCCT 
1 5 TAGTAAAAGATGCCATAGGAGAGTTTGGGGTTATTTAGGTCrrTATTGAAATGCAGA 

ATGGG AAGGTG CCACTGGT AATGGGT AAGGTT CTAATAGG CCATGTTTATGACTGTTG CAT AG AATT CACAATG CATTTTT CAAGA 

GAAATGATGTTGTCACTGGTGGCTCATTCAGGGAAGCTCAT^ 

TAATAAAAACAGGAGGAAACAAGATGATGGAAT CTGTTTT rC"l"l CACTCAAAT ATT AT AAACT ATTTTAAAACACAAGTGT AACCA 
AGTGGAAGAATAAAGAGCTTACAGTTTGTCTTAAGGGGA 
2 0 TCCGTTCTGGAG<rnGTGGAGGAGGGGTGCTGCCTCTGGACrGGCT 

GCTGGTTCCCTAAGCATCCCAGCACAGGACCACATAAAGGATGTATTCTCCCTGCCATC^ 
CCATCAAAGAAATATCTCCCTCTTTTTTCCATCTTGGAGCTTCT 
TCTTGGCTTCTTATCTGAGGGGACCCA^ 
TCTTTCCTCCTGCCTCCTCCAGGAGGGATAGACCCTTTC 

2 5 T CAGC CTG CT AAAG AG CTGGG AGGC CCATGACG AC CGTTTTCCCTTC CTT C CAGTC CCTCTGGGAAG AGG AT ATTGGGAG AT ACAA 

TACCTTTTGGCTATTGTGACTAATGCrrGCTGTGAACATTGGCCT 
GGACCTAGTAATAAAATTGCTAGATTCAATGGCTATTCTATACTATGTTTAACT 
AACTGC^CCGTATTACATTCTTAACCAACAGTGCGCAAGCGTTCCAATTTCT 
CTGTTTTATAATAGCTGTCTTAATGGGTGTGAAGTGGCGGGT^ 

3 0 7UVCT AACACACTGAAAC^^CCT CAG ATG GTGTG TGTGC CTGT CT AAAAAAAG CTGGGGACAACAAGAG 

GGTGGACAGCAGCCACCCCCACTGGCCTCAAATCCCAAAATCCACAAAGTAT^ 
ATGACATTCAAGATTATTCTAGGTAGTAAGTAGATGAATATTTTCATACTG 
GTCAGTCATTTTTCTAGGCACAGGGCATAGAAAAGATAAOUAACn'GTCAA 
ATGTTCTAGAAAGGTGTTTATTTGTGTTCGAAAGGTGTTAAGAAA 

3 5 GGAATAGGTTTATTTATATAAGAAAAGCGACTCAAGGTCAGGGAAATCATTTAGGAAATATTA 

GAATGAGGAG AGAGGG ACG CAGGTGGCTTG ACATTTGGGAAATGTGCAGGTGG CCTTGTCGCCTTCACTGAG CAACTCCCCAGCCCT 
GTTCTCTGGAGGTCCCCACCACACAATAGCCCCAGX5AGGTCCATGCCAGAGGAGTTT 
TGCTACTTTCCTTATTTGAAAAGGCCTGTTGGAGATTCCACA 
TATCAGTTCTGGACATG1 T 1 X 1T1 TTTI T I ^ 

4 0 TACCAGAACTCAGCCTTCGCCTGTGCCTCCTACTTTGGCA 

ACAGTATGATATAGCAGATCAAGTAGGAAGCAGATTATCTGTTAAAAAGATTGGAACT 
CTCrATTTGTTCACTTTTTTCCCAGCCTTTT 

CTT T TCTTCTGCAATGTCTAGTTTCATTAAACCTCTCCAGTATTTTATCTCACACAT 

AGTTTTT AAAAACTAT CTT C CATG CTT CTATTT AACATATT CAATTTTTC CT CT AG CT T ATT AAAT ACATGGAATAT AGTGT AACT 

4 5 GTTTCAATGTCCTTGTCTACTAATGATATCATCTGTGTCAAT^ 

CACATTTTCCTCCCTCTTTGCAAGCTTGGTAATTTTATATT^ 
TTGTATTCCTGTTAATATTCTTAAACTTTGTTCCAGGATG 
TGAGCTTTGTTAGGCAGGACTCATCTTACTTrTGAGCm 
CACTACTGAGGCAATACTTTTCrrCAGTGTTCreCC^ 

5 0 ACGGTTTGTGGCCTCCCACTGAGAGGCCTGGCCCATTAGAGCTTAGGGT^ 

AGCCTTGGGTAGTTTTCTCACATGCATATGCTGATCAATGGAATAATCAA 
TACTCrGCCCTGCAAATTCTATTCACCTTGGCCT^ 

GCAAGGAC CATGGTGTCAG AGAAAAGG CCCACGTGGCAGGAGGG CCATGG CATAAAATG AG CAG AGAG AAAGCTCTTGCAG CTGGA 
GGTTCCCACGTGTCTGAGrTGCAGAGAGAAAGCCTATGTTTGTGGACAGACCATGGCATC^ 

5 5 GTTGGGGAATAGAAACTAGAAGAGGTGGGACAC^GACrAGGCT 

TTTCATAAGTGGGTAATCCCTGGCCAGGTGAGTCAACAGTAATCACTGTTTGACT 
GAATTTTGTCAACATGGTTTGTTCTCGCTCACAATCAAATCTTTCAAATCATTTTAGCCTC 
TTCTCTTTCATTATAAATCAGATTTTAAGCCCCAGGTTACACTCACCCTC 
TACAGCCTATTGGACAAACTCGGCCAACTTCCAAAATGTTCCCCAGTCTCACCrrGTC 

6 0 ATTGGAGTTCAGGTAGCCTCTCTATAGGATGGAGGAGAGAGGTTTTTTTCT 

CCCTGACATAACCAGACATTTCAGCCATTTGGAATTCTCAG 

GTAAAGGAAAAAGAAGAAGAGTATAGATTCTTATCTTATTTGAGC^ 

CACCTGTATAAAAGCTTTCCTGATATTATTTCCTTGGATGGAGATTCACATTCAAT^ 

GTTT AGCATTTTCATT ATG CC C CCAA CATTGTCTT AGG AC CTGG CAGATTG TCTG ACATTGG AAGGAAG C CAGTGCACATTTGTT A 

6 5 AAT CAATT CACGCATT AAGCAATAATT CT CATC AAG ATGGTGGGAGGG CAAAGGTT AT ATGACTTCACAATCTTTCAAAAATT AGT 

TATC^CTTTTrGGTCTTCAATTAACATCGCAATGTTC 

AT AAAG AAGAAGTG AG TAT CTG TCT CCACCAGCAAAGT CAGG AACTC^GAAGAATCAT AGAGGGT AT AG AAGG CTG TG CTG ATGGG 

AGTGAGGAACAGGCAGGCAGAAGGAGTGCTTTGCAGGGTT^ 

ATAGATAAACCAATTCAAACACCAGATGATACAGGAGGAAATG^ 

7 0 ACCAGCAGCCCTGGGTTAGGGAAGCAGAGCTTCTGACriTTGGGTCT 

ACTGAATGTCTTCTAGCCrGGCAGAGAAAGTGCCAATATC 

TCTCTAATATAGGCAAAGAAAAGCCACAAACATTTTGTAGGTTGATAAGAATGGC 
TTCTTGTGTATCATAGCAGAAGTAAATGACAGAAATGCTGAACCTTTCTAGTTTAAT 
GGATTATAAGTAGAATTAAAATAAGTGATCCAAAACACAAGTCATCCCT 
75 C^TCCAGCCTTAeAACCACCCCATGGGCTAA 
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CTGGATGATCAACAATCGGCCCAAGGGCTGACACCT 
CTTCATGACATGTAGGAGGAAGTACAAGCACCAGGATGACTGGGAA 

TT AG AT CCAGAGG ACTTGGCACAAAACTX5 AGTTT CAAAAGGGG CTTTGAAGAG CAGGT ATTGACATGTGGCAGAGATTGGGTTCAT 
AGAACATGTTCTTGGTTATTCAGGGAAGAGGGGCATGTTTCTGA^ 
5 ATCTGTGTGGCTCATGGAGCTGCTGGATGAACITAAGATGCCCTGGAC 

CAATGGCTATACATGGGGGAACCAGGCTGCCTGGAAGCCTCCTCATGGGAGT13 

CCCCTAGATATCCAAGGACTTGGTTCTTTTCTCCAAG 

TCTTTTGGAAAAACTCTTGCCTTCCCCGCGGCCCCCCATCTTTACTTTC^ 
1 0 AATACTCTCTGATTACAGACACAAGTAGGAGGAATGAATGATGTGTCT 

AGAAGGAAGTCTGCCCAAAGACACAGAGGAAATGAAACTCATATTGAAGGCAAAAGAAGAGTACAGA 

GGGGATGGATTGGAAAATGGGAAGAGGAGTTACTGGGATAAGAGAAAGCTG 

AAGGCAACCTGGTTATAGTTTGCCCAATAGGCTCAATGGGCCAACACCACT^ 

GAGACAGTCCTCAGCAAGCTGAAGCGGCAATGCTATGCGTGG 
1 5 AAAAGAAGTTGTAACTTTAACAACTATCCCTAAAGCACAAAAGTTAAAGT^ 

TTACTTTAAAAATTAAGCAAAATGTTTCATTGTAAAGACAAAGTGTT 

CTTTAAGGTTAAATGTAACACCACCAGTCTCTGTATTTGGAATTC» 

CACGATCACACCTAGATTGGAAGATTCACATACTACCAGCAACACCT 
. GAGAAAGTTTGTCCTAAATGTTCCCTTGTGTGGCCCTGTGACTATGG^ 
2 0 AATATGGGAAGAGGTAGGAAGAAGGTGGTAGAGAGAGAGAAC 

TCTTATTTAATCITTTCAATTATGACATGAGGCAAATATT^ 

AACTTCTCCAAAGTCGTACAGCTGCTCAGTGGTAGK5TGTGACTTAAGAGTC 

CAGGTGCTTCCCTGTGGTCAGGGAATGTGTGGCTCCACATGGT^CCTACCCTGA 

CACATCCTGCATAAGTTTCATTTGTATCCCAACTTGCACCAGGCTGACCT 

2 5 TTTTGAGAAAAAGGATATGAAAAAAAAAGATGAGAAGGCAAGACACTCAGAAAC 

AAATTCCCACTGGGAGCAGGAATAGGATCTAAGGACCCCATCTGGTGAG 

GAATTAACCATGTTGGGGATGTCCCCTAAAGAGACAAATCAGCCCCAAAGTGTTCC^ 

GAAGCAGCTTGGCATCTTTCCTCrCCAAGTACAAAACCCTGGGTTT^ 

3 0 TGCACAGGTTCCTCGCTTCCTGCTACAGTCTTCACATATATGTGGGCCT 

TCTCCTAGGATTCTTAAGGAAACAAATAGGTTTTTTAAAAACT 
CATTGACCCTCCCACCTGAAATAACCAAAATCAGACACAACATTTGAAG 

GACAATGGTCCTGGAGAGACAGAAAACAAATGAGGAGAGACCCACAATTACCTGATCTTACTGCT 
TCAACTATCTTTTGTTTATTGTTAACAAAGTTTCTCATTTTTTGTCTT^ 

3 5 AGTTTTGTAGGCATCATATGATTGGACCTTGTTTTTATAGCCAATC 

CATTT AATGTG ATCATTG AT ATGGTTGGGTTTAAATCT ACCATCTTG CT ATTTGTTTTCTGTTTGTTCT AC CTGTTTGTTGTGTCT 
TTTTAACTCTTTTTCTACTTTATTTCATATAGATTTTAGAAGAACCT 
GAAATTACATCTGTTATATAGATCAACATTGTTTGACATCGTTATGTTATTAACACT^ 
CATTATTTAGGTATTATTTAATGGGTTTTTTAACCAAAGTTTTATTATTTTCC^ 

4 0 CCATAGCATTCTACATATTTTAATACCAATTACCATCAC^ 

AAGT AT AAATT ACTTTT AT ATATTG ATTTTTGT AT CCAGTAAACTTG T^ CTTGTT AATTCAAG T AATTTTT CTG T AGAT 
TTAGCTT T TCTTrTCrTTCTTT C TTTCTTCCTTT 

CTGGAGTGTGATCGTGTGATCTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGTGATTCTC 
AGCTGGGATTACAGGCACCTGCCACCACACCTGGCTAACTT^ 

4 5 TCTAGAACTCCTGACCTCAGGTGATCCACCTGCCTTGGC 

ATTTAGCTTTTCTATGTCGACAATCATACCATCTGCAAATAATGA 

I'TTTTTTTGAGATGGAGTCTTGCTCTTGTCACCCGGCTGGAGTGCAGTGGTGCGATCTCA 

TTCAAGCGATTCTCTTCCCTCAGCCTCCCGAGTAGCTGGGATTACAGACGACCGC 

GTAGACACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCT 

5 0 TGGGATTACAGGCGTGAGCCACTGCACCGGACCTATACTTTCT AATTTTT AAAAATCCTACTTCATTG^ 

ATATTGAATAGAAGTGGTGTTAGTGAAGATCCTTGTTTTATTTCTGATCT 

ATGTTAGCTGTAATTTTTTGAAAAAGACATCTTTCAAGTCAAGGAAGTTCC 

TGTTAAGTTTTTTTGAACTTTCTTCCCCTATTGAGATGGTCACATGAGT^ 

AATTG ATTGTCT AAT ATT AAAGGGAGCTTG AATTT CTGTT CAG CCAAT AAAAATG CACTGAGAACCT C CTTTGTG CTTG AG CTT CC 

5 5 TCTTGGTCATCTCTTTTCTACATGTGCTCATACTGTGCrrCITTTGGTGA 

HUMAN SEQUENCE - mRNA 

ATGGATCCACCACGAGCGTCCCACTTGAGCCCTCC^JAAGAAGAGACCCCGG 
CATCAAATTTCAAGATTTGGTCGTCTTCATTTTGGAGAAGAAAATGGGAACCACC 

6 0 GGAAAGGGTTCAGGGTTGAAAATGAGCTCAGTGATTCTGTCACCCACATTGTA 

TGGCTTCAAGCACAGAAAGTACAAGTCAGCTCACAACCAGAGCTCCT 

AC CGGTGG AAATGACAGGAAAAC AC C AG CTTGTTGTG AG AAG AG ACT ATT C AGAT AGCACCAACCCAGGCCCCCCG AAG ACT CCAC 

CAATTGCTGTACAAAAGATCTCCCAGTATGCGTGTCAGAGAAGAACCACTTTAAACAACTGTAACC^ 

C^TATACTGGCTGAAAACTGTGAGTTTAGAGAAAATGAAGACTCCTGTGTG 

6 5 GCCATTCACAATCATCAGTATGAAGGACACAGAAGGAATTCCCTGCCTGGG 

AAGATGGAGAAAGTTCTGAAGTTAAAGCTGTGTTAAATGATGAACGATATCAATCCT^ 
GGGCTGAAGACTTCTGAGAAGTGGTTCAGGATGGGTTTCAGAACT 

AAGAGGCTGTCTGGGCATTTCTTCCGGATGCTTTCGTCACC^^ 

7 0 GATTTTTTAATTACCAGCCCAGGATCAACAGAGGATGAAGAGCAA 

ACTTTTATATTATGACCTTGTGGAGTCAACATTTGAAAAGCTCAGGTTGCCTAGCAC^ 

AG TG CTTTCTG ATTTT CAAATTG CCTCGTCAAAGAG TGGACAGTGACCAG TCC AG CTC^ GAGG AAGGAAAGACCTGGAAGG CCAT C 
CGTGTGGATTTAGTTCTCTGCCCCTACGAGCGTCGTGCCTTTGCCCrc 
CCTCCXXSCGCTATGCCRCAC^TGAGCGGAAGATGATTCTGGATAACCA 
7 S CAGAAAGTU AAGAAGAAATTTT7G CG CATCTGG G ATTG GATT^T ATTG AACCGTC^ GAAAGAA_ATGCCT AG 
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HUMAN SEQUENCE - CODING 
ATGGATCCACGACGAGCGTCCCACTTGAGCCCTCGG 
C^TCAAATTTCAAGATTTGGTCGTCTTCATT^ 
5 GGAAAGGGTTCAGGGTTG AAAATGAG CTCAG TGATTCTG TCACCCACATTGT AG CAGAG AACAACTCGGGTTCGGATG TT CTGGAG 
TGGCITCAAGCACAGAAAGTACAAGTCAGCrCACAACCAGAG 
ACCGGTGGAAATGACAGGAAAAC^CCAGCTTGT^ 
CAATTGCTGTAC^AAAGATCTCCCAGTATGCGTGTCAGAG 

GATATACTGGCTGAAAACrTGTGAGTTTAGAGAAAATGAAGACTCCTGTGTGACATT^ 
1 0 GCCATTCACAATCATC^TATGAAGGACACAGAAGGAATTCCCT^ 

AAGATGGAGAAAGTTCTGAAGTTAAAGCTGTGTTAAATGATGAACGATATCAATCCT^ 

GGGCTGAAGACTTCTGAGAAGTGGTTCAGGATGGGTTTCAGAA 

AATGCAGAAAGCAGGATTTCTGTATTATGAAGACCTTGTCAGCTGTC 

AAGAGGCTGTCTGGGCATTTCTTCCGGATGCTTTCGT 
1 5 GATTTTTTAATTACCAGCCCAGGATCAACAGAGGATGAAGAGCAAC^ 

ACTTTTATATTATGACCTTGTGGAGTCAACATTTGAAAAGCTCAGGTTG 

AGTGCTTTCTGATTTTCAAATTGCCTCGTCAAAGAGTGGACAGTGACCA^ 

CGTGTGGATTTAGTTCTGTGCCCCTACGAGCGTCGTGCCTTTGCCCTGTTGGGATGGACTGGCTCCCCT 
CCTCCGGCGCTATG CCACACATGAG CGGAAG ATGATTCTGGATAACCATGCTTTATATGACAAGACCAAGAGG ATATTCCT 
2 0 CAGAAAGTGAAGAAGAAATTTTTGCGCIATCTGGGATTGGATTA 
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Table 37 

MOUSE NOMENCLATURE 
ICSGNM Znfnlal 
Celera mCG3994 

5 

HUMAN NOMENCLATURE 
HGNC ZNFN1A1 
Celera hCGl736622 

10 MOUSE SEQUENCE - GENOMIC 

TTTAGATAATTTTCTGTTGTATCACATTTTCCACTGCT 

GGGGGGAAAAATGTTTAATTrATCATCCTTTAAGAGACATGGAAATGAGT 

AAAGTGCCCCAGTGTTCTTAATGGAAACTAGAGGAAGCCATCG^ 

GGCCTTTTCTCCCTTGTTTGTGCTGAGTTTTG^TAGAGTA^ 
1 5 GTCrCTCGCTTCATCAAATGAAAGCC^GCGGCTACGTATCrCAGGTCCCGGAAG 

TTGGAAAGTTCAAGTTCTTCTCCAACCGAGAGTGCGCC 

TTTAAACGATTTTCACITGAATTAAGAACATCCAG^ 

ACTCATTGTTGCTGTCTGTTGGGAGACTGCAGAACAGGCATTAAAACCG^ 

GCCCTATTATAAATAGACACGCCGATTTTTGTAACTGTACAAATGAGGAAGTTGT^ 
2 0 GTCTCCTGGGTTTGCATGAGGACCrTGTAGGCTGAAGGTGAAGGATTATAAAGCCGTC 

TTACACATTAGCCTCAATCTTGTGCATGGTGTGTTGTATTGGCATGACT 

CATACCATAGAATAAAACTTTTTAGAAATGTAGACATTCCTT^ 

TGTGATTGTTGAGTTTATTGTCZTGTAGTTGTTATCAT^ 

TAGAAGTTGGGTGAGCAAGGGTAACTTTCCAAAGTACTAAAGCCATTAAAA 

2 5 TCTAAGTTATAGTACTGTAAACATAGTTTTGAAAAAATAAGAGAACATATAACAAATT^ 

T ATTT CATTTTAAGTG ACT ATGGTTGATTTTTATAGGTGC CGT CAAT AACAC CAACAT CAGTTTCAGT ACTTTCTGGAAAATGGCC 
ACAGAGTCCIATGTGGCCTACCCTrAAGTGACAGCGAGTGACAATGAGCTGGCAATCAGTAAAT 
TAGATGCTCCTTAATAAGGCTGGGGTGTGCCAAAGTGGCCTTTT^ 
CGGTGATGGTGCTGGCCTGGAGCGTGAGCAGCATCCCATCT 

3 0 GGAAGTTCTTCTCACTATCACIAGTGAACAGTCAAGTCAGAGC^ 

AACAATAGGATACCTCTATGAAGCTCTGTGTGGCTTAACrTCTT^ 
AAACCCTTTTAGCTAAAGTAGAATAGCGTTGTAGAAACACCCGC^^ 

T CCCT CCACCACAT ACAGCTTT AAGG1TG T ATTTTCAAGT ATC CACTTG AC C 
GAAAGTCACAATGAGATATTGTNNNNNNNNNNNNNNNNN^ 

3 5 AATTCTACCGTCTCTTCTAACAATTACAAACTATCAG 

ATTTTAAAGATAGAAATCTCATTTGTGCTGCATCTCCCTTCCATCACACTC 
GGGCAGAAGACAATGCTTCTCAACTTCGGTTTTCCGCCTTAAAT 
AGAAGTGGCCAGTGTTGATGGTCCAAATCAGAAAGCATGCTAATT 
AGTATGGAGGTTAGAGAGGAAACACAGCCTTGTTGTGTGATCATGGCTTACCTGGG 

4 0 CATGTCAGCACACATGTGAGCCTCTTGCCTCCAATTCCATTGTCTrT 

AGCCTAGGGATTCACTCTGATGCCCAGCCTAGCCTGAAACTCATAAACTCATATGTAGCCTAG^ 
GGTGAGAGCTACAGTGCCTGGTTCCACCATTITCTATTAC^TGTCACA^ 
TCTTTATTTTATTCTCTCTCTCTTTCTTrCTCCCCGCCCCGTGTGTGTG 
AGGCCAGAGGTCAAAATGGAGAATCTCCCITATAGTTCCAGGCC^^ 

4 5 TCACTGCTCTTTTGTGCCCACCTCCCCAGCACTGGAACTACA^ 

AGGT C CTCACATTTG CCTGG CAAG CG CTTT ACTGACTGAGGT ATTCCTGTAG C CC CTTGGCTG CATTTTT AACAG ACTT C CT CATTT 
CCCTTTGACCCTCCTTCATATCAGTTT CTGCCCACAGAGGGGAAAATGTGAAGCCAGGG CTGCACATCTG AGTGATCT 
TGGCITAGCTGACTGGGGGAGGAGGGGCGTGGGGAACACTAAATTGGATTCCTAT 

GTTGT ACCAG CAAAGCT CTGG GAG AT AAT CATG AAAATGT GT AG ATTGG CAG CT AATTTTTG AAAAAATG AAAAG AATC AG AG ATG 

5 0 AAATAGGAGCGCTCAGAAGTTTTTATGTTCTCCCAACCTGTTT^ 

AAAGACGTCGCATATTGCTGCTXnTTCTCGTC^SAGCATCCCTTTTGAAAACT 
AAAAAAAAAATGGAATCTTCAGTGTATAGATGAAAAGACATGTCTT^ 

GGCATGTTTAATAGGTGATAGTCCCGAGCTCAACCCTGTCTTCCACATGCACATTTACTTCCT 
ATTAAGTGGCTAGTCCCAAGTCTCCCTGGGACAGCTTCACC!AGC1AAGA 

5 5 TTTAGTTGCTGTACAAGTTCTGTTATGAGAAGTTTGTa^ 

AGGGGCACCTTCCTCTGTCTGACATTTGGATCTTGTCTCCACT 
CACTGTCATCTGATGACTGGAAGTTCAGAGCTCAGCTCTGCTATTTGT 

ATGCTC^ACTCTGAGGCAGGCTGCCTGAGAAAGTGCTGGGCAAGGCAGGTTC 

6 0 TCTCAGATGCCTCGCATGTAATGGGCTCCCTCAGAGGTGCCTGT 

TATAAACACAAAAAGTTAATGTACACTGTTTTTAAATCTCAGGACATTTT^ 
AGACTAATGTCTACTGACCTCCACTCCrGCCTACAGCAAGCTTTAT^ 
TGGGTAACTTGAAAAGAAAGATGTTTGTGGTTTCTGCCTGGCCTCGA^ 
TCAAAATTCTGGGCCAGGCCCTCGGTTATGACATAAAGCACCACAGGAA 

6 5 CCTTGAGGACTTGTTGAGGTTCTGGGCTCTACTAAGTCCTCCTCrc 

TGGCCTGGTGGCTTACCTGGAGACAGCAACTGCCa^GCTG.TTGGATGCITCT 
AATGGCCTCTTCGTGGGAAGAGGAAAGGCTCCTGGCCTCCCAGGAAGACT 
CAGTCCCCAGAGCAGTGAGGTCCTTCCTGTGCAGTCTTGAAACCTGGCTCCCAGCCT 
TGATGTGTTGAATCCTTCTGTCTGAATCAAGGGAATGAAACCTGTTTGTCCCTGT^ 

7 0 GGAAGAGGGTC^GGAGGGGAGAGAGGGAGACACAGAAATAAAGAGATGAGA^ 

AAAGAGATCCCGAGGGATGGTGGGGTTATAAGGAAGGCCTCACTGCTCACTGTCC^ 
GCTGGCCTGCACTTATCTCTGCATCTATTTCCCCTTGTGCAAGATTCAGA^ 
TTTAGAAGAACTG CCCAGGTGAGAGCTAGTTTACTGACATTGTGGCATC 

T CTTCTCATG CAAAGTGG ACCAC CACAG CATGTTTCTGTGACCGGCTT ATT C CAT CCAAT AC CCCTGCCCAG ACCTGAAGATGGTA 
7 5 ATAATTTGC"I^"GGTGG It AAl _ rCCCT AT AGTt^AATGAAATCTC C T T u i 1 i i ATTTAAGGCTGAGACTGTAACATAACTCAG CTCA 
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AACTATTTGTCTTTTTAAGTTTTTAAAATGTATTCGATTTCTCTGACTCT 
AGCACACATGGGGGAGGCAGTCTATGCAAGTGTATCTATGTATGAGTGTC 
TGAGAATCAGTTCTCTTCTTCTACCACGTGGCTCGTAGGGAfCAAACTCAG^ 

AGACAT CT CT CTGGC CCTTT AAGTCCT ACCCTTCCC CCT CAG ATT C CCTGT ATTGCTTT CACATGG C AG CAG ATGT ACTTTGT AGG 
5 TGTTTTGTAAGC^TGCCTGGTGGTCAGCAGAGTACCCAGGATGGCCTAGACT 
TTTGCCTTAAAAGAGGCCAAATGCATTTATCCCCCACTCT 
CTTTGGTTTTGGGACATTTGAGGGGCTTAAGAAGTCTCrc 
AAAATTATCAAGTACCTGAAACTGTTGTTCTGAAACAAGGGACCCT 

TTGAAAGTGTG CATG CAAATCT AAGGG AGGT AGACTGTGTCAC AAGGG TC CAT ATG CCCTGG CTGG CAGGGAAGGGAGTCTGTCG C 
1 0 TAAGTGACCCACAGAGCAGCCATGTACAGAGGTGAGCTACTGGGATCCTGAC^ 

TGCTCCAAGCACTGTTGGTGACCTCACTTGTCTGTCCCCCTGAGGCATGACAGCTGGCTC 
GAAGGGCAGGCACTCATACTCATGGGCCTGATAAAACATTTGTTTCT 

T CACTG CT ATCATCT AGTGAAC CAAG CCTTCAATGCAT AGCCATT AAAAT ACAAT AATGG CACTG TAT AG T AGTT AAATT AAACAT 

TTTAGCAGTACNNNNNNNNNNNNNNNKNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNM^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNOTJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNCGftTCCACGTATACCATAAAATATATTAATGATAAAAAAGATAT^ 
TCACAAAATTACTTATATAGCTATATGGCATGTGTATGTATACATATATAAAATAAATACTITCATATAT^ 
25 GTAGATATACCATGGCTTACATGTGTTCCGTGTTATATAAAACTATGTATATAG^ 
GGAATCCCCCATCAATTTCCAGGTTTCITTTAAACGTAGGCCrCAACCCAT 
GAAAAAAAAACAAACAAAGGAACCTGACTTAGATTCAGCAGATAACCT^ 
TGAG CACTTTCAATG AAG CCACCIAG AG T ATTCT ACT 

CATCTAAGGGTACCCTAGTGCCCCGAGGTGTGGCCCAGCTAAGTCCACTGCATC 
3 0 CCTG AT ATTTGTG GT AG T CT G TG T ACAGTTT ATGG AGCACTCAC CTGGCCCACT AT AAGG CAG C CTTTTC CT ATGGGTGTTTTCAA 

TTCAAACCAAGATGAGGAGCCAGCCACAGGCCTGCAAAGCAGGAGGTTTG^ 

GAGAGAACG CTGGTGG AGTCTG CAG CTAGAGCTGAAG AGTGGGGGACAAGGACAG AG CGTGTATTG CCCATCTGCCAGGCTGGAGG 
TGTATTCAITATTGATGGAGGTAGTGCGTTGCTGTTCAGATATGCAGCCTCCCTGGGTAAATGAGAC 
CGTTTTTTGATTGCTGATTGTACGCGTGTCAC CAAG CTGACTCAAGGTTCATCG ATG CATGCTC^ 

3 5 TATGGAT(^GCCAAGAGAATGAATTCTGTGCCTACAATGACCCAGGGC 

CATTTTGGG1TATTATGCAGTAGGAAATTAACAATAAATAACAAATTTGGTCCTCCTGTG 
TGTAATGCTGTTTTTTAAAGGATCAAGGTCTGTGCTAGX3CT 

AGATAGCCITGTTGTTAAATAGCATAAGCGTCTCTCTCTCTCTCTCTCTCTCTCTCT 
NNNNNNNNNNNNNNNNNNNNNNNNTCCCTCC^ 

4 0 GAAGACAATGGATGTCGATGAGGGTCAAGACATGTCCCAAGTT^ 

CT AG CTGTCCTTGGGG AAGCTAGCAAGGCTGCCATAAAAGAA71AGGCTCAGAG CGACGTGTGTCTGTGTTCAGAG AACTTGAACAT 
AAAAACCTCCCAAGTGAGATTAGGTAGTGTGAACTCTAACAGCAATTCA 
TGTAATAGATTTGTCTACCAAAGATACAGAACACATX3CCIAAAG 
CAGCAGCAATGAAAAGAAAGCCTAGCCAACGACGGAGCCCAGTTCAGAGTCA 

4 5 TC AT CTT AC CAG ACTG AGAAAG CG AAACACTCTGG CTAATAG AAAGGTGG AG CTGGGTTCCAGGCTGTCCAGTGGGCAGCACTGAG 

GGTGGCCCATCCAGTAGATCCTGCCCTGCTCCTGCACGCATCAGTGTATGATGTCCGGTT 

TCCTCTCTATGGCACTGGCCAGCCTGCCATCTGTCCCTTGGGCAAGCreCCTCCT 
AAGTTTGCATTTTGGTCCAGGTGCCCCTGACAGGACCC 

5 0 GG AG ATGGG CTTG TGG CAAGTC CTGTG CTCTGT (^G CTCTTTCTTTCC ATTG T AAGGCTTC AG ACTT AAACTTC C C ACAATG CAAA 

GAATAATGAATGGCTACTTTTAAAAGGTGTTCGTCTTATGCTCTCCTA^ 

AAACCTTGTTCTTGTTATACAATCTACCCACTTCAGTTTAAGGA^ 

AAGCACATGCTCACTGTGGTGTTGTCATTAGTACTTAAAAAACTATC 

GGAGG ATATG AAT ATG CAGGT AT GG AAAG AAATCTG AAGTGT AT ATCAAAATGGG AAAG AT ACAAAGT ATG ATT C CATG TGG AAAT 
55 AT AAATTG CACACACG TG TGC ACACATT CTG CATG CTT C CATG AAC AGTT ACAGTG ATT CT AG AG TGT AG ATTG ACCTGGACTTCC 
CCCAAAGCTGAAGGTTAGAAGAAATCCTGG CGTGCTGAGGGGGG AGGGGAAG ACATTGTTTTT T T CTTGTACTCTGTATAGTTATT 
TAACTTCGTTTTCTTTTGTGTTTCTTAAGCAAATCTGTTA 
TCGCCCAAAGCCCAGGAAAATGGTGCTA7UVCATAATTAAAATAGGGTX3ATGAT 
CTCACCCCTTACAGCAGGATTGTTTGGAATATCACAGAAACAGGGATGTGGA 

6 0 GTCTTTGCTGCTTTGACCTCAGATGAGTTCTGAGTTTTT 

CCTACTGTCTGACGTCCTAATACCAAAACATTAGGCTGGAGATGAGACCAACCC^GATGACTATCAGTGAAAA 
TGGTATGTGTCTATAACAGAATATTATTTAGTTATGAATGTTGGAATTGATCATTAAC^ACAACATGGATAGA 
GAGTTAAGTGAAAGAGCCCAGGTTCTAAAAACCAAAGGCAGGCACACACTCTGCT 
CTGGGTTGGTGGATTTGCCACCCACACATATTCAAGGGTTGCAC7VCTATATGTATGACCGA 

6 5 ATAAACATGAGCAATTATTGGACGTCAATAACAAAATTTAAAACA^ 

AATGTTAACTATGATGGGTAGTCAGTATTTTGATTTTTCCATTTGAT 
TTCTTCCCCTTAACATTATGCTGGATGGTGTCTATTGTAG 
CCTGAAAAAACCCTTCACTTTCATTTTTTGTCTGAAAATATTTGCTCT 
CCCTCACCATGATACAGCCGCTGCATTCTGTTCAGCCITGAACAT^ 

7 0 GGAAAGAATTACATATTGCTTTTGCACTCCTTTAAATTTG 

CTX5GGAAGAAAGTAGGAAGGGTTCATGTAGGAAGGTTGACTT 
CTCTTGCATCCCTGGGGATGGAACTCAGGACTTTATTCATGCAAGACAGGCCACT 
CTTCATAAGAAATTGAGGGATATGTTACTGACTCATTTCTITT^ 
AACATGTGCAATACCTTAGAAACACATTCCTTACTAGACCCCTCTCT^ 
75 AT ATGGACI"GtG CTCTaATT GG TGTCACCT ACAG CATCAAT CCT AG AGGTTGATGGT CATTC CAC ATC ATG CT C B .CTGACTTC-AGT 
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CTACGAATTGTGGAAAGCTTTCTGATAGAAACTCGCAGCTGTGTCCAGCCTTATTACACTGTCATCTA 

GCTTCCTTTGGAGGTCCTTAACTCACTTTCTGATGGAAGCTCATTTCCT 

GTTTTTGTTTGCCTTCTAAAGAAAGAGGCAGTCTATGGGAAGJ^CCTTCT 

5 CTGAGGCTCCTCTGGCTCATATCACCTGAACTCTGTATTTTATATTTTCTTAATC 

ACAAGGAGTACACCCTGTAGACATACATAATCTGTGTAGGGAAGAGGAGCCTCATATATATTAAATA^ 

TTAATCATAGTAAATCACATTGATTGAGCACTTGGAACTACCCAAATAACATCCT 

CCACTGAAGGAGGATGTAGTTACCTCCAAGTTGCAGAAAGCTA 

CAGACAGTTATGAGCCTACITCAG CC CTAGGCTAGAAG CAACTT ACTCTCCTCAAGTGG TGG CT CTGTAGTG AAT ACATCT ATT AC 
1 0 CCTGGTATTACTCTCCCTTAAGCCCCTCTGTATCTTACAGATGCATG 

GC AAAT CAT AAT ACACTT AAG AACT CATTT ACTTTT AGTTTC CG ATT AAGGG CAC AATTTCTTTAATTTG ACCACAAACTT CTCAA 

AGACATACTTGACACACAGGCAACAGGCAAGTTAAGTATAAATTGGTGTGATTATAATT^ 

GTGAAATTTTGCTTGCCTCTTCAGAGTCATTCCATTTGAC^ 

TACCTGGTAAAGGTACCACATAGCCTATGCCTCTTGTGGTGTACTAC^ 
1 5 TGTGATTAATTTTCATCATGACCreCTGAACTAT 

GT AG TGG C CA.TCGCTCCTT CCCATCAG CT CTC CAATTGCTTT AATCAGACCTAT AAC CCAGGTT AAT CATTG TATT CAT AATTGGG 

GTGTGTTGGCrTTACCTAGCACITGGGGGTTTTTT^ 

G C CAAT ATG CACAAT AACTTGGTCT AT CAG AAAGGTTTT CATGAG ATC^ T AAATTTTCTGCAGG AAGACCTCC C^G<1A.GT AAAAGA 
ATGCAAAGATCCTTGGTCATCAAGCCCCCCTGCCCCCAGAGTGGTGGATGTCT 
2 0 AAATCAGAGGACCTCTTAAAAACCATCAGTGAGCCGTGG 

CCATGTTCCTTCAGGTCTGATTTTGTACTCrATACCCCTATACACA 

TCATCTGCTCTGAGAGTTCCTATTGATCrrcTCTGACCCACAGCGTTTTAGAAT 

TTCATGTAGGAAAAGTACCAGGATATAAGATGGGAATAGAGGTGAGATGGCCTTTTAA 

AAGAAGTC CAGCTGTT ACT ATGAACATCATCCITAAGATAG AGTTGAGTTTTGTCCTT^ CCACACTTTTAAA 

2 5 ACTGAGTAACCACCATGTCTCirTTTGCATGCAATTAACT 

CTAGATCTTTATCACACATACCCCTCCCTCAAGCrrcrAGTGTGGCATG^ 
TCAAAGTTATTATATGGAAGACCATCACAACAAAACAAGTCAAAGGAGT^ 
GCCAGCTXjGTGAATCACGGGTCCCCATGGCTCGACTCTCCGT 
CCACTCCCAATCTCATGGAGGTTCTTTTTATAATTATTTCTTTTTT^ 

3 0 GCTCTTTGGGGGTTATTTACTGGCCITTCAAAAGGCAAGTGGTGTG^ 

CTCTACATTATAAAGAATATCTGTTTAACCAGATTTCTGTGTGCTTATTGTCTT^ 
AACCrTCATTCTGGGCCATGACTCTGTATGACTCAAGCTGTCAAAT^ 
CTTGCTTATTGCAAAGATGAGAAGCAGAGCCACAAGTCCTGCCAC 
GCTGACTACTGCTCCAAGATTCTAGCACAATGTGAACACTTTT^ 

3 5 TAATCTTACCCCATCTTTAACTATTTATCTCGTATC^^ 

TCTAATCAAGCTAACCTGGGATAGTACATTGTGATGTCTCTTTCTTAGCGATTGT^ 
AGATTCTATTTTAAAGATTCCAGTAAAGCTTGTCTCATGTACrTATTTTAAT^ 
TTCTTGTGAGTTGATCCAATTTTCATACCATGTCTGTTCGTGTTCTC^ 
CACAAAAGTCTGGGTTAGTGTTTGTTGGGAGGTCATTTCGCCTATTCGGCCAA^ 

4 0 TACCTTCAGGATCTTGAGCCAGACTTTCTTTTGTAAGT 

CTCCTATTCAGTTTGAGCCTGTCAATCCCTTTAAATCAGCTCCAGAAAGGTGGACA 
AGCGTTTTCTACCAGCGACAGCTTCACTGTGTCGATCTGAATTTCT 

TGATTGCTAGGGAGATGGGATTAGGAGGGGAGTGAGGGGGTAACTGGAAGTGAAAGTGGCC^ 

ATTATCAAATAAT AACACAGAAGGGAAGGAGG CAG AGGGAAGGACC AAAAAAAG CTGTATATGCATGTGTGAGGTTTTCAAATAAT 

4 5 AGCAAAGAAGGAAGAACGGGAGAGGAAGGGAGAATGGGAACCCTACTTTATGATGTGTAA 

AGTTTGTCAAAGCACGAGAAGAGCCTCAGAAATACAGCCACCATTAGCCCAGCAGTAAC 
CTTTAAGGATATGTCCAAAATTGACTGAAAGGAGGTTCAGTGCAAAATT^ 
AAGCAAAACAATGAACCTAATTCCTTATTTCCCCTATAAATAAAATTGTAGAAATAGGTTTGT^ 
NNNNNNNimNNNNNNNNNTATATCAATATTTATGTAGTAAT^ 

5 0 GTTGTTTTCCTTTTGTAAATCTTTGTTCTGTCCTCT 

TGTATTTGTCCTAACAC^AAGCATTAATTCAGAGCAGGTACCCCTTCAGTCTGCCCTAG 

GAGGTTTTCCCAGCAGCTGACAACGCCTGCCAGAAACTCAAGCT 

CACTCCAGATGAAGGGGATGAGCCCATGCCTGTCCCTGAGGACCrcTCCACT 

GAGG CATGGGT AAGTGGCTCACCCTG CTGTCCTGCAGGCTGTAG CAT ATGCACAGTTAGGAAATGG CTG CTGTGT AGGATAAGGTA 
55 GGTGGGCTCATTCCCTCTTGTCCTCTGAGGTCAGTCACATAACAGGGTTTCCTCAAGTTGG^ 

CATTTGTATTAATTTAAAGGTCACTTTAAATCAAGGAGTTCGGCCAGTGTTGGAGCATATAGATCT 

CACCCTGTATGCTAGGGGTTCTTTTC!AAAACATAAACAGATGGTCACAACCT 

TCTGGGAAACAGGAGAAATTGTCCTTTGACCATGCCACACAGGGAGAGCC^ 

6 0 GGAGAAAGAGGTGGG CTTGCCCAG AATTTCCACAG CTG CTGTGGAACCTCrGCAGAATCATCATGCTGTAGGTGTGATTTAAGG AA 

CCGAAAACCAGGCTGGTTCCTGGAGAGCG^3GAAAGATGTGTGTATTTCACGTGTC^ 
CCTTGTCCCTTTGGCCGTGTAGTGTGAGCAGGGAGCAGGCTTCX3CTGG 
AATCAAGTGCTCAGTTCCAGGTGCCCTTGGTATXSCGCACTCTGTAGAAC^ 
ATGTACAATTTTCAGTTCAATTTCCTTTCAAC^TGAAGTTTAGAAAGTAATCATTTCA 

6 5 ACGTTTTGCCTGATTTGGCAGTAGCAGTTGACTTGGCCCT^ 

GCAAAGTTTTTGGTGGACAACTAGAAGCCACAACATCACCTGCTAGAAAAACT 
GGAAAGGGATGAGAAAACCCACCATTGGAACACCCCCAGCCACACTAC^ 

T CCACCCATTTCCACATGTAGGCTATATATAGG CTGTAAGTTTGATAATCTGTAG AATAT AACATT AAATAGAATGGGGT AATATC 
TAGGAGGTATTCACAAATTACAGGAAATAAAGTTGAAGCAATACTTCA 

7 0 TTAAAAAC^CAGGGGACCAGGAATCAGGAGAGTTTGGAA 

GGGAGGCAGX*AGGGC!AAGTCrCAGAAGGGACAC!ATCCAAGATTCTGOT 
CTGAGCCTCCATTTTCCTCATGTGAGCGAGAGCACTACCGCATGCC^ 
AGAATGGCAGACAGACTAAAAGAGAGAGAAAGTGTGTGGGTGCATCG 
GGATCCATGCGGCCTCAGTAGTAAGATGAGGACAGACAACAGAGCATA^ 
7 5 AAACACCCAAAATTGAAAACAGCCAGGAGAC^GAGAG^ 



777 



WO 03/008583 PCT7US01/51 291 



GAGCTCAAGCCTGTGGAGTGACCAGCAGTGAGGACGCAACAACCTGGGTCACTCAGTGGAGAGAAATGGGC 
GTGACCATGCCAGGAGACTCTGTACCAGACAAGAACAGACAGAGTTAGAAA^ 

GG CT AGTGGG TCATT CC CAGGATT AG AAAAGTTGGGGTTT CACACACTG CAT AGAT 
CTATGTATAAAAATTGGGACAATGGCAGTCAGAGGCAGAGTAATACAAGCACCAT^ 
5 TAAGAAGAAAAAAAGATT^GAAGAGCTGCTTGAAGAATTTAAGTCAGACCCG 
TCGGAACGGTCTACTTAATGGC^CTGTGGCTTGTTCTTCTGAAGTTATCT 
TGTGTCTGAAACCATTTGAAGCTGACGAAGGACTCAATTG^ 
GAAGAAGGGCAGAGCCGATTGCACTGACACAGAACTCTA 

AGAGAGCAGATGGGCAACTATAAAGAAAAGTGATGGCTCAATACTCTATTCCT^ 
1 0 AGTTTCCCCCTGGTTTGGGTTTGCTCTTGTCACTTGTGGCCT 

GGATTGTGGCCAAACTGTGATAAAATCCAGAGCACCACTCTGTTTG 

CCCAGAATGAAAGGTGGAATTCCAGGCTAAATGCACAAACACCTAGAAAAA 

AAAAGCCAATAGAAGGATAGGCCTGCTCAAGGCTGAGCTC 

ACTGGAGAATCTGTGCAGAACCCAGACAGAAGGAAATGGGATTG 
1 5 CCCITTGAGTGTATCTCAATCTGATATTCATCTGCCAGATCTGAGAGAGGCTGAC 

TAACTTTTACTCTTCITAGCTTAGATGAGAAGAGAGA^ 

AAATAGTGAAGTGTATAAAAATATTGATCTTGGTTTTTATTTT^ 




2 0 CTCAGCCATGAGGGCAGGGCTGAAGCAGGAACTATAAGCAATCATTTT 

GTCACATGAGAACATTTAGAAATATATTGTCACATATTGACAACTAATTG 

GGGT AG CAAT AAGTTTTGGTTGGCATACAAAAT CTGGGATTCACTGT AT CATTTTT AAACAT ACT ATGG CAC CAGG CTT ATTCG CC 
CCATCACTACCATCMCTTTAGGCAGTCAAGGATGGGTCCT 

GG CTGGAGGGAGGTTT TT AGG ACTG AAAGAG AG ACTG CAAG CCTGG CTGGCG AGTGGG AAG CTGGCTTCAAGTACGGATG TG AG CC 

2 5 CAGTTATCTAATCAGCAGCCAAAGGAAGTTGTTTTTATCAGTGAAATGCA 

GCAGCATTATCTGGCAGCATCTGAGCAGCrGCAGCAAGTGCAGGGACTG 
GCCACCAGCACTGAGTTCCTCTTTTCTGGAGGTGACTGAAGTATGTAGGTTGGCTCCGTGTC 
CCTrcCCCCGCTTCAGAGCTGAACGTAGCAGAGCAC^TCrGTCAAATT 
AAATGCTTGTAAAAGCCACTGGTACrCCACCATAGGCATTTCTAAGTCAGCT 

3 0 ATGAGACAGTAAGCAGTGGTATAAAAACTATT*TTCCCACCCX3CCCTCTCTTrC 

AGGTCirTGGCCTrCTGCATGCATCTCCCTTACTCTGTCAACCAGTAGG 
TTATTCCTGAACGTTGAAC^GACTCTGCATATCACCTGTCATCATG^ATTGGGCTTGGGTC 
ATTTAAATCCATTCCTTGGAGGAAACAGAAACTCAAATGTGTCAGTCAGC^ 
ATTATGTGTTGTGTGTATGATGGATGAATGCATTTATGTGCATGCTAGATC 
35 TGTGCATATATGTGTATATTTGTGCACATGTATAGTATATGTGCATATAAATATGTAATATTrTATGCA 
AATATGTGTAGTGTATTAGGGTATGAATGTGTGAGGTATGTAAACAATGTATATGCATATC 
TGTACATGTAATATATACGTATGGTATATGGTATGTGCTATGTGTAGTATATATGTATATCTGT^ 
TATGTGTGT^TATATCTATGATATTTTATGTATCIACAGAGTATATGGGTATGGCAAGTTTATAGTGTG 

AGTGTTTGTATGATGTGTAGCAGTGTTTGAGTAGTGTGTCTGGTATATATTTGGTATGTGTGACTTGTGITATATGTATGTTTGTA 
40 CCATGTGTGTGGTATGTATATATGTGGCATGTATCTGTGTGGCTTGTATGTGGTATGTGTGATTTGTGTAATATA 
ATATATGTCTGTGTCTreTGTGTAGTGCTTATGTATGGAGTATATATGTGGTGTATATATGTTTGGT 
TGTGCAGGG AG CTTGTATGTG CTGTGTATGT ATGTGGTATTCATGTGTGT AGTTGAACAATTTGTGTAGTACT 
TGTGACATGTGTGGTGATATGTATATGTGTTATGTGTTATGAGATAAACGTG 
GTTAGGGTATATGAAAGTACACGCTGCATGCCGTGGGAGCATGTATAGACATTAAAATACAACTC^ 

4 5 CCACCAGTGGGTTCCATGGACTGAAGTCTGGCCATTAAGCTTTGCAGAAAGCA 

TGTGTGGATCTTGACAGTATGCC^TGGCTATTTTCT^ 

ACATGCACATTATACTCTGACATGGAGTTCACCCCAGCCAATATCCA 

TGTTGTGCCAATGACAAAAATTACATATCATCTCAGTTGC^CAGCCCTTTGCCTGAGATTCCATT^ 

5 0 GCCTCGACACTAAGCCTGACATGAAATCACACCCTGGGTCCCA 

TGTCTGAACCATCAACTAGAACTCCTGGGCATAAGAGGCCTAAGCCAGCTCTCCCAGGCCCT^^ 
AAGCTAATGTATGGTTGGGAAGAACATGGCTTTCC^CTTTGGGCTAGC 

GTGCCTTGTATTTGTAGACTGCTGACCATGAAATCCTGAAATTTCTTTAGTATCTC^CCCAAAACAGAAC 
ACAACCCTGCCCCAACCCGTGCTGAATTCATTTTAGTATTTGTGTACATGTTGTTTGGTCAGCT 

5 5 GGACCTCAGAGATAGAAACTCATCCCAGAAACCTTTAGTGGATAAGATGGACTCCCA 

AGCTGTAGTGCCTGTGTTACAACGACACAACGATAAACTCACAAGAATATGAGGTTTTTGTAGAG 
GTGTGTCCTAACTGGGCTGAGGATTGGGGTTCACGAGGGGTTCAGATT 

CCAGG C CT ATTTTAACTGT AG AATG AG TTG AGCAAAG AGGG AAG AATG CTT AAAAG CAAAAG AGTAGAGG CACAAG AGG ACCCCAG 
CATCTTG ATCTGGGGTG TGAG CAGTGGTG CAGG CAAGGGG C CAGGTG AT AG CT C C AG CTG CTTTACAT AG T AGAG TTTG TTCT CAC 

6 0 GATAGCAACTGTTAGGAGTCTTGGTCTCAGAGAAGGGTATTTGAACCTCTGCCAAGCA 

TGGGGTTTCTCTACTGTCTTGGTATAGGTCCATGGGTATAGTGACCAACTACAGTGATTGTGG 

TGATTAGCATAGAATCATGATAATTCTGATCCAGGGAATTGGCTGGAATGTAGGGATGGAAGAG 

GAGTCAGCTAGGTCAGACAAGAGAGGGAACCCTTGGAGGAACACT^ 

GTCCTGCAGTTCAGATGAAATTTCTCAGAAGAATTATAGGATGGAAGCCACAGAAAAC^CATGGG 

6 5 GAGAACTTAGTGAGTGCCCTTAGTGAGGAGAATrrGGGCCAGTTGTGCATT^ 

CAGACTGCCACGGAAACCAGACTGATGATCAGGGAATGG^ 
TGAGCTGAGAGTTGCTGGCTGAAGGGGAGATTTCCAGGCTGATAClTrAGG 
GTGGTGACACATGCCTTTAACTGCAGAAGCTGGAGGATCTCTCAGTTGGAGGCCAGCCTC^CCT 
CGGGGCTACACAGAGAAACCCTGTCAAAAACTAAAATTACGGGAAATAGGAAGGACT 
70 CCTGGGAAG CAAGCAGGGNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NHNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GAATGATGGGTGTTGCCTTATTGC<;GATGAGAAAGGGAGAGGAGCTATC!AAGCA^ 

TGATGCTCAGCACCTCAACrGTTTrAGGATACTGATTTCACAGT 

GAGGCTGTCCCTGCCACACTC%ACC^GTAGTGCTACCCC a -C 

7 5 GAAGCTGACTCAGATGTCAGCCACACTTCCCTCAGCGTTCCCACACTC!GAGA 
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TAAGACTTTTATTGTGCTCAGTAATGTTCOIGAACCC 

CCCAAGCCCCACTCTCAAATGTGCAGCCCTTCTTCCGGCl'TTTTTTCTCTTTGTGTCT 
GGTTTGTTTCTGTAAATAAAACCCAGCATTAATCTTATCTT(!ACTGTAOT 
TTCATTTACTGAGAGAAACAAAGAGGAGTCCAAGATTAAAAGGCCCTGGAACAGATAGAC^ 
5 GGTCCCAGTGTTATTTTCTGCTTTGTTCTCTGTC 

GTTAACTTTCCTTCTGCATGGGGTTTCAGAAAGAAATGGAAAAAAAAA 

TTATGCAAAAGTAGAAAGGTTTAAGTAGAGACACCAGGCTCT 

C AG ATGCCTGCTGGGGGAGGGGAAGTGAGAAGCGTC CCTCCTCAGG C CCTGAGTGGTTGAGTTCCAG CTTCTGAC CT CT ACCAGAG 
1 0 GAGCTCAACAGAGCACAGATCTGCTCAGAATCTCAAACcrrf l'GTTGG^ 

T CT ATTT ATTT ATTTTGAT ATGAGAAACTT AGAAAT AATTCA^ 

CCATTCATGCTGCCAGACTGTTAATGAACAGCTTGAAGACCACCATGGTTCGAA^ 

CAAGTTAGGGCTTTTTTCGTTTTGTTTTAATTTGGTTT^ 

TGTGGGGGGGTATTTTTTGGATAGCTATTTGTGCAGTAAAGGCC^ 
1 5 TGAGGCCTCCTGCAGCAACTGCCCTCTCCCTTGT 

GCAATAGACATCATGAGGTGACTCCCCTCGCTGCTCCCTGAACCCATGGTG^ 

TGGAAGAGAATGATCAAAAGTTCACCTTACCATGGAGGGACCCCAGACTCTTG^ 

CTTCTGTGG TAAAAAAG CCATTCATGCT AC CTCTG TT AGGAG CAAGGT CAAGG CAGG CATCT CTGG CACAGG ACTGGGG TGTG TCA 
TTGGAAGGCTGGCTCCCAGGAGAGTACTTCCCCTACAGGGGATCTGG 
2 0 ATTCCIAGTGAAGCACTGAACGTCCAGCATCTTCCCCATCCAGCAGGGT^ 
CTGACTCCTGCTGCCTX^CAGGGCCCAGGCAATTGTGGACATTGACT 
TGCTGAAATGAGAAAAAGGCTTTTATTACAGCCCTGCAGGCAGCCATATC 
CTGAGTGCTTTCACAGTGTCTCCACACTCCACrTGAAC^ 

TTGGCTTCTGTTGGCACCCCAATGTGTAATAATGACAATTAAGTATTGCCTGGTGTAG 

2 5 GTTGGAACTGCCTTCAGTGAGATCAGGGCTGGGGAGAGAAGGCrGACrT 

GATGATGCTGGCTTGCCGTAAAGCCATCTGTGTTTCCTTACAGTTTTGGAA 
TTGCACCTCTTGCGGGTTTGCGTTTCrCTGTGTGTTTCrTC 

TTCACCCTT AG CTT AACCCX3CACTGTGCCT ACAT ACCACACGCATCC CC CTGG AG AG ACAG ACAG ACAG 

ACAGGCTCACCTTTAGCTCAAGACITTGAAGCATGTG^ 

3 0 CATTGTTT CTG GAAT AT AGCT ATGCCATG CATG ATTG C CTGTTTT AGTGG AG CAC AGT CAG CTTG AAATGT AGT AATCT AGG TG CT 

AGATGCTTCTGAATTCCCTAGAGAAAGGAATTATTTTTTTAAAGGAAAAAA 

ACTGAGTGTTTGAGTCCAGTGTTTGATGCTTATATCACATCT 
TCCAGTGCTGCACTCAAACACTGAGCTGTGGGCCAAGCCTTTGGG 

3 5 TGTAGAG AAT ATAATGTG AATG CTTG CTG AGTTGTATCATCTAGTT CTTCCATATTTT CCTCAGTCT CAGTCCTGTCCCGTTTGTA 

CTGTTGAGTTCCACTTGGCATCAGCATTTTAGCAG^ 

AGGAACCTGGAAAGCCCATCCTCTTCCCTGAGCCCAACTGACATGCGTGG^ 

CCGTCTATTCCTATGCTCTCCCGTGTATCACTGTGAGCCTATTCATTAA^ 

CAAGGCACATTACAGAAGAATACCACAGGCATCTGCTCCCACCAAACCCAGCCTC^ 

4 0 GGCCACTGAGTCCTGTTGCTGTTCCCTAAAAAAATAACACTGCTGOWVCCTGATGA 

ACTTTTAAGGGAGAAATAGAGGCC^GCTTCCCTCCTGCAAGCTTAAAATGAAGCCTGGGATTTGCCT 
GAAGGCCOUXSCATAGATGGCTCTGGCATTCCCGCCTCTCC^ 
GACGTCTGCACGCCACTGCCCACTGCACITCTTACAGTTCAGGAG^ 
GCACGCCAGG ATCCACAGAGAT AGG ATCTGTCATTCTGGCTTTC 
45 CACCCTTCCC^CCCTTCCACCCTAGTCTTTTCATTTTCTCT 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CCTCATCTTCCTCTTCCTCZATCTTCTTrCCTCCATTCCTTCAA 
CATGGATCAAAGGACCAATAGATGGGTTGATTATATTGTACT 

GGATAGCCTGGCTGTCATAAGTAACCTGAAGCATCTTTAGGAGCCTATTGTCAGTAACTA 

5 0 GCTTTrGTGTTTTGTGTTTTTTGTGTTTTGCTTCCTGTT 

CT ATGT CT AG CTGGTTGT AT CAC^G AAAGCTTAC CAC CCAG TGT CG ATGGGT ATG C AAG CAGG ACTTCTGAG CG CATG AG AG T CTT 
GACTTTGTTGACTGCTCCAGGCTATAAACCAAGCTTTACrc 
AACGAAAAGAAAATCTG CATG ATT ACTTGACATAGTCCTCAAAGCTGT^ 
CTGGAGTGTCATGAGCATCACAGGTGGACATGAACCTGTTCCTTTAAAGA 

5 5 TCAGATTTGGAGCTAAGGGGAGATGTGTTCTGAGTGGCTCGAGCATTCTAGAGC 

ACACTTGTCCAGGCACCCTGTTAGGCAGAAGCCACCAAGTAAGATGAGATCACAGAATTT^ 
TATACAGGGATAGAGAGTCAAGAAAATAACAAGAGAATGGTGAGCAATAAAATATGGAAGAATATAGT 
ACCATTCTGTTGGTTTTAGACACTTAAAGTTCAATATTTATGAAATAGTCA 
AATTCTCTGATGGTAGTATGACTATTCTCCTGGGAAAGGCAAAAAATTGTCATG 

6 0 CCACATGTCTGACAGTCCCATTGATGGTATCTGCTAGTCTATC^ 

GGACTGAGACCACCTCAGGGTGCA'riTC.'T'rC l"!' CAAGAATATCTATGTTTTTAAGTC ATGCATTAAAGTAAGAGCTTCTC CCAA 
CATGGATGGAACAGACAGTGAATATACATCATTGGCTTTAATGTTAACG 
A(^CAAGCACAGCTTGTGAGCAATATCATGTTGCTGCTGTGTCAAAAGGTCCCT 
CCAGACCTCATGTGGACTCAGCAGTCCATTTAAAACCATCAGCAGGCAG^ 

6 5 ACATCAAGCACTGAACTAAGTCTATGCATCTGCAGACCATTGCTCCTC 

GGAGATGAGACAAACACTAAATACCCTTCCTTCTAATAATTCCTT^ 

TAGCTAGCCATGAATCCAGAGTGTTCACAGATAATGATCTATGGGATCATGGCTTCT 
TAGAGTTCTGAGGACGGAAGGCTGAGTAGGAAGGCCACCTCCCTCAC^ 

7 0 CACATCCCTTGTTGCCTGCTGACCTGCA 

CCTTTGACATTTGGCTGAACAATGGGATCTCCT 

TGCAATTGCCAGACACATAGACATTGTAGAATAAGACAGGACTCCTGC^ 
CAAATATCCCTTGTTGCTGTTTTGATACCGTGATACTCTGTO 
GGGTTCTGCATTGATCCCTCATGCCCTAGACAACACAGACACCT 
7 5 ATGTTTTCAAG C (^TTCAAAAT CCAGTTCTTTAAAGGG ATCCAAAACAACT AATA CAG TT AAAGG AGT AAAAACATCCAAC 
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TG C C C CTGCACTGTTCAGTTCTCTT ATTCCAGTGGAGGTTATTTT ACAAGG AGATTTCT CAGGT C TTGT ATTGAATTTCATT 

TTTACrTTGGCAAATTAATTTTCTCCTGAAATCCCT 

TCAAGAGCAGATGGTTTACTATCAATGCACAAAGAGTTCTTA^ 

ATAATGGTAAGTGGAGAGGTGGCCATGGCTGTTGGTACAGCCCTCCTGGTTTGTGT^ 
5 TCCTTACC(^TTCTTCAG<5CTGGGGAATCTCTCTTGAAAGAAATAG^ 

TATAAATTTTAGCTTTGGGTTTAGAAAGAATGGCATA^ 

TACCCTTGCTTGATTrCTCATCTGTCACTCACACCTCCACACT 

AGGTCTGAAACTGTTGCATTCAGAGGTACCACGCAATCTCTTCAG 

GAGATGCCCCCATAATTGGGCTGTTAGTAATAATCTCACTTTCAGGAAA^ 
1 0 TGGTACATATTCTTTGAGGTAAAGCATCTTTAAAAGGm 

TTTTGAAGAG T ATGGGT ATCACCCCACAACAAG CAGGGACC CTACTT ACTGG AACAAT AAG 

AAACCAACTAGACCACTATAATAGATACGGATGCCGTTGGCAATC 

GGAGCAGCTTCAACGAGCGGTTGTCATGTTTAACAGCCATTA^ 

ATGTGCTGCCAAGTACAGGGAGAGGTAGCACATGGGCCATGTGGGATAGGGTTGCT 
1 5 GGCAGATTCCAAGCGCATGCCCAGGGACCTGGCACAGGCTGAAAGAGTGGTCACTGTA 

CCTGGCTAGTTAGCTTAAGGGCTTCATTTTCTGATCTTTGG CAGAAAAT CT CTCACAG G ACT ATGG ATC CAAGTTTTCAG ATT CTT 

TAGGCTGTCACTGACTTTTTCTTTGTTGCTATCCAAAG 

GT CT AATTTTG TTGGGGACAT ACTTG CCAGG CATATAATGAGGAAAATGGGGGTGAATGAGATGAGCTTGAGTTTTTCTGAGGGGG 
CAGG AT ATTTTAAAAAAAAAAACCTG CAATG CAT ACAACAAGCAAGGAG AGAAATCAGTTTTC AGG AG ACATATGT AGG 
2 0 ATGACATATTATGTGTCTTCAAATATGGTGGAAACTGGCT 

TGAGTCAGGGATAGTCITCAACAGGTATGAACAGGCACCCAGGATA 

AGTGATCCCTAGTCTAAAGCATTTGTTCTGTCACAGCCTTTTCCAAGAAAGCTGAGAGCATGACT 

TGGCTAGGCTGAGGCAGGAGGATTGTGATTTCAAAGCAGCrG^ 

GTCACACTGAAATTCCTGATTCTTACATGTTAACATCTTGAATGGATAA^ 

2 5 GCAAGGCC CTGGGTTCAAT CC CCACT CCTGTG AAAACAATGAAGAG TGGGT ATGTT A^ CAAGGG ACA 

TTTGAGGACCATCCAGGTACAATTTGAGTCAAGACTCTCAAAGCAAGAAGGC 
CCATAGGGTTGGAAGAATGAATATGAAATAAAAAAGCCAATTTGAAAAGGCA 
ACAGGGAGCATGTCATGTAGAGCATGGCTGCCCTGGGGTGGACTTGG 
GAACTCAGGGCTGACACCCTGGGATACTTGGCCAGGTGGAGA 

3 0 CCCTGCTACCGGTAGTGCAAAGACCTACGAACTTTCCTCATCACC^ 

TATGGTTAGCACCCTCCTAGAAATCrAGTGTGCCAAAG 

TG AGGGGC CCTTCCAGCTGCACTACAG ACCCTCAGCAACTG AATG CTGCT 

GCTCTGCTCACTCCTGACCGTTAGAAATGTGTCACCGAAGTGTGAAAATACTTATGCTC 

CAATCCCTCTCATTTGCATGGGTTACTCATAAATGTAGATCTTGGGTATGATTTG 

3 5 GCCTTGCTGGATGGGCTTGTCTCTGGCCTTCATCTACAAGTGCTGGGTGC^^ 

GGAGCCGGGGGTGCAAAACACAGAGTGATAGTCGTTTGACACAATTTCAGCT 

TGGACTAAGTCATTCTCTCTG ATT CCCAATTGTTCAATTAAT CT ATGTGTGTT AACACCTCTCTAT AACATCATAG ATAGTG CCTA 
GTAATGAACT CTGCGAGTTCAGGCTGAGGAACCCTGAATG ATCCGCCACTCTG CAG CTGTTCAGGAGTTGT ACCCTTTTTGTG ATG 
TGGTGAACTCTTTGAGGACTTCAGCACTGTATTC^ 

4 0 TTTAAAAGGATGTCCCTGCATACAGCTTAGCAACCrTG ATT*GAACT C AGCCrTTCATGGTTACACCTT AAGGTTGAAACC CATTTC 

ATTTCATCATTTCCCAGAAGGGGCACTCAGTTCACCCATTTCCT 

CTC CCAGT AG CATGG CT ATGG AAG CCAGCGTCACACT CTTGCCCCAC CTGG CAT CT ATGCATTTTG CAC AT AG CTG CCAT C CG TTC 

TAAGCTATCTGTGCACAGATCTGTGGACAGGGCTGGCTGTTCCCCTGGTGTCT 

AGCACTGTTGTGCCATCTCTGACACACAAGCTCrCGTCTCCCTATGT^ 

4 5 GTGGTTAAGACCAAAGCTTCTAAACTTGCrCCATCCTATTTCCAGTGG^ 

GCTTGATACCTCCTGCCCCGAO^TCITrAAAGATGTGTGTGTGGGGGTGGGAGGTGGGG 
GTGTGTGTGTGTGTACTTGCACACGTTGTTGTTGX3TAATGATGTTGAGTGAGACCAGCACATGA 
GCCATGCCCACAC^TCTACATTCATCTACCCACCCCTTCAGCGTGCCCITCCCTGGCCT 
CCTTTCAATTCTGCTCAGGTTTCAGATGGTTAGGACCCTAGGGACAGGAGCC^ 

5 0 CCACCAAAAATAAGTTGCCTTGCCTGAGTAATCCTGGCTCTAAGCTGTC^ 

TTAGGCTGCTGGAGATTAGATAAGAAACCGTCITTCCCCCCACCCCCACTACAGTCrc 
GTGTCTCAGGGATTTGCCAGGAAATGCATGAGTTTCACITCAGAACTT^ 

TGGG ACCCTTTTCACTGCGAGGTGCAG TTGG ATG TG AATT ACTG AGTG ATGGGG TCGGGC CAG CT ACAG AG CT ACTG CTCTTCTT A 
AGAGTGTACAAGTTCTTAGCATCGTTAGGCCACAAGACTGTCACTTTCAGCCTCTGGTG 

5 5 TGTTGTGTTTCTGTCGCTTAAAAGTGACAGAAGC CTCCAAGTTGATGTCCAAATATGTTTCTGTATTTGTAAAGGATGTGTTTTAC 

AGGGGCTGGAGAGATGGCTCAGTAGTTAGGACACTGACTGCTTTTCCAGAGGTCCT 
TCACAACCATCTGTAATGGGATCTGATGCCCTCTTCTGGTGTGAGTCTGAAGAGAGCAATAGTGTTCTC^ 
TAAATCTTTTAAAAAGAGAGAGAGAGAGGAAAGGGGAAAAGAAGGATGTGTTT^ 
ATCAGTAATGGAGACATGAATGTTTCATCGCTGCAGAGTGCACCTTTGCAGATTTCCTGCAGTGCCCG 

6 0 GGTTGTTTAAGCTTTGTTGTTGGAGAATGAGATGGTGTGGTGCTCACTGTGCTGTACAACCTCGC 

CTTCATAAGGTTGAGAGACACAAAGAGGATGAGAGTCCTGGAGGCTGCAGCAGGACTC 
CCAGCAGGTTTCCATTCACGATTTCCCAGGAGCTITGGACTGGCAGGCCCTAATCTTC 

AATG ACAAG CTGTTTCAG CTTTGCCAC CTCTC CCAGG C CACTG TGG ATGTCAC C GC AT AG CATGG CTG ACTATCT AGTGCAGT CTC 
TGGCTGACACAGGTTAAGTGGCCATC^GT(^CAAGTGATGCTATCCTGGTGATATTGGTTAAGCTATCT 

6 5 TAGGACAGCCATATGTCACGTGACTACATTCAGTGATACAAGTGCG^ 

CACAGAT AAG CACAGGC AAC CAGAACCAGCACCTCCACCTAAG CTTGAGTGATCAGTGGGGTCCCAGAGGCTGTG CTGGCAGAGGG 
ATGGCCAAGGGCATGATTGATAAGGCTTTGCTCCAATGTTAGCTC 
TTCCATAAATACCTAAATTGTTAGTGAGATACTAACATTTCATTGATTACAGAG 
GACATTCTGTGTTCAAAAATATGATGGTTTAT 

7 0 CAGAAGCCTTGAAGACAGCATGACAGGTGTAG^ 

CATAGTCCTGCCCACCAGCCTCCACTAGGGGAATCTTCCCCAGCTTCCATTTTCC^ 
CACAAAGAAACACACCTGGCCCATACAGGTGGACAGGTACCTTGTGCTG 
AGAAAGGTCTTGTGTAAAGAATGCrCAGAGCACACAOSAGCACAGAAGAGCCTTC^ 
GACATAAAAACACACAGATGTGTGTACACCACACAGTACTCTAA^ 
75 TAGATCGC^ACACAGTAAAC\AATAGGAAC^ 
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GCAACAGTGGCTCTGAGGTGGCTGATAATTCTCTGGAGCAGTGAGACGGTTGGGACAGAGG 
GTGAGGGTGCCTTCCATTTACACCAGGCCGCTCTCTCGCCTCCTCAGTGG 
GATAAGGACTriTTCTTCTTGGTGTCCACACTCTACTAGTGGCCTGCCri^ 
CAGAGTGCCACAGACTAACACGTTAGTTTGGAATTTTCCT^ 
5 AACTGAG TTGT AGAGGTTTCTAAATGTG CCCTTTTT AAGTATTAAT AT AT ATAAAT AT ATT CT CTGTTTCAGTAGTCT ACTTTTG T 

TGCTTGTTTG CTTG AT1TT ATCT CAGT C CT AACACT AG ATl'ITT ATT AAAATGT AAGGAAATCTT ATTGTGG ITT" IT ATT CT CGCT 
GTTTAAATACAAGAGAAAAAGAACTGAGTGGCCTACCAGAAGTATCAA 
AAATATGCAGGGCACAGTTCAGCAGCCCCTCTCATCCCACAAGACT 

GAG CTCAAACTCTCATTTTATGTTCAAGAAAAG CTGTGT AAATACATGT ATACATGCCTGTGTGTG AGAAGACACATGTGTGCATG 
1 0 TGTCTCTTGTATTTTTGCATCTGTATAGGTCTAATGGCATGTT^ 

TATCTGTGCATGGGCCTATATTTGTGTGTCTGTGTGCATGTGTGTATTGTGTGTGACCCAAGCTCTCT 

TTCCCTAATCTTACTTCCCTTTCTGTCAGTTCTCCATAACT 

CTGCATTCCTATATTTCAGTGTTCACCCTCGTTCCTCTC 

ATATGTCACACAACACAGTGTCACAATCACCAAG 
1 5 AGGG ACATCKIAGAGAAT AG AGAAGTTTGG CTCTGGGAAGCCCAG^ CTG CT ATG C CAGATG CATGGTGAG C 

AAAGTAGACCATTCCATCATTTTCTACAATGGCAGTTG^ 

CTCCAGCTTCCATAGGGCCATAGGATGGCTCC 
2 0 TGAGTAGGCAGAGGATCACCTACATGCCCAATACAGGTCTCTTCCCACATACGTGACTTCT 
CCCATGGTCCATTTCTGCTGGTACTCCATGATGGCTCTTGGGCAGTGGGTATC 
ATGCTCCTGAGCCAGAGACATCTTTCTCCTACAATGTTCTCCAGTGACACCCT 
TTCCTCCAATGATGTGTTTGGCrTGTTGGAAAGGCCTTGTTTCAAAATCTA 

CAGGATTT AGGGAATT AT CTGGGAGGAACTGGCTATGAGGTC CCATAAACGTG<3GCTGCATCCATAGCTTGCTGAGTCTCACTAGT 

2 5 AACCCTGGGCAGCTCTCTTGTCTGTGAGTCTTACTTCTGTCTCT^ 

AATTAAACAAGTAAATGCATGGTAGATGTTCAACACAGCTCCCTCACCT 

AACCAGTAGCCTTGACCTAGATGCTAAGTGCTTAATACGTCTG^ 

TACATTTCCCAATAAAGTTCTATTGAATCATACAATGAAAAAAAACAGGA 

AATAAGGACAGGACTTTTTGCAATGGCCTGGTGGTAAGAAATGGCTGGTCCCTACTAGGAAAAACAT 

3 0 CAGGAGATGTGACATTTTGATGAAAGTGCCAGTAAGCTAGGGGTTTCTGAGTAATGAGCT 

GCGCAAAGGGTGCCAGCATTTCCTGAGTGCTGGGTGTCTTATCTGGCCACT 
GGATTTCCTTGGGAGGAAGCTGAAACTTGGTGATGTCATGTGACTCTTCAGTCCCA 

TG AAT AGAACTTG CTTG CCTCTG TGGG C C CATTTTCCATGG CAGTTTCCAAAG CTTG G AGGCC AG ATGTT AGG ACC ACAGTC ATG A 
CTAACACCTGAGAGCTTTGTAATGATGGCCAAACAATATAGAATGTGGTCAGCA 

3 5 AGAACAGAAAGGAAGAAGAAGAATAAAAGGGAGAATGACTCITGAAGACCTCTCT 

TTGTGCTGAAGTTAGCAGTAAGTTTSCCCCTAGACAGTGAAGGG 
CAACCCCCAAATCTAATTCnTGCCACCAGCTGCCATTCCTTGGCCAGAC^ 

CCGTTTAATACAGTTCCTCGTGTTGTGGTGACCCCACCCATAAAATTATTTTTTATTGCCACTGCA 

TT ATGAATCACAATGTAAATATCTATGTTTTCAATGGT CTTAGGTGG CC C ACATTAAAAGGGTTGCTGG ACACCTCCACAAAGGGG 

4 0 . TAACAAAACACAGGTTGAGAACCACTGCTCTATAGAAATGTGTTTCT 

ATCAGGTTCCTGCCCCCAGAGCCAGAATTCTGGAGAATGTGTCTC 

TCGGTCCCTATCTGTACrGTTCTCATCCTCCATGAA^^ 

CTCTTGTCATGACTCTCTATACAC^CAACCTTGTGCTC^ 

ATCC CAGCCACTGACCCAAGCTTGTTCTCCTGTACACATAGCCAGCTCT^ CTCTGCTC CTCCC CATCTGAGCTGG 

4 5 CACCTGAGAAACAGGTGTTCCTGGGAGGAGCACATGGAGAAATAACCA 

TTGTCAAAAGCCTACTGACCACTCCAATCCTGATTTCTCAGCCTG 
CCAGCCAACACCTGGACCrGTGGCTGCAAAAGCTGCAGGAAAACCGACATCC^ 
TTTGCTTGGAATTCTGACAGGTGCCCGTTGGGCITCTGGAAGGGA 
TTACATTATCTTCAGCCACCCTGAAGCTCGGCCTAGGCTTCT 

5 0 TGT ACGGAT AG CCACAAGGAGCTAAGCCATCTGATGTTCTCTTCACCT 

ATGTGTTCCACTCATTTCAAGKSACCAGCATACCCTACACCTCACC^ 
AGGTCAAGTACAATCCATGTAGTGTTTTCAGCCCTACATTTTGCCAATTGCT 
TTGGAAGTATGTTGCTGGGAAGGTCAGTCTACCAGCCATGTGT 
CATTAGCTGGCrCATACCAATTTACTCTGAT^ 

5 5 CTCTGGTGAACACGGCCACCTCTTCACTTTGTGGTAG^ 

G ATAAACIACAG AAGCCTCCTGCTGTCCCTCCTG ACATCTGCAGT^ CT AATT ATCCAG CAAG ATTCCAG 

TGTATACTCTATCCTCAGTATACTGGCTCATGTGATCTAGTAAAGATGTCTCTCACTG 
AGTATCTCAACTTITAACrCATACCC^AGACCrCAAATGATGTCA 
TCCCCTCCTCTGCCCAC^CCTTGTTCACCTTACTCATCAGAATACTCC 

6 0 TTCT CATGCTT AT GTGG TTCTTCTTG AAT CC CAGGTCTT ACAGGTG ATGTTT AGG AG CTTT CTC ATGG CAATTTTG CAT ACTTT GT 

ACCACACACATG AAGAAGTAATG CTTTTGATGAAAATGTAG CACTTCTTTT AAAAAAAAATCAAAAAAGCAAG AAACTTAAAAC AT 
TC^TAAAATATCTCT AAGCTTTTTCCCCCAGACAACCAG ATTTAAC CT CCCAGCTGTGGGTCTGTAAGACCCAGAGCCTGACCTAG 
TTGACATGJUVCTGTGTGTAAGAGGGCCAGGAGAGAACAG CTCAGTGTCAG AACCTGGGGG AATTACAAATGG CACCATAGTG ATGA 
CTTGTGGCrTCTCCTGTAAATCTGAATAGAAACATTATTGTTGCTCTCT 

6 5 GTATTTCTAAGTCATACATTGCTTTTTCTAACCTGATTT 

CAG<1ACAGGGTTGGCGGTGTGATTCAGTGGGAGAGCACATGCCTCATG 

AAAACTCAGGTGTTCTACAACAACCAGAAAAAGGGGGACATTGTGTCC^ 

ACTGATGCTGGGAACGACCCCAACGTCCATGTCCCCCAACAGCCTAAAGCACAG^ 

TG AGT CCCCAAC!AAAATCCTGAAACAGAAGAACACTG TT AGG G^ CT ACC CTGG CTC CA 

7 0 TTGTCCrGGGTGAGCTGCCAAGCCCTCTTCCTGCrGCTGTT 

(^TAGGCCCATAGGATTCTTTGTGGACAAGAAGATAAAGCGTTATCT^ 
CITGTCCTCTCAGGCCCGTGGAGTATTATCATGATTCGGCACACAAACCTGACAGAACAGA 
AGCAAAGTGAGCTCAGATAGTCAGCGCATGATACAGTTTCCACCAAGGAGGAAAAGCC 
CATGSTCAGCTGCTTCTTCCTTCCATGAGAT^ 
7 5 GCCAAGGGATTCATTTGATTC^AGAGCCAGTAGGCATAGACTGTAGTGATGTA 
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rrcccc^ctgttgactttgctctaaatatgcttgcgt^ 

tt ctcaatt actggacatgg attt ctg aaact atccaactatt^ ctgaactgtg tcacag 

at aacctgt ac ctcctgtg catgct accaag tg acactg agtg ctag t cag c catggcctcttt ctgagcatgg ct atggt caggt 
cacatgagcattcagagctgggcatttatctggaataggcttat 
5 cgtctccaggctatgatcatcgggcaggccctccaactccataata 

ctggttctaggggagagatgatctc^ggtgtctccatctcagacccaga^ 

TGTCGGGTGGAGGGTX5GGGCATTGCTCTGGAACAAGAATGTGCAT 

gtgagctttggttattgttttagcattttcctaaggtccatccctttatt^ 

gcagttgccagaagtagaggaattttgacacaagattttcagttctct 
1 0 gatgtgatcatagagactcaaggaaggtgacagcctcagtctg atagacacgtctgctgggatgggagcccaaaggccacagccca 

cagtgcacaaaaaaacctaggcaggtggacaatggtgagcaggaagc^ 

gtggcaggggtatgattggagcacatcagggaaactgaggctccttctc^ 

catggaggagctctggtggattcattggctagaaccttgtttactg^ 

gagggtgggtttcgagatgggcagctgtctggtgcct^ 
1 5 tctgggccttggcacagttctgctgcctgctctttct 

acctctgatgtcccaacaggagatggttgaaagagttgggacagcctcacagct 

gtcacttgagtaggtgtaagactccacaccaatgcacctctccactgtgt 

tgtctctttgcaaggctgaaaccacaacgtcttctgtcagtgatgtcatct 

agcccaaccaacagggaggtttaagaagaaatgtatctgtctataaaatatactggttctc^ 
20 cct ac actc ctgt ag aat at ac ct at annnnnnnnnnnnnnnnnnnnnnn^ 

nnnnnnnnnnnnnnniwnnnnnnnnnnnn^ 

nnknnnnnnnnnnnnnnnnnnnnnnnnnnnnn^ 

nnnnnnnnnnnnknnnnnnminnnnnnnnnnnn^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNl 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNKNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
3 0 CTGCCAAGTTTACATAGCAAGTTCTAAGACAGCCAGAGCT^ 

TATTTCATAAAACAGTGGCTTAAAGCTX5CAGGTAGACTGTCTGAGAAAA 

CCCATTGTGGGGTGAGCTGCTTCTGATGGGATAGTGTATAGACAGGG 

AAAGAGCAAGCAGCCAGGAACTGTGTCCTGATTCAGAGCTTCTATA^ 

CTGAGCTTCCCGCCTCTGTGTGT ACATGAAG AACACCAGG CTT AG CTGTG AGG ACCACAG AGCAGGCACAACTAGGCAGTTGT ATT 

3 5 TGCAACACCCTGCAAGTTATTAGCTGGCCACGTGTTTGTCATTCte 

TCAC^CAAGTAAACACATATGCATGGGGGTGCACACTTAAGTGTGTGAATGCAGTGG 
AACC^CCCAAAAGAAGGCTGCTTGCAAAACTCTGTCTTTTGAA 

T CTGTGCT AAACAC CAAAAACTTT ACTTTT CACATTT ACATG CT AAACACT ATG CTT AATG C AAAGT ATAG AC CTTCCTGTG AG C C 
CAAGATCCCTTACTGCCAAAACAGTGATGGTGTCTACCTCAAAGACCCACAGTGCAGCTTAGA 

4 0 TGTGCACCTGACAGACCCACTGAGTTGCTGCTGTGAGAGACTC^ 

ACCCACITAGAACACCATAGAATGTTTTTTCTAAGTCCAGCCATTGTA 
AGCAACTGGTGCTATATTTTGCTTTTAAGCTTTTGGTAGAT^ 

GACAAATGATTAATTCTTTTGAAAGGGACTCAACATTTCATGACCCAGCATCATGGTCT 

CCAGGCATGGCAATG AACACCTGTAATCCCCACAGT AGCAG AGGG AG C AACAGG CGGCCCAC CCCTGAGCTTTCTGGCT AGCCAGT 
45 GCAGTTGGATTGGCAATGTACAAATTCAGTGAGAAGCCCTATCTCAAAAGATGATGATGGGGGCTGGAGAGATC 
AGATCACTGACTGCTCTTCCAGGGGTCCTGAGTTCAACTCCC 

GTCrTCTTCTGGTGTGCCTAAAGACAGCTACAGTTTACTCATATACATTGGATAGATAGATAGATAG 

GATAGATAGATAGATATTTTTTAAAAAAAAAGATTTGATGGTGATGATGGTGATGGTG 

CAGACGTTGATGTCTCCAC^TATGTATCCACACAACTGCACATGT^ 

5 0 GGTTCIXSTGGCCAAACAAATTTGAGAAACRGTAACrTAAG 

TTTCCTGCTGAGTCAGTAGTTGATCCCAGAAGTTTAATTTAGCCAGCTACT^ 

AAGTCATGTGAG CGTGCACCCGGTGTGCATG CATGTACATATAC AC ACACAG ACACGTGTACAGAGTAT AAC AGGTCTT AGG AATT 

GGAACTCACTAGGAACGGCCTGCGAAAGACTACAAGCACGGGGCTTGGTGC^ 

CTGGTAGCTCCTTTTGGAGCAGCAATTCCCTATCCAACAGGCAGAT^ 

5 5 ACAAAAAAAAAAAAGCAAATGAAGAACGTTTGAGGGGCAGACTACCCATGAGGAAGCACACT 

GTG TG AG ACTGTG ATGTGGCCAAG AGCACACTCAG C CAGT ACT ACTGAATGG TTG CT ATTGGTTCT AGGATG TT CT AGG CTGTTG A 

GGCCGTCGGAGCTTGCTCACAGTTCAGAGGTCCTGCTCTGAGGAGTGACTTTTCT 

AGCAGCTAACTGGTGGTTTTAGATAATAACATTGCCTTTTGAAAGCAGC^ 

AACCTGGCTTTCTTTTTG CCTTCTCTATTGCAGTTG CATATGGGGCTCATGGCTrTAGGGATTTTCATGCAATAATTCCCAAATCT 

6 0 TTCTCTCGTAAGTATATGCTTTGCTTCTAGAAAACAC^ 

TTCTCTCATCAACACCTTCCTTTATAACTTGAAAGTGACATGGACTAAATGTTTGCGAT^ 
GTGTCATTGATACAATAAAGTGCAAAGACAGAGGGTGACX3GTC 
ATTCCTGGCTGGCCTAGACTGCTCTGGGATGCCCAGACTCATCTTAAATTTC 
CCTACCCCCTCCCTCTGTCCCTGCCCCCTCCCCTCCCTCTGCCTCT^ 

6 5 G ACTTTTGGG AT AG CATTTG AAATGT AAATG AAGAAAAT AC CTAAT AAAAAATTGG AAAAAAG AAAG AGAG AG AG AAAGAGAGAG A 

GAGAGAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAA^ 
GGTGAAGAAAGAACTAATGTAGCCACAGTGGGTGTCTCACTGGGAGGCTGCT 
TTTGTGGTCCATGCTTCCTGTCAGCATCAAACAAACCAACATCTTTT 
AAAGGTGCACCCTTTTTCCCTTGCTCrrGGAGTGCT 

7 0 GGTGGCTCGCCACGTAAGC^TGCACCATGTTAGCGTGCATGCAGCCTACCAGCTCTCCTCG 

GTAATTTCATCTCITAATACATTGAAAGTTGAAAATGATAACTATAT^ 
CTTTTTTTTAATGCAAATAAAATCAATGCAAAAGAGTTACAC^TTGTTA 
TAAAATGTTTAATCTTATGAAGCAGCAGCAGAAAGCAGAGTGTATTG^ 
CAGATCAAACCAAAGCACACTCTACCACCAAATTACACCCTAAGACCAGCT^ 
7 5 ATGTCAGCCGCGTTAGGTTTATGATGTCCTGTTATCAC^TATTACTTGGGAAGTACA 
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CCTGGTCAAAGGCCAGATAGACTTCCCTCTCC^GAATATTAGAAGTTTGCTAAGATATTTCT 

ATGTGAAGGTAAGAATTTGAGATTTCCTTCTTCATGTA^ 

CCAGGGCAGGAAGTCATGTTAATTTTGAGCCTGCCAATCACAGTTTTGAAATC 

AGCACATTCTAAGACACATTTTCATTAAGCCTGAGGAATGGGCAAAGCCC^ 

TCAGGATGGGAAAAAATGACAGTGGGAGATGTTCTCAAACATGATAAATGTT^ 

TAGTTCATAGTAATACTATCTCTTTTTCrrTCTCTCTTCCCCTCAGTTGTATC^ 
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G AAG TT AAAACAAACTTTTGG CCT CCTCTG C CATCTTT ATC CCATTTTGAGTCC CT AG AT ACCC C C CAAAAATT AAAT ATG C CTT A 
AAATTCTTCTTTCTTAATAAGTTTTCATTTTCTGATCT 

ATTATTT CTAAAAT AGG CTTGATT AGGT AGAAACCCT ATTATTTTAACT ATTTTT AAATG AT ATT AAGTCATG CAATTTGTTCTAA 

ATAATGCITACAATTTGTTCTGGATTTGAGACAAGGTGCC^ 

TGGCCTGGTTCCTCCTAACCCCTTGTCTCTGCCTCCCACGA.CACTGCTGGCT 

T ACATATGTGT CT CAGG TCTTCAC CT ATT AC CACAAAT AAATG AAAATG TACAAAT AT AC CTT AT AT ATTTT AAT AACTTTG CTT A 

TTAGCTACAGTTATTATAAAGATCTTTACACTT 

ACCAGGCTGGAATGACAACrTCAGAATTTTTTTTTCCA^ 

ATTAAGATGCAAGGGAAATAACZAATGTAGACATCAAATCTGTGACCCAGAGGCT 
CTCAGTTGCTCACAAGGGCACACTTGGCMCCAAGCCAGGCCTCCCA 
TTC^TTCTAACCTTTTCCTAGATGCTAGAGGCCTTGCC^ 
CTGGTATACCTCTAAAATACATAATGAGAGGGTTTTTTGTTTTCTCCA 
GGGGTTTGAAAATACCATGTTAACATAAGCCATGTTTTGTAAACA.CT 
GTAGGTTCAACCCAGAGCTTTGKSGGGGACAGGGGGAATATTCACAA^ 
ACTTCATCAAGCTTCCCTGTTGTTCTCAGTAGTAGACTAGC^ACAGAGC^ 
TTTCACTGAAGGAATACCACAGCTAAACAGTTCTAGAAAGAGT 
TCAGATTCCTTATTTAGCAAGCACCCACTTTGCATTT^ 
CATAGCAGGAATCrTTAAGTTCCCCATGCTGACAGTATAACCTGTTTTTTC 
GACATCACAGACCTTCGAAGTCAAAACATGGCTGGCAGTGT(^ 
CCTGCTGTGTCCCCACTACTTACGGTCCrTGGCCAC^ 
TTCAGTCTTTAACTTTCTGAT 



TCTATGTCCTTC^CAGCTATCCACCTATTTTTCCACATACATTTCTTACCCCCA 



AATGCCAAAGCCCTGCCTTCAGGTGCCCTAGTCACCCTAGTGATAG^ 

G CACCCATCACCTG ATTCATAACCTGCTTCCTCCTG ACTCTCC^ CTAGAGGAGAGCTGCT 

CCTGGTATGAGAAGCCTGAATGTTACrrGCCCATGCACCCTGCTGTCCTGGCCTTCTCT 

TTGTGCCAAGAAATCCCCCCAGAGGATTGCACATGAACTCTGCTGT^ 

TCTCTGCCCTGCATGCAGTGTGCTCAGGAAAGCAGGATCACAGCCTCCTTTACCTC^ 

ATCATCCCAACACATGTTCrTCCACAAGGTGATTTTTACCCAGCTGTGGAT 

CTTAGCTGTCATGGCTAAAAACTTCCCACATGAGCTATGATCCT 

CAGAGAGAGAGAGAGAGAGACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 

ATAACCAGAGGTTGAGATTTTTGTAGTTTTACTGTGCTGACGTAGTCT 

AGCATATAAACAATGTAAGTTCGTTTAGCTTGTAATTGTGGAGGATAAGAAGCTC 

AGTX^CG AGATGGGACATTGCTCTAGCACAGG CCTGT CTTC CTCT^ATTCAGCTTTGCGTTG CT AAGTG AAAACCCAAAGCAAGGG 

GTTCCCAGAGGAACGTGTATCTGGTTAGGCTCTAGAGGTTCCA^ 

GTGATGGGAGCATGTGTCAGAGCCAATCACACATCAAAAGTTGGGAAGTAGAA^ 

TTTTTGAGGGGACATCACTAGCCATCTAAGGACTTCCCACCAGGTCCCACCTCAGAACAGACT 

AAGTGTTTTTCCAAACCATAGCCATGTGAGTGCAGATACTGGTGAATTTGAATCTACCCAAAGATAAC 

GAGTCCAGTGTCATTAATCACraCTCTGTGAACCXSCAGACCTTTCCCCrCT^ 

AAGGACTGCTGATTTGCAAAGTTTCXX1ATCAGAGACTTGTAG 

TGTCCTGCACATGTGCGGAATTCAGCCTCAGAGTTCIATTCTCACTrC 

CCCATTACTAAGCCAGTAAGTCrTTTCTGOU^GAGCrAACAGGATAATATTGG 

GACCTCCCAGATCTAGAGCTAGCAGGTTGCTAGCTCTGTGACCATAGCAAGTTCTTTGACTGAA 

GATGGTAATGGTGTGGCCCrTTTTGAGTTGTTGTC^GGTTAAATGGCAAGTGCCATT'AAAGTA 

TACAGTAAGCACrGTTGGGATCAGGCAGAGCCTTGGTC 

CIAGCTGCCCCAGGCTGGCCATGGCTATGAAAGCTCCTGTATTTACAGATGA 

AAATCACTGCGCCTTTAAATAAAGGTGCCACCCCTCCAGGCCAGCCATTTTT 

GAGAAATTTGGGAAATAAGATTCTCTCTCAGGCTGTGAAGCAGGT^ 

AGGTTCCCCTCCAGGGTCGATTTAAACACACAGCTGAATTCACCCCIAAAATTTGGGAACCAGA 
CACACTTCAGACATCCrCTATGGCCATGTCCTGGCTGCTGCTC^ 
TATTATGTCAAATAGAGGCTGGAGATCCAGATCAATGGTTAATAGTAATTGACT^ 
ACCCACATTTGGCGCITCACAAAAGCCTGTAGCTCTAGTCCCACAAAGC^ 

CAT ACAC ACAAACG AAAAG CAAATG CCTG CTT CTT AATT ATG T CACAGT GACCT CACTGGTTATTTT ACTT AGAG CAG CATACCTC 

CCTAACTCTTATGCACCrAGAAAAAACACTGAG AAATTATGG CTACAGTTTCCACTG CCATTG CATGTGGG ATTCACCACTGT ACC 

ATTAGTGTTTTGGGAGCATGTTTATGCAATGTGTTGTATTTTCTTT^ 

TGGGACTACAGCAGAGTCAATAGCGGTGTTTGTTTGTTAACCTACAAGCATCTTCTTCT 

TAAACTATTCCAGCTCACCTCTTAGCAAATGAAGGAGATGTGAGGACACCTC 

AGATAAAAAAGAAAGCCAATTGCTAAGAAAACGGAACTTGGACAAGGGA 
AAATTTGTTGTGCATATGCGTAAGAAAATACTCACCCGTT 

CATCCACACTCTCTGGAAGCTTGKn'GCATGCACCAGTOUSACTGCCCGC 
TTTCAGCCAGTAATGTTAAAGTAGAGACTCAGAGTGATGAAGAGAATGGGCGTGCCrG 

G ATTT ACG AATGCITGATG CCT CC'GGAGAGAAAATGAATGG CT GACCAAGG CAG CT CGG CTTTGTC AGG AG TTGG AGG 

C ATTCG ACTTCCT AACGGAAAACT AAAGTG TGAT ATCTG TGGG ATCGTTTGCAT CGGG CC CAATGTG CT CATGGTTCACAAAAG AA 
GTCATACTGGTAAOGCCTGGGAGTTTTCCCT^ 

CTGGGGATGCAAGTTAGTATTGTAGAGCACCTACCTAGCACACACAGTCCCrrAA 
ATGTGGTCTTGAAGCATGTTCCCCATAGACTAC^^GGGCGGCACTTGCTTATAAA 
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CTTATTCTAGGG CAGTGTCTGT^AAGGTCTTCTCAGCATGT ACCT AATCAACCCTGTGAACG AAAGAGAAGG CT AACAGCATGTGTG 

TGTCAGCITCTGAGACTCTGATAGTATACTCGCTAGAAAAAA 

CCTTCACAAGACGAGATCAAAAGCTAGAGACACACCAAAACTGACTTCACAAATT^ 

TTAATCTTGTTAACACTTGGGATTCTCTATTCTGAGGCATTAGTG 
5 TCGGAAAGGGTGTAGGTCAGCATAAAGTGGTTCACCCTCCAGTCTTGAGTCT 

CACAGAGCAGTAGGGGACTTTCTGTCAAACTCCTATTACGGTAGAAGGTTTC 

CTATTCAAAACTAAGAAAATGTTCAAGAGCCATGCCTACCOT 

TCTACTCCTG1TCCACATGTATATGAACAGTCAACTCCTTGTTCAACCTAACTC 

CTTCCTAGAAACATCAGTGGTGTTCAGGACAAGTGTTTGCATGGTATAAGGTTTCTTTG 
1 0 C1T CTTTC CAT AGAAG1TTC1TGAAAGGT AG AT ATTAT AAC CTGG C AAATTCACCT AACT CTT ATAGCCACAGAAAGCT CACATG T 

AAAACGTATAGAGCCTCGGCTATCACTACCCATACACTGGCTT CTTTGGGGCCTG AG AAGG CTGGCATCTGCTAGTGCAGTGTCCA 

GG AGGGAG AACAACACG CACAGATGAATG ATAGCGAG TTTT ACT AG ACT ATCTCAAGAAG CTGAGGAAGATGG CTTGGAAAGG AC A 

ACCTTGTCTGTTTCTTACACGGGACTCAACAAGCGTGCGTT^ 

TCATCACCAAGGAATGAAGGCGTTTCCAAGGTTGGGTTTTAT^ 
1 5 TAT ATGTACAAATG ATTTATAAGATG ATACTT AAAT AAATGTTTGTCT^ CT CT AAAGTTAGG C AAATTTGG C 

TTTTCTGAAAG^AGCCTTTAGTTCTCAAGTGACTCGTAGAAA 

ACCTTGCTCTTGCCTTCTGTGAGGCAGTGGAAATTAACATC^AG 

ACCATTGCTGATAGAGAGCAAACCTCTGACTGTACAGTAAAAATCCC 

AATTGCTAACATATGACTGCCTGCCTTC^CTTAAAAAAAAAGAAGAAA 
2 0 GGGAGGGTACCTGATATTTGCTACATACTTTCCTCAACAAATAGTGACTAAGTATCATACCTAC 

TTATGGAACAGATAACTTACAGAAAGAG&AAACAGGAAGAG 

GCCAACTCCACAGTATATCTCCGTATATGTGAATATACAAATCCCCAGGTAGAAGTAAAAACCCCATTT 

TAGCTAGACAGAGCAGATGGGCTTTCTCTTCCAGGTCTGGGTG^ 

AGCAGGTTATCAGAACACTGCCCACCCCTGTGAAAGGC^ 

2 5 TCTC<K3AGAATCTGGAGAGCCAG(2ATTCCTATGAAGTCCTCGGAACX3 

ATGCTGCTGAGGACAGGACAATAAGCAGAGCCATCTCTGTGGCCTATGA 
AGGATGTGCACCAGAGCCTCTCTCTCTTCTTCCCATTGCCCATGACCCAGT 

AACAAAG ATCT CAGGGG CAT AG CCCAG CTT CT ACTCCTCTCACTGGGTGG CATT CCACTG C CG CCATTTT CT AAATGGCTT GTGGG 
CACCGGTGTGTCACTTCCCCTTGGGTGCTCAGTTGGTTTCATGT^ 

3 0 AGGGTGCATCTGAGGATCACCCTGGGGTGGTAGGGAGCT 

AAAGACTAG^IAAGTGGCCTGAGCAGAACAAAGAGGCTC^CTTCCTGTGG 
TACACATCTATTCTTGTATGCC!AGGCTITTTTTAAAATCT 
TTTTTAAAGAAGATATTCTTTATTACCCAAGTAACAGGTCGCA 
ATTAAATTCAAAATTTGCAGTATCTAATGTCTCATCCCAGTTAA^ 

3 5 GGCrcTTGTCCTCCAG^AAACAGGTCATGGCCACTGTCAACAGAA 

TTATCTCTAACACCTGCACTGACTGAAATACCATATG AT CCATTTT ACrAAATGTGG T CTTT AAGC CT C CCCAC CATGGG CTGT ACA 
TTCATGGCCCACACAATCAATTGGAAATCAGTAGATTTACAGT^ CAAGGCTTGAG ACACCAG 

GCAAGCAGTCCCCATGGAGAGATTTTGACTCAAGTGAGA^ 
AGAACAATGTGCTTTGAATTATCAAATTCAAAGAAATTGAAAGAAATTG 

4 0 ATGGC GAG AACACT AGCT CG AAT CACATCAGGG AG ACAC CTGGAGCGC CT CCCAG CT CAGAAAGGGGT CACT C AG CTG CT AAG CAC 

GTG ACITGGCCCACTTGGATCACC AGG AGTGTCTTCTGAAG CT^ CAGCCTGAGGGC CCAG CCCTCTC 

TG AGGAGCAG CT ACAG TGTTAGGC ACGTTC CACTTCCCCAG CCGG AT CTT CCT G CAT ACACT G CTT ACrTTTCTTGTTT C CG CTCC 
TCC^GTTGGAGCTG AA<!JVCTTTGTGAACAC!AGTGTCCGCTGAGTGTG C ACTTGGGTTTCCCAGGCGAGGTG ATGGTTAGTGAGG CT 
TCTAAATGCTAACACCTACAGTTATCGTTC1CCATCCTCCTTGACGTTAAACAGTGTCGTCA 

4 5 AGGAATTTGGGGAAACGTTGAATACTTACTGGTTAGTATCAAGAGCCCT^ 

ACTTTTGTtnTTG CAATG CC GACTGAAAAAG AC CT AG AG G T AATT CGGAAATTCAG G CAC CAG T ATG C ACATT AAAG CTT AACCTG 
GCACAGCTTTTAACAGTACCACCTOTGATAAAAAAAACTCTCTGCTGCCCCT 
TTTTAACACATTGTTCACCCTTAAGAGAAATGTGTCTCGGGACATGGGT^ 
TTCAAAGGCAGCGTTTATAAGGCACIATAATGGACACACCATTCCTGAGAGTTAAA 

5 0 CACACACTCTTTACTCACAAGTAGCTCCTGCAACCATCTCCAAA 

CCCTGGGCTGTCTCAGTCACAAACCAAGAAACCTCCTTGGAAACGCCCTCAATC^ 
CTACTGTGGACACAAAAATGAATTCGAAGGATCTCTGAGTTGTC 

T CT CACGCTCACACTTTCT CTGT CTG TCTCTCT CCTG ATTTG CCTGCTT AGGGACAGACAGG T AATGGAGAAGAAATTGAAAGGG A 

5 5 AAATACAGGGAAAATGTAAATGAAAAGAAGTTATTATAACTGTTATAATGAAGTGCCTT^ 

AAATGAACTTGTGATCCTGAAATGCATTT1TAATGAGTTTCC1TT 

AAAAAGCGACATTAAAAACCCCACCAAATGCCACAACACGCA 

GACTCTTCATCACAAAATGTTGCAGTCACrTTATTTAAA 

AAATTAGATTTTCAG ATCAAATTG ACCCAG CC AGTX3AAGCGTTAAGCAGCTGGCAGGTTT AGTCTGAAAGC CTCA 
60 ACCTGCAGCCGGTGGAAGTCACCAAGCCCATGGGAAACAACTTTCTCGAGGCCGCG 

ATGCTACCCTCTTCCCTTCTTGCCCAGGTGAACGGCCTTTCCAGTGCAACCAGTGTGO 
CTGCGGCACATCAAGCTGCACTCGGGTGAGAAGCCCTTCAAATGCCATCTTTGCAACT 
CGGCCACCTCAGGACGCACTCCGGTATGTTCACTGAGCATGGC 
AGCCACTGCCCTTCCAATCCACCCAGTCC^CAGCCCTCrcC^^TGT^ 

6 5 AGGAAATGGGGGGGGGGGGTGCACAGATGGAAACCCTC^CTCCTTGGTGTC 

agtgggtatttttctx3 aagt at actg ctcatgcccttaaccttccatac^catccttttacacctattg ctgtgtcactcatatca 
cixk;cttatgtggaatatattgactagcagtaataaacgtaattattttcgt^ 
tcagcccagggcacactgcccc^ccagagagtaacttccaggctggaaaccagcagaggct 
ttgtcagcaaactgcagacaggaaaatttccctgtaaggatg^ 

7 0 ggacagttgtcagaatgttcccaacctgtcacacagactcatcttta^ 

cagtagccctataag cctccaggtctgag atatagg aaataaaaact atatgctatgtttg cttgcaatct agtatttgtaacatg 
catatgtcgcttccattctgcactgtcgtgtggtgtcacx5tgct 

CCCTCCTAGAGTGCCCTGTCATAACAAGCTGTCACTCT 

AAGATCTCAAGTTATCCATCTCTTCGCCCCATATTTCCTAAGCCCCTCAACAGCGTCCT 
7 5 AGGGTCCTGCAGGAGCAGAAAGAAAAGACAGTCreTCCTCCCTGG 
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GGGCCAGCTCAAGCTAAGTTTGCTTTAGAGGAACTCT 
TCCAGAGCTGGGCTTTCTGGTTAGTGGTC^CCrc 

CTGTACTCTACTTCCCTACCTGACTGGCAGTAAGCAGTCAGGAGACTCCTAA 
TTCTTGGCACTGCCAAGAAAGGTGGTGCCCTCCGTTGAGGAGCTGTCCT 
5 CAGACACCTCCCnGTCTTAGGTAGGACACTACCTCAGATCTGGGA^ 
GGATGGAGGTAACTCTGTCTGTGTGTAGCTGCrCC^ 

GGCAGGTGACTTAGGAGAAAGTCACAGAGACACAGTAGGCAGGTACCATGCACTCA 

ATGGAAAATTCCAAGAC CCACTTTT AGG CTGAT AC CAG TCACAG CTlCTAT CACCCTTT AATT ACTGGTTT ACAGT AT AAATCTTT C 

CTGCATGCCCATCCTTCAACTTAAGATAGCAAGATAACATCTTGTGA 
1 0 CATTGCATTTAAGGCATGGTGGTATCCTCCTCTCGGTGGGGCTAATGATTT 

AGTTATTTCTG CCACAGAGTCACCi^TCTCIXKSTAAAGATGTCGACCAG AG ACCTGGGG CATTGGTGACTCGATTGTCTGAAAATG 

TACTTAACAAACACAGCTCTGGGAGAAAAGAAACACTGTGTTTCCT^ 

TTGTCTGCAAAGTCCAGGGTGTTATATTTTCTTAATGAATGGCC^ 

CCTAGCATCCTGTGGTGGTCTGACGTTCTAGACTCTACTGTGATGATGGAATTTC 
1 5 ACTACAAACITTGTTAGAGTCAGTTTGGTTCTCGTGTTCT 

GAATATAAATGTGCAACCATATGTAATATTCGTTCCATATTTCATCATTTGTGGTACA 

AACTGGAGAGACGGCCCAG CAGTTGGAGCACTGGCTGCTCTTCCAG AGG AG CCACATTTG ATTCT AGCACCCACGTGGTGATTCAT 
AACAATCTGTAAAGCCAGTTCCAGGGACTCCAACTCCCTCrTACA^ 
2 0 CATACAGGCAGGAAAAAAACCCATACCOSTGAAATTAAGTAAAATAAAAATCTGTTCTG^ 
TTGAGAGCATCITCATAGCATCTGGTCACTTAGGAAAGAGACTTAA^ 
TGGCTTTATATCATACTTAGCATGAGAAATGCAATGTGATGTTGAGGTGAAA 
TTCTCTAAAAAAGGGTCTTTATTGAAGTCTTTAAAAACCGATATTATATGGAC 
AACCAGGAAGGACTAAACTCTTGTAGAACCACATTGCC^ 

2 5 AACAAAATTAAAATGTTAGTCCTAACATTAATCTAGCCAAACnTAGAAACTCCATGTAACT 

GTGAATGCCAAGCTGAACAGATATGCAAAGAACACTTAGAGTGCTCAAGTCGTTA^ 

GAGTTT CAT CAT CT CATATTCITTGGGGATAGAATAG AGG CTTTTCAT ATG CTCTT CCT ATGGAAAACACATTGT AT ATC CTCCAG 

GTATATGAAAATGAATTTGGCTTATCCATATCCCTTTTCATTTTGTGTTTATATAT^ 

GTATGTACTATAGTGTAAAATATATTTGTATTATATAGAAAGATACAATTO 

3 0 TCATTTCTTAGAGACTTGCGCAACCTCATTTTGCCT 

TGGGGAGAGGCCCIACCCrGACAGTGAATCrGTACTCrCCTCATGTTTCT 

ATTAAAGGAGACCCTTGAGTCCGTGTCATAGGTTAAGATGAAATTCATGAATTC^ 

CA' X ' l ' l ' l " f AAAATTCATTAATTTTGAC!TAAGTCCATAATAGTCATAGCTCA 

GTT ATAAATGT ATTGATTG CATCCCCATT AC C CAGAAGG CC AAT ATTT AATTGG AG TCTT AAACT CAATTGTGTTTT CTG CAGTT G 

3 5 GT AAGCCTCACAAATGTGGATATTGTGGC CGGAGCTATAAACAGCGAAGCTCnTTAGAGG AGCATAAAGAGCGATG CCACAACTAC • 

TTGGAAAGCATGGGCCTTCCGGGCATGTACCCAGGTAAGCACTGAGGCCCTC 
CCAGGATGGGGCTCTGTGGCCTGTTTCAGCCACAGGAGGAACTTT^ 
TTCTCTACTTCCCAAGTTCTCAGTTGACCAGTCATACATAACCTGTCT 
GTGTTGTGGGGTATTTTGTCGTTGTTGTTTGCGTTGTTTGGTGTGTTTTTTTTCCC 

4 0 AATTGGCTAAAGCTATAGCTAAGTGATGAATGAGCITG 

AAAACACTAATTACTCATGAAGGTCTTCAAAAAAAGAACTC 

AGAGTTGGAGAGAGCAGAACTTGTTAGGAGACCTGGTTGGCACTG^ 

TACATTGTGGTGAGAATCAGTTTCTAATTTGTCCTGCCTGGTGTCT 

4 5 TCTTTCTCAAGAAGTGAGAAATTACTCTTGCAAGAA 

TCTTCCTGCCTCTGCCTCAGGGCACIATTAAGATACACTAGATGATTATAGTTGC 
GTAATGGGTGTCTGAGACAACCGGTAATAATAGGAAGGAAATTCTGTTTT^ 
CAATTGCTTTGGCCAGGTCATGCTGGGAGCACAGGTACACTGAAATCTCCTGCCAG 
GAAAGAAAAGGTTAGGGACCCATGCTCCTTGTTGAGAGCATGTCCCAATGATTTAA 

5 0 TTCCCAGCATCTTTTCCTTGGTTCATCCTAGAGCCAAGGCCTCCCACA 

TTGCCTAGGTCATGTCTTACTTCACTATTTCACCCCTGAAA 
GCATTGAGGCTCCATCCTGCCAATAAACATGTACATTCTC^ 
TACTCCCACGGACCTAGTTAGGTTGGGTCCATTAGTTGAAAACTGCCTAACT 

GTTTGTTTTT CTGAAT ATGATTTTCTTT AG T AAAAAGG TT AAGGAT ACCTG TT ACATTTCACCTAT AT CTG CAT AGGAT AAGG AAC 
55 TATCTCCTGAGCCTCACATGCACCAAGGACAGAGGAAAAAGTACTGGGCTCAGAGGTGTGGTATGAT 
ACCTCAGAGAAGGGAAGATATGTAAATTAGCTTGATAAATCACATGT^ 
• TTTCCT CCT ATT AAT CAGAGTG GG CTGAATGTTG AGCT ATTT AAC AAG ACACCCT CAT AATTCACTGTT AA 
GTAGTATTTGACTAGAAATGCAGGTTTTAAAGTGACX3ATA 
CAAGTGGCAGAAACTACCAGGTCATCCTG'GGCAAGGGGGAGATGTAG 

6 0 TCCATTTATTGTAAAAAAAAATCTATC^GAAAAAAATATATTTAAATATTGA 

TCAAGGAACTCTTTGAGGAGCATGAAGATAAACTAGATAGAACCAGATAAACCTAT 
ACAAGGCAGGAAC^TATGGGAAGAGCTGTTTAAAAAGTAGAAGCTGATCACTTTAC^ 
ATATATTTTAGAAGTCCCTGCCTTGCAAACTAGGTGTCAGGATAGAAGAGACT^ 
TGGGCAGGGAGAGTTGGTATAGAAAAGCAGATGGAATGAGCTAACCTGTATGTCTGTGTTTACCTTA^ 
65 CCCTGTGTGATTGCTGTGTGGTCATTAGACATGTGCTGTGTGGTTGGGGACT 
TCATTTCTC^CCAGATGATATTTGGTTCTCATTGTTGTTGTTAAATTTGG 
ACAGTTAATTCACTGTGGTCCAACCTTGTGTGGTAGACAGCAGCTTGTTATTTT 

TGGCACITGACAAATGTGCAGTTAACACTGAATACAGCA^ 

7 0 TGACTTTAGTCATTAAGGAAGAAACTAACCACAACGA 

GACAGGCTGGCAAGCAATGTCGCCAAACGTAAGAGCTCTATGCCTCAGAAATT^ 
ACCTAGAATTATGTGGGTGTTCTAGACAAAAATTATTTGAAAGTAGAAGAAAACAAA 
AGCTTATT A TTTCTTAAATACTGTAAA TT TTTTCTAATAATATTATTCAAA 
CATAGCATTTGGAGCCTTTGGTATGATGTTTCTCTGGCCGCTCTGACAT 
7 5 GGGCTAATAAGAACCAAACAGAGATGAGCCCCCTCAGAAGACAGCGGAAGGGTACAGCT 
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TTAC CTG TC CTTGATACT ATG CGGAAT AATG GAGAGGGAATGAGTTGTTCATGAGG CAC TGTTGCCTGC CAG CT AAGGCGGACACT 
GT AAGGACAAGGAAGT AG CTG AGACAG ATG CCTG AATGGCATCACT AG CCTCAAAGG CAAG CTTT AGGTCAGAACAGAGATG CTCG 
TAAACTCTATGGGGGCAACATC>GGGGAAATGACCCAACCT^ 

AAG CAT AAAATGCCCC CACAGATAT AAAAAAAATTT AGAAG CC CTCAGT AGGAAGAT CAG CT ACT ATCATTGTCAAGAAAAG AAAC 
5 CTTATGTCCAAGAAAAAAAAAATAAGGTGTTTG AGAAG CTG AGGTGGACTTTGAGAAAGGGGCAG GACCT CAGCAG CATG C CAT AC 
AGTGAAGGGGTCATGGAGTCTAGGCAGACAAGGAAAATTACTTTC 
GCTTTGCC^IAAGTACTCACTGGAAAGCAGTGAGTAAAGCTGTCCCATGATC 

ACTGGG CAG CATCATCTCATTTGGAATCAC AGAGAAAG CAGGGGT ATGCTCCCAATGGG AG CAGAAG CTGAGCACATAAAG ATG AC 
ATGGGT CAAAATT ATCTGGACATGC^AGATGGCAGG AGAATC CTTTGG TGT AGAAGGAATGGAGG T C 

1 0 ACGGT AGGG CTC^TGAAGGAAACTGTGACTTC CCTTG CTT AGTGTGCCAT^ CAG AAAAGACAAAG CCACGT CTGGAAAAT AAGA 

GTTCAGGGTTTGATGTAATTGCTTTTAAGAATCCTACCCTGGTAAGGCTACC 
AAACGTTGGTGACACACACACACCTTTACAGAGCACCTAC^ 
GAGGGGAAGAAAGACCACTAGAAAACAAAACAGAAACCACTTATTAGAAATAT^ 

TTG AACCIAAAAGAATGAAGGAGACTTCAT ATC CTGATCAAAG CAATCTTGGT CAAGGAGCCAG CTCTTGGATGG CTGGG 

1 5 GCTTTAGAAAGTATGAGGTGCAGGGCCCTATCTTCTCCCTGAGGA^ 

TC^CAGAGATGAAGCAGCAAATGTGTTCAAGGGTTGTGAAGAGTAACTGACCT 
AGGAACCTGAAGAGTGCCACATGCCTrAGTAATGAGCATCGG 

GCAATGCTGAGAGGGGAAAGCGATCTCCAGGGGAGAG 

AGTGCCTCTTCCTGCTCTCGTGTCTGCCTTCCTCTCAGTGCT^ 
TCTCCCTAAAATGAAGAGAATTGGACCTCAAAGGCTGCrCCCCACATT^ 

AATAGATAGTAAGAGTTTTGATAGAACTATTTCATGT l"rT ' l ' T CAG AAAACCAAATG CAAC ATCCAGAAATTACCAGTATTAAATT A 
GAAGAACATCATTGTAAATATCTTCCTGCCATATATATTTCAGTGGACAC^TTAATAAT 

2 5 GGAAAGGCCTTTAAGTTCCTGAATGCGCCACCTCAGAAAGATTGCAGCTGCTCTTCCCT 

TCTTCCCGTGTATTGATTGAAAATCAGTTCACAGTGGCCACAGCATATTCAAC^ 

GTCATTCCCAACAGGACTTGACITTTTTAAAAGGATGACT^ 

TTTTATTGCTTTTATTTTTACATTGTACATTTCT 

CTATATATTTTAACAGGTTACAGAATCAGACCCTGGTCTCTTAGGCTCATTTTCGT^ 

3 0 CTTTACATTCAAGCACCCTTGTAGACATTCCATCTCCATCTCT 

CCCAGGAGTACGTAGCCTCTGGCCACTC!TTCCTTCTCTGACATTGGC^ 

CCTG ACTTTCTTTT CITTG AT ATT ACTTTTCTCCCAAAAT ATGT A CAG T AAGA 

AGCTCTTTGGTGTTTTACAGTCTACATGAAATGTAGCCTTTA^ 

CCTTTACATAAGTACCAATTGTTCTATCTAACATGACTTCTATTCA^ 

3 5 TGTCTGTT AG CATTTTG CT ATTTT AATTTCTTT ATTTTTTT AAGTTT ACTIT ACTGTT AT TTT AT ATTTTT C CTG ATTT AATTG AGT 

TCCTAGCAATGTTTATTCTAGTGTTTGCTATTTTTAACTCATGTATTGAGTATGTATTG 
TCrAAAATCrCCCrTTATTCTGGGTTG ^^ ^ 

TTCAAAATCTGACTGCTTGC!AT ATT CT AAAAACT AAAAT AGCT^^ CATG TCATG AAGT AATT A 

TGCTTCATTGCTCCTCTTCCTC^AATTAAGTTTCTTTTTCC^ 

4 0 AAAAT ATTTTCCCTGTAACCATGGCATTTCTTTT CATCCTG CTCATT CT ACTTG G T CT AATG AGG CTTG ACTG ACATTTCAAC ATT 

CTTCCCATTGTACCTGAGACTTTTCCrrATGAGAACTATAAGTTTTT^ 
CAGGTGAGGCCAAGTACATACGGCrrCTGTTTAAAGTC 
AAACCAGCAGATGATGCCTTCTGTCTCCACAAGGGACATCTCAGGGCCTACT 
GTGTGCAGGGGAGCCAGCCCACATGCTAAGGTCTCTGGTGTC^TC 

4 5 AGGAGGGCATTTGCCCATTGGAACTCTGTCAGTGTGAGGGTTTGGAAGGAATGCTGAGAG 

CC AAAGCTTTGT CCAGTCT CAGG CAAG AATCTTCTGTAC CTTT CCTTGGTCAT ATG T AGTTG AATGT AAAGT AGAAAT AAG AAG AT 
GGATGGATGGATGGATGGATGGAATAGATAGAGATAGATAGATAGATAG^^ 
ATAGAAGATAGATAGGAGGGAGACAGGGAAAGAAAGGGATGGATGAGTA^ 
GATAGATAGATAGATAGAC^GACAGACAGACAGACAGACAGACAGACA 

5 0 GAGG CTTTT AT AGCT TT T AATTTTTCACATTTGGGT AG AG AAATTCT ACAGG T AG ATTTGTCACGCCATG CT CATCTGGAATG TAT 

ATTGCTTTCTTTGCAAGACCCACrTCITGAAGC^ 
ACAGCAAAGTACCACCTCTGGTGAGAGAACACAGACITTTACAGCT 
CTAGGGTTTGGTAGTAAATGAGATGCAAGTACAAGAATGAGTTCCTCATGTC^ 
AAACAAACAAACAAACAAACAAACAAATAAACAAAACTCCCCCTTG 

TGTCG TT ACCAG AG T ATGCGTTGGTTCTGG C ATG AGT AG ATG AAAG CAAATG AAAT CAAG CC ATTGTGTGGTG CAC AC CCAGG AAG 
ATGAAGAAAATGCAATCAGGAGGATACrTCAATCAG^AGGAATATATTCCAGGGCTACTATAGTC^ 
ATGGTTAGATTTATGAATTGTGGAATACTTTTCCTACTGTAGATAAAGAGTTTGGGGG 
GACACTGGGATCATCTTCTACCACCACTCCATCAGCTGCCTTCAGGTACCACCTGATTGAGCAGCCTGTCAC 

6 0 CTGTTGCCCCACTGTGGCTGGGTAGTTCA CTTfC^ 

AAGGTTCCAGTGTATGATCCCAGCTGTACTTGGACATTCCCTATGCCCCCAAGAGCTCTGTC 
AAGCCCCAGGCTCTCTGTCCCCTCC^ATCTTCACTGCTTCTGTTTC^ 

GATG AG AAACCT ATGCAACACTCTGGT AG CAAG CCTTCCTGAGCTCG TGGG ATTT CAT AG AT AGGGATG G AGT CTGCTGT CAT CAT 
TTAGTGTGTATCTCTCCTGCTAGCTGTGTATATCCAAGGATACATATCCATTCTCACT^ 

6 5 CAGACATATTGAAGAGAGGAAAAAGGCATTGTGGGAGGTGAGGTATAAAAGTGC^ 

CTCGCTGAGAGGGCACTCAGTGACTACCTCTGCCAACAGGAATATAAACT 
ATTTCACATCAATGTATTTGGAAAATAAATGTTTAAGGAAATTC^ 
GAAGCCAAATCTGGTCTCATCTTCTTCCATTGCCAGGGAAGATTTGTGT^ 
ATGTCCAATTTGTAAAAAAGAAGAGGTTTGACTTAAGCCTTGGGGT 

7 0 GTGGCAGGAATGTTGTCTTGTGGCTGTGAAGCAAAAGGGAAATATATG 

CCCCAAGTCrAAAATCTTCTCATTAGGCCCn'CrirCCTAAAGAGTTGAGAACCTT 
AAGCTTCTGGGACATTGGGTTCAGCCACAATCATAGCAC^ 

CCAGACAGTAGCAGAGTTCATGTAGTCCTACAAGCTAACCATACC^GACCATAAGTTTCTTGAGT 

ATTCTT AG TT CTG CT CATCTTT AGT ATT AGCl^'T\iTTTGT AAACAGTAAC TGAATGG TG C CCA C CT CCAAG CCT ACTCAAGT AGAT 
7 5 CATCTCTGTTTCCTGCTTCCTGTTTCTGCTGAGCTAATTCACTACTGCTC 
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ATTCCTTCAACTTCACCATGCATTGGAACCCA^ 

TCATGTTTTGAGAACATTTTTAGATTATTAAAATGATAAATGTAATATAGTATC^ 
GTGTATTTTTATTAAATAATTTTGGGATAAATTAAACAGTATACAT^ 
TATGAACTTTATCATACTTAATGAAAGATGGGATTTACCCCrGTC 
5 CTTTTTCATTAATGGGTAAATAGGACTTCCTGAAGAACCT^ 

CCTGGCAGATGTGTCCTGTCTGCTGTGACACTAi^CACCATTCAACCC 

CAGGAGACAAGTGCCTGTCAGACATGCCCTATGACAGTGCCAACTATGAGA 

GCCATCAACAATGCCATCAACTACCTGGGGGCTGAGTCCCTGCGCCCATTG 

AGTCATCAGCTCCATGTACCAGCTGCACAAGCCCCCCTCAGATGG CCCCCCACGG TCCAACCATTCAGCACAGG ACGC CGTGGAT a 
1 0 ACTTGCTGCTGCTGTCCAAGGCCAAGTCTGTGTC^ 

GAGAGCAACGCGGAGGAACAGCGO^CGGCCTTATCTACCTAACCAACCACAT CAACCOT CATGCACGCAATGGGCTGGCTCTCAA 
GGAGGAGCAGCGCGCCTACGAGGTGCTGAGGGOXSCCTCAG^ 

AGCTGAAGGTGTACAAGTGCGAACACTGCCGCGTGCTCTTCCTGGATCACGTCATGTATACCA 

CGGGATCCCTTTGAGTGT AACATGTGTGG TT ATCACAGCCAGGACAGGTACG AGTTCT CATC CCAT ATCACG CGGGGGG AG CATCG 
1 5 TTACCACCTGAGCTAAACCCAGCCAGGCCCCACTGAAGCACAAAGATAGCT 

TTAATGTTAGTGACAGGATTGCATTGCATCAGGAACATTCACM 
TTCCCGGATATGTGGCTTCCTAAC^CTCTCCCCACCCACCCCACC^ 

ATTATCCAAGAAGGAG TAAGG TGGATCTTG C CCTAAGCAGAGTTT ATG C CACAAAGATTCT C CAAAT CCCC CAAGACAG CACAGCC 
2 0 ACTGGGGTTGAGCCATCTCAGGGAGCTCTGCAGGTGAGC 

• TGTGCAGAGACCAAGGCCAAAGGTTGAACirTGAAAGACTATTAAGTCA 

CAATGTG CTGAAGGGAAGGGATTGGAGAGCCTTGAAGACTCT^ C 

CAGTAAGACGAGCTCAGTGCTCTTGTGATCATTTTTCTCTT 

TAGAACCCAATCCTGAAATTGAGACCTTTTTTTAAACTTCTGATACT^ 

2 5 ATCCraVGTTATCCTGGTTCCTGTTCTTCCCCGAGCrcATAGTACCAGG 

CAATTOIATCCCTGTTCAGATTATAGATATGGACTTTGCTATCI^ 

AAAACCATT ATT AAGAATG AC CATTTTT AGATGG AAGAAATGAG C CC AG CAT CT CAGTGG CT AAAACACAAAAT AT C CATG CTTTT 
AAACAAAATTGTTAAAT ATTC CGAAGCTCTCT AGT AT AAACACC AAGT AG CATGTG TTTT C ACAT AAAG AAG ACAGGGG C CATG CA 
ACCTTTATCAAGTGGAGGTATTAGAATGTTGTAATGTTTGGAGACACAGTGTG^ 

3 0 ATATCACAGAAAGGTAGAGGTGGGATCTGGTATAGCCAGACCAA 

TCTCAACAGGAAGGCCTCTCITAAGAGTCAGTAATAGGGTTGTTACCATCCACC^ 

ATGTTTACAGCACTGTCTTCGCAGAAACTTTCTGAGGTGATTCCAAAGAACT 

AACGAGGTTCTTAGTCTCAGCTCTCCTGACATATAGGGCTTGATTATT^ 

CCTGGTCrCTCCCCAGTAGATGCCACX5AACATTCTAGCTGATACCTAACTGTCTTCCCAGGTGTTC 

3 5 TAAACrCTAAACGCrGAAGTACGCAGGTTTTCTAAAAATGAC^ 

GTGTCTTTGGAATG CTGAT GAATT CT AT AG AT C AGCATTG CAAAT ACACTT CAAAT ACGTCTG AG TTCAAGTG CAGGG ACTG AGTT 

CACCAAGGTGTGAACTGTGCTCAAAAAGTTCAAAAGTGTGTGTTTCTTTG 

TAAAACTTTTTT1TTAGAAACAAATGAAGCACTTGGAAAGT 

ATTTT AGGAACT AT AG CTGTG AAAAGATTTTGTTCAAAAGGCAGG CT AGCCITACCCAAATTCATATATGGCAGGTGTCAACCTCC 

4 0 CAAGCTTACAGTTAGCAC^CAGCTTTTGCTC^CT 

CCCTACCCCCC^CCCACCAGTGGTATTCAGAGATTAAAGCAGAATTGTAAATAGTGGCT^ 

CCCCTTCCTCTCACTGCC'l'rri'l'lAGCC^VATATAAATGTa^TTTGCACACC TTTTGTTGTGGTTTTATATTGTAACAGCATTTTT 
TTGAAACT ATT GT ATTT AAGAT AAGGTTT CAT ATT ATG TCCACAAG T AATT AAATTATGTTTG AAGGT GG CTAT ATG CTGTATCAG 
AAGTTGATG AT GTTTTT CTTT AG CCGGTAAAGGAGGGTTTTG CATG ACCTCACTGTTTG TT CTGTGGTTT GTTCTGTTG T ATGATG 

4 5 TGTGTCTTGAGTTITGCTGTGTGATGAAGTGCGCTGAG^ 

GGTTCCTAAGAGCAGGTTGACCTGTTGGCGACAGATGGCAATCACCATTTCTCATTCCT 
TCCCAGGTCCCTTCTGTGATTCTTACCTTAGTGTGCATGTGTGTCTGTCCTGGTGAGAG 
ATTATCACCAAGGGCACGCACAGCCTAGCACCTGTTGCTTCAGATACCGTCACACTCrG 
AAAATGTTAGAGTCTTCAATGGGAAGCAGAGGTGCTTGTTATAAAG 

5 0 TCTTTTGTTTGG TTTT AACTTAATTGTGACCCTTGTAACAT CAT CTTGCC^ 

ACATGTCAAGACGTACTATCTATTTCTTTGTCAGTTATAGCAATAAGA^ 
ATCeXTGTTCTTTCTATGA'lTTT'rCl'lT'CriTTATGGTGA 

ATGTG CATCTGGTGCTATGGCACTCACTGTGAATAAGTGTG AC C AG ATATTAATATGC!AAT ATTGTTTCCAATCCTTTCTAATACA 

5 5 TGTGAGTAAGAAAATGAAGCCATCTCTGCAGCCCTGACTCTAGCATGATATTAACT 

CCACTGGGAG<1ATAGAGGAGCTATCCTTTTGTACACATTTTGCCTTAGATCACTACA 

CAAATATTCTGTCTGCCCCATC C^GTGAAGTCCAAACACAAGCATAAAACIAAGCCCCAGTCATTGTGGGAGTCTG CTGTAGCCTCT 

CTGAAACCAGGAGGTGAGGGAGGAGCCCAGGCCTTGACACAAGAAGTGTGAGCAGATA 

AGCTGGCCAGCTAGGGAGTCACTGCTAAGCCTCTGGACCTCAGCC^ 

6 0 GAATGACAAACAGAATGACAGCTTTCGTGTCCTGCCTTTTATTCCTTTCCATTTTAAATAA 

G ACAC CGTGG AG CAAGGCTTTCTCATT CATTT ATCTG CTT CACTCTGG AAG AG ATTG CCAG AGAAAATG^TGCT AACTTCACTT CT 
AAGGAGTATGTCGATGTAAACACTGGGCTCTGTTTTCCATGTG AG C CTGGAATGCGCACTG AGGGAT AGGAGGG AGGC CCAAGGTC 
CCTGACCCCATTTCCAGTGTTTCTCAGTCATATTTTGATCAGCAGAATTCAGATTTC 
TGACTCCTTAGTTAAACTTCCTTGCCrrCCTTCTTTGAA 

GTATTCTGAGGAAGAGGCGGGAGTTGAAGGTCTTTAGCTCCACAGG 

TAAAAAGAAAGAAACAAAACGTTGG CTGTTTCTGAAAGTTGCCTTTTGTCATGG CTTCAACT AAGACACAG AG CT AGGAG CAAAAG 

AGATAGAAAATTCAGCAATTAGTGAAAGGCTCAGAGGAAGTTCT 

GAAACTTCTGGCTCTTTCTCTGCACCAGTACCCAAAGCAGCAACCAACTG^ 

7 0 GCTATTGCCACACAGCATCCACCTCTTGTGTCTTTTCCCAGGCTAC 

AAGGGATTCCCATGGCCCCTAGAATTACACTGCTTTTAGCAGCCCCAAACCCT^ 

TGAGCAAACCCACTCTAGGATTAGCACACAAGAAGCCCAGCCTCCA^ 

AACrGGTATCTTAGAAGCCTCTCAAACTCTACCCTATCTT^ 

TCTGGAGTGGAAAGTATTCATGGGACAACCCGCGGCCCCAGCCCTGAGGTGTGTGCGTGGCCAG 
75 CATCAGTCAGTTCACTGAGGACCAGTTAGATCC^G<^C^ 
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CCCACAAATGGTCTGAAACAGCTTAGATGTAGACAAAC^ 

AGCTTCCACGTGTAACCACCCAGCAAAAAGCAACCACATTGAAAGATGCACATAGTCTAACC 
GTGGGCAGCTTTGACATCCAACCTGGCTTCTGTACACTACAAAGACAGAAGGCC^ 
TCCATCATAGATGATGTGATTTTCTGTGACTGTCCTTGCAGGCCTCCTGCCTC 
5 GCTTTCAAAAGTGATTTGTGGAAATGGGGACCTGATCCGA 

CAGAGCAGAGCAGAGTTTCCTCAGCTGGGAACATGGACGTTGAGGTTGCATTC 
AAAGTATGGTCTGGTGCTTAAGAACAGAATTCTGGAGACCACCAAGTCTGTC 

GGCAAAACGTGAGCACTGGGCCTGGGTCATTCAAATATTCTGTTCTTTCT 
1 0 TCrCTCTGACAACCGGAaUVGCCAGCAGTTC^TTCGTATTCCAA 

TCCTCCCTTGTAGCACCAGCCACACTGAAAAGTCCCCAGCCTTTGCAATTT 
ATGAAAGACAGCACCTCGGAGGCCAAGCGTCACAAAAAAGATGGTGCTGTGCAGA 

CCCTT CTCCACTTGG CTCCCG CTCCATT CTGGAATGTTC CTTGCAAAGAAAAGTG AAT AT AATATG ATCCTCTTG AGAT ACAGTGT 
CTCTCrCAGAAAGGGAGGAAAATACCTACACCTATA^ 
1 5 GGTTGCATGTCATATCAGTTTCTCACAAAGAGTCACTATCCCCAAGACTTGCC 
ATCCAGAGCCAAATAGGAAAAAATAGTGACATGGGTAGAGCCAGGCACAGG^ 

ATCTCT AGAGCCCAGPrcCATGGTAGTTGGACGTGGGC CTGT ACTG CAGCT 

GCTCTGGAGGGTAGAAAACACTGCTCTGGGAGAGGGGAGAGGAACAGGAAGAGC^ 

GGAAGGAAAG CCAGTTGAG C AGG CATCC CCAATTTTCAT CTT CAAAAT AAG AG ACCAATT ACAAATC CAAAAAAAAAG AAAGGCCA 
2 0 AATGAGGACAGCCACTGAGGTGGCCTCTGAGCCCAATATATAGTTTGCTCCAAATGACCCT 
GTCATACAGCAGAGGTAGGCTCTGACCCCAGGCCTAGCTAACAAGAGAAGACTTCAAAAACA 
AGTTTCTTGGCCAGATTTTACAAAACAAGGAAGAAGACCCT 

AGCAGG AGC^AAGTGGACATCCTTTG ACCTCTG ACTCCTCCG ATGACT ACAG CATGTCTAGGGAGAGGG CTACTTATTGC 

AGGATGCCCTAAGCTCCAGCCCAAAGAACAGTCTGGTGGCTCCTTGTACAGCAATGAGCCCCT 

2 5 GCACTTG ACCTGCGGCAGAGCAGCTGAG CCCTGGAGACTCAGTGAATCGG CACCAAGGG AAGCAGATGTGCTCGCCATGGTCCTCA 

GGCACAACCAGGAACCAGCCACTAGGGGAGGAAATTTGCATA^ 

ATGGATCTCXX3T AT AGTTCAACAAATTT CACTT CTTCITTCC CATGTTTT CTCACAC CAACT AGATATGAG GT AG CATGTG CAACT 

CGTCCAAAGGTTCGGGGGTGTTGTTTGATTTTTATTATTGT 

CACTCCCTTGTGCCCAAAGTGAGTTTCTGAGCTCCCTCAGAGACCTCTCCCAA 

3 0 TACCCTGCAACACAAGCAGATTCACAAACAGCAGTGGTGGTCATCTGCCTCT 

GCCGCTTCCrAGAGCCTGCATGAGCTTCCrCATTGACATTATCCCTGTGGCTT^ 
CAAAGGGTGTGTGGTTTCAGGGCTCCCCAGTTTCATTCCCCTTTCAGGCCT 
AGAGGCACAGCCCTTCTCTAACCCTTCGAGGGACTCCCTATTGCCCCCATAAATTCCCACT^ 
ACCATTAGTTTTATAACACCTCACAGGAAGGCTGT^ 

3 5 CTAAAATAAACCAAAGGAAAGAATAGCCTGAGAAAAAATCCCTAAGACACCAGCAAGATG 

CACACACACACACACACACACACACACACACACA(^CACACACAC^ 

GGG AG AAAAG AATT AT AAACCTAC ACTCT CTAC AG AT AAC AGGG ACAG AACAATGTG AGG T AGG CAT AGCCTGCGACAGGAGG TTA 

GGCCTAGGGCAGGCTTCCTCTTCTTCCTGGTTGGCTGGTCCCACACGCAG^ 

AGATGCTTAGATTCTAAACTGAAGCCTTTCGTrTGTTGCAGCTGCTGACTTG 

4 0 TGTCTTTCCTCACTCCATTTTTTAAACATTTAATTTAATTCA^ 

TTCAACCTC ACGG CCT CGGGGAC CATAAAGTG CAGG AATCT ATGGGT AAAAAG CATTCTCC CCATCGTGACAG CCCTGTT C CCAAC 
CCATCTGTTCAGACATCTT CCAAATCACAT CTAATTCCCACACC ATCCTACTC C AAG AACCCTG CTTACCCTCAAAGTCTTCCC AC 

CTTGATCATTCCTGGTCCrGGCTGTT AAGTGGTAGTCTCTCCACATCAGACT AAAAG CT^ 
45 CCCIATAATAATGGTATCCTGTGAGATCAGTCCTCCCTGTATCCTGGGCTCCCTGGCT 
CTGGACCCTAGCCCTTCrTTGAGGAGCTCCITCCCATCTTTGCACT 
TCACCTTGGAATCTCCAGGAGGGGCCACTCCTTGCCCCAGTATTC!AGGACAAAGTATCCT 
AAAAGCAAAGAGAGGGGTC^ATGAGGCAGAGAAGGTGAGTGCAGCCCCTTGAAAACAGGGAAACTA 
GGGCAGAGGGTGTCATTCACAGATTGGGACITTTCTAACTTAAGTAATTGT^ 

5 0 TATAAGCATGGTCAAGTTCCATCCCACTGACTCTAATCAGGCCAGAAGTTGACCT 

ACAGGACAACAGTTC^CTGTTAAGGGGACACCAAAGTTTTGC 

AGACAGGACCT AG AAACCAGTC AGGATC CACAGACCAGGGG CAT CT CAAC CAACT CTTG AAAAACAG ACAGCAG AAAGAATGAAT A 

GTGGAGACATGGAAGAGAATTAAAACCAGAGATGCCAAGTAGAGGAGGAAACAAAAAGAAACAAAGA^ 

TTAAAAACAGTGACATACGTAGTAGGTGCTAATCAATCTTTATC 

5 5 AAGGCAC^CACTGGCAGAATGGACCCCAGCTAAGACTTGTCTACAG 

CCCAGCrGTGGGCCCrGTGTAGTACAAAACCAACCTACCGGGCAAGATGTGCCCGCTATCATAACATAATAG 

AACCACTTTCTGATTGGATTTTAAGCCTGCTACACTAGAGGGAAATCATACGTAGTCAAAAGCCC^ 

CCTGGTGAGAGCAAGGGAAGAACTTAGCTTTTACTTAAGGATGTGTTTATATCCAC^ 

60 MOUSE SEQUENCE - mRNA 

CCTTGGGGAGGCACAAGTCTGTTGATAACCTGAAGACAATGGATGTCGATGAGGG 

AGCCCCCCAGTCAGTG ACACTCCAGATC AAGGGG ATGAGCCCATGCCTGTCCCTGAGG AC CTGTCCACTACCTCTGG AG CACAGCA 

GAACrCCAAGAGTGATCGAGGGATGGGTGAACGGCCTTTCCAGTGCAACC^ 

TGCGGCACATC!AAGCTGCACTCGGGTGAGAAGCCCTTCZAAATGCCATCT^ 

6 5 GGCCACCTGAGGACGCACrCCGTTGGTAAGCCTCACAAATGTGGATATTGTGGCCGGAGCTATAAACAGCG 

GCAT AAAG AGCGATG CCACAACT ACTTGGAAAG CATGGGCCTTC CGGGCGTGTGCCCAGTCATTAAGGAAG AAACT AACCACAACG 

TCTATGCCTCAGAAATTTCTTGGAGACAAGTGCCTGTCAGACATGCCCTATGA 
ATCCCACGTGATGGACCAGGCCATCAACAATGCCATCAACTACCT 

7 0 GTAGCTCCGAGGTGGTGCCAGTCATCAGCTCCATGTACCAGCrGCACAAGCCCCCCTCAG^ 

GCACAGGACGCCGTGGATAACTTGCTGCTGCTGTCCAAGGCCAAGTCTGTGTCATC 
CCAAGACTCCACAGATACAGAGAGCAACGCGGAGGAACAGCGCAGCGGCCTTAT 
GCAATGGGCTGGCTCTCAAGGAGGAGCAGCGCGCCTACGAGGTGCTGAGGGCGGC 
GTCAGCACGAGTGGCGAGCAGCTGAAGGTGTACAAGTGCGAACACTGCCGCGT^ 
75 CATGGGCTGCCATGGCTGCCATGGCTTTCGGGATCCCTTrGAGT^ 
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CAT CC CAT ATCACGCGGGGGGAG CATCG TTAC CACCTG AGCT AAACC CAG CCAGGC CC GACTG AAGC AC AAAGAT AG CTGGTT ATG 

CCTCCTTCCCGGCAGCTGGJ^CCACAGCGGACAATGTG 

GTTTGATTTGCGTTGGAAGATAAGTTTTTAATGCT^ 

CC 

5 

MOUSE SEQUENCE - CODING 

ATGGATGTCGATGAGGGTCAAGACATGTCXCAAGTTTCAGG 
GCCCATGCCTGTCCCTGAGGACCTGTCC^CTACCrCTGGA^ 
TCCAGTGCAACCAGTGTGGGGCCTCCTITACCCAGAAAGGC^ 
1 0 AAATGCCATCITTCCAACTATGCCTGCCGC 

ATGTGGATATTGTGGCCGGAGCTATAAACAGCGAAGCTCTTTAGAGGAGCATA 

GCCTTCCGGGCGTGTGCCCAGTCATTAAGGAAGAAACTAACCAC^ 

TCC CTTGTCCTGGACAGGCTGGCAAGCAATGT CtSCCAAA 

AGACATGCCCTATGACAGTGCCAACTATGAGAAGGAGGATATGATGACATCCC^ 
1 5 ACTACCTGGGGGCTGAGTCCCTGCGCCCATTGGTGC^GACACCCCCCG 

CAGCTGCACAAGCCCCCCTCAGATGGCCCCCCACGGTCCJVACC^^ 

GGCCAAGTCTGTGTCATCGGAGCGAGAGGCCTCCCCGAGCAACAGCTGCC^ 

AGCGCAGCGGCCTTATCTACCTAACCAACCACATCAACCCGCATGCACGC^^ 

GAGGTGCTGAGGGCGGCCTCAGAGAACTCGCAGGATGCCTTCC 
2 0 CGAACACTGCCGCGTGCTCTTCCTGGATCACGTC 

TTGAGTGTAACATGTGTGGTTATCACAGCCAGGACAGGTACGAGTC 

AGCTAA 

HUMAN SEQUENCE - GENOMIC 

2 5 AAAGTAGCATACAGGTTCTGACCTTCAAAGGATCAATAA 

ATGGTTCTACCCTCAGTTTAGTCCCTCGGCTTAATCATCTTT^ 

CAAACAT AT AT ACATCCTGT AT AT CT ATTTGT ACAAAG AATTGG AT CTTCATC CTG CAT ATT CT CTTTTCAAAATT AG CCTTTGTT 
TACCATCTATAAATCAAGACAACATTATTCCTTCTCCGAAAATG 

CTGAAAATG TTT ATGAG CTATAAT CAT AGAAAAT AAAG GG AT ATT AT AATGAGAAGTCAATCTTTCAGTTT AATTGC AGA 

3 0 TTTAAGCTACAGAAAATTGTGAATTTGAGCTCTGAAGTGGAAATAAAGTTGCCAGTTTC^ 

TTTTTCTTCATTAGAAAAACATATCTGCAAGTAATAAGATATTATGAGTAAATATTACAAG 
GCTACAGAATGCATTATATCTTTAAATTATTTCTAAATCACAAATT^ 
GTCTTCTTATCCTACAGAAATAAAGAGGTATCAACT 
TTATGGTATTGCAGTGTTGTATTTTAAAAGTATTCT 

3 5 CAAGGAAAGTGTGAAATAAATTGTCAACTTTTAGTTGAAGGAAGGAAATTTGGAATATTC 

AGCATTAAATAGGTGAAGAAGCAACACTTCTAGTCCATGATCGCAGAGCCCCAG 
TTTCAGCTGTCAGGAGAGGAAAACACAGTCCCTCTGC^ 

ATATTTTTCCTGTAAGTAAGATAAACCCTTCAAAAGTAGCACCATGAAATGGAACT 
AACTCTGAGGAAACAAAATAGAAAAAAAACCAAATCrCTCCTTTCAATGA^ 

4 0 GAAGGCAGGAGAAAAGCTAACGCTCGGGTGGGTCAAGGTCGTTCCCTTTCATGGAGAACAGACAATG 

AATGGACCAAACACCTGAAGCCAGTAAGGGGGGTGGGCAGGCAGA^ 

GGCAGTGAGCTGGT CTGGAGCACTTGAGGACTATAAGGTAGGGCCAAGTGGCCAGAGACAGG CAAGTAAGTGGGAG AGTGGAGTCT 
GGCCTGCATTClAACACAGAAGCACCCCGATCrGACTCTGCTCTTCATCAAG 
TAGAGAGACCCCTGAGGAGGACACTGCTATTTGCAGATGAGAGGTGACCAGCCTGTCTGC^ 
45 GGACACCTTGATACCTGCAGAGCrrAGTGGGATTCCCCAATGGACTGGATGGTAAGTG 
GTTTTGGCCTCAGCAATTGCCAGCTGGTATTCAATGAGATGAAGGGGTCT 
GCTGGGAGACAGTAAATATAAGTTGCCTATTAAGCCTCCAAGTAAAGGTGTTAAGGAGCT 
AGCAGATATTAATTTTTGGAGTAAGGGTACAGAGATGACATTGCAAACTGCTTCT 
GAGACATGTCTGGAGAGCTGCAGCATTTTAGGAAAGAGGGGAGAAAACAGCCAATC 

5 0 GGTGTCCTGAAGATCTACCTCACTGATATCGGGGTTCAATAAAATGTTCGTGCAT^ 

GCTG ACTT A CT ACCTTTGGGAC CAG ATGG ACTTTGCTC CATC CATTG AG AT CATTCTTT C CAG CT CTTCTT AGATT AAAAAAAAAA 
TCCACTTTTAACTTAGTCAAGGTTAGTCCCAACTGAGAATCTGGAGACACATTCCTG 
TGTTAGATAGCACTTTCTGAGAAATCTTACAGCCACCCATAAACAGG 
TTACAATTG^AAGACTTATGTTCATGCTTTTGTTTAATCCAAACCACAACT 
* 55 GACAGTGTGATCCTGAGAGAGGTGACGTAACTCAGCTGAGTTCACGCAGTGCCCTCGCCATGCACCTGTGGT 
AAGCTAAGGTGGAAGGATCCCTTGAGCCCAGGAGGTGG71GGCTGCAATGAGCCAAGAT 
GGAGTGAGACCCTGTCTOVAAAAAATAAATAAATAAAAATAAAGAAAAGAAAAGAGAGTTC 

CT AAATTTCAGTCCTACTTGATAGGCTG^AAAGGCTGTTCACA CCTGGACAGGTCCATTCCAT CTG ACCTCTC 

TACCAAGACTGTTGCTTCTAAGACTGACAATTCGGCOT 

6 0 AAATTGGAGAAAGGTGTCTTAAAAC^GCAGTCCCCAACCTTTTTAGTGCCAGGAACTC^ 

GGTCAGGAGGGGATGGTTTGGGGATGAGTCAACrAGCATTACATTTATTGTGCACTTGAGCTCTATTATTACT 
TATAATGAAATAACAACTCACGATAATGTAGAATCAGTGGGAGCCCT 

TGATGGGAGACAGTGACAGATCATCAGCCATTAGATTCTCATAAGGATCATGCAACCTAGATCCCTCACATGTA 
GGATTTGCAATCCTAAGAGAATCTAATGCCACCATTGATCTGATGGGAGGTGGGGCTCAGGAGGT 

6 5 CTATAAATACAGATGACACTTTGCTTGCTCACCCTCCCTATACTGGTC^ 

CAGTTCTTGGAGAGATCTC^TTTCTGTGTTGGGAAATGTG^ 
CAGCATGATTCGCAAGGTAATTTGAAAAGAGAGGGCAGACGCATC 
GTCACATAAAAACTGAATGCTTCAAAGTGCCGAACACTTACCCAGTGCTTGAT^ 
TGCACTAACTCATTGAGACCTCAAACATTTCCGAGGATGTTTGTGCT 

7 0 ATGGAGAAGTCTAGTAACTTGCAGAACTAACTCCAATA^ 

TCAACCCCCAGCGAGTCCCCTTCCCTGAGCTTCAGTGTACTCATGTGAA^ 
TGGX^GATAATGCCTGCATGCCTGTCACAGAGCTGGGCT 
CTCrCTTTTGAGACGGATGTTTCTAAGCTCCCCTGCCAGCCAAGTGTGGA 
CrrCCTa a .CCACCrCCCTCCTT!TGTCC^TOAAG 
7 5 CATGTCAGAATTTATTATGACACATTTTGCATCAGAATGAAACTATT 
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GGACAGATCTTCCTTTCTGCriTCTTC 
CTGAATAAATAAATGAATACATGCCACCAGAGTGTTTATC 
TGGAGTGACACACAGCTCTtjTTCTTCTTTGTTCTC 
CTTCAGAAACTTTTC^TCAGAAAGATTTTTTGTGT 
5 GTC T CTT AT AACCAAATGAT ATTG AT AGAAATTTGTTG CTTTTGAGAAC CT AAG TGT ATTT CTCTT ACAAATTTCAAAGGGAAT AA 

AGTGAAAGTATTATCAXSGAAGTGGGAANNNNNNNNN^ 

TCGAAACCTACAGGCAGAGATGTGAATGGTCAGTAGTCCTAAAGGATTTCTC 
AAGACAGGGCCTTACAGATtTITTCAGGAAACCAAGCA^ 
TGCCAGCAGGAGAGTCCTAGCTCTGCACCA^ 
1 0 AGTGGACTATGCTTAGAAATAGCCAAACCAACGAGGGAGCCCAAGAAAAAGGATCGAGGACCTCT 
TTTTCAAAAACAGCAGAGGCAGAAGCTGGAAGTCATGAAAA 
TCATCCAAGTCCTGGGTATATAAATAAAATAAAATTTTATTTTTTAGCAAAAAA 
TGTTTTCCAACCATCAGAGGAGACATGCACTTTTCATTC 

TTCCATAGATTATTTCATTGAATTTCACGTAATTCTATCAAGACCATTGTGAAGTA 
1 5 CAGGCTTTAAACAACTTGCCAAACACAGTTACTAAGAAACAAAACTAT 
CTTTCAAAGTAAAAAACTCACTCTCTTTTGAAAATCCCATCCAGGCT 
GTGAGCTCTGAATAAGTAGCCTTTGATTAATCC^^ 

TGATTTAGCAATTATTGCCTGAATAAATTAAAGTGTTTTATTTAATACTTTAACAT^ 
TAAATAAAATAAAATGACTTTAGACATAATATGA 
2 0 TGTGCATTTTACTGTTGTCCCTAAGGGTTTGAACATTTGATTTATATCACCT 
GTTATAATAATATAAGCTTGACTTCCTTCAACTCCAC^ 
TTGTTITGTTTTGTGACAGGGTCTTGCTGT^ 

CTGGGCTCAAGGG AT CCTC CTG CCT CAG CCT CCT AAGT AGGTTGTTGGG ACTG CAGGTGCAT AC CAT AACACCTAATTTTT AAATT 
TTTAATACAAATGGGGTTTGATTTTTAAAGTTTAAAATTTTTTTGTACAA^ 

2 5 CTCTTGGCTTGAAGTGATCTTCCCACCTTGGGCTCCAAACT 

GTGCACTCCTCATCTCATTGGGACATTGGTTGGGCTAGCAGGGAATAACTGT^ 
GTGGCAAGGTCTTCCATATGTGATCTCCTTAACC^ 
GGAAATTCAGGCTTTGAGAGATTAAGGACCTCAGTCAG 
TCAGTGGGCTACACCITGGTGAGATGCCTATGTTTCGGTC 
30 CCAGCGGCTAAAGGTACTTCTG CATCCAAG AACTAGAGGTGTCCCGTGT CCCCTTT ACAACATTGTTGGCCTCCCTCTGGGCCTTG 

TACCTTATGTTTATTGCCTCATTTAATCCAGGTAAGGAGTCTTTAGGATTGGT^ 
TGCTCAGGAGTTTGGATAACCTGCCCAAGGTCACGCAGTGG^ 

CCTTCTCTGGGCTGTAGCAGTXSAGTTAACTGGCTCAGACTTGCCCAGCGGAATGCCCT^ 
GCTCCATATGCGCTCAGGGAGAATGTGGATAGACGCTGTTTC 

3 5 ATCCTGTCTCTGGTGTGCCCTGGCCCCCGACTTGGAGGCCTCCTGGGCCAGGCCAAGA 

ACACTCAACTGCCCTGAAGGGACZATTTCCTGCTTATTCCCTTGCCCGGCTC 
TGCATCTCCTACCCTGAGGAGGCTCCCrrTGGTCTTTCATCG 

CAGGAGCACGTTCTCGGCCCTCTTCACAGGGCGCTCCTCCTCACAGGGGTGCCCGG 
CTACAAGTCTGGAAGGGCAGGAGGCGCATCTCACTCCTCrGGGTCCCCTCCC 

4 0 GAATTCATGACTTTTTCTCTCCTGCTCAAGCTGAACA^ 

TCCCCGTAGGGCGAAGGGGTGCGCTGTCAGATGTGGCATT 

TCAGACGCAAAGAGCCGTCGAGCGGGCTGCGGGATTGCTTCGCTGTCACCTCCGCCT 

TGCACCCAGGCCAACATGGAAGGCGCCATCCTAACTTCTGCCGTGAGCAGGTGGGAGGGAAGAGA 

TGTCTGGGAAAATGAATTGCACCTTCCCCTCCCTTGCGGAGGATCAACTTT^ 
45 CCGGAGACTGAACCCGGGAGCCAAGGGGCCCCAGTTCCAGGG^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNJ^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNTOJNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNIWNNNNNNNNNNNNNI^ 

NNNNNNNNNNNIWNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN*^^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

hHWNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNKNNNNNNNNNNNNNN^ 

nitonnnnnnnnnnnnnnnnnnn^ 
gcgtcacccgaaagtttgcggggtggcagggcgcgcgct 
cggcgggggcggccggcgcggagggggccaggtacggggcccgcg^ 
cgaaagcctggtctgagccggctgggggcggggagtgtggcggagaaatggg^ 

6 5 gtgaaagaaagctgggaagagctccgcggccaagttagcaggacactctaacaagtgactgcg 

tgcggcaagccccctgggtccccgcgcggcgcatccc^ 

cagagggagccccggctccgaggttgctcttcgcacccgaggatcagtcttggcccca 

ttrtcaaattgaatttcaataggaaaacttggggtaactggtgaat^ 

7 0 actgtgccagcaggtgcttggaccaggccgacatgggac^ 

ccatatttaaatttatgtagacctacatatacccacggaagtcagcct^ 
gccgggagctttctttttatactataaatatgatc 
aaagttagtccttttgctgaccgcaggtttgac^ 
gttacttgcggttatatttgaaatgtattttaaatatttc 

75 TTCCrTCTAATTTGGAGGCATCT^ 
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CCTCT CT CT AGGGACTGCAG CC CAC C CAG ACTGAATTT CAATGCGGTG CG CTTTG CTT AGGTT ACCCACT CACAATTT CCCACTGC 
GCCGCAGGCAGTATATTTCAGCTTTGAGATACCTTGTTTTAAAATTCCAGACAAAAT 

CCATCAACTTCTGTT AAAAGAGGAAAATTT ATGGAATTTG AAAATACTGCG T ATG AT ATTT AAACTTTCAT AGACATTCAAATG CT 

TTTAAGGCCAGGTTCAATTTGGTTATGAGTCGAGGGGTCGGGGG<iACCCAC^ 
5 TTGTTTGAAGATGTTTGTTCMTCAGTTTTATTXn'ACT 

ATTAGAATGTGTCAGAATTATTTCCGTGACAAATCAGATCATTTG^ 

AGAAGATTCACCTTCTITACGCTGGCATCTTGTAAAATGC^ 

CATTAAAAATACTACACTATTCTTGCTTGATGGAATGTATCTGATT^ 

GAAAAAAAACGTGACTCTTAAGTCTCACAGTCAATCAGAGCTGGTC 
1 0 CCCTTTTGTTTCACG<STGAGAGTTGAAGGAAGGAGTTTAGAAACT 

GTACCrCTTGGAAAACTACACACCCCCCTAATGCAGAAACATCATAGCAATAATCACCC^ 

GGGCTGCAGATAAAAGTCTGGATCTGTTAGGTTTGACCCTT^ 

CGTTTCTAGCCATTTCCTTTGTCGAAAAACTAAGATCGCAGTGAATGTAT^ 

CACTGTCTTCCTTGTTTGACTAGAGACTTGGAGTTTtl^^ 
1 5 GTGTGTGTTTCATGCGTGGTTAATATAGCATATTTTCAATATA 

ATCTACTTATACACACACTTCAAGATTATGCATTTATTAACAGATAC^ 

GAG CGTGCAC ACAC ACAG CATGCACACAG CG TGGAGTG AGAGGCATGGGG CAGTG TGG AAGAGTTTT AACATCAAACAG ACCTGAA 
ATG AGT ATT AAAGG C C CCCTTT AlT'ITTAAACTT'iT ACT AAAACAAGATGGATTT C C CT ATGTT AT AT AATGGTG AATTTT AGGCA 
TAAAT AACG ITTTl'T G AGTG TTGCAT AATTG T ACGT ATT AATGT AATGT AACTGTGGTTAACGAAG AATTCATCAAGG ATATCACT 
2 0 GTTTTCTGGCATTTTTTTTTTCCTCCTCTAAT 

TCTAAGGGAGATTGATGTAAGTGGCTCCACTCCAGCTTACAGAAGGTAAACCACGAC C 
TCCGATGCCTCTGTTTCTAAGACTGACAAGCACTCT 
CCTGGCTGTGAACATTGTTCTCTCTCTG<3TGTCTCTATGTT 
GAATAATCATTCCATTTTATGGGAGAAAACACAAGATGTAAAAGCAACAA 

2 5 TTAATCTATTTCACGTCTTGAAAACAGATCAGTTTrATTTTGCrCAAA 

GTAGAGTTTCAGAGGATCAGCATTATACACACTGTCACACAC 

TTTGTTAGGGACGCAGAGCACCTAAAACCAAAGGATATCGACAGTAACAAAGCTGr 

CTGGTGTCTTCATGTGGACCATGGCTTTCTTGTATTTCTTTGCA 

tcctggaatatatgtttatgtccttaaagtttcagtgtcx;tcactt^ 

3 0 ccaacaaatattttttaagacattatgaaaccttatgaagtg^ 

gcagaaggttm-aagtggcgaagtgcggccctgcccattttggtgtct 

gcaactgaggcctaatctacaggcaactgccaaattgtttcagttgacgtttttccctct 

ttgccagtggagccttcagccaaccacctggtaataaactgttaaaaatggtgcaaaccct 

TCTTGCCTGTTAACAGGCCTGGCCTTAATTCTTTTCTCCCATGGCCATT^ 

3 5 AGAGCACCCTTTTCCACCACAAGCCTGACAAATCAGACGTCCACATAATT^ 

TCCCTTCCTGTCTGTGTTTTCCTAAGAGAAAACGGTCTTCCCTCCTTTTTTG 

CCTGGTGGAGTGTGAAGGCAGCAGAGGAACCTTTTGGAGGAGGAAGAGGACACAG 

CAGGTGGCTGGGTCrTGAGGGGAACTrCCGAGCAGCCCTAGGTCCT 

TTGGCGAGGCCTCGGTTGGTTGGTGAGTGCCACACGGTTTCTTTGTGTGCTTGCATGGATT^ 

4 0 TT CC CTG CTGGTGTTT C CACAGTGGGTGGCCTG AG CCCAG AG CAG CT C C C C AT AT CC CTGTG CAGG C CACCTG TCTCGGGTGATGG 

AGAGCATCATTATCCTCCGTCrGAACGCTCTGCTTTCGGATG^ 

CTTAACGAATGGCTGAAAATGGGTCCTAATTAGTGGAAAAGTGCTTTGTTCATATTTT 

TCTTCTGCAATCAGCTCACTGCTAAAGTAAATCTGACTCTCTTCC 

TAGGTTTGTTGAGAGAGCAATGCAGTAAAAAGAGGGGATCCAATGTGGTCTTGTCT 

ATTGGTGGGGGGTTTGC^CAGACTGCCAAAACACTAAGAACTGTGTAAAGTGTTTTTGA^ 

TTTATGCTCCATTTTTGAGATCACACACTAAAATCTATACCTTTAAAGC^ 

TGTGGTTTAGATTAGAGTCCTGTTCTGAAGCTAGGAGTTCCACTATGAATATTGATT^ 

TAC CAGATTTT CACTG<3CAAACCT AG AGCAAAT AAAATT CCACAT AAG AT ACTT C CCT AGACCT AATGGG AAAAATGTTT7UV7*TTA 

5 0 GAGTCTTTAGGAGAAATGAGAATGAGGAATTGACCTTTTGTAAGCTTACTTCTGAGGCACTCT 

T<K5AAACTAGAGAGAGCC^CAACCCCCTAGTGTGAGCCCCACTTTTAACCGGAAAAA 
GTTTTCCGTAGGGCAGAAAATTAAGCTGATATTCAAAGAGATTCACT 

AATTAAAACCATACTGCT AATTATCTCAGGTTGTTAAACATAAGG CAATT AATT ATCAT T i TAAAAGTTGGTAGGAAGTTGTGAGT 
ACITTTGCAGTATGAGTGTTTTCCCGCTTTAGTATGAGGTTC 

5 5 CAATGTTTTGTTAAAGAAATGAAATTTCCTAAAA^ 

GCTGCAGAAAAATCTGATT<5CTGTGTTTAAAATGAATTTT 

TAAATAAAGAC^TGCCCATTTTTTGTACCCCCACAAATGAGGAAGTTGTAAGCTCTCT 

TTCG CATGG AC CTT ACGATG TGATGAATGGTTTTGG C ATGAG TGT CTT AAG AATG CTT C CAGATT CGGGTT ACAGG ACAG CCAG CG 

6 0 CTGAGCTCCCrrATTGCAGAACAAAGTAGGAATCrAGAA<rTTTCTTGCT 

TCiATGTTCTTTTCTGTCATTATTTGATTGTTGTCAGT^^ 
TTCTGAGACACAGCTCATTTTGACTTCAGTGGAACCCCTCGAAGGTC 
AGAGACCAAACTACAATTTAAGGAACCTGATTTTTGAATCAAA 
TAGTTATTATGAAAAAAATCTCATCTTGATGATAGCTGATAATTTTGTGG 

6 5 TTGGGGAAAATTTCCAATTAGTAGGAAACCAAGTGGCCTACCTTACT 

CTCTC^ AC^ AAAGACT AAATACCC ATTATTGTATAT AGGTCAGG CCAAAG CAG C CTTTT AT ATTG CAAGGAATAAGAGGT AAATAG 
ATATATGTGCAACAATGAATCCCCTAATGTGTTTACTCTAGAACACATGTTCTTTCT 
TCTTACCACCTGCTAATGGTAGATACTGTATCrAAATAAGTTGAGGAAAA 
TCAATCAATCATGACTTCAGGTTATTTTTAATAAATATACACGTATGGCTTCATAAACAA 

7 0 AATTTTACTTCTTTGC^ACTAAATAAAATATAGCTTTTGCCAAA 

TTGCAGAAACTTCAGTTTTACAGGTGCATTGTTTGAAATGT^ 
TGAGTTTCAAAATGTCCACCTGAAATGATTTACTTGTACGTTA^ 
GACACCATATCGCCTTGCTGAC^GAAAAACTTACTGTTATAAGTTTGTGTTTATCT 
GTTTAAAATAATGGC^JlC^CAjGCA^ 
7 5 GGGCATGTCTGTGTGCACCCTTCTTTTC^C^CAGTTTCTG 
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CTGCACAGCTCCTCACCATGCATGAAGGTCCTCTGAAATCGGTAA^ 
TTAAGTGATGAAGTGTGAAACGCCATTTATATTTGAGGAAGCTACC^ 
CCTGTAAAAGCTTCTATCAATTTTGACTGTGTATGCrTCTACCATGGCGGCTCAATA 
TACAGTAGTATAGACGGGAAGCCTCTCCTCrCCGTGTGAACCGTGC^CCC 
5 CGTCAGGACAGACAGTCATGGCCAGClTGAACTCCAGCCCTGGGCTrCTTGCA^ 
CrCCACTTTCTCTATTTTTAAAACAACTTTTO 

CTGTGGCATTTAAAATGTTCTGTTTCTTTTCCCTCTGTATCAAATACCT 

CTGTATTTTGAAAGGTTTCCTTAGTTCCCTTTGACCCTrCCTC^^ 

ACAGAATTGCACATCTGAGCTGATTTGCCCTCAAAAAGTITCACA^ 
1 0 ATGTCGGGGGAGGGTGGTGTGAAAGCCAAATTGGATTCCTG 

TACTCTGTGGAATAATCATGJUUVATGTGTAGATTGGCAGCTAATTTTTGA 

GGAAGTTTTTATGTTCTCTCAACCTGTTTTGTCAAATTGTTACGAAAACCT 

CATTGCCTTAGCTTTCTCTTGAAGCATTTCCTTTTTTA 

CTACCAGTCTGAAGATGAAAGGACTTGTCT 
1 5 AAATT ACAGGCTTGGTCCCTGTCTCC CC CATGT AT CT ACTCCCTTCAAT AT AAG CATCATTGAGT ATTT AAGG AAAT AAC C C CAAA 

TGTAACTCTAGTGTAGCTTCACTTGTCAGGGAGGAAAAAGTAAATAGCATACATT^ 

TTTTATGATGAAATTTGCCTTT AG TG C AGAG T ATTACAAAGATCATGTTT AG TTTCT AGCAGT AT AT AAGT AG CATC CAT C CTT AT 
• CTGTCATGCATTTGGAGTGTGCGACCCCTGCACT 



2 0 CATCTATAAAATGGAGATAATGATGCAGATGCCTTGGGTTTAATTGGGAGAGTTAA 
TTGAArTCTAGCTCTACATTGGAC^CTATGCCAGGTGCTCAAATAA^ 
CTTAACCATTTCCCTCTTCAATGCCAGAAACTrAGTAGGTO 
TTTTCCTGGCAGACTAAAAATACTCAGTACATATGAAATATC^ 
CATTTGCTTAAAAGCAAACCAACAGAAAGACATTGTAAGGCAGTTC 

2 5 TTTTTCATCTCGCTCrGTCXaCCCAGGCTGGAGTGCAGTGGCACA 

TCTTCTGCCTCAGCCTCCCAAGTAGCAGAGACTACAGGCGCCCGCCACCACACCTG^ 
GGTTTrcCCGTGTTAGCCAGGATGGTCTTGATCTCCTGAC 

CATGAG CCACCACG CCCGG CAACT ACAATTGTTCTT AAAG CTTG T AG AATT ACTGTGTG CTAC CAACAG ACAGGCT AATTTTG AGT 
GACCCTCAGTACTTTGTACAGTTAATTTGGCACGCTGTGTACTTAGTGGCTTTTTAACAGCTATA 

3 0 AGTAAAGnTGTGATTCTTGACAGGCATCTATCTGCATTTTCATTTTTACT 

GAGACTCCTTCTGTCAGGGCAAGCACTGTGAAGAGAGG 

GAGACCCGCCTCCACAGGAGCCCCAG CAGGTTCGGTGG AGCTCTGG CCACACTCTCCTTTGGG ATGCTGAAGT CAGAATG AGTTCA 

CTTCCCAGCCAGTCTTGCCAAGGCTCCTCACCTGGAAGCAGCAACTGCCCAGGGCTGTTGGATG 

GTCCCAGTCCCGCCTCGGTGTGGAAGGAGGAAAGGCAGGGTCCAGGAAGCTtSTTTCAGGA 

3 5 ' TTTCAGGGTCAGCt^SAGGCTGTAAATCAGCAGGGCCCACACG^ 

GCTCCTAGTCCGGCTTCTGCTCCTCCTTTGTAAAGTTATGG^ 

TGACACTGTTTGGTCZAGAAAGAGGGAGAGAAAGGAGAAGGACAGAGAGAGACTGAGAGAGAGACAGTCT 

AGGGAGGGAGGGAGAGACAGAGAAAGAGATGGGAGGTAGGTGTGGGAGGAGGGAGAGATGCAGAAGGCAGAGGAAAGACAGACAGA 
GATTTAGACCTCCCAAGTC^GTGAGCAGTCCAGAGTTGGAGTGGAGGGTGCCTGGTG^ 

4 0 CCrAGAGGCACAGCCATGGACAGCTTCTGTCACGTTGGC CCTGCA(TTT A 

TGCATGCTCCAAAAAAACAATAAAAGCATTCATGTTCATAAGAATTGC^ 
TACTATCGCTTCITTTAGCTCTTGCCTGAAATTGTTTGGGTTT 
TATTCCTTCTGGCATCTCCCTTTACAGACCTACT^ 
CTTTGTGATTAACTCCTTTTACTGAATGTGAGCTCCTTTTAAATTGAGGC 

4 5 CTGCATTTCTTCCATGTGGGAGAGGAGGGGCTCAGTAAGTGCTTTC^ 

GTGTGACAGGACTTAAGTGCCCTTCTAGACACTTCAGGCTCCCCTTTGTAAG 

AACAGTTT CT CATTGTTTG ATT ATTCTTTT AG CCITTCTCIGG A^ CAAAG C C ACTTTT ACGAG AAAG TC ACTGCTTTTT C ATCT C 

AAGAGATGCAAGTTTGGAGTTTGGGGAAGTTTTC^GGTGCCCGTCAA^ 

TOVCAGGGCTCAGTGCAGACCGAAAACTTGAGGCCTCTTGTTCAGAAATTATTAAAAA 

5 0 GATCCCCTAAGCACCAGCCCCCAAGCAACTGCACTCATAAGCCCATGAAGCCCCCTGCTGTC^ 

TATGCACTTGGAAGTGAGATGGATTGCAAAACACAGGTCTCCATGCTGGGG 
AGCAGGCCCACGTGCGAAAATGCAGAGCrCTCTGGCTCTTGCAGACTTGGCTC 
TGTGGTGTGATCTG(nX3CCCACCCCTAGCTCCCTCCAGGAGACTG^ 
TCCCACAGAGCAGAGCTTGATTTGGGGACCAGCCGAGGGCAGTTTGTCAGGATTC 

5 5 TGAAAGGACGAGTGTGTCCAGAGGACITAGCATTGATCACCrCTGTCTCCATGCAGCAA^ 

C^AAGGGCGTTTGCCAGTGGTGTTGGTTGGAAGAGCCTTGACTTTGCCTT 
AGTATGCAACAGTAGGGCATTTGTTATATAAATTAGTTGACTAGTGTGTAGCC^ 
ATAAAAAGATATGTGTGGTATT AAATT AAAAAATATTTTAAAAC!AACATATTTGTAATC^^ 
TACAGAAATAAATATACAGAAAAAATAGTAGTCCTAAGTGGTAGAAATTATGAGCA'ITTTCTTG 

6 0 ATTGT AT CATTT ATGT AG CAAAAAGTTTT AAGT CAGCATTCT AAAAATTTCGTGTTGTT AT AGTTG CTGTGACAAG ATTT AACTT C 

TGTATGCTTCACCAATCAATACAGAGGTATTTAAGACCCGGTGTGTGATAGGCC^CGCTAAAATACTATACACATCT 

TAGAGAACTAACTTCTAACTTCCTATATTAGTGTGGGACGGCTGTTACAAAG^ 

CATTGTTTTGACAGTTTG^AAGAATCGTGGCTTTTCCCCTTT 

GAAATGCTTGTGCTGGTCACATGCTTGCATCTGGGCTAGTGTGTCTGGCTTCCGTGTGCTGX3TGGATC 

6 5 ATACTTTTTCTGTACAGTGGCC^CTAGCTGTACrCCTAAGCCACAC^ 

AAGCCCCTCCCATCCAGAAAAAGTGACTATCATGTATATCCTCATGATGACT 

CCATATGTACCATAAAAGGTATTAAAGATATATGGAGTGATAGATATATTATATATAACACTTCTACCCT 

ATTGAGAGGGTAAGATCCCTGAATCATCCATCAGTTTTTCAGGTCT 

CCAGAGACAGAGGTGGCCAAAAATAAAAAGGGGGACAGGGGGACAACCTGGT^ 

7 0 GTGAAGGAGCTCTCCTGAAAGTCATATACCAGAGCATAAATGAGCAG^ 

ATCTGTGGAGCTGCGAGCTGTCATCCACTTTGGGGGACCTGAGACT 
ACCTTTTTGTTTCTACTCCACTGATGCTGGGGTTCTCTCCCTGGT^ 
TCATCGAAGXSGCAGCATCTTCCCAGTTGl' rrc ri n ' C lT I 'l'C'l'l 
CAGCAAAACAGGAAGGCTCGGGCTCTCTCCTGGGCTCGTT^ 
7 5 GAAG&CTGAGGCAAGGGCCCAGAAAGATTGAGGX3ACAAAGAC^ 
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TTATTGATGGAGGTAGTGCAGTTG CTGCTCAGATATGCACCCCTGCCTGGGTAAATGAGAC ATTCTTCAG CAAATTGCTTCGTTTT 
TTGATTGCTGATTGTACGCGTGTCACCAAGCTGACTCAAGGTTCATCG 
TCAGCCAAGAGAATGAATTCTGTGCCrACAATGACCCAGGGC'CATTTA^ 
GGGTTATTATGCAGTAGGAAATTAACAATAAATAACAAATTTGGTCCTCCTGTC 
5 G CTGTTTTT AAAAGGATCAAGGT CTG TG CCAGT CTGATACTCCAG CAAGT ATGTG AGGAGG AAAATG CATT ATTCT TG CT AG AT AA 

CCTTGTTGTTAAATAGCATAGGGGTTCTTTATCTCTCTCTCTTTCT 

CCTCTCTTT CTCAGAT AACCTGAGGACCATGGATG CTGATGAGGGTCAAG ACATGT CCCAAGTTT CAGGTGAGACCTTATGAG ATA 
GCTGTXjTGGGAAGTTCATGAGAAAAGCTTCCCTGGGGCCGGAAGTCA 
TCTGCAAAAGAAAAGACCATGATGGAATTTGCCTGAGTGTT^ 
1 0 CCTCCAAGATGAAATTAGTTTTACTGTTAAACTTCAGGAAAAAAAAAC 

AAATGGCTGATTGGTTTAAAACCCAAACACATGCCAATGATGGTACTTAAAGAGAG 

TTGGGCACAGTCATGACTGTTTGTTCATTAAGCA 

ATAGCTCTTCTCATCCAGGAGACCCAGGAAGTGKSAGAAG 

GGCAGCACTGAGGGIGGACGGC^GCCAGTCCAACAAAACCGCACAGTTCCCCAG 
1 5 GG CC CTGT ATCCAGAGG CGTCAGCCC CAGT AGCAG CTTT CATGGACTTTGGGGTTTTCGGT ATTTCAT AT 1"1T1"G AGCCTCACAGA 

CTCACAGCCAGCCCCAGAGGCTGACTTATATTTGA^^ 

TTTCTATGGGATTCTACAAGrTGGTGCTCCTGATGAAGACCAGGACCTATGTGTG^ 
AAGGTGCAGGAGAGGATCTTCTGAGTTCCATGGAACTGGAGATAGA 
GCTCTATTCATGTTTCAGATTTTGGACTTAAACTCTC^ 
2 0 AGGCTTCGTATTTATTACAATATAATTGCCACATTCTTC 
ATTATATAATTCrCTTTTTGTATATCATTGAATATTTT 

GACACAAAATAATTGGGTACATAGTGATGTTGTGATATATACAATCTATAGTAATCGGATCAGGT 

CAAACATTTATCGTTTCTTTGTATTAGGAACATTCGACATCTTCCTTCT 

ATCCTGCAATGGTGTAGAACACTAGAACTTATTCTTCCTACCTAGCTGTAATTTTGTCT 

2 5 ACTCCCCCGACCTTTCCAGCCTCTATTAGCCTCTGTCCTACTTTCTACTTAT 

TGAGTGAGAACATGTGGCITTTTAACTTTTAGAATGTGGTATT 

AAAGTITGTGATTATCTATATAAACAAAAATGGTATAAATACAAATATCAATriTGCATTGAAG 

TCACAAGTGCACAGAGCATTTAAGCATTTGTTCACTGCAGCATTGTTATC^ 

CATACAGCTCTTAAAAATAAGGAGGATATGAATGAACTAGTATGAAA^ 

3 0 CTGGATGGTATGCAAAGTATGATCTCGTTCTTTTAAAAGAAAATAAATT^ 

CGTTTAGGAAAATTCT1GGGTGATATTTATTAACCTGGACTTCCTCTTGGA 

AGAGGAGGAGAAGGAAAACTTTGCriTTCATCTTCTAC 

GTAAACAAAAAAGCACAAAAACAAAAAACGAGTTAAATGGGAAAAAAAGCA^ 

AGCCTCTCACTGCAGCCTCAGAGAGCTGGTCTGGGAAACACTGGTAGATGAGGACTC • 

3 5 TCACTGGGTTTGCTGACTGTGACTAGAAGTGATTAGGGTGTCAAAAAACCC 

CTTTCATTGTTCTGCTCTTCTTTTCCTTGTCGCTTTCACTTTGGGTAGCTTCT 

ATG AG ACCCCTTTGGCCAATGCAGG AGTCTACACTCTG AT ATTCT AGAGTCAAAG CTGAATG CTG AC ACCT AGGAATTCATCTCTA 

GAATGTTTATATAAGGAATAGCCCCTCAGTATTCCGATCTCGTATCTTAGTAACGAAACTAACAAAAGC 

GTTTTCTTGTCTTTACCATAATACAAAATAAGTAATTTGTTCTGCACCCT^ 

4 0 AAGAATGGAGACCTTATTACATAAGACTTCCTGAAAAAGGAAACTCTGTTTTTGTTTC 

GTACACTTACCAAAGGGATTCCTATTTCTAAAACACTCATACTC 

AGAGCACITAAAGAGTCATGTCACITTTAGTAAAGAATC1AAAGGATACT 

TTAAAACCAGTATTTGTTTTTTTCATTTCTAAAGTTGGAAGAA^ 

GGAGTTGACATTCTTGGTAAGAAATCTTACCAAGATTATGTTATAGATTATAAGAAATCI^ 

4 5 CTCTTTT CC CAT AAAATATT AAAGTAT CAGCAATTT CATAGGATTCAAC CT AATG TATGCGAAATGCT AG AT AAACAGAT AAATAC 

TT AATAT CTGG CTTTTTTT CAAAGCACTC^GTT ATTTGTT C CTTG AG ATTT AT CCT AAATGTGGG CT AT AC CCTGGTTT ACAGTGT 
CTCACAGATGTGTAGTAGTAGACACTCCATAAGTGTTTACTGACrTGA 

G AGGAG AG CAT ATCT AAAGCAAGCT AC C CTTT C CTTT AGGG CACG T CTCACT AATT CTTTGGG T AAAG CGT ATTlTr CTTC CTTTT 
GTGTTTTTGGCAGTCTTTCCAAAAATACGTGTTATACCTATGCATTATT^ 
50 GAAAGTGAAGGATGTGGGAACTGAGAGATCTGCATCAGCATCCCACCTCT 

CTCCTGGACCACTATAGTATCATCTGTAACAGGAGGGACTTGAGCCAGTTGATCTCT 
TAGATATTGGATATTGGTTTAATGCTATTTGTAGTGTGTTTTTTTGGGGATATGGAAACCAG 
ATAAAOTGTATATATCTAAAGGATATGGAAACCTTTAGATATATATAATCTGCTTACGTAAA 
CAATC^AATATTTTATCAAGTTAAGCATAGTAAATCACATTGAT^ 

5 5 TCCTTTCAACCTCACAAGG ACCCACAG AAG AAAATACAGTTATCATTT CAACCTGCAGGG AG CTG AGACACAGAGAATTTAAGCA 

ACTGACCGGAAGTCCAACAGGGAGTCAGAGATTGCTCTGGGGTC^ 
TTATAGAGTTGAGATTCTATATTTTGAGCTTGTATTTACCCAGAGA^ 
GCATTTAGGAGAGGCGGGATGACAGAACrTTGTTGAGTGAATTATAA 
TTATACCATATCTCATCCAGTTTTCCCAGAACACrTCTCATAGGTAATGCTTTATT^ 

6 0 AAATGTGTATTTTTATAATTTAACCAGAAGTTTATTTCA 

TTAAACACTTGATGTCTTGCTGTCTCCGTCrrCTGTGTGTTTGCATGTCATTGTACA 
CAATATATACCTTATGTTTCTATGTGCATATAGTTTACCAAATAATA 
TTTTTATTTTATCTTCACCGCAACCCTCTGGGATAGACCAAAATCATG^^ 
TAAAAAAGGGACCAGTCrCATGTTCCCATTCGTCrrACAACTAATTGGTC^ 
6 5 CrCCTAGTCACTGTGTGTTAGTATTTCAGATGCCTGTTGGCTTGATTTAGTC^ 
TTTTACATTITAAATGGTTAACCAGGCAAATTGXjTTT 
TGATAACTTTTCATCATCiGAGATCCrCAGCIGTCAC^ 
CAGCZAGCCCGGACTTACACTTTGGGGCTTTTAC^CAAATCA^ 
GAATTAACACAGGGCCCTTGGTGGGCATCTGAATATATCTCACT^ 

TATTTTAGAATTGCGATCCCTACTGTTTCATTATTCCTTTTTCrCCCCCA 
AGGGACACTC^AGCACTGACATTTAGTTCAGTGCAGGAA 
TGTAGTGGATCTTGTTTTCTCATAGATACAGCAGTTAAATTTT^ 
CACTGAGCGGAAAGTACGACATTTCTTCATT^ 
75 AGGGTGT CT CACAC CAT CCTC CAGTG CTCCAG CATGAT CC CAGGGAJULAG CTATAGGAGGAAAA JGAGTCAATAJUIGTtTAGTTTCT 
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CAACCTCCCACCTCCACCCCATAATAATGACAGCT 

ATTGGGATAGCATGTCAGGCCAGAAGGCTCCATGGTCATTTCTATGAAGGTACTTTAGCAGGTC^ 
GTCCCTGCCrCCCCCAAATTGCAAGCTCCCTGCTTTATGTAGGAGACCTATC 
ATTCTTACCCCCACACCCACCCCCCAACCCCCCXSCTGC 
5 CTGGGAG TTTG AACACCATTG CTCT AGGAAGTCATCTT ATACAAAAAT AAG AG CAT AT ACCTTCCTG CGTT CT 

CCTATTTGGAGTTTTTCCCCATTTATGAAAGAGGTGAAAAC^ 
GGCCGGGCGCAGTGGCTCATGCCTCTAATCCCAGCGCTTGGGAGGCCAAGGCAGG 

TTGG CTAACACGGTG AAACC CTGT CTCT ACT AAAAAAAT ACAAAAATTT AG TTGGGCATGGTGGGGGAT ACCTG TGGTCCCAG CTA 
CT CGGG AGGCTG AGACAGGAGAATGGCTTGAACCTGGGAGGCCG AG CTTC TG AG C CAAG ATCG CX3CCACTGCACTCCAGCCTG 
1 0 C4GTGACAG AGCAAGACIXTTCTCTCX^ CAGTGAAAAT AGT AGTGACTT AATG CACATTG CCA 
AGGCTTTAGCATAACATGAACACITTCACTCAATGTCTCTCT 

CTTT AACT ATTCATTTTGT ATTGGCT ATCCAAATAT^ CTA 
AGCTGGAATATTAAATTGTGATGTCTCTTTTCCAGAGAATG 
ATTTTAAATATTCCAGGAAGGGTTGTTTrATATACTGAGGATGATTTTA 
1 5 TTGTGGAAGGTTTATTCAAACAAACAAGGACATTTCACACAA 

ttgcacacactctcaaaaatctaggtttgtctatgtgttcatatc^^ 

agg ctgttgctgttttg tt agactt cttcac^gaccttc^ ct aaaatgg tctt c cacacgtag ct at actg cat aagtt cacatca 
tctgtttcttgcatgtgggttgtgtctcaactcaagtttaagttagattt 
gagaaaagagcatttcctactagttatggcttccqvag^ 
2 0 ttggtcaaatataattctaatccagtcaaaataaatac^c<;w 

GTATAAGCAATTTTTGTATTTGTGTGGATTATATTTrATTTTCCT 
TGCGGAGAGGGGATCATTTTCCCAGTAGGCATGC^T^CCCT^ 
ACTAAGCAGCTGGTAAGATATTCCCTC^TTCTTGCAAAGAAAGTGAGCAC^TG^ 
GTGGTATCCTCAAACACAAATTGACAGGAGGGTGTGGTGTGGCAA^ 

2 5 GAGTTGACAAAAGATAGGAAGAACCTTAAAAATATAGATGCCTTTTATGCAGTGA 

TATTAGGAATATGTGCAAAGATTGGCTATTAGGATGITCATTAC^ 

CTCAAAAATAGGAAAAGAATTTAAATAAATCCTAGTGTACCCGTTATACATGAAATTATGGAA^ 

TGTACATGAGAATGTCTAAAAGCTAGTTCATTTTGAAAAACAAA^ 

TAAGAGTGGCAGTGTCTGGGATTATAGGTGATTGTATTTCTTCCCTTTTGCAC^ 

3 0 AAGTGACTTTTTTT ATAAAAAGAAAAAGGT AT ATG CATCCCAGCAG AGAAGCACTGG CTCC ACCCAGT ACCTGCCTCCTCATGCCA 

CCCTCTCAAGCCAAAAGCCGGGGGAAGCCCAGGC^CCTT^^ 

GAAAGCCCCCCTGTAAGCGATACTCCAGATGAGGGCGATGAGCCCATGCCC^TCCC^^ 

GCAAAGCTCCAAGAGTGACAGAGTCGTC^TAAGTGGGTCACCAGCGGCCT 

CGAAGACAGTGATGAAGGGATGCAAGTCATTTTATCCATTGTGTTC^ 

3 5 CAGGATGGGGGTCCTCTGGTGTGTGGGAGGATGGACACTCACAGGCCAGCATGGCCGTGAGAGCCACACACCCCGCAAAATGTCC^ 

AGTTGAGGAGCAATCCTGCCCAGGGACGCGTCTCTGTCACTGTCCTCTG 

TTGTAGAT CGTGAAAATT ACACACTTACXSTGTTTTGGCAAGAG^ CAT 
TTTATGTTATCCCTCTAATCTCTAGTTCTTAAATTTCAGATTTAACTGAAAATAGAAAGT 

AG AAAAAAATCAAAG CACACATTT ACAT A C CTTTTC CAT AAAAAAG TGTGTG TG AAAC CAT CTGCAACT C CACTGGTT AACTT ACA 

4 0 CATGCGACAGTTTGTGTTTGGAAAACTTCACCCACCCTCTTTATT^ 

AGGTGAAATGACAGAGTCAGAATTGAACTGAATTTCTGACTGAAAAACTAGTTCTCGCTCCATTATA 

TG AGTCAACAG TTCC CTGGAAT AC CTGTTTTCTTCTTTAAAAAG AG AAAG AG CTGTG AT ACTG AG AGCTAC CTATTGG CAT AAAAG 

AATATGAAGGACATACAGGTAAAATGAGGTAGACTC^ATCAATGTGGG^ 

TATGTAAAATTCTCAGGCTTATC^GAATCACAACTGTATGTCTATAGGACATCCTAATGC^ 

4 5 TTTTATCTACATGTATAGAAAACATATTTTGGGCCC^GCATGGTGGCT 

GCG<5ATCACAAGGTCAGGAAATCXiAGACCATCCTGGCTAACACGGTGAAACCCTGTC^ 
CGTAGTGGTGGGCGCCTGTAGTCCCAGCTACTCXXSGAGGC^ 
GCCGAGATTGCGCCACTGCACTCX1AGCCTGGATGACAGAGCCAGACTCCATCT 
ACATATTTTGTATGTTTTTTATCTGTCTTAGAA^ 

5 0 TAATGGCACAAACAGATAAAGATGGTGGCTAAAGCCATTTC^STGTAGA^ 

CCTATTGGAAAATGGCTACTATATGTGGAAAGTAGAGGGGAAGAGAAATGAACTTGTGAATTAT 
GATACATATATCATTTCAGGGTTTTCCAAAGTTGGGATGCAAAC^ATCATC 
TCTTCTATGTATTGCAAGTGATACTC^TTTTACCATGCACTGTACCACTACGAGGGT^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 5 NNNNTTACTCCCTGAAGCACCCAGCC^CAAGGGTACTGTTCTTCCTG 

GAACACAGAGGCTCAGGGACTCAACCAGAAGAAATGATCCATTAACC^ 
GCCAGCTCGC^H'CGCTGTGGAGAAAGAGATGGGCTCTCCCTGGAATCCCTGCA 
GGGCATCTTTAGAGAACTGAAAGTCAGGCCGATTCCTGGAGAGTGAAGAATAGC^ 
TCCCTTACGGACGTCATATTGTTGGTGGGTTTTCCTGGGTGGGAGATCACCCAGTATCTC 

6 0 GCTGTCITTTGGTTCAAAGTCCTGCCCTCGGTCCCAGCTGCC^ 

GCCCATGGGTCCTGGCCCTGCCCTTTATCATGTTTC^ 

TTT CAG AATTGTGAAATTT AAAAACCCT AG AAAACCT CTTTG CTTTG CTTG T AAAT AG CAG TTG ATG C AG CC CTGTGG AC AGG CGG 

GAACAGCCATGTGACGTGGGCACTC^GATGATTTGGGAGATTGAATGC 

CTGTAAAGAAAGCAAAAAAGATGCAAAAGCCTACATGACAAACCA 

6 5 AC-GGAGAGGATTCCACAAGACCGAAGTGCCCAAGGAGACATGGAG 

CTCCATGGTTCTGGAAGGAAACCAGCC^CTTTCATATGTGGTGACAG^ 

GAAGTATGTAGAGAACATTAGGAAACAATATAAAAAGGTAGGGGAATTCGTGGATAAAG 

ACATTATTCCATGGGAATAAGTCATCAGTGCAAAGGACTGTAAGGAGTGCACAGCAGGA 

AG AGGAGGAAATGGAGTTGCTGAAATGCAGGGTGAGCAGCJIGGGG CGG CACACCTCAGAAGGG ACA 

7 0 GGTCCAGGCCCTCTCCTCCCTC^CCTGGAGCTTCTGGCAAGGCTTT^ 

AGTAACTGAACCTGCATCACAGAATGGCCTTGAGGGTTAGATGGAACATATATAGACATAATTTGT 

CACAAAGTAAAAAGGACAGAAAAAAGAAATGCAACTTGTGCAAAATAGAACTTTCAAAGATT^ 

CX3TAGCTTCACTTGGGAAAAGGGAGAGAACAGATGACCAGCA 

AAAGAAT AAACAAACTGAAAACAGCCAAAAGAGA^ T AAGGGTG CT 
7 5 GGCCGCCACCTCrACTCAAGTTCAAGTCGAGCTGCCACTGTTGAAATTTCAAGGAG 
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AAAAAAAAGTGGGAAATTATTTGCTAATAATAGATTGCAAGAGTGTCTGC^ 
GAGCTGTTATGACCACTGGCATCCTTCAGTrTTAGACAAATCTAATCTCAACAAGGAT 
GGGAATGGTGGAAATCAGTTCCCTATGAGGAGGGTGCAAAGTTATACTGATTGGTGAGTCACT 
TCACACCTCATTGCTTCATTCAGTGTGGAAGGAACATTCTW 
5 GAGCAGCCCTGCAGCCCTCAGTTGGCCTCCATTTTCTTCCCGGTCTGAAGA 

ATTTAAGTTAGACTTAAAGG AG AG ATGTGGTCTTTACT CTCCTTTTGG CAAGG AGAGTTTAGTCAT CAGAATGGTCTATTTAGTGA 
GACTGTAGCTTGTTCTTCTGAAGTGGTCTTTTTAAAAACTAGAGAACT 

AG CCGAGGATGCACT CAGTG ACCTCTTCAGG CAAC C CCAG ATTTACTCAGTCAGCCATG GCACACATGCT AG AGAAGT AAAAAAAT 
GAATTGAGACTTGTTCTGAAAACTCTTGAGCAGGGGCTCTTAA 
1 0 GCAGATAAGTGi ^^i"i"i\irGTGAGAAAAGTAAAAACTTAGTTAi'i 1 i u i VCCCGTATTCAGTAATCCTTGCArrTTriTCCTCCTCCT 
TTATACATACATTrTTGCCAGTTACTGACTTCT 

ATTCTATTGACAAGCCCACAGGTCTGGAGGTGTGAGTAAGGGCATTCCTCCCCCATGAAA 
GGGnTGGAAC^GAGGAGCAAAAGACCAGAGGACAGAGAAGAAAGATTGCAGGCTAGAA 
CAAGCAAATTTAGGGCTTAGAGTAGAATTGAAGAAAAAACCCAGGGCAGAGGCAC^ 
1 5 ATTGATGTAAGTGGAGATGGGTTGGTTGAATCCGTGGAAGGCCAAAGGATTGAGGCCCATAT^ 
TGAGTTGCTACCTGGCTTGATAATCXK5GACAACCTCAGGCAGTA 

TAAGGTATATTTCTATATGCCCTGGTACATTCCAGAATCTGCrrCTCTGGCTCTGTGCA 
AGGGGATCGCCATGGTTTCATGAGTGGTTTATTTCAGCTTGCATCTCAC^ 

GG CAG CACTGAGAGATAG AAACTG AATCAG CACAACTT AGGTCCTTCATGTTGAAT AGTTTG AACAGAATTTGATTT AC C CT AGC C 

AATT CI TTI ' 1'ATTCATTCAATTGAAGT AA7UVATTT CTTTACT CAGCCATG AG TT CT AAGACTG CAGTGGGAATTGT AAGAGGATT A 

CCATATTTTAAATTGGCTATTCTGGTGCCAAATCTCCCCTTCATTTTACAGT^ 

GGGGGTGGGTAGACAAATGGTGGTAACAGTCCTCTACACTCTGGGAGGGAAGGAGCGG 

AAAG CAGCCGCTCTGGAG ATTGGGCAGCAGGCTITTG AGCACGGATGCG^ CAGCTAG AGGAAGTTGT 

2 5 TTTT AT CTG CG AAATGCATCTCTT CTG AACCG AT AAACCCCATTTT AG CTGGGGAG AAAT ATT AT CT AG CAG CATCTG AG CATTAG 

CTGCAGCTGAACTOkAGCATGTTCAAATGTGAGAGAAGT 

TTTTOUVGTTTTCTTTTCT T TCT T CTTT G AGG 

TTCCC1*NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 ' NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 0 NNNNNNNNNTTAAAAGTCCTGTCAGAGTAGATAGATAA 

TCCATCTGTGTTGAAAGAATAGTTAGCTCTXSTGTATAAAACAACCT^ 

AAAGAAATGGGGAAAAAAAGGCTGGGGGGTrGAGCTGTGGTTGCCAGTTGCCACCTGTCCCCCT 

TTTGGGAAGTGTAAATATAACCCAGAAAAAACAGCTTCCCCTCCTTTTATGCAAAAGTAGAA^ 

ACATTGCCCCGGAAAGTTTCCACAGAACCCGTTAGACTAAATGAAAGCTCTTGCTCT^ 

5 5 CTACCAAGCCCTCTCTTG CTGGCAAGGCAGGG CTGGGCCHtXSGCTACTGTCrCCCACCAG AT AAGCCCAG C TTTC T T CAAGGCCCT 

AACAGGAACCTACATTTCACAAATTACCGCAGACATGCTCACAATTTT 
CTTATAGGGAACTCAGAAAGCATGCTCCGGTGAGCAGTGAGCCACCTAGAATCAGCCAAC^ 

TTGTTGGTAAACTTTTGGT AG CTTGAAGTCT ACCG TGGTTGGG AGT ATTT ACAC CACAGAAAT CGGTG AAC GCTGCAAAT CAAGG C 
TTTTTCTTTTTGAGAGCTGGTTTATCTGTGCACCACTAGCTTAA^ 

6 0 TTTGAAAGGAGGAATTCITTTGGGTAGGGTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

GGAGCrGGGAAAGGGTGAATCrATTGGGGAGGACTGTGGGGTACATATAATTTGTCCA^ 
TGCTGTTGAAGTCACAAACTC^jAAATGTTGGAGCAACTAGACTCTCCAT 
TGCrcGGGGAGTATGATTAGAAAAGGATTTGGCCAGGCATGGTGGCITACGCCGTAATCCC^ 
AGATCATGAGGTCAG^AGTTTGAGACCAGCCTGGCCAATATGGTGAAACCCC^TCTCTACTAAAATATGAAAAA 

6 5 GGTGTGGTGGCGTGTGCCTGTAGTCCCAGCTACTGGGAAGGCTGAGGCA^ 

GAGCCAAGATTGTGGCACTGTACTCCAGCCTGGGTGATAACTCCITCT 
GTGTCTTATTTATGAGCrATTATGCTATTGCTGTTATTTGGGGCTGAATGArr i I" 
GCAAGATGTCTAAATTACATTAATATGTTACTAAATTACTATGATTTTTCCAGAA 

CTGTGAIT AAGTTGGTCTTCTGCAAAGAAATTT CTGTTX5ACTTCCTTAGATTAAATGCAG CCCTG AGATAGGTGTGCAGGTCCAGG 

7 0 AAGTGGCCTGTGCAGTTCTCCCTGTGCCTGCAGGGGTGGCTTCrTGAGG 

CTAAGGAGTCTATCAGAAAACTACCTCCACTGTGGAAGGGCTGGGGATTGTAT^ 
ACTCCATTAACTGAAGAGCCGTTTGGTGCCCTAATTTCAGGAG 

GTCTCCGCAAGGCCTGCTGCCAGGAGAGTACTTCCCCCACAGGGGATGTGGCTGC^GGCTC 
GATGATTCTGGTGAAGCCCrGAACATCCGGTGTCACrCCTC^ 
75 • G CTG CATGG CAGAG CC CAG TUG CAGATGTGG CTGTTCTCT i 1 1 1 CAGAGTGAAGGACTGTTACCCTAGAAGTGAAATCCTGCTGAA 
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ATG AG AAAAGTGCTGTT ATTATATTT CT AC AGG CAG CT AT ATG ATCATTTTT ATT ATG ATT CI^GGAACCTGGCAGAATTTTGACT 
GCTTTTGCAATATCrrCTGTGCTCTCCAGTTGAACTTGGTGGC 

TGTCCATAATGTTTAGTGACAATTAACTATTACCAGGTGTAGCCACCCTTACTTAATCT^ 
TGAAATTATTGTTCTGTAGAGAATGCTGTGATCTAAGTGAAGTGACTTTC^ 
5 AAAGG CTGCATTT ACT CG CAGTTTT AAAG CATT ATTGCTACTT AT ATATGGTCT CAACACTG ATTTTT AT ATTTGACTTTGT AGCT 

TTAAGTGTAAAAATTTCTGTTAAGGTTTTATATGTGTTAGTTAAT^ 
TCCTAAAAATATACCCTTAAAATCTGCTTGCCCTGTGCCT^ 

GTGATTCTCCTGTCCCAGCCTCCCIAAGCAGCTGGCtATTACACGCCCG 
1 0 TGGGGTTTCACCACGTTGACC^GGCTGGTCTa^ 

ACAGATGTGAGGCACCGTATCCAGGCTGTATGGAGTCTTTCTAATCAGGACATAGGC^ 

TGAGACGCAACAACGTTGAACAGTCATTGTTTGAGGGA 

GTGAAATATAGAAATCAGAAATTCCAGCGTAGCACTGTCCAAGGGG 

ACT CACATT AAAAAAGAAAAATG ACACAGGTGAAATT AATT^ CTT AATTCAG TATG CTT AAAT ATT ATTT AAGT A 

1 5 TGTACTCAATATAAGCAATTGTTAATGAAATATTTTACTCT^ 
TTCACCACACATTTCAATTTGGGTTGGTCACATTTCAAGTGCT 
CAGATCTTGAGAGCTCCAAAGAACrGTGTGTCATTATATTGTGGAAA 
CCAAAGACTAGTCATTTATCAACTCTCTAAAAGTAGTTTTT^ 
GTATGTGAACATCTGCTTTTTAAGTCTTTCATTTTT^ 

2 0 AATTGGGTTTATTTTATGTAGCTGTGTTTTAAGTACAGGTOT 

AACAACATGTGAACATGTCAAAOVAATCAGTTTITCA^ 
TCTITCATCCCTTCATTGTATTCCCATGTACTCTATGGTAACTTTTTTAAG 
CTGCTTTATAGACATGAGGCAGGTTCCCAGAGAGCGCCCTTGAGGACAGGCT 
ACAACTATGGAATTGTTTTGCAAAAATTGTGGAGAGTGATGCCCC^ 
25 ACCGAGGAG AGGTGGTGAAGGAAAGGCT TTTT'l T TTT TTTTTTCAAAGAGAGAAAG AGAAGCAJGCAAGCTGCCTGCCTGCAG CCTG 

AGAAAGCAGTTCCACCTTGGCACCCATGTCCTCATAGCCCAAGCGGGGCCACCCrC^CATGTG 
CAAACCCCAGCCTTCCTGTCxAAAGGGACACCn'CCTGCTGCTAACC 
GGCGGCTGCATTTCCTGAGTACACCATGATCCAAAAAGGCACAG 

CCTTT CTTG TTTG CTTGC CCTTTTG TTT CTTT C CTCCTTTTTC CTT CCTTCCTGTCTTT CTTCTi'CCTT CCTTC CrTTCTTC CTTT 

3 0 TT ClTrri 'AATTITCCrCCTTTCrCrTrCTC C ^ 

TCTTNNNNNNNNNNNNNNNNNNNNNJ^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 



. NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNTOTONN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNKNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNNK^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN1W 
45 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNTOWNNNNNNNNN^ 

NNNNNNNNNNNNNNNWINNNNNNNN^ 

NNNNNNNNNNNNNNNNN^^^NNNNNNN^ 

NNNNNNNNNNNNNNNNNNWNNNNNW 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNIWNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNHNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
5 5 ATGGGATCCCCGGGAGGTGC^AAATTGAGCCGTCTTTGGAGAATCAGCTAATGAGACAGATGCATGT^ 

GGCACTCTG CT AGG CAG AGGGGTG AACCAGAAGAATGAG ATT CATGGGGCCAAAG AATTTG C CTTCTGGTGT AAG AAAAG ATGG AG 

GCAGCTTC^CAGAAAGAAAAAAAAGGTAAAAGATAGAAATGAAAT^ 

GGTTTGTGGCTCACGCCTGTAATCCCAGCACITTGGGAGGCCGAGGCAGGAGGATCACT^ 



6 0 TGG TT CAG TT AAAAT ACTTCATG AAAAT ATTT AGCTTGATG CACTG AAATCCAT CACTG CTTTTGTGGG ATG CACTGACTG CAGTC 

TGACTGTTCTCCTAGTGGAAAGGGGAGCAATAGCTTCTGTCTGCATGTGCA 
AGACAACTTCATCCCACCCCCGCTGCCCACAGAGCTAGCCATGTGCACGGTGTGTTTGAACAATG 
AGATCCCCAGTCTCATAAAACATACTGGATGGAAGGCTGTCGAGAGATC^ 
TAACAGGTGATCACCTCGCCTATTC^GAACCCTCAC^AGGCAGCCCACTTAC^ 

6 5 ATCTGCTTGTCTCTAAAGCCAGCCCTGCAGATGTCTGCAGTGTAACTACTTT 

TGACAACAG CTCT ACAAAG CC CT CTAAAT AGC CCGTT CT CCTGG CT AAAGTGT GTTGGTTTCCT CAG C AT ATGATGAAAT AAAC AC 

ACCCAAAGGGCCr^CCTGGTCTTTCCAGGGAATCCA^ 

TAAGTGTACCTTCCCCCTCAGGGCATAATC^AAACAGTTTGCCAAAGCTAACTGTGTAAT^ 

7 0 TCAGAAAGCATGGCATATTAAACTCATTTTCCAT^ 

GAAT AACAC CTT CCTTG CTCCTC CTAAT GGGG CACATGTGCACAT AT AAAATTGTGTTCGAT CTTTGCAGC C^ 
CTGTCAACTGAGAACTCATGAGTCAGTTTCT 

TTGGAGC^AAGGGTAGGATCTGGCAATTATCCTTATCTCATTTGGTTCA 
TACTGTATGTGCTAGTTACAC^KjACACOTGATAAATAAGACZAGCIATT 
7 5 CTAGTTAAAAACAGAGCCTTTTTTC^AAACAGCCTACTAATCi^ 
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TGCAAGGGCTGCCTCCTGGTCTGATCTCTGATAAGGATCAGATCCTAGAGCCTCTGAGATCCCTGTCCTCCCT 
CTCXSCAGAAGGAGAAACAGTTTACGGTGGTTCATCATGA^ 

AACCAAGTTTTTGCGAGTTGCCTAAAGTGGGCAAAAATCCCACAATGCACCTGA^ 
AGGAGCCCTCCTTGACTTCTACTTTACTrTATCCATTGACACTAGT 
5 CGTTTTTATTTTCGTGAGTGGAAATATCACTTTGAAA^ 

GTTTGACGCGTGTCACATGATTCCACAAGTCACTCAAGGGCAGGAGGAT^ 

TG AAAGGGAAACCAGGG AG AAAGTGTGCTT CGTGAGAAT AGGT ATG CGGACGG CT CTTTGTGACG CTCTG TGCAAC CCTCGTGTTT 
GCATTGAACCAAGCTGTGCTGCAGATGGACACCCATCCCATTTCCCCGACTCA 
GGGGACCACAGAGCAGGGGGCCATGCAGGGGACCAAGGGACATTTTAGTGTAACCAAAG^ 
1 0 AATAAAGAAAATCACAGGTTTTCCCGCTGATATGCCAGGGACATTTCCAAGW 

CC CATCAGTCAG CCAT ACTG CAT AATTG TTAGAT AGTG AAAG AAATT CATTTTTT AAGTTTGTCAGAAAAATAAATTCTTTG AAGT 
CTTAAATTGTTGCCTCCGGACATGACATATCTGGCTCTTCCAGAATCCATGTTAGTCCT 

GCGTTTGTTGATT ATTG ATTTTGTAAG ATG CC CCACACGTTGG CTATT AG CAGAATTCTCACTT CT AAAAGGAAAATG AGTG TG AG 
CTATGTTCAATGAGAAACAGTrAlTTTTGGGACATTCTTC 
1 5 CCAAGATTAGAGCAAGAACATAGTGGTTTTCAC^CCCTGGACATCATCCACA^ 
GACAGGCCACCATTTTCTTTCGAGAATGAGCAAAGTGAACCACCATGCCAG 

AT AGT AAAAAAT AAACAAACAAGGCAG CACTTG AG ATGGT CATGGCAG AG CAAT CTCAACIAAG CG ATTTGTTATTTTG CACAGTG A 
TGT AC C CACT CATTGAAT AAAATG CACCAGAAC CATG CACT ACAGATG CT AAGG AGAGTTG TC CITCAACAAAGGAGAT AGGCC C C 
AC!TGG CCGTGGGG CAGTTT ATGTTATTTGGTCCTTGT CATGGG CAGG CATGGCC CTTTCT ATG CTT CACAGATG AGGAAGGT C CCT 
2 0 GGCACAGGTCCAGTCTCCAGCATGGCCTAGAGGTGGCAGGTC 

GTTTATAACACGAACGGAGGTTAATGAACTCATCTTCCCATCGCACAACTO 
CTCTTTAACTCTTAGTCATTACCACTGTCTAGTGTAGGCCTCTGTC 
GAATAACATACTGAGACATGTTGCTGTTGTGTTCTAAATAGC 
GTCTAGTTTTTCTTGCGAAGTGTTTGTTGAGTGTGTAATGAGGA 

2 5 AAGCAAGGAGAGAAAACAGTTTTGATGAGAAGCGCATGGAATTTTTA 

GGATTTTAGAAGAGATGGATGGGGAAAAGGAACAGGGAGTGGGAACAAAACGTGGA 
CTCACCGCCGGCTTTGGTTTATCGTGGGCCAAGGGGACAGACCCTGTGC^GAGGG 
CCTGCAGGCAGTAACAGCCACCCAGTCTATAAGCTGAGTGG<3CATCGGGGTG^ 
ACCCCAAAATGATAGCTAGCTGGAACCATTTATTCTATTGCATTTTATO^ 

3 0 CTGTCACATTCAGGGCATTCGTGCTTATGTTTTAAAACATGTTAT^ 

CATGCAAAGTTTCAGTCAAGACTTTGGAAACAAGACAGTGTCTra 
TTCCAAACTTAGATGTAAGAAGCTCTGAGAAACACATGAAATCACCCCCA 
GGCAGCAGGGTGTAGGGAGCTGGGGCAGCCCCGGGGTGGGCCTTGCAGGCTGGGCTTC 
AGGCCTGGAGCCGGCTGCCCACTCTCTGACCAGCACA 

TCAAG AAGCCTGT ACAG ATGTTCCCCTCACCACTGTTTCCAGTCACCTTTGG CTTTCACGGTG CAGATGCTAAGTTTGATTTTCAG 
AGCCCATCTGGGAATTTAGTGAACTGAACAGGTAGCATTTCTGAACCCACCCATAACCCATG 
AGTTTGCTGCAGGTGACTTTTGCAGCTGGGTAGAGAATCTTGGGGCAGGA 
CTGACGGCACXSTTACACCAGTGGACTCTAACGACGACCTCACCTCGTGCACAGATAAT^ 

4 0 TGTCACTGAAGTGTGAACATATTATGCTGTTAGATTTCCCATCATTTCT 

AAATGTAGATCTTTGGTATGATTTGTACAACTCCCGGGTGTCAATCTGTGAAAGAAATCGCA 

TTTATCCCTGCGTGCTGGCTTGCCCGGGTTGACTCAGAGGCAGTCT CACATTCAGCTGCGCTGGGGCCAAGGACCCAGGGAGCCAA 

GTGTGCITCTGTTTTCTGTATTTAGCAATTTAAGACCTGCGTTTAAATACTAG 

ACACAGTAACTCTTAATTGTTTAATTCAGTTCGTGTGTTACCTCTCGGAATAAAATAG 

4 5 AGTTTGGATTTAAGATCACCAAAACATTTACCACATCTCAATTGTT 

TGGGGAGTGGGGGTGTGGATTTAACCATTATATTTTTATGATATTGGGAGTAGATT^ 
TTAAAAAGTCACTTATCTTGATGTAAAATC^TGTCTTAGTAACCTTC 
CATATTTCATTTCATCATTTCCCAGAAGGGGCACrrGAATTCACCCATTT 
TATCTACTTTCCCGGCAGCAGGACCCTGGAAGCAGTCACACCAAC 

5 0 TGCAATCAAATCAACAATATCTTTGCTCAGAAATGGATATGTGAAGTAAAATGTGCT 

AGCrGGCCAGTGAGCCCTTCATTGCAACAAAGATTTTTCTAGAGCCCCTGCACTA 

TCACACGTTTGG AGG CT AC CTTGGC CAGT ATT AC CTT AAT CCAG CATTTAGGGG AAGG AGCATTTCAAG ACT AAATTTTCTAAACT 

GCTCAAGCCTACCTCATTTTATTTCTTGTGTATTTTAACACTTTTGGATGA 

GCCCCCATGCCAAGATTCTTTAAAGACTlTCrrGAAAACGCTTCAGTCTrrrCITC'r 

5 5 CTCTGGCATTACAGGGAGTTAATTTGTGGGCAC71CACAGT AAATT AT AAACCC CTTAAGCAG AGAAG ATTGT ATT AT CAGTTTATT 

AGTTTTCnTATTCATTCCTTCAGCACACATTTCTGTTGCCTCCAATGC^ 
AGATTTCAACTOTGT AGTG CTCTTTCAGCCAG AAAG TACACTTGTGTTTCCCT 
CATTGTGCATAGAGAGCTAATAGCTGTACCCTAAATGATCCTGGCTTTGAATTCTCTTATCT 
TCCTCTGCATTCTAATTTGCTACTTCTAATCTGCTGGGAATTACAATAAGAAAGAAC 

6 0 AGAGAGTGTGTGTAAGTGCCGAGAGAGTGTATGAGGGACTTGCCC^ 

CTCTTTTGG TTG ATT AC CT CAGAGATCAGTTTT ACTTTCT TTCTCATT CTTG ACCT ATCATCACT AG CTGATATGG ATG ATGTG T A 
CAACTCTGAGTAAGAATAATKNNNNNNNNNNNNNNNNl^ 

TTCTGTTG ATCTTAAAGTTT ATATATTTT AAG TTT AGTGTTT C AGAATG AG AC CAAAGCGGT G ACATTTTCAACCT CTTCGGTCTC 
TCTTCAGTTTTTTCATTTTAAGTTTTTGTTGTGCTTCT 

6 5 TT AT CT AATGT ATAAATGTG TT*1"I"1ATT ATTT ATT AGGAAATTT ATTTT ACTTGG CCCTCAG CCATGACATATCATGGCATAATCA 

CTATCCTAAATTrGTATATCTTATCCTTGCATAAGGTAAGACTTTCTATGAATT 
TTAAGAATTTTITTATACTTTGTTTTCrcCAAATGAAAT 
TTACCCTTGCTTAGATCTTATC^TAGAATCTTTTCATAGAAGACACA^ 
TGCCCAGTGGTCTGCTAGGATGCTGTGCCACAGCCCAGGCACAGX3AGGCA 

7 0 CAGCAGAATTTGCAGCTCCATGTTTCCAAAGCTT^ 

CTCAGTTCTGCCAGCCACACTCTTGCCATGATGAGCAGTTTC^ 
GAGGTCTGGAAGCCACCTGCGCCTGCrGGCCCCrCCTTTCTTGTTT 
AGAAGTTGCCCTCCTCCCTCTATCCTCACCTCCTGCCATGCTGTCACC^ 
CCAGTGATGCAGGAGAGGACATCCTGGCCGGAAGAGTCAGAGCTT^ 
7 5 G CC CAGGGGCCAGT AG ATG C CACTTTTGCT CCAGGAAGAATCl~TCAACT GT G T CCTTTTT ATTCAAC<KX3CTCTCTTTTCAG CC AA 
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TCTCTAGATGTACTAGTCACAAACACTCGCATTTATTAAAATGTATCC^ 
TATCACATTTCTGAGGGCCCTGTGCCTGTTTTCTGCAGGAAGTTGCTGTTGTCCCGGGTC 
AGAAGCAGACCCTTCATGCCACACTGGGACCCAGGCAGGCCCCAAGCCAGGATGCT 
GAACATAGAGGAGGCAGAATTCCCCCCATAGCATCATCGTGGAGGGCGCTGATTTGTGTGCT 

TCTTGCAGCCTAGGCACCCTCCACAGCAAGCCCAACACAGGTGCTGTG 

GAGGATGAAAATGGAAAACCAGTGCAAAGGTGTGTCATCCAGTTGGTTACTGCTGTGGGTGCCCGGG 
CCCACTAGCCAGGGCAGCCTGGCGGAGGTGAGCATCATTCTGCTTTCTGGCTGCACCT 

CCAGGCACAGGGCAG ACATCAAG CAAGGGGCAGAGCTGGGG CGAGGGCTCTGGGTC CC CCCG TT ATTG AACACAGG C CTG AAGGGA 
1 0 GCCTAAGAGGTGGCTGGCAGGAGATACTACAAGGCAGAAAGACAGG^ 

GAACAAAAAGTCAAGGTTGAGAGATGAGAAATACTCTTGCCAAAAAGAGAACAC 
CTCTCTAGCATTTCTGCTTGTCTTTTCCCTAGTATTGGTC 

TGGTCACCAGGCTGTCTCCCCATGATGGGGATGGAGAGTGGTTAGTGCAGAAACTTAGACCTCCCCT 

GAAGTTTATCTAGAAGGGAAGATACTCAAGCGTAGGATTTCAGTGATGCATTTG^ 
1 5 ACGTCACTTTGGGTAGGAACAATAC^TAATGCGTGGCAATGCCTTTTGTGAGACCTGTTCAGGAG 

GGCAGAGTCCTCAGGTGAGGAGGGGGACAGGCCTGGGTCTTGGGGGAAGGTGGAGATGACCAGCCT 

CCTCAGTCTCCATCAGGCAAATCTTGAGAAGCCTCCTCCATTCTGCAGGCAAATC 

GCGGAAATTCACATGGGGAAGGGCACCTGTGAATGGCCTTCTAGAACTATCAGGAAGT^ 

TGGTCCTGGGTGCACTCGGATGATCCTGCCTAGCCAGCTC 
2 0 AGCAGTCAGTGTCATGTATAAGGATGGGTTGGGGTACCGGCCAACGCCCGCCnTGAGGCCT 

CAAGAGAGTCCTGGCTAGGTGTGCCCCGCTCTGCAGGGCCACACCATCCACTGGAAACCCTGTGCCCACT 

GG CTGTGG CCAATG CT CTTG T C CAAATTTT ACTG AAT CTGCAGTCT CT CTT AATT CACTC CAAAGTTT AATGTG TT AG CTTG CGAA 
T AAAT AAAT AAATAAAT AAAT AAAT AAAT AAAT AAAT AAATGGAAAG AAACAGTC CT CAG AAAGTC C CAGTCAAATTTT AATTC CA 
ACAGATATTCAGCAGTTTCCTCTAAGAACAATGAGAGTTCTGGGGCTAGCCAGTGTTTCT 

2 5 GGTGCATTCATTTAAAAACCTGCCCTAGGGAGGCAGCGTTCCCT 

CCCGGCTGTCCTGCCCCTCCTGCCACATGCCTGTGAAAATC 

GGATGTCTTTGACTCAGAGCTCTGCCACTGGTTATCTCCACGAACAGAAAATGCCACAGATCGG 

TTTTCCCTCAGTTTCAGGCTTTTCTCTCCTTGCCTGTTTTCCTCGCT^ 

ATAGATTTATGTTTTCTTTTCCATTAGTCCACTTTGCGTCTCAGCCCTAAAAT^ 

CTGTGGGGACAGATGGTCTGGAGCCAGCGCACTTGGATGATCTCTCT 
CCAAGGCTGACAACTATGAGCTC^TACTCTTCTAGCAGTTTAATTTAGACCCAAGAAAG^ 
GTGTGAGTGTGTCTTTGTGCCTGTGTTTCTGTGTGTGTCTGTGTGTACATGTGTGTCTGTGGGGTG 
TGTGTCTGCGCGTGTGCATGGCTTTGTCTGTGTGTGTGCGCACGCACTCTGCTAAGCT 

3 5 CCTTT(nX5TCACTTCTCCATAATTCTTTGTATT^ 

TCATTATTTGCTCITGTTCCTCTTCTAGGGCTTTTACAATATAGCCAGGAGGATGTGG 
ACAGTGTCACAATCGCTGTGCTTGGGCCCTACCTCCTGGAGACCAGGGTGGGACGGTGTCTCT 
ATCACGGGGTGGCAGAAGGCCCGACTGTCCGGTCTTCTGGAAGCTGGGGTCTCGTGGTCGCTGGGC 
GAGTAACCGAGAGCAGGATGACTGCACCCCTGACTGCCGACTCACCGGGCTTCGGAGGG 

4 0 CCTCCTACTTCCCCAGCCCACCCTGGTGGTCTCCCACTTCTCTCTCTCTGCCTGTC 

ATCTCCATCCCAACCCCAGACCAGTTCCCTTAAACCACCTCCCACCACAGGACCCCCACAAGCAGGGACCTCT 
GCCTGGGGACTTCATCTCAGCTTCCAAGGCCTCTGAAACCAGGCCCTGCATCCTGGGCATGACT^ 
GCCANNNNOTWNNNNNNNNNNmJNNNNNNm^ 
CCCACCTCCACTTGTTGAGATTCCCTCTTCCTGCTTCTCAT^ 

4 5 CCTCCCTCCAGTTGGCATGGCTTAGTCTTCCTAAGAACATGTGCTGCTTCCCAGTACCCAGGGCT 

GTTTCGGTGCATCTTCCTCCTG CTGTTT ATAGATTGCAACATCC CTGGACAGATCTTGGATTTTCC CTCTAT AATTTCT 

CTCCCTCGGGGTTTGCACACACATCTGTGGAATGCCCAAGTTGGAGATGGTCCTCCCTGATGTGTAGTC 

CTCATTTCATAAACTCATTAACAGCTAGGACGCGAAGACITGGAGAAGTTAGCTCAGGAC^ 

TT AAGAAATGGGCTTTGG AGTGCCTCG AATTTGGCCTCCACTTTTC^ CAGGTCGGGTGAC * 

5 0 CATCTGGAGTCGGAGTTTCTCTCATTTGTAAAGCGGGGGTGATATTTTTCTC^ 

CTAGGAAGGACATAGCATAGCACCTTCACCTGGTTGCAAG 

G ACTT ACATG TT ACACGCTCTT AGTTGTTCGATTTGT AAATG GG AAG AGTGTGTG ACC CAAATT AGTCGT^ C CAG ATATTT C CTG 

GTAAATGGTTGTTGAATCATAAAACTAGAAAGATGGGAAAGAAA 

AGGACAGGACTTTTTGAGACACTCTCATCCTAAGAAACAGCTGATA^^ 

5 5 GAGATGTGAGGTTTTGACAGAAGTGCCAGGAAGCCAGAGGTGTGTGAGTAGTGGG 

AGGAGAGGGGCAGCCAGCATTTCCn'GAGCGCCAGCTAATGGCGGGGACCTGGCACCTTTGCT 

ACATGAGGCTGGCGTTTTCTTTGGTTTCATACGTGAGGA^ 

CTCAGAG^CAGAGGCCGTCTGTAAACACCnTCAGGTGGTTGGAGG 

TTTATTG T AAGGG CT AAAT AAAAT AACATATGC CT AG CATTT ATACAG C ACGGGG CT TTTT AGG AAAT AGTGAGACG A CCATGGAG 

6 0 AGGTGGAGGAAGAGAGAAGGGAAAGAAGAAAAAGAAAAAAAAAGCCTTAAAGAGTTTCTTAA^ 

TGGAGTTTTTTTTTCCCTGGTTAGACATAAGTTTACATGAAACGTTAAACGGC^ 
TCACAGCCACTTGTGAGTTTCITATCAAGATACAGAAT^ 

ACTGAGGTGGAAGGATCATGTGTGCTCAGGAGTTTCAGACCAACCTAGGCAATATGGTAAAACCCTGT 
AAATTAGCTGGGCATGGTGGCATATTCCAATAGTTCCAGGCACTCGGGAGGC^ 

6 5 AGACTGGAGTGAGCTGTCATCGAGCCACTGCACTACAGCCTG 

ATAAATTCTGCAACAAGTC^CAATTCCTTGCTCAGAAATCrrCTACAGGTC 
CATCCTGACTTCTGAGTCTTGTAGAGACCAGATCTGACTCCTGCAGC 

ACAGTAAATAATGGACTCTCTACCrGTGAGTCGATTTTACTGAGCTGCCm 

7 0 GTCATTTGCACATGTACATACACATGTACACACGTGCACACGTG^ 

G ATGCAC CCACCCAATT C CACG CT CCT CAG C CCCTTG CTTT C CTCTG C AG ACAG C CAGTT CAG CT ACAGTT ATGCCAG ATTG CCTG 

CACTCTGATCTTTACCATCACCAGACTCTCAGCTTGAGCT 

CAAGGCAGGACTGAAGACTTGACCTCCACCTGTAGTGGGG 

7 5 GGATTAAGAGGGCAACTGAGTTGTTCTTCCCATCCCTCCTCCTCTG 
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TTGGGCTTCTGGAAGGGAGGGACAGCAGAGGATCCACCCTCTGTGTGTCCTGGGG 
GGGCCTGGGCTT<n^GJ^TCTTTGAGGCTGACACCCTGCCAGCCCT 
AGGAGGAGCTGAGCACAGCCTCCGGAGTTCTCTTCAGTTGATCGC^ 
GCCCTTTCTTTGCATGTTTGTTCTCATTTGAAGTCAGAAGTG 
5 CCCCATTTCAGGAAGCCMGTCCAGCTTAACAGTCAAAACATTTCCT^ 
ACCGGCAGCAACAGTrTAACCrGTTGCTTCTCTAAGAGTC 
CTAGAGCACATGCTAAACAGTTGATCAAAGGCTCTTGCCTTGTGG 
CCCACGTCGTCCCTTCCGCACTGCCTGCrGTGCCTCTCCTCTCCATC 
CCGGCGCTGCCCCAGCATGTCCTGACAGGGCCTAGATATGGAAAGGTGGCTCTCCATGCA 
1 0 GTGTGACCCACACTCTTATGGGCAGCCCAGTT^ 

TCCCTTATTAGAATAAAATTTGGAATGACAAAATT^ 

CGATCCCTGACAAGAGCCTTCCTATCTAATCATTGTTCCTCCAGCCCTCT^ 

CCTTTCCCrrCTCC rm 'AAAGCATGAACCAAGCTTTC ^ 

TCTCCTTAAATTATAAAACTTGGGGGTAATTTCTAGAGGTGCCATCATAGTGC^ 
1 5 TATCATTTTTACAAAAAAGTAGTATTTCrTC 

CTTCAACCTGTTGAATGTTTTCCCCTAAATTGTTATATGGAGATCCTGGACCC^ 

GCAG CTGAGGAAATGCTCCAACCTCAGGATCCAGGi CAAAACAATCATTTAAAAGCTAATCCTGGCCAGG 
CACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGG 

AACATGGTGAAACCCCATCTCTACCAAAAGTACAAATATTAGCTGGGCATGGTGCCGCGCGCCT 
2 0 CTGAGGCAGGAGAATCACTTGAACCCGGGAG<IAG^ 

CAAGACT CCATCTCAAAAAAG AAAAAAAAAGCT AATCCCAT AAAAG AACCAT CATTTTG AACCTGCTGTTTCCTTT CTTGTACTCT 
TTAGAAGTACCCATTTCTCCCTTTCTAAGCCATAGGTC 

AAATTGTGTTAATAAAACAAAAACAAAAATAGGACTGGTGCCTAGTTGTACTACATGAAGA 
CCTAATCTTCTGTCAAGTTCTCCAAATCATCTGCTCTGGAAGGTAGCTCCAAC^ 

2 5 GCCATCACTGACATGTCCCATTTGAAGCAACCAAAACTGTCCCTCGACCTGACACTCAT 

GaXTTCGGAAGGTCCACTCAACCCCATTGCCAGATAGAGTAAGTGTCTG 
ACTCACGGTTTGCAGGCACTGAAG^C^TTTTCCTGCCTTCTTTTTCA 
CCCATCTGGGAAACATGGTGAATGTTGGTTGGTTGTAGCTAATCCTATGGGTCTTC 
TCTTTATCTGCGTGTGGCTrTCTACTAAAACATGAGCTACAGGGCTCrC^ 

3 0 TTACTATTGCTTATCTGAATAATATTACCTGCTG AG AAGTTTATTCATTG CT CACCAGTTGT AGGG AG ATTTTGACACAGG ACTGG 

AGGATTTTTTTCTCATCGTAACMTGCAGACCCATG 

TCTTAGGG<MTGAGCAGGTTGATTGAGAGGGGTGCCTGAAAGCAAGGTCCCCATGATC 

GTGGACAGGAAACAC^GCAGGATGTGGTGCAGGCTGGGCTGCCTTGGCTGGGCAAAGGCA 

TGACGTGCTTGGAAAACAATAAGAATG<3AGACCTCTCCAAAGACCTCAGACATAATTT 

3 5 CCCCCTGTGGGTTGGAATGTATGGAATCC^GATGAAGCA 

GATGGGATTTGGGTTTGGGTGTACATGTTCCAGGCATGGAGGGAAGA^ 

CAGACCAGGGCCCTGGGCATATTTCCACTTGCCTTCTTCTGCAAACTTTTGTGTGCC^ 

ATCCAGGATCCGGAGCAGTGTAAGACATCATGGCCTGCATGGTGTGACCCTTGACATCT 

GG AGACAAGTGGCAGCAGAGTTG ACATCATCAGGACTGACTG CAGGACAGG CGGTGAAACGAGCCACTG AGGAGTT AC C CAGTGC C 

4 0 CTGGGGAGCTTAGATGTGAGACTTACACCACCCAGCACTCC^ 

TTGCCTCTTTTCTAAGACCGGAGCAACAATACCTTCCTTC>AGTC 
TGAGAAGACTCTGAGCCTCCTCACTAGAGGGTGGTCTAAAAACAAATGGATCCATC 
CTTGACTTGGTAGCACTTCAGGATCTAACTCGTGCTTTCCAGAAGTGTGGCCT 
TCCACCTCCACCTTGCTGTCCTGTCCTGTGACTCTCACTAGCTCAGTC^ 

4 5 GAGGCATTGCCAAGGTACCACAAACCATACTCAAACAAGGTCAT 

TTAAAGAATGACAGCATTTTTTCTAAATTTTATAAATTTTATAAATTTCTCCT 

GATTTTCCCAGAAAATTGTCAGTGTTCTCGAATTTGAAAGTAGATCAG 

AGCCTCTCCCCAGAGTGACAACGGCACAAGAAGTrTATAATTTTATGCACAGCT 

TTTCATTTTTCCCATAATGTTCATCAATTTGG CAATTCAAAT AGG ATTC AG AATCCAAGGTGTTCG CAGCATTTT ATATG AAAATG 

5 0 TGCCTGAGAGTTTGGGGAATAGAAATTCITACCAAATAAG1TAAATTTGCT 

TG AGAAAAAAAGTrTG ACACCTGTTTTATCTG CATCCTTTTATTAAACAGTGG CCTGAATTT ACT AAAGGACAG AG AACAGGGTTG 
AGAGGTAATClACTGAATAATAGCITCGCrAATCCCACAGTG^ 
GTCTTCAGGGGCAAAAGCAAGTCAGC^CCTACAAACCTCTCCAAGAGA 
GCCAGAAGCAATGTGGGAATATTGTCAGACTGCACAGACCTGG 
55 GTGCATACTCACCTGCTCTGTGACTTTGTTCAGTCCCTACCCTGCATGTGTGGTQ 
AGGTCCCTGAGGGGGCAGGGCTAAGAGCrrGTAGCAGTCGCACTTGACCAGT 
GGACACCATTACTCCCCATGCCTCACGTGCAGGTGAGTX3TGTGTATATTTGCATTAACA 
ATGTGCAGCTGCAGTACCrTACACATCTTCCAGTGCCTCT 
GCATCCACAGAGCTCCCAGTGATTGGCTCCCAAAAGATACTGAACTTTGGCATGG 

6 0 CATAATTACCTGTCCAGCTAGAATAGTACACACTTCTTTTACTGAGCCCAG^ 

AAGGGTGAGCGCTTATCAATTCAGCATTACTCTGGAGACATGAAACAAAC^ 
TTGTGAAC CACAT ATTTGGGACAAAAATCAACAAGT AAATTTGAATT ACACT 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GTGGCCCACAATCATATGAAATAAAGTTCATCATCATTGATCATTAGAGAAATGCAAA 

6 5 ACCAGTCAGAATGGTTATTAATAAAAAGTCAAAGAATGAC^ 

GAGGGGATTGTAAATTAGTTCAACCATGTGGAAGACAG^ 

TTACTGCATATATACCCAAAGGAATATGAGTCATTCTACCGTAAGACACATGCACGCAT^ 

G C AAAAACATGGAATCAACCTACATG CC CAT CAATGAT AG ACTGGAT AAAGAAAATGTGGT ACAT AT AC C CCATGG AATACTATG C 
AG CCATAAGAAAAAGAATGAGATCATGCCCTTTTGCAGGAAC^ 

7 0 ACAGAAAACCAAGTGCAGATGTTCTCACTTTTAAGTGTGA 

GTGGAGGGTGGGAGAAGGAAGAGTATCAGAAAAAATAACAGGTAGTAGATTTAATACCXSGATG^ 
CCCCATGACATGAGTTTACCTATATAACAAACCCGCACATCTACT 
TAATATATTACAGTATTCTTCATATTCAATCGCAATGTGTGAAG 
TTTGATGCCCAAGACTGCACAAGGCTACATTTTCTACT 
75 CCTTAATCTTCAGGGTACTGGTAAACaT^aAGACCTCCCCAGT^ 
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ATCTTAGACTGCAGGGATATATTAGGAAGTTAA 

AGTATCTCAGTGCTTTGTTGTTTAGTCAGAGTTTTGTACTCTATCACCC^ 
CTTTAAAACATCTTTGCITTTTCTCCCATTATCGAGATGCrrA^ 
AAGGGTTGGAGCTCAATGAAGACAACAAGAACAGCAAAGGTTATTGTAAAACTGGCT 
5 GAGGCAATCAGTTTATGCTACCTCTGTCTGTTTGATGGGATTCATAATATTGACTTTAT 
AAAATAAGCAGATGGAGAGTAAGGATTTGCTAGGAAATAATTCAGCCAGTCACTTTGA^ 
GTCTCTCAAACTATGTTTGCCCATTATCCCAATAATTTATTTCCCAATA^ 
AATCTGGAAAAO^CTGGTTTAAGCAAAGTTCAGTAGGTCTGCrr 
ACTCATCCAACAAGTTTAATTTAACAGCTACACTCT^ 
1 0 GCGTGTGTGTGTGTGTGTGTACACATGTGTGCACATGTGCAGAATCTACCAA^ 
CTGTGAAAGAXiAATAGAAACCTGAAGATTTGAGGCAGTGATAGC^ 

AG CTCTTTTGTTGGTTGGGGAAAACAGGAATTTTT CCC CC^ CATTTTCT AATTTT CT ATGAACACTTC CT AAG A 
AAAAG CTG AATGAAGAACATTTG CGATG CAAT CAG CTCATT AAGAAACACG CACTTTTG TGGAG ATACGTG CTGTCC CAGG AGATG 
CTCTGCGAGGAGCCGAGTGTTTGGACTGGAGCTGCTGAATGGTT^ 
1 5 TTGAACCCATCAGTAATTGCTCCAAACCACTTTATGGGATATAATGCTGTGAGTTGACAC 
AGTAATTACTTTT CTG TTGCCTATAGAAGGGCCAGCAATAGCAGATG ACTA 
TTTAAAAAAAAGTAATGCTTTCTGTTAAACTCTGACTATACTCTCTCCT 

GCAGTTACAT ATGGGGCTGATGACTTT AGGGATTTC CATG CAAT AATTCC CAAAT CTTT CTCTCGT AAGTATATG CCGTG CTTCTG 
GAAAACAAAAGCATGCCTTCATCTCCTATCATGTAAAra^ 
2 0 CACTGTTCTATTCGTTAGACACCTACCATATCATTTTTGGGTATTTATACCATAAAG 
GTGTCCTGGGAGCAT<^CAGTG<3AGTAACJVGTAAG 
GACAGCTTTGAGATTACTCCTGTAATCCTTTTTAT^ 
CATGGTTTCAAGATAATTTTTACAATTTTGTGGGACA 
AACTGAGAAGAGAAAATTGCAGGCCTCCAGGCCGTTATTCTGTGGTC 

2 5 CCGCAGGACCTTCTCTGATCTGAGCTGTTAAACCAAAAACAGATGTGAATTTC^ 

AT AAATATATTTGGGT AACATCCTCAAATGAGT ACTTTAAAGAACAG AGAGTG CTGAAT ATTT AA 
AG C CAT ACZAAGTTGATAATCCCCTT AGGACXTTGT ATT AGT AAATCCT CT AATACT ATA^ 
GTTAACACTGTGACAATTATAATTATAGGCTCTAATTTGAACAG^ 
CAAAAACTATCTGGTGTCTAAC^AATTTAACAGTATCAAAGTAC^ 

3 0 TAGAAGTTACAGAAATCriTClT'lTACTGATATTCC'lTTGATTCCACi rrAAAGCAATAGCTTGATACCTAcrrrii^GAGATGTAT 

ATGTATGTAC^TGTAAATGCTTATATATGTGTATTTGTTTACACGTGAGCTTATATGA^ 
AGAAATGATATTGATTTGAAGAATCATATTGTTGAAGTATTTTG<K3 

CATGTTTCTGTGAAGTCTGAC^AAC^TGTCAGCCAGAGCATATGTTGAAGCAGACCCTTCAGTGAA 
GCTGCATGTGAAGATGTGTTAATGTTATGATAGTTACAGTTTTTATAAAAGAGATGATATACACT 

3 5 TTTATCCAAATGTGTCCATAAGGTATTGGTGTCn'CrCTTTCT 

AAGCGGCTACTTGGTCTTGATGCCTTCCCCCGCAAAACAGCA^ 
AAGAATTGCCCCCTTTGCCCTCAACACCTCITTTCTTCTTGAAAAT^ 
AGTTTGTGTCCGTAGTTGCTGGAGGAAAGAAAATGCCrATTGCTCTTTTTTTTA^ 
TTTCTCTCCCTGACTTCATTCCATTTAAAGGCCTT 

4 0 GAAAACIAACCTT CT AC ATTTT AAATGAT ATT AAGTT AT AAT AAATGGTT C CATG AACTTTAAAACGCTT AAAAG TTTT AT AAATCT 

C IT IT AGAGGCAACACAAAAATATCTATATATATATCTTCTCTTCACCTCAAATTT^ 
TTAAATATTTCCATTTCATTTGATTTTTACTTGCTAGTCACC^ 
GAGCATTTCAAGCTTACATC^CCTAAACTCACACTAGA 
CAAACATTTACTTTTT(^GTAAATATGAAGAC!ACATTCTAAAAAC^ 

4 5 GGGACTAACAGTGTACATTATACCTATGCATTAAAAAATGTTTAATTATCCAT^ 

TCTTGGCTCCTCTTGGCATCTGAGTTCCTTGGA^ 
ACAGGGGTGTGCGGTCTCTCTTAGTACTGTTGGGTGCCAATAAC^ 
CCAACTTTAAATAGCATCTGGGAAGGAAAGCAGCTTTTGACTAGGAGTTTG 
TTGAGTAATAAAATTGGGCTCTTGTCCATTATCTCTCAGTCAATGAAAGA^ 

5 0 AGAGAGGACCAGTCTCCATTTCATGTTCfATATTTAA 

ggtgggg^gttacaaaatatttataactatctttcctatcttcag 
gttccagctccgagtaac^gag<3aag<sagtgag(^atttaggaact 
gttcgatagagacaggatccatggggttgaaccaagcacaatgcctgcaaat^ 
aggtatgatcagcaccatttaaggagtggcagttgttaagaaac^ 

5 5 gagccttgcatatcccttgggatgggttctogggtattatggagctgccaggaggtgctc^ 

tcccaacctcctccttcattagtggaagtatcagctggtgtcattgc 
aaactcacattctcgcagagcctccgtcacaactgcatg 
ggtaaaggaaaac<;tctgtgggacaaaggccctgaactcctcactccttgaa 
acaggtggcaggaacctgccatcagc^ttcccatcagcctccagggcttgc^ 

6 0 gaaacccctctctcagaagaaccacttcttgccct 

TGAGGAAGTGTGCATCCTGTGCCACCAACCTTGCTCGCAC^AATTAGTTTCA^ 

G CCTATGCTGC C CTTTTGAAAGG AGG CGTG CXIH' ACAGC CTGG CTMT ACAG AAG ATTCCAG ATTTCAG TGACT ACAGGACG CTGTC 
ACCCCTTCTGTT1TTCTTTTTTCTGTCCACATCTCTGA 

TGGGAAGCCAAGACACTCTCATCCTGGGCITGTGTTTTTAAAATCTATATTTTA^ 

6 5 ATTTCAGATTTTGAG AT AC AATCTG T AGTTG CAACTTATG AAGAT AGTTT CAAG C C CTG ATTT CTGTCATCTATGAAAACAGT AAG 

CAAGTTGCTTGTATTCTCCTCCTAGCTGGATAAGAGACCCCCGTTT^ 
TTGTCTATGTAACCAGCACCCACTTTG(^TTTTTCCGCAAAGA 
TGAATCrTACCCGCCCCGTGGTCACGGTGTTGCCCACTCTGTTC 
CTTGTCTTAGACITCACCAACCTTTCCCGTCATAACATG^ 

7 0 CTCAGCATCTG C CATCTAACTTCCATTTAAAACGTTTTGAGCATTTG CTGAT ACAGGTT CCTCTCTC CTTTCCC CATT ATTTT CAG 

TTCCTTGGCCTTGCCCATAGTGATCTGTGGATATTGTCTGCATACTCITTTTTA 
CCTCTTCAAACTCTATATATTCCTGAGCTGATCATCCCTACTGCT 
TACATCTAGCCATCTAGGTGGAGATCTAGGTACCTATCTCCCTAA 
ATAGGCTCTTGTGCCTTTTCOACTAGGAGTTTGAAAACATCCACT 
7 5 CTCTTGTCCATTATGTCTCAGTCAGTGAAAGAGCCCAGGGIXK^ 
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GTTACAGCTCTCAGTGACAGTGTCATTGAGGCTAACAAGG^ 
TCCAGGCCTCTGGCTTTGAGAACATCTGTGATATCTAAGGC^ 
CACATAGCTTTGCCTCCTCCTGCAAGCAGCCTGCTGTAGCAGAACCGGTGTTCCrGA 
CirCCCTGCTGTTCTGCTCCCCACCTGCAAGCGCCCACACAC^ 
5 ATACTCATTCTTAATGATCCAGAAGAACAAGTGATAGOTCTGGCAAGTTCT 
CAATTCTCTCTAGCTGGTTGCCTCTGAATrTATGAAGTCAAGCCGC 
TGCGTGGTGTCCTTTTTCTAGCCTCGGAAGCACTAGGCCTACACCTTCGCTA 

TGC CACACCAAT CTT AATTTG CTCCATTGTGTT CTAAT CT CATTTT AAAAAC CACAAGG TAAATGATT AAAAATAAT CTT AGATT A 

1 0 GCTTTCTCCCAGAAAGATGCACAGCCTCTACCTGAGAGTACCAATTCCAGAGAACT 

GCCAGCCAGCCTCACTTGCCTGCCTATCTCCTTTATTTTCCAAGTATC^ 

TCAGATGCTCTCGGTCTCTGATCTTCAGGATCTGAAGGGGAGAGCATT^ 

GCATAATGCCAGGATCTGATGGAGGAGACAGGCZAGCTCTGTTAATCCTCTGGTGCATCCT 

TGTCCCAGTTAATTCATTTCATTATTCATCTGTAAGTCATTTCCCTAAA 
1 5 CCATATCCATGGGTTCAGCCTCTGGAAATTCAACCCACTGTGGCITGAA 

GAAGGCAGATTTTTCGTATCCATAGGTTCTATAGGGCCATTTTAGGGACTTC 

TGCACTGCAGATACCAAGAGATGACTGTAAGTGGTTAGGAATTTG^ 

CCACTTGCCAGCTATGTGACCTTAGCAAGTTGTTCAACCTCTCTGTGCCT 

CTTCCCTTATGGAGTCCTTGTGAAGGTTAAATGGCAGAGTACAATTAATGTGCT 

TAGGCTGGTGCTATAATAATTTCCATTTTCATGGATGAGGAAATCAAGGCA 
CTTTTAAATAGTGCAGCTAAGATTCCAGACCAGGTATTTTTATTTCAGTG 
TGAATAAAATAAGCTTGGAGTGCTGTTAACTAAGTTGGTTATTTAAGACAGTTGTTCT 
GGAGGGTATGTTAAAACAGACTGCTGGGCCCACCGAGAGTTCCTGAGCCTGCAA 

2 5 GTAAGTTCTCAGGTGATGCTTATGCTGCTTCTCCCAAATTTTGAGAACCATTGATTGAAAATC 

GGCTGAGCACAGTGGCTCACAACTGTAATCTCAACACTTTGGGAGGCCAAG 
GCCTGGTCAACATGACAAGACCCATCTCTACAAAAAACACCAAAAAAAAATGGCT 
TACTGGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCGGGAGACAGAGCAAGACCCTGTCTC 
TGTCATCTATCTCATTGATCATTTTACTTGGCAAAATTTACCT 

3 0 CTTTCAGTGTAACTACAAATTCTCTTAGTCATTATAGTGTTGGATTCACC 

NNNNNIWNNNNNNNNNNNNHNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CTCTTCTTTAGAACACAGGGTTTGTTTTTATGTTATTTAA^ 

3 5 CTTCCGGAAATCTGAAAGGGAACTGTGGTC^GAACTCTGAGCATTTTTATGTTTACTGAGTT^ 

ACATGCCACAAGGAAAGGTAGCATCACAATAAGAGACGTTTTTCAGGCI^GATAA 

ACACATCCTAGAGAGCTGGATTGTGTGACCCAGAACCCACCCTCTAGGGCAAGGTGCCCATCTGATGGGTAGGCT 

CCTCAGACCACTCCTGACGTGAACCTGCTTAAAGTGAGGGCCCAATTCTAAAGTGGGAACTATGTAA 

TCAGATAATCGCCACTGGGCCTATGGATGGAGAGGGCTGGCAGATCTCCCTGTAACCCCAGGTGCATCCCGAGGCCT 

4 0 GCCCACTCAAGGCTGAATGCACGGCGAGCTCAGGCTGCTCTCCCCTTGGTATTTGCT 

CTATTTGATTGTCTTTTTGCTGCTGTGTTGTTTTGTTGAG'r 
CAGCCAGTAATGTTAAAGTAGAGACTCAGAGTGATGAAGAGAATGGGC^ 

TT ACGAATGCTTGATGCCTCGGGAGAG AAAATG AATGGCTCC CACAGGG ACCAAGGCAG CTCGG CTTTGT CGGG AGTTGG AGGCAT 
TCGACTTCCTAACGGAAAACTAAAGTGTGATATCTGTGGGATCATTTGCATCGGGCCCAATGTGCTCA 

4 5 ACACTGGTAAGGCCTGGCTCAGT1"1 rrCCTTTAGTGGCCTGGAGAAGGTGCATGGGGTTTGAAGGAGGAAAGCATCCrGTCTTCCT 

TGTGTT CTG AG CATGTTTCT AATTGACTGGTAG CTCAGTTGTTGCAAG CG ATT GG TTCCAAGTGGT AC CG AGT CAT AG AGTCCTTG 
TTCTGGTACAGCCTTGTAAAGGACTTCTCAACACGTACCAAT^ 
GAGAGGCTTGGCGAGGGCTGCTATTATAGTAACACATACTAAGTAGTCTCAGCTGA 
CCTCCACAAAACAAAAAATCCTGATACACCAAAACTTACTCCT 

5 0 TTGTATCCAAACTAACAGGGTTTGATGTTTAGAAACAATAATCAATGAT^ 

G ATGAGCTTTCAAATGGCTTG C CCT CT AACAGG AAAG ACTTTG AAT CAG ATGCTT CT ATTG C C ACTGGTT ATCAG CTC AAATTCCT 
AAAGAACTTCATCCCAAATACCCTGTCTTGCTGAAAGGTTTACTGGA^ 
GAACACGGGCACCCTAGTGTCAGCGAGTTGTGCTCAGGCTCACAGGATGCCCC 
GCTGGCCGCGCTCIGCCACTGTTTCTAATAGCCGGTCACGTTGATGGAACT 

5 5 AAACCACCACTCATGGTGCTGGAGTGTCTTAGAAGTAAAATATGAATAACACACACTTATT^ 

G AATTGAAG AGAAACTTTT CTTTG CGTGGGAAG CTGTTCTAAAGTTGG GT AAAACACAAT AG AT C CACC ACCTCT AG CAG CCACT G 
ATAGCTGAAACGTGAAACATAGAGACCCTAAGCTATCACTXSCCr 
ACTGCAAAACCCAAGAGGAAAAAAATAGTTGAAAATCCTATTTTAAAGGCCTAGCA 
ACGATGACTTGAAATCAGACCCTCGTATGCTGCTTCOSTGGGGGCGAACAAAAATATGT^ 

6 0 AAATTTTGTGAGCCAAAAAAGCTGTGTTATCAGGAAATG 

TCAAAATGATCH'ATTCCATTTGTAGTGTATCTGAAAACCTATAAAAATAAGTTG 
CAACTTCTAAAATTAAGCAAACTTTGCTTTTTCTAATGGCC 

CCCTAAGTGAAGCAAGCGTGCATGAGAGTAGTCCCAGCCTCGCCrTTGTAATGAGCT 

TGATAGACACTT AACAGG ATACTCGGGGACC CATGGTAAT AC ATCCCTGAT AAGGAATAGACCTCACAAATGAACT ACTTGC CTGT 
6 5 TAATTCATTTAAAGCCTGACTGTACAGTGAAAATTCTCTTAAATAAATAT^ 
CAAAATATAATGACTGCCTGCCTTGATAAAAAGAATAATTACATT^ 

CTG CTGTTTGCTGCTCACTTCTCCCAACIAAATAGCAGCTAGGTGCCACCTG CAC^CCAATAAGCTGAGTTITTCACTTACGGAACA 

AATAACTTTCAGAAGTCAATTTTATAGTTTCTGCCTTGCCCTTTGT^ 

GTTTTCATCATGCATTTCCACTTGCCTTAAATATAAAGGGTCCCAGGTAGATACAGAAA 
70 T AACTAG ACATG CTAGAAAAGGTTTT CATTTTCCTGGGAT CTGAGTGGAAT ATGTT AGAAAAGG CATG CTTTCTGAATTCTCT ATG 

CTTAAAACATTTCTAGAGCAGTGCTTCTCAAACTTGAAAGAGCATC 

GACCTGGGGGGAGGCAGGAGAGCAAGCATTCCTAACAAGCTCCCAGCTTGTGGAGCACA 

AGCTGAGC^GTGACAJGT<?AACAGGGCCATTCAGAG^ 

GGACTTGCACTGTGGCCATTGTTCCTCCTCCTGCCCATGGCTGA 
75 GGGGCGGTCTCGCCTCTG CCCATATCCCCACGTTGGTGAAGTAC CACTGGCGCCATTrTCrAATCAGCTCATCGGCACCAGCACGT 
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CACTTCCCCTGTTGTGCAGCTCAGTTTTATATTTCTGACT 

TGAG AAC CAGAG AGG AGAGAGGT AGG AAGGAAGCAGGAAATGCAGGG TG CAAAT AC AC CACATT ATT ATT CT AAAT AAT GGG ATTT 
TTAATAACAAAGACCAAGAAGATTGTCTGTGCCTATCrAGTTCCCATCTGTAAGATAAAAAGG 
ACTVGCACCCTAATGGCACrcCTCCTCCCAGGTATGATTC 
5 TCTTCTGAAATGCCCTCAAATTCTATGACATAGTATCCATGAAGACTTCCCT^ 
CCTTCTTACAGAGGGTTCCTAAAATGGGAGATCTGTGGAGTCATTC 
TCATCCCAGTTAAATAAAAGTAAATACAACCTTGAAGAAATTTGACCT^ 
GATGGACCCCGGCCACGCAAGAA(^CACATXK5CGTGGGCTGCGTTCT 
CCCCTCCAAAGGAAACTGCTTGAGTGGAGTGCTCTC 
1 0 GTGTTCATACCAGCATAGGCACTCAGAAGCAAAAAAGGTTTC 

CAGGATC^AAGTGAACCCCTGCCCGCCrGCCTGCCTGCC CTGCTCTGCTCTGCAGAAGGG CATGTGTCTGTAG CAATCTGCCCTTT 
CTGTCTGCGATGGGCAGAGACCCTGGCTCTGCCTCCTGTT^ 

CCTG CCCAGCACGGAAGGGGGTCTTTC C CAGGTGT AAG TGT AGGG CTGCATTCACT AATTT AGGG TCAT ACCAC^^ 

GTGACCAGGATGTTGTCACATGCTTATTGTGAGCATTTTTGTGACATG 
1 5 TGTAGTTGGAGGCACATCCX3TCTCATGCTGCCTGGGTCTCCCTC 

AATGCACACATTTATGCACAAGTGCACAGAAGGCCCAGAACT 

CCTGGGCTACCTGGCTGCTAGCTCTTT^AGCCT 

AGTTAGTTTGTTTCACCATCACCTTCCAACCTGCTTACGTT^ 

TTGGAGAAATACCAGATACCTCCTGGTTAGTATCAGGACCTCTCCATTC^ 
2 0 TTTATAGTAATGCATCTAACCCGCCAGCTTAAAAAAAAAGTGATGATAAT 

GGTGTGGATGTTAATAGTGCCAGATATGCTTCTGTGTAATCATTTTTGAGA 

TCGGGAAATGGTTAAATAGACATTTTAATACA1TGTTTTTTCCT 

CAACACCAGGAATTGTGGTCACATAGCATTCrrCTTCT 

ACAAATGTATCACrCTTCGAAGTGAGAGTTAAATGAAATGTTCCCTG 

2 5 AGCATATTTCTTTTACTCAGATACCTGTTTATCCCTTTGTCAGCT 

CAGGTGTTTTCTTAGTCACAAATGCAGCAGTCTCACCCCTGJ^ 

GTACTTAAAATATAATTTGTCTTCTATGGATATAGACAAATAATAGCTTAAATGTTTT^ 

AATCTCAATGCAACTCTGACITGTATGTCCTGCCATAGGA^ 

TCITAACTTCTCIATTTTATTCAATCTTTCTCCATCTCT 

3 0 TTGTCTGCITAGGGACAGACAGGTAATGGAGAAGAAATTGAAAGGGAAAATACAGGGAA 

CTGTTATAATGAAGTGCCrTTCATTTAAATATAAGAATGTAACG CTGAAATGAACTTGTGATCCTGAAATGCATTTTTAATGAGTT 

t ccttttt attttgctgctt cag tgg cat atttt aaag accctttg aaaaaag c cacatt aaaaaac c ccaccaaatg ccacaaca 

cgcaaaattg^atattggtttattcgaaaactgc^^ 

atgagtttgaatttgcattgtgatgtgccattctgatttag^ 

3 5 gosttaaggagctggcaggtttagtctgaaagcctcatgt^ 

acaactttctcgtagcatcgtcctcatgtccccacgctgagtt^ 

cttccagtgcaatcagtg cggggcctcattcacccagaaggg caacctg (ttccggcacatcragctgcattccgggg ag aagccct 

tcaaatgccacctctgcaactacgcctgccgccggagggacgccctcacto 

tgcagtccc^gggctgtctgggtgtcccgggattcctccactctgcccgcctgggtcccggattgtgtcctt 

cccactgcaccagggagattgggcggagga 

gcctcctcatcccgtttgtcaggctc<;tatggtttccctaaaggattttgcgggaaat 

aaaacgttttcaagtacaattctttcrrgttgttaagtccc^ 

gctggtagagaagggtgatttac^gcgcacccacccagactgagccccgtgtggctctc^ 

4 5 tagagagg ccagtctggggctgtt ac cagatttt ag acag cagc ctttct ctttg aatt ag acagtt aaag t acaacc cac at aat 

ctggagtctt'gacaagatcatcataggcataaacgctctatcattct 

tttggttagatccrctgcctcctgagaaatggtcctgggtgtttcattatccagcagtcccataat^ 

agggctcttggccggcctgtcatggatcatgtttgcctacagtgtggtctatagaa 

tccagttggggatatttgctgtagcatactgcatgagacttcggaggcgaa^ 

5 0 tcctgccccagccagccaccagccagggcccttctccaagcagcagcctctctaa^ 

accagctactcctcacccaccctgggtagcaggataaggaggagcctccctcagggaggcagacg 
gcagcggcccac^ccatctcttcccaaatgtgatgcgtgtatttgat^ 
ttatcacacatttactgagcacccctgggtgcccattctgcacaga^ 
ggaggcatgacagagtgtctgaggggacagtgggagcataaggaaggggac^ 

5 5 CCATAGAAGGGAGGGGCTTCTGAAGTAC^TGCTGGAAGGCGAGT^^ 

TCGGCAGTAATCCCAGAGAGGGGTGCGGCACCCATGTTGGCAAGAACTCGGTCTCACrCCT 
CCCAGTTGGCTC^AGACAAAAAGGAGAGGAGTCACAGGGGCTGGTGCATCACCTCCT 
ACATGGAGAAGGATCCCCACATGCTCGCCATCAGAAAATTGTCATGATTTGGGTGATTTGAC^ 
GCTATACCAAGCTGAGGAGTTGCTTGCCCGAGAGGGCGGGCAGATCAGCAGGGGGG 

6 0 AGGGACAGGGAGAACAGGTGGTTCCCCTGAGATAGGAAGACAGTAGCATG 

GCATGTATGGTGGTCCCACCTGGAGACAAAGGTG^TACCTG^ 
GCAGTCTGTCTGCCATGCTAACCAGTGGTCTAGGAAGAAGCTTCACATCT 
TGGCAGCAATATTGCCTTGGGTTAGTGGGAAGCACCACTTCCCATGTCAGTGGAAA^ 
ACCATAGGTCATCATCACTCCTAGATTACTCTGATTGCAATGGAATCT 

GATGGTrTCCTATTCTAAGTACCACTAAGTAlTGAGTAGTT 
CACGGATGAGCAAGTGGAGAAAACCCATCATGGATGCAGGGCTGCTGGG^ 
GCTCTTGGGGAAAAAAACAGTGATGATTTCTTCAAAATGATTT 
AAATGTGGCCAGACTCTACTCCATTGTGGAATGGTGACTCCCCTGTCCGATGA 

7 0 AACTGGTGGTGGCCTGAGGTCrTATGAACrAGCATGTTAGCAGGGATA^ 

GCTGGGGGAGCTGGAGGGGTGGAGAAGTGATCTCCCIT'CCTGCCTCATGGC^ 

CAAAG AAT ATTGG ACTATG CTTCAAGAG ACACTTGAGTT C AGTTG CTT ACAGTGAACAT ACTTCTT ATGTTC CAG AGTT AAACT CA 
ATCATATTTTCAGAAAAATAGATATTCAGACATCATGTCATATTTACAGGAAGCT 

TTTTTCAGG AAAGGGT AT CG AGTGGTTATCT AT ATCCT ATT AATTTCTAAAAG CAATG CTCAATAAAATGTG CIAATTTTAGG CAAT 
7 5 ATTCTGTGTTTCAACATT ATTTCCTT AT GTTGGGGGT aCaT CACTTT AT AT AG AT AAAGAJ^GAT CCTTTT C ATT CTCT ATT AAAT 
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GGTCGAGAAGTCAAATTTTGGCTTTCATGTTGGTCTTCTTACTGTAGCTTCT 
TTTGGCTAAACCATTGTACCAAAC^TTTCTATTTGGATGCTCTTAACTTTTA 

TG AGAAAGAAGTAGAAG ATTCTGCT ATT AGGTTGGAAATTG ATATT CCTT ACAT AAGTG ATTAAAGAT ACTTTATTCAGTTTATC C 
CTTAAAATGG T AT CT ACTG AAGTCCTT AAACTT AGG CATGAT ATT AATTCCTGG CT AGT CTTTCTTTTTC CT AAAATATCACTCAA 
5 TAAAATCCAAGAAGGACTAAAACTC^TTATATAGAACTGCACT 

TAATAAGTTAAAATTAAACAAAATTAAAACTTCAGCCCTCAGTGG 
GTGGCTGCCATATTGGACAGCACTAATAGAGAGCATTTCTATCATCATAGGAAGTTCTC 

C CATC CT AAAG CTCTTTGG CAAGAAAG CAGAG CTTTTC CAT AAG CTCGG TTT ATG AACAAT AT ATTTTGT ATTTTCTATG T AT ATG 

TATATCAGAATACGTATACATAGAAATATATGAAAATAC^TATACATATAAATATACATATACATAA^ 
1 0 AGAAAATACAAAATATTTTGTATTACATATACATATGCATGGAAATACIAAA^ 

TTTAT ATT ATGT AAGAAAAT ACATTTGTGCAAT AAAACGCTTATTT CATTCAAGCAT ATTTG TCCAG C ATGGTTT CTTAGAGACTG 

AAGTTACCCAGTTTAGCACCrGGAACTGAArrCACTCTGGCTGTC 

ATAGACTGAGGCCTACTCTCGAGGCAAATCTGCAGACAATGTG^ 

ATCCACGTTATAACACAGCAGTGGCTCGTTTTCTC 
1 5 GT AAAGTAGACAAACTAAGACAAAACAAGAGTCG CATGAGTTCTTGGT ACAATT AAAG AAAC C CTTGG CCACCAACGTTTTTAAAT 

TCAGGAATTTCACCAAGTCCGTAACAGTTTTAGCTCT 

GCATTCCCCTTACAC^GAAGGCTGGCATTTAATTGGGGTCTTGAACTCAATTG 
GATATTGTGGCCGAAGCTATAAACAGCGAAGCTCrrTAGAGGAACATAAAGAGC 
CCGGGCACACTGTACCCAGGTAAGCGCTGCTGCTCXK3AGGCCAGCCT 
2 0 TCTGCATG CAGCCTTAGGAGC7VG AGCCTTGGGCCTGCTTCCTG CCGGGGCT 

TTCACCAGAAGCACGTTGTGCTTCCCAGCTTTCT 

ATCGGGAGAAACrCCrAGGATAGCAGTGACCTTGAAAGTTTTGGGGTTGTT^ 

AAAACACCTCAAGTAGTGATATTTCTTTGTAAACAAAATAAAATGTAAAATAT^ 

AATAAGAACTTTCIAGGCrcTGATTTTAAGC^TCAGTTCAGA 

2 5 TTGTTGGAAGCCCCTGGTAGTTGATAATATTGAGGTTGTTGAAGGATGAGAGG 

C CACTGT ATTG CAT AACG AGAT AGCACTTG ATTCAGACCC CAG AGC CTG T AGAAT AAATG ATGTGGAGAG AGT ATTGAAAAAGGCT 
GGATTT AT CTGGGGGAATGG CAATGTT AT AAAAGGTTCTT CCCT AATTTTTT AAGAGATAAG CAATTT AT AAAG AGATCAAAT AAA 
ACTAATCTCTTTCAAAACAGTATTCATTTAGCACTACCCCTGGAAAAATCATCACTAG^ 
GAAAATTTATTAAGCCTCATTCACCTTTCX^CTTTAAC^ 

3 0 TG<3CAG ATG AGAG C AGG AAAAAAGAATG TTG AG AATGG CTAAAAATTG ATT CCAATT AGTCTTGGCATGG AG AGG CAACAT CT CAC 

CTWCCAGAGCCCTGCAGCCAGAGTGCTGTCTCCTCTCCCAGTCAGCCAGGGATCT 

ATCTT CT CACCTGTGAAATGGG AAGATACCATTAAATT GT CATTTGG LT'l ' CT'TT CAAAATCT CAGGTCT AG AATGGAAAGGCATTG 

AAGGTGAGTGGAAGAGAAGAAACTGGATGTTAAAATAATAATAATGTAACAGTTATTAATTCAC^ 

GACTAATGGACGAGAGGATGATTATGTTCCTTTGAAATAAAATGTCAACATTCAAATATTGUTTT1T 

3 5 TGGGTCTTGGAAGGAAGTGCTCCTAATGATGTAGTTTACATATTAATGCTTAATAAAC^ 

GAGGACTTATAATCAATAATACCAAGCCCTATTGAATACAGTAATCT 
AAACAGACAGTATATCATTTCrcTTTCCTCCACTGTTTTCATTACTAAA 

TCCGG GGG AGCAGG AGAAGCAGCACT CATTT CACTGAG ACCTGGGTTTG AGTTCTCTGTTTTTG AATGTGAC CCAG AG CT CAT CGC 
TTAAATGCGACTGATAATTCGTGCCTGGCCACCCrCATAGCAGCGCTCTGAAAGAAACATAAAGGAGTG^ 

4 0 GAGCCTAGACATGTGTCAGGTACTTACCTAAACATGGATGTTTAGGTCACTGACATACTGA 

TGGTCTCTATGTCAGTAAGGCTACGTGCCAGTAATGTGTATTAAA^ 

ATTT ATTC C CTTTGGTGGTTAAGTT ACTTG G ACT AC CACAGTG ATGCCTCT ATG TCCCCACACTCAGG AGGTATGAAAG CGTCTCT 

AGGAAGGAGCAGTGGTCTGGTCTCAGTGTGGCAGCAGACAAAGCCAGGCTCGCACT 

GGCCTCAAAAGGTGCAAAATGATCATAAATATCTTTCGACAGTCACCTTTG^ 

4 5 GCTGCTGGTTTAAGGAACTTGGAGTATTAAGGGGATCCATAAAAGGACT 

AAGGCAGGCAGGGGCTTGGTCATTTCAGCTGTGCTGTACCCGGATGAGTGGAAACTGTGC 
TCCCCTCTAAGAACCCCTTGTCTGTGCTCAGATGGATGAGCAAGTAGTCTCTTCT 

ATTCAGTAAAAT CAAAGCCT CAGTTAGG CAATAT AGG AAAAGATGAATTTTTGTG AATATAAlT"lTCTnTAGTAAAAGGCTTTTGG 
TCACCTCTTATAATTTCTCAATGTCTTTATGGAAAAAATAAGTTTCT 

5 0 GAGGTGCATGCATCTAATGACAGAGCAGCCTGTGACTCGCGAGCATC^^ 

GG AGG CTTTT GGGAGG CCCTGCG CCATGGTX3CAG AGGAAGACCCCrGACCTCTGGGAGCCCTG CTGTGGGTGTGGCTTGG 
CAGAGAAGGATGGACATGTCAGGAAGCATATTAGCTTAATAGATGAA^ 

TATCTTGCATATAAATCAC^GTGTGGCCTATATATGTTTGCCCATTTAATAAAAAACACATATTCCAAATG^ 
TACCAGCAACAATAGCCAATTTGAAATATATGTTTTCAAAGTAACATTCATAATAG 

CCCAAATTAGTGTATCATTTCAACATAAAACTGGCTTGTAAATTTTACTGG 
TATACATGAGATAATTGGCAACTTTTTAAAAGAAGAATAATGTCAAGTGATGTATC 
GAAGAATAGACAAACAAATCAATAATAGAGATGGATACCCTACTGAAGACCCC^ 
GGAAGCATAAGGCAGGAACGGTATTGGAAGAATTGATCAGGCATTTAA^ 

6 0 ATAAATTGCCGGGAAATGAAAGAGTTATATTTTTATAACTCCAACCACAATAACrGATC 

OVAAGTTGTATCTTCAATTATGGAAAGAGATGACATAGAACAATTGAGAATGACT 
ATAATTTCTAAATTTTCTACAGCAGTCAGGGGTGCTTTGATTT^ 
TTCCTCCACATTGCCCAGATCTGCTGTAGGCTCTTGGGGGCCACATTTTTGTATCT 
TGTTAGATCACACAGTTCTTTGCGTCTCAGAGGGCTGCTGTTAGGGACTC 

6 5 AAAAGGACTTGTCAAAGGGACATCAATACTTAATCCCAGTCCAGCCTCTC 

AGGATGATGTGTGGTGAG CAGCTGTTGGAGGAAGAAGCATTTGCAGGTGGCTOT CCGGCTG CCTCTGCCTGT 

CTGGAAGTGTTGCTGGGAAGATTAGAATTAATCT 

GGACGCGACTGAACCCTTTAAACATAAGCCTTTCTAAACTGGCCTCT 

7 0 GAGCTCTATGCCTCAGAAATTTCTTC^TAAGAGTTAAATGTTTGCT 

TAGAAATG AG TTGAGGGTGGAAGAAAGGGAAAAAAATCTT ATITTTTC^ CTT AACATTC CTT AAAT AT 
CTTAGAATTTTTTCCAAT AAGTATCTTAAAAAT AACAAAC CT C C CATCAGTTTTTCCTAG ATTTG ATTTTG CAG CATCTGGGG CCT 
GCCCTGTGATCTGCCTGTGGACATCGCTCTTAGGGGCGGCTGCAC^ 
GGGACACCACACTGCACCAC^.CTCaaGGCCTGGCCGGCTTCT 
7 5 CC CTTATG GGGGACT CTGAAGGGTT AATGGGAGGAATCAT ACAGTGACTGACC C CTGAGAAGTGT C CAGTGAAG ACAGGGCTT AGC 
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T AGGATTGCTGTTTTG CCTAATG CTCTGCGCG ATT AAAAAAAAAAAAG AAGAAGAACAAGAC CATTCGT CTCT CT AGGAGCATTG C 

CCAGAGTAGGTATTAGACACACCAACACCACCATCCAGCCAGACGCTGCAGGGACAG 

GGCTTGGGAACCAGCTCAGAGTGAATACAGAGCCACCGCCACTCACCAACT 

CCTGG TT CCATC CTTT ASTAT CTCACAGG<3TGATTGTG AAT ATC CCATG ACTCCAAGATTG AG C CTCGGTG 

5 TG AAGGTT AACTCTG TCCGGAAAG AGGACCAGT AAAAG CTTCATG AGGCTG AG ATGCACTTTGGAAGAGG AATAGAGTTT CAG CAC 
ATTCTAGGTGTTGGAG<3AATG<^GAATCTAGGCAGATGTTTAAAATCAATC 
GGGGCCTAAGGACATGACGGAGGAGOU3GGTGTGTTCCCAGCTTAACT 

T AGG CAGCT CTGCCACAGG ATG AATGGCTTCAG ATG C CAGGTGAGCG AGGGAC C CTTCATT CAGTCAG CAGGAAAG AAG CACTGGC 

ATA 'IT 'X 1 ' 1 1 ATG AG AACAAAGGCT AGG AT AGTAAAGACAGCAAG T AC CAAAAAATGACTGG AAAAGGG AG ACT 
1 0 CAGCAGGCATGGAAAGAAGGGCTTGTGAAGGGGAAGGGGTGGTC 

ACCATGAGTCACACTGATTCTAGAGTAGTGTGCCCTTGATGAAAAGGATAAC 

GACGTGTTGATTTTCAAGGACTTCTGGTGTTTGTGACACAT^ 

TGAGGCCAAGGGTCCCAGAG<X3CRCCTGTTGACCTGCAGGATGACATGAAGG^ 

CAGCCTCCCTCTTGCAGCTTCAAATAGGGCTCCATTTTGACCTT^ 
1 5 GCAAGGGAGAG AAGG AT/TCAGTG T AC AAAAAG AATG AAAG CCACTGG CTG AG C C AGTGGGG AGTTGT C CACACACACATGAG C CTT 

TGGACCATGAGAACGAGGGAGGCCTTGCCTTCCTG^ 

AGC^CTGGGCAC^GAGCCCACGGGCTGCCTCCTGAGCAGCCAGCATCTGCCT 

CTGGGGTATGGG(IAGAGATAAGGCAGCAAGTGTGTGCAAGG 

GTC^GGTGAGGAGCCrGGAGAATGCTTTGGTGAATGAA 

2 0 TTCATCTGTGAAACTCTACTGKSAGCCACTCCCCA^^ 

GTTAGTAGAAGAAATAAACAACACAGTGCTCTGATGAGGCAA^ 

TGGACGAGGGT AG CAGAAGAGGAG TTTGGAG CAAAAGACTC ACAAG CAG CTGCAT AATCTG TTGGTG CTTGGCAG TTCATTTG T AA 
AAATGATGCCTCTTCC 

25 HUMAN SEQUENCE - mRNA 

GAATTCCGOCGTCGCGG ACGCATCCCAGTCTGGGC<K^ACGCTCGGCCGCGG CGAGG CGGGCAAGCCTGGCAGGG CAGAGGG AGCC 
CCGGCTCCGAC^TTGCTCTTCGCCCCCGACGATCAGTCTTGGCCCC^^ 
ATGCTGATGAGGGTCAAGACATGTCCCAAGTTTCAGGGAAGGAAAGCCCCCCT 
ATGCCGATCCCCC^JSGACCTCTCCACCACCTCG^ 

3 0 AGAGACTCAGAGTGATGAAGAGAATGGGCGTGCCTGTGAAATGAATG^ 

CGGGAGAGAAAATGAATGGCTCCC^CAGGGACCAAGGCAGCT^ 
CTAAAGTGTGATATCTGTGGGATCATTTGCATCGGGCCC^ 
CCAGTGCAATCAGTGCGGGGCCTCATTCACCCAGAAGGGCAAC 
AATGCCACCTCTCCAACTACGCCTGCCGCCGGAGGGACGCCCTCAC^ 

3 5 TGTGGATATTGTGGCCGAAGCTATAAACAGCGAAGCTCTTTAGAGGAACATAA 

C CTTCCGGGCACA CTGT AC C CAGTCATT AAAGAAG AAACT AAT CACAGTG AAATGGCAG AAG AC CTG TGCAAG AT AGG ATCAG AGA 
GATCTCTCGTGCTGGACAGACTAGCAAGTAACXTCXSCCAAACGTAAGAGCTCTATG 

TCCG ACACGCCCTACGACAG CAGCGCCAGCTACG AG AAGGAG AACGAAATG ATG AAGTCC C ACGTGATGGACCAAGCCATCAACAA 
CGCCATCAACTJ^CCTGGGGGCCGAGTCCCTGCGCCCGCTGGTC 
40 CGATGTACCAGCTGCACAAGCCGCTCGCGGAGGG CACCCCGCGCTC CAACCACTCGGCCCAGG ACAG CGCCGTGG AG AACCTGCTG 

CTGCTCTCCAAGGCCAAGTTGGTGCCCTCGGAGCGCGAGGCGTCCCCGAGCAACAGCTGT 

C AACGAGG AG CAG CG CAGCX^TCTCATCTACCTG ACCAACCACATCGCCCCGCACGCG CG CAACGGCTTGTCG CTCAAGG AGG AGC 
ACCGCGCCT ACGACCTG CTGCGCGCCG CCTCCGAGAACT CG CAGGACGCGCTC CG CGTGGTCAGCACCAG CGGGGAG CAGATG AAG 
GTGTACAAGTGCGAACACTGCCGGGTGCTCTTCCTGGATCACGTCATGTA 

4 5 TTTTGAGTGCAACATGTGCGGCTACC!AC!AGCCACG 

TGAGCTAAAGCCCTCCCGCGCCCCCACCCCAGACCCCGAGCCACCCCAGGAAAAGCACAAGGACTGCCGCCTT 

GCAGCATAGACTGGACTGGACCAGACAATGTTGTGTTTGGATTTGTAACTGTT 

TTGATT lX^CriT TGAAAAGATTTTTATT^ 

CAGACGCTGGCTGAGATTCCCTCACCTGTCGCTTCCT^ 

5 0 T AAAA CACGCCAC1AG CCTGGG AAGGAG CGTGCTCT AC CCTG TGCT AAG C ACGGGG TTC G C G CACCAGGTGTCTTTTTC C AGTCCCC 

AGAAGCAGAGAGCACAGCCCCTGCTGTGTGGGTCTGCAGGTGA 

CAGGGCCAACAACCTGTGCCCAGGCCAGCTTCGAGCTACATGCAT^ 

TauVGTCATATTCTGCGGTAGGAAAATGATTGGGTTGGGGAAAGTCGGTGTCT 

55 GGAGGAGACTGTCTTCCCGTGGGCATATCrGGG 

GTCACTACAATTTAAACACCAGTCCCGAAATTTCGATCTTCTTTCTT^ 
GCrTTATTTCAAATCTCTTCCTTCCCTGGCTGGTTCCATCT 

ATTTTAATTATGTACATCIGTTTGTAGCCACAAGCCTGAATTTCTCAGTGTTGGTAAG 
TTCTCGTTTTAAAACCCATAAAGGAGTGATTTAGAACATCATTAATTTT 

6 0 TTGCTAACACACAGAGCTOVCCTGTTGAAACCCAAGCTTTCAAAC^ 

TCCACACATACAT AGGATGGCTGG CTCTG CACCTGTAGG AT ATTGGAATGCACAGGGCAATTG AGGGNCTGAGCC^ CTTCGGA 

GAGTAATGCCACCAGATCCCCTAGGAAAGAGGAGGCAAATGGCACTG 

CACTGAAGCCCXSAGGAAGGTGTGTGGAGATTCTAATCCC^ 

CATTACTCTCAACC^CCTAGGCAATGAAGAATATACCATTTCAAATATTTACAGTACT^ 

6 5 AATGAAGCAACAGAGAGGAAATTGTACATAAGTACCTCAGCATTTAATCCA 

GGGCTGACTACTTATTTGTTAGGCGGGAGCTCTCCTGTC 

CAGTAGCACCCCGAATTGACAACCCAAACTCTCCAGACATCACCA^ 

CTGACCCAAATGAATTC 

70 HUMAN SEQUENCE - CODING 

ATGGATGCTGATGAGGGTCAAGACATGTCCCAAGTTTCAGGGAAGGAAAGCCCCCCTG 
GCCCATGCCGATCCCCGAGGACCTCTCCACCACCTCGGGAGGACAGCA 
AAGTAGAGACTCAGAGTGATGAAGAGAATGGGCGTGCCTGTGAAATGA^ 
GCCTCGGGAGAGAAAATGAATGGCTCCCAOVGGGACCAAGGCAG 

7 5 AAAACTAAAGTGTGATATCTGTGGGATCATTTGCATCGGGCCCAATC 
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CCTTCCAGTGCAATCAGTGCGGGGCCTCATTCACCC^ 
TTCAAATGCCACCTCTCCAACTACXSCCTGCCGC^ 

CAAATGTGGAT ATTG TGG C CGAAGCT AT AAACAG CGAAG CT CTTT AG AGG AACAT AAAGAG CG CTG CCACAACT ACTTGGAAAGCA 
TGGGCCTTCCGGGCACACTGTACCC^GTCATTAAAGAAGAAACTAATCACRGTGAA^ 
5 GAGAGATCTCTCGTGCTGGACAGACTAGCAAGTAACGTCGCCAAACGTAAGAGCTCTATC 

CCTGTCCGACACGCCCTACGACAGCAGCGCCAGCTACGAGAAGGAGAACGAAATGATGAAGTCCCACGTGA 

ACAACGCCATCAACT AC CTGGGGGC CGAGTC CCTG CGCCCG CTGGTG CAG ACG CCCCCGGG CGGTTCCG AGGTGGTC CCGGTCATC 
AGCCCGATGTACCAGCTGCACAAGCCGCTOSCGGAGGGCftC^^ 

GCTGCTG CTCTCCAAGGCCAAGTTGGTGCCCTCGGAGCG CGAGGCGTCCCCGAGCAACAG CTGTCAAGACTCC ACGGACACCGAGA 
1 0 GCAACAACGAGGAGCAGCGCAGCGGTCTCATCTACCTGACCAACCACATCGCCCCGCACGCGCGCAACGG 

GAGCACCGCGCCTACGACCTGCTGCGCGCCGCCTCCGAGAACTCGCAGGACGCGCTCCGCGTGGTCAGCACCAGCG^ 

G AAGGTGT ACAAGTG CGAACACTGC CGGGTCCTCTTCCTGGATCACGTCATGT ACACCAT C CACATGGG CTG CC ACGG CTTC CGTG 

ATCCTTTTGAGTGCAACATGTGCGGCTACCACAGCCAGGACCGGTACGAGT^ 

CACATGAGCTAA 

15 
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Table 38 

MOUSE NOMENCLATURE 
ICSGNM Tbx21 
Celera mCG13239 

5 

HUMAN NOMENCLATURE 
HGNC TBX21 
Celera hCG27200 

10 MOUSE SEQUENCE - GENOMIC 

TTGGCAGCAGGGACAAGCAGAAAAACTGGGGTTTTGT 

TGATATGTCTAGGGTAGCAGGTGTTCCGAGAGGCCCCAGAGCCCAGGCCAGTGCACACAGGC^ 
AGGAAATGGACTGGTATTTCAACTTCCAGCTATTGCTTCAGATGCT 
CCCACAGTGTCAGAATCTGGTGCTGGTGACAGGAAAAAAACAGGCGGGTATTAGGGGAGG 
1 5 CTAATAGGCTACCTCACCAGCCACCCCACCCCCAGCCATTTCTAGGTCCT 
GAGX3TTTCCACGGTGCCTTGTGATACACACTGGGTCGCTGAGAGGGGTC 
GTGTGTCAGAGTGAATCTCCATCTGGGGTGATAAGTGGGGTTAATCTCAAGG 

GCC CTGGGTGC CCTG CATGGTGG CAG CTG AG ACT CGGATGT C C C CTTT ACT AG AG ATGGTGC CTTG CCTGG CTTGT CTCCC CACT A 
TGGTTATCGGTTTCCAATCATTACCrrTATTTCTATTCCATTA 
2 0 TTTAGGAGACTCAGAAAAGAAATGGGATTGAGAGTGA^ 

CCATCAGGCATCTCAGATAGATACACAAACCAGACTCTCAGCCTCr^ 

TCGGG ACTCACCTCAGGCTCTGC CTTTT ACAGG CTG TGACTGTCAC CATGTT CAT AGG AAGAGGGGGGACTAGG AAAG CTCT AAGA 

ATAAGCTGGAGGGCCAAAGTAGACTCCAGCAGGACTCAGAGTTGCT^ 

GGGTGAGTTGGCTAGCTGTCTTTGCAGCAGTTA^ 

ACTAAAAAACAGAATTCCAAACAACTCATGAAGAGGGCCAGGCGAGCA 

CCCCCCCCC^CAC^CCTCTGGAAGCCCTCCTCAGCACCCCC^CCCCACCAAAGCCCACAGCCCTGGGGAAG 
CACCACGGACCCAGTGTCGTGCCGGCTGTGGCTTCTTCCTTCTGACCTATTTT'ITAATCAT^ 
AAACCCCGGCCCCCGGGCCCGTC^CTCCACCGCGTGTCAAAACCACC 
30 TCCCCTCAGGGCCTG AGAGCGAAGGCTAATT ACCCAG CTGCTGAGTGCATT CCCCCCACC CCACCCCCTGCT ACTCCACCCCCAGC 

CCCAACAGAGGGGGCCCTGGAGGCCTGCrcCTAGAAGAAGCTGGGGCCTCCTTTCCCT 

CTGTCTAAC CTCCATTCTTTTCTCTGTCCTGCCCCT AGTGGTG CTG AATGGCAGATGCAAGGACTGATATTCTAG ACTGGAAGGGA 

CTTTGAGGCATCATCTCCTTTAAGGAAAGTAAAAGTCGCAGCCTCCTGACCTGGA 

TGTAGGGGATGGTCTCTGCCAAGGGTCTCAATTACAGGACAGAAAACTGA^ 
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GAGCTTGTGGAGGCCACAGGTCAGTGTTAGGCATCTTCTTCAACTTGG 

CGTGCAAGTGTTAGOVTACATGTATATATGTCATCATATGGATGTGTGGTGC^ 

TGAACCACCATGTGGATGCTGGGACTGGACCTGGGTCCTTTGCAAGAGTAGTCAGTGCT 

4 0 CTCCCCGCCCACACACACITTTATTTTTGGAGATGGGGTCTCTCACT 

GTCACTGGGTCCCAGGAATCCTGC^GCCTCTTTCITCCCAGAGCrrGGGATAATAAGTGGATATGTTG 

GGAGAGTTAACTCTGGTCCTCATTTCTTGAAACGAGCCCTTTATTGACTGAGCCCATTTTTCAT^ 

AGTCITACCATGTAGCCCAGGCTGGCCTGGAACTCA^ 

ATGTG ACACTACACCCAC CTTATTTTCTCACTGACTCAGTCCTTATTCCCTG AGCAAGGAC^ CCT 
45 CATCCTCAGGTCTGGGGTTGTCCAGCTTGCTCCATACACCATTAGAGATTGATCACTCCT 
CCCTCAGACTATCAGCTGCTCCATATCAACTTACCCTGTCA^ 

CTCAAAAAGGAGGC C CTG CACCAGGT ATACAGGCATAC AAT AG AGTG CAG TTGG ACTGG CCTTGAGG CTCTG ACAG ACTTGG AG AT 

GCAGAGACAGACAGACAGACAGACAGACAGACAGACAGACAGA 

GATAACAGGAATACTCGAAAAGGCAGGGACAGAGAGAGACCCTATTATTCGACA 

5 0 GGCAAGGGGGTACCTGGCACAACCATGCCAAGGTGTGTCGATGAGTAACTAC^ 

TGGGAGAAGGGAGAAGAG CAAGG ACTTGGAGCAGGGGT AGAGG CAGGAGAGTGGACATGTAG ACAG ACGGACAGATGGG AG CAG AC 

TCATGGGGGCAGAGAAGTGGCCAGGAGAGGGAGCAGAGAAGGACAGGAAGAGGGAAGAAAGAATGCT 

GAGAGAACTGGGNNNNl'INNNNNNNNNNNNNNNATGT 

ACAGTATTTATTGTTGTTTTAGTTTTAATTAATTTTCAGAT 

5 5 CAGGCTAGCCTTTAACTTGATCAGGAGTTTGAGATGACCrTGACT 

AG ACAAGTCCCACCAAGGCTG ACAAGACAGGAGCGGTTC CACAG ATCCAG CTG CA 

GACAGGACACAAGGCTCTAGAGACATGTGCAGGCCACCACCACCAAGGCCAAGAAAC^ 

TTAATCCCAGCACTCGATAAGCAGAAGCCGGCAGATCTCCGTGAGTTCGAAGCCAGTCTG^ 

G CT ACAGTCTCAG AAAAT AAAGG AAGG AAGAAGACAG AG AGAG CTACAG ACAG ACAGACAGG CT CAAG AGC CTGGG AGAAAG CAGG 

6 0 GCTCATGCAG ATGGTGAGGTTAG CTTGCCCGGCCTGG CACTGCTGGACTGTTCAGG AAGCTAGATAGCTTCCCGAGGAGACAGGGA 

CAGAGGAGTCCAGCTACACAGGAGTTAAGGTTGGCTCrCCAGCCTTGCCCTGXS 
TTACCCAAGGCCCTGGCATTCCCCCACCCCCACCCTTGGTAACC^GACCCTGCTTCCTGCTG 
CTGGGGTGGGGGGGGCAACACCrCCCCTAGGGCCTTCCTAAAGCTTGTGCTCACCCT 
GGTTGCTCCAGTGATAGCATTAAGGGAACCCTCATCCCTAGACATGGGGTGCCTGTGGAA 
6*5 TGGGACTGAGTTGTTGTTCAGGACCAGAGGGTAAACXXSTACCCATCT^ 

GGACATGAGATCCTGAAGCCCTTGACCTTGGAGTAAACCCCGGGTGCTGGCTACTGG^ 
TGGXSGGTGGGGTGTGCTGTCCAGGTGCTGGCCACTGCCCTGCC 
GCTCTCTGAGGACTGGCIX^TCTGGGCCCTGAGCTGGGAGGGCTC 
GCAC^TTGAGGGGTTATCTTGAGGAGGACTCTGTGTTTGGCATGTTTATC 

7 0 TGGGTTGCCAGTCTGCCTCTGGGCATAAGTGGCCTGACGTACAGGCGAGATCT 

TTGGGGGAGAAGGCTAATTCTAAGTTTTCAGAGTCCCCAGGCCTAGTTGATTACAACCGG 

GG AAGCAGG CTTGC C CCT ACTCACTG AG AGGAG TCTC CTG AACAGGTTCGCTT CTGGCCCCAGG T CCT AGTGG AG G AGGGGG TG AG 
CATGAGGGGTCTTTTTGTGGAGGACCCACAAAGCTAGATGGAAGA 
AGGCAGCACAGCCTGGGCCTCCATTCCCTTCCTTCCTCAGGGCAAGGCTCT 
7 5 O^AGGAAAGCAGAGCTGTCAGGATGCAGGGCAAAGGTCACCCAGAGAGTGAGGCT^ 
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TGGAAGATATTGTTTCCCTTTGTCAGGCTTra 
TCCITGTTCTCrrCCCCCCCCCCA£GGGCCTGAGACTTT 

AATCTCTCCTGAGCATATCCTGAGCCAGGGACTCCTTCCTCtGAGCGCCTCGTT 

CAGAGACTTGGCTCCTGGTGCCCTGGGCTCTGGGCCACCAGCCCTACCCACCCCCCACT 
5 GGATGGAGGACITGGTCCTGCACACATGTTCAGGAGAGGATCAGGCCCACT 

GTCCACOVGGGCACTGGAACGCCTGCGATGTXKKrrTCCCTTGCCTT 

AGATAGCAGAGAGGAAGAGGGAGGATCCTGGAOGTGGGATGTGAGGC 

GACACTGTCGGGCCCCAGCATCAACATTGTCTAAGGAGCAAGTCCCAGAAACCCACA^ 

AGAGGAATGACCAGGGCrGTGTGGCCTTGGGAGAGAATCCCTGGTACTCT 
1 0 GAACAGCACATGGTGAGGAGAACCAGTGCCAAGTTGGAiGGGCTTA^ 

CCGCTGCTCCCnXSTTTCCCTAATCTTTTCTAAGAGACTATCAGACCT 

TACATGCCAAGTGTGTCACATATGTGCGTCCATATATACCTGCT 

GACTTCTGGAAAGAACGCTGGGCTGGTCATCTACCTCAGACCCTCCGCCCTAGCTC^ 

TGTTCTGTCTC CCACGTTCCCCAGGGCCAGCCT ATGGGTAGCACGTACTGGCAGGATG AGAGA 
1 5 GGCAGAG AGGGG AT C CCAAGGGGG C CTC CTTTCCTCAATCCTGG CTTC C CACCTCT CT AG C CCTGG AG ACTGGG AAGGGGGGTTG A 

GGTGGCAGCAGGAGGTCTCACTGTCGGGGTTCrTCTTCTCACCCAAGCTC 

CACTCAGCCCCCGCTTTTCACAAGCACTTTCTTTTTTGGG 

AGGTTAATAAGGCAGGCTCTATCGACCC^TCAGGTTATGGGGAAACCATTCTCGGGCCCCCCT 
ACCACCCAGCTGTGATAGTCATTCCCTTCTGCTCK1AAGAGCTTGTTG 
2 0 TCACTTGACCTGAGGATCAGC<3TGGAAGATCAGGGATGGAAAGGCTGAGGAACAAGACT 
CAGCAAAAATGCrarCCCAAAGAGTTGTCGGTGTTCAAAAATTTTACCTCCT 
CTATTTTCCACAATAGGGAGCCAAGGGTGCAAAGGTCACCTTCTTTCAAGTC 

CCTTTATGGG CG AGT AGGGGGTGAGGGAGTG AGGAGG AGGC CAG AC CTCGATAT CTG C TG CTG T CATGGAT CC AGG CCT CATTCTC 
AAGAGCTGTGCTGATTGTCrGCCCCCACCTTTCCCTGGXSTCATGTCTGCCTCCTGGG 

2 5 TGTCTCTTGGTTCAGCTCTAGTTGGAAGGAGAGTGTCCCCCTGGGTCC^ 

GAGATGAAGTCACACCCCAGGGGAGAAGTGGGGTGGGGCAGAGATGAGGGAAAGAAATCT 
AAGTATTTTTTGAAGCATTCTTTAAGATGTAAATCTTTATATATTTGA 

GAAAAAAACCAAGAACCCACGTGGTAAAGCGGACTGCGAAGTTGATGTAAAATTAGACATCTGTGTGTGTC 
CAGAGGAGGGTCTATACAAGGGAGGTATCCAGAAGGGrrTTCrTAAAATGTTAA 

3 0 TCTGACTTCTGTTTTTCTTCACTGCCTAGATTTTCTGT^ 

AAGGCACTTGTTGTCCAAGCCTGATGACCTGAGTTCCAGACTCAGAACCCACATAAGGATGGAAAGAA 
CTGGACTCTGAACTCTAAACATGCACCATGGCACAGATGTCCACACACA^ 
CTCTGAGATTATGCCTCACTCGCAAGGGTCTCCTTCATCACTCTCCAGAAGAG 
CAGAGCCCCAATTCCTACTTTATGGATGTGATATGCCCAGGATGATATAAAATCACCAGTCT^ 

3 5 ATATGTAAGTACACTGTAGCTGTCTTCAGACACTCCAGAAGAGGGTGCCAGATCT 

TGCTGGGATTTG AACT CAGGAC CTTCGGAAG AG C AGTCAATGCT CTT AACCX5CTGAG CCAT CT CTCCAG CCCAAGT AACT CTT AAA 

CGATACTTTACTTCTTATTTACTTACTTTTGAGATAAGATCCCCTGTAACCCAG 

AACTTCTGATCCTCTACCJUVCCCrCTCAAGTGCTGGGATGACAGGCATOTGC 

ACCAG AAG CGTGGGTG TATG CT AAG CAGG CACTCCAT C AGTTGAACT AT AT C C CAG AC CC CGGGGATG CTTTTT ATTT CAAAAGAA 

4 0 AACTCCCAATGGATTCATTTTCAGTTAGGAGGTGAGATTCTTCT 

AGGAAGGACTGTCTGTCTCGATCAGAGCAGGTGCTGGGCTCCCTGTCAGCC^ 
AGAATGGAAACAGAGGCACACAGACACCGGTTGGTATAAGGTACACAGCCCCTA 
CCCCATCAGGCCTGAGGGACTTTCAGGCAAAGGAAGTGACTTACCCAGGTCCTAC^ 
GACTCTGAACACTGACAGCCAAGCGCTACTTCCCGTGTCTTTC^ 

4 5 TTTCAGCC^GCCCAGCCCACCGTGGGCTGCTTTGGGTGCAATCTT 

CAAGCATCATCCTCGGGCCTGGGGGGTGGGGTGGGAGGCTAGCGATG<3GGG^ 

TTTGTGTAAAGTCACAAAGCCTGGGCATACAGGAGGCAGCAACAAATATCTGTTGACTATGGATAGT 

CCAGAGGCGTGAGAATGCTCAGATGGTCCCATCTGGGX3A 

GAGAAACCGTTTCTCTCCTCGAAGTTTCAGAGTTTCAGGTGGCAGGTTGACT 

5 0 TAGGAGTGGGGGTCGGGGCTTCAGTCCTCACCAAAAGCGAATTCGCGKrrGTATTAGCCACGAG 

GGTGGCGGGGTX2AGGGGACCCAAGGAGCTTCATAAAGCCA.CAGCAAAG^ 

AGCCCAGGAGCAGCTGCGCGCCTATCCAACGCGCGCTCAGGAGCCAAGCGTCCCAGCTCCCGCTC 

CGGACG CCGCCCCCGTCGCCCCAGCCCTAAGGACCCTCGGGTCTCTTCGACGGCTG CTGGAAGGCGCCC AGCCCGCCTCGG ATGGG 
CATCGTGGAGCCGGGCTGCGGAGACATGCTGACCGGCACCGAGCCGATGCCGAGTGACGAGGGCCG^ 
55 AGCATCGTTTCTTCTATCCCGAGCCGGGCGCACAGGACCTOACCGATCGCCGCGCAGGTAGCAGCCTC 

TGGCGAGTTCTTCCCGCCGCCCCCX3GGTGCGGAG 

ACCGAGGGCCGCGCGAGGACTACGCATTGCCCGCGGGGTTGGAGGTGTCTGGGAAGCTGAGAGTCGCGCTC^ 
TGGTCCAAGTTCAACCAGCACCAGACAGAGATGATCATCACTAAGCAAGGACGGT 

6 0 GCAGGGCGGAGCCGGCGGGAGTCTGGCCTCGGTGGCCCGCCGGAACAACGACTTTCCTCTTTT 

AAAATTTGAAGCTGCAGGTTGCTGAGTCTTTGTGTTCX5 

GGCACCGCAAGCTCCGGAGAGCCGCGGGTGCTGAGGGTATCAGAGGGCTCAAAAAGGATGATGCTC 
TTTGGTGCAGAGGAAAGGATCACTCTCGCTrCACCGAGAGTTC 

GCTCCTTGCCTCAGTCTGGGATCTCAGTGGGCAG CGAGG AGTGTGTGTT ATAG ACCTGACTG ACCG CCTTTCGGTCCTT AGGTATT 

6 5 CGGGAAATTGAAG TGTTGGGGG CAG AAT ATTTTTTGAGATCTCAG TGTG CCTG TTCCTTTG CT ACT GCTGTG CTT AGG GT AG CTTT 

GGTGTATGGGGGAGGGGGGGAGCAGACTGGGATTTGGTGACATC^AGAAGCGGTCGCCAAGTTTCT 
TCATATGTCTAGAGAGGGAGAGAGAGAGAGAGAGAGAGAGAGGAGTGTCTGTGTGTC 
TGTAGAGAGGTGGGAAGCAGCAAAACTTATAAAGGCACAAATAAATGCCGAGGGCCTGAAC^ 
CTGGTGTCCAGTCTTGGACTCjTGCTCTTTGAGGCTAG 

7 0 TGGGGGAAGATGGAGGGGGTCCCCTAAGCCTGTTGCATTAGTGGTATGGTTTGTt^ 

TTGCTGAAGAAAGAGAAACITGCAGTTTCTGAGGGGTCATTTATAGTATGC^ 
AGAGGCAGCTACCTTTCCCAGTGAGGACCAGCCAGAACTTTGCCATTAGTGAG 
AT\STAATTGAGAGA7TGCAAGGGTGGCX3TTTC^ 
TCCCTTCGGATGATGTGGGGAGAGGGCAGATAGCTGCTAACTAC^ 
7 5 TGAGAATAAAACATTTGATTGGT1TTTGGCTCATCCTTCA 
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CTGAGTGTCTCTGGCACTGGCCAGGTTGCTGGTCCTAGAGAATGTCTC 

AG AG AAACAG AC CTAG AGACAG CAGGGG ACTCTTCGG AGGTTCCCCAG CTGGGAC CAGGCCTCAGATGGCACCTGTCTGCTCCTTT 
TTTCTCCATTCTCCACAGCCACCTACACTGTCTGAAAGC 

GCCAGGCAGGGCTGGCCCGTTTCATACCCACGAATTCTGAGTTGCTGGGATAACCAGTGGCTTACGCTAGTCCCGC^^ 
5 GTGGGGCCTCTAACCATGGGACTTTCTAGTATGTCAGCAACAGGAG 
TGTCAGGCCTCAGCTCCTCCAGTCAGAGAGAGCATCAGCCACAA 
GGCTGGGTTGCTAGGGGAGGCATTTGGAGGGACACACTCTATATCACT 

TT ATCATTGGC CTGG CT CT ATGG ACTC CTCTCCTGTG TG CG CCT AT AGG ATTTTTTTCCCATG AC CACTTTTGCT CTCTC CCGTTT 
10 GGATCGGTGCTCTTATTAi-rri'AAGTGCTGTAATTGGAACCrGGGGCCT 
TTTTAAAGCCAGCTCTGGTAGTGCACCCCTTAAGTAATGC^ 
TCCCTCTTTCTTTCTCTTTGCTATGTAGCAAAGGCCT 
CAAGGATGTGCCACCACGCATGTTTGACACTCTCT 

GC CTT ATGTGTGCT CTGCACACAC AGT CACACTG CT ACTG AG CCACAT CTTT AACT T CT T T TT'lTTlTTrri AAGATTT ATTT ATT 
15 TATTATATGTAAGTACACTGTAGCTGTCnTCAGACACTCCAC^AAGAGGGTGTCA^ " 

TGGTTGATGGGATTTGAACTCCGGACCTTCGGAAGAGCAGTCGGGTC 

TT ATGACAAATCACACTGT ATAAAAAT AAAAGG CCAGG CGTGGTGGCACATTCAATCCCAGCACT CAAGAGGT AG AGGCAGGCAGA 
ACTCTGTGAGTTCGAGGCCAGCTTAGTCTACAAAGTGAGTTCCAGGA 
CAAACAAACACACAAGACCAAAAACCCAAAAAACAAAAATA^ 
2 0 TTCAGTATTGGTCTGGTCTT/ITTTGTTGTGAGTTATTTTC 

GG CTCAGCGGGCCTGGAACTCACTATAGAGATGGGGCTAACCTTGA^ 
CATTAAAGGCATTTGCCACCTTCTGGGTATTrTGAGATAG^ 
GCTTTGAATCITTATGTTTCTCrGAAAATGAAT^ 
AACCCACCTGCTTCACCCTCGGGAATGCTGGGATCACAAGTGTGCACT 

2 5 GGGGGGCTGGCAAAACGGCTCATTGGGTAAAGGCACTTGCTGCC^ 

CC!AGGGCCCAAATC^TTGAAGGAGAGAATTGACTCCCATAGATTGTTCTCTGAACT 
TCCACCACITGCACCATCCAAGAAAAGAAAAGTAAAAATAAGAAAA^ 
AAAAAATCAACCAGGGTTGAAGAGATGGCTCAGTCAGTGAAGTGCITGT^ 
GCCGGTGGATCTGTGTAAATTCAAAGCCAGGTCTAOVGAGTGAAT^ 

C^GCCTAGAGTAATCAGAGACCTTTAGGCCAATGAAAGACCCTTG 

AGATAGATAGATAGAGTAGACAGACAGACAGACAGTGCCTGAGAAATTATATCTGAGTTGTCT^ 

CACACAT ACAT ACG TGCAAGTG CACACACACACACAC ATG AAT AAAT AATTTAAAAAATC C!AATT AG CACAC CTAAAAAAG AAATT 
CCTAAATGCCTTCTGAGTGT/TTCATAAGAGTGTGCTAG 

3 5 AAGGCCAC!ACTGCATGACATGGGGTCCCTGTTGGATGGGGCAGTGACrCCCCATCAT 

CCCAGGGGTCCCATCCTGGAAAGCAGACAGGGAATACCACATCAGTTCAGCCTTACCAAGCCCTGT 
TTGAGCTCCAGCTATCTGGGAGGTCCTGGCTGTTTCCTTTTGCT 
TTTGGCAGGCTGGCCTGTCCCrCTGTAGGTTTCAGTGACCTC 
ATTCTTCCATCCCAGACCTAGCCACAGCCCCrCAGCCGTTGGTCCTGGGC^ 

4 0 ACCCTGGGCTCACCCACTAGTATGGATCTGTGGTAGCGATGCTTTTGGTTCrc 

CGGTTCTAGTGGGGCTCAGAGAGGGAAATGGAGC^CAAGAAGAGTTTGAGGTGAGCCAGAGAA^ 
CCTGTAATCCTGACAAACTGAGTTCAGATCCCCCCAO^TACTC<5TGCGGAGTAGGTTA 
CAGAATAAAAATAATAAAATGTAATACTTGTCTTTTTCTTTGGTTCCCCCTAC^ 
CTCTGTAGACCAGGGTGGCCTTGAACrCAGACATCTGCCTGTCTCTGCCT 

4 5 TTCTATTAGATATAATACCTTAAAAACAAAGAAACAAACAAACA 

TGCACTTTTGGAGCAACCC^GGAGCACTC'CGTGCAGGCT 

GACGAAAGGCTTTCTCAGTAGGCAAGAGAGGCCGGTCCCCGTGGGCGTGTGGGTGGGTACTAAT^ 

GGCrGTGATTGGTTGATTATTATAATTAAGATAACrTTGGGGAGAGC^ 

GTGACGTCCGGGTGGCGAAGGCGCITCCTACTTCATGTGAGGTAGTGCCACAGCGC 

5 0 TCGAAATTTTTAATTACTTTTTTTTTTT^ 

ATGTGTGTGGGAGGGATCATGTATATGCTATGAGTATGGAGGAGCCTATAGAGATTGGAGGTCAGAG 

ATAGGTGGT ATGGAG CCTATAGAGAAGTCTAGTAC AGCAGGCGTGGTGGCG CACGCCTT^AATCCCCAGCACTCGGGAGGCAAAGG 
CAGG CAGATTTCTGAGTTTGAGG CCAGCCTGGTCT ACAAAGTG AGTTCC AGG ACAGCCAGGGCT ATACAGAGAAATCCTC 
AAAACCAAAAAAAAAAAAAAAAAAAAAGATGTCAATTATGGGTCCTCTG^ 

5 5 TCCAGTCTTTCATTTCTTTTTCTCTTTTGGTAAGACA 

GAATGTGCAGCACACCTTTTGCCTCAGGTTCCTAAGTGCTGAGATTTTAAGT 
TTTTTT/rTTAATTTAGATTTATTTTATTTGTAAATCT 
ATGAATGCTTTGCTCATGTGTATGTATGTCTGTACACAACAT^ 
GAGTTATAGGTAGTTGTGAGCCACCATGTGTGTGGGCTCTGGGTATTGAACC 

6 0 TTITTTTGTTTTGTTATGAGACAGGGTTTCTCTATGTAGCCCT 

TCAGAGATCCACCTGCCTCTGCCTCCTGAGTGCnKKSATTAAAGGCGTGCACCACCACACC 
AAAAAAAAATATTTATTTTTATTrTATGTACATCGGTGTTTTGCCT 

GAGTT ACAG ACAGGGGT AAGCTXJ CCATGTGGGTGCTTGGAATTGAAA CCAGTG CTCTT AAGTGG 

TACACCATCTCTCGAGCCCCCAAAACAAGTGCATTTAATTGCT 

6 5 ATATTAAACCCAGGGTCTCATGGATGCTAGGCAATCAATGGT 

TTGCCTGCATCTATCTCTGTATCAGGGTGTCAGATCCCCTGGAACGGGAGA 
AATAGAACTTGGATCITCTGGAAGAGCAGCCAGTGCTCTTAACCCTGAGCT 
AGTTGCGACCCCAATTTAACTTCTCACATCTTTACAAAACACI^ 

TGG ACT AAGAATCT CAAAATCTCTTGG AAACGG CT ACT CAG AAAAG CCTAGCTGTTCTTTACCTG CTCTG AGGAGGACTCAGCTGG 

7 0 TCAATCCC^TGGAAAGTTGATTGATGTCTGAAACTAGTCA^ 

CCTAGAACrCACTATGTTCCCTGCCATGTCCCACAGCCGTGTAAAAATCATCG 
ATGGGATCTTGACTCTGTAAGTAGTTTTAAGGAGGTCAACTTGC^ 
ACATGTAGATCAGGGAGCrATTTGGGGATTCTAGCrcCTGGTTATT^ 
CTGTATGTACATOTACAAGTGTGAACATTTC 
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TTCAAACTCACAGTGATCTGCCTCTG CCTCTCAAATTCTGGGATTAAAGGCATG CGCCAGCATG CTTGGTTCATTT ATTAATTAAT 

TTGTTTGTTTGITTATTTTTGAGACAGGGTCTCATTAGCCTAGAGTCTACCAAGAAG^ 

CT CTGCTTCCCCAGTTTTC-GAATTCTAAGTAGATGTCAC 

CCTCCTCTTCAGTGTGGGCACrrGGGTACTCATGTCTTCGTGATTGTC 
5 CCTCCTTTGCCTTTCACTTACAATCITCCTGTATCTTTTC 

GTGTGCXK3TATTCATCAATCTCTCTATAAGCTTCAGTGTCTTCAGGTGTAGAGTTT 

TGATGAGTTGGCTTGTGCGTAAGTGGAGGATGGATAAACATAGTATGAGGCTGGGTATGGG^ 

CATCCGCGAGGCCCTGGGTTCTATATTTGGCACCAAAAAGGTGTTTGCTATTCT 

TGGTCACTAGGTAAGAAAATATATATGTTTTATGTTTTTTT^ 
1 0 TGGCCCATCATGTGGGTGTGGAGAATCAAGCCCAGGTCCTC 

AGAGTCTCCCTGTCCCGTAGCTCTAGCTACCCTAGAACTGCATACGTAGACCAGGCTGGCCT 

TTTCCCTCCTGAGTGCTGGGATTAAAGACAAGTGCCAAGCCAG^ 

ACATAGCAAATTCTAGGATAGCCAGAATTACATATGAAACCCTGCTTCAAAATA^ 

TTTAATCCCAGCACTTGGGAGGAGAGGCAAGCAGATTTCTGAGI^ 
1 5 GGGCT ACTCAGAGAAACCCTGTCT CG AAAAAC CAAAT AAAT AAATAAAT AAAT AAATAAAT AAAT AAATAAAT AAT AAAT AAG CAA 

G CAAG CT AGGCAGTGGTGACACACAC CTTT AATCCCAGCATTTGGG AGGT AGAC C CAGG TGG AT CT CT AAGTTTG AG AC CAGTCTG 

GTCTACATAACAAGTTCCAGGACAGCCAGGGTTACACAGAGAAACCTTGTCTTGAAA 

GCTAAGAGTACTGACTGGCTGCITTCCCAGGGGTCCTCATTTCAACT 

GAT CTGATG CCCTTTT CTGGC CTG CAGGTGAACATG CAGCAGAG CACT CAT AT AT AAAAT AAAAT AAAAT AAAAT AAAGTCT AAAA 
2 0 TGTTTTTTAAAAGCAACAAATATTAATAAAGACATGCACCACCATGCC 
ATGTGTGCTTTTGTGAGTTTATGTGGACCGTATGTGT 
GACAACirGTGAGCTGCCTGGTGTGGTTCTGGGAACCAAACCCTGATCT 

AATCTG AATGGCTT AAAGTGAATGGGTGAGGG AAATGGATTGAAAATATTCTC C C CACCT CXX5G TTTTCCAT ACTT AG CTCTTGGT 
CTAGGGATGTAGCAAAGCTGGTAGGATTCTTGTCTAAAATTCACTATGTCTTAGG 

2 5 GTGGTGTGCACCCTAATCCITGC^CTTTGGAAGTTGAGGCAGGAAAATTC 

CTAGTCTATAGGTTTCTTGATTAAGAGACAAAAGCTTTCCTAT^ 

TCTCC^GTGAGTGTTTG AATCTGTTGTTCTG ACCACACCGCTCTC CCAGCTGGGG CACAAAGG CAGAG 

CTCAGTCATTGGAACTGGTGATGCAGAGACnXMCATGGTGG^ 

GATGGTAGCGCTTGGAGACAAGACGGCTGTGATGGTGATAATGAAAGTCAAGACTAGCGTAAT 
30 AACTATGATGGAGATGGTGGGTGTGGTGACGGGAGTGACGATGGCCATGGTCATGGCGGAGGCGTAGCCAGAG 
GGTGGCTTTGACC^AGGTGGTGGTTTGAGAGATAGAAGACATGGTGAAA 

TGCAGATGTGGTGGAGGTGGTGTCTGTAGAGATGGAGGTAGCGGTGTCTGTATTATGGTGGACATGGTGGC 
GGTGGGGTGGAC^TGGTACTGGAGACAACATAGCTGTAATCATGGTAATGG 

CTGGAAGTTGTACTGGTGGAGACGGTGTCACCGTGATGGTGGAGATAGCGGTGAGAATGATAGTGGT^ 

3 5 TTGTCGTGATCGTGATAATGGGGGGCAGCCATGGTGGCGGTGGTGGGGAGGCCTTTGCAGGAGAGC 

CACTTTAGAACAAGGGAGGGTCACGCAGGGTCCCACCCACCCACCAACCTGGGACCCAA^ 

TCAGG ATGC CTAGGGTGTTTGATGAGTTTCTG AAGGCCAG ACTTGGGGC AGGG CAC ACT AT AG ATGTG ACTGG AGTGTGGG AC AGG 

GAGGACCTGTCAGTGACTTCCATGGGGC^C^GTGTGTTGCAGAGCGGG^ 

CCTTCTGGGACTATCTTTAAAGGGTTGCCTGGTGCCrrTTGGCCCAT 

4 0 AATTATCTAGTAACTGATCTTTTGAGAGGAAAGCGTT ATCAAAAGT CTGCAGGCTGGG ATGTGGCT CAGTGGTAGAGTGCTTG CCC 

ATCATGCATTAAGCCTTGGGTTTAATCAGCAGCAG 
CTCTAAGCTACGAA.CATGACAGGGTTGAACACAGAATTGGGGGCAATG 
CAAAGACACAAGTAAGCTCAAGCCGGGGCCACAGTACAACACGT 
TTTTGGTAGAACGTGATTATTGTTAAGTGTGAACAATTTAATT/TTG 

4 5 TGTAGTCCTGGCTATCCTGAGACTTGGTTTTATAGACCAGG 

CTAAGATT AAAGATGTGTT CT ACCACACTGG CT AGTG CAATTT AACTTT AAAAAAAAGATG T ATTG CTGGGCGTGGTGGCG CACG C 
CTTTGATCCCAGCATTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCGAGG 

CAGGG CT ACACAG AG AAACT CTGTCTCAAAAAAAAAAAAGAT AT ATT ATT AATTT^ TG TG T 

GTATGGACATGTGGGTGTCAGTGGAATCCAGAGGCATTGGATCACCTTGGAGCT 

5 0 GCTGAGAACTGAACTCAGTCCCCCTGGAAGAACACTATATG 

TGTTAC!ACACACACAGACACACACACGTATATTTAAATTTATTTATTT 
ACACACCATAAGAGGACTACAGATCCCATTACAGATGGTTGTGAGCCACC^ 

AG AG CAG TCAGTG CTCTT AACCG CTGAGCCATCTCTC CAG CCT AC AT CT ATATTT AAT AACT AACAAGCT AGCT ACCTTCTG AC AC 
ACTTGTCAGTTGGCTCTTC^GGACAAGTGAAGGCCGGCT^ 

5 5 TGGTCCCTC^AACTTCTCTGTTCCCCAC^GGCGAATGTTCCCATTCCTGTCCTT 

ACAGGATGTTTGTGGATGTGGTCTTC^TGGACCAGCACCACTGGCC^ 

GGCAGCATGCCAGGTACAAG CITCGGGAGTGGTGGGGATGCGTTTCCTCCTAGACAGGG CGCCCCCTGGTGGATTACTCAAGCCCT 
TACCCCGATGTGAAGGATTGAACCCTACAGCCTATCCACCCACCT^ 

6 0 CTCTCCCTTGCATCTTTCTCACGGTCCACTCAGAAGAGGCAAAATCCCACATCT 

AACTGTCCACAGGGAACCGCTTATATGTCCACCCAGACT 
AAGCTAAAGCTCACCAACAACAAC^XXX^CTTCCAACAATGTG^ 
CTACCCCCTGACATTTCCTAAGACCCACTTTGTACTGTTGGA^ 
GCCAAGGATAACCATGAACTTCTGTTCCATCTGCTCCrGAGGX3 

6 5 AATCAGATATTGCACCAGACTAGACAAAGGTGCTAGCAATTC^GCTGCTCT 

TTCCTACCCATCTrTGGGAGACAATGAGATGGGTGGACACTGGATCTC 

ACATGJIGGGCTTCTTCACTAGATCATGATCTGTCACCGAGATGGTCCT^ 

TTACATTGAACAATTATGGGTCTTGTTTTGGTTTGTTTTACAC^ 

CCATCCATCCATTCATCCATCTATCCATCCATCCATCTATCCATCCATCCATCCATCCATCrATCT 
70 ATCTATCTATCTATCTATCTATCTATCTATC^TCTATCTACTTATC 

GGGGAGTAAGTTGCTATGATGTGTTTGTGGAGGTCAGAGGACAACTTGCTGAAGT^ 

GGATCAAACTCAGGTCGCCAGGCTTGGCAGCAAATGCCTTTATCTG^ rGriTiTTT 

G1TTTTTGTT1TT1TAAATTAAACAACAAAACAAAACCT 

AAAGACCTGAGriTCATCCCrACAACCrAGGTTAAAAGAG^^ 

7 5 C-GGAGCGGGTGGCC^GACACAGGTGGATCCCTAGGGCTCACrC^CCAGCC^ 
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GGAAGTTGCATGGTGGTAAGGTGGTCGAAGCCACAAGATGAACTCAGGCATCCTTTCTAGTCTGTCTCAC^ 
ACGTAGGAGAGTCCATCTAGAGATGAGGCCAGCTTGAGGTCAGGAGGACCTTCCTTCC^ 

GG CCCCTCTCCAG ATGATCGTCCTGCAGTCTCTCCACAAGT ACCAGCCCCGGCTG CACATCGTGGAGGTGAATGATGGAGAGCCAG 
AGG CTG CCTG CAG TGCTTCT AACACACACGT CTTT ACTTTC CAAGAGACC CAGTTCATTGCAGTG ACTG CCT ACCAG AACGCAG AG 
5 GTGAGCCCGGGGACGGGCATCTGCCAGAGGGAGGGTGGGACAGACTTCTTGAGGAGGAAG 
CGTCTCTAGATCACTCAGCTGAAAATCGACAACAACCCCTTTGCCAAAGG 
TOU^TCCCCTTTGGTCTTTCTGAGATGGGACTGTGGTTCAGTGCT 
ATATTAGAAGCCTAATATAGCCTTGATCTCTCTTTCTCTCTCTCTGTCTCTCT 
TTTTATTTATACGAGTACACAGTAGCTGTCITCAGACACACCAG 
1 0 TGTGGTTGTTGGGAATTG AACT C AGGACCTCTGG AAG AG CAAG CAGC CAGTG CTCTTAACTGCTGAG CCATCT CT CT AGC CC CCTC 
ATTGATTGTTTTTATTTTTATTTATTTTT^ 
ACTGCTTGGGGGAGACCCCTTCTCACCTCTTCTATC 

CCAAGGATATGATCTCACAGCCTTACTGGCTGGGGACACCTCGGGAACACAGT^ 
CCCACACTCCTACCCTCTGCCCCGGGGCCCACTGTGCCCrrACTACCGGGGCCAAGAC^ 
1 5 GGCCCCTCAATACCCGCCCAAGATGAGCCC^GCTGGCTGGTTCCGGCCCATGCGAACTCTGC 
CAGAGGAACAGGGCTCCTCCCCCTCGCTGTGGCCTGAGGTCACCTCCCT 
GACACTAAGAGGAGGAGGATATCCCCCTATCCTTCCAGTGGCGACAGCTCCTCT 
AACCGAAGGCCAGTTTTATAATTATTTTCCCAACTGAGAAAATGCCGCTGAAT^ 
GGGGCTGAGAGCCCCGAGCTCTTCCCCATCCCTTCCCTGTATAGTGATTGGTTGGAGAG^ 

TGCTGCTTCTGACCCCATGGTTCCATGGAGAACGG^ 

GAGGTGGACKXMTCGGGGTGGGGAGTCCAGGAGAGCTGCTCTCTTCCCCTGT 
TGTGTTAATCTCTGACCTGAAAGTGAAGATACACGCATTTTTACAACAGCC^ 

ACTGGG CCACCTG CGAGG AGACAAG AGAGGGTGGGTGCAGAGG AAG GGTTTG AAGGGTG CACCAGG CGAGGT CACTTTG 

2 5 AACCGGTGTGT ACACACACGGG TGT CT CTTTTTTATTTCTTCGGGAGGGG GGAG G CT ATTT ATTGT AG AG AG TGGTGTCTGGATGT 

ATTTC1TCTGTTTTGCATCACTTTCTGGAAATAAACAT 
GTGTGTGTGGATCAAAGTTTCCTGATCTAGCAAAGrrGTCCTGTGGGCTCT^ 

ACC CCACTT AGGGAT AAACT CAG AGT ATGT CTTGTC CTCAG C ACTCCAG CTGATGGTTCATGCCAC CACATG TGACCAAAGG ACCA 
GCCCCTCAGCCTATGTTGCCTGCTCAGAACCCTGGAGGGAGGCAGTGTGGGAGCAAG^ 

3 0 TAGGAAAGCCAGGACrCTTGATTTATCrTTGCATCACT 

TATACCATGTGATGTTGCCACTCTTTTGTAGGTCACGA 

CCTCITATTTAGTTTTCATTCATAACCATAAACTTAACT 

CAGGCACTTCAGTGAGAGCAGAGATTACAGGGTGTTGCGAGCAGATGACGTGGGGGTGCTTTCC^ 

3 5 GGTGGGTAAGATGCTCGAAAGTCAAAAGAATTAGAGTTGTCCATGGTTGGAAGTGG 

CCTGGACCCCCAAACArTCCATGGCTTCCAOSATGTTCAT^ 
CAGTGGGAAGCATTAATGTTTGGTCCAGCATTTGCATGGACAAGATGCCAATGAC 
AAACTTCTCCCTGTAGATGGACCTGCCGCCAGTGCCACGGTGGTGTGTGACATCACC^ 
TGTGAGAGTAGGAACCCTTACACCCAAATCCTATCTCTCCAGACAGAGCTCAGAGCAC^ 

4 0 GTCTACAAACAG(ZTrCAAGATTATGTTGTAACCCAGGCTGGCTTCCAATT 

GGATTACAGACACAAACCACCATGCCCAGCTTT 

AGAGAATTTGAGTCAGGGGTAAATGGCTAGTTGCACAGACrGACCACCCA^ 

GGTAGCACACATCTGCCATCCCAGAGCTCCTAAGAGATGGGAGGCAGAGACAGGAGGGTGTGCTTG-GC 

CACAGTAGGACAGTAGGAAAACAGTGGGGAAGAGAACAGTTTTAACTO 
4 5 TCCAGGTTGGCCTTAC^(ZATAGACCAGGCTGGCCTCAAACTTACAGAGATCC 

GATGTGCCCCACCACACCCAGCTGTGTATGTGTATATTTGTGTGGGTGCCATGGAAACCAGA^ 

TGGAGTCAGATGTGGTTTTGAGCCTCCAGATCAGTGATAGTTTGGTGGGAACCIAAACT 

CTCAACTGATGAGCCACTGCTCCACACCCTAAAAITAAAAAAACAA 

CAGCACTCGGGTGACTGAGACAGGAAGGCAGACAGTTGGAGACCAACCCAGGCT 
50 • AAGGACAACTGCAAATGATATCAAATTTTG CT^ 

ATAAAATTGTGACCCTACAGAAATCyVCGTTTTTGTCITAATGAAA^ 

TGAAAGGAACGCTGTAAGGTTAAGAAAGGTTTTTTGAGCTAGAGTTATAGATAGrrGT 

ACCTGGGCCCTCTGCIAGGAGCAACCAGTGCTCTTAACCACTAAGCC^ 

GCTCATTTTTATACGGATTAGTGTTITX3CCTGCATCTATGG 
55 GGGTGTCAGATGCCCTGAGACTGGAATTTCAGAGGGTTGTCAGCCACCATGTGGGTGTTG 

GAGCAACCAGTGCTTTACCCACTCACCTATCTAGCTCCACACCACAAACT 

ATCTCTCnTAGGAATTATCATGCTATTTTGATTCATGATCATCCA 

ATTGAATGGCACATGTTGGTTTGAAGTCTTGTCGCAG^ 

CAGGGCCTCGGGTGTGCTAGGCAGGCACCCCACCAGAGAGCTACATTTCCAGTCTCATCGATGT^ 
6 0 TGGGTCTCGACCCCCCCCTTCCAAAGCACTCATGATTACGTTGCGA^ 

GTGTGTGAGGGCAGTTCTCACCACAGCATGAGGAATTCTACTAAAGAGTTGCAGCATTAGGAAGGT 
TCTCTTrCACATTGAAGTCTTAAATACTGGGTGATCCTTATCAACTCAC^ 

CTTGGGTGTATAGGAGAAGTGGCTATCTCTCACGGGCrrTCCCATAGGGTGATCCGGGAGGGTGACCCT 
GGGTGCTGGATCTGACCTCGATTAGCTTATGAAGCCTGAATGTATO 
6 5 CTGTGAGTTCAAGACCAGCCTGGTCTACATAAGTGAGTTTCAGGACAGCTAGGGCT 
AACAACAACAAAAGACCCTCAGGCC!ACACrCGCATGCAATCIX31^ 




TCTCTGGGACAGAGTTGATGAGAAATACTGGACTGTGGTCra 
7 0 TTGACAGGTAGGCAAGGTTAGAC(^TGTAATATX3AAGCTTGCCCACTGA 

ATTTGAAGGAATACCTCTCCTCTTrCCACCTTTAAGGATCCATTA 

CTGAGTGAGCCTTCATGTCCCTTCATGCCCCATCAT^ 

GCCCCATGTCATGAAGCTGTTCTGAAGGGTGACCTOITCTCGGCTC 
. AACAAGTATAGCTTTCCCTCACGGCATCCGGTX^CATCAGGTTGGTAGC^ 
7 5 AATAGATCCGTGCTGCAGTAAGGCACAGCCCCTCCCAAAGAGCTTGTTTCCT 
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TGAGTGCACGGAGAGACCCCTCCTGAGAGGATCCATTCTCCA 
GAGAATGGAAAAAATGGAAGGTCACTTTTTTCACCCCCAATTC 
ATGTGAGTGTGTGCACGTGC^TGTGCTATGGCATGTGCACTGAGGT 
GTTTGAGACTCTATTATTTTTCTACCGTGTGTATAAATTGCTGGCTO 
5 CCTGGGAGCACAGGGATTACAGGTGTGTGAACTGCGTGCCAAGCTTCrrCCATGA^ 
TGTATGACAAGGGCTCTTACTCACTGAGCCATCTCTCTTGACCCT 
CAAACCGCTCTTACTACCTTTAGGCTCCCAAGTCCTGCCTTTCT 
CTGGTGAGACGGCTCTGGGCGCTCAGCGTGAGAAGCAGACATGAGCA 
GTTCTCTGCAAGGGGCTCAACCCCCTCCTTCCACCGCAGCTGACTGCCCCITG 
1 0 TTAGAAGACATGCTAATTCTCTGCTCCATTTTGTGCAATGGTG 

GCACAGGCTCCATAAGACCATGTCTCAAAATAATCTGTGAGATCATAGTCCT^ 
AGTAGGTCAGGTTTATATGATGGGAAGGXX5TCTCCTCTTGTGCCTCACCTTATA 

CATGGAGGAGATGGGATTGAGAGTAGCTAGCAGATT CGGGTGGCCTTCAAATGGAGG CTGCGTG AACTGTGTTGTAATGGAGCTGT 

CAGGGAGATGGGCATGTGGGAGGGGTGGGGTCCTTAGTGCTGTGTTGCTATGAGGATACAAG 
1 5 GGGGGAGAAGTTTTTATATTTAGTTTACACATTCTGATCTTGGTCTATTATTGAGGA 

ATGGAACAATGCTGCTNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

OTJNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNJTONNNNNNNNNNNN^ 

NNNNNIOINNNNNNNNNNNNNNNNI^ 

NNNNNNNNNNNNNNNNNNTCTCTCTCT 

AGTCCCTCAGTTGAGATGCCGACTTCCCAAATGACTGGAGTTTATGTCAAGTAGA 

2 5 GCAAATTAAATTAAAGTAAAATAAAAATAAAATAAAATATAAACACTCTACT 

G CAAG AGAGTAG CCATGAACTCAAAG CAAGCCCAAGGTATTGAGATGAGCGTGAACCTATCTCAGTGCACAG ACACAGGAACACAG 
TGTTAGTGTTGGTGGCCGGAGCAAGAGCACAGGAAGAAATTCCCTGGTCCCAGCTCCTCT 
CTGCAGCCACTGCAGGTGCTGGCGGCAGCGTCCCTTTGGTTTGGAGrGGGCGT 
CCTCGCCCAGTCATGGTAAGCAGTCTCTGTGAGCAAGAGGGCAAAGACGCGATGTACGCACTT 

3 0 AGCTCTTCCCACAGCAATATCGGCTACCCAGGCTGGGGGTGGGATGGAGTCAGAGACCACATC 

AGATCAAGAAGGTGGTGTTGTCGTTCCTAGATCTTGCTGAAGTT'l'ri 'G'lT CTCCT TTT CCTTTTGAGACAGG CTTTCCTG CAGAGT 
TAAGGAGACAGTTCTCTGAGACAGGGTTTCTCTATGTAGTCTTGGCTGTCCTGAAACT 
TGTGTGAAAGCAAAGCTTGTGGAGTTTGCrrACTGTATACTCCTAACCA 
AAAATATTTATTTGTTTATTTCATGTATGTGAATACACTGTTGCTGTCCT 

3 5 GATGGTTCTGAGCCACCATGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCAGTCAGTGCT 

TCTCCAGCCCCCGTGATGTTGTTCTTCTTGAGGATGCAGGTCCTATTCCCATCACCTATCT 

CAGTCCCCGGGTATTCGATGCCCTCTTGTGGTCTCTGTATGCACAGATATACATGCAGACAAAACACCCATACACATA 

ATAATTTAATTTCTTATTATAAAAACTTTGGGAGCTGCTATGAGTTCAAGTGTGACCT 

AGGGCTATGTAGTGAAAACCTGTCTCAAGAAACAAACATTACA^ 

4 0 TC ATTCTGATTG AT CTG CCAGACATGGG CAGTCAATTTGG CTGTGG AG TTGG AAG CTGACTGATGGG CACAATCTGGGAG CAAAAA 

TGAACCCAGATTAGTTTAATTATCCTGGGTGATGAT^ 
CTATCTGTCCATCTGTCTGTCTGTCT 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNKN^ 
45 NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TATATGAGTACATACAAATATAGGTATGTGTGTGTGCATA^ 

GATACTGTAAGAGTAAAACCTAAAAACAAAATTCTAGGCCTTTAG^ 

TGTTCCAAGGTATGCTAATCCTAAAACAGATAGCGACATCCCrrCCACCCACCCACTTCCTG 

5 0 AAGTCACCCTGACCTACCCTGACCATTGCTGGAACCCGCTATGCAAAGGTGCTATTGTTAG 

ATAACCCTCACAATTACCAGGTATTCCTCGTGACTCTTGTGACTTTGCACr^ 

TTTTCCAACAATGCCCTTCCCACCTCCTTGAGTTGTGGTTTCriT 

CCTCTACCCCTGCGTGGTGTATGACCGTGGGCCTGAGAGCGCTCTC^ 

TGTGGGTGATTTTGGGGTGTCGCCTCTCCTGAGTCAGAACCT 
55 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNIWNNNNNNNNNNNNNNI^^ 
60 NNNNNNNNNNNNNNNNNNNNNN^ 

ACCCCAACGCCCAGCACTGGGGGAGTGAGGACAGGCTGGTTCCAGAGGATGGCCCACCAGC 

TCCCTCCCTCCCCCAGGCAAGTGGGCTGCATCTGGGGATCA.CAGACACGCACATA 

CATGAGAACACACACACGGTATGTTTGCATGCATTGGCTCCCTTTG 

GTGGAAGACGGAGCCTTTTGAGCAGTGACCAAGAGGGATGAAAGTCCT 

6 5 CTCCTTGTTCTGTTTTATATTTACTCTCACCATCCCACTTTCTGTCA 

GGACACCCAGCTTCCAGAACCGTAATTTAGACCAACTTCT 

GGACTGAAAGCTAAATATCCATTTACATGAAACCAAATCACAGAAGGGAATAAAATCCTTCTTGTTCCC 

' MOUSE SEQUENCE - mRNA 

7 0 GCGGCCGCGTCGACCGCGCGCTCAGGAGCCAAGCGTCCCAGCTC 

TCGCCCCAGCCCTAAGGACCCTCGGGTCTCTTCGACGGCTGCTGGAAGGCGCCCAGCCCGCCTCGGATGG 

CTGCGGAGACATGCTGACCGG CAC CGAG CCGATG CCG AGTG ACGAGGG CCGGGGG C CCGG AGCGGACCAACAGCATCGTTT CTTCT 
ATCCCGAGCCGGGCGCACAGGACCCGACCGATCGCCGCGcAGGTAGCAGCCT 
GCCGCGCa^TCGCTTCCTTGGATCCTTCGCCTACCCGCCCCGGGCTCAGGTGGCT 
75 GCCGCCCGCGGGTGCGGAGGGCTACCCGCCCGTGGATGGCTACCCTGCCCCTGACCCGCGCGC 
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AGGACTACGCATTGCCCGCGGGGTTGGAGGTGTCTGGGAAGCTGAGAGTCGCGCT 

CAGCACCAGACAGAG ATG ATCATCACTAAGCAAGGACGG CGAATGTTC CCATTCCTGTCCTTCACCGTGG CTGGGCTGGAGCCCAC 

AAGCCATTACAGGATGTTTGTGGATGTGGTCTTGGTGGACCAGCACCACTGGCGGTA 

AGGCAGAAGGCAGCATGCCAGGGAACCGCTTATATGTCCACCOVG^ 
5 TCATTTGGGAAGCTAAAGCTCACCAACAACAAGGGGGCTTCCAACAATGTGACC 

CCAGCCCCGGCTGCACATCGTGGAGGTGAATGATGGAGAGCCAGAGGCT 

AAGAGACCCAGTTCATTGCAGTGACTGCCTACCAGAACGCAGAGATCACT 

TTCCGGGAGAACTTTGAGTCCATGTACGCATCTGTTGATACGAGTC 

AGACCCCTTCTCACCTCTrCTATCCAACCAGTATCCTGTTCCCAGCQjTTTCTACC 
1 0 TCTCACAGCCTTACTGGCTGGGGACACCTCGGGAACACA 

CCCTCTGCCCCGGGGCCC^CTGTGCCCTACTACCGGGGCCAAGACGTCCTGGCGCCTGGAGCTGGT^ 

CCCGCCCAAGATGAGCCGAGCTGGCTX^TTCCGGCCCATGCGAACT 

GCTCCTCCCCCTCGCTGTGGCCIXIAGGTCACCTCCCTCCAGCCGGAGTCCAGCGACTCA 

AGGAGGATATCCCCCTATCCTTCCAGTGGCGACAGCTCCTCTCCCGCTGGGG 
1 5 GTTTTATAATTATTTTCCCAACTG AGAAAATGCCGCTG AATTGGAAGGTG CCCACT AACTT AGAAAACAG ACG CGGGGCTG AGAGC 

CCCX1AGCTCTTCCCCATCCCTTCCCTGTATAGTGATTGGTTGGAGAGGAAGCGGGGCAAG 

TCCTGGCCCACAAGGAAATACGACAGGAGTGTCCCCTGCCCCTrTCTCTGCCCGAACTAC^ 

CCCCATGGTTCCATGGAGAACGGAGAATGGACTCCAGAGAGTTTTGGACCCAGAGGGACTT 

TCGGGGTGGGGAGTCCAGGAGAGCTGCTCTCTTCCCCTGTCCAGTCAGTAACTTTCAACT 
2 0 TGACCTGAAAGTGAAGATACACGCATTTTTACAACAGCC^ 

GCGAGGAGACAAGAGAGGGTGGGTGCAGAGGAAGGGTTTGAAGGGTGCACAT^ 

CACACACGGGTGTCTCTTTTTTATTTCTTCGGG 

TTGCATCACTTTCTGGAAATAAACATGGACCTGGTAAAAAAAAAAAAAAAAAAA 



25 MOUSE SEQUENCE - CODING 

ATGGGCATCGTGGAGCCGGGCTGCGGAGACATGCTGACCGGCACCGAGCCGATGCCGAGTGACGAGGGCCGGGGGCCCGGAGCG 

ccAACAGCATCG' ri " rcrr CT ATCCCGAGCCGGGCGCACAGGACCCG ACCGATCGCCGCG CAGGT AG cagcctggggacgccctact 

ctgggggcgccctggtgcctgccgcgccgggtcgcttccttggatccttcgcctaccc^ 

gggcctggcgagttcttcccgccgcccgcgggtgcggagggctacc^^ 
30 gctctaccc^ggccgcgcgaggactacgcattgcccgcggggttggaggt^^ 

tgttgtggtccaagttcaaccagcaccagacagagatgatc^ 

gtggctgggctggagcccacaagccattacaggatgtttgtggatgtggt 

ou^gtgggtgcagtgtggaaaggcagaaggcagcatgccaggg 

actggatgcgcc^ggaagtttcatttgggaagctaaagctcaccaacaac^ 
3 5 ctgcagtctctccacaagtaccagccccggctgcacatcgtgg 

caciacacgtctttactttccaagagacccagttcattgcagtgactgcct 

agaacccctttgcc^laaggattccgggagaacrttgagtccatc 

aactgtcaactgcttgggggagaccccttctcacctcttctatccaaccagtatcctgttcccagcrc 
aggcc^gcccaaggatatgatctcacagccttactggctggggac^ 
40 gcatgaagcccacactcctaccctctgccccggggcccactgtgccctactaccggggccaagacgtcctggcgcct 
tggcccgtggcccctcaatacccxjcccaagatgagcccagc^ 

gggatcctcagaggaacagggctcctccccctcgctgtggcctgaggtcacctccctccagccggagt 
gataaggaaaccgaaggccagttttataattattttcccaactga 

45 

HUMAN SEQUENCE - GENOMIC 

ACCATGTTGGCCAGGATGGTCTCAATCTCCTGACCCTGTGATCCTC 
CCACTGTGCCTGGCTGATACTTAACTTTTTTATATCTCAGTTTCCTTGTCTGTAAAA 
TCACACCATTGTGAAGTCTAATGAAATCAAAGGTATATTGACTAGTCTTAACCAAAAAAAAAAA 
5 0 GAGAAAGAAAGAAAGGAAATAAAGGCTAaSAGTTGCCTGGGGAT 

AGGCTGGAGTGCAGTGGTACAGCCTCAGCTCACTGCAACCTCCGCCTCCCGGGTCCAAGCAATTCrrC 

AGGTGXK^TTACAGGCACCTACCACCACGACCAGATAGGTTTTTTTGTATTTTTAGTA 

CTGGTTTCGAACTCCTGATCTCAAGTGATCTGCCCACTTCGGCCTC 

CCXjXSGATATTCTCTTATCTCATCCTTTACCCCTCAACCCAGAGTGGTGATAGGAGGAGGGGG 

5 5 TACAGTTTCCAAACTCACTGGGAGTCAAGGATTGAGACAAAGTACAAGCTAGATTTGGCAAC^ 

GGGAGCCCCATGCCCCACTGCCTATTTGCIAGGGCCCTGGGTCTTGGGTGGCCCTAGTTCAGTGTGTGCATGTGTG 

GTGATGAGGTCAGTATCTCAACTCCCAGTTACTGGTCCAGTTGCTAAGAACTTGCCTGACCA 

AATGTGAGGTTCTGATGCCAGTGATTGGAAAGAAATGGCGGTGAGCAAGGAGAGGG^ 

GTCCCCTGTGG CCAGAG ACGTGGGTCCAGGG CC CTTCTCAGTGCCCAGCTCGGTC CAG CC AG CTGCCCATGG CCCTGGTGGGAGG A 

6 0 GGTAGGCTGGACGCCITCTGACACITCCr 

TGTGTCAGAGCGGATCTGTGTCTGGGGTGGTAAGTGGGGTTAAACTCAAGGTCAGACCTAGCCAATCCAITTTCGC^ 
CCTTGGGTGTCCTCCATTCCAGTGGCTGGGACCCAGGTGCCCACTCAGCTGGAGACAGCACTGCr 

TCCCTCTGCTTCACACACTGGTTGCTTG CCTGATTTCTG CCTGTGCCT AGG ATTTATAAGG AAGGG AGTTGTGG CATGTGGATCCC 
AGGCCTCCGTTCTACTCCCAAATTTCAGGTGCTAATCTCTCCATGAGTTTCAGTGACCT 

6 5 ATTTAAGGGGTTGTGGGAAGACAGCTGAAAGAATGTCCCAGTCTTCA 

GGGGAACACTACCAGGAATCTCAAAT^CTGTACAATGCAGACACAGTCAATCAGCCTCCACCT 
AGATCACCAACCTGTCCCAGGCTCCTGAGTCTATTACCTCACTCCTCCT^ 

TGTGGCAGAAGTTGGGAAGGCAGAAAAGGAGAGAAGGAGGGAAAAGGACAATCTGAGAACAAGCTGGAGGGCC 
TGCAGAGTTGGCAAGATGGCAAAGGGGATGCAGTGGCAGGTGAATT^ 

7 0 GCCTTTGCGGAGGAGATCCCTAGGTGAGAAGGTAAGAAGG^ 

TATATATGTTGTGGGTGGGGTAGGGGGAACAGAAGAAGAGCTGCAGCATCCTGCCTTTG^ 
CAGGAGGTGGGCTGCAGGTAAAATGTCCCCGTGAGACAGGGTTGAGGGAAGCAGAGG^ 
CCCCACTCCCAGCTCACCCGCCC^CTGCGAACACTCCTCAGCTCCCCCACCCCAT^ 
CGTTCTCACCACGGACCCA.GTGCCGGCTGTGGCn^CTTCCCrCTGACCTATT^ 
7 5 CCAAACCCC<^CCCCCTGGGCCCGTCAGCTCCACCGCGTGTCAA 
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GCCCCTCCCCGGAGGGCCTGAGAGCGATGGCTAATT 

CAGAGGGGCCCTGGAGGACTGCTGGGAGAAGCTGGGGCCTCCTCACCCrCCCACTG 
CrCCATTCTTCCCTCTGTCCTGCCCCTAGTGGTGCTCAATCGC^ 
CCTrAAGGGACAGAAAGGGGCAACTAGTCTTCTGTCCTGG^ 
5 GGGATGGTCCTTGCTGTCTCATTTTACAGAGGRGAAAACTGCGACACACAGAGG 
TGGCGGAGCCAGAATCAGAACCTAGAGTTCCATGTTTGTCTTTCCACCTGT^ 
CCCCACATACCAACTAGGTCTGTTTCTGTTTCTGTGCAACTCCGCTGCCAGCCCTC 
CCCTTGCCCCCTGGCCACACAGAGCTTC^CCCTG^ 

TTCCCCCAATACCACCCATG CCCTTTTCCCTCTTCCTCTCTCATTCATC CCT AACCAAGTCCGTAAGTCTG CTCTGTCTCACCCGC 

1 0 ACCCTGAGACATTAACCAGGTGGACAAGGCAAA 

TCCCAGATGGACAGATAGACATGCAGGAGATACATGGAGCGCTGGATAGCCCTGGAGACACGGACA 
CAGACACAAACACGCTACAGAGACAAGGCAAGTTCCGAGGCCCACAGGTAACAGACAGACAC^^ 
CCTACACTCAGGACCAGCGTGCAGACCTCTCACAGGTCCCAGGGATAGGAGTGGACGTAAAGAGGATAC^ 
ACACAGGGACCCAGAGAGGGGCCGAATCAGACCAGAGGAACCCAGACACTCAGGAGC 

1 5 CACAGATCCCAC^AAGCCAGAGAGAACCTTGGGGAGAGAGGAGGCCCCAGGGGGTG 
CCTGGCAGAGCCTGACTGTGCCAG<»AAGTCATGCCAAGATG 

CCTCAGCCTGTGCCCTCCCTCCAAGTCCCTTTTCCTTGGCTCTTCTTTTTCTTCCCT 
TGGATATTTCCCGCCTCCCCAGCCCAACTCCTGGTAACCAGACCCTGCT^ 

2 0 AGTTACCCAAGTACCTGATGTCTGAATATCCCTTCAGGAGGGCGCAC^G 

TTCAAAATAAGATGCCACGGAGAAGGAGGACTGAAAGGCACCCAGGCCTGGCCTGGAGCCCAGGG^ 

TGGGGGGTGTGTGGGGAAGGGTGACCCAGCCTGCAGATGAGAGCCTGGAGCCCTTGACCTTGG 

ACACTGACCACCTGGTGCTGGGTGCTGGGGACrrCGGAACTCCTGCAGCTGTCACGGGGCGG 

2 5 CCCCTTCCCCCCrCTCTGGCTGTTAATCTGA 

ATTCAGGGGACGCXKSGGTGGGCAGAGTGAGATGCTGAAGGA^ 

AGGG ACTGGATCG CAGAG ATTTCCCT AGCAT AGCTCTGCATGGGGTGTGTGTGCACArc TGG 

TGTGTTGTGAGTCTACACCCCAGGAGCAAGAGGCCTGAGGGGGTCACTTTGCTAGGAG 

TTATCTTTGTGATCCCAGATCTCCAAGGGCCTGGCXTA^ 

3 0 AAGCCTGTCCCTTTCTAATATGTGAGTCTTATGAACAGAGGGGCTCTCTTTTGGCCCCAGGCCCTG 

CTGAAGGACACTTTGTGGATGAG CCACTAAG CCAGAGGGAGGAAAGGGCAGGTGTAAGTGGCTGGGAGTAGAGGAAAGAGGGCTT A 
GCACAGTTTATGCCTCTGTTTTGTTTCAG CCTTAGGACAGC^ CAGTCAGTCCTGGCAG AGG 

TCTGATAGCTCCATTTACTGACAAAAAAAACCCAGAGAGGTTAAGATACTITTAATAGG 



3 5 AGACTTTCCTCCCATTGGCCCATCTCTGTCTGTCTCCTT'lLr rTCATGCCCCTCTCGCAGGCCCAGCCTGAGACTTTCCCACTGAG 

TCACX3ATCTGGGTCCCTGACTTTGTGGTCCCTGTCCCTCTCTGGGTCTAAATCTTTCTTTCAGA 

TTGCTCTCTCTTTCTTTTCACGCCTTCTCCCCAACTCG CTGCCC CCTCTCTTTTACTATCTACCCTCTCACAGCCTCTAAGGCTCT 
GCCCAGCAGGGTGGAGGGTGGAAATCCTGGGTCCTGGTGCCATGCCATC 

TGATGCCTGTAATCCCAACACTTTGGGAGGCTGAGGCGGGCAGATCACGAGGTCAGGAGATCGAGA 

4 0 AAACCCCATCTCTACTAAAAATACAAAAAATTAGCCGGTCGTGGTGGCATGTGCCTCT^ 

AGGAGAATTGCTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCTGAGATCGCACCACTGC^CT 
ACTCTGTCTCT^AAAAACAAAACAAAACAAAACAAAACAAAAA 

ACCCATCCAAGAGCCTTCCAGGTGCCTAGGGAGCTCAGCACTCCATACGTTTTCAGGAGAGGACCCAGGTGGCTGGTA 
GGTGAGGAATCATCTGGTCCTAGAGCATCAATAAGGAAATCTCAATGCCTGCAGCCACCCT 

4 5 GCCCTCAGTGTGCATACCCAGCAAGCTCAAGGCACACTGACTCGCAGGAT 

CTGAGAGTGGGGTGCATGGACTCCCTGAAGTCCAACGCATCACCTGGAGTCCTTAAACATACAGATTCT 

CTGGGCCT CAGAATCT CT AGAAG CAAGTC CCAG AAAT C CACATCGTTGTGG CTT C CTGG CAT AGTCTG CAGC C CACAGGTGAAG CA 
AGTGCTGTGTGACCT7CGGGG AG CCTCTCTCATTTTG AGTCTTC^ CAGAGG 
ACAACAGCGTGGAGAGAACCAGTGTCTAGGCTGTGGGATTAGGCATGATTTTCTTTTCTGTTTT 

5 0 CTTTTTGTTTCTTTGAGACCGAGTCTCCCTCTGTCACTCAGGCTGGAGTG 

CCTGGGGTCAAGTGATTCTCTTGCCTCAGCCTCCTGAGTAGCTGGG ATTACAGGGGCCAG CC ACCACACCTG CTAATTTTTATTTT 

TGTTTTTGTTTTTGTTTTGAGATGGAGTCTTGCTCTGTC^ 

GCCTCCTGGGTTCAAGTGATTCTCCn3CCTCAGCCTCCTGAGTAGCTGGGATT 

TTTTT AGTAG AG ATGGG^TTTTGTCATGTTGG CC AG GGT AGT CT AGAACTT CTGGCCT CAAGTGATTT G CTGG CCTTGGCATC CCA 

5 5 AAG TG TTGGG ATT ACAGG CATGAG C CAC CACG CCCGG CCTGTTTTCT AAAC CTTTTCCAGTGTTAGT ATG ATT CTTCCTC ATTCTT 

GTGGGAAGAG CACTGGCCTGGGAGTCAAGAGACCTGGGTTCTAG CTTCAGCTCTGCCACAAATCCACTGTGTG AGTTTGGTGTCTT 
ATTCCATTTTGCATAGTTACCATCCACCGTGTCTCATAAAATGTGATGCTGGTTGGAGGCCATGTCTCTCT 
ATCTTGTGTTTCirCACTCCGGTCTCACCAGGACCAGCCTCATAATGCTCCCTGTATTGA 
GCCATGGGGAAGT<3AGGGGATCCCAGGGGGCCTCATCTC 
60 GTGAGGTCGCAGCTGGAGGAGAAGGTGTCACTGCCAGGGCCCTTATTCTC^ 

ACACCAGGGCCACTCAGCCCCCGCriTTCACAAG CAC T TT C T T T T T TGGGTGGAAG AAATTGG AGAGAGGGGGTG AGAAATCGGGG 
ATTGAGGAACCAGGCTGTAAGACCCTATAACAGCCTGTCAGGTTCT^ 
GGCCCCAGGGCCACCTGATTCrGAGATTCATAATTCCACCTGCCCCAGAAAGCTG 
AAGGGGATGAATCACTTGACCTGAGGATTAATGTGGAAAAGAGGAGGGGGAGAGAGAGGGAGGAGCAAG 

6 5 ACAGCCAAGCACTGAGCAAAAATGTCCCCCAAAGGGGTCTCAGCAT^ 



TGAACATCAGGCAGAAACTTCCCTGrrCCTTAAGGTACGGAGAAATG 



GGAGTCCCCAGCTTTGCCTTTGCCTGCCAACTTGCCTCTTGGTTC^ 
7 0 GTGAAGCCCTGTCAGGCTGGGACAGAAATGTAGAGCTGGGGCCAGTTCCC^^ 
AGAGGAGAAGTTAAACTTTAACATGACAAAAATATT^ 
GATATGCAAAATATTCCATGACACCTTGTGGAGTGAAAAA 
ATATTTGTGC^TGTCTCCAAAAGACAGAGACAGAGAGTATATAAGGA 
TTTGGCTTTAGGGGGTGGAATTTGGGGTGAvTTA^ 
7 5 TTTCATCrrTTATACACAGGAAAACTCCCAATGATGCT^ 
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CTAATACATTAGTTAACATAAAAGGAAAATGGAGAAAGAAAAGCCTCCATCGTCCTATCT^ 

TGGGTTTCCTTCCTGTCAGCCGGCTCTGAATCTGTTCAAAGCAGCTTTTCCATCCAGGTATAATA^ 

ACGTACACCCTCTCATGTAAGGCTTGCAGCCCCCCTGAAAAGT^ 

TTGACTTTC^U3GCAAGGAAAATGACTTG CCTGGGTCACAT AT AC CTGG CAGGCAG AG CCAGACCTGGGACT CTCAG CACTGG CAG C 
5 CCAATGCTACTTCCAGTGCCCATGACTGCCATGCAGGGTAGGGGAGCTTTTGTGG 
ACCCCTGGCTtSCTGCTGATGCAGTGCGCTTTAAGGA^ 

ACAAGATGCAGCCAACTCAGCACATGTGGACCTCTGGTGGGGAGAAAGAGGGCAACCCGA 

ACACAGTAGGTATCAATAAAGATCGATTGAATGTTCATGGTCAAAGTTGCT^ 

GGAGAATGCCCAGCGAGTCCCACTTGWCCATCTCGGAAGGCTT 
1 0 TGAATTTCCCCGAGGGTCCAGAAGAGAGCCACGGGCTCGTGTGTCAGGCAGCGGAGCT 

CTACAGGGTGCGCGCAGGCTCTCGCTTCTCTCCACCATGGGGGGCCCTGCAGTACT 

GCCACGAGAGGGCGGGGTGGGGCGAGGCGGAGCAGGGCCGAGGTGGCGGAGTGGGGGGGAGCCGG^ 

AAAGCGCTGCGACTCTAGTGACAGCGGCCCGCTGGA 

AGGCGTCAGAGCCCG<X3CrCCGGTGGGGTCCCCCACCCGGCCC^ 
1 5 GACCCTCTCGCGCGCGGAGGGGCGGGTCCTCGACGGCTACGGGAAGGTCCCAGCCCGCCCCGGATGGGCATCG 

CGG AG ACATGCTG ACGGGCAC CGAGCCGATGC CGGGG AG CG ACGAGGG C CGGGCG C CTGGCG C CG ACCCGCAGCACCG CT ACTT CT 

ACCCGGAGCCGGGCGCGCAGGACGCGGACGAGCGTCGCGGGGGCGGCAGCCTGGGGTCTCCCTACCCGGG 

GCCGCCCGCGGACGCCGAGGGCn , ACCAGCOK3GCGAGGGCTACGCCGCCCCGGACCCGCGCGCCGGGCT 
2 0 AGGACTACGCGCTACCCGCGGGACTGGAGGTGTCGGGGAAACTGAGGGTCGCGCT 
CAGCACCAGAC^GAGATGATCATCACCAAGCAGGGACGGTGAGTGCGGCG 
AAGAACGTCTCGTCTGTTTI^CTGGCTCGACAATGCTTCTGACTCCGTGTCC 
GAATGGGGTTGTTAGGAGGACAGGGAAAGCTCCGGAGGGGCGTCTGTGCCCAGGCTGTTGCACCA 
GAGACGGGTGAGTGCGGGAO^GAGAAGTCACCTTGATTTAGGGGAAGGGTGACTG 

2 5 TGCCCCACTCTCTACTGTAGAGGAGTOSCAGCGGGCAGTGAA^ 

GAAACTGGAGGGGAAGGGAGCCCCTTATCTCCGGGCCCCCTGCGCCCACCTCCCCCGGCTCCTTTGCTGCTGGTGTG 
GCTTTAGTGGTGGTAGTGGTGGTGGTAGCGGTGGTGGTGGTGTGTGTGTACGG 
GTCGCCAAGTTTCCTTTCCGGTCTTACTITGAGATCATATGTCTGGTGTGTGTG 
TGTGTGTGTATGTGTGTGTGGGTGTGTGTGTATGGGGGCTAGTGCAGT^ 
30 TGCCCA03GGCTGGGCAGATGAAGCTCACCCAGGGGTCCAGCCTGGTAGCCAGCCCCACACT 
AAAACCTTCCCCACACTCCTGTCCAGAAATTCAC 

GGTTCCTGAGCCCCGTGCGTGATGGGGAGAGTTTGGGGCTGAGGGTGCTGCTTCCGGATAGAGCCT 
GAAACCTGTGACTTGTGTGGTATTTGTTTAGTAAGCAACCCCCGGAGTGGACTGTGTCTGAT^ 

CCAGTGCGG CCCATGTG AGCAGGGGACCAGCG AGGACCAGTGTGGAAGGGCTGTTGT C ATTGGTGG ACCTGGGATGCTGGGTCCCA 

3 5 GGTCCGAGAGGTGTGGATACCAAACGTGGGGCTTGGGGTGGAGGGG^ 

GATCTGGGCATTGCTGGAACATTCTTCCCTCCAGATGATTTTTGTGGGGTAGCCT 
GGCTTCAAGTTACTGAGCTGTTCCACTTTCCCTGGGATGAACCCA 

TAAGCCTTCTGATAATTTCTCTCTGCTCCCCCCAACCCTAGTCACCCTTTGGGTCrrACCCAGA 
ATGTGTCTGAATTGGAAGGGACCCAGAAGATCCTGTCCCTCTCACTGCACAGACAGGGAAAC^ 

4 0 TTCCAAGGTCACACAACACACTCATTGCAGGCAGAGCTGGCACTACAGCCCAGATAGTAT^ 

CTACGGCCACCAGTAAAAACAGCCACCCACGCTCTCT^ 

CGTTAGGTAGGCCAAGCAGTCTCCCCATTTTACAGTTGAGGTCACTGAGTTGCAG 

GCAGACCAGAAGTGGGGCCTCTTAACCTTCCACTCTGTTGTGTGGTCAGGAGGACTT 

CACAGCCTCTGCCAGGCCTCTGCCTCCTCCTGCCAGTCTTGAGGGATGGGGTGGATGGAGC^ 

4 5 CCAGTGCTAGGGGGCTGCCTCTCrcCCTGTAGAGCCAGCTTCAGGGAGGCT 

TCTCGAAGTGTGTATCACCATCTCTACTCCCATGGCCTCCTTTCCTTGGTGTTCCT 
TGTCCCCCTTTTAGATACATACACriTACATTTTTTTATTATGAAAAATTC 
CTTCCATCCCCAGCTTCCACAGTTGTCAATATGGGTCCAATCTTGTT^ 
TAGTAATATAGTAATTACTAAOTACATTCTAGATATTAT 

5 0 CATTAGCAAACCTGAAAAGAATTTCCTAAATATCACCAAGCTTCTCTGATTGCT 

CCTCACACTTTTTGCTCCIATTCrrGAGGGGCTCCATACGCGGGGTCCCT 
GCTTTTTCTTGGATCAGGCTTCATAGACCCCIACGCT 
CAGGTTACCTCCATACCTTCTCAGCTTGGGCTCCTGAGATCTGGGAAAT 
ATAGCACAGCTTCCTTTCOSTCTTCCTGCTTTTGTGGGCTGG 

5 5 CACCCATGACCTTCTTCCTTTTTGAAAGGTTCTCCCACCTCAGGCCCACCTACAG 

GACTCACGCCGTCCCGGATTCTTCCTCTCCCACCCACCCTGGATCTATCCCTCGGTATTGACCAGCG 

AACGGGCAGAAATGGGGCAGGGCACAGGGGGTGATTCCTGGGTCTGGAGATT 

AACTGAGAACAAAACGGTGGATTATCCCAGTAGGAGACTTTGGTGTCTACGTTTGTG 

CCCACACAGGCTTGTGCGAGGGTTTCTGTGTGGCAGTGTGTGTGTGTTGCGAATGTGTTGGCGAGTATATAA 

6 0 CATTCTCAGTAGAGGAAGAGAGAGGCCTGTGTCTCTGTGGGTGTGTGGGTGTGTATTAATTGCTGGC 

TGATTGGTTGATTATTATAATTAAGATAACTTTGGGTAGCTGTGTTGTGTTAAGCT 
ACAGAGGTTGGTAAAACAACAATTTCAGTCCCACACTATCCTTGCTATCCTCACAGTC^ 
CCAACTCCAAGAACTTTAATTTTTTTATTTGTAATATTTTTCATG 
GTCGATAACTTTCTACAT C ' r TTTTTTT T TTT l 'GAGATGGAGTCTCACrCTGTC 

6 5 TCACTGCAACCTGCACCTCCCAGGTTC^GTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACA 

CCCAGCTAATTTATGTATTTTTAGTAGAGACGGGGTTrCGCCATG 

CCGCCTCGGCCTCTCAAAGTGCTGGGATTACAGGTGTGAGCCACCGCACCTG^ 

ATTTATTTTTTATGGAGACAGAATATTACTGTGTTGCCCAGGCGGTCTCAA^ 

CCAAAG TG CTG GGATT ACAG G CATGAG CCATTGTGC CCAG CT AAAAACATTTTTTTT AGG CTT AAG CATG CTT AACTGGTTTCATT 

7 0 TGTCTTGGGGCAAGTGAAAACAACTTTAATTAAAAA 

GTGGTGCGATTATGGCTCACTGCAGCCTCAACCTGCTGGGCT 
GGCGCACATC^CCATGTCCGGCTAATTGTTTCTATT^ 
TTGTCTGGCTGTTTGCCAGTGGCTGTCTGAGTC^^ 
CATTTTGTACTTTGGTAGGTACX3ACACATTGCCAGAATACCCTCCCAA 
7 5 ;vGAGGACCCATTTTCTCACAGCCTCATCAGCAGTAGATATTATC^ 
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AGGCTGAAGTGCAGTC^CATGATCTCTGCTCACCGCAACCTCCGCCT 
AGCTGGGATTACCGGCATGCACCACCACGCCTGGCTAATTTTTGTArrrr 
GTCTCAAACTCCTGACCTCAGGTGATCCACCTGCCTCGGCCTCCCAAAGTGCTG 
ATATTATCAATTTTTAAACATTGACCAATATGACTGAGAAGAACTATCTA 
5 AAGCAGAGCCTGTGCCAGGAGCTCCGAATCTC^XJAGAGGTATGACGA 
AGCAGAGGAATTTGGC^TCTTGGACCCTGTCTTATGAGGGTG 
CATCACCACTCTCGGTTCCTTCCCAGCAAAGTCT 

TTAGGGAGAAACCGAGATCTCCTTTCTCTTCTCCCTTTTCCCCAGCCTCTTCCT 

<IAGGATGAGCTGGGCATGGTC*GTGAGCACCTATAACCC 
1 0 GCATTTGAGCAGAGCCTGTGCAGCATAGCAGACrCCATCTCTTAAAAAAAAATC^ 

AATCC CAG CACTTTGXjGAGGTCGAGCCCAGTGGATCACCTGAGGTCAGGAGTTCGAG ACCAG CCG 

TCTCTACTAAAAGTACAAAAATTAGCCAGGTGTGCTGGCGGGCX3CCTGTAG 

TGGCGTGAACCTTGGAGGCGGAGCTITGCAGTGAGCCAAGATCACACCACT 

CAAAAAAAAAAAAAAAATTAGCCAGGOSCAGTGTTGGGTGCCTGTACTC^ 
1 5 GTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGGGACAGAGCGAGACTCT^ 

AAG AAATGGGGAC CACAGAGCACAGCATGCACTCTGCAGT CAG AAG ACCT CACCT CTG CT CTCCCACCGAGG ACCGT CACCAAAC C 

TCTCGGAGCGTCAGTTTCTTCAAATATAAAATTGGAAATAACAX i i Lk'i i ITTl Ai i i i i r^iriGAGATGGAGTCTCACACrGTT 

GCCCAGGCTGGAGTGCAGTGGCGCCATCTCGGGTCACCGCA^ 

AAAGTAGCTGGGACTACAGGTGCATGCCACCACACCAGGCTAAT^ 
2 0 AGGCTGGTCTCGAACTCCTGAGCTCAGGAAATCCACCCTCCTCGGCCTCZAC^ 

GGGCCATCTTTCGAACTTTTTAACAGCTTTCAGAGGAAAGATC 

TTGGGCTGGGTGCAGTGGCCCATGTCTATAATCCCAGCACTTTGGAAACCT 

AC CAG CTTGGCCAACGTGAAACTCCAT CTCT ACT AAAAAT ACAAAAATT AGC TGG CACATG C CTGT AG TC C CAG CT C 

TTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGAAGG^ 
2 5 ATGACAGAGCGAAACTCCATTTCAAAAAGAAACTGAGATTTC 

TTTCTCAACTCAGGAAAGTCAGGGGAAGACAGGCGTGTGACAGCACCCr^ 

CTGGGGACCTCTGGGGATTAGAAGTCCCTCAAAAACAGTTCCTAGGCCCCT 

AGGTCCCTTTTTCCTCATCCTTCAGGGCTTCTCATGAGAGGAC^G^ 

CTTCT<3CCATTCrrC^U^CTCTt3CTCT^ 
30 GGTC<3AGGTGGTC^TTGTAGTGACGGTACTACTGGTC^TGGTTGTGACGGTGAAGGTGGTG<;CT 



TGCTA(nX3GTGGTGATGGTGCTGGTTGTAGTtX5TGGTGGTGACGGG^ 
GAGTGCTTCTTTTCTGAATCTGTTACTCTGGCACAACA 

AAACAAC^ACTTTGAACATGAGCCCCGTGTTCAATAGAGCCTTCTTTCAGGT^ 

3 5 CAACTTCT CTAAAGGCTGAGGGCAGACTATACCGCAGGTGGCAGACCCAGGAGCATGGGAG^ CCTGTG AGTGGGTGTCA 

TGGGTCTGGGAGTGTGTTAGCACrGTGAGAGCCACGGCJlGA 

CTGTGGAGAGCTGGTGGTGCCCCCTCCTAGGAAGCAGCTAAGGTTAGAGACAGGGGTGTGCAG^ 

CTGAAAGAACATTTTAAAAAACTCTGATTTGTAGTGTTTGCTC 

ACTCATGTGAGTCATTGAACACAGAGCCGGGAAGAGACTCACAGTGCACrGTGATT^ 

4 0 GGAACTAACCTCGACAGTGCAGACAATAGTAAAACGTAAAATAACTGGGC^ 

TTTTGAGATCGAGACTTGCTCri^TCATCCAGGCTCGAG 
AAGCCATTCTCCTGCCTCAGCCTCCTGAGTAGCTAC^ATTACAC^CGCCCACG 
ACTTTTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGCTGGTCrCGAACTCCre 
CAAAGTGCTX5GGATTACAGGCATGAGCCACTGTGCCGGGCCTACTTTGACT 

4 5 AATTTTTAATTATGGCTGTGTTTAACCACTC^CTGGCA^ 

GCT ACAGCACAC CACTG ATGC CTGGGCACTG TTG CAGGGGGG ACTGG CTGTC AAG CTC^lAG CTG ATGC<nX3CTGGG^K3 CTTT CT CT 
CITCTCTCCAGGCGGATGTTCCCATTCCTGTC^TTTACTGTGGCCG<3GCTGG 
GGTCITGGTGGACCAGCACCACTGGCGGTACCAGAGCGGCAAGTGG^ 
GCGCCCCTGGGAGCXSGTGGGCTCTGTTTCGCTGGGACTGG^ 

5 0 CTTTAACCCCCTCCCACTCCATCCCACGCCATTGCATCCCTCCTG 

CTGTCCCGGC^GCTGCATGTCAAAGAGGTGAACTGTCCACAGGA 

CACTGGATGOGC CAGGAAGTTTCATTTGGGAAACTAAAG CTCACTUVACAACAAGGGGGCGTCCAACAATGTGACCCAGGTAGGACC 
TGCTCTTCAAAAGGTAGCCTCGCCCTGCTCCC^CCCTGGGTCTGAGACCT 

5 5 TCATTTTTTCTTCCTTCCTACAGGAAGGAAGGTTCGT T TTTCT ^ 

GGCTGTTT CTCACCCACCTTGGGAGGAAGATG AGATGGGATG AAG CTGAATCTTGGACGGGGGTCAT ATTCAGG CCACAC AGG CCA 
ACCCAG CCTCAGGGTITCTTCACOUATCATGGGTTG CCATTG CTCCGGCACCATGAAAGG CACAG AGTTGG CCCTGTGGTCCCAG 
GGAATAG ATCCCG AGTTTCTCT AGGTTG AACAATCCTGTTAAAAGTG CCCTTGCT CTCACACCCAT AATCC 

C!AGCACTTTGGGAGGCCAAGGCGGGTGGATCACAAGGTCAGGAGT^ 

6 0 T AAAAAT ACAAAAATT AG C CAAG CATGGT GGCGGG TG C CTGT AAT CC CAG CT ACTCAAG AGGCGG AGG CAGG AGAAT CG CTTG AAC 

CCGGGAGGCAGAGGTTACAGTGAGCTGAGATCATGCCATTGCACTCCAGCCTC 
AAAAAAAGAAAAAAGAAAAAAGAAAAAAAAGTGCCCTTGCCCTAAAGCCCTGTTTGTGC 
GAGGTAGAAGAGGGGGCAAGAGGCTAGAATTGGTGCCCCAAAGAGTCCGGACACT^ 
TGGCAGA^CAGGGGAATGGACAGGATGACCTCAAGGTGCTTGCTAGCCTCACGT 
65 TCGAAATCCTTCCTCCCCACCCCTACAACCGTCTTGCTCTGTCTACAGATGATTGTGCTCCAGTCCCTCCATAAGTACC^ 
GCTGCATATCGTTGAGGTGAACGrACC^AGAGCCAGAC^CAGCCT 

AGTTCATTGCCGTCACTGCCTACCAGAATGCCGAGGTGAGGGCTGCCTGAGCCCCGGTGGGGAGG 

TCTTCCTTGGGAGGGATTrGGAAAGTTCCCGAGCCCCAGACrCAGGACT 

CTGAAAATTC^TAATAACCCCTTTGCCAAAGGATTCCGGGAGAACTTTGAG 

7 0 CCTCCTGAGACACATCCTCTCCCTGCCCCTGAAAACAGG 

CGGAGAAC>GAATGTGTtlAAACAGGTAGGCTCACAGGTGACTGGTT CTGCTTGTGACC CGTTTl' CTTGCCTTCTATTTriTTCTAGC 
ATGTACACATCTGTTGACACCAGCATCCCCTCCCCGCCTGGACCCAACTGTCAAT^ 
ACCCAACCAGTATCCTGTTCCCAGCCGCTTCTACCCCGACCTTCCTC^CCAC<5CGAAGGATGTGGTTC 
GGGCCCCCCGGGACCACAGCTATGAGGCTGAGTTTCGAGCAGTC^ 
75 ATGTCCTACTACCGAGGCCAGGAGGTCCTCGCACCTGGAGCTG^ CCTGTGG CACCCCAGTACCCTCCCAAG ATGGGCCCGGC 
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CAGCTGGTTCCGCCCTATG CGG ACTCTGCCCATGG AACCCGG CCCTGGAGGCTCAGAGGGACGGGG ACCAGAGGACCAGGGTCCCC 
CCTTGGTGTGGACTGAGATTGCCCCC^TCCGGCCGGAATCCAGTGATTCAGGACTGG 
TCCCCCTATCCTTCCAGTGGTGACAGCTCCTCCCCTGCTGGGGCCCCTTCTCCT 
CTATTTTCCCAACTGAGCAGATGACATGATGAAAGGAACAGAAACAGTGl^ 
5 GATG AAGGACTGAGAAGGCCCCCGCTCCCTCTGGCCCTT CTCTGTTT AGTAGTTGG TTGGGGAAGTGGGG CTCAAGAAGG ATTTTG 

GGGTTCACCAGATGCTTCCTGGCCCACGATGAAACCTGAGAGGGGTGTCCCCTTGCCCCATCCTCTGCCCTA^ 
CCTGGTGCTGCGTCTTGCTTTTGGTTTCCAGCTGGAGAAAAGAAGACAA 
GGGTGGGAGGGGTCAGGTGTGGGACATGGGAGCAGGAGACrCCACTTTCTTCCTTTGTA 
TGTGTCTTAATCCCTGATCCAAAAAGAACAAATACACGTATGTTATAACCAT 
1 0 CTTTGGACAGCTGGCCTGGGCTCCCCCTGCTCAAACACAGTGGGGATCAGA 
TCAAGAAGCAAGATATTGTTTGTGGTXSGTTGTGT^ 

GGCTATTTATTGTACTGAGAGTGGTGTCTGGAT ATATTC CTTTTGTCTT CATCACTTTCTG AAAATAAACATAAAACTGTTGAATG 
TGCCTGCCTCAGTGCCAGCATGGGGGGACATGGATGGGGACTCAGT^ 

AGCTTTCATTTATGGGGAACCTGCTAAGACCCTGAAATGACTCCAGCTGAGTTACAGCAAGGC 
1 5 AGGGGGTTCMTTGATGCTACCTCTTAAGGCATCTC 

GAGGGGTGTGAGGGCAAGGCCAAGGGCTGGACTGGGAG CGGGGG7GCAGGG CGCTGTGAGGCGGTGGCACTTtlATTTTTCTTTGCA 
TTTCTAGCAGCCCTCCACCTTTATCTCTAGGGTTATTCAAGGATTAAA 

TTGT AGGTCGAAAAG CCAAAT ATT AGCAATTTTATGTG ACTCAAG CT AAT ACATGT AAAGGGT^ CCTG ACAAACA 

GT AAGCACT CACTGTGT AAG CT ACTGTT AC CAACAGTTTCT AG CTGTTT CTGTCTG TCTTTTT AT ACACACTGAATTGTGTTTGT A 
2 0 AAAT AAT ACAT ACTTTTTTTTTTTTTTTTGAGAC^ TTG C C CAGG CTGG AATG CAATGG TGCGAT CTCAGCTC 

ACTGAAACCTTCGCCTCCCAGGTTCAAGTC^TTCTCCTGCCTCACCCTCCTGAGTAGCT 
TGGCTAATTTTTGTATTTTTAGTAGAGACGGTGTTTCGCCATATTGGCCAGG 
CACCTTGGCTTCCCAAAGTGCTGGGATTACAGGCGTGCGTCACCAC7VCT 
GAAGAAAAGTATAAAGACAACAATAAATTATCTCAAATCTTACCCATCAAGAATTA 

2 5 AGTTTTATAAATTGGAACCATACTGAAAAGGCTTTTTCTTAATGAAAAT^ 

AGACACACACACACACACACACATCAAGACAAGCAAAACCTCT 
AATTTTTCTTCACTACAAGAAATATTTTCTTGGTTTTTT^ 

GCGTGTGCCACCACGCCCGGCTTTTTTTGTGTTTTTGGTAGAGACGGGGTTTTCACCA^ 

3 0 ACCrrcTGATCTGTCTGCCTCGGCCTCCCAAW 

AGAACACTCTCAGGCTAAGACAGGTGCTTATAGGAAACATTAAGAAGTTAGAGTTACT 

CCCCATTCTAATGGGGAAACATCATCATGGGCTTTGGGTAGTTCTCCCA 

TTTTTTTTTTTGGTCACAGTTCACATCCCATCCTTT^ 

G CAT ACTTGGCTCC CTTT CCTTTG AATTTGGAAT CT CAT AATTTTGT AAATGAC AGG AG AT CTG C CACATTGT GCT CTG TG AGTTC 

3 5 AGATTATTTGTCCCCCTACTCCTACCCCGTGGCTGTCACTTT 

TGGTTGATAATTATAATACTCTTTTCGCTCACCATTTAAAAATGATTTTTGCTCCCT 
ATCAAGCCATACTTTACCATCTTTTCTAGAAGTCTGACTGCATGTTTTGA 

TATTTCTTCACTATTAGATGTACCTTTTACTGTATATACTGTGTATATATACACATACCACATA^ 
AATACCACACAACTCAAGATATTTCTTCAAGATTTAAACACC^ 

4 0 CC CT C AT AG AT AAC CAGT ATCTTT AATTTTCTGTTTTG CTTTTCTTGC CT TTT ATTTGTT AT ATTTTTT ATTTCT 

GATAGGGTCTTGCTATGTTGCCCAGGCTGGTCTTGAACTCT 

TTGACTCAACACTGTTTTTAAGATTCATCCATTTGTGGCAGGTAGCT 

4 5 CTATGTGTATGTTTGCTATAATTTACTTATCTATTTTCCTGTTGACAATTTACACATTT^ 

TTTTTAATGTGTCTTCAGGCACACAAGTGCTAAAATTTCTCAAGTCAAGTCTGTATCTAGAAA 
ATTTAACAAGAGAATGCCAAATTACTTTACCAATTTACACTCT 
TTCAGCAGTAGTAAGGTTGAGC CTTT CTTCATATGTTTTCTAAT CAT AC^TTTGT 
TGCTATTAGAGTATTTGTCTTTTTCrrCATTGGTTCTC<3TT 

5 0 ATTGCAATTATCTTCCTCTAGATTATTGTTCCrrCTTTTCATTTTC" 

GTCATTATAGCTCACTGCAGCTGCAAACTCCTGGCCTCAAGTGATCCTTCCACT 
TGCCACCATGCATGGCTAATTTAAATTTTTTCTGTA^^ 

AAGTGACCCTCTCG CCTTGACCTCCCAAAGTGTTGGGGTGGCATG AG CCAC CIACACCTGGCCAACATTTTAT AGTTAGTACTTTTC 

5 5 CTTGTCTCATTTAAGAAAACTTTTTCTGCTTGAGGGTTAGAAAGATATTCATCT 

CATTTGACATTTAATTCCTrTTAGTCTTTGGTGTGAAGGATTCATTTTTGT 

AATAGTCTTTCTTTTG C C C ATGGT AGT AGTCT CACAT ACT ACTT CTGGT AG AG C AACGT CCT CTTTTT AAAG TTT CTT CTT CT G AA 
GTG CTTTGACT ATG CCT AG CT CTTTGC CCTT CT AT ATG AT AT AT ATAAT AACTTC^ AAATTTTCAT AACAAC CAG ACTGAG ATTTT 
TATTG AGG TTGG ACT AAAT CT ATTGTT CGATTTGAGAG AAGAGAT AACTTTG TG AT ATTGT CTTTCT CATG AAC ATGGG AT AT 

6 0 TTCTTCATTTATTTAGGTCATCTTTAAC^TTCTTTAGCAAAGTTTTCCAATTTGTAGGTGTAGGACT 

TTATTCTTC^GTATTTCTTTTTTTCTCTGTAAAAGGC 
TACAATTGATTTTTGTATATTGATCATGAGATCCAGCCATGTTC 
TTAGAGCTATCTCTGTGGCCAATCATATTATTTGAAAAAATGAGAGTTTT^ 
ACTCAGGAGGCTGAGGCAGGAGAATGGCATGAACCCGGGAGGCGGAGCTTGCAGTC 

TGCTTCTAATTTATTTTCCTTATCTTATTCGACTGGTCATAACCT 

CT ATTTTTTG ACCC CACCT CTG ATGGGGTCTCACTCCGTCACT CAGG TTGG AGT ACAATGGTGTG ATCAT AG CT CACTGCAG CATT 
GAACTTCTGGGTTCAAGTGATGCTTCTGCCTCAGCCTCCCAAAT^ 

AAAA TTTI TTTTTTTTTl T 1 IT rTTTT GTAGAAACAGGGCCTAGTTATATTGTCCAGGCTGCT 

7 0 TCCTCCTGCATTGGCCTCCCAAAGCACTGGGATTAGAAGAGTGAGCTGCCAT^ 

riTTTCCTTTrACTTGTTTGCATTTCTATAATTTATTTATTTTA 
TGATGTTTTATCCTCCCTTCTCGTCAGTAAGATAATTTTAGATCACTTTAATTCT 

TT CTGACTTACT AAT AAAAACTT ACHT CCTT CTTCT T AG CCATTCTAAT AACC 
TTTTTTCATTCATGTTTTTCrrC C"lCTCAACTT ACT GkTTTCAGTT CTTTCTTTTTTT AAC C C CT A AAAATT AG ACTC CTTTATT AT 
7 5 TTT ATTG AG ACT ATG CTTTGT CTGGACTT AC CTTT ATGTTACC ATT CTTTTTG TT CATTTT C C CTTTTTTGCATTTCTCTTTTTTG 
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GGGGATCATTTACCTATATCTTTAAATCCTTTATAAGTTCTT^ 

TAAATGTTATCATTTTCCCrATTGGTATTGAGTGA 

TTTTTTTTTTTTTGAGGTGAAGTCGTACrCATTGTC 

CCTCCCAGGTTCAAGTGATTCTCCTGACTCAGCCTCCTGAGTAGCTGGGATTACA 
5 TTATTTTTTTTTTTATTTTTACT 

CGCCTTGGCCTCCTGAAGTGCTCGGATTACAGGCATGAGCCACTG^ 
GTCTTCTGGCTTCATTGTTGTGAGATCTGCTGCTATTCTAATTATCATT 

G ATCTTCT ATTT AT ATT CATGCCCTAAAGATTCACT ACGATGTGAGAAGG CATC ATTTTATTTTT CCTTATC CTATTTGGG AT ACA 
TTGTGTTTCATGGATCTTTGCATTCATCATTTTC^ 
1 0 TTCTCTCCAATTTCATCCTTTAGGACTCCAATCAGATGGGTGTCTT TTTTTUTl "^ 
GGCTGGAGTGCGGTGGCACTT^CTTGGCTCACT 

GCTGGGACTACAGGCATGCACGACCATACCTAGCTAATTAAAAGAAATTTT^ 

CCTAGGTTGGTCTTAAACrCTGGGCTCAAGCGATCTTCCCACTT^ 

ATCCTTATAATTGCATTATTTGTTTTTOTAAACATTCTATACCT 
1 5 GATGATAGGGTTGTTGTCTAAATCTGTTGTTTCTTCTGATTCTCAATGA 

TT uT'riGTTT ATT AAAT i v i i G CTAAAG AAAAT CTT CAAGGAGT CTTT AGG CAAT AACACTTCTGAGT ATTT AACATGTCTTT ATTT 

TACTTACACATGTGAGTGATAGCTTGACCCAGTATAAAAGTCTAGGTTGAAATCACTTTCT 

AGTATGATAGTACTCTCAGATTTTTTTTTTTCCTGAGATGGAGTCTCAGTCGCTC^ 

CTCGGCTCACTGCAACC^CTGCCTCCTG^GTTCAAGTGATTCTCCT 
2 0 ACCACGCCrGGCTAATTTTTATATTTTTAGTAGAGATGCGGTTTCAC 

GACC<^CCTGCCTTGGCCTCCCAGAGTGCTGGGATTACAGGCA 

TTTTTCATTGT CACTT AGTTTTT AGT ATTG CT CTT AAG AAG TCTG ATG C CATTTTGATTCATG ATC CTTTTTGGT AGG TTTTT AGA 

TCCTCTCTTGCTTCCCAGTGTTTTAAACATTCTGTATTATATACCTTGGTCTGAAGCC 

AATGAGAACTATCAACCCAGAAACGCTGTTCTTCATTCTT^ 

2 5 CCAGCACTTTGGGAGGCTGAGGCGAGCAGATCACTTGAGGTCAGGA^ 

TACAAAAGTACAAAAATTAGCCGGGCGTGGTGATGCATGCCTGTAATCCCAGTCACTCAG 
ACCCAGGAGGCGGAAGTTCCAGTGAGCCAAGATTGCACCATTGCACTCCAGC 

AAGAAAGTTTCTCTTCCI^rTTTAAAAAACATTTTT TT IT 'l ' X " 1"1T'1 ' A GAGATGAAGTCTC 

ACTATGTTGGTCAGGTTGCTCTTGAACTCCTGGCCAAAAAAT^ 

3 0 CCACACCACTGGGAGCTGATTTATTAGGCrGGGGCCCCCACAATGTTAGTGTGTAGAGATATGAGTTCT 

AAATTT CTT ACCAGCTGACTGGAGAAT ATTGT AGG AG TG AAAT ATGAGAAG AGAAACCG AG GGGCTAATTTTTT'l'Cl"l"ri'G C AAG C 
TTTCACTTACATTTTCrrCTTCCAGCTTrcCCT 

CATTTTCTTTAG AATG ACACTCTATACTTCTAGGTGGT AGTGGT AGTGAAAT A CGGGAAGCAGTCAT 
GGGCATCTTGATACATCTGATTTCTUGACTTTTAGCC^ 

3 5 AGTTCTGAGCCTCTTAAGCATTCTTTCGGGCrAAATCAGTTCACTCATTG^ 

ATGTTGCTTAATCATGCACTACTCCTCCGTCTGCTTTCT 
GCTGTTCTCITTGTTCCTGTGGTTTGATACATTTGAAAAA 

GAGACAGAGTCTCGCTCTGTCGCCCAGACTAGAGTGCAGTGGTGCAATCZTCGGCTCA 
ATTCTCCTGCCTCAGCCTCCTCAGTAGTTGGGATTACAGGTGCGCACCACCATC 

4 0 GGGTTTCACCATGTTGGTCAGGCTGGTCrCGAACTCCTGACCTTGTCATCT 

GCATGAGCCACCGCGCCCGGCCCAAATATACAAATTTTAACCTAAACACTCTTCAT 
CAGCCTTCATGGATTGGCCTTCrrcrTTCTCTACCCCACACCCCa^AA 
TCCTATCCAGCTAGTATTTATTTITCAGAGACATTCATTACTCTTCGACT 
TTCTTTCITTCTTTTTTTTTTTTT^ 

4 5 CACTGCAAGCTCTGCCTCCAGGGTTCACACCATTCTCCrGCCT 

CACACCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGGG 

CCACCTGCCTTGGCCTCCTAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCGAC^ 

GGATCTCACTCCTGTTGCCCAGGCTGGGGTGC^GTGGCGCTAT 

CCCACCTCAGCCTCCTAAGTAGCTGGGATTACAGGCACATGCCACCATGCC^ 

5 0 TTCATCATTTTACCTAGGCTGGTCTCAAATTCCTGGGCTCAAGCAATCTGCCCGC 

GTGTGCCACTGCACCCGGCCTTTGTTTCTCTTCCTATTACT 

TCCAAAGTTATCTTAGTAGTGGCATATTAACTTGGGGATGGCTGAAGATTAATTTCT 

TCTGTACTTATGATTTACATGAATGCCTTTCTGAGTGTCTGTGGTTACAGCT 

ATGGCAAGAGTTATAAACAGCT 

55 

HUMAN SEQUENCE - mRNA 

CGG CCCGCTGGAG AGGAAGCCCG AG AGCTGCCG CGCGCCTGCCGGACG AGGGCGT AGAAGC CAGGCGTCAGAGCCCGGGCTCCGGT 
GGGGTCCCCCACCa^CCCTOKXn'CCCCCGCCCCCrcCT 

GGTCCTCGACC^CTACC^SGAAGGTGCCAGCCCGCCCCGGATGGGCATCGTGGAGCCGGGTTGCGGAGACATGCT 
60 G CCGATGCCGGGG AGCG ACGAGGGCCGGGCGC CTGGCGCCG ACCrc C AGCACCGCTACTTCT 

CGGACGAGCGTCGCGGGGGCGGCAGCCTGGGGTCTCCCTACCCGGGGGGCGCCTTGGTGCCCGCCCCGCCGAG 
GCCTACGCCTACCCGCCGCGACCCC^GCGGCCGGCTTCCCCGGCGa^ 
CCAGCCGGGCGAGGGCTACGCCGCCCCGGACCCGCGCGCCGGGCTCTACC^ 
TGGAGGTGTCGGGGAAACTGAGGGTCGCGCTCAACAACCACCTGTTGTGGTCC^ 

6 5 ACCAAGCAGGGACGGCGGATGTTCCCAT^ 

CGTGGTCTTGGTGGACCAGCACCACTGGCGGTACO^AGCGGCAAGTGGGTGCAGTG 
ACCGCCTGTACGTCCACCCGGACTCCCCCAACACAGGAGCGC^ 
AACAACAAGGGGGCGTCCAACAATGTGACCCJW»ATGATTGTGCT 
GGTGAACGACGGAGAGCCAGAGGC^GCCTGCAACGCTTCCAACACGCATATCT^ 

7 0 CTGCCTACCAGAATGCCGAGATTACTCAGCTGAAAATTGATAATAACCCCT^ 

TACACATCTGTTGACACCAGC!ATCCCCTCCCCGCCTGGACCCAACT 
GAACCAGTATCCTGTTCCCAGCCTGCTTCTACCCCGACCTTCCT^ 
CCCCCCGGaACCACAGCTATGAfiGCTGAGTTTC 
TCCTACTACCGAGGCCAGGAGGTCCTGGCIACCTGGAG 
7 5 CTGGTTCCGCCCTATGCGGACTCTGCCCATGGAACCCGGCCCTGGAG^ 
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TGGTGTGGACTGAGATTGCCCCCATCCGGCCGGAA^ 

CCCTATCCTTCCAGTGGTGACAGCTCCTCCCCTGCTGGGGCCCCTTCTCCTT^ 

TTTTCCCAACTGAGCAGATGACATGATGAAAGGAACAGAAACAGTGTTA 

GAAGGACTGAGAAGGCCCCCGCTCCCTCTGGCCCTTCTCTGTTTAGTAGTTGGTTGGGGAAGT 

GGTGCTGCGTCTTGCTTTTGGTTTCCAGCTGGAGAAAA^ 

TGGGAGGGGTCAGGTGTGGG AC^TGGGAGCAGGAGACTCCACTTT CTT C CTTTG T ACAGT AACTTTCAACCTTTT CGTTGGCATG T 
GTGTT AAT CCCTGATCCAAAAAG AACAAAT ACACGT ATGTT AT AAC CAT CAG CCCG CCAGGGTCAGGGAAAGG ACT CACCTGACTT 
TGG ACAG CTGGC CTGGGCTCC C C CTG CTCAAACACAGTGGGGAT CAGAGAAAAGGGG CTGG AAAGGGGGGAATGGC CCAC ATCT CA 
10 AGAAGCAAGATATTGTTTGTGGTGGTTGTGTGTGGGTGTGTGr T T 11 111 cri TCT' lT l'T Ari ' I ' l TTTT GAATGGGGGAGGC 

TATTTATTGTACTGAGAGTGGTGTCTGGATATATTCCTTTTGTCT^ 
AAAAAAAAA 

HUMAN SEQUENCE - CODING 
1 5 ATGGGCATCGTGGAGCCGGGTTGCGGAGACATGCTGA 

CGACCCGCAGCJVCCX3CTACITCTACCCGGAGCCGGGCX3CGCAGGA 
' ACCa5GGGGGCX3CCrTGGTGCCCGCCCCGCCX5AGCCGCTTCCTTGGAGCCTAC 

CCCGGCGCGGGCGAGTCCTTCCCGCCGCCCXKTGGACGCCGAGGGCTACCAGCCGGGCGAGGGCTAC^ 

CGGGCTCTACCCGGGGCCGCGTGAGGACTACGCXSCTACCTOaKKlACrrGG 
2 0 ACCTGTTGTGGTCOU^TTTAATCAGCACCAGACAG AG ATGAT CAT CACCAAG OVGGGACGGCGGATGTTCCCATTCCTGTCATTT 

ACTGTGGCCX3GGCTGGAGCCCACCAGCCACTACAGGATGTTTGTGGACGTGGT 

CGGCAAGTGGGTGCAGTGTGGAAAGGCCGAGGGCAGCATGCCAGGAAACCGCCTGTACGTCCACCC 

CGCACTGGATGCGCCAGGAAGTTTCATTTGGGAAACTAAAGCTC^ 

GTGCTCCAGTCCCTCCATAAGTACCAGCCCCGGCTGCATATCGTTGAGGTGAACGAC^ 
2 5 CAACACGCAT AT CTTTACTT T C CAAG AAAC CCAGTT CATTG CCG TGACTG CCTAC CAG AATG CCG AGATT ACTCAGCTG AAAATTG 

ATAATAACCCCTTTGCCAAAGGATTCCGGGAGAACTTTGAGTCCATGTAC^ 

CCCAA CTGTCAATTC CTTGGG GG AG ATCACT ACTCT CCT CTCCT AC CCAAC CAGT ATCCTGTTCCCAG CCG CTTCT ACCCCG ACCT 
TCCTGGCCAGGCGAAGGATGTGGTT CCCCAGGCTTACTGGCTGGGGGC CCCCCGGGACCACAGCT ATGAGGCTGAGT^ 
TCAGCATGAAGCCTGCATTCTTGCCCTCTGCCCCTGGGCCCACCATGTCCTACTACCGAG 
30 GGCTGGCCTGTC^CACCCCAGTACCCTCCCAAGATGGGCCCGGCCAGCTGGTTCCGCCCTATGCGGACT 
CCCTGGAGGCTCAGAGGGACGGGGACCAGAGGACCAGGGTCCCCCCTTGGTGTGGACTG^ 
GTGATTCZAGGACTGGGCGAAGGAGACTCTAAGAGGAGGCGCGTGTCCCC 
GCCCCTTCTCCTTTTG AT AAGGAAG CTG AAGGACAGTTTT ATAACT ATTTTCCCAACTG A 
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Table 39 

MOUSE NOMENCLATURE 
ICSGNM Stat 5b 

Celera mCG20224 

5 

HUMAN NOMENCLATURE 
HGNC STAT 5 B 

Celera hCG16577 

10 MOUSE SEQUENCE - GENOMIC 

CAGGGGCGGGCACTGCTGCTTGTCTCGTCACGATTCCT 
GCCCGCCCTATGGGTGGGGTTGCCCAGCCCCTCCXyrTAGAGTGTTGTCC^ 

TTTTTGTTTTTGAGGTCCAGGGAACTGATAGCAGGGC CTTGTGCCT CTCCAACTC 
CTTCTCATTCTGTGACTAGTAAGAAGCXSTGCAAAGGCAAGAAC^ 
1 5 TTGATGCTCCTTGTCTGAGTCTACTGTGTGTGTGTATGTGTCT 

TCTCTGTGTGTGTCTCTGTGTGTGTCTGTGTGTGTGGGTATGTCTGTGTGTATCTGTGTGTGTGTCT 

TCTGTGTGTGTGTGTGTCTGTGTCTCTGTGTGTGTATCTCTTTGTGTGTCT 

TGTGTGTGTCTCCGTGTGTGTGTCTGTATCTCTGTGTATGTATCTCT 

TGTGTGTGTGTCTCTGTGTATCTCTGTGTGTGTCCAAGTGCACGCATGTATGTGCACAAGTCT 
CCGATATTTATATTACAACGCGTAACAGTATCAAAACTACAGTT^ 

CAACATGAGGAACCATATTAAGGGTCGCAGCGTTAGGGAAGTTGAGACTGGGTATCT CCCACTG AG CT 

GTGGGGGCCAGCT^GCTCCAAGGATCCTTCTGAATGCCTTCCCAGAACTGGGAGCA 

TGGGAATTGAACTCAGATCCTAAAGACJ^CACAGTGAGTGCTTTGCCAACCAGGAGCCG 

2 5 TGAGGGAGCACGTGTTTTAATATAAACATGGTTGTATTTAGGTGAGAAGTT^ 

TCTCCACGCAGCAAATAGCCACCCAACCCCCGCGCCCCCCTITTACCTTCTGTCCTGTCCTTCTGTT 
AAAGCGACACACGGTGGGTGAAGATTTATGACTGTCTTCCCAGAATTTTATTATCT 
GTTTCTTTGTGGGCCACTTTCACTCCTGTTCCCTTCTCATGG 
GGTTGCTTATAAGCTAGTGGTTAGGTTGTTTTGTTTTGTTTTGTTT 

3 0 GATGTGGTCATTTACCCTAACCTTGCTCGGTCAGGCCAGTATGAACACAGTCTTAATCTTCT 

CTCTCAAAGTCCTTGT ATTCGGT G ACCACTCAACACAGT AATG ATG TTG ACATTGACAGTG AG AG CATC CACATGAC CAGAGAC CT 
GGTCCC!TTATCCCCCTCCCAGGATCCCTCAGTTCTCAGCGGGTGCT 
ACTTTTCAGAT(^AACCAATGTTTATCCATTGAGTCCATCTGCCAACT 
TGGGTTTCTCCTCTCTCTTTAAATTATTAAGTGTGACAACGTGTG 

3 5 CAGGACACAGG CGGTGCTGGGCAGTGGCAACAGGGCACTGCACAGTCAGGG CCTCTTGGCACTGAGAG ATGGGGTCATCTTGGTAA 

TGGCATTAAGGGCATGTGCCACCAGGCCTAGCTTTITTCT 
TTCACCTTTTTATTTAGTTTGGGGTGTG 

GTTTGCTCTTCACTCCACT ATGGGTTTCAGGGGTCAAACTCAGGT CATCAGACTTG AATGACC AATG CTTTT ACCTG AG AG CCATT 
TCCCTGGCCCCATCAACATTCTAAAATGTCCAGAATTCAGTCTCCCGGGCGTTGTAGTCCAT^ 

4 0 TGTGTTTTCCTCACTTCCTCGCTTCCCCAAAAGTTCAACCTCACTCTT^ 

GTAGTTCAGGAAGTTTGTCCTCCACAGTTTTCAGTGTTCT 

TTCCATAGGTTCTTGTCTGTAAAGCCCTTGTGTCAGCGGGACTGATTAAGAAATCAGAGT^ 
AATTAAGAGCACTGGCTCCTCCTCCACAGATCCTGAGTTCAA 

CTAT ACAT AAAAAGGGAAGG AAGG AATGAAAG AAGGG AGGAAGGX3 AGGGAGG AAGGAAG C CAGCAAAAT 

4 5 GACTCAGTGGATAATGATGATTGCCACCAAGCCTGACATCCTGAGTTTG^ 

TACAAGTTATTCrCTAACTGCOJ^GGCGTGCTATGGTGTGTGCGCACACA^ 
TAAGAAAAAGAATACCTTATTCTTAAAGGAAGGTAACTTTTGTCCTGTGGAGC^ 
TCAATCAGATGAAGTTTGGAAAAGCTTGTTGGTGGGGTGAAGATGGAGGTGGGGAG 
TCATTGGCTCTCCTCAGGGGCTGACCACACTCAGGCTCTGTGTT 

5 0 AAACTCCGAATAGTAAAAACTAACCTGGCAAACCATATCTCTCrGTGAGT 

TTCAGTCCATTGGCAGAGTTTCGCTGGAAGCTTCCAGCAAGAGAGCCTGCTT^ 

TGTTCTGAAAACCTTCAGAGTAATGGCTTTGTTTGCAAACGCTTCT 

ACTAACAGGCAGAAATTGACCCCCAGAACTAGAGTAACAG^ 

GGTATTGAACTACATAATGAGTCCAGTGCCAGAGGCAAGCCTGGGCTACCC!AGAGCrG 
55 TTCTGATAG AATGTTAGCTATTGATACAACATAACACTG ATCACC!AGCAT AGTAG ATATCC CT C AATAAG CCTG 

TGAAGTTGAGGTFrTTATAAACAGGTACITCTGGCCC 
GTGCTGGGTGTGTTTAGCCGGTATAGCACTCTGTGTCCAGGCCCTO 

GGCTG ACTGTCAACACTGTCCTAAAAATATG ATGGTG AATCCAG TTCC CAGGGTTTAGAG CGGC CCCTGTGATGATTTACCAGCG A 
TACAAACCACTTCCCTTGTCAGAATGTCACGAACTGAAAACTGAGCrGCCGGGTC^ 

6 0 GGTAGGCTGTCTCCATGCGGGTGGAGGAGCCGCCCGGTGGCTGGATTGACCATGGGTACAGGGCCGGTGG 

ACCTG TGTGCACAATGTTTCCTGGTGAGTGTGGAGAGTTCTGACAGGCATGGGAGAAAAGC^ 
ATTTAAACTGGTAAGGAGTGGGAAGAGTTTTTTGTCTGGAGT^ 

GTACAACAGTAACCTTCATAAATCTATGTGGATAAAACCCACAGTTTAAAAGTAAAACG^ 
ATATATATACATTTTTTTTATTTTAACTGACACAACT 

6 5 ATTGTCAGATCTCTCACTCAACATTGATGATGTGTTTCT 

ATCCACACCGGAGCTCTATTTTCTATCCCTATTAAAGTACTTTTATGTCGTGAAACTATC 

CCACTTTGTATTTACCAGCTATTATCCrTCAAATACTGTACTATTAA 

AATATTCTGACCAAAAAATTAATATATATATTTrTTTTCrGACAAATTCT 

7 0 CATCTTTTGAAGCTGGACACACCTTTACTACCCACATGA^ 

AGCTGGTCTACATGTT AAGTTG CAG ACCAG CCAGGTCTACTGAGTGAGACTCAGTCTCAAAAAAT AAACAAAGG CGTTTGGGTTTT 

TTTTAGGTTTGGTTTGGTTTGGTTTTTGTT 

AGAGAGAGAAC^TATCaTvTTwAAA 

CCCCATTAAGCTGTTTCTCTCTTAAAACAACAATAAACTCC^ 
7 5 GTTCATTTAGCCTATCGGTTCCCTACCAGTGTAGGTCACAACC 
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ATCGGAAAACACAGATATTTACATTACAATTCATAAC^^ 

TGAGGAACTGTATTAAAGGGTTGTAGGATTAGGAAGAGTGAGAACCACAGA 

GCTGCTACACATGCATTTTGGGGACAGGGCAGTTAAGACATGTCCTCT 

AAGAAAACT AGACAGAAAAATGACAGG CCTGTGTGATGAAAG AG AG AT CACC CACAT AG AGTCAC CCGAG TT AT ACAGTT AT AG CT 
5 GAGGAGCTCATTAAAGATXnTTC^TTCT 1 G I'TTGTTTATTATA 

TGAGTACACTGTAGCTGTACAGATGGGTTGTGAGCCACCATGT^ 

TG CTCTT AACTG CTGAGC CAT CTCTCCGGCCCT ACACCG T AGAATTTCAT ACT CATGTT ATGTTCTGT AATAACTTTG AAAAGAG 
TGATCTGAAGCTGGAGAGACGACTCAGTOSTTAAGAGGCrrGCCCTT 
CTCACAACCATCrGTAACTCCAGTTCTAGGAGATCTGACCCCCTCTTCTGCT 
• 1 0 CAACATG CAGGCAAACAC C CACACATT AAGTAGATCGGAAAAAT AT AT AT CTTT AAAAAAGGAAAAAGCAGAC ATGTCCAATTGTGT 

TTGCTCATATTTGTTTATTAAAGCAACAACAACAGAATAGT^ 
TTAGTAAAGATGATGCATGCCAGACAGNNNNNNNNNNNNNW 



GCGCGCGCGTCTGCATGTGCGTGTGTCTGTGTGTGTGTGTGTGTGTGTGTATACTTATG^ 
1 5 TTATTTTTGAAACAGGGTCTCACrCTGTTGCCCT 

AGCCACCTACCTCTGCCTTGTGACTATGACACCATAC^CAGCCCCTTCCCTCGA 

TTCCTGTCTCCATCTCCGAGCTCTGAGATTACAAGTGTCCACC^ 

GCTTGCAAGGCAAACACTGGCTGAATCACCTTCCTAGCCC^ 

ATGAATGTTCTGGACACATGTATATTTGTGTACTA^ 
2 0 CTGGAGTTACAGATGGTTGTGAGCCATCAGGTGGGTGCTGGGAAACAAGCCCAG^ 

CTGAGCCATCTCTCCAGCACCCCCCTCCTTGTTCTTAACTAAGAAAGAGTC 

GCATGCTTAAATCTCCAGTCTCCTCCACrCCCAATCTCr^ 

CTTCRCTAAAACATTGTACAGTTGTCTTCCAGATGTTTGACT 

TCACACCCACrTGAACTTTGCAGTATTGGAAAATCTC!ACTGTTA 

2 5 TATCCTATAGCCGCACACACATCTAAAATAGGTAACCAACCGGCACCCACCCTACA 

AAAT AG CTCAGTG TT ACTCG ATCCG ACAGCCACTGCTCACAG AG AGAT CTTTGTTT C CTGGCACACATGTGCGTG CAGT CT CTGGG 
AGTCAACAAAACCC^GGTCCAGTGCACTGGACCATGGCTGTAAC^ 
TTGAGATTGGATTCAAGACGCCCAGATCCGTATTACCAAAAGCAGTGTTAC^^ 
ACATAGCTTCTCCACTCTGGGGAATCCCAGGTCTCTGCCAAA 

3 0 CAGAAAGC^TCTATTTGTTTGTTTGTTTGTTTGCTTGCTTGC 

CACTAGATCCCATGACAGACGGTTGTGAGCTATCATCAACrrTAGGACCTCrGGAAGTA 

CTCrCC^GCCCCC^GAAAGCCATTTTCTAAAGCGTGTTTTTGTTGGTGCAGCT^ 

AGTC^CAGGAAGCCCACGCTGGCCTTGAATTCTTGATCCTCCTC 

ATCCTGTTAAAAAGAAAAAAAAAAAAACAGATAATCTCACTATTGCTGGGACTAGTGGCGCC^GCCT 

3 5 AGAAGACAGCAAGTTTAGTGGGCCATGGTGGAACAGGCCTrTAATCCCAGC^ 

GAGG C CAACCTGG TCT ACAAAGTG AGTTC CAGG ACAAC CAAG AGAAAT CCTG TCT CAAAAAAT AAAACAACAAC AAAAAAG AAATA 
ATATAAAATGAATAAATAAATGAAATAAAATAAAATGATGATAAGTTGG^ 
CTCTTCCIAAAGGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCACAAC 
ATGTCTGAAGACAGCTACAGTGTCCTTACATATAACAATAAATAAATCITTAAAA^ 

4 0 TACAGAGCGAGTGCATAGTCAGCCTACACAACITAGTGAGACACrGTCrCt^^ 

CAG CCATGGAG CT CTTGCCCAGCATGCCCCAGGTCATAGGTTC AATCTCCATT ACAA 
GCCCTGCCGCTCACCATTGTCAjCACAGCAATCCATTCTTCT 

AAATTCCATAGCAGTAACTCCAGAGGTGAAAG AACATTGGGGAG CTTGGCCTG AT AAGCCTCTG CTCTGGGTAAGAGAATGGCCAC 
TAAGCCACTAAG AT AACACACAGG CCTCAGTCTGTC CCGAG ATTCTCCGGTCTTTCAGATCCTGTTCAAATCTTCCTGTrCATCTT 

4 5 CCATTTCACAAAACCCAAATACCAAACCTAGAGTTAGTTTTCAG 

TTATAACCCACGGCGGTTTCAAGCAAACGCGAAGACCAGGTCTAGGATATTAA^ 
TTTTGATGGTTCTGCTGCAATAGTCrATGCTTCATAGGTCCATTTGGGGCCC^ 
AACCTGACAAAGCTGGAAGTCAGTTCX1AGAATATTGATCCTGGGTAGCTGAGA 
(nXSGCAACCTGCGTTTGATCCCTCGAACCCACATAAAGATC 

5 0 ATTCACCGCGGCACGTGTGCGCCrcCACACACATCACATACGATAATAAAG AAAAAAG 

AAAAGGAGAGAAAGCATTCATTCIACTTTTAAACCCCGTGCATAT^ 
CAAATAAATCAAGGTCTTTGTTTAACAGGTTGTGAACCATGGCTATGTGGA 
CAGATGCAGGCCTTGTACGGCCAGCATTTCCCCATCGAG 
GAACAGCTTCTAGATCATCTTTTGGGAAGTACGAGGATATTGAACTTTGATGA 

5 5 GAGTAACCTAGGCTTAGTGGTCCCCATCTGTAACCGTCATGA 

TTGCAGAGTGAGGCCCTGTCCCTATCTTGGAAGAAGAAGAAGAGGAGGAGGAGGAGGAGGAAGA 

TCGGGCTGGGTGTGTAGCTCAGTGGTAAAGTACAGGCATGAGG CTCTAGACTCAGCCCTCAGG ACCTGTCTCCAAGAAAAG CTGTT 
TGAAAATCTTCATGTAATTCATAAAAAGGTATTCAGCCTTCTTTTA^ 

GTTGTTTTTTAAATTATGTATATATGTGCACGTCTGTGTATGGATGCGTACACATGAGGGCACCT 

6 0 GCnK3AGCTGGCGTTTCAAGTGGTTGCGAACCACCTGGTGTGGCOT 

CTCTTAACCAGACAGGCCTCTTCTCCAGCCTCTGAGCTTGCC^ 
TCTCACTTTTTGAGACAGGCCCTGGCTGTCCTGGAACTCAAAAGTC 
TTITCCCCCCAAGGTAAAGTTTCACCGTGTAGTTATGAGTGGCCTGCAT^ 
AGCCTTTGTCTCCCATGTGCTGAAATTTATATCTTATAAATCAGATATTTAACATATTGA 

6 5 TCTGGTTTTTTGTTTTTCTGAGATAGGGTCTCATACTATATAGCTCTGGCTGTCCCGGAA 

CTTTTTAAAAACTAATAATGGTTACATTTCAGTGTrGACT^ 

AAATTGTTAG AT CGG T AT AACACAGTT CAG TAG CAATG CAT CTT ACCGTG CAGTG CT AGG ACAAAG CTT ACAACACACGAG ACAGC 
TGTACCACTGAGCGGCTGCCTCCTCAGCCCTGCCGCATCCTC^ 

GCCCGTCGTTGGCCTCAGTCTGTCCGTGAGTGTCCAAGTTCATCACAAGTT CATC AG ACT AGCCAGG CTTTGACCAGGGTGGTG AG 

7 0 CTCCTCTTTGGAGAGCCAGCCTTGTTGCTGCTGGGAGAAGG 

CTTGTTAGCAAGGCTCCGTTC^GAAACAAGCCTGCCGAAGGAAGGCAATTAAAAAAAA 
AAAACTGACTAAATATGAGAACATCAAACACACCTCAATTAGTCC^ 
T"TCGGAAAACATTAATAGGAAGTTTCTGC!TGATTA?JL^ 
AGACTGGCTCCATACCACAAAAGAGAGAACAAAAGCCAACTTTCGT^ 
7 5 TCAGGAAGCTGAGGCAGGGGAATTACTTAAGCCTAGGAACTCX3AGACAGTCCTAGGCAAATGGGCT 
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AGT AGTCT ACTT ATTGAAT AAG CTTTGGG AAATTGGGGTTG AT ATT AAGGAAAACGTGG C C 

CATAGAGAAATAGAGATATCACCGAAGAGTTTCCAGCTCTCCGTATTGAAGCnXr^ 
GTCCAAATTCAGAACOVGACTCCTCTGTTCTtSTTCAAAGACT 
GCACTTGGGAGGCAGAGGCAGGTCGATTTCTGAGTTCAAGGCCAGCCTGGT 
5 AGAGAAACCCTGTCTTGAAAAAACAAAC^AACAAAAA 

GG CTGTG CCTG CAG CCCGTCT AGCCTTCCACTT CAT AC AT ATTTTCTTCAAC C CAT AG AG AAT ACACTTCT AAAAATTGGCTTTGT 

CCCTACCCTTTCCAGACACAGGTTGGGTGGTCTCTGGTA^ 

AGCCACTTGCTAACTTCAGGGATTTGTGGGTGGCTGTc 

TAATTAGATATTATTTCAGCCTGACAGCTGTTTGGTTTGTC^ CTAAGTCACTTAAGTATTACCCAGCAG CAATTGTTGCCT 

1 0 AACTT AGTAG A CAAAAAT L T 1T T AAGG ACAAAAAGAAAGAAGAT AAGG AGG AAGTGGAC CCAAC C CCTGTCAC CAGCTG AACAGCT 

CAGGATGACA.CGAGGTCTGGGGACrcGCTTCTCGCCTTCACTGGAQ 

CAGACTATTCTCCTGGGTGTGAATACTTGTAGGCATCTTCTGTCC^ 

GGGGGCCTGGAGCCAGTCAAGGTTGGAAAGAACATGCAGAACTATATACCCT 

GAAGTTAGAAGACCCCTTATTCCCTTTATTCCTAGGAGGGGGTGGGT^ 
1 5 GTATTTAGAGAAAGAAGTGTAAAGCTC^CTAAAAGAATTGTAGAGTCAGGCATTGTAGACA 

TTCTrAACCCGTGAATTATAGAAGAAATCACGTTCCAGACTCCCATGATGGCTCACACCT 

GTAGTATGTAAAGGCTCCATGTGTAC/VGGCTCCTATGCATCTATGTAT 

ATCTTGATAATCCACAGGAGAACATTAAGGCCACCCAG CTCCTGGAGGGC CTGGTGCAGGAG CTGCAGAAG AAGGCGGAGCACCAG 
GTGGGGGAAGATGGGTTTTTGCTGAAGATCAAGCTGGGGCACTATGCCACA 
2 0 GGCCCTGCCCTCTTGAGGACTTCTTCATTTAATTTTCT^ 

TCACGCACATGGTTTTGATTGGCTTTGTGGTTGCGG CTGTT ACCrTCTTTGAGACAAGAGTCTCACCGTGGGGCTGGAG AGTTGGT 
TCTGTGGTTAAGAGCGCTTCCTGCTTGTCTTAGGGTTATTATTGCTGTGATGAAACACTAT 
CCGTTTATCTGGCTTACACTTCCACATCACAGTTCGTCATTGGTGGAA 
CAGAAGCTGATACAGAGGCCATAGAGGAATGCTGCTra 

2 5 GAACGACCAGCCCAGGGGCGGCCCCACACACAATGGGCTGGGCCCTCCC^ 

TCCTACAACCCAAACCTATCATAAGAAGGCATTTTTCT 
GAAAACCAGCCA.GCGTGCTGCTATTGTAGAGGACCTGGGTTCAGCT 
TGGAGCCCTCTTCTGACCTCAACAGGCATCCACCCACAAGTGAC^ 
AAAGAAATGAAAAGTTTCACTATTTAGGTCAGATTGGCCTCAAACTGGG 

3 0 GGCTCTAGGTAAGCACCACTCCCAGCCATGACTITTTACTCT 

CCCCACACCTTCATOTACACATGTACACATACACACATACACACTCT 
TACACAATACACACACATACAAACATACACATACAGATGCAGACACATACAC^ 

3 5 TCTAAGAACATGAACTTCCCTTACAGGGTCACAATACAGTTATTC^ 

TTCTGTCTCTATTATCCATTTGTCAACTGACTCA.CAAGTG 

TGACAT ATCTCTTTAATGTTTTAT CTGGTTTGGTTTCAACTTTGTGACGTGTCTGTTTTGGAGATAGGG CCTCCATAGTGTTGTAG 
CCTATGCTAGTCTTGAACGTTAAGCATTGAACATTAAGCAGTCATCTTGTC^ 

CTG AAAAC ATT CATTITTCG AC CTT ATTTTGG L~1 _ 1~I"GATATTAAC!ATTTT AAAATAATC ACTTATT CC CIT ACTTTIAAT Al l l lJ ri'T 

4 0 ACATTTTATTTTTAAATTTGTGTGTGGGTATGTATGTGCACA^ 

TGGAGTTGGAGTTAGAAATGGTTGTGAGCTGCCTGAGGTCGGTGCTGGGAATTGAACT 
AGGGCCTCACCATGTTGTCTA 

G ATGCCGGTCTTCCCTACTGTG ACAGT AATTCTG Al"l C'l AACTG ACAGG AAATTTCC 
TTAAAGACTCACCCGTTTCCAAGAATGTTGCTGTAGTGcii ~r i GGC^TTGAGGTTCAGAAAGTAAAGCAGCCTTACCTTCTTAGTG 

ACCATGAGTGCTTCCCTGCCCTGGGTCCGGTTAACCCACAACAGAGTGGA^ 

GT ACGACCG CTG CCCCATGG AG CTGGTTCG CTG T ATCCGGCACATTCTG T ACAACGAA CAG AGGCTGGTTCGCG AAG CCAAC AACG 

TGAGTGTGCCCATGGTTGTGGGAGAGGAGTTGGGGTGCCCTTCTTC 

ATATGCACTAGTGTCTCTTTAGGG AG CTGACCTTAGCTTTAT AACTT AAGT 

5 0 AACTAACITAACCAAAGCAAATCAGTTTTGTGTTGAA 

CTCGGCCAGCTGCTGAAGGAGTGGATCATACCTAGTCACTTAAACAGCCAGATCCAC^ 
TTCCTCTGAGGGGCTACAGACACCTAGGAGCCTTCGGGTGTGGCTTCATCCACGGTAGAAGCA^ 
AGCACCAGTCTGCCTTGAGATAGATACAGACCATGTTTTATTCCCAATGTGTCCCCAGCGAAATCACAGT 
TGAGTGAATGTGGAATCAGGGTTATCATCATCAGGAGGAAGACCCAAGTGACCTCTG 
55 AATTGGGC ACACAC ATCCTCTGTGGGACACTCATGTGGAGGCCAGAG CTGAAACAGAAGTAGGTTTTT ACCGTGAAGAAAG CCTGT 

CGAGTCAACCGCCTCCACTCGCTTTCCATTCTACCCCGATTCTCACATGTGGC^ 



GCT CT CCAG CTGGAAGT CTTGCTGACG CCATG T C C CAG AAG C AC CTT CAGATCAACCAAACGTTTG AGGAGCTG CGCCTGATCACA 
CAGGACACGGAGAACGAGCTGAAGAAGCTGCAGCAGACCCAAGAGT^ 
6 0 TGAGGCTGGCAGTGGAATCCTGGAAGCTCCGGGGGTGGGAGTAAGGTCTAGGAGCAAGCTCTGAGGA^ 
CCATTCTGAGCTGTGCCTTTGTCTGAGCAGAGTGGAACCATTCAGGCTGAAGAAGGAGCrc 
CCTGATGGGGAGAGTGCAGGGTCTCACTCAGGCCTTCCTGTTCTCATGAAGTTTGGGGTCGCTGTA^ 
ATTTAAGGTXjTTCATGGGCTGGAGAjGATGGCTCAGTAGTGGAAAGCACT 

AACCACAT AGCAGCTCACAACCATCTGTAACTGTAGTCCCATAGCGGCCAATGCCCT CTTACGGTGTG CAT AT ATG CAGACAAAAT 

6 5 ATGCATATATATAAGTAAATAAATCTTTGCCAC^CATGGTGGCTCACATCTTTAATCCT 

TCTATGAGTTTGAGAACAGCCTGGACTACAAAGAGAGTTCCAGGACAGCCA 
CAAAAAGCATTCATAGCTTAATAAATAGCTTACAGA 
TTTATTTATTATATGTAAGTACACTGTAGCTGTCTTCAGACACT 
ACCATGTGGTTGCTGGGATTTGAACTCAGGACCTTCAGAAGAGCA^ 

7 0 TTTTAGAGTAATTGTTTGTTTGTTTTAGACA 

CCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCT 

TTTTATGAATGCAGCCTTGCTGGGATGGGAAGTAGGAAGTACATCACATCTAGGAGTGT 

CCTAATGTGTGAGAGCCTGG CCTCCATCCCCAGCACTGGAGG AT CTCTATTTGTGGTGCCTG^CATTtjAGTTGTATGCAGTTATTC 
AAGACITTCCTGTGACCCrGTATGCCTTCTGGACAGTGTTCCCT 
75 GAG TTGTGGAGCCTC CG CAG CG CT C C CTCTTAAC CL~1 ACGCTT7 CTTCTCT ATG CGGG CG TC C CTGCCTCAOAC AO AC AGA(^GG«S 
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CTGGGGCCATGGCACTAGTGATTAAGAGCACTGGCCTCTTTTGCATAGGAGTTGGATT^ 

AAACTGTCTGTAACrCCACTTCCGAGTAATCCAGGACCCrCTTCTAACCTCTGTGGGCACC^ 

ACACGTATGCAAAACACTCATACGCGTGCAATAAATCTAAGACCAAT^ 

CCGCCTAAGGTCTCCTAAGGTCATGTGTGTAAAGCAT AGCGCATTG CT ATGTTGACATT ATTT ACTG CCTCCCAG^CTTCCCAGAT 
5 G ATAAATCATTGAG AAGGAACTCTGTCTATTAGCTCTGTGCCCTGAGG CCAGC ATAAATTACAGTTG CGGGCTTATCCTGGCTGTC 

TTAGTGTTGCTTTTGTCCATCC CAG CTTGTGTGTCCCTGGGG CAAGGACAAGGT CATTGTG ACATGG CCAG C CATG 

AG CTT CAATGAGACATG CC CGTATG CACAAATTCTTGG CAAG AC CAG AAT AAGG ATG CTGGGCT ACT CAAGGG C CACCGG AGGAAA 
C AAT AATGC CAGG ACTC CGCTCTTTTT CTGGGCATGGAG CAAGG AACAAG AAGGTTTG CAG G CAGTG AT AGTGGACAGGT AG TTCA 
ACCCAGGACGTACTTGATGGAGCAGAGCCGATGAGGACTCTATGTGACT 
1 0 TATTGAAAGGTACCAAAAGAAAACCGACGATGTAGCTCAGTT^ 

AAAAAAAAAAAAAAACCTGGTTTTATCCCCAGAGCAGCCTCCTGAGCCG 
CGXMAAATACAGGGATTTACCCAAGTTCGACCAGCTGATAAACCACAG^ 

GAGGGCAGGATGCCCTCTGTGTGGGGAGTGTGGAGGCACACACT ACTAAG CAGTTGCTTCCCCAACAAACACCTGCTCCCGG CCT A 

GGAGAGGCCAGGTCAGGAGAACCTTGATCCGCCTTCTGAGAGAGCCCCTTC^ 
1 5 TTACTGTTGATGGGGTCTCTAGATGACCCTGAGAGTGGGTGATCCrrCAGA 

GTCCTGTCCCCTCCTCCAGCTCAGTTTGCCCAGCTGG^ 

GAAGCAAGTGTCCCTGGAGACCTGGCTGCAGCGAGAGGCACAGACACT 

TOTGCCCCAGAGGGAGCTCTGTCACCTGAGATTCAGGGCACCAGGTGGGGGACAGAGCC^ 

GGGGTGGAGCGGGTGTCAGGGGGCCAGGCCCTGGGTTCTAGGCCAGGCACTGAGGGATGCT 
2 0 C CCTCG CACAGGAG CTGG CTG AG AAG CAC CAGAAG ACCCTGCAG CTG CTGCGG AAG CAG CAGACCAT CAT C CTGG ACGACGAG CTG 

ATCCAGTXK5AAGCGGAGACAGCAGCTGGCCGGGAACGGGGGTCCCCCCGAGGG CAGCCTGGACGTGCTG CAGTCCTGGTGAGGGTG 

AGAGGTGGGCGGGGTGGGCAGCTTGGGCAAGGATGCCAGCTGCCTCTTCTGTC 

CrGTGACAGCTCrCACTTTAGCAGTATCTTGGCTCTTGATGGG 

GTGCCTTGGGATGGGCAGGGCTACCCTTGAGCTGGTTGGTTCCAAGTTGGGGCCACCCTG^ 
2 5 ACTCTGCTCAGTGTGCCTGTGCAGGAGAGCCK3TGGCC 

GAG TAG CATG CAT CTTCCTCCCCTGGCCTGGGCTGGAACAG CAG CCGC CATC CCCCTC 

AAGCTGGCCGAGATCATCTGGCAGAACCGGCAGCAAATCCGCAGGGCTGAGCACCTGTGCCAGC^ 

GGAGGAGATG CTGGCTGAGGTCAACGCCACCATCAhTNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NMJNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNl^^ 
40 NNNNNNNNNNNNNNNNNNNIJNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNIWNNNNNI^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNIJNNN^ 

NNNNNNNNNNNNNNNNNNhfi^NNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NN>HWNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNTINN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNIJNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNlttJNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNIJNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNIINNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNIJNNNNN^ 
65 NNNNNNNNNNNNNN!TONNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNMINNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNlJNITONNNNNNNNNNNNNNNm 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNIWNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNIWNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNm^ 
7 5 NNNNNNNNNNNNNNCTCACTTTTTATGGAGTACTGGG 
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ACTATAACTATCCCCAGCCCATGATGAACATTTTCAG 

ACTCTCAGAAGCGTGATAAAAGTGTCCTCGTTATTCGACACATCCAAGTCTGTCTTTCTCTAGTGTGAGACT 

TGTCATCTTT AG AGAACCGGG CTTTATTGCAAAGGCAGGGTTTTT^ CAAAAGCAGAGTTT AG A 

CTC CT AAAAAAT AAAAAAAAAAATT AAAAAAAAAAAGGCCAC^ CAAAG CCTT ATGGATTAGATTGC 

AATATGAAATTTTTTACATTCTCAGTGATGCTGATGACTAA^ 

AGTCGACCATCTCAGTACTTGGGCAGCCAGGACTACATAGAGAAACCCTGTCTCAGGA 

GAGTATCCAGACACCACTGTATGCTGGGATTTATTCAGGATAGGCTTGTATGCATTGTATC 
1 0 CAGGGCTGTG TATATGAACCT AGGAG AGTGG AAAGGGGTTCTT CTT AAAGTGAAATTACATTGG CTGAGG CTTGGGGGT ACCT CAT 

TTGCATGGAGAGGCATTCCTTGAATTTCACCATGGATTATGTGACCT 

AGGAAGCCCrGCrTAGAGAAGGTTGT'lTrm TGI 11TG TT T rm 'GGTTTT 

GGTTTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACT 
CGCCTGCCTCTGCCTCCCGAGTGCTGGGATTAAAGGCGAGAGCCACCACGCCCGACTC 
15 GGAAGCTGTCCGGACATG<3TAGACTATATAACCTTAGTAGTGAGGTACAACAGAC^ 

AGTCATGGTAATGCACTCCTGTAATCTTAGCCCTTGGGAGGTAGAGTCAGAAGGATCAG 

TAATTTCAAGTTCAAGGCCAGATTGGCCTGCATGAGACCTATCCCAAA 

TGCCATGCAAATATGGAGACATGAGTTTTTATATTTTATTTTATACATGTG 

TGTGCCTGTGGCCCATGAGGCCAGAAGAGAGCATCAGATCCCCTGGAACTGGGGTTACAGATGATTGTGAGCT 
2 0 TGCAAAAACAGCCAGTGCTCTTCATCACTTGAGCTATCT 

AGACACCTGAGGTTGACCTCTGACCTCCACOTATGTGCACATATGTGCITCA 

GGAAGCAGGAAGGTGAAGAGAGAGGAAGTCCCCTCTCCTGGCCATCTTC 

CCTCTGGCCCTAACTACCCTCGTACCTACTGACCAGCCAGCTGAATAGAATGAGAGG 

TTTCTCAATCAAGTTGCCAACTAGAAACTTTCCACTCCTCCATGCCAAGTTC 
2 5 AGGACLAAGCAAGTATTATACTTGGGGCTGCACATGTGTGCCCTCCCTCTGCTCA 

CTCTCTCTCTCTCTCTCTCTCTCTCACTCCCTTC^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNIWNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NhWNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNfrWNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 
35 mWNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNNNN^ 

CAGCAAGTGAGGGGGGGTGGCGGGGCGGGGTGGGGGGAGTCCTGTTTACTTAGCAGTGGATTCC 

TGAAGGGGGGAGGCATGGCAGGGAGGGGGGATCCATCTCTCCTGAGGG 

CTTGAACAAAGTAAGGAAACTGGCTTTGCAGAGACCCAGAGGAGAAGCATCCT 

AAAAGCTCCCCAGAGCCCCTAGGCTGCCTCCAGCCAAGCAGGAAGTTGGTGAT 

4 5 GT CGGGGTGAAAGGTCAAAGCGTG CAT AATC CTAAGTGCCCTGATTTCACACACACACAC 

ACACACACAGAGTTGCAGGTCZAGTAATGCATTCATGTAGCACTGTCTTC^GGA 
AGATCACGGTTAGGCAATTGAGATTCTTAGTATTTAATTTACTTCATTTTC^ 
ACCTTGTTCAACTATTCCTTCAGATAAAITCCTAAGCAGAATTAGGTCA^ 
CATCCAAAACTGCATTCIAAAAACCCAAATTCACATTTCCCAC^ 

5 0 AGAATAGCCTCC^TTCATGTCTGCCAGTCTGGTTGGCATGA 

AACCTTTGCATATGATGATTAATAATTCCAGATTAATTAGCATGGTAGC^ 
GAGTATCCGTCTTAGTCACTGTTCTACCACTGTGGAGAGACACCGTGACCAAGACA 
GCTGGCTTATAGTGTGACTGGAACTCCTCTTTTACAGGAAACTGAGGCA 
GGGAAGGGGCCAAGCCAGGGTTTCAGCTTGGCriTAGCTCCTGAACCTAGAT^ 

5 5 AATTTTTTTCrGTGAAAAACATGTTTGTATTTC^ 

GTGAGATGCTCAGTCTACAGAGTACTTGCTGCGAGTTTAATCCCTAGA 

TAATACTAGTGTATGTGTTCCTCCACATGCCTGGGCACACACATGCA.TGTACACACATC 

TGTGTATATACACACA.CACAAGAAAAATAGTTAATCCCCTATTGATTTATCTATAC 

TTTCT CAAGAACAAAGACTTGTGATTT CCAT AGAAACCCAATCTGTCTGTTTTTAGTGTGTGTGTGTGTGTGTGTC 

6 0 ACATATACCATGGTCTGCATGTGGGGAGTAGGAAGACAGTCAC^^ 

GGACTTGTGTGATCAAGCTTTGTGACAAGGACCirTACCCACTGAGCTCTCT 
ATTTTTCAATTATGCCTGCAAGCACAGTCATA^ 
GTAGAGAGTGAGTTCAAAGCCAGC!ATGTGCAACTTAGCAAGACACTC 
GCCTAGTATATACGATGCCCTAGTATATACGATGCCCTAGGTTCTT 

6 5 ATTCATAAGGTAGAGAC^GGCAGATCTCTGTGAACA 

TTTTTTCACAAAT ATG CAC CA.CT AGAAGTT AACT ACACGCG CATGGAAGTGTCTGC CTTCT 
AGCTGACCTGGAAGGT^GCCCAATTTGTACTTTC^TGCCT 
TGTCGGTC^TGTTCTGAGTTGAGATTAAGAGGCTGTGTCCTAACCCT 
CTGGTTTTCACTGTTCTTCTCCCCXSGCAGCGAAA 

70 

MOUSE SEQUENCE - mRNA 

AAGGTCCTGCAACGGGGAGGGAGCGAGCGGGCftTGCGAGAGAGCGTC 

GGTTGTGAACCATGG CT ATGTGG AT ACAGGCTCAGCAG CTCCIAGGGCXSATGCCCTTCACCAGATGCAGGCCTTGT ACGGCCAG CAT 
TTCCCCATCGAGGTGCGACATTATT^^ 

7 5 CATTAAGGCCACCCAGCTCCTGGAGGGCCTGGTGCAGGAGCT 
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TGAAGATCAAGCTGGGGCACTATGCCACACAGCTCCAGAG 
ATTCTGTACAACGAACAGAGGCTGGTTCGCGAAGCCAACAAC^ 

G CAGCTTCAGATCAACCAAACG TTTG AGG AGCTGCG CCTG^ AG CTG AAGAAG CTG C AG CAG ACC C 

AAGAGTACTTCATCATCCAGTACCAGGAGAGCCTGCGGATCCAAGCTCAGTTTGCCCAGCTGGGAC^G 
5 ATGAGCAGGGAGACGGCCCTCCAGCAGAAGCAAGTGTCCCTGGAGACCTGGCTG 
AGTGGAGCTCGCTGAGAAGCACCAGAAGACCCTGCAGCTGCTGC^ 
GGAAGCGGAGACAGCAGCTGGCCGGGAAOSGGGGTCCCCCCGAGG^ 
GAGATCATCTGGCAGAATCGGCAGCAGATCCGCAGGGCTGAGCACCTGTGCCAGCA 
GCTGGCTGAGGTCAACGCCACCATCACGGACATCATCTCAGCCCTGGTCA 
1 0 TCCTGAAGACCCAGACCAAGTTTGCGGCCACTGTGCGCCTGCTGGTGGGGGGGAAG 
AAGGCXLACCATGATCAGCGAGCAGCAGGCCAAGTCCCTC 

CAACTGTTGCGTCATGGAGTAC CACCAGGC CACTGGCACGCTCAG CG CC CACTTCAG AAACATGTC C CTG AAACGAATCAAGAG AT 
CTGACCGCCGTGGTGCAGAGTCAGTAACGGAAGAGAAGTTCACGATCCTGTTTGACTCA 
GTCTTTCAAGTCAAGACCTTGTCGCTCCCGGTGGTGGTGATTGTTCAC^ 
1 5 GGACAACGCCTTTGCAGAGCCTGGCAGGGTGCCATTTGCCGTC 

TGAAATTCAAGGCTGAAGTACAGAGCAACCGGGGCTTGACCAAGG^ 

AGCAACCACCTCGAGGACTACAACAGCATGTCCGTGTCCTGGTCCCAGT^ 

CTGGCAGTGGTTTGATGGCGTGATGGAAGTATTGAAAAAACATCTCAAG 

ACAAGCAACAGGCCCACGACCrrGCTCATCAACAAGC CGGACGGG ACCTT CCTGCTG CG CTTCAGCGACTCGGAAATCGGGGGCATC 
2 0 ACCATTGCTTGGAAGTTTGACTCTCAGG AGAG AATGIl ri GGAATCT 

CGCTtSACOSCCTGGGGGACCTGAATTACCrCATATATGTGTTTC 
CGGTCCCCTGTGAGCCCGCAACTGCGAAAGCAGCTGACGGATACGTGAAGC 
GCIATCCACAGATGCTGGGAGTGGCGCCACCTACATGGATCAGGCTCCTT^ 
CCCACCCAACCCGGACTCCGTCCTTGATACCGATGGGGACTTCGATCT 

2 5 TCTTAGGCCGGCCCATGGACAGTCAGTGGATCCCTCACGCACAGTCATGACCAGACCT 

GAGGAACTTCCTGTGGATGTTTTAATTCCATGAATCGCT^ 

MOUSE SEQUENCE - CODING 

ATGGCTATGTGGATACAGGCTCAGCAGCTCCAGGGCGATGCCCTTCACCAGATGCA^ 

3 0 GGTGCGACATTATTTATCACAGTGGATCGAAAGCCAAGCCTGGGACTCIAATAGATCT^ 

CCCAGCTCCTGGAGGGCCrGGTGCAGGAGCTGCAGAAGAAGGCGGAGC^ 

CTGGGG CACTATGCCACACAGCTCCAGAG CACGTACGACrcCTGCCCCATGG AGCTGGTTCGCTGTATCCGGCACATTCTGTACAA 
CGAACAGAGGCTGGTTCGCGAAGCCAACAACGGCAGCTCTCCAGCTC 

TCAACCAAACGTTTGAGGAGCTGCGCCTG ATCACACAGG ACACGG AGAACGAGCTGAA^ CAGAC CCAAGAGTACTTC 

3 5 ATGATCCAGTACCAGGAGAGCCTGCGGATCCAAGCTCAGTTTGCCCAGCTGGGACAG^ 

GACKKSCCCTCCAGCAGAAGCAAGTGTCCCTGGAGACCTG^ 

CTGAGAAGCACCAGAAGACCCTGCAGCTGCTGCGGAAGCAGCAGACCATCATCCT^ 
CAGCAGCTGGCCGGGAACGGGGGTCCCCCCGAGGGCAGCCTGGACGT^ 

GCAGAATCGG CAGCAGATCCGC AGGGCTGAGCACCTGTGCCAG C AGCTG CCCATCCC7V.GGCCCCGTGGAGGAG ATGCTGGCTGAGG 

4 0 TCAACGCCACCATCACGG ACATCATCTCAGCCCTGGTCACCAG C ACGTT C ATCATCGAGAAGCAGCCTCCTCAGGTCCTGAAGACC 

CAGACCAAGTTTGCGGCCACTGTG CG CCTGCTGGTGGGGGGG AAGCTG AATGTGCACATG AACCCCCCGCAGGTGAAGGCGACCAT 
CATC^GCGAGCAGCAGGCCAAGTCCCTGCTCAAGAATGAGAACAC 
TCATGGAGTACCACCAGGCCACTGGCACGCTCAGCGCCCACTTCAGAAACATGTCCCTC 
GGTGCAGAGTCAGTAACGGAAGAGAAGTTCACGATCCTGTTTGACTC^ 

4 5 CAAGACCTTGTCGCTCCCGGTGGTGGTGATTGTTCACGGCAGCCAGGACAAC^ 

TTGCAGAGCCTGGCAGGGTGCCATTTGCCX3TGCCT 

GCTGAAGTACAGAGCAACCGGGGCTTGACCAAGGAGAACCTCGTGTTCC^ 

CGAGGACTACAACAGCATGTCCGTGTCCTGGTCCCAGTTCAACCGGGAGJ^TTTGCC^ 

TTGATGGCGTGATGGAAGTATTGAAAAAACATCTCAAGCCTCACTGGAATGATGGGGCT 

5 0 GCCCACGACCTGCTCATCAACAAGCCGGACTGGGACCTTCCT 

GAAGTTTGACTCTCAGGAGAGAATGTTTTGGAATCTGATGCCTTTTACCACTA 

TGGGGGACCTGAATTACCTCATATATGTGTTTCCTGATCGGCCAAAGGATGAAGTATATTCTAAGTAC^ 
GAGCCCGCAACTG^GAAAGCAGCTGACGGATACGTGAAGCCACAGATCAAGCAGGTGGTCCCCGAGTT^ 
TGCTGGGAGTGGCGCCACCTACATGGATC^GGCTCCTTCCCCAGTCGTGTGCCCTCAGG 

5 5 CGGACTCCGTCCTTGATACCGATGGGGACTTCGATCTGGAAGACACC^TGGACG^ 

CCCATGGACAGTCAGTGGATCCCTCACGCACAGTCATGA 

HUMAN SEQUENCE - GENOMIC 

AGCCCTCCCGTGCCTTCCCACCCTGACCCTGTGCCCAGTGGCTGTGCCCTTGGTTGACCATCCCTCATTTGGTCT 

6 0 CCC CG AAT ATCTGTGT CTTTCATTG C ACATTTCCC CTT CTGGGTTTTTTTTTTTTTTTTTTTTT^ CG CTCTCGT 

TGC CCAGACTGG AG TG CAATGG CGCG ATCTTGG CT CACAG CAACCTC CAC CT CCAAGGTTCAAGCG ATTCT CCTAC CT CAG CCT CC 
CG AGT AGCTGGG ATTACAGG CATGTG C C AC CACTCCTGG CT AATTTTGT ATTTCTAATTTTGT ATTT^ 
TCCATGTTGGTCAGGCTGGTCTCGAACTCCCGACCTCAGGTGATCCACCCGCCTCGGCCTCCCAA 
AGCTACCATGCCCC^CCCCCACTCTGGCTTTTATTCTCTCT 

6 5 CTGTCTGGGTTTCTCTC<^TTTCTTCAGGCCTGTTCTCCCTT^ 

ATGGG CAAAGGG CACCCAAGGGAGTAG AGGGAAGGGTGTCAG AGTTGGGTTGGGAAGAGGTGACTGTAGTCGTGGCT 
ACAACTACCCAGAAATGTCAAATTTGGGGTTTCTTTTTGCT^ 
AAAGTGCCCrGGTGATTTTGGGGAACCAC^CTGTGGAGGAGAAAACTGAG^ 
CAAGGGTCTGACTGAGCTGCGGTCCGAGGCTGGCCTCCTTGCCCCT 

7 0 GGGGAGTGAGGAAGGATTTTAGAAATCTTACAGAAATATAT^ 

TGG G AGGCTGAGGTGGGAGG AT CACTTG AGGC CAGGAGTTTG AG ACCAGT CTGACCAACATGG TGAAAC CTCGTGT CT ACT AAAAA 

TACAAAACAATTAGCTGGGCGTGGTGTCACGCACCTGTAATCCCAGCTACT 

GAGGCGGAGGTTGCAGTGAGCCGAGATCATACCACTGCACTCCAGCCTGG^ 

AG AAAAAAAAAAA AO C AAT AT ATT CTCC^^ CAG CCT AGGCTTCAG 

7 5 TTC^CACACTATATTCGTATCCTGAACTACATCAAGGCACATTTGTGTAC^ 
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CAATAACCTTGAAGGCytf3TCAGGGCCAGA^ 

GTG CACAG ATGG CAC CAGG AT C CAAGTTGCCGGGCCTTGTGGG CAGGGTCTCAGT AT AACCTTT CT AAG CTTCCC CAAGCC CAG CA 

CACACCGGCGGAGTGAAGAGGCCTCCCTGGGCTGGGCTAGCCCACCATTTTTCTGCGGTGCGG^ 

CAGCCTACACCCCTGCTCTATTGAATGGGACAGTGTGCTCTGTGCACACTTACATGAA 
5 ATT AT CCCAAAGGGC CGGGCATGGTGGCTCACG CCTGTAGTCCCAGCACITTGGGAGGCrGAGGCGGGTG CATCACCTGAGGTCAG 

GAGTTCGAGATCAGCCTGGCGAATATGGTGAAACCCCGTCTCTACTAAAAATCCAAAAAT^ 

TAGTCCCAGCTACTCAGTACGCTGAGGCAGGATAATCACTTGAACCAGGGAGGTGGAGGT^ 

CACTCCAGCCTGGGCAACAAAAGCGAAACTCCGTCACAAAAAGAAAAAAAAAAAAA 

CCTTTCTGCCTTGCCCTTGACGATCAGGTCTGCCCATGCCCCTACCCTGGGGA 
1 0 TTGGGGGGTGG AGGG AGGTGG T ACAGAGATG CCTTTGTTTCCTGTGCTCCAAGT C C CTT CTTCAAGG AG AGTT C CTGTT ACATGGG 

AGCCAGAGGGGAGGTGGGGT AGGGGTGAAGGGCCTT CC CAAGAAAG CTGGAGG CTTT CT AGGAAGT AT CTTTTCACCCTGCGG CAA 

AGGGGAGAAGCCCTGGAGCACTTAAACATCCrcTTTCACCCCCACCTTGCCTCTG 

CCGGAGAAGGCTTCTAGAAGTGGGGCCATGGGCCCATGCTCAGAGTATACCCCACCT 

GAAATGCCACCCCCATTCTTGTCC^VAGAAGACAGATCAGGACCTAGGGAGGGTGTGTGTTTATATGC^ 
1 5 GCAAATCTGGGGGAAGGGAAGGAGCTGCCGGAGCTTGGGTGAAGCCCGTTCCTGCCGCTGTCCCCTATC 

GTCACTTTACAGTGAAGTCACCAAGAGAACACAAGGCCATCCTGCCTGAGGCATAGTGACTG 

AGGGG AAGG^TGTGGGGCTGGACAGGGTGGGT AGGAGAGGAGT CCTGCTT C CTCTGCCCAGCTGCCCAGCCTGCCTGCCCTGGGGG 
AGAGGCTTGTAGGCCGGTTTCTTTCTGGACGAAGAGAAGAGGATGCAGAGTCTGTGCCAGGTTCAC^ 

GGCAGGAGGAGCC CAC CCTTTGCAGC CCCCTCATTG CAGGG CCAGGCCCAGGAGCCTGGAGAGAGGAGAG AGGG CACACAG AGC CA 
2 0 GG CAG CTG AGTGG AATTGG CACTG TCCTGGG CAGCCCTGGG TCAAG CCTGG CTTCT CAT C CT CATCCATG ATG ACT ATCAGG CCG C 

TCTTCCCGCTGAGCTAGTTTCTCCTCCTTCCAGCCCAACA^ 
TCTGGGGGTTGTGTGCGAGGGGGCTGAGCATCTTCCACGACACCTCCCAGTACT 
TTCCTITCTGTAGAATCTGTTTCTC^TTGTTTCTTCGTTTCTTA 
AAAGTGCTTCGGCCAGGCTTTTCCCCTTACGCTGGCCTCTCAACTCCAAGCA 

2 5 TTCOTCCCrrTGGCCACCrGGTGAAATTGGCCrCATCTTTCCTA 

TAAGGTCCTGGTGCAGACCCTGGCTCTCCTGTAGGATCTTGAGTG 

GAAATAGCAAGACCCACTGCTX3GGATTTCTGTTAAGACCCAGTCACAGGCCCGGTC 

TGGGAGGCTGAGGCAGGTGGATCACCTGAGGTCAGGAGTTTGAGGCCA^ 

TTGTCTCTATTAAAAATAAAAAAATTAGCCAGGTGTGGTGGCAGATGCCTGTAATCCCAGCTATT^ 

3 0 TCACTTGAACCTGGGAGGCAGAGGTGGCAGTGAGATGAGATCACACCACTGCACT 

AAAAAAAAAAG ACTCAGT CACAGTGG CTGGTGCATTGTAGG TG CTCAGAGG CCTTGGCCAGTAGAGCTGGGTGTTTTTTCCTCTGC 

CCTTCAATCCCATCTGTATTACAGGCCCCTTCTTATGGCCCTAAC^CATTC 

GAGTCTTGATAAATATCTrGTTGGATTAATGAAGGCCCCTAAACGAG<^ 

TGTTOVCTC^CCACCCCCACCATGGTTCCCAGA 

3 5 TTTTCTACAO^CAGTGGCAGCAGCAGCCTCATCTTTGCCTGGTGCTGGGCT 

AAGATACTGGCTGCAGTGCCACTCTGCTCCCTCCTCTGCGTCCGCCCCACCCT 
GTGGAGAGAGCCCAGTGACCCTGTCCCAAGAGGCGGGGTCGGGGGATCCTTGG^CTT^ 

TCTGGCTGGAAGGG AGCCCAGATTGGTG AGG CACACAGGAG CAAGACTG AG AGCAGGTGTGTGAGACTCTCCAGGTGG CGAGTCCT 
CCATGCACCTGCACATCCATCGCTCCCGTCCTCTCCCCTCCCCTC 

4 0 CTGGGATTAGGCTGAGGGAAAGAAAGAGAGAAAATGCTTTTTTATCAAG 

TCCTTTTTCCTTTAAAAAAACTGTCTATTTTACTAATGTTTTAATAGG^ 

ATTCAAAAGGGTATGTGGTGAAAAGTTAGTCTCTGCCCTTTCCTGTCCCCCAGGGCCCTTCTGGA 

G TCTTTTG TG CATTT AAATGCACGG AAGGATTCTGTGGATTTGAAACAT AT ATT ACftCT CT AC CCC CT AACT ATG ATTT ACACGTG 
ATGGGAAACTGAGATCCTCAGAGACATTAAGTACCTAGACCAAAGTCACACAGCTAGTTTTTTT^ 

ACGCCATTCTCCTGGCTCAGCCTCCGGAGTAGCTGGGACTACAG^GCCC^ 

GAGTCAGGGTTTCACCGTCTTAGCCAGGATGGTCTCGATCGCCTGACCTCCTGACCCACCCGCCT 

TTACAGGTGTGAGCCACCGCGCCCGGCCAACAGCTAGTAAGTTAGCAG 

ATG CT CTGGC AGAAGT AAGCAGGGG CACTGACAG AG CCCTCCGAAACTGCAG AG CC CATTCCAAAC CG AGGC CACG CCATTCTGAG 

5 0 GCCTTGGAGATCTGGCATCCCCGCCTTCTTTTCTGTTCAGCATCCTATACCTT 

TCC CC CT ATC^IAAGGATTCAT CCC CACAGGGAAAAAGGGACATT C C CACAGTGCCCTTT CTTAG CAAATGTGTGGC CGGTGGTTGC 
CCAGGGTCAGCCTGACTCCTGGGATGCAGGAGC^AAGCCATAATAACGAAGGCCTCTGGTTTCTGGCCA 
GGCITCAGTGACGTTCCCAGAAAATGGCCTCATGATCTCTGGGGCAG 
GAAAGCCATGTGTTCATCTATGAGAGAGAGAGGGGAGGAGGCTGTAGGATGGGAGCAG 
55 AGACCCCCAGGCTCTTCCCCCAAGCCTCTCrCCTGGGTCAGGCTGGTGCACTAAGGGATC 

CTCCAGGTACL"lTCT'rGGGTCATTCCTGATGGGTTGGACTGTGGGGTC i l i CAGGATCCTAGCTTCCTGTTCAGTTrCTTTCTTTC 
TGTCATGTCTTAAGTCTGGTCrrCCCCTCAATTTAAGGGAATATGTCCCCGAGGCCTAOAAGA^ 
GTCAAGCAGAGAAGCAGGTGTCAGGCCTTCCCATCCCAGGATGACAACATTCTGTTTTGCTGAGG 
CCTCAGCATCTGTGCACACACCCAGCCAC^TCTAGCTTGGTGCCTGGTGACCATC^ 

6 0 AAGAGGCATTCCCACCCACCCrTCAGGACATTTTr riTlTl^rrr CTGAG 

ATGGTGTGATTTTGGCTCACCGCAACCTCCGCCTCCCAGTTTCAAG^ 
GGCATGTAATCCTGTAATGGCACCACCATGCCCAGCTAATTTTG 

GTCTCGAACTCCCGATCTCGGGTGATCCACCCGCCTCGGCCrCCTAAAGTGCTGGGATTACAGGra 
CCTTCAGGACATTTAAACTAGCGCCCIGTGGCTGCCACAAACCAGCT 

6 5 TTCATCTTAGCTGCTACTAAACACCACAGTAGTAGTTAACCGCTATACAGTGCTTTACAG^ 

TTATTTCCCGCAACCCGATGAGGTAGGTACTGTCTCTATCACCACATTAC^ 

CCAGCTCAAGGAATTGACAGAGTCTGGTTTGAGAACCTGTACTT^ 

CCAGGCTOGAGAGC^GTGGCACAATCATGGCTCACTACAGCIT 

GGAGG CTG AGGAG GGAGGGTTG CTTG AG C CAT AAGTTCTTGAACAGCC CAGG CAAT AT AATG AGACCCTGTCTTTACAAAT ATTTT 

7 0 TTTTTTTTTGAGATGGAGTCTCACTCTGTCGCCCCAC^CTGGAGTGCAGTTTCGTG 

GGTTCAAGCAATTCTCCTGCCTCAGCCTCCCCAGTAGCTGGG^ 
AGTAGAGACAGGGTTTCAACATATTGGCCAGGCTGGTCTGGAACA.CCTGACCTrc 
GGGACTACAGGCXSTGAGCC^CCGTGCCCC^CCTACAAATAATTTTAAAAAA 
AGTTACTCATAAGGCTGAGGCAGGAGGATTGCTTGAGC^ 
7 5 TTTTCTGTGAGACTGTCTCCAAAAAATAAAAATAAAAAAAAATGATCACACCCTTGACCT 
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TAGCCTCCTCAGTAGCTGGGACCACAGGTGTGTGTCGC^ 
CCTCCATTTGCATTCCTAGAGCATAGCATTTGTTCCAAAA^ 
TTAAAAGTGAATAGCATATTGCGTCTGCCCTTGGGGACAGAGA 

CTGT AATAG AGGCATGAAT AACG CGGTGGGAAACAAGG AGGG AGGT CATG CTG CCTG AAG CAAATGT CAC CG AAGGG CAG ACATCA 
5 AGAAGGCAGGACATTCCAGGCAGAGGGAATAGCATGTGTGCAG^ 
GGCTGATGCCTGTAATCCCAGCACTTTG 

GTGAAACCCCGTCTCTACTACAAAAATACAAAAATTAGCCGGATGTGGTGGTGTC 
GGAAGGAGAATCACTTGAACCCAGGAGGTGGAGGTTGCCGTGAGCT 
'GACTCCGTCTCAAAAAAAAAAAAAAAGAAAAGAAAAAGAAAAGATAAAAG^ 
1 0 ACATCATGGGCAGTAACTGTCAGGGAGATTGCCAGAGAGCTCGGGGTTGGTTGG 
GCCTCCAAGCTCCCTGAACCrrACTCTGTGCAAAGCACTGTGCCAGGTGT^ 

GCGGAGAAGGGTTTTTTGAC CAG CTGG AG ACCTTTGTGGCTGG TGT CATTCTGG AGGTGAAG AGAAGG CCGCT GGAAAGAGCTG CC 
CAGGGAGGACTATGGGAGAACTTGCAGCTCTGGGCCATCTCTGGAC^ 
GAGTGGGGGCGGGAGGGGTCCTTCTCTCTCTTTTGGCCCCTGCAGCT^ 
1 5 CCCTTGGAGGGTTGCGAGGGAAACCGCTGGCCAGCTCCGGGAGATAGACGTAGC^ 
TTCCCACAACCGCAACCCCCTCCTCAGCACGTGGAAAGGATCTGGGGGAC^ 
TCTTTCTTCTCTAGAGAGAGCAGTCCCTCCCGCAGCTCTGACTGGTGGG 
GGCGCCCCXTTCCCCCGTAGGTTCACCCTGGAACTCGGACAGGTGAATTCGGCCTGGGCGAATC 
GGCGGCAGGAGCCCTGGCTGAGACGCCCTCCCGGGGGCAAACCC^GGATAATTATTGATCGGGG 

CACGACAGTGCGCCCTAGCCCTGCTAGTGGTGGTCGACCC^ 

AG^CCGAGCCXSGGAAGACACCCCACCGCCGCAGAAAGGGGGTGCGCGCGGCGCCGGCAC 

CK3TGTGCCGAGGGGGTCCGGGTCCTTCAAGGGCGGTCGCGGAGGGGG CG CGAGGG ACAGCGGGCT AG CGAG CG CGCGCCGGG CGGA 
ACCAGGAAGCGGGGCGCGCTCCGGGTTGCCAGCGGCGCGCGTCCCTGGGCCTCCGCGCGTGAGCGGCGCGCGCGCCrc 

2 5 CCGGCGGGAGGAGAGTCGGCGGCCGGAGCCGTCACCCCGGGCGGGGACCCAGCG 

AGGGAGCGAGCGGGCGGGCGGGCAAGCCAGACAGCTGGGCCGGAGGAGCC^ 
GGGGCGAGCCCTGGGCCCGGGGCCGGGGGAGCCAGGCCGGGGAGCCAGCCCGG^ 
TCAGGGCCGCGCCGCCGCCATCACTCCCX3GGCCGCGGTGGTTGCGCGCTGGGCGAGGGG 
GGCTCCCGGGGAGGCTGAGGGCAATGOSGGGTGCGCGAGG^ 
30 GCTGCGTGCGCXSTGTGCTGlAGTGGGTGTGTGGGGACCCCGCGCCGACCCCATAGGAGCrCAGGGGG 
CCnTCACGTTGCAATTCCGAATATCCCCAGCCGTCTTCCTTG 

TGGGATGAGCCTGCGGGGGTGCCTGGTGa^TGGGGGGTATTAGCAAAGCTTCAAGAAA 

AAATACGAACGTTTCGATAAGACTCTGGGGGGCGGGATCCCACATCAGGACCCGCC 

TGCAGTGTGCTTTTTCITTCTAGTCAGGGATAAAGGGTT^ 

3 5 AGTTGTATCACAAAAGCTGGTCTGAGGCTAAAG^AATCTGGCAAGATGGTCG 

AGAGGCGGTTGCAAAC ACTTGTAG CCAGCAGTG ATGG AAAAGC CAAAGAAG AAAAC CAAGTGTCAGCTTTTAG AAAGACGGG ATTA 
ATGAAAAG CACAAGAATTTTAAAG ACT AAAAGAG AT ACCG ATT CT CT CTTTG C CAAAG AGC CAGAGT CAGTTCTTTT CTT CTTGGT 
CCCTGTGATCCCGCCATACCTCGTATCAAATAAATAGTGCCAAAAGTGGGTCAAAGAATGTTAGAGCTGGAAAGGAACCCAGCGAT 
GATGCCATTCTTTTATTGTACAAATGAGGAAACTGAGGCCCAGAT^ 

4 0 TGAGTATCCAGGGATTGAAACTTCAAGGACTCTGAGATTCG 

ACAATTCCATACATTTTCTCTGCAGTGTCCTCAGGGAATGCCCTTTGAAAG 

TGTCATTTTTTTCAGTCTTCAAGTGGTGTTTATGGTCTC CTGTTGG ATAG CCG CCTGGGCCAT A 

ATAAGCTTTCAGCTGTTGGGTCTACATTATTTCAAATAATTTTGTT ATlti 1 i CAA 

ACATTCTATTG TTGT CCATTTGCTT ATG TTAT ATTGCG TT ATGTT CATTG CCTGTG AGTTCCT AGGTG AGGATGT ATTTT ATTG AG 

4 5 TGTGCCTTTCATGTTTTAAAATTTCArrriGTAACACTGTAAATATTGGAGTG 

GATTGATTTTATTTTAAGGCTTTGGGTTGTTTTGTTTATTC1A 
ATGGACATGGCTAAATAAGAGAAGGGGCTGCTGATTTTAAGA 
CAAACIAAGTCCACACCTTG<3GTCAGGTAAGCGCGTTATCT 

AAGCTCCAGATTGAGACACAGGGAAGGAATGGAGAGGTTTTTTTTTTTTriTTTIT 

5 0 AAAGTTG ATTTCT ATCACCAGT AGTTTGGTT ATTGCATAACTTGAAGCT AAACT CTTTG CTAT ATACCCAAC 

TAGGTAAACTATCCCACCITCTCATCAGTTCrCACCnTrTCGGTGCTTGTA 
CAGAAATTTAAGGTGTGTACTTCATAGTATGTATGATTGTAGGAGAAAAATGTGCTATC^ 
TTTTGGTQlACACTTCCTATCCTGCTCrGACTTTTCATCTCATCTCCATT^ 
ATGTAAAAGCCACZATCAAGACrAAAGTTGCCATTTATTACT^ 

5 5 ATTCAAATTGTCTGTAATTTTTGTTAGGOTATATATTATGAATCCATTTG 

TGACACAGGTAGTCITATTGTAAAAGGAAGACGTTATAG 
CGAAGGAAGCCTAACTATTTTCGTGCCCTACAAAAATGTCT^ 
ACACATTTTTTCCCITTCTGTTAAGAGTGCAGATTACTGCATTATTCCT 
TTCTGATAATTTGAAACrTATTCGTTTAAAAAGCTATCTGGCAATTC 

6 0 GATACTATAAAATATCAAAGTATCAAAATACTTTAACTGTGCAACTTAATAAAAATA^ 

TGTCCTCAAATGTGTGGTATTTAAAGGAAGAAAATGTTTTTAATTCTGAG 

TT T'l'TTT CT CCTGCAGTGTT GGAAACACAACAG TTTT ATG CTT ATTT CTG GG AAT AAG AAAG TCAAACTT AACAATCTAGTTT CCA 

TTGATCAGACAGTTTTTGTACTTAAATTTAAGTTCTAAGC 

AGTGGCCTTATTTGAAGTAAAATCTTGACATT^ 

6 5 GAAGATTTTGCA.CATCTAAATCTATGCTGTTTTTACTAATCA 

ATGAGACCCTCACTTTTGATGGTTTCAGAGTTGAAGTGACGTTTT 
AAGGTATCTTTGAGGTTTG^TTTCCTTTTTTTTTA^ 

TCCACATGTTAACCTT CTTG TAG TGTCACATT'l T 1 ATrTIAAACTTATACAGACAAACATCAGATGCCATGTGGTTTGTCTCTGAC 
CCTTTGCGCTACTTCATGACAGCTTTACTGAGTGAAGCCTGAATACCCAATCGGGAAGGCAAGAT^ 

7 0 TCTTCAAAGTTTAACAGCCTTGGAAGGAGTCTT^ 

TCCTCGCAGGTACTTGGAAAATGTATAGCACTAAGATAGAGTTGAAACATGTTATGTAGAGCCTTGAA 
GAGTTGCTCTGGGTTTACATGATACAGAACCCCATGGTTTATATTAAGTAGTAA 
CATAC^CAGTGTCATGGAAAAGAGTTAAGAACAAGGTCAAATTCAGGGTATGGTATT 
AAAJ^CTCTTCCTTTACTCTCCTGGATCACT^ 
7 5 AGAAAT ATG TGTAGATTGTTTCAGTG ATT AAAG CAAGTGTT CTTTTC CTGGC CTTT AACTCG CTTTATGCT AAGAAGTG T AAT AAC 
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T AAAT AGACTCCTAATTGAATTG ACATT C AGATG TGTTTGCAGTTTAGGGGGAGTTT A CCT CTTCT AAATTTTCATTTC GATCCCA 
TT ACT AAAAG CAGT ATTGCATT AG CATGGTGCACAATT ATG T ACAATT CAATTTGCAAATG AAG XT AATACTTGGAGT AT CTTGAC 
TT C CT ATTCC C CTGCTG CCC CAAT ATTCAAGAAAGGAGAAGTT AT AATTTGTCACAT ACAAGATGG T'T"l'CrrCTTT AGG TTT ATT C 
TTAATAACTATGTGCATTTTCACATGGATGACACCACTGAGATGATG 
5 TTTCTACCTACTTTTGGAAGCTAGCTTGCAAAATCTACTGC 
ATAACCTTTTAAAAATTTTTGTACTTTTCTAATATGAAGAG 

ACT CATTTTTTAAATT AT ACTAAG ATATAGAG CATGGTTTGAGAGCAGT AATGAG ATTCTCAGAAAATATGAAAACTTC 

GACTAGTTAGAGAAAGGTTGGGGAAGACAAGGGGAAATACATCAATGAGC 

GACTAATATTTTTAATGCCC^TCAGGCTACTAAATGTGAATT^ 
1 0 TAACATTGCTTGTTTTCCCCTGGTAGGGGATTTCTTTGCATA 

TTGTATTAAAAATAACTATTCATCATTCTGGGCTAGAGCTCrcCTT 

AT CATTATGATGC CAC CTTTCGAGAGG TG AGCAG ACAATT AAC AAGTC CTC 

TTCCAACGCCTTCAAGTGTGAAGATAATAAAAACATTACCTAATGCTTACCGT^ 

AGCTCTGGCCTTGTGTTAACTCGTGATTCTCAGAGGAGTGGGTATTAT^ 
15 AAGTAATTTCTCCAAGGTCACATTACTGTTCTTGGCGG 

CTGGCCACTGCCACITTGTTATAAGACATTTTGTTTCTT^^ 

TGAATAGAAAATATTT M ITATATGTTTAGGATTTTTTTAAATGT^ 

TTCTGGTGGAAATTATATTAAAGAAGAGAAAGTCTTAATTTT^ 

TACATCATAGAAGAAGAGCTATAAGCTGTCTCCTTGGCCTCTGTTTACA^ 
2 0 GTTG ACATGGACGTCTGTACTGAGAACTCAACAATCCCACT CATTTT ATTT ACAACTTT ATTTTG CGGGGATTT CTGTC CTATTCT 

TGTTGCATTAAAGCTTTTTAGAGCATTAGTCATGATTTCTG 

TATTTCTTGAGACAGAGCCTCGCTCTGTCACCCAGGCTG 

TTGAAGAGATTCTCCTGCCCAGCCTCCGGAGTAGCTAGGACTACAGGCATGTGCCACCATG 
AGAGACGGGGTrTTGACATCTTGGCCAGGCTGGTCrCGAACTCCTGACCT 

2 5 GGATT ACAGG CAT AAG C CAC CATG C CCAGCCT ATTTGGG CGAATTTTTAAGG CAACTG AACT ATTGT ATG TCT 

GGCCATTAGAAAATAGTCTCAGATTTTTTTT^^ 

GTGAAGTTAAGTCTTGGAAACAGAATTCACAACCCTAAAGCGTAGTGCTTTA^ 

TCAATTCCCTGTGCAGAGTATAAACTTGTTCAAATAAACAGAGGATGT^ 

TTGAATTTTCTTTGAATGTTGACAGAAGTTACAATGTCAGTGGAAA 

3 0 GTTTGTGTCAGTGAGATCCTACAATTAAAGCTGCTTCITTTTTT 

GCCTTTAGGTGAGAGTAAGTTAAAAAGTTTGTGAGAAACTATTTTT^ 
C^CGCCTGTAATCCTAACAATTTGGGAGGCCGAGGCAAGCGGATCACGTGAGG 

GAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGTGCACGCCTGTAATCCCAGCT 

GAAG AATCACTTGAACCCAGG AGG CAGAGGTTACAGTGAGCCGAGATCGCACTACTGTGCTCCAGCCTGGG AG ACGGAG CG AGACT 

3 5 CGATCTCAAAAAAATAAATAAATTAAATAAATTCAGAGCCTATAGCTC 

AAATCTAACCTGGGCTAAATGTTAAGTTACATTCATACATGTCATTGTAGCCT 
AAGTACTAAAAATTGGAATTCATCAGCTTTGTAATATAGCATTCATTCTACAGTATATTT 
TAATGCTAATTCTGTTTTTTTGTTTAATGTTTCTAAAAATCTT 
TACAGAAATCTACAGCTCTTTAAAAATGAAACAGTTCACCCTGTGAATTT 

4 0 AGGTGGGACTCCTATCTCATTTCTGATAGAAAGTGAGGGAAAGCTATGTATAGTGAGAACT 

CATGGAAGGTTTAATATGTAAGGATACAATGTAAATAGTCCCTCITTTTGTATT 
TTTGTCACTTAATGATACATGGTGACAAAGTTCTAAGAAATGC 

ACACAAACGT AGATGGTGAG CC CT ACTGCACAC CT AGG CTGT ATGAT AAAGCCT ATTC C T AGACT ACAAACCTG TGG AACATG 
TATACTCTATTGAGTACTGTAGACAATTGTAACA(^GGGTAAGTATTTGTGTATCTA 

4 5 TAAAAACATGGCATAAAGGATAAAAAATGGTACATCTATATAGGGTACTTAACAGGAAT^ 

GGGCAACTCAGTGGGTGAATGGTGAGTGAATGTGAAGGCCTAGGACATTACTGTACATTACTGT 
TT AGCCT ACATT AAAT GT ATTTTG AAAATTGri"!"^ 
TTTACTTTTTTTITTTTTTTCTGAGATGGAGTCTTGCTCT^ 
CCTCTACCTCCCGGTTCAAGCAATTCTCCTTCCTCAGCCTCCCGA^ 

5 0 TTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAG^CTG^ 

CCTCCCAAAGTGCTCGGATTATAGGTGTGAGCCGCCGTGCCCGGC<ZAACTGTTTACTTTTTTACAC 

ACACTTAGCTTAACACATAAACATGTACAGCTGTGC1AAC!AATATT^ 

AATTTTTAATTTTTCAAAACTTTAAAAACTTTTTAATTTT^ 

ACATT AGTCTAGGCCTACACAGGGTCAGGGTCATCAGTATCACTGTCTTGCACC 

5 5 GGCAGTAACACATATGGAGCTGTCATCTCTTGTGATAACAATGCCTCCTGAAGGACCTGCCTG^ 

TTTTTTAAGTAGAAGGAGTACACTCTAAAATAACGATAAAAAGTATGGTAAATACATA 
TATCAAGTATGTACTGCACGCAATTGTATGTGCTAGACTTTCACATGACTGGCAGTGCAGTGGG 
CAAACATGTGAGTCATGCATTGCATTACAACATTATGATGGCTACAGCGTCACTAGAG 
ATCTTATGGGACCACAGTGATATATGTGGTCTGTCATTGACTGAAACGTCATTATGTGGTG 

6 0 AGTGTAGTCATGTTTTACAAATGAGTTTATCTTGCAGTGAGTTTAAATATCAA^ 

AGATTGGCCTTTCTCCTCATGTTGAAGGTATCAAAACAAAAT^ 

CTGAGTTGGTGATGAGGCAAGATCCCTTGACTTTGCAATACCAGAAAAATGCACATAGT^ 

CTATGTCTCCAGGTGATCCCTCCCACCTCCACCTCCACACCTCCCACCTCAACCTTCCACAGTATTC 

CTGCACO^CCAAAAAGCATATAATTTTAAGGGAATGATTTTT 

6 5 GCCCAGGATGAGTGAAGTGTGCCAGGAGTGGGTCCTTAAGTAAGAAAACGTATTACT^ 

AATGTGATATTCTCAAAATTTATGGTTTTGAATTTTATTTATTATTTTAA 

TATGTTGCCCGGGCTGGTCTCAAAGTCCTC^GCTCAAGTGATCCTCCCACCTCAGCCTCCAGAGTAGATGGGACT 
CCCCTGAGCCTGGCTAGTTTIGAATTITTATTACCCAAATGTGAAG 

TATTGGTAAATTTGAACAGGATATAAGTGGCTTTAATAACTAATAAAAGAAGATACATCTTATAGATAC^ 

7 0 TGT ATACAT AAAT ATTT AAAACTTCTTT ATG CCAAAAAATGC CAGAAA CCAAAGCT AAG TGACAAAAT AAGAAAAAT ATTTCT AAT 

ATATATGACAAAGCATTATGGGTAAATATTCGTAATATATAAGTAATCTATATAAGTAGTAAGAAAAAGATGATTCCC^ 

AAATGAACAAAAGAAGAACTACAAATAGCCGATAATCATATGAAACAGTGTTCACCCATACT 

ATGACAAGTTTTTTC^GTTAACAGATTTAT^aAGACT 

GCACTTTCAT ACT CATTGGTTG AAATGT AAATGGGTG CTI'TriTTGGATGGCAGTTTGAT ACATGTATTGAGAG CTT AAAAG AGTG 
7 5 CATACTCAAGCTGGGCATGGGAGCTCAGGCCTGTAATCCCAGCACTTTGGGAAGCTAAGGCAG 
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AAGACCATCCCGGCCAACATCGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCT 
CAGCTACTCGGGAGGCTGAGGCAGGGAGTCAGAGGTTGCTG^ 
GACTTCGTCTCAAAAAAAAAAAAGAAGAGTACATACCCTTT^ 
AATATGTTTAAAGATTTGTACAAATAGGCATGGGCAAGGACTTCATGTCTA 
5 GACAAATGGGATCTAATTAGACTAAAGAGCTTCTGCACAGCAAAA 

GAAAATTTTCGCAACCTACTCATCTGACAAAGGGCTAATATCCAGAATCTACAA 
ACAACCCCATCAAAAAGTGGGCGAAGGACATGAACAGACACT 

TGCTCACCATC^CTGGCCATCAGAGAAATGCAAATCAAAACCACAATGAGATACCATCT 
AAAGTCAGGAAACAACAGGTGCTGGAGAGGATGTGGAGAAATAGGAACACTT^ 
1 0 CATTG TGGAAGTC AG TG TGGCG ATTCCT CAGGG ATCT AGAACT AG AAAT ACCATTTG A C CCAGCC^ 

C CAAAGGACT AT AAATCATGCTG CT AT AAAGACACATGCACACGTATGTTT ATTG CGGCAGT ATTCACAATAG CAAAGACTTGGAA 
CCAACCCAGATGTCCAACAACGATAGACTGGATTAAGAAAATGTGGCACATACACACCATGG^ 

ATG AGTTCATGTCCTTTGT AGGGACATGGATGAAATTGGAAAT CATCATTCTCAGTAAACTAT CGCAAG AACAAAAAAC CAAACAC 

CACATATTCTCACTCATAGGTGGGAATTGAATAATCXX3AA 
1 5 GGG TGGGGGGAGGGGGGAGGGATAGCATTC^KIAGATAT AC CT 

CATGTATACGTATGTAACTAACCTGCACAATGTGCACATC^ 

ATTTGTACAAAAGGAGTGTTACTTAAAATAGCAATAAATTGGAAAAA 

TTATTACACCTGTGTATAGCTGTCCAATAGAATACTGTCCAGCTATTTAA^ 

AATTCACTATATATTACTAAATGAAAAATGAAATTACAAACAATCATATA 
2 0 GATTTCTTCIGGGTCGTGGAGTTCCCTGTGTGTGTGTGTGTGTGTG 

CATTTTTCTATAATAAACAGATTATGGCTGG^ 

CACTTGAGTCCAGGAGTTCAAGATCAGCCTGGGCAACATAGCAAGACCCTATCT 
TGGCACATGCCTGTXXSTCTCAACTACTTXWGAAGATGAAGTAGGAGGATCACTTGAGCCT 

GATTGAG CCACTG CATTCGTC CTGGGTGACAGAGTGAGAC CTTGTCTCAAAAACAACAAAT AAAAACAAACAAAAAAACAGATGAT 
2 5 AATACAGGGAATCCTTTTGTAATCAGGAAAAAAAAGTTATC^ 
GTTTTTTGTTATCAGTGTTTTCTTTTTAGAAGCAGAAAAAAG^ 
TTTGACATCITTTGTTCAACTATGTTAAAATCATCATAAT^ 

TGTT ATTT C CTGTGCTTCGTGGT AGC CTG T AT ATTTTTACAC AG AAG CTT ATGGAG AATT AT ATTGTT AT AT AAT AT AAT AAATTG 

TTAGGGAGACAGAGGTGAAAGGATTGCTTGAACCTGGGAAGTGGA 
30 CCGGGCAATGGACAGAGACNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNIWNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 
40 NNNNNNNNNNNNNNNNNNNNTO^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

KNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

ATAAAAATACAAAAATTAGCTAGACGTGGTGGTGTGTGCCTC 

CCCAGGAGGCAGAGGTTACAGTGAGCTGAGATTGCACCACTGCACTCCAGCCT 

4 5 ATAAATAAAAATAAAAAATAAAAAATCTATCTTAACTTGGATTGTGCA 

TATTTTAAGCCAGTTTGGGGAATACCAAAACATTTTTAAAAATCACGTAGTACTT 

TAG TTAATT AAAATG C^TGAAGTAAATAAG TGTTTTTCTGT ATT AAAGAT AAAACT ATAAAC^ 

AAAT AAAT CATTCT CTTCTITTTGGTT ATTTTTTG CTT ATCACCTCTT ACCTCCTCT ACTT CCT AAAAACATG AT AATCTCGTTT A 
ATTCAAAATTATGTTTTCTGGCCATACATGGTGGATTATGCCTGTA^ 

5 0 AGTCAGGAGGTCGAGACCAGCCTGGCCCGCTTGGTGAGACCCTGTCTCTACTAAAAATACAAAAAT^ 

TGTCTGTAGTCTCAGGAGXSCTGAG^CAGGAGAATCATTTGAACA CAGTGAG CTGAGATTGCACCACTGCA 

CTTCAGCCTOTGCGACAAAATGAGACTCTGTCTCAAGAAAAAAAAAAAA 
TGTGTAGAGTATGTGGAATATTTATGTTCTAGACCAGCACTGTTCAAT^ 
CTGTATCAGAATAAAAAGTAAAAATAAAATTAATTTTAAGAAT^ 

5 5 AT AAAAATTATTG ATTAATGAGGTATTTTGGGCCAGTTTGGGGAGCC AAAG CAGGAGGATCACTTGAGTCCAGGAGTT CG AGACCA 

GCCTGG^CAACATAGTGAGACCCAGTCTCTAAGAACAACAACAAGAAAATTA^ 
TACTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCGAGGAGGTTAAGGCTGCAGTGAGT 

TGGG CAACAG AGTGAGACC CTGTCTCAAAAAAAAACAAACGAAC CCAAAAG C CGACAAAAACACTGAG ATATTTT AC ATT CCTTTT 
TTTGTACGAATGCTTCAAAATCTGGTGTGTATCTTACAGTTATAAAATCTCTCA^ 

6 0 GAACTGTGTTCAGATTTCATAAAACTTTCAGTTGATAAAGTAGATTC^ 

AAGTATTAAATTGAACACCAGTTTCCAGGTTTACATTT^ 
CTAGCTACATTTCAAGTGCTTAGGTAGCACATGCTGCTAGAGGCT 
AAATGAAATATAGGK5CTGTCTCCTGCTTTATAACATCTGGCTTACAATCAAC^ 
CCATGGCACATGTTTACCTGTGTAACAAATCTGCACATCCTGC^CATGTATCCT 

6 5 TCTTGCCCTGTGGGAACTGAAGCCACTGCTCTGTAGGAAATAAAAGTGTCITr 

AGTAGCTGGGATTACAC^CACCTGCCACCACGCCCGGCTAATTTTTGTA 

CTGGTCTCAAACTCCTGACCTCAGGTGATCCACCCACCTTGGCCTTCCAA^ 

CACAAAAGTGTCTCTTACCTAAAGAAAGTGATGCCCTTAAAGCTTCCTACCATrCC 

7 0 AGATGCACCCTCTTG lTlXi AAAACTGCrGCATCCTTG CATGCCTGAAGCATTACCC ATGTCCTTG AAG CTCCTGCTTTCAGAGGAC 

CTCTCATTTCTTC^AGTAGAAAAGCCCTCTCTC^CCAGGTGCAGTGGCTCACGCCTG 
GGGAGGATCGCTTC^GCCCAAGAGTTTCAAGTCAGCCTGG^ 
ATAAACAGCCCACCCTGTTTCAGGTATACCTCAAC^3TCAAATCJi 
GTGTTGTTTAACTTTATTTTTGAAACTGAATTTGTCT 
7 5 AAAAT ACATCAAG TTGATG CTT AAAATTGGGG ATCTAGTT ACAGATGAAATTTTT C CTTTCAAAAGAGAGTTCCCCCCAGTTTTGC 



828 



WO 03/008583 



PCTAJS01/51291 



TCTAGTCTTAGGAGATACTTAAGGGAGGGGAATCATTTTTAGTTCCCCCTAGGTGAGGTGGTAGCCT 

CACCATGACTTXn^AACACCACTGTGTGAG CTGGAAT CTAGG 

TAATGCCACTTCCTACTTAGTTTATCTTAATGGTTTTGTTTG 

AGTGCAATGGCGTGATCTCAGATC!ACTGCAACCTCCGTCTCCCAGGTTCAAGTGATTCT 
5 ATTACAGGCAAGTTCCACCATGCCCAGCTAATTTTTGTATTTTTAG T AGAGACAGGGTTTCACCATGTTCX3TCAGG CTGGTCTCAA 

ACTCCTGACCTCAGGTGATCCTTCCGCTTCAGCCTCCCAAAGTGCTGGG ATTACH 
CTTTTTAAAGGATACCTGATTTCAGAAGGAACCTTTGAAGAAATGGCTAGAAC^ 
AGCCAAATTAGTTGACCCTTTTAATGTTAAATAAAATATTCAA 
TATGTTGCCAAAACTAGGTTCATTATAACATTGGAAAC^ 
1 0 GGATGATTTCCATAAAGGGAAACAAATTCCTTTCTTCCAAATATACTGTGACAGCCT 

AGATGAAAATT AAAGAG CGGACACAAAGTTGTT ATGCTCT AAGGAGTCATT CTTTTGATTTAAAATTCATTAT AATTCCCTGG TTG 
TCTAGTGGTTAGGATTCT^CACTCTCATAAAATTCATTATATTGATTCTATAA 

CATTTTT AAGTGTTT CAAAAT ATTTTTGTT AT ACCT AGG'rrrr CACTGTGG TGGTATTTGTTGTTTTAGT AGAATTTGACCT ATAT 
TTT ATTCT ATGGG AAAAAGAAG AAAGT AAACATTGG CTAG CT AT CT ACCA CCCTG T ACTG CT ATTTG TGTGTGGTC CCATTGTG AA 
1 5 TTTTT AACTCCATGCTCTAT ATTTTT AGTGCrTTGTGAAGATTGAATCTGC^^ 
TGTGGCAATCATGAAAGTTATCCTGTAAATAAAACTGTATGTCCTGGA 

TGTGG AAT CCTTG ATTGGAC CCG ATriXJTlT'IGCACAT ACAAT AG CT CGAT AAAGATGAT AGGCTTTCAGCAGGGATGAACAGATG 
TTCTGGTTTTATGAGAAATTGCTTTTGCATCACCAAG^ 
TAAAACTTCTTCCCAGATACTGGTTTGAACACACAGACTTTGTCAT^ 
2 0 CCCARCAGAAATAAAGAGAAGAAGAATAAAAACAAAAACTAGAAAGTGTAG CAGGGG CACAATT ATTT ATTT AAACTTT AAAG TAT 

TTAATATTACATACCAGAATCTAAGTTACATAAAGCATGTCTGTTTTGTTCACTGCC^ 
CATGTAATAGTTACTCAATAAGTATCTXSTTAAATAAGTATAGTAGTCCTGCCTTATCCC^ 

CT GAACCCT ACAT AT ATAT ATTGTTTTTT CCT ATGCAT ATATAATT ATG AT AAAGTTCAATTT AT CACAG TAAG AG AT 

T AAGAACAATGAT AATAAAAT AGAAGAAGT ATGT AGT CTTTTTTTT^ T CTCG CT CTGT C 

2 5 ACCCAGGCTGGAGTGCAGTGGGGCAATCTCCGTTC^CTGCA 

CAGTAGCTGGGACTACAGGCATCTGCCACCACACCCGGCTAATTT^^ 
CCAGGATGGTCTCGATCTCCTAACCTCGTGACCTGCTCGCCTCAGCCTCCCAAAATG 
TGGCCGAATGTAGTCTATCTTAAAATACTGTATCTTACTATGCTGTACT 
AAGGAAAGACTATTGTATTCCCCTTTAATCTCAATACAGCAGGTC 

3 0 CnTTTACTCTGGAAGAAAGAAAAGAGAGGGGGCrGGGTGCAGTGGCT 

AGGGTCACTTGAGCCCATGAGTTAGAGACCAGC 
AGAGAGAGGAGACAGCAAATATCTCTTTGTTTCCTTTGGATTTC 
TGGTTTTTTAACCCCATTTTAAAGATTGTGGAGTTACGTATATATATCATA^ 
AGTTTCTCTGTGTCACCCAGGCTC^AGGTTAGCTCATTGCAACCT^ 

3 5 GAGTAGCTGGGATTACAGGTGCACGCC^CC^TGCCrGGCTAATTTTGGTATTTTTAGTAGAG^ 

GGTGGTCTGGAACTCCTGACCTTAGGTGAACCGCCCTCGGCCTCCCAAAGTGCTGGGATTACA^ 

ATTTTCACTGTTTTAAAGTACAGTTCCGTGGCATTAAGCACATTCACA 

TTTTTTCATCCTAAACTAAAATCCTGTATTCATTAAACA^ 

AAC CGC CATTCATTCTTT CTGT CTCT ATG AGTTTGACTACTC CAGG CAT CT^ TGGAATCAT ACACT ATCTGC C CCTTTG 

4 0 TGTTTATTCCTTTTTTTGCAACATAATTCTTTTGGTTGGAGACT 

ACCTAGACCTGGCCAGTGATGCATGTGCAATGAGTATGCAAAGACAAGTTAATTC 

TAAAGCTTTCAAAATGTG ATTC CT CAT CTGT ATGCCTGTGGCTTATCTACTCCAAG TCTTG CTG AT AAACCTG CACTGAGAGT AAT 
AG CAAAG ATT AGT CTTT CTC CATGTTT CAT AAAACAAATTGTTCTCATT CTT ACAG CCTGATT ATG AACTGGAAAATTACAGG TAC 
ATTGTTGTAATTGTTTATAGAGACAATGAGACATACTCTCCACA 

4 5 AG AAACGACAGTTAAGAAAC T TTTTTTTT' r TTAGACGGAGTCTCGCTC CATTGCCAGG CTGGAGTGCAGTGGCACGATCT CGGCTC 

CCTGCAACCTCTGCCTCCTGGGTTCAAGTGGTTCTCCTGCCTC^GCCTCC 

CAGCTAATTTTTGTATTTTTTGTAG AG ACAGGGTTTCACCATGTTGG CGCCTG 
CCT AGG C CT CCCAAAGTG TT GGGATT ACAGG CATG AG C CACCG C AC CG A C CTI^C"rriT'I"ri' J rr CCAlT"rri AAAAATTTTCTATTC 
TCATTACATCTTCCTTTCTCCCTACTTTTCCTTCCCT^ 

5 0 TTTTTCATTCTCATTTTTCTG AG TAG TTG CTG AAGAC ACAG CCTGACC CT AC<XX3T CCAAAT AT AG ATGAAACAATCATAGCT CTC 

TAGCAGTTCACCTGGAGGGAGAGAGAAAAAGAGAGTAAGAGAACCAAAATGAAGGGGAAATAT^ 
ACAGAAACTAACTTGTTGAGGAAAAAGAGAAATATACTGGGAGTTCTGACTGAT^ 

TTTGAAAAGT AAT CT ATGG AATTT AGCTTTAAAG CAGG AT CTTGGG CTCTAG ACTACTTT ATTATT ATTATTTTTT AAGAG ACAG A 
CTGCACTGGTGCAGTCATGGCTCACTGTAGCCTCAAACTCCTGGGCTCAAGCCATCCTCCC^ 

5 5 TT ACAGG T ACAT AC CAG CACGC CTG G CT AArrrHT AAlTTri ATI 1 1 U IG T AG AGATG AGGT CTTG CTGTGTTG C CCAGG CTGG TCT 

TGAACTCCTGGCCTCAAGTGATCCTCCTGCCTCAGCATCCCAAACTGCTG^ 

GGCTAGTAAAGTGATTITCTACCACTGTATCTGTGCTCATTTACA^ 

TAACATGAGGGGTCAACAGGTAGATTCGGAACCATC^GAGTTTAGGCCGAAGCCCATA^ 

TTTT ACAATGT CTACAAATG CAGTTGTTAAAG TG CCAGAGACAG AAGTATTGACAATGAGAAATTATCCAAAACCAGGCCAGG CGC 

6 0 GGTGACTCACG CCTGT AATC CCAG CACTTTC^GGAGG CAAAGGTGG C^U^G ATG ACTTG AGGCCAGGAGT CAC AGGCTG CAATGAGTT 

CTGTTCATGCCACTGCTCTCCAGTCTGGGCCACAGAACAAGACCCTGTCTCT 
CTCCTATTC<5AAAGGTGGAAATGGTTGTGATGGAATTCAGCAAC 

T CTTTCAGT CCTTCCTGTTT AAAT ATGAATGG AT ATGT ATTGCCATT ATTG C C CTG C CCCT ACATCTC CCAAATTTAG ATTG CTAG 
TATTTATGTAAGTTTACTAGTCACAAATAGATTCTTGCATTTCCCATCTCTTGT 

6 5 ATTGGCACTT ACTT^GGTTTTCrriT ACAG CAG CAG CTG CT AATT AGTG ACAG CTTTTGAGCAAG CAG CTTGT CATTTTTGG CAG AG 

ATAGGGGACATCCATAGCAAGTCAAGACTATATTITCATTTGCTAAA 
AGTCTCTAGTCAAAGGTGGCTGTGTTTACTGAATTATCTTCCTGAGT^ 

AAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGTTCAACGTCTATCAGACTGATGCCATAAGTCCT 
TTAATCTACATT CTGGGT AT AGAAC CT CAT CATCTTGTTTTGGCACAATCATAAATCTTTCTGGATTGTAGCTAGTGGCATACA 
70 GTAACTGAATGAGTGAATTGTTGACAAATGTGCTTTTCGTT^ 
TTGCITAGAGAAGATCCTTTCTTTGGAATTTT 
AATAAAGTGTTAGCTCTATTTTACAAACITGTCACGATGTT^ 
CAATGATGAGTCAAAACCTCCGTCATGGTTTCTATAAAGTGTGATTCACT 
CTCCACAGT^rTAAAAATATCCTCAA^ 

7 5 CTATAAAAATTGTATTCAATAC^CAATTACTAAATACCAACCATGTATATCAAGTACTATAT^ 
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AGACATGGT CCTTG CT CCAGGATGG CTT ACAGTC^ ATT AGTTGG CCAG CAGAT AACTT AGTTGCCTTGG TT CTT ATG T CCAG AGTG 
TAGGGTAAATTCTAATTCTAGAGGAGGAAAACATCCCATCTTITAAAAATAAGA 

T ATTGTACTGATT CT ATTTCT AC CT AAAGGTG TT AAAG ATTGTTTGAGG CAC^ CCT AGG CTTGTGG AACTGCCAGG CAT C CTT 
GTCCCAAATTTCCCCTTGATATATCGCACAGCACTGTTAAATTCAGAAGTTGC 
5 TAGCTTGGAGAACCCAG^TGATCTTCTAGCATTC^ 

TTCTTITTTTCTTTTTTTI 111111 1 G AGACGG AGCCTTC^CTGTCACCCAGGCTG<^ 
CAACCTCCA.CCTCCTGGGTTCGAGCCATTCTCCTGCCT 

TAATTTTTTGT ATTTTT AGTAG AG ACGGGGTTTC ACCATG CTGG CCAGT CTGG TCTTG AG CTC CTGACCTTGTGATTCGCCT ACCT 

TGGCCTCCCAAAGTGCTGAGATTAOVGGTGTGAGCCACCGCACCCAATCT 
1 0 GTAG^AATTTATCATGTATACCTTGTCTCCAACCCAATGC^ 

ACAGGTATCCAGATGGCAGCTAACTTGAGTATCCTTTGrc 

CTATTAATGACATTTCCCCCTGTATGTAAAATGTTTTCTGCCATTAGAAAGCCT 

ATGGATAATAATGTAACTAATGGAAGCCCCTTAAGGAATTTGCGAACAAG 

TGTTGCATTGCTAATTCTGACCAC!AAGTGGTCAJSGATCTAGGTTTTC^ 
1 5 TTATTCTCCTCCCACTCACTTTTACAATCTAAAGCTTTTGGCT 

C CAAATATGGCAT C CT TTGGTTGTGCTTGAAGCTTTGTCTTCTT AGTGCT AAC CCT CAGGGAAGGG AGT CTT ATTTAAAG CCTTTT 

XTTTTGGATTOTCAAATAATCTTTTTGTATTCATTTC 

TTAAACTGCAGGTATCACTGTCAGGAGAAACTCATTTGACTAGCTGATGAT 

TATCACCCAGGCTC^GTGCAGTGGCATGATCTCCACTCACTGCATCCTCC 
2 0 TCCCGAGTAGCTGGGACTACAGGTGCCTGCCACCACGCCTGGCTAATTTTTTGTATTTT^ 

GCCAGGATGGTCTCGATCTCCTGACCTCATGATCCGCCCGCCTTGGCCTC 

CCGGCCTCGTC^TGATTTTCAATAAGTAAACCACTGTC^TTCACAGCCT 

TACTACTACrGTTATTATAATTCTATTACTACrGCT^ 

TAAGGTTTGGTGATGAAGTATAGGAAGGGCTTGGAAATGAAGTCATTTATT^ 

2 5 TGTGGGTGGCTTAATTTAACATAACAAACTAATACGTTTTTCCTTGAACTCCATACT 

GTGGGGTTAAAATACTCTCCATCATCAAAGCCCAGACATTTTAGAACAAG^ 
ACCACAACCTGAAACTAACGAGTTAGAAACCACATAGA^^ 

GTAGATAGTTAAGCCC^AACCGATCTC^TTGAGAAAAATGTTTAAAATCATCATTTTCCCTTCCT 
AAAAGGGTCTAATTTTGCTCCTCTGGGAAGATTGAATGCXST^ 

3 0 GTTTGGTGCAGTGGCTCACGCCTGTCATCCCAGCACTTTGGA 

GCCn^GC^CATAATGAAGCCCCGTTTCTACTAAAAATACAAAA 

CTCAGGAGACTGAGACAAGAGGATTGCTAGAGCCTGGGAGGCAGAGGTTGCAG^ 

GGTGACAC^GTGAGACCCTGTCTCAAAAAAAAAAATTATATTTTCTAT^ 

TCCATGTCTAATCTAAGAAACTTATATATTCATTGTATATATATCTACTGGTTGTCAGCATTG^ 

3 5 TTGCAGAATGACTGACTTGACTTTATTTTGAGAATAAGT^ 

GTTGCATTTATTATCTCTAAAATACGATCTTCTTTTTCT 
TGCAGTGGTGCAATCTTGGCTCTCTCCrACCTCCACCT 
TACAGTCACGCGCCACCACACCCAGCTAATTTTGTATATTTTTGGT^^ 
CTCCTGACCTCAAGTGATCCACTt^CCTTGGCCTCCCAAAGTGCT 

4 0 ATTTTTAGATGGAGACTTCTGGACATTC^TATAAAATCTGGATGGT^ 

TAACACTTCCACCACGTTGAAAAAGCGTAGCTCCATATGATCTTACTC 

ATTCTGGGTCTCACATTTG CTTTGTTTTTAT CTGGAGCTTT ACTC^ AAACT AAATG CT ATTGCGT ACT A^ 
GGAAGTTGGCAAGAATTCGCTGCrTTCTGTTTGGTCTC 

CCGX3TAACAGTGCCGCCTGTGAAAG AACACATACACAGAAGGGCTCTG CTGATG CCTTG CAGCATAAGCTGTTA 

4 5 GTTAAGAGGTGGGAGCCTTTCAGTGCATCTCTTGTGTGCT 

GATCTACTATTAGAGGTATAGTGAGAGACGGTCTGTTACCAGAAAAAGTAGGTT^ 
CCCACTATGAGTGCTGTAGTAAATGTGCACAATTATTTTGTG 
TTTT ATACCTGG AT AACCGTTG TTGTTTTTTTTTT^ 
ATGTCA.TTTCITCTCAGAATTAGAATAAGATTTTTTTTTTT^ 

5 0 TGCAGCGACACAATCACGGCTCACAGCAGCCTCAAACTCCTGGGCTTAAG CAGTTCTTCTGCCTGTCTCCTGAGTAGCTAGAACTA 

CAC^CATGCGCCACCACACTGGGCTAATTTGTAAATT^ 
CTGGCCTAATCAGTCTTCCCATCTCGGCCTCCCAGAGTGCTGGGT^ 

ACTGCAGCTACTAACTCCTAGGCTCAAGTGATTCTCCCACCTCAT^ 

5 5 CTACTACTTTTTGTTTTGCTTTCTTTGTAAAGACAGAGCCTTGATATGTTGCCCAA 

CCTCCTGCCTCAGCCTCCCAAAGTGCTGAGATTACAGGTGTGAGCCACT^ 

ACTCGTTGG ATCCACTTTTTCTG CAGG ACTTG TTTT ACTT AAACACAGG T AGT AGG ACAC C CCACAACTTGC<5 ATT ATTTTT GGAA 

AGCTGTAGTTGCGTTTCATATCACCCAGTAAGCAAACAAGGG 

TCAGTTTTGTATTGTTTTAACTAAAACATATTCAGTTCAGTTTTTATTT^ 

6 0 AAAAATG AAAGATTTT AATTCATGG CAAATT AAG AG AACACAG CT AC CT GG AAAACGGGG CAAACTG CCATATCCTT CTG T ATTT A 

GTTTTGGTTCGGTGGATAGGTGTGCTTTTCTAAGGTATTATTGGTCTCT^ 

GAAGGTTTGTGTTGATGTACATAAAAGACATAGAGATTAGTACAGAGAGAAACATCTCAGG^ 

GTGCTTCTtSTTCTTGGAAACTGGTCTGATCrGTTCCACTTTCCT 

6 5 AGCGAAGTTAGGAACGAACAGCTGAGGTTTCCTGTC 

AACTGTCAACC^CCTTCACGTAAGCCAGTTGTGTTCAC^TATGTGATTAGAG^ 
GTCTCACTTTGTGGCTCAGGCTTGAGCATAGCGGCGTGATCATAGCT 
CCTCCrGAGTAGCTTGGACrGOUSAAGTGAGTCACCATCCCCAGCT 
AGTGACTTTCTGTGTArrrCGTTCTATTTACAGTTGGTAC^GTTTGCT 

7 0 ATAAATATATGTCAGAATTACAAATTAATCTGCTATTAACAGTCTTAAATCACT 

GAAAGTGAAAGAATTG CATTTTAAAAAAAATTTTTT T AT AATT CTGAATTTTT ACTTT AT AATT CTGAATG TT CT ACTT AAAATT A 
AAATTTTTTTATAATTCTGAATTTTTACTTTATAATTCTGAATGTTCTACT 
CCTGGATATTATCATGTTAGGAAATATlT'CrrCCTGTGTTTTATGTCTCTTTAAA 
ATTCCXCATTGATTATTAAATTTATATCTTAAGTTTT^ 
7 5 AGCCCCTTCACCTAAATGTAACAACTGTTTACACTTCTCTTTGACCTT^ 
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TTCATTTAACATTCTTTTATTTCTACTCTAAAAAAATGTGTCTACAGGCCAGGTGTGA 
GGGAGGCOSAGGTCGGAGGATCAOVAGGCCAAGAGTTGGACACCA^ 

AAAAATTAGGGCTGGGCGG<^TGGCTCACGCCTGTAATTCCAGCACTTTGGGAGGCCGAGGTGGG 

TT CGAAACCAG CCTGGC CAAGATGGTGAAAACC CGT CTCT ATTAAAAAT ACAAAAAATT AGCCGGG CATGGTGGTGGGCACCTG T A 
5 ATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGGGGCGGAGGTT 

CTCCAGCCTGGGCJUITAAGAGCAAAACTCCGTCTCAAAAAAATAAAAATAA 

TCCCAGCTGCTCAGGAGGCTGAGACAAGAGAATTGCATGAACGCGAGA 

TTTAACCCTGTTGAAAGAGCAAGACTCAGCCAGGCGCGGTAGCTCACGCCTGTAATTCCAG 

ATCACAAGGTCATGAGGTCGAGACCATCTTGGGCAACATGGTGAAACCCTGTCTCTACTAAAACT 
1 0 TGGCGCGTGCCTGTAGTCCCAGCTATTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTG^ 

GATCGCGCCACTGCACTCCAACCrrGGGTGACAGAGTGAGACTCCATCTCAAAAA 

TGGCAGATGCCTATCTATGCTCCCAGCTGCTCAGGAGGCTGAGACAAGAGAATTGGTTGA^ 

CCAAGATTGTGCCACTGCACTTTAACCCGGTTGAAAGAGCAAGACT 

ATGCGTCTACATGGTTTITATATTTCAACATAGCTTTTATATTTCATTTAACT 
1 5 T AGT CT AATCAT C C CCCT ATTTTGG ACATTTTGGTT ATTTCTGTTTGT CC CG ACGGT AT ATATC CCTGG<1AATTGGCCTG AATCAA 

GGCACTTCTCAAAGAAAATTTTGCTAGGTAGAAACTAAACACrGTAATTCTCAATGGTA^ 

AAAATATTGGT AACATG TGTGG ATTT ATT AATAAATTGCCTG TTT^ CAAACAAGAGG AAAGGGATT AT AT 

TTCATTTAAGCAGTATAGAAACAGGGTATTGTAGTAGCTGAGGAGTTTTTTTCCTC 
GATTCTAAGATCAGTTGATCTTGAATTACAGTACAAATATAAACCTTCACTAGAAA 
2 0 ACTTGTCATCCCAGCACITTGGGAGGCCGAGGTAAGAGGATCACTTGAGCCCA^ 

ACCCCATCTCTAAAAAAAAAAAAAAAAAAATTAGCCGGGCATGGTGGCACACACCTGTGGTCCCAGCTACTCAGGG^ 
GGAGGATCACCT<3AACCCAGGAGGTTGAGACTGCAGTGAGCAGTGATTACGCCATAGCAC 
CTGTCTCAAAAAAAAAAAAAGAAAAGAAAAGAAAGAAAAAAAT^ 
TTATTTTTCCGTAATGCAGATGGTCATTAATTTTGGGCT^ 

2 5 TTCCCAAAGGGATCGCAAATCCATTATCTCCATGGATTTGGAC^TGGA^ 

ACAAGAGAGCCCAACTTGCATGCTGGTGGCCTGCAAGCTGCAGCACAACGCCTAAGAAGTCCAGAT^ 
AGTTTCAAAGTGAGAGTCAAACCTCTGCTTTCATTGCTTTGTTTTTC 

AGTGTACATGACTTAAAAGTGGAAAG AATAG AACAATG AGTACT CG CAT ATACTTCACCTAGGTTGAGCCAGTGTTCACAGTTGTT 
AGATTTAC^TTTCCCACACTCCCCACCACrcTCACTCTTCCGTTT CTCCTGCCAAAGTAAGATGCAGACGACATGG CCATTTACCCT 

3 0 CAAATACITGACAGTGTATCTACTCCCAGCAAGGGCAT^ 

TGATAAAGTAATATCTAATATATAATGATATAATGGCCTITAGAGCTGTGGGTGTTTTTGTT^ 

GTTGCCCAGCCTGGAGTGCAAGTGGCGCAATCTCAGCTCACCACAACCT 

TTGAGAGTAGCTGQUlCrACAGGCGTGTGCCACCACGCCC^ 

CACAT ATATTTTTT AAAGTTT ATTTTT AAGATTTTTT^ CATT CTTGGGTGTTTCTCG CAG AGGGGGATTTGG CA 

3 5 GGGTCACAGGAGAATAGT<X3AGGGAAGGTCAGCAGATAAACAAGTGAACAAAGGT 

GCCITCD3CAGTGTTTt3TGTCCCTGGGTACTTGAGATTAGGGAG 

TTAACAAAGCAC^TCrTGCACCGCCCT^AATCCArrCAACCCTGAGTW 

ACAATTTI^GTATTTTTAGTAGAGACAGGGTTTCACCATGT^ 

CTTGGCCT CCCAAAGTCCTGGGATTACAGGCATGAGCCACCACACCCAGTCGTGTTTTTGT^ 

4 0 CTCTTCAGTCTCCTTTAACCTAGAACAGTTCCACCTGTTr^ 

GTCTGATTGTTTCTTCATGATTTAGTTTGTGTAT^ 
TGGGGATGTTTGATCACTCGGTTAAGGCATTGTCCGCCAGATTTATCC^ 
ATGTGAGTAGTACTTTGAGACTAAATGCTGTTCCCCAACAACTGTCACCTAATGGTTTTACT 
CTACTCTCTTTATTTTTGAGATTGTGCATCTTTGGCCAAGACTTTATATCCTATATTTA^ 

4 5 TGCTGATTGAAACCTGATTTTTATATTTAGTAAGTAGATCCTTTCT 

CTCCATTCTCTGTTCTTCCTATTGTAATATGCAGTGATTTGGCTGTGACATAAAGATTAATGGCGGAT^ 
TCATCTCTATGTTGCCrrGGGCACATTTTTCCrTGTTTGCTlX"!"!'! 

GTTTAATAATGTGATTTTTTAAAAAGAGAATTGGCrTCCTGATTGGATATAGGGTAGTA 
TGGCTCTGGTATTAACCTGCTTCTAGATGTGATAGCClTC'riACTTGAT 

5 0 TTATTGCTGCTATTG AATA 1 TTT TCCGTCCCTCAAAATCG CTAT ACT AGGCT AGGCATGGTGGCITCAGG CCTGTAATCCCAACACT 

TTGGG AGG CCG AGGTGGGTGG ATCACTTG AGGTCAGGAG TT CAAGAC CAG C CTGG C CAACATGGTAAAAC CCTGC CTCT ACT AAAA 
ATCCAAAAATTAGCCAGACACAGTGGCRCATGCCTGTAATTCCAGCTACTC^ 
AGGCAGAGGTTGCAGTGAGCCAAGATCATGCGACTGCACTCCAGCCTGGGTGACAGA^ 
ATTGCTGTACTAGCn'GACCACCACCATTTAATATATTTAATAATGTTGGTATCAA 

5 5 CTCTrmTTTTTTTTTTTGAGATTC 

CGCAACTTCTACCTCCCAGGTTC^GaiATTCrCCrGCCrrCAG 

GCTAATTTTTTGTATTTTTAGTAGAGACAGGGTTTCTGCATGTT(^ 

GCCTCTGCCTCACAAAGTATCGGATTACAAGCATGAGCCACCACGCCTG^ 

AGTCATTCACTGGAG CTGAT AGGTTTAACAGTTTTGAACCCTGGCTCCTTG CCTTX3T ACTTTTCAGATCAGAAGC^AATGTTTAAAC 

6 0 GTCATCrCTTTGATCTGTCACCCAGCCCTAGCCTTTC 

CTTTGCTCTCTTTAAATTATGAAATATAACAAACAGGCAGAAAAGTAC^ 
TGCAGCCTGGTGCAGTGGTGCAAGTAAGTGCAGGTTCTCAGGTAGGTCCAGCCCAGATAGTA^ 
TGGAGCAAGTGTTTCACCTCITTCTCITTTTAAAAATA 
TATTCTCAGAGTTATACAATTATCACCACTATCTAATTCCC^ 

6 5 CATTCCCCATTTCTCCCTCCCCTCAGCCCTTGGCAACCAACAATCTATTTTCC^ 

TAAAAATGGAATTTACAGTATGTTCCCTTGCATGCCTTT^ 

GTACTTCATTCCTTTTTAACATCTG CATAATATTCCATTTTATTC^TTGTATTTATCC^TTCCTATCC^TATTTAT CTGTTT ATC C 
TTTGGTGGACATTTGAGTTGTTTCCACTTT'GGGGCGATTATC 

7 0 ACTCCAACCTCCACCTCCOVGATTCAAGCAATTCCC 

CGGCTCATTTTTTTCrrTTTTTAATTT^ 
GATCCACCCGCCTCAGCGTCCCAAAGTGCTGGGATTACAGGC^ 
ATGTTTTCACTTCTCTTAGGCATATACCCAGGAGTGGAACT 
CAGATAGTTTTCCAAAACTGTTTCACCATGTTACAATTCT 
7 5 TAATACTGCCTACGTCTGAGGGATTTAGGATAAAGAGTAGATTCATAAAAGCA 



831 



WO 03/008583 



PCT/US01/51291 



CTGGGGCACTATGCCACACAGCTCCAGAACACGTATGACCGC^ 

TGAACAGAGGTTGGTCCGAGAAGCCAACAATGGTAG CT CTCCAGCTGG AAGCCTTG CTG ATGCCATGTCCCAGAAACACCTCCAGA 
TCAACCAGACGTTTGAGGAGCTGCGACTGGTCACGCAGGACACAGAGAATGAGTTAA^^ 
ATCATCCAGTACCAGGAGJU3CCTGAGGATCCAAGCTCAGTTTGGCCCGCT 
5 GACGGCCCTCCAGCAGAAGCAGGTGTCTCTGGAGGCCTGGTTGC^ 

CCGAGAAGfCACCAGAAGACCCTGCAGCTGCTGCGGAAGCAGCAGACCATCATCCTC 

CAGC AGCTGG CCGGGAACGGCGGGCCC CCCGAGGG CAG C CTGGA(XITG CT ACAGTCCTGGTGTG AG AAGTTGG CGG AGATCATCTG 
GCAGAACCGGCAGCAGATCCGCAGGGCTGAGCACCT 

TCAACGCCACCATCACCK^CATTATCTCAGCCCTGGTGACCAGCACGTTCAT 
10 CAGACCAAGTTTGCAGCCACTGTGCGCCrrGCTGGTGGGCGGGAAGCTC , 

CATCAGTGAGCAGCAGGCCAAGTCTCTGCTCAAGAACGAGAACACCC^ 

TCATGGAGTACCACO^CCACAGGCACCCTTAGTGCCCACTTCAGGAATAT 

GGGGCAGAGTCGGTGACAGAAGAAAAATTTACAATCCTGTTTGAATCCC^ 

CAAGACCCTGTCCCTCCCAGTGGTGGTGATCGTTCATGGCAGCCAGGACAACAA 
1 5 TTGCAGAGCCTGGCAGGGTGCCATTTGCCGTGC 

GCCGAAGTGCAGAGCAACCGGGGCCTGACCAAGGAGAACCTCGTGTTCCTGGCGCAGAAACTGT^ 

GGAGG ACT ACAGTGGCCTGTCTGTGTC CTXK3TCCCAGTTCAACAGGG AGAATTTACCAGG ACGG AATTACACTTTCT CAATGGT 
TTGACGGTGTGATGGAAGTGTTAAAAAAACATCTC^AAGCCTCATTGGAATGA 

2 0 GAAGTTTGATTCTCAGGAAAGAATGTTTTGGAATCTGATGC 

TGGGAGACTTGAATTACCTTATCTACGTGTTTCCTGATCGGCCAAAW 
GAGTCrcCTACTGCTAAAGCTGTTGATGGATACGTGAAGCCACAGATC!A^ 

TGCCGGGGGCGGCAGCGCCACGTACATGGACCAGGCCCCCTCCCCAGCTGTGTGTCCCCAGGCTCACTATAACATC 
ACCCTGACrCAGTCCTTGACACCGATGGGGACTTCGATCTCGAGGACACAATGGACGTAG 
25 CGGCCAATGG ACAGT CAGTGG ATC C CG CACGCACAATCGTGA 
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GCTCTTAAATCTGGATGGGGGATAATAATAACACTGATAAATGTTATTTATTTCTTATCCT 
TAGATGAATTGAAATATTTGTTGCTAATAATTCATAAAGCAGT 

TTTAATTATTAAAAGTCAGATATGTCCTTGGAAGGAATAGGtAAAGGCATGGCTCCCTC 
ATATTAGAAGGAATGGAATAGGTAGTTTTCTTCTCCnTTXTTTTT^ 
5 GGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGCG 

GGACTACAGGAG CATGG C ACCACGCCCAG CT AATTTTTGTATTTTT AGT AG AGATGGG TTTT CACCATTTTGG C CAGGATGGTCTT 
GATCTCTTGACCTCATGATCCACCCACCTCGGCCT^ 

TO ' Crmi 1 1 rTTAAGAGTTTAGTATGCATAGGGCAAAATCAAAGCCTTGCTGTTACTGTACCAC 
CATTCTTTCATTCAAAGATGAGAGAACAGAOTCATGGTG 
1 0 GTGCCAGTGAAATTGATTAAGAAATAGGAATGCCATCTTTTAAAAGGGCCCAGC^ 

AGGTGGGTGGATCACAAGGTCAGAAGTTCAAGACCAGCCTGGCCAAGATGGTAAAACTC 

CAGG CGTGGTGGCGGG CAG CTG TAATCCCAG CTG CTT AGGAGG CTG AGGCAG ATAATTG CTT AAACCCGGGAGGCGGAGGTTGCAG 

TGAGCCCAGGTCACGCCACTGCACTCCAGCCTGGGTGACAGAACGAGACTCTG 

ACAATGGCTTGTGCCTCTAATCCCAGCACTCTCGGGGAGGCTGAC^ 
1 5 GGCAACATAGTAAGACCTTGTCTCTACAAAAAAAAAAAAAAAAGTTTTTTTC^ 

GTCCTAGCTACTTGGGTGGCTGAGGTAGGAGTATCCCTCAAGCCCAGGAGTTTGAGGCT 

CTCCAGCCTGAGTGATGGAGTAAGACCCTGTCTCGGATGGATGGATGGATGGATGGA 

AGAAAAGTTTATACTGGAGGATATTGGTTAAGGAGAAGGATAGCAAGAGCTATT^ 

ATTGAGTTAAAATTAGAGATGGGAGAAAGAAGCCGGTTTTGTAAGTTTAAGGTAACAG 
2 0 ACCTGCTCTTAGCTAGATACTGTTTTAGGTATGTACATAAAACTAATTC^ 

TATTCCCATTGTACAAATAAGAAAACTGAGGCACAGAGAGGCTTAACAG 

ATCA.TCTCTAAAAGTTTAACCCCAGTTTAGAGACTTTGGCCCTGGGATGGG 

AAATT AT ACATATGAAT CCAGCAGAC CAAAT CTGCTGG ATT CAT ATT ATT CAT ATT ATG CAT AAT ATTATT CAT ATTAACC C C CTA 
CTCCCACCCCACACTAGAATTCATTCACTGAGTTGGCAGATACACAG^ 

2 5 TAGCTAGGGCAAGACCTTGGTCACAGTGTTCTACAAGTCTTCAGAGAGAG 

TTTAGAAAGGAAAAG CATCTGAGAAAAGATATTT ATG ACTGAGACAAAATTATTC CAAAT CC CACTCACTT AT AT CATTT ATTT AT 
GTGGTGGACATTaVTTTTTATGGTTTTCAGTAAGCCTGTAACATTATGAG CTATTAGACTTTG AAACTTTTATTAACAAGTG CCTG 
AGGGCCCGACTGATTTATTTGGCTCTCCAAGCCCATGGGTATATCCAGATATATAAG^ 
AGTACAGGGTTCTGCATCCAGGGCTGTCTTAGGTTGTTCTG 

3 0 TCTCATCTTTGGCCTAAAAATAAGATTGTGA^^ 

ACTTCCCTTGTCTGAGCATCACGCACTGAAATTGGAGCT 

TCTTTGTAGATGCAGAGTAGCGTTCGATGGCTAGATGAATGCCAGTTAGAAGGATGAATGCCGGTTAGATGAAT 
AGGCATGTGGACATAGC CTGATAATGGGGTGAAGAGTTGTGACATGTATGAAG AAAAATACT ACTT AACTG CCAGGCACGCTGGCT 
CACGCCTGTAATCCCAGCACITTGGGAGGCCGAGGCGGGCAGGTCA 

3 5 GAAACCCCATCTCTACTAAAAAAAATTAGCCAGG CGTGGTGGTGAGTGCCTGT AGTCCCIAGGT ACTCAGG AGGCTGAGG CAGGAGA 

ATCGCTTGAACTCGGGAGGCAGAGGCTGCAGTGAGCCGAGATCATGCCACTGCACT 
TCAAAAAACAAACAAACAAAAAACCACTACATAGCITCTAGGTG 
GGGTGTTTAGAATTTATAATGATAATCTCAGTAAGTCATCTCATATTTCACATAT 
ATGACTCTGTACTAGTTAGTGTACTTTTATTTTACACTTTTGAATAGCA.GT 

4 0 TATGTGTAGTGAAACTATATTGTATTTATCAAAATAAGGTGAITGCCACTTGGTATTTACCAGCTATTAT 

TATTTTAGAAAATAATTATTATGGCATGTTCATAACT^ 

TTGATGAAAACTAAGAAATAATCTAAAATTTAAGGCAAAATAGAGTGACATTTTT^ 

AGTA T TTCTTTCCTAAAACAGTAATAGGCTCCTAAATATGCTTC 

TCCGCAAGTATTTTTTGACCACTTTCCATAGGCAAGCCACCGTACTAGACACT 

4 5 CCGAGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGA 

TGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAA^ 

GGAGGCTG AGGCAC^ AGAATTGCTTG AACCCGGG AGG CAAAGCCTG CAGCG AGCTGAG ATATCTG C ACTCCAGTCTGGGCGACAGA 

GCAAGACTCCGTCTCAAAAAAAAAAAAAATTGTAATTGATTATTTAAATGG 

CTGATATATTGTTCAATTATAACTTTGAAAAGAGCCAATAGAATCGGTCAAATTGTC 

5 0 TTATGATGCTTATGTGCCAGGTACCGTTCTAGACCCAGGAGAAAGCACGGTGAACAAAACAAAGTC^ 

ACCCTGGTGGGAAGTCAGTAAGCAAATATATGATCAATATCAGGAATTGATAAATGATAAGAAAAATAA^ 

G AG AGTG ACCAGAATGC CG1TTT ATTTGAGTGGG CTGGG AAGGAGT CTCTGGGGAG ATAAAAG TTTT CATAGCTTTC CTTCTTTTG 

TCTCTGAAGTCAATTCACCTAATGAATGCAGATGCAAGAGAATTTTTGTATCT 

GGTGAGGGCCTACTTCTTACTCTTTACTACITAGGAATACTA^ 

5 5 CTT AAGTGCCTAAGCACCTTCATTCATCGATCCCTTATAT 

TGCAGTG ATGTAATCTCGGCTCACTG CCACCTCTGCCACCCAGATTCAAGCG ATTCTCCTG CCTCAGCCTCCCAAGTAGCTGGAAT 
T AT AGACG CATGG CACCACAC CTGGCTAATTGTTTTGTAATTTT AGT AG AGACG GGGTTT CG C C AT GTTGGCCAGG CTTG T C CCG A 
ACTCCTGACCTCAGGTGATCCACCCACCTCAGCCTCCTGAAGTC 
CTTATCCTTCATCCAAAAATCATGATTTGGTATCTGTTCTGGTTTTCCA 

6 0 TTGTGCAGTTAAGTGACTTCCAGTGTTTGATTATAAAGTGGAATTGCTT 

T CACAACCACTGAAATTTGATGTCTTGGGTCAAATTGG^ CTT AATG CCT AAAAG CC CT AAATAAT ACAC 
TTCTCAAG TGTTCT AT AG CATTG CACT AGTCT AAAATAG AT AACTGG CACT C C CCT ACAC CTC ATTG TTGG CT AACTGACTTG TCC 
CATAAATAGCCCAGTGTTACGTGATCAGACAACTACAACTAACAGAAGAGATTT 
AGTAAACAAAATCC^GTCCAAGTCACCGGCCCCTAACTGTAACCTAGTTCTTAGAAGAATTGTC 

6 5 TGGGATGGAATTCAAGATAACCCTGCCTTATACCAAACTCTACA 

AGAGTTACCTGAAGTCCCTTCCCAGTCTACCCAAATGTTTCTGCCCTAATCTGGTT 
AAAATTGTCAGGTGAGTCTGAGGGGGGCATCATGGTAGTT^ 
ACTACCGGCTGGGTGCAGTGGCTCAC^CCTGTAATTCCAGCACTTTG 
GACCATCCTGGCTAACAOXSTGAAACCCCATCTCTGCTAA 

7 0 AGCCACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGCGGG^ 

GCCTGGGGAACAGAGCAAGACTCCGTCTCAAATTAAAAAAAAAAA 
CCAACTCTTAGGTGGAATGTATTGTTTrrGTGTCTGAAAGATTTGT 
GCAGTAACTCAGGAGGTGAACCrrCGGGCATTTTTCCAGATAAATCT 
GTGATAATG(ZATAGGGTTGAATCAACACTTCAGATAT^ 
7 5 ATTT CTG AAGT CAAAATAC CAAATCT AAAACAAGTGATTTTTTAAAAAAAATCTCrC a -G CTGGGCG CAGTGGCTCACACCT AT AAC 
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CCCAGCCCTTTGGGAGCTGAGATGGATCACTTGA 

GAAAATACAAAAATTAGCTGGGTGCGGTGGCGGGCACCTGTAATCTCAGCTACTTGC^ 
CGGGAGGCAGAGGTTGCAGTGAGCCGAGATCTTGCCACTGCCCTCCAGTCTGGGCAA 
AAAAAAAAACTCTCTGCTAATTATTTATTACTGTTGAAATGCTGCT^ 
5 CTACGATGTCAAAGTTAG CAATTCTTTCCCTAATTCTCCTTTTGTACTTTTGCTT AT AAGACCTT C 

TTCATAGACCCAGCTGTGTATCTTTCCATGGCCAGGCTGAGTCT 

AT ATTGTCTTT ACT CTAC CAAAAGGGCT AAAAACATTTACTTTGGC CAAACGTGG TGGCT CCTGT AATCCCAGCACTTT AGG 
AGGCTGAGGTGGGAGGATCACTTAAGGTCAGGAGTTCGAGACCAG 

AAAATTAACCAAGCATGGTAGTGTGCCCCTGTAATCCCAGCTACTCAGGAGGCTGAGACAGGAG 
10 • GAGGTTGTAGTGAGCCAAGATTGTGCCACTGCACTCCAGCTTGGGTGGCAGA^ 

AATAATAAAATAAAAACATTTACCTTGGTAACTAAGTGAATATTTACTCTGG 

AAG CACTGATTGTTTGTTCAACAGATTGT AAAC CATGGCTGTGTGGATACAAG CT CAG CAG CT C CAAGG AGAAG CC CTTCATCAGA 
TGCAAGCGTTATATGGCCAGCATTTTCCCATTGAGGTGCGGCATTATTT 
AAGGATGTTTGCATCATATTTTGTGAAACACAAGAATATTGGAGTTTGTT^ 
1 5 TTTTTAAACATTTTTAAATGTGATCCATAAA 

AGATTGAGGGGATGGAATAACTCTTGGGATTTTTAAAACAGG 

TATTACTGCTAAGAAAGAGCAGCCAACAAATATGTTTCGAGTGCCCATGGG 

GAAAAAAATTAAAGCCCAACATTGTGTTGTAATCCCJUVCACTTT 

GG AAGGCCTG AG CAACATAG CAAAACCCTGTCTCTGTACAAAAT ACAAAAATT AG CCAGG TGTGGTGGTGTG CACCTGT AGTAC CA 
2 0 GCTACTTGGGAGGCTGAGGTGGGAGGATCACTTGAGCCTGGGAGGG 

CCTAAGCAACAGAGTGAGACCCTGTCTCAAAACAAACAAACAAACAAAAT^ 

GGACATGGTGGTGCACACCTATAATCCCAGCTACCCAGGATGCTGAGGTGGGAGGATTGCTTGACCCCA 

T AGG CAACATAG CAAG ATC CT ATG T OU^AAAAT AATAATAAT AATTT AGTGTTT AATGGGG CAGAT AGG TG TGAAT ATG CAT ATAA 
TTTAATTATAATTTATTTCITATAGTTTATTAAATATAAATTATTTTAC^ 

2 5 ATTAATAATTTAACAGTGGAGACATTGGTGTTTTTAATTTATATGTATT^ 

GGTGTTGTGGCTCATGCCTGTAATCCCAGCACTTTGGG^ 
TGGCCAACATGGCAAAATCCCATCTCTACTGAAAATAOUVAAAATTAGTTC 

TGG AGGCTAAAG CATGAGAATCGCTTGAAGTCGGG AGGCAGAGG TTG CAGTGAGCCG AGAT CGCACT ACTGCACTC CAG C CTG GG C 
AACAGAGTGAGACTGCCTTTAAAAAAAAAAAACAAAACTAAAC^ 

3 0 GATTTGATAAGCATTTTAATTTTTCTTTATTTTCCAGATO 

TAGAATTAGTACAGTGCAGTCGACTAACAGCACAC^TTTTTAATTGGTATCA 
OTGCTCAAGGTCTGGGAGGTTGCTGGTTGGTTACAGTC^ 

TAGCCATGGGTCCTCTTTGACAGTCCCTGAGAG AACCAG CTTTGATGCTGCTATTTGCAAAGGG CCATATACCTGTCTGTAATGTT 
GCCTTAGAATAATCGCAATGCGTGTTAGTAGCAAATTTCTGTTCAGA 

3 5 TAAAAAAAAACTCAACTTCATGTAATTAAGTGAGGGAAAAAAATTAGCTAAATA^ 

TCCACTCCCTTTTCATTTTCTCTCCTGTTTTCTGATGTGGAGGATTTAGAAAAACATGAGT^ 

TC ACTT AGCAG AGTTCT CATTTGGTGTT CTCACTG AATC^ TG T T G AT ATTAAGG AAAACTTG AAATTTTTT AG 

CTTTGGTTTTCTTCAGAATATAAAGATTTTAATGAAGAATT^ 

4 0 GACTGCAGGTGCCCGCCACCA.CACCGAGCTAATTTTTGTATTTTTAATAGAGAC 

AACTCCTGACCTTGTGATCTGCCCATTTCGGCCTCCCAAAGTGCTG 
G TTCTTTATGTTTAAGCCATCTAAGCAAGTCTTI T T T ' I TTITT^ 
CTGTGAGTTTTGCrCAGAGACTGGATCCTTTTTAAGAGTGGGCTCCTGGCCGGG 
GGGAGGCTGAGGTGGGTGGATC^CTTGAGATCAGGAGTTTGAGACCAGC 

4 5 ACAAAAATTAGCCAGGCGTGGTGGTGCACX5CCTGTAATTCCAGCTACTTC^ 

GCAGAGGTTGCAGTGAGCTAAGATCGTGCTACTGCATCCCAGCCTGGGTGACAGAGTGAGACTTCGTCT 
TTTCCAGACCAAGGGCATTGTTGCCAGTTGAAACAGAGGTGCAGGGCT 
TATTTAATACATAGGGCACACATATTTAGAATCTGACTTGCCTTAACTGCCrTCT 
GACTATGGTATCACTGCTTTTTTTGTTTTGT^ 

5 0 GGCGCAATCTTGGCTCACTGCAACCTCCACCrCCTGGGTTCAAGCAGTTCTCCT 

CACATCCCGCCACGCCCGGATAATTTCTTTTGTGTTTTAGTAGAGATG 
GAGC!TCAGGTAATCCGCCTGCCTCGGCCTCCCAAAATGCTAGGATCACAGGCGTGAG 
GATACACTTGTTGGTGCCTCTTTTTTGAAATTAGTTGCTATCTTCAGGGA 
TGTTTGGCTCCAGACCCAGTCAGCAGTTATTGAATGTTATTTCAGTCCGACAG 

5 5 TACTTCCCAGTGTCAATTGTTGCCTAAGTAGTAGAAAGGAAGG 

GGGAGACCATCCAACCCCCGTCACCAGTTCAGTAGCITGAAATCAATGCAAAGTCT 
GTTGGCATTGCCTTCAGGTCCTGGGTCTGCAACTAGTGACATGAATGTCTGACT 
ATGTCTCATTTCCAGGGCTTCTTAGAAATAGCAATAACTTGTTTC^ 
GAGGCTGGAAAGGACATGTAGAAGAGTCTTGGCCTTGTGCCTCCTTTAC^ 

6 0 GTCCTTATTTCCTTCAGAGTACACAGAACCCTTTCCTTTTCCTTGCATTTCTCGG^ 

GAG AG AAAGGAAG AAGGG CAGCAGATTGTT AAAAATTTTTTTT^ CT ACTGT AT AATC CAACT ATTT AGT AA 

ATTACAAATAGATTAACATAGGCATTGAGTCCATTTCrGTTGGGCT 

AGTTGAAGAAAAAGTAGGATGTGTTGAGGCTGTATTTATACAGGCTCACCT 

CT AGGGACT CAGT AG AT CTTGAT AATCCACAGG AG AACATTAAGGCCACCC AG CTC CTGGAGGG CCTGG TGCAGG AGCTG CAGAAG 

6 5 AAGG CAG AG CACCAGGTGGGGG AAGATGGG TTTTT ACTG AAG AT CAAG CTGGGGC ACT ATG CCAC ACAG CTC CAGGTGGG TGTG GG 

CTCTGGGGTCACACTTAATTACCTTCCCTTGGGGGTCTTCTTCCTGT^ 

TTACACGTGC^TAAGGlTClTGTTGTGTGGl , l'I"l'GGTL'l , l'l"riTT , rri'T I I "1 1 ACTTTTTATTTGAG AATAATTTCAT ACTT CCAG 
AAAGTT ACAAGAATCAT ACAAAG ATTCCCCACAT ATGCC CAT ATTCACCTGTTGTTGACTTTTATGTTGTCATATACTC CTCGCTG 
TCTCTCGACACACACGTTTTTCAGTCACTC^AGAGTAAGTTGCATACCTTGTGTCCCT 

7 0 T ATTT ATTTATTTATTTATTTATTT ATTT ATTT ATTT ATTT ATT^ CTCTGTCG CCCAGGCTG 

GAGTGCAGTGGAGCAATCTCAGCTCACTGCAACCTCTGCCTCCCGGGTTCA 
GATT ACAGG CTC CTG CCAC CATGCCTGGCAGTTTTTTTT^ 
AGTGCTCTTGAACTCCTGACCTCAGGTGATCCGCCCACCTCAGCCTC 

GTCCCTeTGTGCATTgT CT ' ri ' r: I i I J. TT T TTri" I ' r rGAGATGGAGTTTCACTCTTGTTGCCCAGGCTGG 
7 5 CTCGGCTCACCX3CIAACCTCCACCTCCCGGGTTCAAGTGATTCTCCT 
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ACCACACCCGCCTAATTTTTX3TATTTTTAGTAGAGATGGGGTCTCT 

GATTCACCCTCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCACCCAGCCCCTCTGTGC^ 
ACCTTGTCTTG C ATGGTT AT AAT ACACTG ATCAACTTCACT AAATTT A^ 

CATTTTGTCAACTAATTCAAAC ATGTCCTTTAGGGT ATTTTT CCC CG CTG CAG TC CAGAATCCAG CCCAAG ATCACAT ATTGCAT C 
5 TAGTCATCATGTCTCTTTAATCTCCTTTAATCTGGAACCATTTTGCAACCT^ 

ATTAATGTAGGCTGGGTGCAGTGGCTCACACCAGTAATCCCAGCACTTTGGGAGGCTGAGG 

TTCGAGACCAGCCTGGCCAACATOTTGAAACCCCATCTCTACTAAA 

TCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCTCTTAAACCTGGG^ 

TCCAGCCTGGGTGACAAGAGTGAAATTCCATCTCAAAAAATAATAATAATAATTCATG 
1 0 TCCTCATTTGGGGTTTGTCTGATGTTTCCTCATGATTACGTTAACAT^ 

CCATGTCCATGGGTTCTGCATCCTGATTCCACCAACCGCAGCTCAGAAATATrCAGGAAAAGAAAAAAA^ 

TAAAGATATGACAGAACAGCTATTTACATAGCATTTCTTTTCTATC 

GAGG ATTGT ATG CATTCT ATGCAAAACGATG CC ATTTTATAT AAGGAACTTGA.GCAT CTXS 

GGAACCAGCCCCCACAGATGCCAAGGGACAATTGTAGGTGACTTGTCCTT 
1 5 GTCCCCTCATTGGTGGTAGTAGTTTTGATTGCCCATTTGAGAG 

ACTCCGTTTTTAAGAATGTTCCCCGGATACTTTAGGCAAAT^ 

GGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGGAACCCCATCTCTACTAAAAAGA 
GTGGTGGCGGGCCCCTGTAATCCCAGCTACTCGGGGGGCTAAGGCAGGAGAATTACTTGAACCCAGG^ 
CCGAGGTTGTGCCACTGCACTCCAGCCrGGGCAACAGAGTGAGACTCAGTCTCAAAA^ 
2 0 TTTTCACTGCAATTGGATTTCCATGATGTGCCTTCTTGTTCCT 

CTCAGGAGGCTXXCCATCCTGAGGCATGGTTAGGGCCCATAGTGT^ 

ACACGTATGACCGCTGCCCCATGGAGCTGGTCCGCTGCATCCGCCATATATTGTACAATGAA.CAGAGGTTG 
AATGTGAGTGTCCCATAGGGATGGGGAGGGCAGTCGGGAAGTCTTTTCTT^ 

CTCAGGGTGCACTATTGGTGTCTCTTTAAAGAGAGCTTCATCCTAAGGGGCACAAAAGAAATAGGAG 

2 5 AGCACTITGGK^GGCTGAGGCAGG CAGATTG CCTG AGCCCAGGAGTTCGAGAGCACCCTGGGCAACATGATGAAACCCCGTCTCT A 

CCAAAAATACAAAAAAATTAGCTGGGCATGGTGGC^TATGCCTGTA^ 

AGCCCAGGAGGTAGAGGCTGCAGTGAGCCATGATCGCACCCCTGCACTCATGGGTGATGGA^ 

AAATGAAATAGAAGATGTGGTCTCTGATCTTAACTGATTTATAACCTTGATGAGAT^ 

AAGTAATTCAACCTCAGCTTGAATAATTTTT^ 

3 0 CCCTGAATCAGGTGTGGAAGGAGTGGTCTGATCAGGTCACTTACAGGGCTGGGTCCACAGAGAC 

CTTCACCCrGGGGTCTCCAGGATGCCTGCAAGAGGTTTGGCCTCTTCCATG^ 

GAGTGGAAGAGGAATCACTCCACCTCCTCATTAGTAGGAAAACCCAGGTGACAOT 
TAATTCAATGGACCTCGTTTGGAATACTCTGGTGGAGGTCAGATCTGAAATGGAAGCTTCTC 

3 5 ATCTGTCCGCTCACCTGTTTCTCACITTTCCC^CTCCACCCTGTGTCTCATGATG^ 

CCTTTTGGGGTCTTACCCTGGTTGTAGGAGCTGGCTTTTCCTGGGGATTAAGGCCTC^ 
CTGCCCCCCATATGTTCCCAGGGTAGCTCTCCAGCTGGAAGCCTTGCTGATGCC^ 
GTTTGAGGAGCTGCGACTGGTCACGCAGGACACAGAGAATGAGTTAAAAAAGCTGCAGCAG 
ACCAGGAGAGCCTGAGGATCCAAGGTGAGGCGCGGTGGAGGCACATTGTGCAACTTGAGGATTGGGCCTGG 

4 0 TCTGGTGGGGTTGGGAGAAAGCACACTTTCCATTCGGAGGCAGCGACCTTTCTCT 

GTGCTGAGAGGATGAGATTTAGGAAGGGGAGATGACAGGACCCCCCC^TTAACTGCATAT 
GCTTCTGAGCTGCCAGGGAAGACATTTCTTCTTTTGAGAAGGAGCTCA 

TAGTAGGCATATTATAAGTGGAGCTTT ATTGAAATGATAAATAGG AAATT ATATAATTACATGTTT AAAAATTTTTCTGG CTGGG C 
ACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGCGGATCAC CTGAGGTCAGGAGTTCGAGACCAGCCTGG C 
45 CAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAATTAGCTGGGCATCCCAGTGGCGCATGCATGTAAT , 

AGGCTGAGGCAAGAGAATCTCTTGAACCCGGGAGGCGGAGGTTGCAGGGAGCCGAGATCTCACCATTGCAC 
AAGAGCAAAATTCTGTCTCAAAAAAAAAAAAAAAAAATTCTAAGCTGGGAGTGGTTTTGCACACCT 
GGCCAGGGCAGGAGGATCACTTGAGGCCAGTAGTTTCAGACT 

AACCTTTCTAAGCAGGTACGTGTAAGTACTAAGGACCTGTATATGATTTCTAATGTTCTATGCAGTTATT 

5 0 AAACAGAGTACACTTAACGCCCCTTAGGTAGCCCTTATGGACAGAGCTTCCCTAGCCT 

ACAGAGAGCTTCAGTCTGTGGTGAGGCTGTGGAGCCCCACCAGCGCCCCCTTTTGAC 

ATAGTTTCCTCTCTGCTTCCATG CCCAGCTCAGTCTCAGAGAAGCCTTGGCTCAACCACCCAGCCCCT CAAACACACACACACAC 

CACACAGAGACACACAGTGCACACACACAGCACACACACACCACACACAGCACACACACAGACACACAG 

5 5 ACACACACC^CACCACACACAGCACACACACACCACACCACACACAGCACACACACACCACACATAGCAC^ 

T ACACACCACACACACAT AGACACACAGCAC ACACACACACACACAG CACATG CT C CTCT AATT CCATG T ATGT AGGTT C CATTTT 
ATGTCTATAGTGATACTACTTATTTACTCTTTATCTTCTC 

T AC CCTG AGG CCAG CACAATGC CTG AAACAT AACAGTTATTG AGTAAAGGTTGACAGG ATG AT^ 
CTGAATCCATCCTAGACTGGCCCTAGAAGAAGGGCAGCTCCTT<^^ 

6 0 AGTCCTTAGACTGTCAACAGTATTTCTGATCCCGTGCCTGGCTTTTCTGT^ 

GTTGGG TTTGG AT CT AT CACGCAGTGTCTTCTCAGTCCCATTTGTCAAGTTCACTCAG CAG^ CTTTT 
GCCAGGGAAGGCTCAGCTGCCTCCAGTGGAGTAGGGAAATTGCTTCCCGTCTCCCTGGCCT 
CCAGGAGAGCCCTAAGTGGGTCCCTTGCATAGGAAGGCTGGGCCCAGGTGTC 
ATTCCTGAGGTAGCCGTGCAGATACCTTGGATCTGTCCGTCCCTGGAT^ 

6 5 CCATATCTCTGAGTTCTAGCTCTGGGCTGTTCCTGGCTACATAGAGTT 

GGCTGGGGTCAGAGTGACGTGGCCCTCAGGGGCCAGCCAGGAGCTCGACCAAG 
AAGCAAGGACTCTGGGCTGTTGAGAAGGTGATGGGAAGGGATTAATAGTCCCAGTCT 
GGGTGGGCCTGCGGTGGTGGTTGTAACTGTGGACATGTTTACnTTCTTA 
CGTTCCCTGTGTCTTACCTATTTCATAATAATTTTTAAAAGGTAATGAGGCCT 

7 0 CTGACAGATGGTGTGGAGGGAAGCCGCCTCCCTTCXKJCCT 

GGACATGCTCTCCCTCTGCGGTCTGTCTCTCTCTCCGTGT^ 
ATGCTGGGCCCCTTCCAGCCTCTCAAGAATCTCATCCTCCCCTGAGGA^ 

GCTGGGTTTCCCTGAAGAGGCCTTGCTGCGGGAGGGATGGGGCCGAGCAGCTGCTTGTCTCTGTCAC 
CCCCGTTAACCAAGAAGCACGAACCTCACACAGAGCTCACCCATC 
7 5 AT CTTAATG CAC CC CACTG AGG CAGGTG CCG AtiAAT ACT CCCGTTTT ACACGTGG GC AAC CCAAGG CA.CAGG A A^-GGAAAG CGATT 
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TGCCCAGGTTCACCTGCAGTCAAGGGCATGGGC^ 
GCAGCCCTGTGAGGCAGGCGGGCAGGAGGCAGGCAGGCCTCCTCCCC^ 
CAGAGTCAGGAGGCTGGTGAGCAGGGCCCAGAGGCGCCTCCTGAGGCGCCATTCATCTTC 
CTTGCTCTTGATGCAGTGTCTGAGCCTGGGAGGGTCTCGCTCCA^ 
5 GGAGGTGATGGTCATGGTTTCCTCTTCTCTCCTCTAGCTCAGTTTGCCCAGCTGGCCC^ 
GGAGACGGCCCTCGAGCAGAAGCAGGTGTCTCrGGAGGCCT 
GTGGGGTCCTG<X3CCTCTCCTGGGCGTGGGTGCCATC 
GAACCTGGGAGGGCAGGAGGCCTTTTTTCCTGGGGGCC^ 

1 0 GCACCAGAAGACCCrTGCAGCTGCTGCGGAAGCAGCAGACCATCATCCTGGATGACGAGCTGAT 
TGGCCGGGAACGGCGGGCCCCCCGAGGGCAGCCTGG^ 

TCCAGGGAGCCTGGTTCCTCAATCAGACTTTGGTCCCCATCCrcTC 

TGGGGCTCGGGTGTGTGGGGTGATGCTTGGGCTGTTTGGGCCTAGTCAGGGTCGCCCCTCCT 
1 5 CAGGGAGCTCTGCTCTTCCCATGGGTGGGAAGTGTGGCGAAAGCACAGAGCCTTCCTGGG 
GGCGTCTGTGCGGAGGAGCCATTGTCCTCCTGTTGGCCTTGGGGCTCTC^ 
AACCGGCAGCAGATCCGCAGGGCTGAGCACCTCTGCCAGCAGCTGCCCATCCCCGGCCCAGTC^ 
CGCCACCATCACGGACATTATCTCAGCCCTGGTGACCAGGTGACTGCrGCC^ 
CCTGCGTGGGGGGAGCCGCAGGTGCCTTCCAGACCAGCAGATCCACTTCCrGCCTCTCATCCCT 

CCTCTCCATCCTCTCCCTGGACTCTCACGGCTTCCCCAGGAACAGAGGCIGT 

GCAGCCCCACCCCCACCACAGAGGGACTGAGAGCCCTTTCCCCTGCACCAGCCCCTCATC^ 

GCACGTTCATCATTGAGAAGCAGCCTCCTCAGGTCCTGAAGACC^ 

AAGCTGAACGTGGACATGAACCCCCCCCAGGTGAAGGCCACCATCATCAGTGAGCAGCAGGCCAAGT 

2 5 CACCCGCAAGTAATTGTGCAGCTTCCTCTCATTCCCTGCTACTTGTCTCATG^ 

GTGTCAGATCCCTGTTTGCCCTTCTGGCCTCTTCTGGGCATCAGCAAAGGA^ 
ACACCTGTAATCCAAGCACTTTAAGAGGCCGAGGTGGGCAGATCACCTGAGG 
AAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGCAGGCACCT 
GGAGAATCCCTTGAACCTGGGAGGCAGAGATTGCAGTGAG 

ATTTTGTAGAGATACGTCCGAGAAGATTACATGTATGTGTTATGTAGCTAACATTTCCT 
TCTCTATTTCCTGCTACCCCAAATTATATTATTTACTCCGCATGTCCTGAGACCCT 
TTTGCAGTGATTACAGTGGCGAGATCTTGAACAACTGCTGCGTCATGGAGTACCACCAAGCCACAG^ 
AGGAATATGGTGAGGGTGGAAGAAGAGGAAGAATGAATTTTTCCCTGGTCTTTGAGG 

3 5 ATGTTTCTTCTTTGTAGTCCCTGAAACGAATTAAGAGGTCAGACCGTCGTGGGGCAGAG 

CTGTTTGAATCCCAGTTCAGTGTTGGTGGAAATGAGCTGGTTTTTCAAGTCAAGGTAAAGCCTTC 
TGCGTGTGTGTGTGTGGTGTACTTTATGTTACAAAATA T T l ' lTl T l ' Tr ATCACATAGCATCrrCl'l 
TGAAAATGGACTGACCTCACCCTGAGAACTGTTTCACTTAGGTTCTCTTAAGAC^ 
TGTAATCCCAGCACTTTGGTAGGTTGAGGCAGTCAGGTCACTTGAGGTCAAGAGT^ 

4 0 C CATCTCT ACT AAAAAGT ACAAAAATT AG C CAGG CGTGGTGGCACGTG C CT AT AAT CCC AG CT ACTCAGG AG AATCACTT G AAC CT 

GGG AGGCTG AGG TTGT AATG AACCGAGATTGTG CCACTG CACTCCAGCCTGGGTG AAAGAGTGAG AC C CTGTCTCAAAAAAAAAAA 
AAAAAGAC^TGAACAAGCAAGGCCCAGTGCAGTGGCrCACATCTGTAATCCrrAA 
GAGTTCAGGAGTTCCAGACCAGCCTGGAGCTGGTTGTGAGACCCCGTCTCAACA^ 
ATCAACAAACAATAATGAACTGAACTAGCTGGAGAGGTAGACTACAAA^ 

4 5 AAGGTTGTGGTAAGTTTTAAATTTGTTATTTGTATTTTCTTCC 

TAACTTTGAGAAACTTTATCCTTCAGAAATATCTAAGCAACA^ 
AAAGTCATATTATTCCTGGGGCCATTTAGATTCrGTGTCTGAGAGGA^ 
TAAGAAAAAAAAAAAAGAGAGAAGGAGAAGAACTTGAGGCTC?GTGGGCGCT 
GCCAGTGGTGGTGATCGTTCATGGOVGCCAGGACAACAATGCGACGGCCACTGTTCT 

5 0 GTGTATTCCTCGGGCATGTGGGTCCCAAAATCAATGCCGGTGTCATGAATCTCATTTATC 

AAGGCCGGGTGCAGTGACTCACACTTGTAATCCCAGCACTTTGG 
CCAGCCTGGCCAACATAGCAAAATCCTGTTTCTATTAAAAAATACCGAAAAATTAGCT 

AGCT ACTTGGG AGG CTGAGGCACGACAATCACITG AACCCGAGCGGCAG AGGTTG CAGTG AGCCGAGATCGCACCACTGCACTCCA 
GCCTGGGTGACAGAGTGAGACTCCATCTCAATAATAAAAAAAAAGCATAGGAAAAAAGACGCTX; 

5 5 TAGAAAGAATGTGGAAAGTGGAAAGAACATGGGCTTGACIATC 

ACCACATGGGC^GTTCACTTTACCTCTCTGAGCCTG AGTTTCCAGCAAG CAGCACAGGGAT ACTCTTG CTCT ACAGCATT 

GTTAGGAAGATTAAATTAGATAACATAGAAAATGCTTGCCACACAGTCAGTACTCAG^ 
CTATGAATGGGATTTGGGACTCCTAGGTTCACTCCAGAGGTTAGATCTCT 
CCTCTrCCTTTCAATAAATGAAGATGCTGTTAAATGTCTTTTCCAAATTT^ 

6 0 GAGATACTTTGCCTCAAGGAAGATATAATTTGGTGGATTTATGCACCCAAACGCTATCTGAATC 

G CATTTTAAAGATCTCATAGG CTGGGTGCGGTGGCTCATGCCT 
AGCTCCAGACTAGCCTAGGCAGCATAGCAAAATCCTGTCTCTACAAAAA^ 
TAGTCCCAGCTACTTGGGGGGGGCTGAGGCGGGAGGATCGCCTGAGCCCAGGAGGTCCAGGCT 
TGTACTCCAGCCTGGGTGACAAAGCGACACCCTGTCrCAAAGAAAAAAATTTT^ 

6 5 CIAATAGAGATTAGTCGATAAAAGCATGATAAAAGGGTTA 

CCTTGATTATGTTTTGTTTTGTGGCTGTTAGAATATAGGATTTGACTGTT 
ATGGGTTACCTTCCAGTTGTGTGTAAAGAAGGTGAAATGGAACCTTCTGGCT 
CTTTAGAGAAAAAGTATTTTACTGCAAAGGTGAGATTTTTAA 
ITI'TT'GAAACTGTCTGTCTGCCTCTTGTCCTGATAAGCCTCCTAG^ 

ATTCTCAGCTATGTTAATGACrACrrACATCITATGCT 
AGGCTGGAGTACAGTGGCATTATCTCGGCTCACTGClAGCri^ 

AG CTGGG ATT ACAGGTG CATGC CACCACG CCCAATT AATTTTTT AT ATTTTT A CAGGTCT CACCATG TTGG CCAGGCT 

GGTCTCGAATTCTTGACCTCAAGTGATCCACCCACCTCGGCCTCCCAAA 

7 5 GACCTTATGCTATGTTAAAAAAAAGACTTTCAGCTGGGCACGGTGGCT 
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GTGGATCATTTGAGGTCAGGAATTCGAGACCAGCTTGGCCAACATC 
GGCGTGGTGGTGCGTGCCTATGATCCCAGCTACTCGGGAGGCTGAGGCA^ 

GACCTGAGATGGTGCCACTGCACTCCAGCCTGGGCAACACAGTGAGACTCTGTCTCAAAAAAATAAA 
AAGACTTGTGGCCAGGCACAGTGGTTTATGCCTGCAAT^ 
5 TTTGAGGCCAGCCTTGTCAACATAGCAAAACCCTGTCTCTACAA 

TCCCAGCTGTTTGGGAGGCTGAGGTAGGATAATTACTTGAGCTAAGGAG^ 

TCCAGCCTGGATGACAGAGCGAGACCTTACCTAAAAAAAAAAATAATTGTGGAAATTT 

CTGTGGGTCAAATAACATAAGAAGATTGCAGCATTGTTAGTAATAGCTA^ 

TCTGGTGAAATTTCTTT ATG TGCTG ATGGGAGGTCT CC CAG ATAG AGG ATT AAGTG AAAAG AGGAAG 
1 0 TGTGCTGTCATTTGTGAAGAAAAGGAAAAGGTATATGTCATTGCTGGC^ 

TGTTAGGTGCCTTTAAAGAAAAGAACTGGGTTGCTGGGAGACAGGGATAG^ 
AATTTTAAATCATGTGAAAATCACCTATTGAAAAA^ 

C^TTTATTAAAAGTTAATCTTCTAACCTATTTCTCAATAAAGCCATCCAACT 
ACTTTGAGCTAGAATTCTTGGCAGAAGGAGTCGAGGGCAATGATTCCArc 
1 5 CACACTCTCTCTGTTTGTACCTAGAGGACCAGTCTTTCAGATTTCTAAATTC 
TTATGTTTTGGTTTGTGGCCGTTAGAACATAGGATTTGAC^^ 

TACCTTCTAGTTGTGTGTAAAGAAGGTGAAACGGAACCTTCTGAATTTCACAGAACCGTGT 

TCAGGCCCTGCAAGACAAATGTGTGTAAC CACTCCCAG<:CATCAGCG'rTrT rC rX"ITGTGGCTTGAGGG AATAAAT CATGCAACTG 

2 0 GAGGCGCTCAACATGAAATTCAAGGCCGAAGTGCAGAGCAACCGGGGCCTGAC 
GTTCAACAACAGCAGGAGCCACCTGGAGGACTACAGTC^ 

CCAGCCGCTGTTAGATCAGTCCTTCCCAACCAGATGCTCTCAGGAGTCTCCCTGCCCTGGGC 

AATGTACCGAAAGTAATAGAAGCTTCTGACTAGATGGACAACITGAGAGTTACCCAGGTC^ 

ATTTTAAATAAGTTGACCTTTGTCTACAGTTAATGCATTTrCTTC 

2 5 CATGAGAAGCAAAGCTTTTGAGGGGATGCGAACTCTGGGCCTCTCT 

CAGGCTCCTATGGGACCCTCAACCCACAAAGAGGGATGCTGATGAAAGAAACTCCTCCT 
TTGATTTTA TTTlT ATTCA TriTl " lTl ' TlT TGAGACAGAGTTrCGCT 
TCACCGCAACCGCCGCCTCCTGGTTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGT 
CCCCGCAATTTTGTATTTTTAGTGGAGACGGGGTTTCTCCATGTTGGTCAG 

3 0 CGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGTCCACTGTGCCCACCCTATTTAi i o i GTTTATTTATTTA'ITITI'GA 

TGTCTCAGCCTCCrTGAGTAGCCGGGATTACAGGCACCCGCCACCATGTCCAGCC 

ACCATGTTGGCCAGGATGGTCTCAAACTCCTGACCTCAAGT 

AGCCACCATGCCCGGCTATCACCTTGATTTGAATTTAATTTTTGTTTC 

3 5 TTAATTTGGAGCCTAACAACGTGCATTTCTCAGAAGCTCCCAGGTGACACT 

CAAGGGACGGGTCAGCCCAGTCCCAGGTGAAAGTGTCTGTAGCAGGAGCCCACTGGG^ 
CTTCTTCCCTCTTTTCCTCTCCATGGGGCCATTAGGTAAAACTCAGGTAGGCGGGGATATT^ 
TTTGTTTCCCATTTAATTTTTCCAATTAAATATGTGTATAATATTCCAAGTGTTA 
CCAACCAGGAGAATTTACCAGGACGGAATTACACTTTCTGGCAATGGTTTGACG^ 

4 0 CCTCATTGGAATGATGGGTGAGAATCATTATTTOT 

TCTG CTTCTGTCCTTTGTCCTTGGTG ATCC AGGG CCATTTTGGGGTTTGTAAAC AAGCAACAGGC CC ATGACCTACTCATTAACAA 

GCCAGATGGGACCTTCCTCCTGAGATTCAGTGACTCAGAAA 

CCTCCCTTCTTATGTCCAAAGTTAGCTGCTTTCGTTAC^\AATTAT^ 

TCAGT ACTAATT ATGGG AAACAT AAATG T AT AG C CATG A CATACT CATAAT AAG GCAGC CT ATG AAAAATACTATT ATGGG AAAC A 

4 5 CAATTGCTAATTTGGTTATAC^TGAAAGTATATGGCAGCATGGTTACA 

AGAGTCCGTGCAGAATCTCAGTTGCTTCCCTCTTAAAATGTTTGCCTTATTC 
TTTGATGATTTGGAGTTCTTACAGTAGTAAAAATATTCCITTATCCT 
AGATGTTAGTTCTTTTCTTTTTTTGAAACAGGGTCT 
GCCTTGACCrCCTCGCCTCAAGCAGTCCrrCCrCCCn'AGCCT 

5 0 ATTTTTAAATTTTTTGTATGTTGCCCAGGCTGGTCTCGAACT 

GATTACGGGCTTGAGCCACTGCACCAGGGCATCAGGAAAGTTAGTTCT^ 
ATTTTCTGTGAATTAAAGTTTTTAATAACTTTTGGCTG 

AGGAGGATCTCTTGAGGCCAGGAGTTCAAGCCCAGCITGGGCAACATAGTAAGACCCT 

5 5 CTGGGAGTTGAAGGTTACAGTGAGTCATGATCAGATCACT 

AAAAAAAAAAAGGCTGGGTGCGGTGGCTCACGCCTGTAATCCCAGCACT^ 
ATCGAGACCATCCTG^CCATCACAGTGAAACCCCCGTCTCTACTAAAAACACAAA 
TAGTCCCAGCTACTTG^GAGGCTGAGGCAGGAGAATGGCGTGAACCC^ 
CIACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCGAAAAAAAAAAAATT 

6 0 GTGGGCATGCTTCCTAGCTTTTATATATCTTTGTATGGGAAATCAGGGATTGGGGGCCAGTA 

TAGAATACTGTTTGAACTTGAAACGTCTGAGATCATTGGAT^ 
TTTTATTTTTATTTATTTATTTATTTATTTATTTT^ 
TCACTGCAACCTCCACCTCCCAGCITCAAGCCATTCT^ 
GCTGATTTTTGTATTTTTAGTAGAGACAGGGTTC^TCATGTTAGCCAGGC^ 

6 5 CTAGGCCTCCAAAAGTGTGGGATTACAGGCATGAGCCAACATGCCTGGCCCCTACCCATG^ 

ACGGTGGCTCACGCCTGTAATCCCAGGACTTTGGGAGGCT 

ACATAGGGAAACTCCATCrCTACTAAAAATACAAAATTAGCCAGGTGTGGTGGCGCGTGC 

GAGGCAGGAGAATCGCTTGAACCAGGGAGTTGGAGGTIGCAGTGAGCCAAGATCG 

AGACTCCGTCTCAAAATAAGTAAGTAAGTAAGTAAATAAATAAAAGTATTTCATCTATAACCCATGG 

7 0 TCTATTGGAGCCATTATAATGGCAAGCTTGAATTTATAGTATGT^ 

GTGATTGTTCIGTTTATTGATCTAGAGGAAAGAATGTTTTGGAATCT 
AGCCGACCGCTTGGGAGACTTGAATTACCTTATCTACGTGTTTCCTGATCGGCC^^ 
CAGTTCCCTGCGAGTCTGCTACTGGTAACAATGTTCG(^TTCTGATTT 
TGTGTGTGCATACCTGCGTGTGTTATCTCrCITGTGTGAAAACCCACACAATTAC 
75 GGGCIAG ATGT AAACCTTGGGCTGG AX rCTT CAGT CAT ACAG AAAACACAT CT AC CACTAACC CCAA AT AT CAAATTGAAG AAAACA 
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AATTCTT T T r AATACAAGCTTAGATCAAACAAAATTAGGTAAAATTTTGTTTTTC^^ 
TTAGAAGTATTACATATTTATTATAAAATTTTCAGAAGACACATAA 

CCTGTAATCCCAG AACTTTGGGAGGCCAAGG CAGGCGGATCATTTGAGGTCAGG AGTTCCAGACCAG CCTGGCCAACATGGTGAAA 
CCCTATCTCTACAAAAAATACAAAAATTAGGTGTGGTGGCAGGCACTTC 
5 ACTTGAACCCAGGAGGTGGAGATTGCAGTGAGCTGAGACCACGCCACCAC^ 



AGGTCGGGAGTTCAAGACCAGCCTGACCAACGTGGAGAAACCCCACCTCT 
TGCCTGTAATCCCAGCTACTCX3GGAAGGCIG1AGK3CAGGAG 

GCCATTG CACTT CCAG C CTGGG CAACAAG AG TGAAACTCTGT CTCAAAAACAAAAG AAAAG AAAAGAAAAAAACAGAT ACCA 
1 0 CATCTCTCATCTAGAGATTAACCTCTATGGATGATTTAGTGACTATTCTTGCACr 
ACCCGTTGCTTGAACTTCCTGTGTGGAGGTTATCTAGCAGCAGAACGAAGACCA 
TGGCTCATGCCTGTAATCCCAGACTTITGGGAGGCCGAG^ 

ATGGTGAAACCCCATCTCTACTAAAAATACAAAAAGTAGTCAGGTGTGGTGGTGGGCGCCTGTAA^ 

AGGCAGGAGAATCGCTTGAACCAGGGAGGCAGGAGAATTGCTTGAACCAGGGAGGC^ 
1 5 TG CATT C CAGT CTGGTG ACAGAG CG AGACT CCAT CTCAAAAACAAACAAAACACAAAT AT ATT AATTT A^ 

GACAAATAAAGCAAGGTAGAGGGATAGAGAAGTGGCAGGGAGGAGAGTGGAGGTC 

TCTGAGGACATGTTTGTTTTTTTTGTTGTTGTTTTC 

ACATTGCTCACTGCAGTCTCAAACTGCTAGACTCAAGGGA 

CACCAATCCTGGTTAATTATTTTATTTTTATAGATATGG 
2 0 AATGTTCCTTCCTCAGTCCCCCAAGGTTCTGGGATTATAGGCATGAGCC^ 

G AATG AAGTGAGG AAGCT AG CCATACAAAG ATCTT AGAGAAAAACATTGT ATG C CAAGAGAACAAG TACAAAAAT C CTGGGTTG AG A 

ATGGCCATGTCTGGAAAGCCTCAGGAGGGCCAGGGAGGCTGGAGCCTCTTGAGC^ 

ACACTCCAGCCITGGAGTTCAGGGTGAAAGGTTGGAGTCTAGAC 

AAATATATCCTAGGTAGCTGGGCATGGTGGCTCATGCCTGTAATC^ 

2 5 CAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAA 

TACCTGTAATCCCACCTACTCAGGAGGCTGAGGCAAGAGAATCACTTGAACCCGGGAGGCGGAGGTTG 
CCACTGCACTCCAGCCTGGGCAACAGAGCGAGACTCTGC 

GTAATCTTCCAACCTCATTTTGATGACTGCATACAATCCCACTGTATGGTGAGTTACATGTTGGTTAT^ 
GGGCCTGGGCACAGTGGCTCATGCCTGTAATTCCAGCACATTAGAAGGCGG^ 

3 0 CCAGCCTGGGCAACATAGTGAGACTCCATCTCTACAAAAAAGAAAAAGTTAGCTGCGCT^ 

ACTCAG AAGG CAG AGGCTGG AAGATTGGTTG AGCCCAGG AATTTG AGGCTGCAATGAGCTATGATTGTGCC^ CACTGTAGCCT 
ATGTGACAGAGTAAGACCCTGTCTCAGAAAAAAATAAAATCCTA 
ATAAAATACTCTGTACACTGTAAAGCACAAAAAACATGTTATGAAAAGGTrc 
GTAATAAGCAATTGGTGAACCCTGCAGCGTTTCACCCTCTTGGGCT^ 

3 5 TCGTCCTTGCCTAGTATAATTAACTTTGTACTGATTCCCATATTCACGTACCCAGAAACAAGACTTCATT 

TCACATTCTTACCACCTGCTTACCTGGATTTCCATAGTCAGCCTTCTGAAAAGGTAACTG 
ATCTGCTCATCTGCCACATCCCAGTGGGGATTCTGTGGGAAGCAAAGGACACATCGCATTCTTGGCACCAAC^ 
CTAGTGACATCAAAGTACCCOVAATGCAGAAACACCAGGCGTGAAGGAGCGT^ 
AAAGACTAGACACTCAGAGTTAGAGGAACCTCAGACATCATCAGATCAGATCAGCGGCT 

4 0 TCCCTGACGTGTATGCACCTCCTGCITGAAAACCTC 

AGGCCAAGGCGGGTGGATCACTTGAGGTCAGGGGTTCGAGACCAGCCTGACCAACATGGAGA 

AAAAGTAG CTGGGC^TGGTGGTGCGCACCTGTAATCCCAGCT ACTC CCAGAAGGCA 

GAGGTTGCAGTGAGCTGAGATCGCACCACTGCACTCCAGTGTGGGCAACAGAGTAAGACTCr 

CTCCAGTGACAAGGAAATCAGTCCTTCCTAAGATAAATCATTTCTTTTCCTGGCAGTTCT 

ATCTTGGACTTTAAGCTTTTCAAGGTGTTTTTTTTTTTTTAATTAGT^ 

TAGTTTCCATGGTTTGAAGAATTTATTG AAAC ATCCAAG CCAATTTGG AAG ATAT CTTGAATTTCAGTTTTAGTGCTGACCTTCTC 

CTCTACCAGGTCTTCCTAACAGATTGAAATATTAAATGCTTGGCCAA^ 

GCCAAGGTGGGCAGATAACTTGAGGTCAGGAGTTAGAAAACZAGCCTGGGTGACATGGTG 

5 0 AATTAGCCAAAGTGTGGTGGTGTACCTGTAATCCAGCTACTCAGGAGGCTGAGGCATCGGAATCATTTG 

CTGCAGTGAGCCGAGATTGCACCACCGTACTCCAGCCnXSGATGAGAGAGTGAGACTGTrr 

GTGCTTGTGTCATATCCTTAGATAAGTTAGGTGGTTGTGTTCTGTCCTAAGAAAAGATGTGATTCATAAGGAA 

TTTCACTTGAGAACTTTGATTTACTACATTCTTCCCTTGGCAGCTAAAGCTGT^ 

5 5 GCCCAGCTTTCCGCTCCCCCGGGGAAACTGTCTCAGCCCTGGGGAA 

CACTCCTGTAGCTGGAGCCCCAGGGCCGAGTGGGGCACTTTGTCCTTCTACACCCTGAGAAAGACAA 
CCCCAAGTCAGCTGCCCTGGCCTCCCTGGCAGCTGAAGCTCTGTTCTCTTCCT^ 
GGGGCGGCAGCGCCACGTACATGGACCAGGCCCCCTCCCCAGCTGTGTGTCCCCAGGCTC^ 
GTGGCATTCTTGTGGGGGTCCACAGGGGAGGAACTGGGGGCI^GGCCCCAGGCTG^ 

6 0 AGAGCTCCTCTAGGTCTGAGACCACACAGCCCTGGCCCGGGCTCTGGCCTTTC 

TTTAGAGTCGGGTTGGGCAAATGTCATCTGTTCCCCTCTCTCAATATG 
TGTGTCTGGACAGCCGGTGTGAGAGGTAACAGGGGAGGGTATTTCAGTCTGAGTTGCTGTA 

TTTTCCCATGAAGCCCCAGGCTGAGGGCATAGGCTGGCTCAGACTGCTCCCCTGGTGGCCTGTGGGGCTTGGGGTGG 
GGCTTTCCCCTTCCCTTCATCCCTCTTCTTTCCCTCCCAAGCCCTGACTCAGTCCTTGAC^ 

6 5 CACAATGGACGTAGCGCGGCGTGTGGAGGAGCTCCTGGGCCGGCCAATGGACAGTCAGTGGATCCCGCACGCACAATCGTGAC 

GCGACCTCTCCATCTTCAGCTTCTTCATCTTCACCAGAGGAATCACT 
AAACAATACTCATAATGTGAAGTGTTAATACTAGTTGTGACCTTAGTC 

ATGTGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGTGTTTGCACGTTATGGTGTj I lTC 
TTGTAGCAGAGGGGCCACAGACAGAAGCTGTGGTGGTTTTTACTTTGTGCAAAAA^ 

7 0 CCATG TGT CTT AAGAGTGG CT GGACTTTGAC ATGTGGCTGTTTGAAT AAGAG AAGG AC AAAGGGAG G AGAAAGC ACATGTG CT 

GTGAGTCTTCGTCACTCTGTCTGCCAAGCAATTGATATATAACCGTGATTGTCTCTGCTTT^ 
TTTGACAAAGAGAGCCTTCCCTCTCCCCCACCCCTGTGTTCT^ 
rAAGGGAAGTCACAAGCTTTCGGATGGGCGGTGGCTTTTCACAAAACATTTAGCT 
CCTGCCCGCCCGCACCCTGGAATTGCCACTCAGTTCCTCTi^^ 
7 5 ACGTAACTGAGAGCTTACAGTGCCAATGCCGTTTGTGTTCTGGC 
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TTGCCTGGTTACCCAGGAAGCTGCTGTTCCGGCTGCCC 
TTATTGATGATGCTTTTTGCAAATGTTGAATCATGGT^ 
GAC!ATCAAAATTCrCTCTTGTGCATTGGGCCCTGGGTCAT^ 

CTTGAAGAGT AGGTAGCAGCAGTGTGGGCTGAACCTAGGCCAG CTTGCTTAGCGGGTCACCCTG CTGTGAAGTCCTGGCAGGTGTT 
5 GGTAATGTGTGGAAATCCAGTCAGCAAGTTTGCTGGGGAGTTTGATAAAA^ 
TGTTCCACGACTTCITCrrcTAGCTTTACACCAGAAGGAAGGAATGGGCT 
TATTAAAATAGCTTACC^TAAGGCCTAAAAGGTCACCCCTCGG 
TGGCCCCAAAAACCTTATTTTTATACATGAGAGTAAATAAACA^ 
CCGTTAAAAATATTCCTCTATAGTAAATTATTTATTGGAAGATGACTTTTTT 
1 0 CTGATTTATTTTTCTATGCCAGGCAGTAGAGTCTCTCTGCCTCTGAGGAGCAGGCTA 
CTCAAGATTTGATGAAAATTCCAACGATGAGGATGGGT^ 

CTCGTAGAGTCACTCCCTGCCCGTCTCCCAGAGATGCTTCACCAGCACCTGCCTCTC 

GTTGGGGGGTCAGACTTGATACACTTTCAGGTTGGG^ 

GCTCACTTTCCAGTTGGCACTGGGTTGGGGAGGAAGAGAGCTGATGAGTC 
1 5 TTGTGAGCTGTCCTCGGTGTTACCGAGGCTGTGCCTAGAG^ 

TtCTCTCTCCTCCTTGTTCCTGCAAACCACAAGATAAAGGTAGTGGTGTG 

CACACAAGTCCTCAAAAGTTTCAGCTCCGTGTGTGAGATGTGCA 

CTTAAAATGAAAGTATACTTTTTATAATTTTTCTTT1TAAAACTTGGT 

TTAGTTATTTAGCCACCAAAAAAAAGTATTGTGTACAATTTGGGGCCTCAAATTTGACrCTG 
2 0 ATGCAGAGTTACAGTCACAAAGTTGTGTATTTTATGTTACAATAAAGCCTTCC^ 

TTTTTCTGTCACATGCCAAGGGAGGGCAGCTTGAACACGGAAAAG 

GGTGGTTGGGAGTTGGCTGCCTCAGAGGGCACGTCATTAGCrCTGATGCCAGATGGGGCCT 

TTTTGAAAATAAGAATGATGGTCTCCTAGCCTCCTGACTCCAGAAAGCrTGTTAGCCAGGATG 

TCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACT 

2 5 ATTT AGAGT AC CTGAC C CTG AGTTCAGCGTTGTGTGTGGGATG AG AAATGGTG C CTT CC CTG CAG AGG CTCACAGTTGGAG AAAGG 

CAGCTACCACAACTGATTATGCTAGGATATCGCTTC 
AGGCCACTGA^CAACTCAAGCTTTTTTTTTATTTTA 

CTAATTTTTGCATTTTTAGTAGGGACGGGGTTTCACCATGTTGGCCAGGCTGGT 

CTTGG CCTCC CAAAATGCTGGGATT ACAGG CGTG AACCACCATG CC CAG C CTGGAG CTGG CT CTT AAGGAACAAAT CAGGGCCGGG 

3 0 CACGNNNNNNNNNNNNNNNNNNNNAT AG GG C CGGG CACGGTGG CTCATGC CTG T AATCCCAG CATTTTGGGACGCCAAGG CAGGTG 

GATC^CCTGAGGTCGGGAGTTCGAGACCAGTCTGGCCAACATGATGAAACCCCGTCTCTAC^ 

CtntSGTGGCACGCGC CTGTAATCCCAGCTACTTGGGAGG CTGAGGCAGGAG AATCGCTTGAACCCGG C AGGCAG AG ATTGCAGTGA 
GCCGAGATCATGCCATTGCATTCCAGCCTGGACAACAGAGCAAGACTCTGTCTCAAATTAAAA 

CACTTGT AGGTATCT ACATAAGAAC CATGAGAACATATATGTAC AGAAACACTTGTGACAAATGTTCAT AACAGCATT ATG CATAA 

3 5 TAGCCAAAGGGCAGAAACAACCCAAATGTCCATCAACTGATGAATAAACAAAATC 

GGC^TAAAAAGGAATGAAGTGCHGACACACAGGACATGATGTTC 
CCAGGCCGGGCACGGTGGCTCATGACTGGGACTCCAGCACTTTGGGAGGCT 
CCAGCTTGGACAACACGGTGAAACCCCATCTCTACAAAAAAATACAAAA^ 
CTACTCAGGGAGGCTGAGGTGGGAGGATCACTTGAGOTGTGATCATGCTACTG<^ 

4 0 AAAAGAAGCCAAGTACAAAAG ACCG CAT ATTATATGATG CCATTTATAGGAAATGTCTAGGATAGGCAAATCTATGG AGACAGAAA 

GTAAATTTGAGGTTGCCTAGAGGCTGGGAAGGAGTTTGGAGGTAATAGTGAAAGGATACAG 

TIT TTTT ri GTTT G TTTTTC?TTGAGACAGAGTCTCGCrCTCT 

CACTC CCTCC CAGTTTCAAGCG^TTCTCCCACCTCAGCCTCCCA^ 

TTTTGTATTTTT AGTAGAGACAGTGTTTTGC CATGTTGGCCAGGCTGGTCTTGAACTCCTGACTT CAGGTGATCCACCTGCCTTGG 

4 5 CCTCCCAAAGTGCTGGGATTATAGGTGTGAGCCiACCACGCCCAGCCTGAGTT^ 

TTATTTTATTTTATTTTATTTTATTTTATTTAACG 

TGTTTTTGAGGTAAGGAAAATGTTATTATCTGTGGTGTTGGTTGCACAT^ 

TGGGTGATTTGTATGGTGTGTGAATTATATCTAATAAAGCTGTTACTGGGCCAGGTCT 

TTGGAAGGCCAAGGTAGGAGGATCXSCTGAGACCGGGAGTTCCAGACCAGTC^ 

5 0 AAAAAAAAAAAATGTTTITAATTAGCrGGGTGTGGTGGCACATGCCTGTAGTCCT 

TTGATCCCCGGAGTTCAAGGTTGCAATGAGCTGTGACCAGGCCACTGAACT 

ATAAATAATAAAAGCAAATGCAAGATCTC^CGAGAAAAAACGTGAAAGTGAGGTGAGTTAGCC^ 
AAGAAGCTCAGCTGTGAAGGGAAGAGAGGTGCTCAGCAAGTAGTAGCTCT 

5 5 TAAAATTAGTTGAGCX3TAGTGGTGCGTACCTGTAGTTCC 

TGAGGTTGCAGTGAGCTATGATCGCACCACTGCACTCCAGCCTGGGCAATAGATC^ 
AAGAGAAGTTTGTAATTTTCCTTGGGATAAAGGCAACAGGCTAAC^ 
ATATGTGACTGGCTGC^TGCAGTGGCTCAGGCCTCTAATCCCAGCACTTTGGGAG^ 
GTTTGAGACCAGCCTC^CCAACATGGTGACATGGTGAAACCCCGTCTCTA 

6 0 GTGCCTGTAATCCCAGCrACTCGGGAGGCrGAGGCAGGAGAATTG 

GCCATTGCACTCCAGCCTGC^CAACAAGAATAGAACTCTGCCAAAAAAACAAAAAA 

GCCAGCAAG CT AAC ACAGATG TTCAC CT AAT AAC CATG T AAAGTT AAG AT AAACT ATGTGACTGG CTGGGTGT GG TGG CT CACAC C 

TGTAATCCCAGCACTTTGGGAGGCCAAGGAGGGTAGATCACTTC 

CCATCTCTACTX3AAAATACAAAAATTAGCIGGGCGTGGTGGCAC 

6 5 TCGCTTGAACCTGGGAAGTGGAGGTTGCAGTGAGCTGAGATTGTGCCACT 

CAAAAAAAAAAAAAAAAAAAAAAGATAAACTATATGTGACAAGTCCTCTTACTTGAGGACT 
TATGTAATGGGTTGTATGCGCTTGGCTATACAAAAAAGTAAGATTTCCTCC 
TACATCACATTGTC^TTGAATGCTTATTCAATAATAAAATTGTTTTCTTTCT 
GTTGTTGTTTTTGTTTTGTTTGTTTTT^ 

7 0 CAACCTCCGCCTCCCAAGTTCAAGTGATTCTTCTGCCTCAGCCT 

TAATTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGTCAG^ 
CAGCCTCCCAAAGTGTTGGGAATACAGGCGTGAGCCACCGCGCCCAGCTGAGT^ 
CCCaGCTCAAGTGGCTTAATCGCGTTCTT^ 
GGTCCTAACTGCCCCAACTGCCAGCCTtXJGCTAGCACCCGAC 
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TGTGTGCTGAGACCCAGAGTCACCaUSGGGTCTCCGT 

TGGACCCCACAGCCTGGGGCTCAGGAGGGACAAGGAGCTGGGACAGGGTGGATCCCGW 
AAGAGATATCGTTACGCAGCTGCTGTGTGCCCGACCGACC 
TCC lTlTnT T T ClWiVnTm 
5 GTAGCCTTGACATCCTGGGCTCAAGGGATCCTCCCATCTCAGCCTCCCAAGTAGCTG 
AGTTTTTCAATTTTTTGTAGAGATGGAGTCTCCCTATGTTG 

CAGCCTCCCAAAGTGCTGGGATTACAGATGATAGCCACCTCACCCGGCCCACCCCT 

TCC CATTTTGCAGAT CAGGAAACAG AGCTCAGTG CAGC CCACTAAATTGCT CAGGGC C CTACAG CTAACAAGCGG CAGAGG CAGG A 

TCTGCACTCAGGTCAGTGGGACTCCAAAGCTTTGATAGTCTCTCCTTTC 
1 0 TTCATTTTTCCTCGGTTTTCCTGTATGCAGTAGAGCGTGG 

CCTCTTCCCAGGAGCTGCTTGGAGATGCTGCTGTGGCCACTGCTGCTGCT 

GCAGCGGTCCCAGGATGCCAGGCTGTCCTGGCTTGCTGGCCT 

GGCAGCGGCGGAGGCTGGAGCAGAGCACGCTCCATGTGCACCAGAGCC^ 

CGCCCCCACTGTTCCCTCAGAAGGAGCACAGGTGTGTTTCCCAGGGTCT 
1 5 CGTCAGATTCCAGAGCCTGATGGCCCCCTTACCTCCTCCACTGCCAACGGA^ 

GTCCAAGATGAATTTCCCATGCTGGGTCCTTACATCCn'CTCTACCCCAGGCCTATTCT 

CTTGCCTGAAACTCITGGGAAACTGGCTTCCTGAGGCGTTTCTGT^ 

TCCCTCTGACCAAGGCCAGCCAGACCCAGCAGGAAGACAGTGGAGAGCAGCCACTGCCCCCGACCT 
GCCTCTCTGCAGGTAATCTAGGATGGCCAGAGCCCAGTCCTGGCCACAGCCCAGTCCCAC^ 
2 0 CCTCCAGTCATGGTGGGCAAAGAACCCCCAAATGGGGGCTCTGGGTCTGGCTCT 

ATCCCCTTGGCCCCGACCCCTGCCTTGGCCTGGGAATACCCTGGGCCAGGTGGGCACCCCTGK3AA 

TGCTGGACG CACTGAGGTCCCCAGGGCTG AGGGCACTGGAGGCTGGGACGG CTGTCGAACTTCTGG ATGTTTTCTTGGGCCTGGAG 
ACTGATGGTGAAGAGCTAGCTGGGGCGATAGCTGCCGGGAACCCTGGAGCGCCTCTCCGTGA^ 
2 5 CCTAGAG CAGGGGCCACXK5GGACTGGCCCTTCGG CTCTGGCCAAAGCTGCAGGTGGTGGTGACTCTGGATGCAGGAGG CCAGG CCG 

AGGCTGTGGCTGCCCT CGGGGCCTTGTGGTGCCAAGGACTAGCCTTCTTCTCTC CTG CTTATGCTGCCTCGGG AGGTCAGTGGGAC 
TCAAGGGCAGTGAGGTGGGGGCAACGGTCAGTGGTCACATGACAAGCAGCCATGCAT^ 
CTCCCCAACAAGCTGCATGCrAGGGCCAGCAGCCTC 

GGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGATGGATCGTTGA 
30 GGCCAACATGGCG AAACCCCATCTCTACTAAAAATACAAAAACT AGCGG AGTATGGTGGTGCACG CCTATAATCCCAG CTACCTGG 

GAGGCTGAGGCGGGAGGATTGCTTAAACCTTGGAGGCJU3AAATTATA 

GGCTGAGGTGGGTGGATCACCTGAAGTTAGGAGTTTAAGACCAGCCTGGCCAATGTGGTGAAACCCT 
AAATTAGCCAGGTGTGGTGGCAGGCGCCTGTAATCTCAGCTACTCAGGAGGCT 
35 TGAATCACTGAGCCGAGATCGCACCACTGCACTC 

HUMAN SEQUENCE - mRNA 

GGAGCCGTCACCCCGGGCGGGGACCCAGCGCAGGCAACTCCGC^ 

GCCAGACAGCTGGGCCGGAGCAGCCGCCGGCGCCCGAGGGGCCGAGCGAGATTGTAAACCATGGCTG 
4 0 GCTCCAAGGAGAAGCCCTTCATCAGATGCAAGCGTTATATGGCCAGCATTTTCCCATTGAGGTG 
TTGAAAGCCAAGCATGGGACTCAGTAGATCrTGATAATCCACAGGAGAACATTAAGGC 

GAG CTGCAGAAG AAGGCAGAG CACCAGG TGGGGG AAGATGGG TTTTT ACTG AAG AT CAAG CTGGGGCACTATGC CACACAGCTCCA 

GAACACGTATGACCGCTGCCCCATGGAGCTGGTCCGCTGCATCCGCCATATATTGTACAATGAACAGAGG^ 

ACAATGGTAGCTCTCCAGCTGGAAGCCTTGCTGATGCCATGTCCCAGAAACACCTCC^ 

4 5 CTGGTCACGCAGGACACAGAGAATGAGTTAAAAAAGCTGCAGCAGACTC^ 

GATCCAAGCTCAGTITGGCCCGCTGGCCCAGCTGAGCCCCCAGGAGCGTCTGAGCCGGGAGACGGCCCTCCAG 
CTCTCGAGGCCTGGTTGCAGCGTGAGGCACAGACACTGCAGCAGTACCGCGTC 

CTGCTGCGGAAGCAGCAGACCATCATCCTGGATGACGAGCTGATCCAGTGGAAGCGGCGGCAGCAGCT 
CCCCGAGGGCAGCCTGGACGTGCTACAGTCCTGGTGTGAGAAGTTGGCGGAGATCATCT 

5 0 CTGAGCACCT CTGCCAGCAGCTCCCCATCCCCGGCCCAGTCGAGGAGATG CT 

TCAGCCCTGGTGACCAGCACGTTCATCATTGAGAAGCAGCCTCCTCA 

CCTGCTGGTGGGCGGGAAGCTGAACGTG CACATGAACCCCCCCCAGG TG AAGGCCACCATCATCAGTG AGCAG CAGGCCAAGTCTC 

TGCTCAAGAACGAGAACACCCGCAATGATTACAGTGGCGAGATCTTGAACAACTGCTC 

ACCCTTAGTGCCCACTTCAGGAATATGTCCCTGAAACXSAATTAAGAGGTCAGACCGTCGTGGG^ 

5 5 ATTTACAATCCTGTTTGAATCCCAGTTCAGTG 

TGATCG TT CATGG CAG CCAGGACAACAATGCG ACGG C CACTGTT CT CTGGG ACAATGCTTTTG CAGAG CCTGG CAGGGTG C CATT T 
GC CGTG CCTGACAAAGTG CTGTGG CCACAGCTGTGTG AGGCGCTCAACATGAAATTCAAGGCCG AAGTGCAG 
GACCAAGGAGAACCTCGTGTTCCTGGCGCAGAAACTGTTCAACAACAGCAGCA^ 
CCTGGTCCCAGTTCAACAGGGAGAATTTACCAGGACGGAATTACACTTTCT 

6 0 AAACATCTCAAGCCTCATTGGAATGATGGGK3CCATTTTGGGGTTTGTAAACA 

AGATGGGACCTTCCTCCTGAGATTCAGTGACTCAGAAATTGGCGGCATC^ 
TTTGGAATCTGATGCCTTTTACCACCAGAGACTTCTCCATCAGGTCCCT 

GTCTTTCCTGATCGGCCAAAAGATGAAGTATACTCCAAATACTACACACCAGTTCCCTGCGAGTCT 
TGGATACGTGAAGCCACAGATCAAGCAAGTGGTCCCTGAGTTTGTGAACGCATCTGCAGATGCCGGGGGCGGC^ 

GGGGACTTCGATCTGGAGGACACAATGGACGTAGCGCGGCX5TGTGGAGGAGCTC 
GCACGCACAATCGTGACCCCGCGACCTCrCCATCTTCAGCTTCTTC^^ 

CATGAAT CGCTTCTCTTTTGAAACAAT ACTCAT AATG TGAAGTGTT AAT ACT CTGTG CATGG TG G CA 

CCAGCGAAGGGAGTGCGAGTATGTGTTTGTGTGTGTGTGTGTGTGTGTC 
70 CTC1ACTGTCTGAGAGTTTAGTTGTAGCAGA 

HUMAN SEQUENCE - CODING 

ATGGCTGTGTGGATACAAGCrCAGCAGCTCCAAGGAGAAGCCCTTCA 
GGTGC<^CATTATTTATCCCAGTGGATTGAAAGCCAAGCATCGGACT 
75 CCCAG CTCCTGG^AGGG CCTGGTGCAGG AGCTG C!AG AAGAAGG CAG AG CACCAGGlXiGGGGAAGATGGG i VrrT ACTCAAG ATCAAG 
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T ACAT AT AAT CAATAAAT AAATTAT AAAAAAAAAAAA CTATTTT AAAAAAA 

AAAAAAAAAGAAACCAGACAGTCTCTCCTTTAGTACCGGACAGACTAAGGGAC^ 

TCTGTGGGAGGAAGGGTGGGGCAGGGG AGG CCTGG CAAGTGGGT AGATGCCATGCCAGCATTGGGCTCACATCTGGM 
GCTCCCTTCAGCTTCTAGTAACCACACCCAAAGGTGAGGCGGATCTGTGTATCAGGGCAG 
5 TGCATCCTGGGATCCTGGTGTGTGCTGGGCCTCAGGCTTGGCTAGGGCAGGTGAGGACAGG^ 
CAAGTCCATCAGACAGCAAAGCCTGGTCTTCTGTTTCT^ 
ACGGCTTCCTGGTGAGGCAGGAAGAGCTCGGGTTGCCTCCTGGGCT 

GGGCCCTGAGGCTATGTCTTCATGGGTACGGAGGTTAGTGTGTGCTTACATACTGTATX3TATG 
TCGTGCACATTTGTGTATGGCTATGTGCATTCTGTCTGTCTACCCATC 
1 0 GTAACCGCAAACCTCTGATGTCTGGTAATATTCCT 

CGTACTGTTCCTTTCAGTTGTGGCATGTCACCAAGCACCTGGCTGTCCCT 
CTTGGTGTGTTTTAGCCTTGTTGTCTTCTCTCTGTGGGATCT 

TCAGCAGGCAGGTTCAGACAGGATAGACAGTAGAAAGGGATGGAGGGGACTGAGAACCATCT 
GTCCTTTCCTTCTTTTGCAGGTCAGACGGGAACACCGGCAGCCTO 
1 5 GGGGACTCAGGAACCCCACCCCTTATGGGCTCCAGTCTGTGCTGCTGGCCCCAGCTCTGGG^ 

GG CTGTTCTTGGCCCTGGTGGTAGTGTTGAGAACCGCGGGTGGCATTTGCACCT 

TGTCAGGGCAGGCTTAAGCCGCTGGGATGTCAAAGGATGCTGTGTGACCAGGAG 
2 0 GGTGTATGTGACCTCAGTGGCTCACGGCTTACTTAATGCTGACGACATCCCAGGTACT^ 
CTCAGAGAGGTCTCCAGGCCCGATTTCCAGGATGGGGACACACTGGCTGTCCC^ 

G C CGGGAAGCAAAGTGCACTGAG AATGT C CAACTCTG AAGCT ACCTGGAC ACTCCXKX5TGCTGACTTGT AGG CAAGTTCAG ATG AA 
GTTCATTG ATGGGGGAGGGGG CAG AATG CTG AG CT CAG CATCAGAG CAGG AACCTCCACACT CAGG AAGTTG ATG CAGGAGG ATCG 
TGAGTTCCAGGCCAGCCTGGGCCACGGAGAGAGACCCTGTCTAAGAAGACCAGTATTGTGTATGT 

2 5 TTAAACTGGTGCAGGTACGAAAAAGCATCACTTGTCTCTGGAA 

CCCCCCCACCCCCACCCCCATCATGACTTTTCTTTTTTTAAAGGGGG CACCGGGG AAAGAACATAAACCCTG CAC ACACT AGGCAG 
ATGTTCTACCAATGAGCATCTCAGCCCCTCGTTCCAGAGTTTC 
GAAAGTTCTCAGTCATTACGTATTTTGAGCCTACAGTGGGTGGTTAAAGACACATT 
GTCAGGGTTTCCACTGCTGAGAACAGACACCATGGCCAAGGCAACrC^ 

3 0 TTAGAGATTCAGACCATTATCATCAAGGCAGGAAGCATGGCAGCATCCAGGCAGGCATGGTGCAGGA 

CTTCATCTGAGGGCAAACAAGAAAAGACTGATTTCCAGGGAGCTGGGAATGATTCTCAAAGCCC^ 

TCCAACAAGGACACATCTCCTAATAGTGCCACTCCCTGGGCCATGCATATTCAAACCACCACACCAAGTGTCT 

AG CCC CAGC ATGAG AGGTTTGG AT CTGGG AGACAG AAAAGG AGATCAGG AGTAT AG C CTGGGGATGCATG CCATCCTCAGGG AT CC 

AGCCGTGGGCTAGCTCCTCTCTTGTACTTGGTGGCTGCATTGAGGTTACCTGGGGTGGGGGGG 
35 NNNNNNNNNmJNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NIWNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNfoNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNAACACACACACACA 

CAAACT C C CAC AAGTTGT CCCC ACACAG C CACTT ACACACCAT C ACACATT CACATCT AC ACAT AAT AAAG ACGT AATTGAG C ACA 
AGGAGATGGCAGCAGCATCGTGGGGGTGGGGAGTAGGTAGGGGGAGGTG^TGAA 

4 5 AAACGTCAAAAAAATGTAATAGAAAGCTTTAAGGCTGATAAAAAAGAACAGATGGAA^ 

CTATACTTTCATTATGTGGGGCTAGGGGGTCATAGGACGACAACCTTGGGTGT CTTTCCTTGGGTGCTGG CCACTTTT AAAATTTC 

ATTTTTTTTGAGACGTGGTCTCTCAACCAGCCTGGAGCTTGTTAAGTAGACTGG 

CACACCTAGCCTTTTCT^CATGAATTCTGGGGATCGAAT^ 

CTCCCCAGCCCTCTGCTGCITTGTCCGAAGTTCAGGGCTGTG 

5 0 GCCCTAAGGATGACTCTGGGGTCTCAGTCCCTGCCTGCTCCCTGAACCTCTTCCT 

AGCTTTGAGCTGTGGTGTGCCTTCTGTTGCTGTCTGCTTTATC^ 

GGCTCTGACACGTTCCTCCCTCACAGGTATATTGGAAGGTCTCTGAAGACAAAAAATCCAAGTG 
CAGGTAATTGACCGCACGTACAAAGAGCCCACGCACACAGTTCGCAGTTATTC 
ACCTGGTATC^CGTGAGTCTGGTGGGCTCGGTCCTGCTGTCT 
55 GCATCCTCCGCACTGGAGAAGCCTGTTCTCACGGCTTC^C^ 

T AGAATTGG C CT CCC CT AT ATGTTT CTGGC CTT CTTG AACCAG CTG AGG C CT AAGAGG TGTTGGGGATTTGATG GGATTTGTT ATT 
GTTGGGTTTATTTTTTGTTGTTGTTGGGTTTTGTTTTTTTGTGGGT 
GTATGCGCACTCAGGGAGACTGAGCCTTGTATCCITTTTCTCAGGAGCT 
CCAGTGTCCCAGATGTTTTTAGATCACGAATGTTTTTAGATCACAG 

6 0 GTGGCAGAGTTTATCAGCTGTTAGCTGATTTCAGAAACAGGGATTA 

AATCCCAGCACTCTGGAGGCAGAGGCAGGCGGATTTCTGATTTCGAGGCCAGCCTAGTCTA 

GCTACAG AG AGAAAC C CTG TCT CGG AAAACAAAAACAAAAAC C AAC CACTAGGTT AAACTTT CTTGG CTAAATGG T AACTGAACAC 

TTTCCCCGTCTTACTGGTCTCTAACTGTAGTGTTCACTCAGGACCCAAGGCAGGCAGG 

ATCTGTGTAGCGGAGTGAGCTGATGCCTTGCACCACTGGATGTGGGGACCCACAGTGACCTTGGCATGGAGC 

6 5 CTGT CCATTTG AGGG TTGTTGGGG AAAAAAG CAACAACAACAACAAC AACAACAAAAAC C CCTTT CCCAACTTGAG AT AATTTCAG 

GCCATAGCC CAT AGACTCT AAGTGCTGTTCTCCTGAAATAGCACATGGG CTCCAAATTGCCGTCTTTGTTTTCCCACTGGGCCCTC 

GAGTATAGGGTGGGGTCCCCAGGAAGGTAGAAGGAGCCTGTACAAT^ 
TGTTCAGCAAGGGTAAATACATTGCCIATCTCTAGTGGGCTCCCCAG^ 

7 0 TGGTCACCAGATACAGGCTGGCCATCTACTTAC^ATAGAGCTAAAGTCATGCCTGGCACCTG 

AAAGTTTCCGTTGCAGAGCCTGGCTCATGGTGCAGTGTGCC^ 
AAGGCAGCAAATGACTGGCCTTTACCATGTGGATACAAGGGTTCTrTGATGG^ 

AATG CCT AAACTACATTCG AGT ACTACAGCCACTAAGC1AGCACTTCCCT CTATGTGTGTGGG ACCAATGCGTTCCAG CCCACCTGT 
GACCACCTGGTAAGACC^CAACATATC^ CAG CCTGGCTAGCGACTAAAGCAG AGCCAGCCTGTCT 

7 5 CAGTCAGG CTTTCTATTC CTG CACAAAACATCATG CATGACTAAGAAGCAAG CTGGGGAG GAAAGGGTTT ACT CAG CTTACACTTC 
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CACATTGCTATTCATCACCAAGGGAAGTCAGGACT 
TGCTGCTTACrGGCrrGCTTCCCCTGGCTTGCTCAGTTTG Cr 

CAATGGACC CT CC CAC CCTTGATCACT AACTGAG AAAATG CCTT ACAG CTGGATCTC ATGGAGG CATTT CCT CAAGGGAGG CT CCT 
TTCTCTGTGATAACTCCAGCTrGTGTCAAGTTGACATGCAAAACCA^ 
5 GGACTAAGAACTATACAGTCGGATTTCTGACAGCAACAATTGCCCCGTrCCT 
CTATGGTAGCCATGAGGTAGCTGTACTTTCCAAAAAGAA 

TG CAAAACATC CAG CAGAAAT CAGGG CAATCTGATT AAGGAAG CTGG CTGGGGTT C C CAG CAG AGTT CTAGTCAG AG CAAGAATGT 

CCATTTGGGTGGGACTTTTAAATAAAAGJ^CAAACGGCAGCAGAGAGATGCC 

TGACTGGGGAAGTTGAGGCAGTCAGCTGGAGCCATCTCTCTGT^ 
1 0 ACTCCCCAACCCCCCCACCCCGCCACTGTATTGTGAGA 

CAAGGGCACAGTGTTTCTCAAGGGCACAGTGTTTCTCAAGGGCACAGTGTTT^ 

AGAGTCAGGGACCTTGGAAGACACTGGGAAAACATGCITGGGT 

CCCTTGAGCTCACACAGCACAAGAAAGCTTACCAATACGA 

TACITGATAAAATAAGTAGCTTAAGGGGAAAAGATTATCGAAGGCACCTCAG 
1 5 GGCCTGTAAACACCCAGGCGCACCCCCAGAAACCCGCTTTCACGGTAGGTCT 

GAGCCCATGGGTGACATTTTACAAATAAAACGATAACAAATATGAAAArc 

GATCCAAATTTTGCACCAAATGCCAAAACCTTGGCTCTTCCC^ 

AGGAAAGGTAGCTAGCCTGTTCAAGGAAAGGTTGGTCTGCTGAGCACGGGGTGAGTGAT^ 
T^GAGGCAGGAGAGTAGTTCATTGCTCTGCTGCAGCCGGCTGCAGATC 
2 0 GCCTCATTCACCTAAACTCAGCCAGCTGAAGTTTAAAGC^ 

CTGCTGTGAACAGACACCATGACCAATGCAAGGCTTACAAAGGACAACATT^ 
CCATTATCATTAAGGTAGGAACATGGCGGCAGCCAGGCAGTGCCTGGTGCAGGC^ 
CTGCTAGCAGAATACTGGCTTCCAGGCAGCTAGGATGAGGGTCTTATAGCCCATACCCACA^ 
ACACCTTCTAATAGTGCTACTCCCTGG^CTGAGCATGTACAA 

2 5 TGATCCCCAGGACCACAATACACTCACCACATGCCTATACACCATA 

CGTACATACACACACACAAACTCCCCTCAGTTGGGGATATAACTAGATAGAACAG 

ATTTGTATAGATGTCTGAGGAGAGAGGCCACTGCACAGGTTTGACTGGCTAAACTAACT 
GTGTGTGTGTXTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAC^ 

3 0 AGCATTTCTCCTAAGAATGGGGACATTTTCCTGCCAACTGGTTTCATCATTGCAGATAA 

CCCCTCCCACCTCCCCAGTATCCTCCGCCCCTCCrc^ 

GGTGGGTCACACTGTCACCCCAGCTCACACTTCCTAACCTGACTGGTGGCCTTTATT 
35 CACTGTGAGGTCCTGTGTGCTTGCTGGCCTCTGGCTTCCAGC 

GCCTTGCTCCTGGCCTCTCCAGTAAGCAAAGTGTAAAGTGATGG^ 
GTCCTGTGGGTGGCTCCTGTCCCTTTAACGGGTTGCATCTGCTGATTTCTTCT 

GTGAAAATTCTTCACCTTCGTTCTGTGGGCATTCCCCCCCCCCCCCACACACACACAGTTTGTCACTTC 
ATTTTTTCCCCCACATGAGAACATTCTGATTCATTATGTGGTGATGTTTGGGTGGGTCT 

4 0 TGTTTAG T 1TU T TTT T T TT TTTT 1T T T GTCAGCT 

ACCTCATAAGATAAGATAGGTCTGTAGTCAAGTGTCTAGGGGCATTT 

GGGTGTATAAGAAAG CAGG CTG AG CAAGCCACG ATGAG CAAACCAGTGAG CAGCAT T CCTC C^TGGCCTCTGC CT CAGTTC CTG A C 

TCTAGACTGCTGCTGCTTTGAGTTCCTGGCCTGCCTTCCCT 

ATCACAACAATAAGAGAGCTAAGACACGTGAAATAGGTTTCCTTGTCAGGTGGA^ 

4 5 AC CCTCATGAGCACACTGTG CCAGAAGCATGGACAAACACATTTGTTGTGT CCTCGCACAGTGCCAGAATTTATTCAGCAACAATA 

AATTCCCAACG ATT AG CAGTTT AGT AACATCAGG TT ATCACTT AATCCTG CTT CCCT CAG TGAT CAGGCCG AGCAAGGCAAATG CA 

GATTCTTTATTCCCGTCTTTTTGTTTGTTTGATAACCATATTTCTCT 

GACTGGCCTCAGACTCAGAGATCTGCCTCCCAAGTGCTGAGATTTAAAGCGTTCGACACC^ 

TTAGTAACGTTAACTG TCCAACCACACAGCACAGAGAAAAT ATGTGT ATATGG CATGTAT ATGGTTAT AGAGGGAATACAGGTTAA 

5 0 AGTCGCTTGAGGTTTCAGACCTGGGCTCAGAGAATCTGCACT 

GTGTTAGAGAGCTAGACACCTCCGAGAAGCTGGAAGCTAGAGTCTCCCAAGGCAGAAGCATGTCTCTGCT 
AACTGATAGTTT'CGTGTCGGGCTGTCCATCATAGTGTTGGGTGTCT 

CTATTAATT CGT AGGCCTGG CTTC CT AGGCCTTTTTCGTACAACTGTGTGCCCTGACACACTCAGCTTGT 
GTGGCTCCCCTTTCTGTTCCCAGGAGTAAACCGTTTATCCATCTGTTTGGAGTGGATGGTG 

5 5 TCCACCCAGGGCCTACTCATCCTTTTCCCCAAAGTGGAATCAAAAGCTC 

GGAGACCGTTTTATACAATATGGGGCAGCTTAGAGCCAC^CACAGCCTGATCCCGGCATGACCT 
GTAGAATGGCTTCTTCAGGAGCTCCTGGTGGTGGATGAGCGGGGGTGAGGGT^ 

GTTrTGTGAGGACTGTAGTTGAGCAAGCAACAACTG ACCCTG CAT ATCTCAGG CTGTCACACACATGAGATGTCCTTCAGCTGTCA 
ACGGTCCCAGGGTGGGACCC^CATGGATCAGGGCCCACTTACCCCACACTCTCGTCTTCT 

6 0 TGATACAGGGACAGGTCCTCAGCACCGAGAGGTCAGATGTTTTCTTTT^ 

GGAAAAGTGAAGATGGCAAAGGAAGATGCCCCTTCGACCCCGCCCACAGCTA 

GG CTCTTTCACT CT CTGAGTTTATGG TGG CATG TG ATGTTG C CAC CTGGG ACACTCAGGT CTT C CCT CTTCAGCCCTG C CTT CCTT 
GGCATCATCTTGGCACGAGCTCCCTTCAGAGAAGGTGAGCGACTTGTGGCACAGACT 
C^GAGGGAGACTCAGGCCCCACCCTCTCCAGCAGCATAT 
6 5 GTGGCTCTGTGCCTCCCGCTGCGGTCCTGCATATCTGAGGCTAGGGGTCACACGCT 
CAGCCAGTGTGAAGGTTCAAGCTTCACTGTCAATGTATAACATTCAG 

GTCATGTGTGACACTCCTGCGCAGTCATAGGCTAAAGTATGGGTTCTGAGTATATTAAAGTAG 
GCAGTCTAGGTGCCTGCACATCTTTTTTTACTTACIAGTGCr^ 

ACTTGCCATTGCTATACTGACTGTGG CAAATTCTCATTTGTGATT AG CT ATT ACCCTG 
70 TGACTGATTGTTCTC^TC-AAAGATGCTGATGTAGGAAGCAGGAATGCAT 

TATAAGGAATTAATGAGGGTATAAATGAATTATGGTAGCCTGGTATGG^TGTGATAGCCTAGCAGAGG 
TTACAGTCCAGCCTC4GGCTAATGTCAGAACTTGTGTT 

TCTCTACTCCCrrCTCCTCCACCACAGGGGGCGAGCTCTACTCTGGGACGTCCTATAA'nT 
GAAACTCTTCCCACAGTCCCTTGAGGACGGAGTATGCCATCCCGTGGCTGAACGGTA^ 
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GTCAGATCCTAGGCAGTG CTCTTTGGAACTG CTCAAGGCCCG CACCAGAGTAGCTGTCTTGCTCACTGCTAATAGCATTGTTTAAA 
TCTGATTCTCTGACATACCCCACCCAGCCCCACTTCCACCCCGTCCCCCAAATGCAGTTTGGG^ 
AATGGGAAGTTGAGAATGTAGACACTGAGTTTGGGTGTCATCrTAGATGTGG 
TGTCTTTACTCCCAGACAAGCATATCAGK3GTGTGGATGATGGGGGCCCTTTGTG 
5 TGCAGGGCTTTTTCGAAGCCCCTTGATCTGGAGTCAGACCCTGGAGTCT 
AGGAGTCTGGAAATCCTTCGGGTGGCrGATTTGGCCCTTG^ 
GAGGCCTCACCCGGCTTCCTCTGTGGCTATCTTGGTGAAGCTGAGGAGGGTGGCGG^ 

G CAT ACAT ACTGTAGACGTCCT AG AACGGAAGCTTCCAGG CT CT AAGG CTG CCCTTG CCAG C CTG CAG CACCC CAGGGAG CAGGT A 
GAGAGGTGGCTTGGTAGCTCCGTGTACAGAAACCATCCTGTGATGCCACCCCACAGAGCCT 
1 0 AGAAAAGCCCAGATGGTCCGGAGGGTGAAGATGACAAGGTCTACTl'L 1"1 1 1 1"1 ACGGAGGTATCCGTGGAGTACGAATTCX3TCTTC 

AAGTTGATGATCCCGCGAGTTGCCAGGGTGTGCAAGGTGGGTGCTGTCTCCCTGTTCCCACC 
AAGATTCCTCTCTGGAAGTGTGGCCATCTCCTATGGGTAGAGACCCCrcCGG 

ACCAGCAAT AAG AGTTTT AGAGGT ATGTG CATGGTTT CT CTTG AGAC CCAAAGT AGG ATTGGG AAG ATtSG CTT AGTGGTT AAAATG 
CTTACCACGGAAACAAAAGGACCAGCGTTCAGATCTCCAGAACCTATAGATGCT 
1 5 GCCTGCTAGTGGGGCTGTCCTtATCGGCAAGCTCTGAGTTTC 

ATCAACTTCTGGCCTCCAGATGC^CCCACACACGAACACTCATATAGAGACACAC 
ACTCAAAGAGGAAAAAATTAGTCAGAGTGAGCACTTGTGATATTT^ 

AGCTCCCTGAAAATGACTAGGATTGAGCAGGCAAAGAGGGCACAAAAGGAGCT 
2 0 GTCTGGGGGGCCAGGGTCTGTGGTGGGAAGGAGGCCTCCCATCACTTGTTTAA 
CAAAGCCAGTGGCACGCTGATGTGACACAGATGGGGTGGTTTTC 
CTGTCCCACATGGAAGTATTGCTTTGGTTACATAACTTGTCTTCAGTATTCT 
TGCTTCTGTGGCTAGTCATCTGTCCTGCCrCTACC^TGTC 
AAAAAAAGTGGACCTCCTTCCTAAAGGCCAGGCTGATCTGCTC^ 

2 5 TTTGTGCTGAGGGCCCCGGGCCTCAAGGAGCCTGTGTTCTATGCGGTCTTGACCCCACAG 

GCAGTTGGGGCCTGGTCACCCTGACGACCTCCTGCCCTCCrGA 

AGCGGTGTGCGCCTACACACTGGCCACGGTGGAGGCAGTCTTCTCCCGTGGAAAGTACATC 

ACACCAAGTGGGTGCGCTACAATGGCCCAGTGCCCACTCCCCGACCTGGAGCGGTAAGCAGGGTCT 

TCTTCTCTGGATCTCAAGGCCCCCCAAGCATGAGACCACACAGATCCT 

CACTGCATG CTGCCCGG ACGAGAG AGTCT ACCGCCGGG CTCCTTTGAG AAGTGGACATTGCACAATGGCAGG CTTGCTTCATTTCC 

CATTTCCTGCAGACAAAAAAAACCTGTTTGTCAGCCTGG^ 

TTTCCTCCTAGATACAGTAAATTAGTAAAGAAAATTCTAGGG 

AGGGGCTTGGCTCTTGAAGCTCTCCCAGGATGGACTGTTCTGTGAGTGTCTGGCCT 

3 5 TCAGGTGTTCAGTAAAACATGGCTAGGGAGCTGGGACCAAAAA 

ATCTGGCTTTCTCTCCTCTCTCTCCTCTCTCTCTCTCTCrrCTCTCTCTCTCT 
CTCTCTCTCTCTCTCn'CTCTCTCTCTCTCTTTCTCTGTGTGTGTGTGTCT 
TTCTTCTCATAGAAAAGCTGTGAAATGAGTGCTTTITAATCTGGAAGTT^ 
ATCCTGAGACAATTATGTCCCGTGGCTTTAAAAGCACAGCTCTTTC 

4 0 AAGTGTGACCATGGCCCTCCAGGGCTTCTCTTCTGTGTGCTCAGTTGAAGCTA 

AGTGT ATCG AC AGTGAGGCCCGGG CAG CC AACT ACACCAGCTCCTTGAATCTCCCAG ACAAAACACTGCAGTTTGTAAAAG ACCAC 
CCTTTGATGGATGACTCAGTGACCCCGATAGACAACAGACCCAAGCTGATCAAAA 

CAGGACCCAGG CCCTGG ATGGGACTTT CTACG ACGT CATGTTCATCAG CACAGGTGGGTTC CTTGGT AAG ATTGGGAAATTCACTT 
TTAGCCATCTATAGGTAGGTTTGGTCTCrCCAGGTA 

4 5 TGGGATTATGAGTCTCTCTGGCCATCAGGCACCrGGGGCACrGGTrACT 

GAGTTAAATAGT CC CTGTTGTCATGGG ACTAGTGGGACrGGCCTGGTATTTCCr AATTGGTAGTGGGAGACTCTGCC 

GTAGGAGCCAGGTTCTGTCCTGCCCCGAGCTGCCTTTGGAGATGCAACTCTCTCTCA 

GGAGCATGGCAGCTGCCAGAGCCCCAGGGGCTACCCTTAGAGCTTCCCAGATGACTTGAA 

GCCC CT AACCTCCCCTGGAAG ACCAGTAAAGTG CAACGGCCACCTTAT ACTGAATGTGGCH'CIAGGTGTCAGGGTGTAACCTTG 

5 0 GTGAAGGTCATTGGAATCCACCATAGGACAGGGTTGTCCTCTTCTGAGTG 

TGAGTGGTTGCCGCCAAGGCTGGAGTCTCTCTCCTAGTTTCATTTCTGTTGCCGGGCGTGGT 

TCGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTA 

AAACCCTGTCTTGAAAAAACIAAAAACAAAAACAAAAACAAAAAAATTAACC^ 

CTTGCTCIACCTGTGTTCrG ACCTTGCAGACCGGGGAGCTCTG CATAAAGCAGTCATCCTCACAAAAG AGGTGCATGT CATCGAGG A 

5 5 GACCCAACTCTTCCGGGACTCTGAACCGGTCCTAACTCTGCTGCTATC^ 

TGGTTTTAGGGX3TGGGGGTCAGGTCCAGG(X5CTTGTCTCCCTTGT^ 

GG CTTCGCAGCCTCntSGC CCAGGTGAACTTGGCGTAC AC AGCIATC CATGGGAGAGCCAGGGG CC 

TTGATTGTACTAAGCCACTGGAGCTTTTGGGAGGTCCCTAACCTGGGGAAATG 

TGTG CT GGTGG^a ACATG ACATTTTGCT AATTTGT AAGG AG AAAT AAGAGG CATTGAC 

6 0 TGCTCCTCCAGAGGTCCTGAGTTCAAATCCCAGCAACCA 

GTGTGTCTGAAAACAACTACAGTGTACTTACATGTAATAAATAAATAAATC^ 

CCCrCTAGAGCTCCTGCCATACCGCCCrGTGCATCGGGC^GCCITrGCAGAGGCCT 

CTCCCACGAGCCACGGAAACACATAGATGACCAAACCTGCCCTCGAAGCCATGCTAT^ 

CAGCCAGATGGTAGTAGTGCAAACCTTTGATCCTGGCACTCGGGAGGTGGAGGCAGGTAGATCTTTGTGAGTTCAAGC 

6 5 GTATACATAGTGTGTTTCAGGGCAGCCAGAACTATGTAGGGAGACCCT 

TGAATGAATAGGTATTTrTTAAGCCCCTAAGTGGGGTGCAGTATGGTTCATACCTCT 

G AAG AT CAGGAGTT CAAGG CT AGT CTT AG C CAT ATAGCAAAGTCAAGG C CAG C CTGAGCTAAGTG AGAC C CCATTG CAAAAAATAA 
AAC AAGG T AT AT AGGAGTGAAT AGAAATGG CCTGGGTGG GG CTGGAG AGATGG CT CAGTGG TTAAG AGCAGTG ACTGCT CTT CCAA 
AGGTCCTGAGTTCAATTCCCAGCAACCATTGTGGCTCACAACCATCrGGA^ 

7 0 TGAAGAC^CGACAGTGTACTCTATAAATAAAATAAGTCT^ 

TCCGAGCACTTGGGAGGCAGAGGCAGACAGGTTTCTGAGra 

TACACAGAGAAACCCTGTCTCAAAAAACCAAAATGAAAAAAAGAAAAGAAATGGCCTGG 

CTTGTTCCTTG TC CTCTGG CCTTGGUAA^'T GTGGACAC CAG ATTTT ACAT CT AC^ AT AAAAC AJXGG AAGTTCZAGAACATGGGTTG 
TCTCCAAATCCTGACTACACTGCCTGCTGTCATGTTACCCTGGGCATCACrATA 
7 5 GCTTTTCATTAATGTGACTGTTAAGAGGACATGTGAGCCT 
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GCTAGGAGCAGGATCAGCTGCCACCTGCCCTGCTTCATGCCTGCTGGCCAAAGGTGCTCCCCT 

CCCCCCTGAGAAGATGGGTC CAGGGCCACACTTTTGGCTTCAAGTG CCAGTGCT AGC CTCTTGCCTCTACCCTTGTCTCTGATGCC 
TGGCT CTGGG ACACTGCTGAACTG TGG C CTGGTGTG ACTGTCCC CGTGG ACTG CACATCTGTGGG CCCTGTGTTTC CAATTCAGTT 
CTCTTTAAGTAGCTGGCCACAGCAGTTACCAAGGGTCCATCAGTCTCACTTAC 

GGCTGGATTCAGAGATGACTGATAAGGATGTGTCCTTATCACAGGTACCTACTGTCTGGCT 

TGTGCCCTGTGGATTCCAGCTGTAGTGCTGTCTCACTTAC 

GGAAGCAACTGGAGAGGGCTTTTATGTGXnTTTCT 

CAGTCGAGCTGAGTGAGCCAGTGAGTTTTGGGGAGGTAGCT 
1 0 TCGGAGGCAG CTGCATCCATAT ATCTCCCTTCAC^GCrGGTAGGCAGCT CGGCAG^CAG AGTCTCCTCTCTACCGCAATTGTGTG 

CTGCTTATATAACCTTGGGGCAGCGGGCACTrGTGAATCTTAAACCTCGTC^ 

AGGCTCCCTCGGCGCTCCGGGAGGACGTGTTTCGATTGGGCAGACATAGAAACAGAGCTCT 

GGGCGTGCAGTGTGCTTAGCCCAGCTACGGAAGGGTGGCGTCAGATCCCAGGCCCT 

TGGGACAGGGGACACCCTGCrrTTTCCAAACAAGATTTGTGTCITAATGTC 
1 5 GCAGCTTGCAGGAGGAGAAAATTTCCTTCGGTTCACACTTCCAG^ 

TGCCCTGCCCTGCATGG(^TCACTC^CCCant3CTC^ 

AGCCTCGTGGCCATCAGGTATACAATATAAGGCTTGGGAAGAGCCAAGGTTTTGGCATTTC 
CAGAGAAGAGAGGAAACCCATCTATTACGTTTTCATATTTGTTATCGTTTTAT^ 

TGGATTATT AAG CTTGC C CG CTGGCTTCTGCTTAG CTG CT ATCTT ATAAAGTCAGGGGTGG C ATCACT CACACTGGGTTG CAC CCA 
2 0 CCCTAATTACTTAGCAAATAAGAAAATGGCCCACAGGCT 

GCAGGAGG CCAG C CTGG TCTACAG AGT AAGTTCCAGG ACAACCAGGGCT AGACAGAAAC CTT GTCTCAGAAAAG AAAAAAGAAAAA 

TGTGGCAAGGTGACAATGAAAACCCATCAGGACACTGTCATTTA^ 
GTCCTTCCCTCCATGCAGTGTTTGACTGTGCTGTCTTCCCAGGAGCTGGTGG^ 

2 5 CCTTACACTTGCCATCCGTCTCrrCATACACAGGGGAGGAAGTTTGTCTATGCAGGCTC 

GCATTCTGCGAAAAGCACGGTAGCTGTGAAGACTGTGTGTTAGCACG 
TGTTACCCTGCACCAGGAAGAGGCCTCCAGCAGGTATGCCrAAGGA 
AATGTGCGCCTGCTGTCTTCTAGGGGCTGGATTCAGGACATGAGCGGTGACACATCCT 
TTCCCACCCCTGGGCTGGCAAAGCTCCCGGGGCGTCAGTTCCATGGTTTTCCTGTGCGGTCTGTCT 

3 0 GAAGTCAAGTGTCACGCAGCAGACCTGTGACCTCTCCTTGTGGTGAACCTGTGATCTTT 

GCTCAAAGGCACACAGGGTGTATCGGTTCACTGGAAAGCACACCCT 
GTAAACAAAACTGGAGCCTCCTAGGAATGAGAGCGAAGGAGCCCTT 
GAACAGTGTGGGACAGTGTCATGCTTCAAGATGAAAATCCTACCCTAGAAAX5AAGTC 
GTAGGATCTGAGTTCGATCCCCAGAACCCATGTAACAGCTGGGCIATTGGTGGTTCATGCTTGTGATCCT 

3 5 GG CAGGTGG ATCCCTGGAGCTTGCTAGCCTCACACAGC CT AGCCG ACTTGGCAAGTACCAGACAAATGAGAGACTCC ATCACAG AA 

AAAGACAAAGAGGGTAACTGCTATCTTGAGGATGATACTCAGATT^ 
ACAG^CIACACACACACACACACACACIACAGAGCAAATGGCATC^ 
GTAGCITTCAGTCCCCTTCCAAAGATAGTTGTCTCGTTTTATATATC 
TTTCAGACAAAAGACAGAGTTGTGGTTAGTTCTGGTGTCCTCTTGGGGCAC^ 

4 0 GACOIGAGGTAGGCATAGAATCTTGCCAGAATTCTAGTCTGCCTTCCCAGCCCCGTACCCCCTTCT 

AAGAAGAGGGTCCCTGGTTGCTAGCTGGCCTGCTTTAGCCATTAGCAGAAGTATTTGTCA 
CTGAGCCGTGCCCAGCCCCTTCCTCAGCCTGCTGACCITTCTCCAGGGCC^ 

TC CTTCAGTGT ATGG AAGGGGG ACGGGGTGGGGGCTG CTCT AGGGTCATGGAT G TCTTT ACCTTGGTC CTG AAG ATTGCACCTGG C 
TCCAGTGGGTGGAAGTCTCATGCCGTCCTCTGGGTATT^ 

4 5 TGAGCACGTTTATTTTGATGTTCCAGATAAGAGTAAAGAAAGTTTCAACCAGCA 

AATGTTTCCAAAAGTCCAACCTAGCCCGGGTGGTATGGAAGTTCCAGAATGGCGAGTTGAA 
GTGGGCAGGAAGCACCTGCTCATCTTCAACCrcTCGGACGG 

TAAAACGGTCTCCCAGCTGCTGGCCAAGCACGTTCTGGAAGTGAAGATGGTACCTCGGACCCCCCC 
CTCAGACAGAAGGTAGTAAGATCACATCCAAAATGCCGGTTGC^ 

5 0 ACCTCCCCCAGAGCCGCCACCCTACCTCCCAAGTCCTCCTCC^CACATCCTGTGAACCA^ 

GCTCC^CTCAGAGAAGACGGTGTATCTCAAGTCCAGTGACAACCGCCTGCT 

TCTGC CTCTTTTCCT ACAACTGCTACAAGGGCTACCTG CCCGG ACAGTGCTTAAAATTCCGCTCAG CCCTGCTGCTTGGAAAGAAA 

ACACCOUGTC^GACrTCTCTGACCTGGAGCAGAGTGTGA^ 

CGACCCCAAGCCAGCCCTGGATACGGGCTATGAAACGGAGCA^ 

5 5 AACGGATCGATGAACTCTCTGCCCGGGACAAACCGTTTGATGTCAAGTGTGA^ 

TGAGGCCAGCGTGTCCCAGCCCATGCCCCTCTGTCTTC^TGGAGAGTGTTGTGTTG 
TGTGCCAGCCTCAGTCCTGTGTCCCTTTTrCTOT 
TTCCAGTCTCAAGTCCTGCAGTTGCTGGAGCGCTTAC^ 
TGCGAAGAGCCACCCCT^CCTCTTCCGATTCTCCTAGCAGCCACTGAGA 

6 0 TATCAGATTGGTAACTTACATCCTGCTGCCCAGATGGCACX3G 

CGCTCCTATTGTGTTGATGTCTTAGGTCATTTTCTTTT^ 
GAAGAGGAAC^TTTTCTAGATTTTTTTGTTCTTATATATT 

TTATTTTTGTTAACTTCATTCrTTCCTGCrrGTTAGGAACATAGGCCTAAAATTGTCT 
AGGGTTTTTTTGTTGTTGCTGTTATTGTTTCTAG 

6 5 TTAT ATTTTCCTG CCATACATCTACAAAGTGGGTTTTGAGTGAGGGCAGGTGGCCCAGTGG CTTTGGGTGGCGACTGAG CTGGTCC 

CACGAGGGGAGGAGGGTTTATATACCXCATGACCCTGCGGCTTCTrGGCGCCTCCTC 
GCTTCCATCTCTCATCACTGCCCTTGGACTTCCGCCTT^ 
TCGGATGGTGACCGCAACATCCCGTGTGGGCGGCCGGCCGCTTCCTGC^^ 
GGTGGTACGATTCTGGCATTTGTTTCTCGCTCACCGTGTG 

7 0 CCACCCCCACCCCGTGCCGTGTAGACACTCATCTTCTGCATGACATGATCrACCATT 

ACTTTGTTTTTATTTTTCTACTTGGAACTGTACACAT^ 

CTTTGCCTTAGTGTGGAGGCTGGGGGCGGTGGGGTGTCCTCCTGTCCCTCTGTGACG^ 

TGTCTGTTCTCAT CCCATGTGT C CGGTTTCTTG CTTT ACCTT CTC CGT CAT G GG AATGG TT CT G ACG ACTT CTTCCTCAG TG C CTT 
crri iGTGTCCTCTGTCrCACAGCCTC^ 
75 CTTCTCCCAGAAGCCCTCGGCCGGCCTTGGCCTCGGGGCCCAACAGCCGT 
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CAGGCGCAGAAGGTGCAGGCGCTGGGGACCTTCTACCTCTTCrGCCAGGCCACAGGTGAGTT 
TCCCCTCGCTCTGCCCTTGCTCCTAGCTGTGCCTGAGGTTCAGCTGTCTGTGCTGA 
TGTTGCTAGGGCTGCCGAAGGGGTCATCCAGCCTCTCXj^ 
CCAATGTTGTATGTCAGAAAACTGGACCTACAGGGCAG^ 
5 ATTCAAGG CCAACCC CAGGCAACHT ATG AG AAC CCTGT CTTGG AGTTTTAAAAT AG 

TGGAGACTATCACATAAGGACATGGACrCITGAAAGATGGTAGTTTACCCAA^ 
TCTTTCCCTGTGGGCTTTGGCCTCTGCACAGAAG 
GCTTTTTAATTGTCGAAGCTATGGAGATCTTGAAAGTGCTATGTG'r 
1 0 CTGCTGTGAATGCAACATGCCGTTGGCAGCTGGAGGCCCCGTGGAGGCTGTGG^ 
CCTGACATGTCATCAGTGAATACATGGGCCTTGATGGCCCTTTTGAff 
GACACCTGAGCTTCAGCCCCCCTCTCCAGGTCCCTGGGTCTCCTGGTGTGTG 
TGCTGTGGACCAAGCCTTCATGTCGACACCTGTTCT^ 

aU^TCTGTGAGCGTATAAAGGGGCAGAAGAGACGCCTTGAAGCCAGAGTCACAC^ 
1 5 GGACTGAACTTGGGTTCAACTTGGAACCTGCGAGAGAAGCAGGTGCTCTCAAC^ 

GTGAATGACTGGGGCCTTCCTTAAGGAAGATAAG^ 

GAATTCGAAGCCGGGAGTTGTTTGCTAAGGTGATGGGGAACCTAGGAGTGAGACAGGAAGGTGGGCTTCT 
GAAGCCACCAGCATGGGCAGAAGCAAAATAAAACCTCACTCACACTTGCCACK3CCT 
2 0 CTTGCCGGCAATGTT CCTAC CGCCG CACTCCTCAGG T AC CACGAGAGACAGGAGATGCGCriTl"lXlAA,LTTTTT AG TG ACCT AAGG 

GAACACTGGAAGCCAGGGACTCAGGAGCCATGGCTGGGGACTGTGGG^ 
AAGAGTGAGTTCCTCTCATGCCCCTCAGAGCGGACTGAGCC 

TGTAAGGACTGGGGTTCTCATCCCCAGACCCCCACCTAAGTGCTGGGTAGGCTTGGTG 
TGGTGATGGAATCCCC^GAGCATGCTAGCTGGCAGGATGAGCCATTTAGCAGTGCTCT 

2 5 AAGCCTTTGCATGCATCTACACACACACACACACACACACGCA 

AGTCTCTGrTCITGAGCAGACATGGGACCAAACCCTTACTCTCCCTTAC 
ACACCCGGGGCCTGTCTTTTCAGTCCTGTCCTCCACCAAGACAAGGCAC^ 
TGGGCTTTGGTGGTGTGAACATGCCAGGGCCTCTGCTATCTTTGAGC^ 
TGAAGCAGAGGCCATGTGGTTTTGAACTCAACTCCCTGTAACCTTTCT 

3 0 GCCAAAGCAGCTGTCCCAGGTGAGACCCCAGTTGGCTCTCCTA 

TTGGGAGCCTTGACCTGClSTTTTrcrCTCTCCTAAAACAGCT^ 
GCCTAGTGCGCTGTTTTTAGAACACAAGACTTAAAGAGAGGGTGAACAGGGACT 
CTCTGGCTCTGCTGGTGACTCTGAGCTATCTTGGGGGAGCCTCACCCCTCTAGAGGGTTGATTAGAG 
TTGTACTCAGGACTATTCTGCCCACTCCTGCTTCCACCCAGGACCTCCATGTCCTCTGGATTGGAT 

3 5 AGGCAATTCAGACGCCCTGTTCTGCATACCCAAGTGAGATGACCAGGGAAGACATGA 

GGTCAGGCTGCCTACAGCCTAAAGCTTGCAGGAACTCC^ 
NNNNNNNIWNNNNNNNNNNNNN^ 

TAGTTCCAGGAAAGG CAGCAGG ATCCGGCTCTGACTTTTCCTCCAGTCTTTGTG CCCGGGTCTGCAGCCCAGCA 
C^TACCCAGCCAGGTGGGTGGAGGGCTGGAGCCGCTGCACTGTCTGACCTTACACT 

4 0 GGGAGGTGCATGTTTATTCTGTCCTGCTGCCAGCTGCCAAAGGTTGGCATATGTACAGC^ 

GTGACATACACTACACCACACCACACATACCACATATACCAGGCACCACAC^ 
CAACACCACACATATAATTACCCACTGCACJVCACAACACATT^ 
CCACATCTTACATGCCACACACCATGCACC^CA^ 
CACATACACACTCCATACCATATATCATGCTACACAT 

4 5 AACCCCATGTGTAGCACACAGCCACACCACCACCACATCTCACCTATACCAAATACCT 

GC CCACACCG T AAAAGT AC CACC C AATG C CATGTGCG CACAAAC CCH" ACAAACTTAAGACACTT AC CT CAGGTG CTTT ACGA^ 
CATAAGCAAATTGCTTTTACCAAGCGGGGTTCCTAAGCAAGCAC 

ATGGATTCGGGGGGCCCTAAGAGAGCTGAGGAATTGGGGTAGTGGGGACAATGAAAGCTGCrrC 
GGGCAGCTTCAGGATCTGGGCTCAAAGAAACTTTTG^ 

5 0 TGGGAAAAGGGAGTCCCTGGTACCTCCCTTTCTCAAG 

GAGTGTCCTCAACCATTGGGG CACTTCCTCATTCTGGTTCTTCIAGGGGGTGGGTGTGTT AAAGAT CACATGACCCTT AT AGTGAG A 

TGCAGAACTGGGTGGGGTGTGCCTAGGGTTCCTGCTGCTCCAATTTAGGAATCCAT 

ATCAGCCTCATCCITGAGCAGTATCTTTAGTCCCC^ 

AGTCCTAAGAAGCCACTGATGGCTACATCGTCATCTCTCAAGACITCCAGGTGCAGTCCCGATTACCT 

5 5 GGGCAGAGACAGAGAAAGGGCCTCAAGCTTAGCCTCCCCTCACAGGTAACAGACATC 

CAGGGCCAAATAAGATGCCATCTGAACTAGCAGCTGTCT 
ACGGTATGAGAGAAGTTTCTAGAAGTTGAGGGAAGCTTTGAAAGAGCCCT 

CCATGGGGAT AGGTGGGTCACAATCCTTCCCAGGAGG CTTAGTAGTCACTAACTTGATGT CAGGAAGAGCCCAGCCCGTGTGCATG 
ACTAAATACCTGACAAAAGCGAGTCAAGGGGAGGTTTTAATTAT 

6 0 ACCTAT AGTCAGGAAGCAGTGTTGCTCAGTTTGTGTTTCAAC ATCCCCTCCTCTGG^ CATCACTGTGTAGCCTTGG CTGGCCTG 

GAATTTTCTAGACTACCACGTCTTCCAGTTCCC^GAAATCCrCCTGACT 
CCACCATGGCCAACCTCACAGGCATGCCTAGAGGCACATCT 

CATGAAGGTATTTGAGGCAGGTGGCACAAGACAGAAACAACTGTTAGCTTCAGTGAACC 
CTGGGAAACCTGTGGATGGGACGTGCCTGGGCAACTGTGATGCATAGGTCCAGTTCT 

6 5 GTACTGAGGGGGTGGCACCGTCCACTGCTTTTAATGCTCCT 

AAGAACCAACAGAGAGTAAGTACCTCTTCACCTCrCTCATGTTT^ 

GGTTTGTGGCCAATGTGACCTACATGTTTTCA^ 

TTCATGTTCATTCCrTGGTTGTAATACCTCA^ 

TAAAACAATATGGTTCCATGAAAGGCACTGAGTTGCAAAGGAAATGOT 

7 0 AGCTCTGAATCCAAATCCACAAAGCTGCTCCTGTCCTGGATCTACACAGGCTCTC 

CTGTTTAAGTAGTGTCGGCATAGTATTAAGTACTGT 
TTCrGCAAACATGTGATTTGATGTCCTCAGGGATC 

TCTGTATACAGATGTGTGTGCGTGCATGTGGATGCGTTCATATACACCTAAATACA 
CTTGAGTAGaAAGCAAATTTAAGTCCGC^CGGCTTGTCCACT 
7 5 GACTGGCATGTTCCrGTCrTATGTATGTTTGTTTACTTTTTAATTT^ 
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TCTCTGGCTAGCCTGGAGTTATGAGACTTACTCTATAGCTCAGGTTGGCCTTGAACTTATGGTGGTTAAC 
CTGAGTACCGGGACTACAGGTGTGCTTCATCGTGCCCACTCAATTCT 
TTATATTTGTTATTCATAAGCTCTTGGAAATTTATTTTTAAAAATTCAA^ 
TTCTTGGAAAGGACATGGGTTTGATTCCCAGCACCCGAATTC 
5 GCCTCCTCATGCAGGTGGTGCATTGGTAACAAGCCAGCAAAACACCCATAC^CATACAATAA 

TTTTAGTGGACAAAGCACATGCTGTGCGGCATGTACGTAATGCCTGGCAGGCAGTTGCAGTAGC 

CAAGGTXX3GAAAGGGAGATCCTTGGAGCATGCTGGCTAGCCAGT 

TCAATATATATATATATACAOiGTGCCAAGCAATA^^ 

CATTCTGTT AACC CTCTGG AGC CT CACCAAGCACTG T ATG ACCT ACATGTTTTT CAGTTTG C 1 1 1 n'T O T J AAATCAGTTTCTCTG 
1 0 AGAGGGTTCTCCTGACTGTCAGTCACTCTAGTGGCTT 

TTTCCTGATTGTTACTAACAGAAACAAGCCTGTGGCCCTCC^ 

ATTTCTACnTGATCCAGTGTGAGAGGAAAGCCTGGGACAACAGCACAGGGGGTCATTC 
GGGTTGGTGAAGTGAGGAGACTCCAGGCCCAAAGTTATAAACTAAAAATCTCTTCAAAAGTCCAGTC^ 
TGGTAGAATGGTCACTTAGTATGCATGAAGTCCTAGGTTAGATCCTCAGTACTCCATCAACrTGTTCAGGGGTC 
1 5 CCCAGCACGTGGGAGGTGAAGGCAGGAGGATCACAAGCGCAAGGTCATCTTTGGCT 

CTGAGGTCCTGTCTTAGAAAAGAAAAAATGCAACCTGTCACCTGTAACTGAGTAGTGCTAACT 

CCAGGGGAAGTTCTCCCACCCTTGGCCACCCACACACCCTGGCATC^ 

TTCTGCCTTGGTGGTTGTTATTTGTTG1TATTTITTGTTGTTTTTC 

TTTCTCCCCGTTTGAATGTCTCCCATCAAGCGTGCGTGTGTGCGTGTGTGTGTGTGTGTAGATGGCATCT 
2 0 GGCTGGCTGGTCCnTCCTGTCCAGCTCAGCTTGGTACATCACATACCT 

GTTTAAACCATTGGGCCATCTTCAG CCAAGCCCCAGGTAGGGCCAGGGAGTGAACCATTTATG AGTGTAG CTGCTTTGTTTTGCCT 
TCTCCTTGGTGGATGGAGGGCAGCCTGCCCTTGGATGGCCTGAAGCTCTGGGGATCTAGGGTGAAGTCTGAGA^ 
TACCCATAAGTCAGCACTAACAGGGAAGTGACCTGTTGCCATCACAGC 

MOUSE SEQUENCE - mRNA 

GAATTCGGCACGAGGCCATCCATGTGTGCCCGTTGCTGAAGGCCTCGGTGGCCCCTGCCCATGAGGATGTGTGCCCCCGTTAGGGG 
GCTGTTCTTGGCCCTGGTGGTAGTGTTGAGAACCGCX3GTGG 
GTCTGGTGCAGTTTCACAAGCCAGGCATCTTTAACrACTCGGCCTTGCT 

CGGGAAGCAGTCTTTG CAGTG AATGCGCTGAACATCTCTGAG AAG CAACATGAGGTATATTGGAAGGTCTCTGAAGACAAAAAATC 
CAAGTGTGCAGAGAAGGGGAAATCAAAGCAGACGGAATGCCTAAACTACATTCGAGTACTACAGCCACTAAGC^ 
ATGTGTGTGGGACCAATGCGTTCCAGCCCACCTGTGACCACCTGAACTTGACATCCTTCAAGTT^ 
AAAGGAAGATGCCCCTTCGACCCCGCCCACAGCTACACATCAGTCATGGTTGGGGGCGAGCTCT 
CTTGGGCAGTGAACCCATCATCTCTCGAAACTCTTCCCACAGTCCCTTGAGGACGGAGTATGCCATCC 
GCTTCGTCTTTGCTGACGTGATCCAGAAAAGCCCAGATGGTCCGGAGGGTGAAGATGACAAGGTCT 

TCCGTGGAGTACGAATTCGTCTTCAAGTTGATGATCCCGCGAGTTGCCAGGGTGTGCAAGGGCGACCAGGGCGGCCTGCGGACT^ 
GCAAAAAAAGTGGACCTCCTTCCTAAAGGCCAGGCTGATCTGCTCCAAGCCAGACAGTGGCCT 

GTGTGCGCCTACACACTGGCCACGGTGGAGG CAGTCTTCTCCCGTGGAAAGTACATGCAG AGTGCCACAGTGGAG CAGTCTCACAC 
CAAGTGGGTGCGCT ACAATGGCCCAGTGCCCACTCCCCGACCTGG AGCGTGTATCGACAGTG AGGCCCGGG CAG C CAACT ACACCA 
GCTCCTTGAATCTCCCAGACAAAACACTGCAGTTTGTAAAAGACCACCCTTTGATGGATGACTCA 

CCCAAGCTGATCAAAAAAGATGTAAACTACACCCAGATAGTGGTAGACAGGACCCAGGCCCTGGATGGGACTTTCTACGA 
GTTCATCAG CACAG ACCG GGG AG CT CTG CAT AAAGCAGT CAT CCTT ACAAAAG AGG TGC ATGT CAT CG AGG AG AC C CAACTCTT CC 
GGGACTTTGAACCGGTCCTAACTCTGCTGCTATCGTCAAAGAAGGGGAGGAAGTTTGTCTA^ 
CAAGCGCCCCTGGCATTCTGCGAAAAGCACGGTAGCTGTGAAGACTGTGTGTTAGCACGGGACCCCTACTGTG 
CATCAAGG CCTGTGTT ACCCTGCACCAGGAAGAGGCCTCCAGCAGGGGCTGGATTCA CATG CC 

TGGAT AAG AGT AAAG AAAGTTTCAAC CAGCATTTTTT CAAGCACGG CGG CACAG CGGAACTCAAATGTTTCCAAAAGT CCAAC CTA 
G CCCGGGTGGTATGGAAGTTCCAGAATGGCG AGTTGAAGGCCGCAAGTCCCAA CACCTG CTCAT 

(HTCAACCTGTCGGACGGAGAC!AGCGGCGTGTACCAGTGCCTGTCAGAGGAAAGGGTGA 
CCAAGCACGTTCTGGAAGTGAAGATGGTACCTCGGACCCCCCCCTCACCTACCTCAGAGGATGTT 
ACATCCAAAATGCCG^TTGGATCTACCCAGGGGTCCTCTCCCCCTACCCCGGCTCTGTGG^ 
ACCTCCCAAGTCCTCCTCCGGCACATCCTGTGAACCAAAGATGGTCATCAACAC^ 

AT CT CAAGTC CAGTG ACAAC CGC CTGCT CATGTCTCT CCTCCTCTT C ATc r 1TGTC CTCTTCCT CTG CCT CT^ C 
T ACAAGGG CTACCTGCCCGGACAGTGCTTAAAATTCCGCTCAGCCCTG CTGCTTGG AAAG AAAACACCCAAGTCAGACTTCTCTGA 
CCTGGAGCAC^GTGTGAAGGAGACACTGGTCGAGCCTGGGAGCTTCTCCCAGCAGAACG^ 
CX5GGCTATGAAACGGAGCAGGACACCATCACCAGCAAAGTCCCCACGGATCCT 
CGGGACAAACCGTTTGATGTCAAGTGTGAACTGAAGTTTGCAGATTCGG 

TGCCCCTCTGTCTTCGTGGAGAGTGTTGTGTTGAGCCCATTCAGTAGCCGAGTCTTGTCACTCTGTC 
CCCTTTTTCT CTGGTTT 

60 MOUSE SEQUENCE - CODING 

ATGAGGATGTGTGCCCCCGTTAGGGGGCTGTTCTTGGCCCTGGTGCTAGTGTTGA 
GCTCACCTC^GAACATGGAGAGGTAGGTCTGGTGCAGTTTC^CAAGCCAGGCATCT 

ACAAAGACACTCTGTATGTAGGCGC CCGGG AAGCAGTCTTTG CAGTG AATG CGCTGAACATCTCTGAGAAGCAACATGAGGTATAT 
TGGAAGGT CT CTG AAGACAAAAAAT CCAAGTGTG CAG AGAAG GGG AAATCAAAG CAG ACGGAATGC CTAAACTACATT CG AGT ACT 

6 5 ACAGCCACTAAGCAGCACTTCCCTCTATGTGTGTGGGACCAATGCGTTCCAGCCCACCTC 

AGTTTCTGGGGAAAAGTGAAGATGGCAAAGGAAGATGCCCCTTCG 
CTCTACTCTGGGACGTCCTATAATTTCTTGGGCAGTGAACCCATCATCrCT 
TG CCATCCCGTGGCTGAACG AGCCTAGCTTCGTCTTTCCTG ACGTGATC 
AGGTCTACTTCTTTTTTACGGAGGTATCCGTGGAGTACGAATTCGTCTTCAAGTTGA 

7 0 GG CGACCAGX3G CGGCCTGCGGACTTTGCAAAAAAAGTGG ACCT CCTTCCTAAAGGCCAGGCTG ATCTGCTCCAAGCCAGACAGTGG 

CCTC^TCTTCAACATACTTCAGGATGTGTTTGTGCTGAGGGCC^ 

AG CTGAACAATGTGGGT CTGTCAG CGGTGTGCG C CT ACACACTGG CCACGGTGG AGG CAGT CTTCT CCCGTGGAAAGTACATGCAG 
AGTGCC&Jl&GTGGAGCAGTCTCACACCAAGTGGGTGCGCT 
TGAGGCCCGGGCIAGCCAACTACACCAGCTCCTTGAATCTCCCAGAC^ 
7 5 ACTCAGTGACCCCGATAGACAACAGACCCAAGCTGATCAAAAAAGATGTAAACT 
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CTGGATGGGACTTTCTACGACGTCATGTTCATCA^ 

TGTCATCGAGGAGACCCAACTCTTCCGGGACTTTGAACCGGTCCTAACTCTGCTGCTATC 
ATGCAGGCTCCAACTCTGGAGTGGTCCAAGCGCCCCTrGGCATTCT^ 
GACCCCTACTGTGCCTGGAGCCXAGCCATCAAGGCCTGTGTTACCCrGCACCAGG^ 
5 CATGAGCGGTGACACATCCTCATGCCTGGATAAGAGTAAAGAAAGTTTCAAC^ 

TCAAATGTTTCCAAAAGTCCAACCTAGCCCGGCTGGTATGGAAGTTCCAGAATGGCG 
TTTGTGGGCAGGAAGCACCTGCTCATCTTCAACCTGT^ 
GAATAAAACGGTCTCCCAGCTGCTGGCCAAGCACGTTCTCGAAGTGAW 
ATGTTCAGACAGAAGGTAGTAAGATCACATCCAAAATGCCGGTTGGATCTACCCAGGGGTCCT 
10 GCAACCTCCCCCAGAGCCGCCACCCTACCTCCCAAGTCCTCCTCCGGCACATCCrrGTGAACCA 

CCAGCTCCACTCAGAGAAGACGGTGTATCTCAAGTCCAGTGACAACCGCCTGCTCATGTCTCTCCT 

TCCTCTGCCTCTTTTCCTACAACTGCTACAAGGGCTACCTGCCCGGACAGTGCTTAAAATTC 

AAAACACCCAAGTCAGACTTCTCTGACCTGGAGCAGAGTCT 

CGACCACCCCAAGCCAGCCCTGGATACGGGCTATGAAACGGAGCAGGACACCATCACCAGCA 
1 5 CGCAACGGATCG ATGAACTCTCTG CCCGGGACAAACCGTTTGATGTCAAGTGTGAACTG AAG CTG ACGGG 

GACTGA 



HUMAN SEQUENCE - GENOMIC 

CCTCACCCAGTCACCTCCGAAAGGCTCCCTCTCCAGAAAC 
2 0 CCAGACATTCAGACGATAGTATCACCITTGAAAGACATGACACC^ 

AGGGGCTGGCCAGCCACTTCCCAGAACAACCCCACACTGTGCCTGGGAGCCAGT^ 
GTGGCAGCAGGTCAACATTAGGCCAGTGGAAGATGATTATTTTTGCTAGCACT 

CACACACCCTCCAGAAAAGACAGTATTGTCTGTATCCAAGTCTGAAGAATAGAGTGATGTCACACAAA 
CTCTACCATGCrGGCTGCTGCITAGAGGGGTCCCCACTCTTCCTCTGAA 

2 5 TAGCXSCTTGTCCTGCCGACATTGCrGATAGATGGATGCCTGTC 

ATTCTGTACATGCrTGCACCCAGGGGTTTGGGAAGTGGGTGAGGGTTAGAGCATGTCTCTG 
CCCTGCAGGGATGTGGCTTGAGGGTGAAGTAGACTGGTCCTGGCAGCTGCCCTG 
CTAGGCATGTCCCTGCAGAGCTGGAGTGTTGGTCTCCAGGGATCCATTTTAAITCTAG^ 
TTATTATGCTTGTGACCTAGACAGCTCCTGAAACCCATTCCT^ 

3 0 AGTGACATCAGCATCCTAACTGGAGGTGGGAGAGGGGTTT 

GGTGAGTGCX5TGTCCTCCACTGTCCTGGTTGTAGGGCCTGGAGAGACTCTAAGCTTTACCTGCCCCATCT 

TAAGAAATAAGCCAGAACACGCTCCCCCAGGCAGTAGTTTGCATGCAGTGAGGTCAGGGGTCCAGCT 

TTGGCCAGTCCACAGCTGTCACCAGAGATCCCCTTTCAGTTTCCCTGGGATGCCAGCCAGGGCATT^ 

3 5 TGTTGCTGGCAGAAGCACTGACACCTTCCATGTGGTAGTGACTGGTGTCCCCTCTGTAATCTACCCT 

CCAGGAAGGAGCCGAGACACGGGTGACTGGACAGTCCCCAGATGGCATTTTGGTTCATGGTTCCTGCCTTAC^ 
CAGTCACTATTCACATCTAAGATGCAACACTAGATCAGAATGTATGTTCTG 

CACATGCACACACAGATGCGCACATACATGTCTGCGTACACATACACAGATGCGCACACAGATGCATGCACGTATGTGCACA 
GCTGTGCACACATACATGTATATGCATACACAGATACATATG 
40 CATTATACACACACATACACAGATGCCCATGCATACACATATGTATACACATGCAGAGATCCAAGCAGATGC^ 
AGGTGTACATATGTATGCACACAGACATACACATGCATGTGCAGACATGC^^ 

TG T ACAAATG T ACAAACATGGAT AGGTGCACAT ATCT ATG CGGGTTCATG TG CACACAT ACAGAAAT ACAGATGCACATG CGTG CA 

CATACACAGATGCATACACACAGATGTACATGTGTCCACATATATATATACATAC^ 

TACTCCCATTTTCTCTGCTGTGATCTCCTGCCrrGATTTGAAGACCCCTC 

4 5 GCAAAAGAGTGTGGGG CTCAGAG CCCCCTTGGAGTTCCATTCTGCTGACATTT AAAGTGGG CT CGTGCACCCTCATGAACAGCCC C 

AATTACGTGTAAGTAATGAGAGTGACATTTCACAACCCTGTCI^TCTAATGGGCTC 

GGG CAGCAATTT CACACTGTCAGG G CAAAATG CACATG C AT AC AGC TCAG CTCCCAG CGGG AT ACCTTTG CTTCTCAGAAATGG CG 
AAG AAAGCCAGCAAAATGGCCTTTTT CTC CTTTTGTTTT AATTT AACT ATTT AGAG TG AGG TC CTTAATG TAACACCCATCAATGA 
TTATGGAGCC^CTCTTCTGAAAGAACrCGCAGGGCCGGCAGATCTGTGTGAGTTGTGGTG 

5 0 ATTATATGAAACAATGCATTCTTGTTGAAAAAGGAAAAAAA 

CTGGGAGGAGCAGTATGTCGCATGGX3GTCTGTCCTGACCTTTG 

CAGGGCCTGATTTATCCCATCGAGAATCTGGTGTCTGCTGATGAGATCAATACACT 

GCCTCGTGGTCAATCATTGTTCTCAGGGCIAGGGCCTGGAAGCC^ 

CTGG ATTGGT CCAAGGT AAAAG CAATT AAG AAAAAGGTTTCTTTTCAAC^ CAG TTTG AG T AAC CGCAT AAACAG CACCAGTC 

5 5 CTGACAGTCCTTCGTAACCATTTAAAGCCACACAGGAGAACCTGCT 

CTTCACTGGCGATTTTGTTTTGTGGAACTTTCTACTCTTGGCTCCT 

GGGATGTTTGTGTTTCTCCTGTTCTACCTCAACTAAGTCAGGGAGGCGTT^ 

CACAGAGGGGGATGGAGTGTGGGGGTCAGGGGAGGCAAAGGGTGACTGT 

6 0 ACAGTCTGGAAGAGGGAGCAGCAAGTGCCAAGGCTGCAAGGCAGCCTGAGAC 

GAGTGGAAGAGGCTGGCCACATTAGGTGCTGGCATTCTGCATTA 
ATCCTATTTTTACAGCGTGTTTGAGCCCCCAGATCTGGCTGCCCTAGGGACCCAGGCT 
CClTGGGAGCCrCCCrrrCAGGGGGTCGTGTCITCTGCTTCTGGATCC^ 
GGATGTCTTTTTAAGGTTAATTCATTTTATGAGAGTAATTAATGCAGGTAAT^ 

6 5 AGGAAGCAGCACTTGCTTGTCrTGTCCCITTGTCCCT 

ATTTTCCITCAAATTAGATATGCCTACATTTCCTGACTTTC 

TGACTCrrGTCCCCTCCCTTTCCCTCTCTTTCTTCCCCCAGACCTCCAGTGGAGTTA^ 
ACTXKK5CAGTGATAAGTACACTCTGGTTACATTTCCATTCATGGGTCAC 

CTGTGT ATCTGTT CT AATTCAT CITCAC^CTCTTTG C CAGAG C CTT AAAATGCATTTCTG AGAACAAAG CCACAG ATG CAAT CAGT 
70 * AATCC^CTGTGCCTTCTCCTGCCCCGCCCCATTCGAGCCTCTGCCTtSCrCCC 
CTTCCTAAGAAAGAAGACCTGGGAAGCAGCTCCCTTGTT1TGGTGGAGCAAAT 
AGGTAAAACATTTGAGAACTTGTAGGTTTGAAGCAGTGTTCTGTACT 

CT AGG TATGAAGTCaT CTT CC CTGGGATTCAAAGG CGTTG CTTTG CTG CCT G CAGTGGGG CACTCAGAGG TAATTTCTTTGTGG CC 
TGTTTCTTTCTGGAAGCCTCATCTAACCCTCCTTTCAGGATGATGTGTCTGGTG 

7 5 GGCGCTX3TTATTCCGAAAGCTCAG<n , CCT / rC J lt31TCTAGTTAi-i TTCTTTTTATTACTCCTTTAATAATTCCTCATTTTTTTTCCC 



848 



WO 03/008583 



PCT/US01/51291 



10 



15 



20 



25 



30 



35 



40 



45 



50 



. 55 



60 



65 



70 



GGAGCTCCTGCCAGTTCATGTGGGACTCCTGAATGGATCGTCCAGTTGTCTGTTCCCCCTTTTTrCA 
TTCTAGGAAACCTCTTTAGTTTACTTTCCAGCCTTTCATTTCTTT 

GTGTTTCTTTGTGACTTCTTCCAGG CCCTGGCTCAACTCTGGTCT CTCTGGTGGTGTGTGACAGGGTT 
GGTGGTAAACACAGCCACTGTTAAAAATTAAATATGTGGACTTAAAATGA 



. GTGCTGGGTGCTGCTGTGCACCTCTCCC CAGTTCAGCGGCATCATTCTGGGAG CTCGAAAG CAGCATTTACACCACAGAAATCAGC 
GTGTCCTGCAAGCTGGGCACCCCCCACCTTACCCC^ 



TTAGAGTCATTTCATTGGGTTTCCAGAGAGAGTTGAAATAAATCTCTGTGTTCT 
CTGACCCCAGCATGCTTATTGGGATACGCAGAGGACTGATTGCAGTTGGGG 

TCTCTCCAACAGTGATCCAGCATCTTTGGTATrTGAC CCGAAATGAACTGGGCCTTGT^ CGTCTCTG CTCCTAGCCCTC 

CGGTGTGGGCACTTACITCCTTAACGTGTC 

TGTTCCTTGTTGTTACTGATTTTAATAGACTTTATTTTTTA^ 

TTTCCCATATGCTCCTTCCCCACACATGCACAGCCCCCTCATTTTCAGAATCC 

AACCCACATAGGTACCTTATCACCCGACATCTGTATCTGGACATTAGGATTGCCAGTTC 

AATGCAT ACTG ACATGTGTCCAC CGTGGTAGT ATC CCTG C CAG CCT AGTTTC CCTGC CCT AAAAACCCT CTCTCTC CCCAG C C C CT 
GGGAACCACTGATCTTTTTACTGTCTTCATGGTTTTGCCTTTCCC^ 

AGATTGGCTTCTTTCTCTTTAATGATAAGCITTTAAAGTTCCTCT ll'l'l'AATGCT 

GAGTAACGTTCCCCTGTCTGGATGTAGCGGTTTGTATACCCACTCGCCTGCCCCTGGGCTGTCT 

TTGGCACGCCTTACCTCTGCAGGCTCCGCATCC 

GTGTATGG ACCTTTG CC CAGCTGGGGG ACATG C CACG AGAG AGGTTGTTT CTG CTTT ATC CTTG CTGTGCACAG CTTGG CACTGG C 
TGGGACCTGGGTCTGGGTGTGGCTTGGAGGAGCCAATGTCAGTTCATTGGGTT^ 

GTGGCGCGGGAGCCAGTGTGAAGTCTCTCACAATGCGAGTGTGCAGTCCTCTrC^ 

GTCACTTCTGGCTCT CACTGG CTGTATGATGTTTGTCACATGGA^ 

CTCCAAAAGGAGGGGTAGCTGGTTTTTAATGGGAGCTTTTCTCTTC 

CTTATGTCTGTCATTGCATGCAATGAGGACTGAGAAGAGCAAGGGCAGCAGGACCCrG 

GGCCCIGGGCIXSGTrGCTTCCACGQVGCCCCACTCCAGGGCGTGGGT^ 

AGCTCCATTCGTATTTGTOVGAGC^TAAACATTATT 

AATTCTCACAACTTTGTAGGCCAAGTAGGTCAACTTCTGTTTTGCAA^ 

GCCTGGAATTCAGAAGATCTGAACCTATAATTTATTGAGTCTTCTC 

TCAAAGTGGAGGAGCITTTGGGAGCCACTCCTAATTACCAA^ 

GTGGGATTTAGACCGGCAACTTCIAGAAGAGACA^ 

AGCCATGCAGAGGGGCCGAGCCCTGTGTGTTCAGGTGTAGCCAGGACAGTGCAGTGGAGGGAGTGTGGATGAGGA 

GAGGTAGAGTAGGGTTGTCCATGGCCTTGCTGGCCATGATGAAAATGTGGGCCTTATTCTGAGTC 

TCAC^TGGGGAAAGGATATGATCTGGATTTATGATAAGAGCATGACTTTGGCCACTGCTA 

TTGTGTCAGTTTGGAATGGTATTTGTTGGT AACZAAAAATCTGCAAG CAAGTAATTTAAM ATTTTAAT AATTTT ATTAGCAAGT CT 
GC^TAGACTCATAGCCTAGGGTCTCATAGGCTCTTGTGGGTTTC^ 

AGGATGGCTGCCAGAGCTCCAGCCATCACATTCT AGTTCCAGGCAG CAG CAGGAAGAG AAGGACTG AGGG AATTG CTAGCTGAATC 

AGCCTCCTTTAAAGAGCTTT'CCTGGAATTGCCATCTAATAAATTCCACATAC^ 

GGGAGATATAGTGAGCTGGGCACACTGCCAACCCAAGTAACCTCGGAATTT^ 

AGGAAGCCAGTGATCTCTACCATGGAAGCAAGAGGCCAGGATGAGAG^ 

CAGGAGGAGACAAGCCCCAGATGTGTTTGAGGCTGAGCAGCAGCACCA^ 

GGGGTCGG CCCCTGACTGCTGGTG CTTTTTTITAGGTGGGGACATGG ACTGTAGGAATTGGCTGG AAG CAG AGTGAGGTTCTGTTT 
CGGCTGTCTCTGGCTGGGTGTGGAAGGCGGCIATCTGGAGGTGGTAGGGAGGAG 

TTGTTCCIATGACCCT CACAGGACTCCTGG ATAGAAGGGGGAG CAAGGGACrCAGGGATCCCAGGTTGCTGTTTTGAGTGAACTGTT 

TACAGAAGTCTAATGTGAAAGTGCCCAATCACTGGGIGCCCAGCTCAGG^ 

CCAGATCAAGGTGCTAGACGCCCTTGGGCCAGTCCTGTGGCGGGT^ 

ATCCAGTCATCAGAGAATGTTGGGGAClATGAGAGATGGGGCCAGGAGGCCrGTGCATGCCT 
ACTAAGCAGCATTAAGCCTGTTCITTTACrrAAGGGCTGT^ 

TAGACTTCTTT AAGTG AG ACTGGAAGAGCT AG CAGGGTCCCATTTGGGGCAGGAGGTCAGACACTCCTGCGTGGTGAGG CCTCACG 

TGCGTGTGTTTCTACAGGTGTACATCCGTGC^TGTGTGTGTGTACATATGTGTGCGCATATGTTGTGTGTGTGCA 

TCACATGTACATCTTAGCTCTrCTCTGCAGCCTCTGGTTTT^ 

GAAACTGAAACCCGAGTTCTCTTCCTTCATTTTCATCT 

AGCCGGCCATGTGTGTCTGCCGGGTGCCCCACACCAGGTCCCCTCCACGTGTCTGTCCTC 
GGGAAGCACTGACTCCTGAAGACCCTGGCTTCGCTTTCCTGTGTCCGTTGG^ 
GCCTTCCGTATTCCTCAGCAGATGGACCCGGGTGAGGATCCAGGGCAAGCAG^ 
TCACCATGGGTCCTGCCITTTCTTTCTCATGTCCT^ 

GCGGTGGTGAGAGCGACCCCTCCTCACCTGGAGAACTGGGAAATGTGGATTCT 
GCTGCTGGCCCTGGTCCTGGGGCGCTGAGCCGC^TCT 

AAGCCCCTGGGGTCGCCT CTACCTG ATGAGGATGTGCACCCCCATTAGGGGG CTGCTCATGG CCCTTGCAGTG ATGTTTGGGACAG 

CG ATGG C^TTTGCACCCATACCCCGG ATCACCTGGG AGCACAGAGGTGAGTGACG AAAC CTG CAGGGGTGCTG ATGGCCTGGG ATG 

GGGACCTCACTGCCTGGGATGCTGAGGTGGGTGGCTGGGGACGGGA 

GAACGGGACAGGGACTCC^CriTCCrGAGATGCreAGGTGAGTGGCTGAG 

GGGTGGCTGGGGATGGGATGGGGACCCCACTTCCTGGAATGCIGAGGCGGGTC 

ATGCTGAGGCGGGTGGCTGAGGACGGGATGGGGACCCCACTC 

ACTTCCTGGGATGCTGAGGTGGGTGGCIGGCX^TGGGATGGGGACCCCAATTCCTGGGATGCT 

TGGGGACTCCACTTCCrGGGATGCTGAGGCGGGTGGCrGAGGATGGGATGGGGACCCCACT^ 

GAGAACGGGACAGGGACCCCACTTCCTGGGATGCTGAGGTGGGTGGCTGAGAATC 

GTGGGTCGCIGGGGACGGGATGGGGACCCCACTTCCTGGGATGCTGAGGTC 

GGATGCTGAGGCGGGTGGCTGGGGACGGGATGGGGACrCCACTTCCTGGGATGCT 

TCTGTTGGATTCTCTGCTCTGAGAACTTTTGTCTGAGCAG^ 

ATACTCAGCAAGGCTCGTGCAGACCTCCI^TTTGGTTTCTGAATCT 

TAATGTTCGCGGCGTGCTGCCAAAGCACCCGCCCCACTGAGCGTCTGCTCCAGGCGC 




'CTTGTATCTGAGATGGCAGTGCCAT 
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GGATGTGTCTGGGGACCCGGGTTTCTGACTCAGTGGATTG 

TG AGAGCTGTCAGCAATAT AGGCCATTGTTGTGCTGAGAGAGJU3GACCTG CTGCCTGTGTG CATCTCrCTTCCACATGGGAAAATT 
AG AACAGAAAGT ATGAAC ACCTG ACT CT C 1 1 1G1 1'1'riTCTC^AAGTCAGGGTCTGGCTCTGTTGCCCAGGCTAGAGTGCAGTGGG 
ACAATCIATAGCTTACTGCAGCCTCAATCTCCTG<1ACACA 
5 CCGCCACCACACCTGGCCC^CCTGACTCTTAAATAAAGCAGACT 

TTGTCAGGACAGTTG AAGAAACAG GT CCTAAAAAAAAT AATGCTGAT ACATTT A CAGCT ATTG CTCAAATCTT 

CATTTTCCTAAGTAGCTATTGATACAGTAACCTAATAAGCAAAGAGACGTCAC^ 

CACTGAGAATTGCATTCGTTGATGGATATATTCTCCAAAAGAAATTTTGTGAAG^ 

GAAAAT CCTAAGT CATTGT ATTCCTGTACACCT AATCTGTGGATGACAAGGCATG TAG ACCAGAG TTGGGTGGG CAG AGGG TGGG A 
1 0 GCCATCACAGGAAGCATCTGGCAGAGCAGGGTTTGGGATTCTG 

AG TG AGCACACTGGCTCCGGG CTTG AGCTCT AGGG CTGTG AGTGGTT C CCATGAG CACAC CTGATGGCTACACTGTGGG T CTGGCG 

TTGACTGAGGCCACCTCCACTGTAGCAGTGCCTGTTGCAGCAAC^ 

TCACTCAGACAGGGACCTAGGATGTGCTCAGGTTTAAAAATGCTTCCATCG 

ATCCCACAGAGGTGCACCTGGTCCAGTTTCATCAGCCAGACATCT 
1 5 TTGTACATAGGTGCCCGGGAGGCGGTCTTCACTGTGAACGC 

CTGCATGT CCAC ATAG CCCAG CTTG AG TGGG CAGC CTCTCCTGG CCCCACTGTGGGGCTGGGGGCTGACTT AC CTGG AG CCTCCAT 

TCTX3TAGCTGTGACCATGCAGCCCCATCTTTGGGTGTCCAC 

ACTTTTTATTATGAAAAAATCTCAAACATACCTATGAGCAGAGACAGCTA 

GCCGGGTGTTCAGTCTTGCCTACAGTACCCTCCACCTCCTGGGGCATTGAGGCAGATCC 
2 0 TCTTACTGGAAAGGGCCCCTTGT CGTGCAAATAATCAAAAATAGCTTCTCCAAAATTTATATCATCAT ACATTCAG CCAGTGTTCA 

TATTTCACCGACTCTTGTGGATGATGCTTGTTTAGTTGGT^ 

ATTGCAGCTGGTCACCCTGTCCATTCAGTCTCCGCTTCTAGGAGCCAGCCTC^ 

CATCCCATGTCCCACTCTT'CACATTGCTCCTGATTCCTCX^TGTC^ 

TGTAGCTCACTGCAGCCTCAAACTCCTGGGCTCAAGTGATCCTCTTGCCTCAGCCT 

2 5 ACCACACCCAGCCCCACTCATGGTGTCTTTAGCATGCACCTCTGTCTI^ 

CTCAGGGCTGTGAGGCGGGCTTCTCCATGGAGGCAGGTGTTCCTCCTCCTCAGTCT 
CTCTTCTCATCAAAGATTGCAAAACGCAATGTTCTGATTCTACCT 

AGTGCCCATCACCAGCTATTTTGGTCCGTAGAGGTCCCATTTCCTTTGGGAAATGCAGAATCGATG 
TTACCAGTTTTCAGAATAATGAGTTAATGAGTCAGTTTCCCCT 

3 0 AATGAAGTCATAGATTTAAAGATATTTGCTCTTCTTTGACCCATCACAGT 

GGTCAACAGCAGCCTGTTTGGGTTGGCrCCTGAGACCrCXSCGGTGTG^ 
ATCCTGGACATTTTCTGCCCAGATCTCTTTITGTTTTCATC^ 
AGCTCTGAGCATTATGCTCTGAAAAOtfU3TATTTGGCTGGGTGCT^ 
TAGGCGGATTACTTGAGCTCAAGAGTTTGAGACCAGCCT 

3 5 CAGGTGTGGTGGCTC AGGTCTTGGGAGTCTGAGGTXSCTTGG AAGTCTG AAGT CCCAGGAGGTCGAGG CTG 

CAGTGAGCTGAGATCIATGCCACTGCAGTCCAGCCTGGGCAACAGAGCGAGACCCT 
ACTGTGAGGCACTCTGAGCAGAAAGATGTGGTGGTCTCACCTCACTCAGCT 
TGAGCGCCTTCCTGACCACACAGGCTCACACTCGGTCrTGGGGGAAT^ 
TGAGAGCCIAGACATTGCTCTGTTAATGACACCATGGTGGTCCCTCGGGGTTGGGGAGAGG 
40 AAAAAAT AAAG AG AC CCCC GCC AAG CAT CAGTG AAAAT C CC TGC AAACGTGATCAAG AGT CTGTTG AAGTGTACGTTCC CTTGTCC 

TTATCCATTTCATGCGAGTTGACGAAATCATCTGTACITT^ 

TTGTGTGCCCCTATCAGGACTTCCATTTCTATGAAATGACACTCCCTTAGTACTGTC 
ACAGAGTTAGTTAGAAACCTAAAGTGGGGTTCCTCTCCTCATACrCTCCCTCCTAGAAGCCCTGTGTC 

4 5 GGGGCATTTTGGGGAAGGTGAGGCAGAGATAGATATAGAAACCTTCCCTTT^ 

GAGGGCAAGGGGCTAAGGCCCCCACCCCCTGGAAGTTTCTGATCACGAATTGTCCCTG 
GATCTTACTAAGATGGTGCCrGAGCTGAGCTGGTtSGTC 

T AAAT CTC CCTTTCT ACTGAT ACAGTGAAT ATTGTGG CTTTTTTTTTT AAA CTGGACTGT AG AG T (^CTTTTTTTTTTTTTTTTTT 
TTTT'rriTTTG AG ACGGAGTCTT ACT CTGTTG CCCAGGCACACGAGGCT 

5 0 CGCCTCCCGGGTTCAAGAGATTCrCCTGCCrCAGCCTCTCGAGTAGCTGGGATTACAGGC 

TGTATTTTTAGTAAAGATGGAGTTTCACCATGTTGGCCAGGCTGGT^ 
CCCAAAGTGCTGGGATCACAGGTGTGAGCCACTGCGCCCGGCCTGTAGAGTCAC^ 
TTTGAAAATACTTATGGTTTAGGTTTTTGAAAAATAATTTAA 
TCAGTGATGGAACTTTTGGAGCTGGTCTTCCAGACTCCACTGGTTTCAGG^ 

5 5 GGCTGCTCCCCTGCTCTCTGAACCTCCCTTGCAGTCTGAGAATTGATCATCTCAGGA^ 

TG CTTTT AAAATTGTGC!ATTTTT AT ACT AGT ATCT CTTT AAAAG AATATG CACTG AT ATGGACTCT AT CC CAACAGGTGT ATTGG A 

AGGTCTCAGAAGACAAAAAAGCAAAATCTGCAGAAAAGGGGAAATCAA 

TTCAAGTATTTCTTGGTTGTTTTAATCTACTGAATGCCAG 

TG CTGGG CTTGGTTTTGTTGTG ATGCGTCTTGGAG CTCATCCCTCTATTCCTG AGT ATCAGGATTGGGTTTCTCAAAGCAGGTGCC 

6 0 AAACATATTTGGCCTTTGAGAAATACTTTTTTAAAGTTTATATGAATTC 

ACTGGTGAATAGTCAAGCCCACTGGTGAATAGTGCAGAAATTCAGGGTTrGrrCT 
AGGAGTTGCTAGTCATGCAGTTTTTCTCAACCCCTTCGTTGGACTTGCGAC^GAGAT 

CCTCTTG CAGG AGGG AGCACATGAGTG AG CAAGTGTGGG ATCTGG CCAGCTGTTTTGGGTGCCAGCAGG AGCAGGCTTTGTGTGAG 
CCCTGAGGGTGGCACCCAGGTTGGGGTGCCTGTGACCCCCGCAGCCCCAGAGGGTGCGTTACAGTC 

6 5 ATGGACAGTGG TGTGTT AT AAG CTCAGTGGG C CC CTTTT CTCATTGTGTGGG ACAG CTGTC CT CTGC CAGTG AGG G CAAAGGGC CA 

GTATGACAGCCITTTTTGGGTACACACACTCAGGGTCCTGAGCTC^ 
AAGGATGGTGGAGTCAGAGAATTTTGTTTAGTGTTGGAAGTGGCTCrC^ 
ATAATCTTCCCTGAAGTCTGGCCAGCTCTGGCCGGCrCTTCTCCAAAGT^ 
TCAAGTCACCTCTCTCTAGTCCAGCACCTTCTCITCT 

7 0 GCTGTGGGTAGTTTCGGAAAAGGCAACATCTGATTGTAAAAAGATGTTCAGAA^ 

TAGAAGTTCTGACTCTGGGCTGCGGGTTTTAGGCTTTTTG^ 
CTTTCTCTGGCTCCTTTCTCTATCACTG^ 

GTACTCATTCAGGGAGCCCTGGGCCGAATACTCATTCCCGTCTGCA^ * 
CTACCTGGCACTGGGCATTCTTCCTCAATGGTGGTCAAGACA^ 
7 5 ATGCTAAACTTTCTTAGCAAAATGCTAAACACTTGCACTGAAGTCTTTGT 
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AAGCAGGTGGTGCAGTGTAGCTCCGGTTTCCGGGGGTGTGCAGGTGGGTTGGGGTG^ 

AAGAGAACTCTCCTAATCTGGGATAATTTCAGACTATAGCACATC 

GGTATCTTTGTTTTCCTGCITGGGCCTTTC^ 

CTGGGACCCTGGGTrGGGTAGACCTAGTGTCGCTGTTCCTAG 
5 GGAAATTCTGTAGGGTGGACACTTCTCCAGGCCT^ 

CAACAGTGGGTGTCCCAGTAAGATTTGAGGTAAGCATTCATTGAG 

GACTGAAACAAAATCCTGTTTGGCACCTGAACAGCATGTGGACA 

TGCCTAGGGGTGGGGTTGGGGTCTCAGTCAACCTTCCACTC^ 

ATGACAAGACAAGGCTGAGAATGCACACATCCCTCGTTGGCCTGG^ 
1 0 CCTGAGATGAACGGCGTGAGCAGTTTGAATAGTGGAGGGCATCTG<3^ 

CAGAGTGCCTCAACTACATCCGGGTGCTGCAGCCACTCAGCGCCACTTCCC^ 

TGTGACCACCTGGTAAGGCTGATGGGCACCTCCCTGTGCT 

ACGTAAAGCTGTCTCCACCCTAAAAAAAAATAGGATTTTCT 

AGTCTTGCACAGCTGCrGTCAGGGGCACACAGCCCACTTGGCTCCTCT 
15 GCAGCTCAAGGTGCTGGGAGCCCTTGGTGCTAGTGCAGA^ 

GCATTGGGGATGTGCTGGGAAGGACAGGTAGCCTGAACATGCT 

CTCTGGGCATCTGGGTCAGGTGGAAAAGTC^GGACT^ 

GACCCACCCTTCTGGTCGGCTGTGGTGTGCAGCrTGCT 

2 0 CCTGTGTGACGGTCCTTTCTATCCAGATAATCTGATGAGCAATGTGAT^ 

CATGTGAACATGGAAAATGAAGCACATTTTCTACTGTGTATCCCTGGATT^ 
GAAAAGCATCATTTGCTAGAACCCTAAATTAATTGGAATTAA 
ATGGCCTCATTAGAGACTTGATGAGGAAGCCTGGTATTTGCACACAGACGT 
GCCTGTACCACAGATGCTGGCAAOSAAACCTGATAGGGGGTG^ 

2 5 CGTTTCTATTTACTTTTGTGCAGAAAGTACGGTTGGACCAGAAA 

AAGCTTAAAGTTCTGTCATTTAAACTTACATTCTGTAGATAGGT^ 
GACCTATAATTTTTCCTAATTTTTTTTTTTTTTTC 

GGCTCACTCCAACCTCC^TCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGC 
CGTCTGGCTAATTTTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCAGGCTG^ 

3 0 ACCCGCCTTGGCCTCCCAAAGTGCrGGTATTACAGGCATGAGCCACCACGCCCCGTCAA Tl ' l 

TTCACAGCTGCAGAAAAGATGTATCACITGTATATCCTGGCAACCrrAGATTGCA 

TGCCTGTGCATGTCTATGTGTCCACATGTGTGTCTCTGTACA 

CACAGGCATCTCCTAAGAATAGTAACATTTTCCTATGTAACTC 

GCATCATCCAGTCTCCAGTCCCCGTTCAGATTTCCCO^CAGCCCCGAAACCTCCTTTCAAGGT 

3 5 GGGGTTCAC ATT CCCT AACCT AACAGGTG AAG AG CTG TG TG CTG CTTT AACTGG C CTTT CAAGG G TGTC AGCTG CGG CCTTCCACG 

TGGTCTGAGCCTTCCTGCTCTGCTTCCGAGAGCGTG^ 
TGGGCCCTGCCTGTCTTCAGCATGCCCAGGCTGTTGGCT^ 

CAGGCTCAGCCCCTCTCTCGTCTGTGCTGTCCAGATGGGTTTCTTCTGTCACTTGCTGGA 
CTTTCTCACTCGTTTGTCCACACTTCTCACATGTTCAGGGATGGAGAC^ 

4 0 TCACATATTTAACCTTGCCGTATTATTTCCCCATG 

GTGG CTGTGGGCACTCTGGCCAGGTAAAGAGAGG ACCCCCAG ATGGAAACGCCAG CCCCTGCTCTGTTTTAGTACAGTGCGGTTCT 

GTTTTTAGTATTCAGATGK3TTGATCCATCTGX1AGTGTGTTTT 

G<rrTCCTATTTATTGGGTCACCTAGCTGCTTAGAATGCCGTCTTTATTGATGTTA 



4 5 ATTG AGGGTCTGAG ATCTTGAGJAGCTTGGCCAGGGCAACGGAG CAAGGGC ACC AAGGTGCTGATGTCTAGACCTGAGCTGAG CAA 

CACAAGCCCTGGCCCTAGTGGGCGGGACTCCTGTAGCTCCACACCTATCTGCTCGGCCCT 
GTAAAATGAAGTGGTACAGAGACACATTGTTACTATAGAAATTCAGC 
TTAACATCCTTTAAGTTTCTGGGGAAAAATGAAGATGGCAAAGGAAGATGTCCCT 
GGTTGGTGAGTCCTGCCCCTCOVGTTCCTTCACCGTGAAGATTACAGTGGCATGTT 

5 0 TGAAGTGCTGTCTTCCTTGGCTTTGTTTAAATAGGAGCT 

ATTCATTGTTTGCAAGGATAATAGCCGCCATGGTAAACGTC^ 

TGCTGAGCAGATGCTCCTGTTTGAGAGTAAGGTTTTGACTGAGCCACATGCTGTCTGCA 
CATAGCCTGGTGTGTACCCACTGTTGCATCTGCCAGCATTTTCCTGGATGTGGGAA 

GACTCTT AAAATGCAG CTGAACATGG AGCATGAAATTATCCCCATCAAAGTGAGAGCACACCTGTGGTC CTAGCT 

5 5 TCAGGCGGGAGGACAGGAG1TTGAGACCAGCCTGGGAAACAGCAAGAACTCATCT 

ACTTGAGGTCAGGAGTTCAAGACCAGCTGGTCAACAGGGTTAAACCCCATCTCTACCAA 
GG CG CGCACCTGTAGTCCAAGCTGCTTGGG AGGTTGAGGC AGGAGAATTC 

TCG CACCACTG CACTC CAG CCTGGGTGACAG AGTGAGACTCTGT CT C AAAACAAAACAAAAAAAC CT CAT CT CG AAAAAG AAAG AA 
CTAGAAGAC AOVGGACACCCCAAAAGTTGAGGAG ACTGTCTTTC CCTTTCCCCAGG CAGGG ACTT AATTGGGTT AT AAAGGAAAAC 

6 0 ACCGTGTTGAATGTTTGTCCCAGTGTTGGTGCCATAGCCTGAG^ 

GGAGAACTTTATTCGGGGACGTCGTATAATTTTTTGGGAAGTGAACCCATCATCT 
AGAATATGCAATCCCTTGGCTGAACGGTAAGXiAAGAGCAG 

TGATCTCTCCTTTTCTGTCTCTGAGCCCTGGGGGCCCCCGAGCTTGTGTTAGCCCCTGTT^ 

AGCAATGGCCAG AGTCCCAGGG CACCTTGTTGGGTGTTTTCACTGGT AACAGAGTACTGTTTTAAGACAG CTCCCCACTAACCCCC 

6 5 CAAAAATGGATTTGGGAGACTCTCATAAAATTGAGAATCTGATAGTAGATGGTAAACTAAGACTAAT 

ATTTCCAGAATGATATTAAAGTGTGGCACCAACACTGGTGT^ 

TCTGGGTGCCTTGTTGCTGCCAGCAGCACATGTCCGATGTCCGTCTTCAATGTGCAGO 
TTTCTTGTGGGTGCCAGCCAGTAGAGCCTTGGGTTTTGAACCTCTTGAC 

TCGGCCATTCCTCAGGGTGGGG CAGGATCTGGTGGTTAGAGTTTGTTTATGTGAAG CATCTGTTGTTTAAACCAGCAAATGCATGC 

7 0 CTTACGT AGGTCTCTGATG CTGGTACCATCTGGCTCCCTG CCTT CCAGTCAGGAGG CCCAGCCTCAGTTCTTGGGGTGAACCTGGG 

TGGGATGCACTGCAGAACCACTGTCGGGGGCACCTCTCTGAGGCCGAGGAGGTC 
ATCATCCTCAGATCCTGGAGTTTCTGTTTGGATTGTGGCATGTGCACACT 
GGTAGTGTTGGCCGGCCTCGTGACCTCATGGGTGCAGGC^ 
GAGCCTAGTTl'CGTGTTTGCTGACGT^.TCC^ 
7 5 GGAGGTGTCTGTGGAGTATGAGTTTGTGTTCAGGGTGCTGATCCCAC^ 
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GCTTCGCTTGGCCCCTGCCATGGCAATAGTGTGCCTGGATG 
CTTTATTCCCrTGGGCCTCAGATGCAGCTCTCAATGG^ 
GTGACCCATTAGCCTCCTCTTCTCAGGAGGTTCrr^CTTTTAATGCA 
TGGAATGTATTCACCAGAGTCTGTTGGAATTTCACCATTTGGCT^ 
5 TT AAGAAG AGAGAACCAATACAGAAGTTTGT CAAAACACAAAT CTTT AAGT ACATT AAAAGGGGAGGG TGC CTTT AAAATGACACA 

CTGTAGGCACAGTGCTCAGCACAAGAGAATGGCCTATGGGC^ 
GGTGGATTGTCAGGCCTCAGGGGAGCTGGGGGAAACCACATGAAACACAGCCG 

GGTAGCCATT CGTCTCTCATGGAAATTGGTAGATAGTTCTTG CGTGG CCCTTTCTCATGTAGTCATGACCATCAGGACATCAG AGA 

AGCAGGGTTCCTTTC^TGCCTATGGCGCACAAAAGTGGa 
1 0 TTGGGTGATGTGCACACCTGAGTATCCACCTGGTGCTTTGCTCC 

CCAGCTGCCCCCCAGCCGTGGCTGCCATGCTCCTGTCTTGCAGACCAGGAAACCATCCT 

CCAGTGCCCCTTGAGATGGGGAGGGTGCTGCAGGGCCAGAGCTCCTGGCAGGCGTG 

Cn^CCAGGGGACTCGAGCCCTCCCCTGCCATCGGGAAGCCAA 

GGTTTGCAGCAGGATCAGTGAGCAGATTCAGCGATGCACAGAGCAGCCACCT 
1 5 ACTCCTGTCATCCCCTCTGATCCCTTTTTTTT^ 

CTCCTG C CT ACCACG CAGGCACACATG AACCCTTCACCTGCTG C CCGTGAC CTG AAAG CCACATGAG ACCCCTGACGAAAAATGTG 

CT AG AAATTATT CTTTG CT AAAACATCAAGGAGG AAAATG AAG TCCTTTT AT CAAAAAGAG AATTTGGG CTC CTGTAAT CACACAG 

CAGAGATGAAGGGGGAACATTTTCCCTGTGTCCAAAATAAGT^ 

20 ' AGAACTTAGGTTCAAAGCACTTTAGCAGATACTTTGGGTAGAAA 

CATTIGCCCAAGACAGGCTGCAGGGCCGGTGGCTGTCCCCrGGAAGG 

ATGGGCCCTATGTCAGGAAAGTCACACAATGCCAGGGGCCTTGGCAGTTG^ 

GGACTGAATGCTGGGCAQVGGCAGGCATGGACGATCAAATGAGC^ 

CCCAGGAGCGTTCTGGGATCCAAGCTGGAACCTGGACGTGGTGCTGCGGGCCACTCACATA^ 

2 5 CGAGCTTTGTTTCAGTCAGGCACAGCCGCGCTGTGCTGGGCCATGGAGGGGTGC^ 

CCACTCTCGGCCCACCACGTCTTCTTGTTGGGGTTTTAGGGGGACCAGGGCGGCCT 
CCTCAAAGCCCGACTCATCTGCTCCCGGCCAGACAGCGGCTTGG 
GCCTGAAGGTGCCTGTGTTCTATGCACTCTTCACCCCACAGCTGTAAG 
CGCCAGGTGGTCCCGTCACCTCTTTCCCCTCCCCAGGAACAACGTGGGGCTGTCGGC^ 

3 0 AGGAGGTCTTCTCCCACGGGAAGTACATGCAGAGCACCACAG 

CCCAAGCCGCGGCCTGGAGCGGTGAGCTGGGGCACTGGGGGCGGGCCGTGCTTTCCC^ 
. AGGC^CTGGGGCCTGCTCATGGGCTGGGTACCTGGCCACGCCAAGGCTCTCTC 
ATCCTTCTTCCCTTTTGCATGTTACGTAACAGCCATTC 

T AACATTG CAT AACAA.TT AGGTCTGTTTCATTTTCAG TTTCTT ACAG AG AA^ CCTTGG CAAAGG C CTCTG CACTGT C 

3 5 CACTAGTCAACAGGAGTCACATTTTTTCCTCCTAAATATAGTGAA 

TCACATXX3GGAAATGGGTGTGACAGGCATGTGCACAGCCCTCTGTGGGGCTGTGAC 

ACTGCAGCACCAGCTCCCTCCGAAGGGCAGCTGTCCCTGACACCATCAT 

TCATGGTCACAGATTTTGGTTTACACACATCAGGGAGCTAGAACAGGAA 

TCCTGGTTTTAGCCATGTTTAGCATTCTT 1 1'CATGAGAAAGCTGCAAATCAGTGTTCTGTTTTTTAATTTCAAAAGCTCAATAAGT 

4 0 GTTTTTCACTTACAAATAGATACAAATTCCTTCCCTGAGTATGTATCGTGAATGCAA 

TXjTCCTCTGCTGGCTGGGGAGAGGCCACCTTTCCrCATAGTA 

TTTCCAGGTTTTGCAGCCCCCCCTGCGTTGGCTGCrrTGTGTGGAAGCACCCGTTG CGTGG 

G ATTTTCT CCCTG CAGTGCAT CG ACAG CGAGG CACGGGCCG CCAACT ACACCAGCT C CTTG AATTTG C CAGACAAGACG CTG CAGT 
TCGTTAAAGACCACCCITTGATGGATGACTCGGTAACCCCAATAGAG^ 

4 5 CAGATCGTGGTGGACCGGACCCAGGCCCTGGATGGGACTGTCTATC^ 

CAGGCTTGACAGTGACAGCTlCATCTGTGTCCrCTGGTAGGAAAACT 
CCCTCGAGGTCACCTGGGAGGTTTGTGGGAGCGTAAGAAGATTTCTC 

C C AGGCT ACTGGAGGT ACTTGTT AAAAAT AAAAATGCAAG TCCTGG CCTGTGGCTGGG CTGGGCTGGGGCAG AGGAGTACTGGCCG 
CCGAGGAGTTGGGGOIGTGCCCACGCAGGGGGCAGGTGTGAGCCACCTGGGAGGAGGCGTCAG 

5 0 AGAGGACCACGTCCCTGCTTCCTGGCCGAGGAGCCAGGTTTTAGATGTGCCCT 

CAGGGAC AGGGG C CAGGAGCCACGGTGGCTGG CAG CG CATTGTGGCCTGCCCTTGGAG CCTCCCAGCCAACCTCACATGCCCCCTG 
TCCTCACCGCCTGCCTAGGGCACTTACCAGTGACACCCCGAGGGCT 
AGTGTAATGTCGACTGTTGGTGAGGATTCTTGGGGTCAACCATGGAACA^ 
ACACTTGGGTTTCCAGTTGTCTGTTTTGCCAAAGCCTTT^ 

5 5 TGTGGAGGGAATGACTTGAGAGTGAAGAGTGTGAAGTTTGCCGCTGTGGTCACTGTCCTTGGTTTCTG 

TGACCTGTAGCTTTGGCCCTGCAGACCGGGGAGCTC^ 
CAGCTCTTCCAGGACrTTGAGCCAGTCCAGACCCTGCTGCTGTCTTCAA^ 
TCrCCXSCTCGGTGGCTTTGATTCTAGGTGACAGGAGAGTT^ 
CCTGGGTGAGCCAACCnTCAGAGTTTGACACAGAGCCATTCCGTGAGCT 

6 0 GGGGAGACGGGACTTTCTCrTCGAAGTTGCTGGA 

CTCCATGAAGACACCTTTTTATGCCCTGAGCTTGGGGG^ 

CACCGTGGAAAATCClACAAGAGAAGTGGTTCGTCrCATCTGTAAAGCTGTCTCCACCT 
TACTCCCAAGCAGCCCTTGGCAGGGGTGGGTGGGCGGGCATCCACGCT^ 

CGGGACCTCCAACCACCTTGTGGGATCCAGGCC CAGTCAGTCCTCTCCTCC CTC AGCTGGG AGCAACAGGC AGAGGAAGAGGGCCC 

6 5 CTGTCreGCCGGGTAAGATACCTACCCTTGGCAGGTGCGTATTTGCA^ 

ATCCACAAAGCTTGGGAGCCTGGGTCCTTTTACGTGGTCACCTCCCCTCTGCT 

AGAGAATGAAATAGCCTGAGCAAGTTATAAGCCACTGGGTACTGGCA.TCTGTTCA 

GCCCTGGAGATTTGAAGGGTCGACGCTTTAGCCACTGTGGGGCCT 

TTCTGGGX3CCATCGTGCTGTTCAGTGGCCTCAGCCAGACTGGG CAGG CACCTACAACATCTCTAAAATGTGG ACTTCTCATTAGAA 

7 0 TGAAGATTGGGCCCCTGGTGTGGACATTCGTGAGAATGACACTCTGTTCCAAGGGTCAC^ 

CAGGGCACATGGGCACTGATCCTCAAAGGCAGAAGGGGGTAGGAAGAGGA 
CGCCATCTCCCAGAACTCGGTCACATGGCCACACCAGGGCAGCTGGCCACAC^ 
GTTTGGGGAAGCGCTTGCTGGTCCTACTGGACCACCCGACCCGCCT 
TGCCTGGACAAGCGGAAATACATCCTGCCTGCCGTGGGGTCCTGTGCCTTCT 
75 AGGCCCAGCTCCATGGCOvTGGCAGCAGGAAGTGGCTGT^ 
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GGTGATCAGCAAATGGAGAAACGGCCAGTACCACGCCCTCCCCCTGTCCTACAGCTGGGGAGGTGCGCTGTGCCTCCACC 
TCCTCTCCCCTGATGACAAGCATGGGGTrrTGCTTTTTATTTTGGTAA^ 
CATTGTGCACGTCAGTGGCATTAAGAACATTCACGTTGTTTTGCAGCCATTACCACT 
CAAACAGAAGCTTTTTACCTGTGAAATACAAACTCTTC^ 
5 CTCTCTGGGTACCTCATATGAGTGGAATCACACAGTA 

TTCCTCCCTGCTGTAGCAGGGGTCAGGGTTTTCCAAGGCTACATACTCCTCTGTCGTATG 
ATTTTGTGTGTGCGTTCATCCCTTGATGGACACGGTTGCT 
GCAGATACCTCAAGTCCTTGCTTTCCATTCCTTGGGCGCACACAGCAGATGC 
GCTGTGTCTGTATTTTCACCAGGACTCCTreGGATGCTGAG 
10 AGTTGCCTCTGATCACGTCCACCGTCACTCCAACCTGCC^ 

ATGAG AG CTG C CTG CAAGGGG ACAAT C CT CAGG CCCAGGGAAGGCGGAGACAGGGGG CGT CGAGGGGCAGG AGTG CACAGGACAGG 

CTTGCTCTCAGCGGGCCTCACTCATGCTGCCAGGTGCCTCGTCA^ 

TTTAAATTTACAGAAGAGTTTTGATGATTGCAGAGAA 

C CAGTGTG CATTTATCTGAG CCAAGGTG CTGTCATGGTGCAGTGCG TT AC CTG AACC C CAG ACC CTGG AC C CTCCT ACCTCC CG CC 
1 5 CACCACATTCCTGGCTCAGCACCTC^GGTTG(^CTGGA 

CACTCCTGGAGAGGAGTAGTCTGGTGTTTTGTGGTGTGTCTGGTGTAT^ 

ACCAAAGAGGGCGCTlTCrreGGGATGGGGCATCrGTATGACT 

GTCTTTTTCCATGTCGAGTGTGTCGGGGCATGTAGT^ 

GGGAAACCCCCACAGTCATTCATGCTTATTTGTAGGGACAGACTTGGAGGC 
2 0 CACATTGACCAGTGGGTCCTGCCTGGAGCAGT^ 

CACAGTCATCCCTTGGTCTCTGTGTGGCXSTCGTTTAGTTTTTAAAGCT^ 

AAGCGGAAGCTGGGGCTGAGTCTCCCGCAGGGTGCAGGAGGTGGTCTGGATCCCCTTG^ 

CCACCCTCCACAGCTACCCTCAAGCCTCTCTTTGGGGTGAGGAGTCAGGCG^ 

ACTCCATGGTGTTCACG CCACCTCX5 CAGGTGCCCCCTX5CX3GGACCCAACAGGACCAGCTt3TC CCTIGG AAGTTACCCAGCTTGGCT 

2 5 TTGCTGCCAGAGAATGTTTCCGCCACCTCCACAAGCCGCGCGATGCCCGAACT 

AAACGTGGCTCACACATGGCCTCATCTGCCGAGG 

TTCCTCTATTGGCATCAGGGCCTGTGATCHTCCAACCAC^ 
TTCATCTGACTGTAAACCACTAAGTCATGTGTAAATTTGG^ 

3 0 GATGCCCTGGGCAGAGGCTAGAGAGCGCGCTCCACGGGCCACCTCCAGGTCT^ 

CATGTGGGGCACCCTCCITCCAGCTGAGTTCCGTTCAAAAGCAGTGAGC^ 

AGGTGGGCTGATGGGATGGACCTGAAGCTGGTGTCACTGGGAACCAAGTGTCCCAGAAGGTGGGA^ 

GTGGGTGTGGCTGCTCTGGAGCTCGGAGGTTGACTTCCC^ 

GCATTGTTCCTTTTTTATGTTTTGTATCACGATCTCCTCA 

3 5 GCTGGATCCAGACCCTCACGAGAGGCTCCCCGCCCACIAGGTGCTCACATTCTGGCTCG 

TGCTTCAGCCTCTGGCrGOVGCCCCTTCTCACCrATAGTGAGGCCGGGA 
GCAGGGAGGAGATCTGGAGGTGGGGACTGAACAGCCTCCCCACTGGCT 
AGGCACCTGTGGGTGCCCAGACAGGGGAGAGAGGGATTTGGACCTAAGGCCCAGTGAGCCCT 
AAGGTCACAGACCTGCCTTGTTTGCCrrCTGGTTCTGAGTACCTCTCCT 

4 0 TTCCCACACACTGGTGCAGAAGCAGAGGCCATGCTCCTGGATGCCTGCTT 

GGCAACAGGTTTGTCTATGCTGG CTCTAACTCGGGCGTGGTCCAGGCCCCG CTGG CCTTCTGTGG GAAGCACGGCACCTG CGAGG A 
CTGTGTGCTGGCGCGGGACCCCTACTGCGCCTGGAGCCCGCCCACAGCGACCTGCGTGGCTCTGCACCAG^ 
GGTATGGGK3CGGGATGCGCTGCCTGGGGATGTCCCCCATGTGTGTGGGTGAATAACACGTGC 
CAGGAGATGAGCGGCGATGCTTCTGTGTGCCCGGGTGAGTGACTCCTC^ 

4 5 CCATTTCACGGTGCTTCAGTTCTGTGGCATTTGTGTGTGTTTTGTTTTT 

CIAGACTCGTGGTCITCTCTGTCrTTTTTTCTTGCCGAAA 
TCCTTCCAAAACCATCCTGTTTTCTTTCTGCr™ 
GCCCATGAGTGATTAATCCTCAGGCAGTGTAAGK5GCTGGGGTGGTC 
ATGTCTGAAATAGCACCACACCCAGAGGGAGGGGTGGCATCAGGCCAGA 

5 0 GGCAGACTTGACCCTCAGTT CTGTGTCTTCCTGACCCTG ACAAG ACAGCTGTCATCAG ACTAAGAAA CTGGG CAT 

GGTGGTACACGCCTGTAGTCCCAGCTACTAGGGAGGCTGAGGCAGGAGGATTGCT^ 
ATCATGCCACTGCACTCCAGCCTGGGGTGACAAAGTGAGACCC^ 

ATGCCTGTAATCCCAGCATTTTGGG AGTCTGAGG CAAG AGG ATCACTTG AG CTCAGG AGTTCGAGACTACCCTGGGCAACGTGGTG 
AGACCCTGTCTCTACAAAAAATACAAAAATTAGCCAGGCCTGGAGGTGGTCACCTGT 
5 5 GAAGATCCCTTGAGCTGGGGGGGTTGAAGCTGCAGTGAGCTGTGATTGCA 

TGTTTCAAAAAAAAGAAAAAAAAGAAAAGTTTAGGC^GGAGGAGAGTGCTGAGATGCACTC 



TCTTCTGGGACACTGGGTGAGAAATGGGATTTCTGCrGCTCAGAGCTGCCT 
TCATCTTTGCTTTTTGGTCACGCATGTCTTTATCrCCTTAGTTGTTCTGGAAGATTCCATGTT 

6 0 GTCATCTTGTGGGTATTTTAAAGAAGTTGTTAAATAGCAAAT 

AAAGT AAAGG AAGTT AC CGG CAG CATTTTTT CAAG CACGGTGG CACAG CGG AACTGAAATG CT CCCAAAAATC CAACCTGG CCCGG 

GTCTTTTGGAAGTTCCAGAATGGCGTGTTGAAGGCCGAGAGCCCCAAGTACGGTCTTA 

CTTGTCAGAAGGAGACAGTGGGGTGTACCAGTGCCTGTCAGAGGAGAGGGTTAAGAACAAAACGCT 

ACGT CCTGGAAGTGAAGGTGGTTCCAAAG CCCGT AGTGG CC CCCACCTTGTCAGTTGTTCAGACAGAAGGT AGTAGG ATTGCCACC 

CAAGCCTGCGCCCACCGGCACATCCTGCGAACCAAAGATC 
TTAAGTCCAGCGACAACCGCCTCCTCATGTCCCTCTTCCT 
AAGGGATACCTGCCCAGACAGTGCTTGAAATTCCGCTaSGCCCT^ 
TGAGCAGAGCCTGAAGGAGACGTTAGTAGAGCCAGGGAGCTTCTCCCAGCAGAAT 
70 GCTATGAGACCGAGCAAGACACCATCACCAGGAA 

TGACGTCAAGTGTGAGCTGAAGTTCGCTGACTCAGACGCAGA 

CGACGTGTCCAGGCCGTGGAGAGTTTTGTGTTTCTCCTGTTCAGTATCCGAGTCTCGTGCAGTG 
TGCAGACAACCTCAGTCCTCITGTCTATTTTCTCTTGGGTTGAGC 

GC ATTG CATCC CAGTC'iTGTGTT CCGAAGTCAG TCGGAG T ACTTG AAGAAGG CCCACGGG CGG CACGGAGTTCCTGAGCCCTTTCT 

7 5 GTAGTGGGGGAAAGGTGGCTGGACCTCTGTTGGCTGAGAAG^^ 
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10 



15 



ATTTAATTCCAGATTGGAAATGACATTTTAGTTT 

CCACTTAATTA'rni i 11 AGGATTTTGCrrTTGATTGTGTTTATGTCATGGGTCAT TTTTTT 1 1' AG TT AC AGAAG CAGTTGTG TT AA 
TATTTAGAAGAAGATGTATATCTTCCAGATTTTGTTATATATTTGGCATAAAATACG^ 

AACTTCCTTTTGCTAAATGCATTCTTTCTGCrTTT HIT 

CTTT A rTTTTT T 1 TAACTTTATTCCTTGAGGGAAGCATTGTTTTTGGAGAGA l " riTC " riT CT ITTT C l Tl IT 

TTTTAACrTTTACTCTCTCGAAGAAGAGGACCTTCCCACATCCACGAGG 

AGTGGAGCCAGGCTGGCCCAGAGCTGAGATGGGAGTGCGGTACAATCTGGAGCCCACA^ 

GATGGAACCTTCACAAGGGCGCCTTTGGTTCTCTGAACATCTCCTTTCTL^ 

CTTTCTATCCAGCCTCACTGAGCTGCCCACTACTGGAAGGGAACTGGGCCT 



CTCCCCGCTTTGTAAATACCCATCGGGAAGAGACTTTTTTTCCATGGTGAAGA 

GACAGTCTTAT CTT C CAG CATGAT AGGATTTG ACCATTTTGGTGT AAACATTTGTGTTTTAT AAG A (J 1" 1 ffl TTTTATTTTT 

CTACTTTGAATTGTATAC^TTTGGAAAGTACCCAAATAAATGAGAA 

GCTGTGCAGCCTCCCGCCCCTG CATGGCCCAGGCGGGACAG CTGGGCTCGCTCTTTTCTGTCCGTTTCCTG CTGTGTTTTGCCCTC 
TCTGGCCTGTGACGTGCTCCTGGCGACTTCTTCCTCAGTGCCCTCTCTTTTCT 



GCCTCGGGTGCGGGGCCCGACAGCAGCTCGAGGGTCTCCTTGCTGCCGCCCTTCCTGAGTGACCAGG 



rCCTTTACCTTCCTTATCTGTCCTGGCTGCGTCCCTCGGGTTGA 
2 0 GCCGTTTCACGCTTTCATTGTGGGTCCATCCCGAGGGAAACTGTAGACAGTGTGAAGCCACTTCTAGTGTA 
GTTTTCTGTAAAGGGCTAGATATTTAGGCCTTGTGGGCTGCAGGCTGTGTCACAGCCACT 
AGCCATAGTCAATACACAACGAGTGGTCAGGGCTGTGTGCCAGTAAAACTTTATTTATAAAAATGCAOT 
CAAAGGCCCCAGTTTCCCAACCTGTGGTATATAGGATTGCTTCTCCCTTAAAAA 
TGTCACCAAGGCTGGAGTGCAGTGACACAATCATAGCTCACTGCAGCCCCTGCCTCCT 

2 5 TCCAGAGCCAGAGTGCTTCTATTTCTGCTAACAATTTGGAGGTTGATAT 

CAG AATGGGAGTTGTTTT AATGACTTG CTTCGTGGT ATTTAAAAG CGG AAAAGATTGG CGGGCATAATCCCCCTGCTTTGGGAGGC 

CAAGGCAGGAGGATTGCTTGAAGCCAGGAGTTTCACAGCAGCTTGA^ 

AAAACTAACCAGGCTTGGTGGCACGTGCCTTGAGGATTGTTTGAGCCCAGGAGTTTGAGG 

GCACT CCAGC CTAGAT CACAG AACAAGACC CTGT CTCAAAAATC ATCATCATT AT AAAAACAG AAAAG ATTG AG A 

3 0 TCCTCTGGGACCTGGAGGCATTTGCCCrGGATTGGAAAATACTCGGATGGGATATCTG 

CACACTTCAAAGATGTATCCTGTGCTTCAGGCAGTGCGTGGTAGCCCTGGTGGCCCCTGTGTTGTCATCT 
CAGAAAGAAAGGAAGTGGCCTGTTAGAGCCTGCCCCGGGGGAGGGATGAAGTTAATGAAGAGGAA^ 
TTTTAAACATGTTAAACACACTGTCAGTAGCTCCTGAGAGTTTTAAGGTGCT 
GGTGGCCCATGTCTGCTGAGGCCAGAATGTCCCTTGGCAGCCAGNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNl^ 

CTGTGGAGACGGAGCTTTGCTAGGCrcGCTTGTGTGGTTCCTGCCCTCCACTGGGTGCT 
TGTGGCCGCTCTCACAGCTGAATGGTGCTCCTCGGCCAGCTAGTTTGCCTTGGTGCAGGC^ 
GAGACTGTGTCCTGTCGCAGCACTGGGGTCCTTCAGGCAGACTCAGGAAGCTCCTCCT 
CTCTATGAACTGGCCGTGGACCCAGGGCTCCTTTGCGGGCA 

4 0 TTTATTTATTTTTGAGGCAGAGTCTTGCTCTGTTGCCCAGGCTGGAG 

CCAGGATCAAGCGATCCTCCTG CATCAGCCCCCCTAGTAGCTGGGATTACAGGCAC ACCTCACCATG CCCAG CT AATTTTTGTATT 
TTTAGTAGAGACAGGGTTTCGCCATCTTGGCCAGGCTGGTCTCAAACTACTGACCTCAGG^ 

GTGCTGGGATTACAGGCATG AGCCACCATGCCCGG CTGCTGTTACATTTTT AAATGTGAGG CTCCAGGTGTACCCCCTGTAGCTAG 
GCAGGCATACACACAAGCCCCTGGGGTCAGCAGGGAATGAAAGATGGGTGCCCAGGGAGAAGGGACATCACTGTTAC^ 

4 5 AAC CAC C C CAGAT ATTT CAAC CACAGGG CATTT AAC ACAGGG AATTGG TT ACAAAGG TG AT AG AAGGG CTGGG AACAG TGGGAAGG 

TGAGGTTTCCATTATCAGCCATGGGCAGGAAGAGTGAAGTGTCAAACACTCAGTCTCACTTGCT 
GTGATCTGCAGCCTCTCCCTGCCCGCCACACTGCCrrCACTGCTGGAGGTCCTACT^ 
CTTCCTGTCTCCTGTCAATGACTCCCATGGCCAGACCTAACAGAAAGCTGGCGGTCACAGATGGA 
. CTTG CAG CAGTGTGG C CAGGAGT AGTGTGTAGAGGG T AGGGG AG CTG AGGG AGG AGGGC AGTC CTTTG CCAGTG CTCAGG ATGG AC 

5 0 TGAACAGATTGAGAACTGAGGCCCCCAGCCCTCCGCCACCCCCTTT^ 

ATCTGGGACATTCTCGTTCACTTGTT'CCACTGAGGCCACAGCCT 
AGAGTGGCTTCAGCCTTTTCAAGGGTGGGCCGAGGCCAGGGCATCATGGGC^ 

TTC CTGAGGAG AGTGAGTGC CATTITCTGCCTrGTGGGCCAGGAATGAGCCATTCCAGGTGAGG CCCAGAGCCTGGAG ATTGCAAT 
55 ACAGTGCACTCAAGGCACAGGCCTACCGCTTCTGCCTGTGTCCTCCC 

GTTTTTCT TTG CT AT AATGGCTG rni CGTGGT AGCTGTCACAGTGGTTTGGGGG CTGT CTGATTAAAGGGCT AGCAAAGTG TCTT 
TAGAATATGAAACTTAAGAGGACATTCGTGGCATCTTCAGTTCCCCATGCTGCGTCCT 
GTGTCTTGGGGAGCGTCTC^CCCCCCAGAGAGGGCTGACTGATGGAAGGTCAGCTGG 
CCCAGAGCTGGTGGAGGCCCATGTGTCACCCAGGAGATGTCCCC 

6 0 GGACTTCGCCATCCAGGAGAGGTGAGGGCCCTCTGAGGCCAAAGCGCTTCC^ 

ACATAGCCTGTGGTGTGTGCAGAGGCTGCCAGGGCACAAGGG CCCAACGTGGGGTGCGGTGGAGGTG ATGTGG CCACGTGGG CGG A 



GGCCTCAGTGCCCATCCAAGGGAGGGACAGGGCCACACCTGTGGGCCTGTGTCAGGAGCTGGGGTGGGAGA 
ACTCTCCACCTAGACACATTTCATCCCTTCrCCACCACAGTTCTC^ 

6 5 AG ATCCACTGATGG CCCTGGCACAGGCC AGCCTTGTCCCTGGGG CAGGATGGGACGGTCCTGAG AAGG CCCTGGGACACCCTAGGG 

GTGTCATAGTGGGAACTATGCCTTTGCAGTGCCCAGGTGAGAC^^ 

CAAGAGGGGTGGGCAGGTCAGGACTGCACTGCCTTGGGGCCCGTGCCCTGGGAGCCGCTAGCCCT 
GACTGAGTGCATCGGCAGGGCCTGGAGCAGTGGGAGTGGCCGANNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNIWNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

CAGCATACAACACACATCAC^ACACACACCAATCATACCACTC 
CACACCACATAGACAC^TGC^OSCTACCCAraCC^ 

7 5 TGACTTAGGATCGTTTCAACTCACAGTGGGriTATTGGG ATG CAGCCCCCTGG CAAGTGG AO CAnCATGTGTATTG ACAAGGGGAG 
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GAAAGCCGTCTGGAGCTGGTGAAGTAAAGTTAGCATGCAGCCGAGCTCTGCTCGTCTGAACTGATGAACCCAAGTTCCT 
CACTCAG AAT ACAGATGGGTGAAAAAAT AAC CCCAC CAC CCGTGTG C CAG CACAT AGCGCATCTGG CAT ACAAACGT AAG GATCAA 
TAGCGATGAATGGCGATGGGTTGTGTCTACATGCA^ 
TATAATTTTTTAACCAAAGTGCAGTCTTTACCCTGTCAGGGCCCT 
5 ACCCGTGTCATTTTCAAAGCTAATAATGGTTCCCCCATACTTTAGGCAG 
TTAA<n\A.CAGTGTTAGAAATCAGAGCCCTTGGAACCACTG 
GCCCAGGGCTCCGGAGAGGGCCCCCGTTCCTAGCAGCCACAGTGGCCTGTT^ 
TTTCCCTTTGTTTCAGACTCTTGAAAGGCCACGGCCGTC^ 
AGAAGGTGGTGCTATCAACTCCCTGTGCTCTGAGGTCCTTCGCAT^ 
1 0 CTGTTATCTGGGATAATTCCACGTGCCTTTACA^ 

GCCTTCCCACAGGCCCCCCATGGGAGCCAGGCGATGCCTACA 
TTCAGGATTCCTGCCCACATTGATAAGGCGGCTTGGCCAGT^ 

TCCAGTG CCCTGAGTGGTCTAGAGCATACCAGAGTGAAACGTTAGG CTGCGTCGGCCTGGTGACTGTCAGGG AGATGAGGACTCAG 

1 5 TTCAACAGAGGACAGAAAGGATGACCAACAAGGCCTCAGGGCATACCCC 

GTCTGGGAGGGAGGAGGGGCCATGGCCAGGATT AGAAAAG AC CGTG C CCCAAGGAGAGAGCACAGAG ATGGGC CAGGG AAG CAGT G 

GGATCCTCX5GGAAAAGAGGCTCGGAGCAGGTGTCCGGGTGAGGCCC 

ATGG AAGACATG ATGG CCAG CXX3CCAGCACTGTGTGCAG AGAATGG CCCCTCTGG 

AGGCCACCTGGATGTGAGTTTGGGAGATGGGCTGGCCCCGAGGC^ 
2 0 CGGGAGGACACAGAGGAGTGCTGCAGGAGATGGCAGTGGGGGC^ 

TCTCACCACCCCTCCAGGCCTTCCTCAGG^ 

TTTGTAATCACATCCACGAAGCTCTGAAGCCATGGTGGTC 

GTGGGAGGTGCCCCAGTGACAGGCTGAGTGCTGGAGCCC^GGCTGCCAACAGCCT 
AGCCCTGGTGAATAAACCTGGCCCTTGCTTATACCAAGGTGATCAGTCTTCTC^ 

2 5 CCTACGGCATCTGCCACAGCAGTTAATTTGCCITCAACTTAAGCT 

CAGGACCTCTACCCTAGATCCITGTAAGCCACAACAACTTCCCl"r 

AGAGCCAAGGCACATCCAGGTTGGTGGCTGCTGAGCCCCACAGCATCAGACCAGCCT 

GTGATCTGGGAGTAGCACCCCAGCCTGGCCGCT 

GATGGT AACT CTCACTITT CCCTCTCT CATG CATT CAG AAAG AG ATGGATAGTTT AAAAACG AAT ACATGG AACGT AAG T ACTT C 
TTTAACATCTTTAATGTCrTATGATGATCAAGCATGTAT^ 
GTTTCTGCTCACACCCCAAGAGTGTAGAGCriTTGAGGGTTTTGCAGTCAGTG 
CCAGCTTAGGCC^CCCAGATTCTTAGTCTCAAGGGCTTGGTCATGAGA^ 

3 5 CCTGGGTGGTTCTTGGTGCAGGTCACATGAGGGAGGAGACCGGGC^ 

TACTTGATCATAACAGCTCAAAAAAATCTGTTTCTTAAA^ 

TCCCCAGTTGATGGGCACCTAGATGTACATAGAGACATATGTTTGTACATCTATG 

TGGAATTATCTGTGGCCAGAGTCCAAAACATTTTGAGTGGGAAAAAACAAATTATAGAGTC 

TGCACAAAAACCAAACCTGTTCACAAGGGCTTGTCCACCAGGC 

4 0 TCTGAATTTCCTTGAGTTTACAGAATGTTTCCTTCAAA 

TAT AAG TTTGG AAACTC CAGGG AAGT AT AAAGAAAAAATTTAAAC AAT CC CAAATTT AACCAT CCAGAGG T AGTT AC CACCAG CAT 

C C CXX5C CATTTTTGGGCC ATG ACT AT CT ACGTT CACACATG T AT ACATGGGTGTGTACAT AT ATG CACAC ATTC CT AGATATTGTT 

TCACnTATTTTTCTGTGTTGCTGAATATTCTTCGATGGCACTTCGAATGAG 

CCTGACTGATAGAAACAGGGCTTCACTGGGCCCATC 
4 5 GGTCAAGTCCTTGGGAGGAGCATTCTCTGGTGATCCTGGACAGTGGGGAATTGTTG^ 

CCACAGTCAGTAAAG CAGGG AAAATGCATTACTAAAGTCAGCTT CATT A 

CCACAGAAATGGAAAGCTCTCTGGCCACTACTGTCCTGCCCAX^GAGGGG 

TGGATGCACCTCTGCCCCTACACCTGAGAACAAGGCCAGTTTTTGTGTGTGCAC^ 

TTTACCTGCTCAGTGTTCTGAACTCCTTCCTTTTCACTA^ 
50 GATGGGC ATCTAGGG C AAATCTAANNNTtfNNNNNNNNNNNNNNNN^ 

NNNNNNNACCAATTCTTACGAGGTGTCATGGTCAAGO^ 

CCTTGGTGGGCCTCGAGGAAGGGCAGGTGCTGGTCAGGGAGCCGC^TCGCTAGGCAGCA 
TGGGGTGCCTXMCCTGATGCAGGCTGTGCCTGAGGGAGACAGAAGCAGCAGAGTA^ 

CAGGAG CTTCAG AGACAG AAG<5 AGCACCTGCACCCCCTCACCTG CCCTCCAGGGCCTGGCCTGGCCCTCCCAGCTCAACTCCAG 

55 

HUMAN SEQUENCE - mRNA 

CTGAGCCGCATCTGCAATAGCACACrTGCCCGGCCACCTGCTGCCGTGAGCCTTT 
GATGAGGATGTGCACCCCCATTAGGGGGCTGCTCATGGCCCITGCAGTGATGTTTGG^ 
GGATCACCTGGGAGCACAGAGAGGTGCACCTGGTGCAGTTTCATGAGCCAGACA 
6 0 GACAAGGACACCTTGTACATAGGTGCCCtX5GAGGCGGTCTTCGCTGTGAACGCACTCAA(^TCT 

TTGGAAGGTCTCAGAAGACAAAAAAGCAAAATGTGCAGAAAAGGGGAAATCAAAACAGACAGAGTGC 
TGCAGCCACTCAGCGCCACTTCCCTTTACGTGTGTGGGACCA^ 

AAGTTTCTGGGGAAAAATGAAGATGGCAAAGGAAGATGTCCCTTTC^CCCAGCACACA 
ACITTATTCGGGGACGTCGTATAATTTTITGGGAAGTGAACC 

6 5 ATGCAATCCCTTGGCTGAACGAGCCrAGTTTCGTGTTTGCTGACGTGATCC 

AGGGT CT ACTT CTT CTTCACGGAGG TGTCTGTGGAG T ATGAGTTTGTGTTCAGGGTGCTGATCC CACGG ATAGCAAG AGTGT G CAA 
GGGGGACCAGGGCGGCCTGAGGACCITGCAGAAGAAATGGACCTCCTrCCT 
GCTTGGTCTTCAATGTGCTGCXSGGATGTCTTCGTGCTCAGGTCCCC^ 
C^CTGAACAACGTGGGGCTGTCGGCAGTGTGCXSCCTACAACCTGTCCACAG 

7 0 GAGCACCACAGTGGAGCAGTCCCACACCAAGTGGGTGCGCTA 

GCX1AGGCACGGGCCGCCAACTACACCAGCTCCTTGAATTTC 

GACTCGGTAACCCCAATAGACAACAGGCCCAGGTTAATCAAGAAAGATGTGAACTACACCCAGATCGTGGTGGA 
CCTGGATGGG ACTCT CT ATGATGTCATG TTTGTCAG CACAG AC CGGGGAG CT CTG CACAAAGCCATCAG C CTCGAG CACG CTGTT C 
ACATCATCGAGGAGACCCAGCTCTTCCAGGACTTTGAGCCAGTCCAGACCCTGCTG 
7 5 TATGCTGGCTCT AACTCGGG CGTGGTCCAGGC CCCG CTGGCCITCTGTGGGAAGCACGGCACCTGCGAGG ACTGTGTGCTG^ 



. 855 



WO 03/008583 



PCT7US01/51291 



GGACCCCTACTGCGCCTGGAGCCCGCCCACAG CG ACCTGCGTGGCTCTGCACCAGACCGAiGA.GCCCCAG CAGGGGTTTGATTCAGG 
AGATGAGCGGCGATGCTTCTGTGTGCCCGGATAAAAGTAAAGGAAGTTACCGGCAGCATlTrrT 
CTGAAATGCTCCCAAAAATCCAACCTGGCCCGGGTCTTTTG 
TCTTATGGGCAGAAAAAACTTGCTCATCTTCAACTTGTCAGAAGGAGACA^ 
5 AGAAOUIAACGGTCTTCCAAGTGGTCGCCAAGCACGTCCTGGAAGTG 
GTTGTTCAGACAGAAGGTAGTAGGATTGCCACCAAAGTGTC 

TCCCCCAGCTCCACTCGGAGAAAACCATGTATCTTAAGTCCAGCGACAACCGCCTCCTCATG I X I U CTTTGTT 

CTCTTCCTCTGCCTCTTTTTCTACAACTGCTATAAGGGATACCT 
1 0 GAAGAAGAAGCCCAAGTCAGATTTCTGTGACCGTGA^ 

ATGGGGAG CAC CCCAAG CCAG C CCTGGACACCGGCT ATG AGAC CG AG CAAGACAC CAT CACCAGCAAAGTC C CCACGGAT AGGGAG 

GACTCACAGAGGATCGACGACCTITTCTGCCAGGGACAAGCCCTTTGACGTCA^ 

TGGAGACTGAGGCCGGCTGTGCATCCCCGCTGGTGCCTCGGCTGCGAC^ 

GTATCCGAGTCTCGTGCAGTGCTGCGTAGGTTAGCCCGCATCGTGCAGACAACCTCAGTCCTCTrc 
1 5 GCCTGTGACTTGGTTTCTCTTTGTCCTTTTGGA 

TGAAGAAGGCCCACGGGCGGCACGGAGTTCCTGAGCCCTITTCTG 

CATCCCTTCAGCTTCCCCTCCCCGTAGCAGCCACTAAAAGATTATTT 

GTAACTTATCGCCTGTTGTCCAGATTGGCACGAACCTTTTCTTCCACTTAAT^ 

GTCATGGGTCATTTTTTTTTAGTTACAGAAGCAGTTGTGT^ 
2 0 TGGCATAAAATACGGCTTACGTTGCTTAAGATTCTCAGGGATAAACT^ 

AGACATAAACACTCCCCGGAGCCCACTCACCTTTITT 

TTGGAGAGATTTTCTrTCTGTACTTCGTTTTACTT^ 

ACGAGGTGGGTTTTGAGCAAGGGAAGGTAGCCTGGATGAGCTGAGTGGAGCCA 
AATCTGGAGCCCACAGCTGTCGGTCAGAACCTCCTGTGAGACAGATGGAACCTTCACAAGG^ 

2 5 CTTTCTCTTCrTGCTTCAATTGCTTACCCACrrGCCT 

CTGGGCCTCGGTGGCCGGGGCCG CGAG CTGTGACCACAGCACCCTCAAGCATACGGCGCTGTTCCTGCCACTGTCCTGAAG ATGTG 
AATGGGTGGTACGATTTCAACACTGGTTAATTTCACACTCCATCTCCCCGCTTO 
ATGGTGAAGAGCAATAAACTCTGGATGTTTGTGCGCGT^ 
GTAAACATTTGTGTTTTATAAGATTTACCTTGTTTTTATT^ 
30 GAAG CTTCT AT C CTT AAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGAGGATGTGCACCCCC^TTAGGGGGCrGCTCATGGCCCTTGCAGTGATGTTTGGGACAGCGAT 
GATCACCTGGGAGCACAGAGAGGTGCACCTGGTGCAGTTTCATGAGCCAGACATCTA 

3 5 ACAAGG ACACCTTGT ACAT AGGTG C CCGGG AGG CGGTCTTCG CTGTG AACG CACTCAA CAT CT CCG AG AAG CAG CATG AG G TGT AT 

TGGAAGGTCTCAGAAGACAAAAAAGCAAAATGTGCAGAAAAGGGGAAATCAAAAC^ 
GCAGCCACTCAGCGCCACTTCCCTTTACGTGTGTGGGACCAACGCATTCCAGCCGGCCTGTGACCACCT 
AGTTTCTGGGGAAAAATGAAGATGGCAAAGGAAGATGTCCCTTTGACCCAGCACACAGCT 
CTrTATTCGGGGACGTCGTATAATTTrTTGGGAAGTGAACCCATC^TCTCCCG 

4 0 TGCAATCCCTTGGCTGAACG AG CCTAGTTTCGTGTTTGCTGACGTGATC CG AAAAAGCCCAG AC AGCCC CGACGGCGAGG ATGACA 

GGGTCTA O 'TCT'l t-'l rC^CGGAGGTGTCTGTGGAGTATGAGTTTGTGTTCAGGGTGCTGATCCCACGGATAGCAAGAGTGTGCAAG 
GGGGACCAGGGCGGCCTGAGGACCTTGCAGAAGAAATGGACCTCCT^CCT 

AGCTGAACAACGTGGGGCTGTCGGCAGTGTGCGCCTACAACCrc 

4 5 AGCACCACAGTGGAGCAGTCCCACACCAAGTGGGTGCGCTATAATGGCCCGGTACCCAAGCCGCGGCCTGGAGCGTGCATCGACAG 

CGAGGCACGGGCCGCCAACTAC^CCAGCTCCrrTGAATTTGCCAGACA^ 

ACTCGGTAACCCCAATAGACAACAGGCCCAGGTTAATCAAGAAAGATGTGAACTACACCCAGATCGTGK3TG 

CTGGATGGG ACTGTCTATGATGTC ATGTTTGTCAGCACAGACCGGGG AG CTCTGCAC AAAGCCATCAGCCTCGAG CACG CTGTTCA 
CATC^TCGAGGAGACCCAGCTCTTCCAGGACTTTGAGCGAGTC 

5 0 ATGCTGGCTCTAACTCGGGCGTGGTCCAGGCCCCGCTGGCCrrCTGTGGGAAGCACGGCACCT 

GACCCCTACTGCGCCTGGAGCCCGCCCACAGCGACCTGCGTGGCTCT 

GATGAGCGGCGATGCTT CTGTGTGCCCGGATAAAAGTAAAGG AAGTT ACCGGCAG CATT I'TTT CAAG CACGGTGGCACAGCGGAAC 

TGAAATGCTCCCAAAAATCCAACCTGGCCCGGGTCTTTTGGAAGTTCCAGAATG 

CTTATGGGCAGAAAAAACTTGCTCATCTTCAACTTGTCAGAAG 

5 5 GAACAAAACGGTCITCCAAGTGGTCGCCAAGCACGTCCT 

TTGTTCAGACAGAAGGTAGTAGGATTGCCACCAAAGTGTTGGTGGCATCCACC 

GCCACCTCCTCCGGGGCCATCACCCTTCCTCCCAAGCCTGC^CCCACCGGCACATCCTGCGAACCAAAGAT 

CCCCCAGCTCCACTCGGAGAAAACCATGTATCTTAAGTCCAGCGACAACCGCCTCCTCATGTCCCT 

TCTTCCTCnSCCTCTTTTTCTACAACrGCTATAAGGGATACCrc 

6 0 AAGAAGAAGCCCAAGTCAGATTTCTGTGACCGTXIAGCAGAGCCT 

TGGGG AGCACCCCAAGCCAGCCCTGGACACCGGCT ATGAG ACCGAG C AAGACACCATCAC CAGCAAAGTCCCCACGG AT AGGG AGG 

ACTCACAGAGGATCGACGACCTTTCTGCCAGGGACAAGCCCTTTGACGTCAAGTGTGAGCTC 

GGAGACTGA 
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Table 41 

MOUSE NOMENCLATURE 
ICSGNM Mdm2 
Celera mCG33 9* 

5 

HUMAN NOMENCLATURE 
HGNC MDM2 
Celera hCG19697 

10 MOUSE SEQUENCE - GENOMIC 

AGGGJVGGCAGAGCAGGTGAATCTCTGAGTTCAAGCACAGCCTGGTC 

AAAAACAAATATGTATCAATTTAGAGAGGGTCTAACATAACTAGTGTGGTTTTC 

AAAGTGAGAAACTAAAGGCAGCCTAGAGGAAAATGTTTTGCACTCTGTGTT 

AG AGCGTGTT AG AG AAATGTCTTTG CG AAGCTGTG TGGGTTTTGTT AG CCAAGCAGTGCGGTG ATGGG C CAT AAG CCAAACAGT CG 
1 5 TGTCAGGACAGTAATGGATCTATTCAGCAAAGCAAGTCACACTACT 

TGTTTACTCCTAGCATTCAGGTGAC AG AGGTAGGT AGAT CT CTGTGAG CCTG AGG CT AGC CTG CT CTACATGGCGAACTC CAAGGC 

AGCCAAGGTTTTTGTAAGTAAGACTCTATCTCAAAATACAAAATAATO 

GAGAATCCTTGCTCACTGAAACAGCTTTCAGCATAAGA^ 

ACACATACCTTGTCATATTTACGAGTGTGATGAACTTTTAGCCCGTGTCACATACAC^ 

2 0 T AGATCACATTACAGTCACG ACTCATAG TTTGTTTTTTT AAAATGCT CTT AAAAATTCG ACTAAAACAAAAATAAAATAAAATAAA 

ACAAAACAAAGT CAACGCAAG CTTT CCTTTCCT AAAGTCTCTGCTG TG CC CG CAG TTT AGACAGAGTTT CTGTCTTCTG ACTGCGC 
TCACTCCAC^CCAGCCAAGTCCTGGTCCCAGTGAACCAATGTrcrGTCT 
TAGACTAGCAAACTACATTCCTTTCAGCCTTAGGTCrrGATGGAGGAAAT 
CACACAGAACTTTACTGAACCTTTAATAAAAACATAGGACTAAAAAT^ 
25 ATTTATCTTATGTGTGTGTACGTGTATGTGCGCATATC 

GTACGTGTATGTGTGCATATGTATATGTGTGTGTGTGTGTGTGTGTATTTCT 
C^ATATGTCTGTATGTGTTTGTGTGTATGTATGTGTC 

TATGTATGTGTATGTCTATATGTGTTCCTGTGTGTGTATGTGTGTTTGTGTGTATGTGCATGTGOVTGTGTG 
CAAAGCAATCATGTGGCAGTCAGAGGAAAACXTGCAAGAGTC 

3 0 TGTCAGGTTTGGCAACAAGTACCTTTACTTACTAAGCCATTGTGCT 

GGCAAACTAAATGATACCTGGAATAGTCAGATGTCCGGGCACCTTAAATTAG 
TTTTGGTGCCTCTCTCTGGCn'CATAGCAATCCTTTGCCACATATTTAT^ 
GCAGGCTAGCTTCC^ACCTTCAGCAGTTAAGAATCCTAAGTAGTGATTTAACT 
GGCTGGTGCTGTTTGGGAAGTGTTTAGGAGGTACGGCCTTGCTGGGGGAAGTCCATCT 

3 5 TAGCCCCACCCTACTCTCTCTTGGCTCTCTCTGCT^ 

TGTCTTCCCAACCAGTGTGGACAGCCAGCCCTCrGGAAATGTTAAGCTAAAA 
TTATCACAGAAAAAACAAAGTAGAAATTAATACACTGATCCTCCTGCGTC^^ 
CATATC^GCTCATCTTTATGAATCTCTGATTTTTGTTTTCAGTAC^ 
CrCCCATTGGGCTGTGTTTCCAACTCCATATTTTTTGTTATTGTTGTTGT^ 

4 0 ATAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGC 

ATTAAAGGC^TGCGCCACCACGCCCAGCTCCAACCCCATATTATTAACCCCACTC 

TTC^TTTGACTTTTCTATAATGTTTTATAAGAAGCTGCATCATTTCT 

CATAGATCTTGGTGGCAGAAGGTCTTGTCCAAAGACTGCGAGTCACACCT 

ATCCAGCCCAACATCTGTATGCTGACATCTAATCAGTCAACATGCTGGGACATCTTCACCAATAAG 

4 5 TCTTTTAAATAAATGTAAAAGATGGATATTT\AAGGGAGGAGGGGCAGGAGAG 

CAACAGTCCTGAGTTTAATTCCCAGCAACCACATGGTG^CT 

GAAG ACAG CG ATGG TG T ACCCA CAT ACATGAAAT AAATGG AT AAAT AAGT AGG CTGGG ATG CAGCCCCGGT GAT AG ATGC CTG C CT 

ACTGTGCACGAAGCCCTGGGCTCATTCCCTCGCTACACATAAAGTCTGTG^ 

CTCACCGTTGTCCTCAGCTACACGGCAAGCTTGAGGCTGGTCTGGGC^ 

5 0 ACTTTTCAATTAAATAAACATGCAGGGAAAAAAGTCAAA^ 

GTGACT AG AAAC AT CTCT C CT CTTG T ATAAGGG C CCAAGTCGAGCA(^CACACACACACACACACAC!ACACACAC AC CCTTG AAAA 

TAATAATGGTATO^TGGTGACACACACCTTTAATCCCAGCAGTCGGGATGCAGAGGCAGG 

GGCTTTCACAGTGAGTTCCAAGATAACCAGAATTACATGGTAATAGCCTGTCTCAAAAAA 

TATATATGTATATGTGTATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTGTGTGTGTGTGTGTGTATGTATGTT 
55 TGTCTTCACAGTGCCCTCAATCCACATGTACTTC^CAAGAATGTGTGTGGTGCCGTGCTTG 
ACCGTGTGCAGGGGCCAGCCAACCTAACAC^CAAGTTTACTAGCTTCAAGCTGCT^ 

CCTTTTTTCCCAGC ACTCTGGAGGCAG AGGCAGACAGAT CTCTGTGAGTT 

NNNNNNNNNNNNNNNNNNNNNNNNNNW 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ CGCTACCATTTAACCACCCCT 
CCAGCTCCTTGTCCATTTGGTCTTTT T T riT lTlTl TTT ^^^ 

6 5 TGGCTGTCCTGGAACTCACTTTGTAGAACCAGGCTGGCCTCGAACTCAAAAA^ 

AAGGCGTGCXSCCACCACGCCTGGCTCATTTGGTCTTTTATTCACrA 

TAAAATG CTAGAATT ACATGAATGAACTAATGAAGTTAACTCX5TCGAG AGTTTCTCAACT CCATGGTCCTTT 

TTCrrATTTCCTTCrTATAGCACTGGGATO 

CACTAAGCrATGTTCCCAATTCTTTACTCTGTCAGCAGCATG 

7 0 TTGAGACCrcCCTGGGTATACTGGGGCTACATAGTGAGACCCTATAGCAAAAAAAAGGGG 

AAACAGAAAGCAAGTCCTGCATGCAGCACX3CTGTACCACGCCCAGAAAAGTAGAAGGGGTGCT 

G C CACACACT AAACGG CTACATGATGACACATGAG AGATGACACTG ACTGT ACCAT ATC CTGTGACTCTG ACTCACCAAAGG AACA 
TGACTC^TAATGTCTTCatCCAATTGT-TCATC^G 
GTCTGTCTCTCTCTGTCTGTCTCTCrrcrCTCTGTCTCrCT 
7 5 TTTCAAGACAGGGTTrCTCTCTGTAGCCCTGGCTGTCCTGGAACT 
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TGCTT CTG CCTGCTGGGATT AT AGGCGTGCAC CACAACTG CCTGGTTCAGT CTGGTCTCTT AAG7VAT CCAGTTTTGTTTTG TTTTG 
TTTTT AAGATTTATTT ACTT ATTTT ATGT ATG CAAAT ACAC CAT CATTCTTTT CAGACACAC CAGAAAAGG CCATCAGAT C CCATT 
ATAGATGGTTGTGAGCCACCATGTGGTTGCTAGGATTTGAACTCAGGACCrrCTGGAAGAGCA 

ATCTCTCCAG CCTCCAAGAACCCAGTTTCTTTTCTT CAAATG CCTCATTGCAGTTTGTTATTGGAT AGCAATAG AGAGT CAGTTT C 
5 TCCCTTCCAACACTTCAAGCTTAAAAGATCATATGTACATATGTTGATCAGCCTATCATC 
ACCATGGCCAAATTCCATTTCAAAGTATGTCCTTC 
TTACAGGCCACATAGGCTCTATGTAGTCATAACCAG 

TTAACTTTCAACTATGAGCCGGGCGTGGTGGCATACTCCTTTAATCCCAGCACTCGGGA 
GAGGCCAGCCTGGTCTACAGAGTGAGTTTCAGGACAGCCAGGGCTACACAGAGAAACCCT 
10 AAAAAAAAAAACCAAAAAAACCTTTCAACTATGAAAACCAAACTATGATGAAAACCAAAGGTGAT^ 
AATAATAAATAATATTCATGCCTTTATGCGATGCGTTAGGGTTTAAGCCTTTA 
ATGTCAACAGAGCCTGGTGATGGTGCTCACCTTTGACA^ 
CCAGGATGTCGAGGGATACAAAGGAGAAACTCTGTCTCAGAAAACAAACA^ 

CAGG CGTT ATTTTTTTT AATCACACt^ACTT ACITTTGAAAT CACAT AAGGTC C CC AGGCTTGGGAACAAACCG CT AGTCAGG CTT 
15 TGCTGGGTTTTAGATGGACTCACAACTTTCTAAATCTAAACCTACTAACAAAGAAACT 

CAACCAAAGTCTOTCTCTCTCTCTCTTCCCACTTCCTCCCTCTCCCCT 

TGTGAACTCCCAGGTTCAAGCAATACTCCrrGAGTAGCrAGGAAGACAGGTGTGCACTC 

TTTGTTTGTTTGTTTGTTTTAAAGATTTATTTATTATC 

CAGGTCTTATCACAGATGGTTGTGAGCCATCATGTGGTTGCT^ 
2 0 ACCGCTGAGCCATCTCTCCAGNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNKNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNIWNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
. NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNHNNIWNNNNNNNNN^ 

NNNNNNKNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNJWJNN^ 

NNNNNNNNNmWNNNNNNNNNNNNNNNNNNin^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNN>WNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNN10JNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNJWNN1W 

NNNNNNNNNNNNNNNNNNNNNNNOTJN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNIWNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNKNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNTTAAACGCTGCCCCGTTTCCGCAGCCGTCTGCTGGGCGAGCGGGAGACCGACCG^ 

5 0 TCTCGGATCACCGCGCTTCTCCTGCXXX3CCTCCAGGTAAGGGACAGCTCGC 

GACG'x J i ri CTGTCTCCTTCGTAAATGCATGTATCTATTTGTCCCri i rCGTAGATGTTTTATAATTTCAAGTTTTTATCGTGTGTG 
TTGTTGTTTTTTTTCTACTTGTAGGCCAATGTGCAATACCAA 

CAG CTTCGGAACAAG AGACT CTGGTTGGT ATTTCTGC CTCG AGTGTGT AAATATAGCTTGTTTTACCAG 
ATTGAAACCTGGATTATCGCACAGTCX3TTTTAAGTATTTA 
55 TCACTGCGACCTATATTTCCACAGCCCTTCCCCTGACGCAGGCTITAGAAGTGCACAGTGTGTGGG 
TGTTCTATCACTGTTCTAAATGGTAAACCGAAAGTCATAGAAGCATCCGTGCTTTTTGTGAGGA 
TAAAATATATAAAGTTCCAGTCTTTGCTCTGTATCTCTTTAAGTTTTCTACTTTATATTAGCATATAAAG 
CAGTAAGTTACAAAATTACTTTTAATAGCTGCATAGGATTTGATAATGCGATTGAACCTCAGCCTTTA^ 
GTACTGTACTACTGAGCTGAGCGACAGTCTCAGGCTCACGACTACTGGGGGATGTAAAAGTTGCCCATGTG 

6 0 GGTGAGGAAGATAGGGAACTAGAATCCTACGTCCTGGTAACCACATTTGATTGCAGACTGCTCACTCTGCTCA 

CCGTCTTCGATCCTGCCAGCACGAATATTGCCAGCCCTCGTTTTCCCTTTCTGATGTAATTTC^ 
GTATGCAAGTTAAGTCAGTTCCTACGGTTACTGCTCTGTGGAGTAAAATGTATTGTTTCAGTG^ 
AAGAGCCTGTTTGTTCTGGGTTTGCCAATGTCTCTCTCTATTTACTC^ 
TTTAGTAGCTTGCATAATTTTCAAGTATAGTTAACCTAAGAAATTTTT^ 

6 5 GGT ATCAAATTTGTTT ATTTTGAT AATTTTTGAATTT^ CTT C CATTTCAAG ATT ATG CACAT AT 

TTAAAGTATTTTGATATAGCATTTTAAAAACCTTTTCCGGGTAGATCnT 
GTTGTGTTTTTATAGTAGTTTTTAAAATTAACTTTATCTTGTAAGCTTTGATAAGA 
GAGTTAATTTAATATGCAAAAATGTAGAACTTCTTGCTTTG^ 
GACTTCrATATGAGTTACATAAGAAAATTAAGCATTAGTAAG 

7 0 CTTGGGAGGGAGAGGCAGGCAGATCTCTGAGTTCGAGGCCAGCCTGGTCTACAA^ 

GAAACCCTGT CTAG AAAAAAC AAAGAAACAAAAAGTCTTTTTCA^ 
AATACCCTTITTTAAAAAACTGATTTCrATAGGCTGGAGAGATGGCT 
AGTTCAACTCCCAGCAACTACATGGTGGCTCACAACTGTCTGATGCCCT 
ACTCATACACAGAATACATAAATAACAGAGTCTTTAA 
7 5 TTCrrCCAAAGCvn"CCAGTATACCCCTTCTCTCTTTCW 
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GATACAGCCTGCTCAGTCTGAATAATGCCATTTGTGTATGTATGTTTTCAGGGCT 
GCTCTTCTGGAATAAATAACTGAAAAGTGCATTTTAAAATTGTGCTAAAAG 
TGGAGAGAGGGCTCAGCAGTTAAGTGCATTTGCTGTTGTTACAGAGGACCCAGGTTGGA 
CCTCTTGTAACTCTAGTTGCAGAGGATCTGATGGCCTCTTTTGGCCT 
5 AGGTGAAATACCAGTATACATAAAATAATAATAAAAGTCTTCAAATTACCA^ 
CTCAAACTAAAACAGCTGAGTTTGCAGATACAGAGAGCATAGTCATGT^ 
CTTAGTCAGCrCAGTGTACTTACTGTCCTTGAAGCAC^ 

TCCTATTTGTCCTCAGTACTCTGCCATGAGGCCACATTCTAGCCATGTAGACAG 

AACAATAGAGCCATACAAAATAACTGTCCTGACAGAGACAGCCGGCCAAGCAGCCAGGAQ 
1 0 TGGCTTCCACTCTCAGTACCCACATGGTGGCTCCT 

GGGACCAGGGACACAAGTGGT GC ATGGAAGT ACATG CAGGCAAAACATGCATATG CATGGAACAG ACAGAAAGCCAG CAGG CT CTC 

CGTGCTGGGGATCCACTAGCTATAGAACACGAGTGAACAGGAGCCTTGATCATCAAAGCTGACCT 

CTGTGGCCGATGGTAACATACTTTGATATCAGACTGACGTGGCTAGAGCAAGCT 

CGTTGTCAGGCTGCCAGGCTGGGCTCACTAGCTGCTTTGTGGGTGACACCriTG 
1 5 TTTAAGGATT CATrrCXI* ATTTT AT ATAT AGGAATGTTrrG C CTGCATGTGT AT ATGTGT AC CATGTGGATGTCTGG TGG CT AT AG 

AAACCAGAAGAGGGTGTCGGATCCCTTGGAACCAGAGTTAGATAGTTTTGATCTATCATATAG 

CTCTTATAAGAGCAACAAGTGGTCITAACCTCTAAGCCTC^^ 

AAATTGTAGCTGGCTACTTGTTTTTATAAATAAAGCTTAATGTO 

2 0 TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGA^ 

GCATGTATTTT AAGTG AT AAAT AAATG ATGCAT AAACCACAT AT AAAG AAAACAATG AGTT^ CTG AGGG CTGTGATTGT ACTC 
TTTGAATTG TTCATTAC CTTG TGACTTTGGTTTCT TTAG CAGT CAGG AATGTGG T AT ATGAT AGATTGGTGAT AGGTTTTG C C CT A 
AGATATAGTCTATTCTCCTAC^TTCTATAAGCTCCCTTTCACT 
CTAGGTTGTGTGGTGGCAAGCAGATGATGGAGCTGTAGGAAGGAGTCAGTGCT 

2 5 GTCTGTGTGTATTTGTAGCTCCTGCCTGCAGAGACTAC 

CTAGGAATCTAGAATTGGGTCCCCTGCAAAGCAGCAAGTGCnTTTAAT 
TCAGAACAAATTGAGTTTGATTATTTGACATGACACCAGGAT^ 

TTTCTGTCCTAACGCTT AGTGGTGTGCACTTGT ACTTAGTGTAAAACC TG CAGCTCGGTAACAAATTGTTATA 

AGCACGG CGTTTGCTG C CTG T ATTGGG CTTCTTTCTC^G CCAT ACAT AAAGGG CTTGTG CTGT AG AG T ATGTTCATGGTGT AACAT C 

3 0 AGCTCTTTTTCTTTTCTTTTACAGGTTAGACCAAA^ 

CACTATGAAAGAGGTGAGGAGAACTGAGGGGTGGAGAGCCGATGGCITCATGGT^ 
TTGAGTGAGATTGGCCATAGAGATTTTGTAATGAGGAAGGACATTAAGTTTT^ 
TCTAATTTACATGAATTTCTTCCTCCTTTCCTTTCTTTG 
GTACACTGTACCTGTCTTCAGAGACTCCAGAAGAGGGCATCAGATCTCATT^ 

3 5 TTTGAACCCAAGACCTTCAGAAGAGCAATCAGTACTCT 

CTGACTGTCCTGGAACTCACTATGTAGACCAGCTGGCCTCAGACTC^ 
CTCTTTTTNNNNNNNNNNNNNIW^ 
NNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNftlNNNNNNN^ 

4 0 NNNNNNNNNNNNNNNNNNNNNNIWN^ 

TCATAGAGACAATGTTGTATTCTCTGCCITAACGTTT^ 

GTTTTGCATGG CTG AG AT AT AAGTGTTTCT CTGG CT CACTT AGG TT AAGG C" I'lGTlTACTT AAAACACT CAAGTTCTTG TGGCAG G 

TTAGGGGCTTCTTCTCAGCTACACATTTGGAAACGTCT 

ACTGTCAATACTTTGTGATCrATAGACATGTAAAAATTGCCT 

4 5 GTGTTAAGTGATGTTTTTAAGGTCATTAACTTTGGATTGACCTTTATAAAGGTTAG 

TAGGTAGATATGAGAAAATCGATGAAGGCCATCAAAAAGATTTAGTGCTTATGCTAAGCTGTGTGCAGT 
GCTCTCTGCGGGGAGTTGGCTTCAGGACACTGCCTGCGAAGTCTC^ 
TTAGACATTAAGTTATCCCTAGAGTATCTGTAATACACAATGTGTGTAGT^ 
AAAACATTTCTACATTCAGX3ATGGATGCCAGGAATATTATTAGACCCACACT 

5 0 TTTATAGTGTCTGATACCAGTTTTCAGTTACAAGATCTTTATTCT^ 

CGCTGGGATAGGTGTGATGTCCTGGGTTAGTTTCTCATCTCTGTTTATGTTTTTA 
AAGGGAGGACATTITACCTACTGAACGACACTCGTGACCTTGTTTTCATACAGG 
GACTGTACTGACAAACCTGTGGACTGTAAAATGCTGTCTAGAGTTACATTGT^ 
CTTAAGCTGCCATAATGGAGCACATGGAATATAATAATAGTGATGGTTCX3GGCCCTGGACTGTTT 

5 5 TG AG AATATTC CATTGGGAGATGTTACTCATGATGTCCTGTCTT CCTCTGTG ATG AC CTCAG AGCTGTAGAAGTATACGTTTGTGA 

CATGTCACTATTCTAATAC^TTGCTGTATriTACCTTACTCTi 1 1 1 TTXTATTCAATTITTTrTTAAGATTTATTTATTTATTACA 

TGTAAGTACACTCTAGTTATCTTCAGACACCCCAGAAGAGGGCATCAGA 

AACTCACAACCTTTGGAAGAGCAATCAGTGTTCTTAACCACTGGGCC^ 

CTTTTTTCTTTCCTAGTATCATATGTTAGTATCT 

6 0 CATTTAAGTAATGTGGAGAAACAGTTACTTCTATTTTCATCAGTGTG 

TCTTGTACATGATTATGGGTGTGGCTGTTAGTGTAAAGTGTAACTTGTTCAGTTAAGTGTGGTATTGCT 
ATTCTATGTGATTTCCTTAGTACTTAGAAACCATATTTGTGTTTCAGAT^ 

TTATATGACGAGAAGCAGCAGCACATTGTGTATTGTTCAAATGATCTCCTAGGAGATGTGTTTGGAGT 
GGAGCACAGGTAAGCCTrGTGCGTGGTGGGTTATAAGAACAGCGAGCGGCTGTGCATGGGGCTG 
€ 5 GGACCCAGGTTCAATTCCCAGCGCCCCCATGGCAGCTTACAACTGTTGGTACTGCCAGTTCTATAGG 
ACATACATGCAGGTAAACATAAAAAACATCTTAAAGAAGTACC^ 
ACCATGGATACAAAATGCTAACATTGGGTGTCATTTTTrGAGGCCCTACT 
CTGAACTCTGGAATCTTAGTTTTGATTGTGTGTGTGTGTATGTGCATATGCC^ 
CCTTCCACCATGTGGGTACTGGGGATTGAATGCAGGTTACCAGGCTTGACCT 

7 0 GTCTAAATGGATCTGACAf CTCTCAGCTGTAGGCTCTCTACTCAC^^ 

ATATATGCAATGATCTACAGAAATTTAGTGGCTGTAAGTCAGCAAGGTAGGTAATTTTG 
AACAGTTCTTTTTCATGTAAGATCTTATGTATCCTAGTCTAAC 
AAAAAACCACAAACCTTTGAGTAG C1T1T1G TTC 
GGAAGTCAAAAAACAGCTTGGCGGAATTGGTCT 
7 5 G CACTTTGACCTC CTG AAACAT CTT AC CAG TCCC CTAATCATGTTTT AAAGACTTGGATG 1TG T CTT AG TCAGGGTTTCT ATTCCT 
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GCACAAACATCATGACCAAGAAGCAAATTGGGAAGGAAAGGGTTTATTCGGCTTACACTTCCATACT 

TGCAGG ACTGGAACTCAAG CAAGTCAG AAAGCAGG AG CTGATG CAGAGG C CATGG AGGGATG TT CTTTACTGACTTG CTTC CC CTG 
GCTTGCTCAGCCTGCTCTCTTAAAGAACTAAGACTACCAGCCCA 
ATATTGAGAAAATGCCTTACAGTTGTATCTCATGGAGGCATTTCCTCAAC^ 
5 AAGTTGACACAAAACTAGCCAGTACAGATGTATACITCATGTTTTAAAGACT^ 

CTTTAACCXriTAG<lA.CTCTAAAAAGGCAAAATATCACTGTGTGCT CTTCTGG CTCTCTTT ATT ATTTTT 

AACTCTTTCATTGCCTACCAATTTTTTTTT AAT AATT A 

TGTGTGCAGTGCCTTTGGAGGACAGAAGAGCCGATCAG^ 

GAATTGAAGCCAGAACTTCTGGTAGAGTAGCCAGGGCTCTTAACCATGAAGCT^ 
1 0 GACCAGATCCCCATAAATT ATCCAGG CT CATTTGGAAC C CAGGCTTCCTG CT CCC CAAGT AG CTG ACGTTTGTTCATTT ATT CAG C 

ACTGCTGAGAGTTGAACTTAGGCTCACACACCCTTGCGTTTAGAAAGTA 

TCTCATTCTCTCTTAAAAAATTAAAGTGACriTCTTTCATTTCACT^ 

ATACATGTGCACCACGCGTGTGCAGTGCCCAGCAAGGTCAGAGGAGGAGCATTGAATCT 

CCTCTGGAAGACCTTCAAGTGCTCTTAAGCACCAGAGTCATCTCTCCAGCCCCTCTTTGGAA 
1 5 GTTACACAGACATTTAATCTCAGCAAATGGGCGGATGGACTTGTCCAG 

AAAGCAAGTTTTAATTATGTGTGCGTCCCTTTATGTGAATATGTGCACATGAGTGCAACTGATC^ 

TCCCTAGAGCTGGAGTTAGGGAGTTTTGAGCTGTGGTATCTAGGAACT 

AGAAAGTACATTTATTCGTGTTTAAGAGCATCCTTATGCC^ 

CACGACCCTGGGTTATCAGGGTTTCTTCTGAAGGACAGGTAATGTTAAAAGCT 
2 0 GTTGAGATGTTTCAGGAAGTAATATAATACCTAGGTAGGACCGTGAAAGAGT<^GGTTTCATCT 

TGTATTCTTGCTCTTGTTTCTTGAGTGGACACTTAGCATTTTTAGGTA 

GACTTX3AGCACAGGTCCTTGTGTGTGCTTCACAAGCATC^ 

TGCAACCTTITTGAGAACCTAGTAAAGGCACTTGATTTTCATTCr 

TGGTTTTTTGAGACAGGGTTTCTCTGTATAGCCCTGGCTGTCCT 

2 5 CTGCCTCCCTCTGCCTCCCAAGAGCTGGGATTAA^ 

TTTCTCTTTTTAAAGACAGGCTCTCACTATTAGCTATG<3 

AGATCCATCCACCTGCCTCTGCCTTCTAGACTTACATACCGCCATGTCTCCTATTAGTTGTATTTGAATACT 
CTGAAACCCCGTAATCCTTACCGTTACCCTTGAACATAGCTACTGTGTTCAGAAGCTTGC^ 

3 0 TAAAAAAAGAACAAAGTAATTAGTTTGGTAACTGA 

CTAAGGCTCTTTATGTCATGCTATACAGTTATGGAATCTGA 
GACAGATTACTGACGATTTCTCTTGATGCTGTGTTTCAGTTTAGTTAAAAGTA^ 

GTT ACTTT ATT ACAT CTGTTTT AT AACT AT ACACAG C CCAAG ACAGAG TT C C CAGGT AG CAT AG C CT ATCTATGT ATT CCCTTTAA 
AGTTTGTATTATGCCATAATACAGGACAGTTCACTC 

3 5 GTTGATGGATGTGCTTTAGGCGGTAGTCTAGGGTATGGATTGTGATTTTTGAGTAATTTTGTCCATTCC 

GTTTTGAGAGCTTGGTCTGTGAAACTAAGTACAAACTACACCTG 
TAAATCGTTCACCTCCCTCACCTCTCTTGGTGAGTGGTACCACACATT 
TTACAGACATCTAGTCTTAACCAGTTTTTATGTCTTAATTTTTC 
GACAACATGACCAAGGCAACTCCTGTTATTAAGAAAAACATIT^^ 

4 0 CTCAAGGCAGTAAGCATGGCAGCATGGAGGCAGATGTGG 

AAGTAAGAGGCAGGTTTCTTCCACATTGGGCGGGGCCrGAG 

ACCAGGT(^CACCTCCTCTAGCCAGGCCACACCCTGGGCCAAGCATATTCAAACCA 

TGTATTGTATGCCGACTAATGCCACANNNNNNNNNNNNN^ 

TCAAATACTGTTTGCITGTTTCTGTGCTCTGTTTGCCXrCCC 

4 5 ACTATGGCTTGACAGTCTCACrGAGTTGCCCAGGCTGGCCTGCTGG 

GTGGTGGTGGGTTTGTTGTTTTTTAAATTCTATTTTGTCAGAGT^ 
TTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCrrGGAACTCACTTTGTAGAC 
GCCTCTGCCTTTGCCTTCCAAGTGCTGAGGTTAAAGGCGTGTGCCACCACCACCT 
ATGATTGCTCTCTGGGGAGAATAGTTCTCTGAGCTAGGAAATTAGCTT^ 

5 0 CAAGATGGCCTCAGGCAAGATTATAGAACTGCTTGAATA^ 

GCTCTGTGCACAGCGACCCTGCACTTTACTGTT^ 
TTCATGATTTCTCACATACTAATCCTTCAGTCTC 

ATAGCrCTTCAGAGCCCCTCCCCCACGTCTACTAAACATTTTTGTTTCCCCrGCT 

ATAATT CCTGTTGAATTT AT AT CTTC CAGCATGCAAATGACT CITGAT ACAAATGG TT AAT AG TTGTT AAAAGGTT AG ATTTTC TAT 

5 5 TGGGGATGTGTTAGACTGGTATATGTTGAATTCTTGATTTCTTTGTT^ 

TGGCAC&TCGCTGAGTGAGAGCAGA03TC^GCCTGAAGGTC 
CACTTTTCTCTTTAAAATATCCTGTCTGTAAGTGTGG 

CTGAGTTACAGGTGGATGTGAGCTGCCCCAGACATGAGTGATGGAAACCAAACACACCCCAGC^ 
TCTGAGCC^CCGCTCTAGCTTCATGACACATAGrrcrTGAAAATTGGGAGGGTTACATA 

6 0 AATGTTTATTTTGCAAT ATTTT ACTTT CTTTTATGTACTTAGGA 

TTTAATTTCTAGACTGTCTACCTCATCTAGAAGGAGATCCATTAGTGAGACAGGTACATAGGGACA 
TTGCITGGTGTTTCCATTTTGGGTCAAATGACCCTTTGACAGA 
GATTCATCACATTAGCAAAACTATGCCTATGAAGTAGCMTGAGAATAGTTTTA 
AAAGAGTCACAGTGCCGGGCGATGCTGGCTCTCATCTTTAATCCAGCACTTGGGAGGCAGAG 

6 5 CCAGCCTGGTCTACAGAGTGAGTTCCAGGAC^GCCAG^ 

GTGTTAGGAAAGTTAAGAGCTGCTACCCTAGACTTCTA^ 
AGATCAGGTTGACCITTGCATAGTCTTTTTrrriTTTCTlT-I^ 
TTAACCACTGAGCTGTTTCACCAGCCCAGCCTTATTCTTAAAGCT^ 
GCACTAGGAGGATTAAATGAGTAACTTCTGGAGGTAAC^ 

7 0 AACAGATCCAATGAAACACTGCTGACTTrTGAAACATTAAT^ 

AATAGGTGCTGAAGATATTCCAATCrCAAATCAGTCCCAG 
CAGATGAGCTACCTGGGGAGCGGCACCGGAAGCGCCGCAGGTCCCT 
ATGTGCAGCGGCGGCAGCAGCAGCAGTAGCAGCAGC^CAGC^ 
GAAATTATCITACTTTTTAATATGGAAGTC 
7 5 TGTGGAGTCCCCTGGAACTGGGGTTAGGGATCGTTGGAGCCA 



860 



WO 03/008583 PCT/USO 1/5 1291 



CTAGTTTTGAGTGCTGCAGATTAAATCCAGGGCrCTGTGCGCACTGGGCAAGTGTTCT 
CAGTCAGTGAGCCAACCTCTCTGCGGATCTCTGCCTATCTAGTT^ 
TATATATATATATATATATATATATATATATATATATATAT^^ 
AACTTAAGTGAAAATGCTTTTAATACAACTGTGTTTTGCTTT^ 
5 TTTATTTACAGGGAACACTATGCCTGATAGTGTTTCAAATAATTTC 
GGOTAGAACTTGAAAGCTCAGCGGTTAAGAGCCCTGACTGCTCTTC 
TCACAACCATCTATAATGGGATCTGGTGCCCTCTC 
TCTTAAAAAAAAAAGACTACTTTTATGTCAAGCTGACTCTATT^ 
GTAGCAAATTGAGAATGCCTGTGTCCTGAATTCn'AGAGCAGTGTATTTTTATC^ 
1 0 GTGTATGATTTGAAGTTGAATCTCTTGTTTGCATTTTTC 

TCACTG TATAGAC CAGG CTGG TTTTGAACCT ACAG AGAT CTTCCAGTCT CTGC CT C CTG AATG AATG CT AGGATT AAAGGCATACA 

CC!ACATGCCTGGCCTGAAGTTGAATCITTTAATCT 

AGTGTCACATGAGGTAATAGCCAATATAGTGTCACATGAGGTAA^ 

CACATG AGGT AAT AG CCAATAT AGTGTCACATG AGG TAAT AG CCAATATAGTGT CACATG AGG T AAT AG CCAAT AT AGTGTCACAT 
1 5 GAGGT AAT AG CCAAT AT AGTG TCACATGAGG T AAT AGCCAATAT AGTGTCA(^ CCAAT AT AGTGTCACATGAGGT 

AAT AGCCAATAT AGTGTCACATG AGGT AAT AGC CAAT AT AGTGT CACATG AGGT AAT AGCCAAT AT AGTGT CACATGAGG T AATAG 

CCAAT AT AGTGTCACATG AGGCC C CAGG C CAACT AAGG CTG CATCGGAAAACCC AG ATG CCAG CTTGGTGTTTGTG TGTCGTCACT 

AGCACTCCGTCACACTAC^CTGTGAGTCAGGGAGGCCTTCGTCTTCXriGTC 

ATAGTGGGTGTGCATATAATCACACACGTAGGAAAGGTCATTGTGGAA 
2 0 AGATGAGTGGGGCTTAGAAACTCAGATTATCTTACTGAGAAAAAGTTAGTC^ 

CATGGGT ATTG AAT AGGG TGG AGGGTGTG TTTAG CTGG ACATTTGAAAATGTT AATG CAG T CATT CT G AGTGTAAC CT CGGGGCTA 

AGGGGCTCIGACTTTTCCTTCATGAGGTAGGACTTTTGA^ 

C^AAATAGTGGCTCGGGATGAACCCGGGAGCTCTAAATGAGGCCCAGAGGCAATC 
GATCTTGACG ATGG CGTAAGTGAG CATTCTG^TG ATTGCCIGGATCAGGATTCAGTTTCrG 

2 5 TGAGTCTCTGGACTCGGAAG ATT ACAG C CTG AGTGACGAAGGGCACGAGCTCT 

GTCAGCCTTTTAGTAAGATGAAGAGGCCATCrrATTCAGAGTCTCT 
TTTGTGGATTTGAGGTGAAATTTTATAGTTTCTTTGCTTGATTT^ 
CATGTATGTCTGCAGGCCAGAAGAGGGCACTAGATATGGTTACTAGCCACCATGT 
AAGAACAGACAGTGCrrCTTAACCTCTC^GTCATCTCTCCAGCCCTTTGCTTG 

3 0 TTAGAGGAGAGTTTGAAt^AATGA'rr'ri'AAACrrrnTlGAGTTGATGGGGTC^ 

GTT riT TTTTTTTGTTTTTGTTTTTGTTT^ 

GGAACTCACTTTGTAGACCAGGCTGGCCTCGAACTGAGAAATCC^ 

CACCACGCCCGGCGAGACTTGGTTTTTCTGTATGGCCCTAGCTGTCCTGGA^ 

CTGCCTTTG CTTCCCAAGTCCTGGJ^TTAAAGGTGTG CTCTGCT AATGATGGATGAACATTCTCAT AATGG CTGG 

35 ACCAC^AAGTTCTCCATTTGGTCTTCCCATTTATGTATGTATGTGTGTGTGTGTATTT 
CCCTGGCIGACCCTGAACTGGCTATCTAGACCAGCCTGGCCT 
ATGAAAGGTGTGAGCCCCTACITCrrGCCCCTCCCTTCrrCAGTACTGATTTGTTTTT 
CCAATAGGACCCGAGTATCTCTTTCTCGGGAGCAACTCTACATTTCTAGTT^ 
CTTCTTGAGTGTTTTGTTTGGTTTTAAGTATCTAACAACTCTTGCCATTGCGAG 

4 0 CCTTGAACTGCATTTAACTTTCTTTCAGAACTCT^ 

TGGCTTCTTGGTTGAAGGGTTGAATTGATGCTAATGAATTTGTCTTACTAGGTCTATC 
CGATACAGACTCTTTTGAAGGAGATCCTGAGATTTCCTTAGCTGT^ 

ATCAGAATTAGGACTATAGGTAGTTGTGAGCCACCACGTGTGTGCrCAGAACCTGAGCCATCT 

4 5 ATATTCCTGATGTTCTTTGCCTATTTATGTAAAATATGAAATGCTGAAAA 

GAAGTTATTAGCCTTGCTATGAACACTGACTGAAGAGTTACTGTCT 

AAATGAATCCTCCCCTTCCATCACACTGCAAAAGATGCTGGACCCTTCGTGAGAACTG 

GTGGAAATCTCTGAAAAAGCCAAACTGGAAAACTCAGCT 

GAATGATGCTAAAGAGCCATGTGCTGAGGAGGACAGCGAGGAGAAGGCCGAACAGACGCCCCTGTCC^ 

5 0 CCCAACCATCGACTTCCIAGCAGCATTGTTrATAGCAGCCAAGAAAGCGTGAAAGA 

AGTGTGGAATCTAGCTTCTCCCTGAATGCCATCGAACCATGTGTGATCTGCCAGGGGCGGCCTA 
CAAGACTGGACACCTCATGTCATGTTTCACGTGTGCAAAGAAGCTAA 

T C CAAATGATTGTG CT AACTTACTT CAACT AG CTG AC CTGCTCACAAAAAT AG AATTTT AT ATTTCT AACT AT ATGACC CC CAAAT 
T AG AC AACATGGGT ATTATTTTT AT ACATT AAAGCCAC^AAAACTGTCT^ T C CACAT AAAATT C ACTT AT AATTT AT C CTGGAGA 

5 5 GTAAATATGGTGAATATr r i V nCCTTTTrAGGGAAA 

TTTATATTTTCCTTATATTTTAAATAATCTCCGCITGGAAGGACrTTGG 
GATGGTGGACCTTCCTCATCAAGGGCTACAGAAGTACTTG 

GTCTTTCATGTAAAGAGTTAAAGACTATGTGAAGGATTGTATATTTAAGTTATTGAAATT 
TGAGTTGTGGGCTGCAGAGAAGACTCAGCCAGTAAAGGCACCTGCTC 

6 0 AGGAAGAGTGAACCAGTTCCACAAAGTTGTTCCCTGATCTTCAAATGGATGC^ 

ACACACACA(^CACACACACACACACACACACACAGTTTTAAAGGC^ 
TTATTTTCCAGCATTTTCAGCITTTTGTCATAGGGGTG 

AGAAGCGCCTTTTTGCCTCAGCTCCGTGGTTCCTGGTGGTTGCCTATGTTTCAGGCCT 
CATTCAGAAAGACTGAATCAGAACATGGATAAAGTGAACTCATTCrAAGATGACT 

6 5 ATAATAGGCCTCTTCCCTTTGATTGAAGGGTCACGTCTAAGTATAGAAAACATAAAACTGTAAGGTAGAG^ 

TTGTATTAATGTTGCGTTAAAGCTTCAGAGACAAGAACAAGAACACTCCT 
CGGCAGCCTGGGCAGCTTCAGTCCCGATTTACAATAAAGTACCT^ 
GATGTTATTTATTTGCAAATAAATGGTTTATTGAAGAATTGTGAAAGAGATTT 
TTAAAGATTTGTTCATTATATGTGAGTACACTGTAGCTGTCTTC^ 

7 0 GTGACCCACCATGTGGTGGCTGGGAGTTGAACTCAGGACCTTTGGAAGAGCAGTCAGTGCT 

CCCCCATCAGTGATCTTTTTATAATTCATTTTTATTTTATGTTCATTGGTGTTATC 
TC^AACTGGAGTTACAGACAGGTGTGAGCTGCCCTGTGGGTGCTGGGAAATTGATC 
GTATGAATAATT03TCCACATGTATATGTATTGTGTGCATACACTGCCTTGAACTACATG 
AGGGCATCAGATCCTTAGGACCCGAGTTCCAGGCTAGGAGCCTAACCCTGTCCTCTGTAAGAGCA 
7 5 GCTGTCTACATGCCGCCTTTTTTTTTTTTTTTAAG^ 
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AGCCTTCATATGGTTGTTGGGAATTGAATTTTAGGACCTCrc 

CTG T AG CTGT CTT ACT ATAAGAGGG CATCAAGTCTCACT ACAGGTGGTTGTG AG CCAC CATGT AG TTG CTGGGATTTGAACT CAG A 
ATTTCTGGGAG AGCAGCCAG TGTTCTT AACCACTCAG CT ATCTCTCCAGC C CCT AAT AGCT ACAAATTT AAATACTTT AGG CAATT 
TTGCCTAGGGATTTCCCAATATATTATATTGCAATGGATTTTCTTTTGAGTCGG^ 
5 TACTTGTGTGGAGGAGATGGCTCAGTGCTTGTCTTC 

ACCACAGCATGAG AGATG ACAGGCAG ACC C ACAGGCGAATCAT CTGGAAG CTTGCTGGTCAG CTGGCTTG CAGT ATG CATCATGAC 

AGAAACAAGAGGCrTACCACAAGGTGGAAGGCCGGAGACTGTTAGGTTTTG 

GCTGGGAAATTCTCACTTAAACCCAGATGATGCTCTGTAATCTTGTCCCCCAAGTTATC 

GCCTGT AGCTGTGTGGAAGAG AG ACAGGTGGGGTTTGGGGGT CTGAGG CAGGAAG AAGAAAGAGGTGGAG AGAGG AGAG TGT AGGT 
1 0 GAGTCTGGAAGAAGTCAGTGCTCTTAATCCCTGAG CCATCTTTCTAGCCCCATAT AG QVGTCTTTTCTA^TTTQTAG ATTTTTGTT 

ATGTTTGCAAACJVAAACAGTAGTGGAAAGAGACAACATTCACTATATC 

AAGCTCAGAGGTAGTCATGGTCCCTCCCAAGCAAACGTTGTAATCTTA^ 

GGCTGTCTTGGAACTCACTGTGTAGACCAGGCAGGCTGGATTTGAACTCAGAGTCCCCT 

TAAAGGCGTGGTCCAGCACTGCCACGACTCAACCTGGGCACC^ 
1 5 AGAAGCTTGTTAAACAAAAACCCGCATGAAGTCTCTCGCACAGAGCTC 

TGTGAGACACAGTACAGCATCTTATTAGTGATGCAAGTCTAGAACAAACTCGGTGAAGA^ 

GCTCTTCACCTAATCCAGCATCATTCTCTCCCGTACCTGAGGTC^CCAGGAAGGCAGAATTGAGCT 

CCCCTCCGAGACTGAGAATGAAACTCCTGTTGCAATAGGAAAATAAGTGTGCTATATGGCriTTC 

ACGTCTCAACCCAGCTTCAGCTCATACATTGGAAAGCAGATCTGAT^ 
2 0 GGACTGTTTTTGCTTGTTTGAAACGGGGGTTTTTCTGCGTAGCCCT 

CACAGAGATCCACCTGCCTCTGCTGCCTGGGTGCTAGGATTTTACCATCTCAAT^ 

TTGTGAAAGTTT CCTTG AAAAGGACGTGGTTTTTGG TAACCAG CATTTCCG TTTT CAGG TCACCTTT AGT AAGGT AT ATTG CTTTC 

GGGGTAGTTAATGGAGATTAGACCCAAGCCTTTC 

2 5 TTTTTACACTGGGTTTTGATATATATTTCTGATTTAAGTAATC 

TCCAGCGT CT ACGGCAGGCAGAT C CAAAG CCAGTTTGG CTTACT ATG AG TT CCAAGTG C AGGCT ACACAG CTTAGTGAAATG CTGT 

GTCTGAACAAAAAGTAACAAAATAAAGACGTAGCTGTGGCTCAATGGCC 

GGCACTTGOIAGTTGTTTCAGTACCAAGTCTCCGGTTAGGC 

AGGAAAGGGT CAACCAGT CAG AACT CAGAACACCAG CTG ATGG CGAACCAAGTTTG CTCTT ACAT AT CTT ACTGG TTCT 

3 0 TAAGATGGCTAGTCCAGGCTAGCCTTGAGCTGTGTCGATCTTGCCT 

TGGCTTCGTCTTATATATATA<nTCAAATGTAATATACCCAGATCCCTTTTTGAACT 
AATGCTTTATTTAAATAAAATGATCAAGATACTCATGCTATTTCCAAATTT 
AAAATATAGATTAGGTTTGGCACCAAGAAAGGGCACCAGGGTGCCCACGTATCTCTT^ 
ACCCCGCATGTTCTTACTTCCTCTGGGAAATGCATGTC^^ 

3 5 TTTATCCTTTTTTTTTTCTTTTAATTTATTTACTTTTGAGACAGG^ 

CTGAGTGTTGGGATTAAAGGTGTGCGCCAACACGCCCAGCTTTCT 
CTITACTTCGAACTGTAAGAATTTGTTTTCrGGTCrAGAAGGCAGATTG 

CACC CATTTG CATTT AT CTTTGGATTTTGTT CTTTG ACAT ACACGTG AAAAGTGTGT ATGTGTG TGTGTGTGT ACT AG T AAGCTG A 
ATGTTGCCAAACATAAAACGCATTTACACAAAATTACTCAGATCCTTAGCG 

4 0 CCCCATCCATTT ATTTCCTG 1 l i GGCTAG ACTCTGAGAAATT ATTTCTCAA 

CTAGAGAAGGCCAGCACAGAGCAGTTCCACACCACGATGCTGTAAAGTCACT 

TATGACTACATGGTTGCTCTCTTCAGACACACCAGAAGGCAGCCTCGGATCCCATTACAGATGGCT 
TGGGAATTGACCTCAGGACCTCTGGAAGAGCAGTCAGTGCTCTTA^^ 

4 5 GTGACACAATCAGAATCCAGACCTGGGACCGAGAGCACTTACTGTGTTGTGT^ 

CCCAGGTTCTGTGTGGAAGACGGACTTGGCGAAAATACCCATGAGCAGCAGCTC^TCAGCGGCGGAGG^ 
AGAGGCTGTATACAGCAGCTTCCAGTGACTGGATATGGGAATCGAACAGCCTTGCATGGCGTCT 

CAC^AGGGGTCATCTTCGTAGGACTGTGGCATCAGGTGCAGGGGGCAAGGGCAAAGGCArc 

5 0 CCCACATCAGACATGTTTGTCCATTTTCrCTCATTTC^CCTTAGACCA 

ATTTCTTCCTTTGGGGAGAACCACTTATAGACAATATCAGCTCTCTACAAATAACACGAAAGGTTO 
AAGTGCCCGTCCTTGCGGCrCTGATCCACGTGGCTCTCCGTAGACAAATTGTTTT^ 

TTTCTT ACAAG TG TTTGGG ACTCCTTCGGGGTGG CG TTTGTG TGGG CAG CATG AGCTAAG CAGTGACCT CCACTTT AT CCAGACTG 
AAGATGGATTTGGTCGTTACAGGTCGCTCTAGGCCAGGTTTGGAAAC^^ 

5 5 CAATCGCAGTCCCACAGGAGGGAGGGTTGCCATAACCGGGAGT 

T ATTT AAAAAAT ACGT ACAAAAG AG TCATGAAAAACACG CC CAG ACT AGG CTTTGTGG CAG CCG AGTGG CTTGG CATGAATTTGT A 

CAGCGGAATCATCGGGCACGAAGGATTGGATACGGAGATGGAATCCGGCTC^ 

CACTGAAGGGCTtXXJATTTCCCTGGAATAGTAAGCTTC 

AACAATTACTTGAGTGATTTGAGATGGGTGTGAGGTCCACGC^ 

6 0 AAAGCGGTCCACGCCTCCGGGG CTAT AG CGCCCGGAAAGTTGCC^AGTGTTTCTTG ATGTAGGCCITTGGTGCAAATG CT ACCAAGA 

ATTACAAAATGAAGTAAGAACAC^GAGGGGGTTCAGCATCTAGTGAAGGAAAAGACTAAATGGAGG 
ACGGTCATTTTCAGTGTGCCGGGGTAAAGGTGCTTTTTTTGGACCAAGCCTAGAGTATCCT 
ATTCTTCCTTCAACTTACTCCCATTCTCCTTGCCACTGACCTAAGGTGTATAACTCT 
AGAAAGAAAGAAACAAAACAAAACCAAACCCAAAACCTTGCACAAGAGTO 

6 5 GTGCTTACAGAGGCCAGAAGAGGGCAACAGATTTCCTAGAACTC^AGTCGCGGATGGTCAGTTGTGAGCC^ 

AACTTCAACCCAGAC CTTCGGAGCC CTTCT AC C^ClTTCTTrr AAAACTTTT AATTG TGC CCTTT AA 
GCCCAGCACTTGGGAGG CTG AG ACAGGCX3GATTTCTG AGATCGAGGCCAG CCTGGCCTACIAAAG 
TATACAGAGAAACCCTGTTTTGAAAAAACAAACAAACAAACAAAACCCCCAAAAC 
TTAAAAACTAGTTTTGTTCTTGTGAATCTTCAGGTAGGTTGCGA 

7 0 CACTGGGTCCACCTGCCTCTGGGCAGTGCTGGCATTCAGGGCTTCCT^ 

GGAGAAGTCATTTGCAGATCTTTTTCCCCAGGTCTGAAGACACAGGGACAATCTG 
GTTTTTCTTTGTCTATCTTAATTTTC/TGC 
CCATTGTTTTCAAGCACGCACACACACACACAC^CACA 
AACATCTGCATCTGCAGCTGTGGCCAGGAGAAGTGAATGGTAGGTCTGGA 
7 5 CAGCTGGC^CCACCTGTCTAAGITCCTGATTGACAGGTAGGA 
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GACTCCCCTAGATCTCGTGGGGGCAGACTGAGACAGCAGCGGGGCITGTGGCA 
TTCTTCAGTAAAATGTGCTCTGGACCAAGGCTTCAGCCAGAGTCCAAGATGGC^ 
TGGCTATGGAAGGGCAAGGAAAATGACAAAGAAAAGCCACTTTCAGA^ 
TTCATGGTTCCCTGCCACACTGGTGCTTGCTGGCATCTCCCATCGTCCTCAGT^ 
5 GCTGAGTGTTTCTCTCTITCCTCrGGGGTCATTTCCAGGCACACC^ 
GTGGTGTCIGCTTAGCTGCCCTGACTTCTCTGGCAGGGCTACCTCT 
GATTAGGTTATCCGTGTGTGTGTGTGTGTGTGTGTGTGT 
GGGATGGTTCACTGGGTGAAAGCGCTTGCCCAGCAAGCCCGA 
CTCAAAGCTGCTTACCAACTTCTCCAAGTGTGCTATAGCCCACAGATGC 
1 0 GAAAACATATGACTTTCCTCCCGGAGACAGTGTTTCTCTGTATAGCCCTGGCT 

TTGAACTCAGATCTGCCTGCCTCCGCCTCCCGAGTGCTGGGGTCAAAGGCACATGCCACT 
CTCCATTTTAGAAAACCTGACCTAAGTGTG^ 

CAACAGAACCTGAGCCTAGCTCCCGTCTCTAGTCTAATCCCCAACAAGAGCAGAGTTTCCAG^ 
TCTCCAGGCTATTTCCCCAACAAATCTGCACCCCCAAGTTACAAGCCTACCCTCTGCCTAACGACCA 
1 5 AAGTTAAGTCTATC^TAGACTCCCAGCACCAGCCAATCATGTTO 
CCCTGCTTGCTGCTTACTATAAAGCCTTGACCCA.TAGACAT^ 

ATGGTGGTGCGTTGGGGGATGGGCGGGAGGGTTGTTCGAACAGAATAAAGACCCTGGTGAGT^ 
TTCCACTCCTCTCCTATTGCAGGCCTGATGTTAACCTTGCCl^ 

20 

MOUSE SEQUENCE - mRNA 

GAGGAGCCGCCGCCTTCTCGTCGCTCGAGCTCTGGACGACCATGGTCGCTCAGGCCCCGTCCG 

AGGGTCGGAAGATGCGCGGGAAGTAGCAGCCGTCTGCTGGGCGAGCGGGAGACCGACCGGACZACCCCTGGGGGACC 

ACCGCGCTTCTCCTGCGGCCTCCAGGCCAATGTGCAATACCAACATGTCTGTGTCT 

2 5 CCAGCTTCGGAACAAGAGACTCTGGTTAGACCAAAACCATTGCTTTT^ 

CACTATGAAAGAGATTATATTTTATATTGGCCAGTATATTATGACTAAGAGGTTATATGACGA 

GTTCAAATGATCTCCTAGGAGATGTGTTTGGAGTCCCGAGT1TCTCTGTGAAGGAGCACA 

AATTTAGTGGCTGTAAGTCAGCAAGACTCTGGCACATCGCTGAGTGAGAGCA^ 

TCCTTTG CAAGCG CCAC CAG AAGAG AAAC CTT CAT CTTCTG ATTT AATTTCT AG ACTGT CT AC CT CATCT AG AAGGAG AT C CATT A 
30 GTGAGACAGAAGAGAACA.CAGATGAGCTACCTGGGGAGCGGCACCGGAAGC^ 
CTGTGTGAGCTGAGGGAGATGTGCAGCGGCGGCACGAGCAGCAGTAGCAGCAG 
GGATCTTGACGATGGCGTAAGTGAGCATTCTGGTGAT^ 

TTGAGTCTCTGGACTCGGAAGATTACAGCCTGAGTGACGAAGGGCACGAGCTCT 
GTCTATCAGACAGGAGAAAGCGATACAGACTCTTTTGAAGGAGATCCTC 

3 5 CAATGAAATGAATCCTCCCCTTCCATC^CACrGCAAAAGATGCTGGACCCrT 

ATAAAGTGG AAATCTCTGAAAAAGCCAAACTGGAAAACTCAG CTCAGGC^^ 

ACAGAGAATGATGCTAAAGAGCCATGTGCTGAGGAGGACAGCGAGGAGAAGGCCGAACAGACGCCC 

CTATTCCCAACCATCGACTTCCAGCAGCATTGTTTATAGCAGCCAAGA 

ACGAGAGTGTGGAATCTAGCTTCTCCCTGAATGCCATCGAACCATGTGTGATCT 

4 0 C^CGGCAAGACTGGACACCTCATGTCATGTTTCAC^TGTGCAAAGA^ 

GCCAATCCAAATGATTGTGCTAAGTTACTTCAACTAGCTGACCTGCT 

MOUSE SEQUENCE - CODING 

ATGTGCAATACCAACATGTCTGTGTCTACCGAGGGTGCTGCAAGCACCTCACIAGATTCC^ 
45 AC CAAAACC7VITG CTTTTGAAG TTGTT AAAGT CCG TTGG AG CG CAAAACG ACA.CTT AC ACT ATG-AAAG AGATT AT ATTTT AT ATTG 

GCCAGTATATTATGACT AAGAGGTT ATATGACGAGAAGCAGCAGCAC ATTGTGT ATTGTTCAAATGA CTC CTAGGAGATGTGTTT 
GGAGTCCCGAGTTTCTCTGTGAAGGAGCACAGGAAAATATATGCAATGATCTACAGAAATTTAG 

TGGCACATCGCTG AGTGAGAGCAGACX5TCAGCCTCAAGGTGGG AGTGATCTGA^ CGC CACCAG AAGAG AAAC 

CTTCATCITCrGATTTAATTTCTAGACTGTCTACCTCATCrrAG 
50 CCTGGGGAGCGGCACCGGAAGCGCCGCAGGTCCCTGTCCTTTGATCCGA 

CGGCACGAGCAGCAGTAGCAGCAGCAGCAGCGAGTCCACAGAGACGCCCT^ 
CTGGTGATTGCCTGGATCAGGATTCAGTTTCTGATCAGTTTAGCGTGGAATTTGAAGT^ 

CTGAGTGACGAAGGGCACGAGCTCTCAGATGAGGATGATGAGGTCTATCGGGTCACAGTCTATCAGACAGGAGA 
CTCTTTTGAAGGAGATCCTGAGATTTCCTTAGCTGACTATTGGAAGTGTACCTCATG 

5 5 ACTGCAAAAGATGCTGGACCCTTCGTGAGAACTGGCTTCCAGACGATAAGGGGAAAGATAAAGTGGAAATCTCT 

CTGGAAAACTCAGCTCAGGCAGAAGAAGGCTTGGATGTGCCTGATGGCAAA 

TGAGGAGG ACAG CGAGGAG AAGGCCGAACAG ACGCCCCTGTCCCAGG AG AGTG ACGACTATTCCCAAC CATCG ACTTCCAG CAGCA 

TTGTTTATAGCAGCCAAGAAAGCGTGAAAGAGTTGAAGGAGGAAACGCAGCACAAAGACGAGAGTGTGG 

AATGCCATCGAACCATGTGTGATCTGCCAGGGGCGGCCTAAAAATGGCTC 

6 0 TTTCACGTGTGCAAAGAAGCTAAAAAAAAGAAACAAGCCCTGCCCAG 

TCAACTAG 

HUMAN SEQUENCE - GENOMIC 

CGAGTAGCTGGGATTACAGGCACCTGCCACCACGCCTGGCTAATTTTTGTATT^ 

6 5 GG CTGGT CTCGAACTCCTG ACCTCAGATGATCCGC CCGCCTCGGCCTCTCAAAGTGCTGGGATTACT^ CGTGAG CCACCGCACTT 

GG CAGTTCTlTTVlTTCTTTTTTCrTTGAGATGGAGTCT 
GCAAGCTCCGCCTCCTGGGTTCATGCCATTCTCCTGCCTCAGCCTCCAGAG^ 
CrAATTTTTTTGTATTTTTTTAGTAGAGACGGAGTTTCACCATGTTAGCCAGGATGGTCT 
GCCTCAGCCTCCCAAAGTGCTGGGATGACAGGTATGAGCCACCACGCC 

7 0 GTTATTAATATTGACrTTCTTCATTTTGAAACTTCAAATAATATTTGATTCAAAG 

GTGGCCCTGGTGACCTGGAATTGGAAATGGGGGTGTGGGATGTAGGGCATAG^ 

TATTACTTGTGGTACTCAGAATTCCAAGAATGACTCCCAGTGACCTTGCCTATGGATGATC 

CCTGTGAATATGAGGAGATATTACTCCCATCATTCTGTTACATTACATGGC^ 

ACL n .TG AGC CTTTG AAGTCAGAG 'X ' TrTTT T CTGG CTG AT AGCAGG AG AG TAAG T CAGG TT CAAAG CACAAGGGTG ATTCAACACCCT 
7 5 GTTGCTGG CTTGAAGATG G AGGGGAGTGTGT AGCAAGG AATGCAGGAAGCCT"! CAGG AG CAAG 
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AAAAAGGGGACTACCGTCCTACAACCACAAGGAACTGAATTTGGCT 
CCTTAAGAGCCCAAGCTGGCCCACACCTTGATTTAGGCTTTATGACACCT 

G AT ATATGGACTGTG AGCT AAT AATTTG TGTTGCTTT AAG CT ACTAC ATTT ATGG T AATTTG TTACACAGC CAT AG AAAACCACAC 
ATGACTCCTGCCATCCCGATACCAGTTGGGGTGGCAAGGAGGAAGGAGAGGA^ 
5 TTAGTTGCCCTTTTATTAGCAGGACATAACTGTG^ 

TCTGTTGGTTGTCCCAGTTTCTCTGATTATTACCAAT^ 

GAAAGACTCGTGGCATGGTCCTGGTAGGAAGTTGAGTAGGGGTATATGAATGAAGCT 
CTACCAATAGATATTCTATGTCTTTTGCTTCCAGGGTTCTAT^ 
ATACTCCTTCCATCTTCTCCAGGAACCACTTAATCTGGGGGAAACT 
1 0 ACTTCACTACCTTTCTTCTGCCAGCTGAA^ 

GAGATTATAAACTCTTCGTTCCACGAAAAGATGACCTCTATTATT^ 
TGG CGCACGCCTATAATCCCAGCACTTTGGGAGGCTGAG^ 
ATGATGAAACATGTCTCTACTAAATACAAAAAATTAG 

ACAGGAGAATCACTTGTATCTGGGAGGCGGAGGTTGCACrGAGCCGAGATCAAGCCA 
1 5 AACTCCGTCTCAAAAAGAAAAAAAAAACTCTCAATGGTAAAAGTTATCAAACCAGTTC^ 
GTCTCAGGCCAAGAATTGGGCCTTATTTGAACTTAGCT 

TGAAATATTTGTTATCTTCTAAGAGCAGGAACTCAATTATGCTCCCATATCATT 
ATAAATGTCCAAAAAAAGAGATCTATTGTAACTAAGTAGAATTTTATCTGAGTAAAA 
ATATAATCTCACCACATAGACAGATTAAAGGAGAAAAGCCACAGGGTC^ 
2 0 CACTGATTCATACnTTTTTATAAGCTCAAAAAA 

AGCAAACATCACATTTTAAAAGCATTTTTACTAATG^ 

TGGTTGTAAGAAAATGAAAAAGAGGTTTAAGGAAGAGACAAGTTAGTGATTTTTGCA 
AGCATCTACTGAAAAACTAATAGTACTGTATAACAAAACAAAATACAGCCTGTAATCCCA^ 

ATCACGAGGTCAGGAGTTCGAG ACCAGCCTGGCCAAGATGG CG AAACCCCATCT CTAGT AAAAAT ATAAAAATTAGCCAGGTGTGG 

2 5 TGGCACGCACCTGTAGTGCCAGATACTCAGGAGGCTAAGGCAGAAGAATCCTTTGAACC^ 

GATTGTGCCACTGCACTCCCGTCTGGGTGACAGAGTGAGACTCCGTCTCA^ 

AACAAAAAAGCAGTATTGCTGGCTGGCTATAAGATCAAAACACAAAAATGAGTAATTGTTTCAT 

AAATAAAATGTTACXKSCTGGGCATGGTGGCTC^CGCCTGTAATCCCAGCACm 

AAG ATCG AGACCATCCTGGCT AACACGGTGAAACCTTGT CT CT ACT AAAAAT ACAAAAA CGGGCG TGGTG GCAGG CG 

3 0 CCTGTAGTCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATGGCATGAACCCG 

ACTGCACTCCAGCCTGGGTGACAGAGCGAAACTCCACCTCAAAAAAAAAG 

AAT AAAACTTAT AAAGCACTTCAGAAAAAAG C CAG CCAAACAT ACAAC AG CTTTAAGGAGGAAACAAT AAAGCATT ATT AA^ 
GTTAAGAGAGACCTAAAAATGGAGTGATATATCATGT^CTTATCTGGGAAGAATTC^ 

ATCCAATG CAATGAAAAGCAAAAT C CC AAGAGG ATTCTTTTTG TG AATT AG A CAAACTGGTT CT AAAAGTT AACATC CAGG AAGAG 

3 5 CC!AAAACAATACTG AAAAAG AT AAAGAGAG CTTTT C CTG TT ACATATTGAAACTT ACT CT AAGG TTT ACT ATT CTGTTG C CCAG CA 

AGTCCTCTTATAGGTATGCAGCCTACTGTCCGTGTTGTTTATGGATTCATAAA 
GACAGGCATCAAATTGATCATTATCAAGAGAAGGGAGGGGAATGGGATAAAGAATAACAGTGAAA 
TTCATTAAAAAAATTTATTTAGGCC!AGGCTCAGTGGCTCACACCTGTAATCCTA 
TGAGGTCGGGAGTTTGAGACCAGCCTGGCCAACATGGTAAAACCCCATCTCTAC^ 

4 0 TACAATCTTGGCTCACTGCACTCAGTGTGGGCGACAGAGCAAGACTCT 

TTTGATGCAGGAATGACAAATGTGAATTCTGGGTAGTGGTTTCTTTTTTTCTT 
GTGGTTTCTTTTGTTGAATATTAATTGTTCTGTGAATGTTT^ 
AAACAAAAAGTCTTTGTATTCACTACTTGCTGGGAl'l lTGCTTTTT 
TCTCACTCTCACTCTGTCTCCCAGGCTGGAGCGCAGTGGCATGATCT 

4 5 TGGAGTAGCTGGGATTACAGGCGTGCACCACCATACCCGGCT 

CAGGCTGGTCTCGAACTCCTGGCCTCAGGTG ATCCACTCGCCTCACCCTGCCAAAGTG CCACCATG C 

CCGGCCTATTTTTTACTTTTTTATTTTCCAGGACAGAGTCTTGCTCTGCCGCCCAGGCTCGAGTG 

CTGCAACATCTG CCTCCCAGCTTOVAGAGATTCTGCTGCCTC!AGCCTCCCGAG CAGCTG AG ATTACAGG CTTGCACCACCATGCCC 
GCCTAATTTTTGTATTTTT AGCAGGGACGGGGTTTT ACATC 

5 0 CCCAAAGTACTCGX3ATTACAGGCGTGAGCCACTGCGTCCGGCC 

GGAAAGACATAAGGAATTTGAACTACTTCTGTAATTTTAGATGGTCTCATAAATAT^ 
TGAGCTAAATCTTGTTTAGTTTTCACCATGCTTTAAAGATTA 
TGGGAGGGTGAGACGGGTGGATCACCTGAGGCCAGGAGTTCGAGACCAGCCTGGC 
TACAAAAATTAGCCGGGTATGGTGGC^CAAGCTTGTAATCCCAGCTACT 

5 5 GGTGGAGGCTGCAATGAGCTGAGATCGTGCCACTGCACTCCAACCTGGGTGACAGAGTGAGACTC 

AGAAAGACAAAAGATTAGTTATGACTAATAAATACAAGTACGAAGTGGCCCTTGGGT 
TTCCCCrTGAGACATGGTCTCACTCTGTCACATAGGCTGGAGTGCAGTGGCAGGATCACAC 
CCTCCCAGCTCTGCCTCCCAAGTAGCTAGGACTATAGGTGAG 
GGGTTTCGCCATGTTGCTCAGGCTGGTCTCAAACTCITGGCC^ 

6 0 AGGTGTGAGCCTTGTTTGCAGAGTTTAGGTCCCAGAGAATATAAAGACACTOU^ 

ATCAAGAACAACGCTACTAGAAATAAGCAGTTTGCCTCTAGAGAAACTTCAA^ 

ACTGTGGAATGCAAGCAAACAGCAGCAGCTGGTCTGTATTTTTTTAAATGGCAAAGTCACAAA 

AGCAGGTAGTTTGCTTTCriTTTTTTTCAAATGGAACT 

TTTGTGTTTTGAAATCCCAAATTAATGAGCATGATAAACCGCTATTGCTGGCCCTC 

6 5 TTCCCCCCAACCTTCTCCTAACCCTCACCCTTTTTCCCTTTCCATTC 

AAGGAG^AGAAGGGGGGAGGTCACAGTGCATGGGATTCAAGGTCACAG 
GTTCCCGCTGCTCAAGTCCAGCTGTCIGCCACGGTAGTGCAACCCTT 

ATCTCTTCTTGCCTGCATTTATCTGCGCTGTTTGAAGCCGTGTGACCCATCAGGACCCAGAGG 
GTCTGCTGTCTTCGGGTTAGGAGATCCTGGTTGGCCTGAAGGGAAGAAGATCAGAGGAAGACCT 

7 0 TTCT C CTCGGG T TGG CG CTGCCCAGGATCCG CTTCCTGG CCTCGGTGTACTCGG CTTCC CACTGTG CCAGGGACTTGACreGAAGG 

GCGGGCCTGCTAGCGGACTTGGGGCrGCTGACCACACCGTTGCTGGTGGGC 

GGGAAAATTATCATCCTGAATCACAATGGGCACTTTGGGAGATTTG 

ACGTCTATTTCCCCGCTGTCTGCCGTCTCCTCCCAGCTCTCAGCAACCTCCT 

TTG CTTT CTTATTTCCACAITTT CT CTGAC i L i1TG CAGTATT CJL^T ATT ATT ACCTG ATTCTG ATTTTTCAG ACATCACT AT ATGG 
7 5 TCCTTGGCTTTTTGGTCGTTCCTCTCATTCCTTTGCTGTTTCAAGCTCTC 
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TGTGACCCATAACTT AAAACT AACC CAACTTTTGATT CC CT C CAGACTTTTTTTGG T AAGCAGT AT CAT CATGCTTT AGGTGGACA 

CTGAGTCATACTGCTGTGGTAGTGCTGGTACACGTAGTT^ 

CCCCACCCCCGCTTCTCCCTTCTGCCTTCATTTCTGACCCTtCTTTTCTG 

C TT rTTTTTI CTGCTCT AAT AATTTTGTCAAGAT AATTTTTT CAAAG CAC AAGG CCAATG AGAAGGGGG AAAG T CGGT C CAG AAGG 
5 CAG CCGG CCCGACT ATT CTG AGTTTGAATCT ACTCTTGGTGGT C CTGTTAAAATACAGTTTGAAG CAAATCACTTGTTTTT ATGCT 

TCTAATCTCTACTCTGTCGAAAGCCCAAATTTCCrTCCT^ 
ATGCAAGTCAAACGGATTTATTTATTGTATCTCAAACAGTCCT 
TGGAATATTCTCTGCTTTTATTTAAGTCTCACCAT 
CAGCATTTACTAAGCAATTAATTCTCTCTCTGAACT 
1 0 CTATCACTTATCrcnTTTTTTrTTTTTT^ 

CGCrCACTGCAACCTCTGCCTCCCGAG ATCAAG CGATTCTTCIt3CCTCAGCCTCCCAAGTGGCr 
GGGCCCC^CTAATTTTTGTATTTTTAGTAGAGACAA^ 

ACCCACCTCAGCCTCCCAAAGTGCTX3GGATTACAAGCGTGAGCCACCGTGCGCAG 
CrCACTGCriTGAGATTTTCTTTCATCCAGGGTCAAGCACTGAGTCrrATTAGAA^ 
1 5 • T ATCTI'TIT AAAAG CCACAT ATT AAATCTCIT AAGTT AAT AAT ATTGG CAATCAACTT AAATCAACTG AGTG ACCACTATG TTT AA 

GG AAGTTT CCTTTCTGGT AGG CTGGG AAG CGGG AGAAGGG AAGGATAT AACTTT AT AAAAAAAAAAAAG CTGCAGAAGGG AAGGAT 
AT AACTTT ATAAAAAAAGGTCCTTGTTCACAAGG AG CT AATCACTTACTGG CCAGAGAAGAAACGTGTTCCTTCTTTGCAATGGCT 
GTGTTTTTGGGTTTAGCTAACTCAATTCITTATGT^ 

AAGTAGGTGAGTCAG AATG AG TG CAT AAT AT AT AAG T AT AT ATTTTAATG ATAATG AATGTGT AGATTAACAAACCCAAACTTTG T 
2 0 TTTTGTTTTTTTGAGAGTCTTGCCCTGTCGCCCA^ 

TTCACAC^ATTGTCCTGCCTCAGCCTCCGGAGTAGAGTAGCTGG 
TTTTAGTAGAGACGGCGTTTCCCATGTTGGCCAGGTTGGTCT^ 

AGTGATGGGATTACAAGCGTGAGCCACCX5CX3CCCAGCCAAACCCAAACATTCTAAAAGCG 

ACGGTAAC CAGCAT CTTCG TT CT CC AACCAGAACAG CTGTATG CACCGGTGT AT AAG C CTG C CTT AAG TG CT ATTTTTAAT C T T TT 

2 5 TTAAAAACAGGCCTTAAAATAACTCATTTGGGTACAACTCCAGCGGCTGGTGGAGT 

TGGTTTCTTTTTGGTAACAGCGACACGGAC^TCCGCATT^ 

AACATAGCGCAACGGCTAAAGG AGTGT CACAGCGCC AAACCTTG CTGG CTCCGCGC CAGG CCGAGCCC CACCCTCT CAG CTCGCGG 

CCAACACCCCt^CCCOSCCrCACAGCCaSCCGCGCCCGCGG 

CCCGGACCCAATTGCCGGAAGCGGGGCCGGTTGTC^ 

3 0 GG<5CGGTAGGGGGCG03CACCGAGGCACCGCGGCGAGCTTGGCTGCTT 

CAAGAAGCCXIAGCCC^AGGGGCGGCCGCGACCCCTCTGACCGAGAT^ 
TTAGTGCGTACGAGCGCCGAGTGCCCTGGCCCGGAGAGTGGAATGATCCC^ 

AAGGAAACTGGGGAGTCTTGAGGGACCCCCGACTCCAAGCGCGAAAACCCCGGATGGTGAGGAG^ 
CGGTCACTTTTGGGTCTGGGCTCTGACC^TGTCCCCTCTATCGCTGGTTCCCAGCCT 

CCAGGGAGGAGGG CGGGATTTCGGACGGCTCTCXI CGGCGGTGGGGGTGGGGGTGGTTCGGAGGTCT 
GGTCACGGGGGCC«3GGGCTGCGGGGCCGCTrCGGCGOXX3A^ 
CTGCGCGTAGTCTGGGCGGGATTGGGCCGGTTCAGTGGGCAGGTTGACTCA 
TCCGAAACTGCAGTAAAAGGAGTTAAGTCCTGACTTGTCTCCAG 

4 0 GGCGATTGGAGGGTAGACCTGTGGGCACC^ACGGACGCCACTTTTT 

AGTTTTAACTtiTTGTTTATGTl'ClTlATATATGATGTATTTTCCACAGATGTTT 

TCCTTGTAGGCAAATGTGCAATACCAACATGTCTGTACCTACTGATGGTGCTGTAACCACCTCACAGATTC 

AGACCCTGGTTAGTATTTTTGTCTCGTGTAACTTTTAAGAATAATTTATTTTATGAAGTGAT 

TGTAGCATGGCTCTGTAAATAAAATTGTATGTGCATAGCTTAAAGTGTGCT 

4 5 AAGAACTGGTCAGTATTTTTTTAGCCATTTGCCACATTTGCTTTAAAAATTAGTTCCATTA^ 

GTTTGTTACGTGACTGATTTTTTTTTCCTTTTTGACCTCCCAGTATTTAATTT^ 
TCGTGCTTGTTTTAAAGATTTTTGTCATTAGAGAAATTTTAAAGTAAAACCTGAAAAT^ 
TAGATTTTTTGTGTGCGTACCCACATATATTTACATTTGAATTTAAATCT 
ACTGTAGTTGTTCTGTGACTTGCATCCTTGTCAATAAATAAAGTCTAC^GAATT 

5 0 GAGTATACCATTTTAAAAATTATTTATGCCTTTACTGATAGGTTTTTAAAAATTGCCT 

GCTTTGTACATGCATCCCGTGTGACTATTTCTGTAAGAAAGATTTTATAGAATTATAGGCTGGCCT 
CCCATCACTTTGGGAGGCCAAGGCGGACCGATCACCTGAGATGAGGAGTTCAAGACT 

CT ACT AAAAAT ACAAAAATT AGTCGGG C ATGGTGGCG CG CG C CT AT AATCCCCTG CTGAGG AGG CTGGGG CAGGAG AAT CG CTTGA 
ACCCAGGAGGCAGAG<3TTG(XSGTGAGCCAAGATTGCGCCATTTCACTCCAG 

5 5 AAAAAAAGAAAAGAAAAAGATTTTAGAGAATTAGAACTCCTGAGTTCTCAATTTTACT 

CAAAAGGCTCTAACAGTTTCACTCCTGCCAATACTGAATATCACCAATCTTTAA 
TCTTGCTCTGTCTCCCAGGCTGGAGTGCAGTGGCACAATCTCX5 

CCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCACCACCACGCCCGGCTAATTTTT^ 
CCATGTTAGCCAGGATGATCTTGATCCCCTGACCTGGTGATCCGCCCGCCTCGGCCrrCCCAAAGTGCT 

6 0 CACTGCGCCTGGCCTCTTTTATTATTATTACTATTTTTTTGATAAAAGGT 

CTTC^JTGAGGCTGGGCACCTTTCCTTTTTGTnT 

GCAATGGCGAGATCTTGTCTCATTGCAACCTCTGCCTCACGGGTTCAAGCGATT 

ACAGG C ATGCGCCACC ACG CTC AG CT AATTTTGT ATTTTTAGTAG AGATGGGG TTTTTCC ATGTTGGTCAGG CT 
CCAACTTCAGGTGATCTGCCTGCCTCG^CCTCCCAAA^ 

6 5 AGTCATTTATTTCTTCTGGGACTTGCCTAGTCTTATCTGTCCATTTTCCT^ 

GCrCTATGTGTTTO\AGTATATTTAACTTTTGTTGTT^ 

TTCTGTGAAGTAAAATTTGTTGGC1TTTGAATTTCATATCATG CTTTGGAAGAACTCTCCrGTCCCAAGATCITGOUUlTATATTC 
CTATATrTTCTTCTAGATTTAAATTTITTAAATATTTAGATC 

ATG CT ATT AT CTTTT AT AG CTGTTTT CTCATT AAT ACATGT CTTTTAAGT CACTAATATACTGATGG CTTTCCATTACAG ATTTTT 

7 0 T r i TlVrriTT lTT l ' l XjAGACTGAGTCT^ 

TTCCTGGGTTCAAGCAATTCTG CCG CAGCCTCCCGAGTAGCTGGG ATTACH CACCCACCTCCACACC CGGATAATTTTTGTACT 
TTTAGTAGAGATAGGGTTTCACCATGTTGTCCAGGCTGGTCTTGAACTTCTGA^ 

GC~T GGG ATTAAAGG CATC AG CC a .CTOCG CCTGG CC CACACCTGGGT AATTTTTGT ATATTTAA CACTGTGTT 
GGCCAGGCTGGTCTCACACTCCTGACCTCAGGTGATCTGCCTGCCTCGGCCT 
7 5 CACCTGGCTTCCATTACAGATTTTTACrTTG 
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AAAGGTT ATTTTTAAATCTTTTTTGAGAGAGTC CATTCTTT AG C CTG AAAG T AAAGTG CAT ATG T AAGCACAG TT AAAAAAAAAAA 
AGTG AT CAGAAAAATT AAT AATT CAATTCT AT ATGT ATTTT AAATTTTTCAATTG TTAAAT AAT ATTTGTTTT ATTGATT AGCTGG 
AAATAGGCAATTTTTAAAAATCTGTTTTTAAGGCATCCCCAATGGGGGTGCTC 
TATTAAAGAGAGGTGCAGAGGTACCAGTTTGTGTGGTAGTGGCAAGCT 
5 AAGGAAGGGAGAAAGTGCATAGTATATTACATTCCCCTCCCCCATGACAAT^ 
AAGGGGCAGGauATGTCAjGCCTGTCAGCCAACTTTGGCC^ 
GAGTCTCTCAGTGTGGCCCAGGCTGGTCTTGAACACCTAGCCTCAAGCA^ 
GGCAAGAGCCACCTCTCCCAGCCAGCTGCCTTTCTTTCTCT 

GTAGCCATG L'TITTTACT^GGATAGAACCATTTGTATGAAA CTCTTTACAGAAAAAAATTGAC 
1 0 CCCTAATCTAGTTGGCAGGCTCCTTISCATACAGGTCATATCTCTTAT^ 
TAGTAGAAGCTCAATAAATGTTTGTCAAGTAGATATTTGAAATGT^ 

CTGAGTTGT AGTTG AGGG ACCTGGG TTGTT AT CT CTGATCTG TT CTT AATT ACCCTTGTCAG CTTGGGTT AG TT AG TT AGGTT CT C 

TAGCCTTCAGGAACCTAATCTTTAGAAAGAGAAGATTGATAACAGGACCCACTCT 

AGTTAGCTTGCAATAAGCTCGTTGGATCAGTACGGTAAGCT 
1 5 CTAATATAAATAGCTGAAGGAAAAAGTAATCTTTGCTCTTTTGG 

ATGATACTTGCATAATGATTAGATCCTCCCCAGCATTTTT^ 

CTATAGTTCTGGGATAATTTTGGAAGTATAATAGCAGTTCTTTTCTCTT^ 

AAAGTCTGTTGGTGCACAAAAAGACACTTATACTATGAAAGAGGTAAGCTGAATCAAGAGA 

TAGCCATACTTAAAGTTTTCAAGACCATGTGGAGAAATTGGCCATATAG 
2 0 GTCGTTTGGACT AT ATTTT AAAGCTGAAAATTCTAATTATACTCGAGTT^ 

CGTAAGGGTGCTTGACCATTTTATGCTGTGAACCC^ 

TATAGAATTACAAAGGAAACATACTAAAATACGGTTACCAAAATAGTAAACTAGTTTGTGAT 

ATAAATACTAAGATTTAGAAGAAGATATAATAAATATAATGTTTTTTTGTTT^ 

GAGTGCAGTGGCACGATCTCAGCTCACTGCAACCTCCGCCTCCTAGATTCAA 

2 5 GATTATAGGCACCTGCCACCACGCCTGGCTAATTTTTTC 

G AACTCCT AG T CT CAGGTGAT CCACTTG C CTTGG CCTCC CAAAATG CTGGG ATT AT AGGCATG AG CCACCATGC CTGGCCAAT AAA 

TATTATCTTGATATAGTGAGGAATATAAGTCATATTTTGAAATGTTTGCCATAAGT^ 

TGTTGGTGATGAAATCACAGGTATTAATACrrcTTGTGA^ 

AGTTTGTGAAAATAAGAATAGAATTTTTTTTTTCCCA.TCCAAA 

3 0 CCTGGATCCGAGGTTAAGAACTTCTGCA.CTAGAGATACATGAGTACAGTATA 

GACCATATATTCTATCTAATGGTTCCAGCATTT^ 

TAGAAACTTCCTTAAAGGATGTTTGGTAGAATAAATCTAAGTT 

3 5 AT CCAGAGTCACAT AAG AAT ATTCCTTT AAGGCCGATGC CTG CTT AGCCAG TT AAATCCAAAGACAG TG TTTGT AG GAAAG GG C CA 

GG TTAAATGGTATTTTTAAGGCTAATTTGGGATTGACCTTATTATCATTGGATA^ 
GCAGTAGATAGAAATGAGAAAATTGGCCAAGX3CCATCATAAAGACTTAGAATGCTTATGCTA 
TTGTCTGATATAGTTCTCAGTTATAGGATCrTTCTTCTGGGGCACA 
GTTGGATTAAGAGTCAGAAGAGCTGGGGCCAGTTTCTTCTTCTTT 

4 0 CTTGC CCAGG CTGGAGTGCAGTGGCATG ATCACTGCTCG CTG CAG CCTGGACCTCGTGGGCTC AGGTG ATCCTCCCACCTCAGTGT 

CTGCAGTAGCTAGGACTACAGGCTTGTACCACTATGCCTGGCTA 

CCAGG CTGGTCTGGAACrrCCTGGGCTCAAGGGATCTGCTTACCTCC^CCTCCTAAAGT^ 
C C CAACGCAATTTCTT GATTTCTAATACAGTG AT AAT^ 
AACTGTAAAATGCCATATGTATTTAGGTAGTTGTTAATAGGTTTATATCT 

4 5 AATGCAGCACAGGGAATATAATTTAACTTTTATTAATC^TT 

TACCTTAGAGGGTTTATTTCATTCTCAGAATTCTCrrCAAACTGTTTTGTA 
CTGGTGACCTCAGAGGCTATTGAGGTTTATAGTATTTATGACTATTTACTAAGT 

TTTGTTT CTG CAACTTTTTTGGCTATCAGTTTTTTATTTTGCTG CCTTTTCCCTTC CT AAT ATGTT AGT ATCT AAGTATCTGGG CA 
GAGTTAAAGTCTACAAGGAAAAAAACCTCACATTTTAAGG 

5 0 TAAATGGAAAAGGCTTAAAAGTAG CTTr riT T ^ ^ 

TGCAGTGGCX3CAATCTCGGCTCACTGCAGCCTCCACCTCCTGGG 

TACAGGTGTCTGCCACCATGCCTGGCTAACTTTTGTATTTTTAGTAGAGATGGGACCA^ 

TCCATCCGCTTCGGCCTCCCGAAGTACTGGGATTACAGGTGTGAGCCAC^ 

TTGTTCTACATAGTTGTGGATATGGTTCCTGGTTGTTTACCCCTATTCAGATA 

5 5 ATTTAGTTTTCTTTAATGCTCAGAAATCATATTTGTATTTCAGG 

ATG ATG AG AAGCIAACAACAT ATTGTATATTGTTCAAATGATCTT CT AGG AG ATTTC TTTGG CGTG CCAAG CTTCT CTGTG AAAG AG 
CACAGGTAATTCTTCAGTTTAGTCCATTGTAAAAAGCCATCTGGGCTAACATTT 

GAG AAAAACTGTTGAAACAT ATT AAAG ACATG CTGAAACTT AATTTTTT AG CT CTG CGG CAT ATT ATTTG AG AAC CTGAG CTCTGG 
C^TCTTATAAACCAGTGTTTGAAACCTAGCCCAATCATTTACTAGTTTTGTGACTAGATA 

6 0 (IATTT AT AAAATAGGGAT AATT AAATACTTCATGTGGAGGTT AAAG CT CTT 

GCACATACAAAGC^TTCAAGTATTAGCTGTAGCrATTATAACTTTT^ 
ATTTGAGCCTTAGTTATTTTC^AGATCATTTTCATTGTCACCTAT 
TTCTAGAAATAGGCCTGGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGA 
AGTTGGAGACCAGCCTGACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAAT^ 

6 5 ATTCCAGCTACTCGGGAGGCTGAGGCAGTAGAATCAOT 

ACTGCACTCCAGCCTGGGCAACGAGCAAAACTGCATCTGAAAAAAAA^ 
TTGTTTGTTTGTTTTGAGAGGGAGTCTTGCTCTA 
TCCTGGGTTCAAGTGATTCTCCTGCCrCGGCTTCCCGrAGTAGCT 
TAGTAGAGACAGGGTTTCACGATGTrGACCAGGCTGGTCTTC 

7 0 GCTGGGATTACAGATGTGAGCAACCACACCCGGCCT 

AGCCTTAGCATTCATTACATTAATAAATGGAATGAATACGTTCACCTGAGACAATTA^ 
TAGTATAGCTGAAAAGATGATAATTAAATGGGATTTTATTTTATTAAAAAACATAACTT 
TTTAAAACAACCTTATTATTGTAAACATACTATATATTATTATTGAAC^ 
AAAAAGCACTGTAGTTTGTTAACAAAAGGTATTTT^ 
75 ATATAGTACTATATTTAAATTTCATCTAGCTTCTTAATTTGTATTATAA 
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TTATATATATTACAAATATATGTATAATTATTAriTTTm 
AOSTGGCCTTGGTATATATGGTTTAATTTTATAAAT^ 
TTQAATCTTCTATTTTTTTTCTTTTTTTTTK^ 
CGATTCTCTTGCCTGAGCCTCCTGAGTAGCTGGGATTACAG^^ 
5 ATGGGGTTTTGCCATGTTGGCOVGGCTGGTCTCGAACT 

TACAGACGTGAGCCACTGTGCCTGGTCTGAATATTGAATCTTCGGATAATGTTTGGAAGTATC 
ATGTTG AGAGATAAG AAAT AGGAGAAATT ACTGACTTGAAAAGGAT ATGAATT A 
ATTCCCAAGGATGAAAAAAGATAGTTGGGATTCATATTTC 

TAAATCCTTAGTTTAAAAACAAAAGGAAAAATCCATTGTGACATCAACGAATACTC 
1 0 T AAACTTCATG AGGGCAT AT ACATTT CTGTTTCATT CTTTGTTTTCC C CCGTGT CTGGTACATAGT AC CAGTGACTGGT ACGTGGT 
AGGTTTT CAGTCTGT ATTTATTGAATGAATG AGT ACT GCTTT ATCTGTT AAT AAGTri'Crrri'ri'l'l'CTT CCCTG AGT CGGAGTTT 
TGCTCTTGTTGCCOW^CTGGAGTGCAATGGCGTGATCT^ 

TCAGTCT CCTG AGTAG CrGGGATTACAGG CATG CGCCAC C ATGC CCAG CT AATTTTAT ATTTTTAG T AGAG ATGGGG TTT CT CCAT 

GTTGGTCAGGCTGGTCTTGAGCTCCAGGCCTCGGGTGATCTGCCGGACT 
1 5 GC CACTGCG CCCGGC CT CTG TT AAT ACGTTTTAAGAT CAAGAAAGGAAGTTGAAGCITG ATG AAAGTCAAG ATT AT AATCZAAATT C 

CT ATTT AATGGATTTTG CGT AACT CAATAGTlTTrriGG CAGTTTGT ATTT AATGGATTTTG CAT AACTGAATCCTTTTTTT'rGG C 

AG TTTGCTACT AAGCAGAGAG CT AAAAACATTC CTG TTTGAAAAT AAG TATG TT CAACCTCTTCCTG CTTGCAT ACTCATTTTTTG 

GGGG TCTT CTGGT AAAGTCCAAACAAAG CACATTTCAGGAA TAATT ATT 

AAAAGTAGCTGTACATAATCATTGTCCTACTAATTATTGTGCTTTTAA 
2 0 TGTGTGCAGTAGTTCATACTAAGTATGTATGTAGAAGTCTGGTTAGATCC^CTTAAT 

CTTGTTATTTTTTTTTTTTCTGTCTACAAGGAAAA 

TAATTTTGAGCATCATGG ATAAAT AC CAT AAAAACGTTTTAAAG ACATTT 

TTTAAAAGTTGCrTTAATTAAATATTGTAATAAAAGTATGATAAATTTATTAGA^ 

TCTAATGTAAAGTTAAATATCCTGTTAGATCAATTTAGACCTGGGAT^ 

2 5 AAAGCTCCTGTTTAGGAGTGATCTACCTTGAAAATGTATTGAAA 

AAGGATGAAGGATGAGTCTTTTTCTTTGGTTATAGACAAAATTT^ 

CGTTTCTT AACTGGTT ATTTTTGGGT CACATCTCT CTTT AAT AAG CTGAT AAAAG C ATTGGGTTCTTGTT CCAGAAAGATG CACAT 
GTATATATACATGGGTAAGTCAATAAGTGTTTTACAACCTACTT^ 

AGTATAATACTCTCATTTGAAGGCTAAAAACATC^AGTTGGGAAGAGATGTAC^^ TT 1 1 TTTTTTTTTTTTTTT GAG 

3 0 AAGG AATCTTG CT CTGTCACTCAGGCTGG AGTGT AG TGG CATG ATCTCG ACT CACTG C AACCTCCGC CTCC CAGGTTCAAG C AATT 

CTTCTGCCTCAGCCTCCCGAGTAGATGGGACTAC^GGCGTATGCCACCACG 
CTTGAGCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCT^ 
TGAGCCACTGCACCCGGTCTCACAGTTTGCTTTTGAGTGCTGCTTAGAGCTGTCT 
CTCGAACTACAGTTAAGAAATGCTCTTAGGCCAGGTGTGATGGCTCATC^ 

3 5 GATCACTTGAGCCGAGGAGTTCAAGACCAGCCTGGGCAGCA.TGGCAAAATCCT 

TAGTGGTGCATGCCTGTAGTCCTAGCTACTCAGGAGGCTGAGGCAC^ 
TGTGATCATGCCATCCAGTCCAGGTGACAGAGCAAGACCTCATCTCTTAAAAA^ 
TCAGGAGCTTGTTTAGTTCCTAGAATGAATTGACCACTCCCTTGTACTTCCTCTATT 
TTGTTTTACITGTGGGTTTriTTGTTTGTTTTTC 

4 0 CACTGCATCCTCCACCTCCAC^GCTCAAATGATCCTCCCCCTCAGCCTCCCAAGTTGCTGGGACT 

TGGCTAATTTrrCTGTATGTTTGGTAGAGACXSGTGTTGCT 

CT CGGCCTCCCAAAGTTTAGGGGTTACAGGCATGAGCCACTGCGTCTGG CrGTTGCACTTGTTTAT ATTTTTGTTACCTGTGTCCT 

ATTCAGGTTTGCATTGTGCATCACACACTATAGTGTTTTATTAATATGGTATT^ 

CAAAGTTAGGATTTAAAGCCCAGAAAAGTATCTCCATTTTATCCCAAAATAAA^ 

4 5 TGTTTTAAAGCTCTlTTfTTTTITGAGCTTTAGATTGTATCCT 

GTACATGTGTACAATGTGCAGGTTGGTTACATATGTATACCTGTTGTTTTAAAGCTCTTT 
CAATTGTAGTATTTGTAGTAGTAGTTTGAGTTTTCTGAAG 
TGGCACATTACTAGAAAAATTGTATTATTGAGATGGAAACTTACAAAT^ 
GAATTAACITCAGAACTCCTATTCCATGTTCTCAACTTTCT 

5 0 GGAGTGCAGTGGTGCGATCTCGGATCACTGCAACCTCTGCTTCCCGGGTTT 

GGATTACAGGTGCCTGCCACCACCTCCGGTTAATTTTTGTATTTTT^ 
AAACTCCTGACCTCAGGTG^TCCTCCCACCTTGGCCTCCCAAAGTGCT 

GGGCTT C CTTTTT ATT AC AT CTTTTTTGTAGCAT CTGTTT AT ACT AAT CCAAAATTC CTG G AC AAT AT AAC CT ATT ATGT AC AACT 
T AAG AAAATT AGTGT AATG C CTT ACCCATAGTAT AAAGAGAAAATT AATCTATG ATAATG CAT AATTT CAAAATG TTAATGTC CAG 

5 5 GCATGAGTTCACTGTTAAGATGAAAATGTTTCCAAGGGGGGTAGTAAAGG^ 

AAACTTTGACTTTTGAGCTCAAAGTCTGTATATCCAACTACAAATGAAACCT 

TCCAATCTGTTTATCTTTCTAAAGTTTCTCTTTTAGTGAATGGTACCACTCATTTATTCACCCAA 

TAACTCCTTTCTTCCTTGCCCTCTATATCAAATACCAGG 

GCCTTAGTTTAGG CTTTAATAGTTTTTAGTTTGT ATT ATTGCAATTGTCTAAT ACTGGTTTTCCTC T T T 1 CTT A 

6 0 TGTTTTGCrAGAGTGATTTGTCACTCTTCTGCTTAAAATTACCATTTAGGGGT 

GAGCTAGAAATCAGTTTCATTTCTTGGACAAAAATCCTTTTTAATGGTGA 
ATCTGCGGGGAAGAATGCAGTGATAAGGGAATGACAGCATGAGTCCTGCTT^ 
GCCTACATTCGACTAATAATGAACTGCTTGTC^TGTCCTGCAA 
CGTTTTCTTGGTCTGGAATGGCCTTTCCTAGCCCrATTTTCATA 

6 5 TCCCTGTCCCTACCAAAATAATGACTCTCCTGTCTCTCTGCTAATTTATCTAAC^ 

TCTACATTAAAGTTGAATTATTTTATTTACATCTGCAACATTTAGTATGAGGATC 
GAAAGGTTAAACTGCATAAGGGTTTGTGCTAGACTGATAGCATATCTACT 
TTCTGATCCTTTTTCTTTTCTCTCAGAATCATCGGACTCAC^ 
ATCAAAAGGTAATCTAAGTAAATTTCTCATTCTAATGTAATACTATCTGCAA^ 

7 0 GGTTATGTTAAGTTTGTTGTATTTTATTTTTTTCCTAAATGCTTAGGACCTTCT 

CAC^TTTGGTTTCTAGACCATCTACCTCATCTAGAAGGAGAGCAATTAGTGAGACA 
ACACAGCmTTGATATTCTTTCTCTAATGAAATTAGTC 

AAC CT CTG AGTTTTTTTCATT CG TGATTTTATTTG ATTTACAAATTG CTT ATTT AGCAAT ATT^ 
CAGTTTACTGTGCCCCACTGGAGTTTATAGTTAGCCAACTCT 
7 5 TGGCTCACttCCTGTAATCCCGGCACTTTGGGAGGCCGAGGCTG^^ 
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GGTGAAACCCCATCTCTACTAAAAAAAATACAAAAAGTAGCTGGGCCTGGTGG CG CACACTTGTAATCCAG CTACTCAGGAGGTTG 
AGGAAGGAGAATCACTTCAACCCGGGAGGTAGAAGTTACAGTGAGCCGAAATTGCGCCACTC 
AGACTTCGTCTAAAAAGAAAGATAATAGGAGACAGGCATATAAACTCTATTTTCAC 
TTTTAAATTGTTATTTTTAATAAATGGTAGTACATACCTATGAGTT^ 
5 AAGGGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATC^ 

- CATCCTGTCT AACATGGTGAAACCCCATCT CTACT AAAAATACAAAAAATTAGCTGGATGTGGTGGCGGGTGCCTGT AGTCCCAG C 
TACTCGGGAGGCTGAGGCAGGAGAATGGCATGAACTGG^ 

GGGCGACACAGTGAGACTGTCTAAAAAAAAGAGAGAGAGAGAAAATAGTTGACAGAG 
TAGTAGACGAGAAGGCTGTTGCCTGAGGAAGTCGCAGTAACTAAGACrc 
1 0 CAGAATAAATGAACCCAGACCAGAATCTCGTACATGCGGGC^ 

AACAAATGGTAGTTACTTGGGGAAAAGGIXIAAGTTAGATCTGTACCTTATG^ 

GAAAAATAGAATACAACATATTTGAAAGATAGTCACTTTAAATTTGACT 

AATAAGGAAAACATTAAAACTTCAAATAGOVAAAAGGGCAGACAGTTC^ 

T ATGGGGAGAGGGTG AATTTTGGTGATTTTT AG CTTT ACAGAT AG T AAAAAATG C CAAAAGGG TGT CCTTTTGAT CT ATCAAATT A 
1 5 GTAAAAATAAAATTTTTACTCATCCTTACTC^TCAGTGCTAATAACTTG 

TAAAAGTGGGTAAAAATTGATTAAATTTTTCGGATTATAAAAAAGCTTAGATGGCCTG^ 
CACTTTGGAAGGCCGAAGCGGGTGGATCTCTTGAGGTTAA 

AAAAATACAAAAATTAGCCAGGTGTGGTGGCAGGCGCCTGTAATCCCAGCTACTTGGG 
TGGGATGTGGAGGTTGCAGTGAACCAAGGTCGCACCACTGCACTCCAGCCTGGGGTACAGAG 
2 0 AAAAAAGAAAAGAAAAAAAAGTCTTAGAGTTCTTAACATTATCCTAGTAACT 

AAAATTCAAAG ATTT ATATGTAGAG AT ACTAATTACAG CATTATTTAT AATTG CAG CCCGGCCAAG ATCAC CTTTTT AAAAAT AGG 
GAACACAAACATGATTAAGAACTCACGTGCAGAACTTGTGATCT 
CTGTAATCCCAGCACCTTGGGAGGCCGAGGAGGGAGGATCACGAGGTCAGGAGATTG 
CGTCTCTACTAAAAATACAAAAAAAATTAGCCGGGCACAGTGGTGGGCACCTGTAG 

2 5 AATTGCTTGAACCCAGGAGGTGGAGGTTGCAGTGTGCCGAGATTGTGCCATT 

TCTCZAAGAAAATGATAAAAAAAGTCCTTTCAGAAATCTGTGATACTATCT 

ATATCATTTCACTTTGCATT U ' C1 " 1T GATCCCTAGGAGGTAAGAG1'T1"1 " 

TATGTTTATTAGTCATTTGTATTTCTTCTGTGACTTGCTTTTTC 

GACTTTGTATAATATGTATACTATGGGTAATTTTTACTATATATGTTGCAGATATTATCT^ 

3 0 TTTGTTTGGTATTTTATTTTGTCATGTAAACCTTGA 

AAT ACTTCTCCACC C CAGCATT AT AAGAAT ATTTTG TT ATAGGT ACTTG AATGG TTTTGTTCTTT AT ATTT AG ATCTTT AATC CA 

CTGGAATTCATTTTTATGTCTAGTATAAGGTAGTATTCTCA^ 

AATTTATTATTTAAAATGAAATGGAAACTTTTTACTGGTTTTC 

GCTTAGGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGAGGATCACTTAAGGCC^ 

3 5 GGTGAAACCCTGTCTCTACCAAAAATACAAAAATTATCCAGGCATGGTGGTGTGCACCTTTAATCCCAGCTAC 

TCACAAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAATGAGCTGAGAATG 

ACTCTGT CT C AAAAGAAAAAGTG T AGGGT ACT G AAG AAATTGAAACATT AAGAAT AC CAT ATG AG T AAATT AAAC ACTTTGGCT CT 

TTTCGGAAAAAAACAGATGAGCTCTTATATTTTAAAGTTTGGTTTT 

TTATTAGATAGATTAAATTGTGATGTATGGAGATAAAGTTCTGAAAGAAGAATAG 

4 0 GGCAACATGGTGAAACCCTGTCTATGCAAAAAATACAAAATGGTGCATTCCTATGGTA^ 

GGCTGAGGTTGAGGTGGGAGGATCACTTGAGCCCATGAGGCAGAGGTTGCAGTGAG 
GTGACAGAGCAAGACCCTTTCTCAGACAATAAATAAATAAACAAATAAATAAATACACACAGATAG 
ACAGATTCAATAAAACAGTGTTAACTTTTGAACTAATTTTATTGAAACT 
TTTCTAAATCACAGTAC^GCAATTTATTTTTTTTCCTTAC 

4 5 AAACGCCACAAATCTGATAGTATTTCCCTTTCCTTTGATGAAAGCCTGGCTCTGTGTGTAATAAGGGAGATATGT^ 

CAGTAGCAGTGAATCTACAGGGACGCCATCGAATCCGGTAATGTTCTCATT^ 
GTCCTGGCAGTCCTAATAAGGATACGGATAGAATGTACATGTGTCTTA^ 
AAATAAAACTACTATATTGATCTCC^CCTTCAACCCAAGGTATTGTC 
AAATTAGTAGTAGTATTAGTAATTATCAATGTCACTGTATGTATTAGAGA^ 

5 0 TAAATTGGAGTATAGAGTTGTTTGAGAATTTGAGAGTTAC^CTTTTTA^ 

TTTTAGCATTTTAAGGCAAATTTGACTATTTAGAGTAAAAATATA 

CTTAACATTTGCCAAATAGGTCTAGACATCAGGTAAGTTCTATTTACTTATGCTGTAAAAT 

GTGGCTCATGCCTGTAATCAGTACTTTCGGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAG^ 

AGATGAAACCATGTCITTACTGAAAATACAAAAATTAGCCAGGCGTGGTGGCGCATGCTT 

5 5 GGCAGGAGAATCGCTCGAACCCAGGAGTTGGAGGTTGCAGTGAGCTC^GATTGTGC 

GACTCCATCTCAAAAAAAAAAAAAAAAAATGCAGGTTTATTATTGCATGGATGACGGGAGA 

ACAGACTTCCAAAATATTAGTGAGCAGCATTTCTTAAGAGATGT<^CTGA 

TGCTTGACAAATGGAATATTTrGTTTTGTATGTCT 

TTGAATGTGGCCCAACACAAATTCATAAACTTTCTTAAAATATTATGATTTTTTT^ 

6 0 TGTTAGTGTATTTTATGTGTCGCCCAAGACAATTCrTCCAATGTCGCCC^ 

TGGCAGTGTCTCATGGCCATTATTCTGGGTATTAGTAATCATTG 
AGTAATCCTAGTCTTTTACCTATTCTTGTTACTTCCTGTATTATCTTTTTTTCT 

TG CCCT CT AG AGTT ACTGT AC CTAACGG AG CTG TT ACTT ATGT AGACC GCT AG ATGG CATGATG AG CT CCTG CT CAGT CTG AGAG C 
AGGTTCTTCATTTGTAGATATTTAAGTTTATACCAAATTCCATACAGTTGAGATGGCACT 

6 5 TATGGTTCTTGCCTTCATTGGGAGATAGATCATTAATGA 

AGCGGAAATATATGTGATAAAATTGAATCCAATTAAAACTTGATGAGTATCCAACCTACAAGTTTGG 

ACCCTATTTATTTATTTATTGAGATTGAGTCTCTCTGTCTCCCAGGCTGGA 

ACCTCCCCGGTTCAGGCGATTCTCATGTCTCAGCCTCCTGAGTAGCT^ 

7 0 CCAAAGTGTTGGGATTACAGATGTGAGCCACCTGCCCGGACCCCTATTTAATGTATAG'I I'lTTGTTAAAGTTATATCAGAATTCTA 

GCTGAATGATATTTAGAGTCCTTGTTCCTGCTTTGTACCTGGGACACTGCCTTGTTACT 

CCAGGTTTTTTGTTTTGAGTTGAATTAGATAAAACGACACGGACACACGTGG 

GAAGGGAGAAGGCAGAAGGAAGAAGCTCCCCCTTACACAGACAGAGGGAGG 

TG ACAAT ACT AG C CAGTTTTATGTG ACGGCTGGAGGGGGTGGTGTCTT ATTTG CATAGGGCTCAGAGGATTGGTTTGACCAGG CAT 
7 5 GTCATTTAGCCCTCAAGAAAACTGGCCCTCCCACCCCAGTCCTTTAATACACAAATGCA 
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TTATGTGGG<K3CGGCCAT6CTGTTAGGTACCTGATGGAGCAAGGGCAAGAGGACAAC(X»TGGGAA 
CTTGCATTTGCATCTTAAAGGTTGCCAGCCTTTAAGAGCCAGGGCTTTTATC 
CTTCCAAGGACCACTTTTCCTCTCCATTTGCCTAAAATAAl'l'TCTTAATAACT 
TTTGTTAGTCCTGTCCTGCATTCTCTTAAAAAATCCTATCTATAAAAAATCTCAGCCGGGC^ 
5 GCACTTTGGAAGGCCCAGGCGGGCGGATCACCTGAGGTCAGGAGTTCAGGACCAGCCT 
ACTAAAAATACCAAAAATTAGCTGGGCATAGTGGCGGGCACCTGTAATCTAGCT 
ACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATCACGCCATTG 
CAACAACAAAATCTCGTTTTTCAGGTTAAGTCCCTAGTTGTGAAAG 

TTCCCTCATTTGAG AATGTTGT AG TCATCATTGGGTGT CATO^TTCAC CAGTTTGTGAAGGG AGTT AAAGG AAAGTG CATT CCT CT 
1 0 CCCATCTTTTCTTTCTTTGGGGTTTAATAAATTGTGGGAAACT 

TCTTCTGTCATCTTTGCAGCCTCAGATTCCAlT'r-l l AGCTAAGACrGGGGTTTTAAATAACTTGTTTAAGGTTCCTACAACTAATT 

GTCACAATTGGAATTCAAACAATTCTAGGTATTTTTGACTACAAA 

AAAATATATATTTTTTAAATTTTTGAGACAGTCTCTCACTCCGTTGC 

CCTCAACCTCCTGGGTTCAAGCAATCCTTCCTCCTCAGCCTTC 
15 ATTTTTTTTACITITriTTTTTCAGAG 

TG CTTTGGCTC CCAGAG TATTGGG ATT ACAGGCATGAG CCACCAC CT C CAG CCTGATG CITTTT AAAT AT ATAG TATTT AGG TCTG 

GTGCAGTGGCTCATGCTTGTAATCCCAGCACTTTGGGAGGCTGA^ 

TAACACGGCGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGCGGGC^ 
GGCTGAGGCAGGAGAATGGCGTGAACCCAGGAGGTGGAGCTTGCAGTGAGCCGAGATC 
2 0 GAGCGAGACTCTGTCTCCAAAAAAAAAAAAAAAAAAAAAAAATATATATATA 

AATATATAATTATAAATCTATTATATATTAATGTATATTATATATTACATTGITATATATATTTATATA^ 

ATATAAATATATATATTTATATAAATATAAAAATATAATTATATAAATATA 

ACCTGTATATATTATATATATTACTATTTATATAAATATAAATATATATTTTAAATAITTA1T 

ATAGAGAGAT AATTT AAT AGTTTCTGGGG CATTCTTTTGCG AACATT AG C CAAAACTTT AAAAACCACTC C CAAT ATTG CATTTTT 

2 5 TCATGGTATTTTTTATAGTATTTTTATAGTATCCT 

GGCTCTGCCGGGCCATGGTTTCTGTCACAGCTACTCAGCCCCGCTCTTC 

AGGCATGGCTGTGTTCAAT AAAACTTT ATTT ATAAAAGTCGATGGGTGATGGG CCACAGTTTG CAGAC CCCT ACCGT AGAGTATGT 

TTTCTGAGATTTAATGTTTCTTGTGATTTACCCCCTTATTCTAAAATTAGTC 

TGTTGGGTTACTTTGTTAGTGTTAATTGTATCCTAACTTGACTrGTGATTCTA 

3 0 TGAGCTGCAACTTGATGAATGTTGCTTTTACTCTCCTTCCTA^ 

ATATACITTGTTTCTGTTTTTAAAACTGATACAGATATG 

TTTAGAAGrTAAATGATATGGGCTGGGCGAGGTGGCTCAC^GGTGTAATCC 

AGGTCAGGATTTCGAGACCAGCCCAGCCAACATGACGAAACCCCGTCTCCACTAAAAACATA 

ACACCTGTAAT CTCAGCT ACTTGGG AGGCTGAGGCAGGAG AATAG CTTGAAC CCCGGAGGCGGJU5GTTGCAGTGAGCTGAGATCAT 

3 5 GCCACTGCATTTCATCTTGGGTGACAGAGCAAGACTCTTGTCTCA^ 

CTAACACTTTGGGAGGCCAAGGCGGTTGAATTGCTTGAGGTCAGGA^ 

T ACTAAAAATACAAAAATT AGC CAGGC ATGGTGGCATG CGC CTG TAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTG CTTG 

AACTTGGGAGGTGTAGGTTGCAGTGAGCTGAGATCAGGCCACTGCACTCCAGCCTGG 

AATAAAATAAGTTAAATCATGTTGTTrAAGGTTAAATTAGCTACTTAGT^ 

4 0 TTITTCCAATTCTAGCCTTCTGCTT AT ATATGCT CCTACCCTTAGCCACTG ACAG C AAATATT ATTG C CCTGAAGGTCTT ACT AGG 

TTCTTAT ACAATTTATTGAAGTC CTG AAACGAAG AATCAG ATGAG CAG CTACAG AATTTTTTTT AATTGT CT AAAGG CTCITT AAA 
AGAAAATCTC AAAGTTT ATT AAAAAACAT AAGT ATCTATGACT CG AC^ T 
GTTGTGTCTTGTTACGTGTATTCTACCAATTTGGACATAGAAAGAT^ 
AGAAACATGCCTGCAAAATTATGTTTTGCATAATTTATG 

4 5 AGGTGGAAGAGCClTTrCAGTTAT<K3TATTGACAAAGGCTTA^ 

TTAGAGGGGGAAAGTGTGGAAGTTGAAATGCCAAGATTAGTTGAGGAAGCAATCTAGATC^ 
GGAG<3AGGGGTGTCTAGTTGAAGACAGACGAAGATGTTAATACAGTCACACTGTAGGTACTTT 
TTTCCTTTATGCAGTGGGGTTTTGAACAGGGAAAATAATGTAGTAATAACCTT 
ATCTGAGGAGGAAATGTGCGTGGCAACCGGCTCCGGGATGGCTGGCTGGGGGGCCT^ 

5 0 GTCCCGGATCAGAGCAGCAGGCCCATCCCCCAACTGTTACACCCTGCTGGC^ 

TACAGAGGTCAAGAGGTGATGTTTATTAAGTTATATATTTTT^ 
TGATTGGTTGGATCAGGATTCIAGTTTCAGATCAGTTTA 
GTGAAGAAGGACAAGAACTCTCAGATGAAGATGATGAGGTAGTATTTTTTTT 
TTCTATGTTCATTGACTTTGAGATTGAAATAATATTATT(^ 

ACCT CAATG AACATG CTT ATT AT AATGG TCTGTT CACTTTT AAAAACATAT AATT CAACTACTGTTTTAATATTAACTTACTTT CT 
ATAAAAGT AAAACATG CATACAGTTAAAGTCy^GTAGTACCCrT AGAGATCTGTTTCCCAG AAGTGACTG CCTTTAACATTTCTAG 
CTATTTCCCTATTATTTAATATCTCCATATTATTTATTTATGAAGTAAAACCACCTAGCTGAA 
OlTCCTTTGTATTGACrrTTACC^TTGTGGGTAAGGATTTCTCTCT 

6 0 ATATTGTCTAAGGCTTTCTCATATATTGTAGTACATGATATTTGTITAGGA 

GGGCGATGAATTGATGCTAATGAATGTGTTTTATTAGGTATATCAAGTTACTGTGTAT 
TTGAAGAAGATCCTGAAATTTCCrrAGCTGTAAGTATACATCrACTl'r 
GACTATATCTAGCTTCTTTCTGAAATGAACTGATTTTTATAAAGTTAAAATGTTTT^ 
CTTTTATTTGTTGTGATGAGACCTGTAGCTATAATTTTTATTTTTAGGTGCTG 

6 5 GGGTCTTACCCTTCTCCCAAGCTGGAGTGCAGTGGTGCAATCTCAGTGCACT 

TGCCTCAGTCTCCTGAGTAGCTGGAATTACAGGCATGCACCACTATACCCCACT 
ACCATGTTGGCCAGGCTGGTCIATGAACTCCTGACCrrCAAGTGATCT 
AGTCACTGCCCAGCTGAATTTGTACATTTTGATGTTATTTTGGAAAATT^ 
CTATTACCAACATTTTGCCAACCTTGTCAAACGTAAACTTTTTTTC 

7 0 GAGTGCAGTGATGTGATCATGGCTCACTGCAGCCCCCACCTCCCAGGCTCAAGTGATCCTC^ 

AATAACAGGTGCATGCCATGACACCCCAGCTAATTTTT 
CCTGGGCTCAAGTGATCCACCTGCCTCAGTCTCCCAAAGTGC^ 

TTTTT ATT ACTGATTTG T ATT AT AGAATG TGGT AGG ATTTG CCATT AAAG AAATT AACACTT AAATGTTT ACAT CT CATG CAT ACT 
TCACTGAGTT CCTGAGG T ATT AT AGGTITT TIT AAAACT ATTGTG CTTT AG I a i 1 T GTTCTTAATATTGCCTTTATTATATTTGAA 
7 5 TAGGAAATAAATT CCAT1TTTTCTGAGTTATAGACAGTT ACT AATT ATTCAAGTG 
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AATTTTTACTGTATTAGAAAATAAATAACTAATTGAATCATTTGATGTCAAGTTGAGTAAAA 

AAATCT ATTTT AAG ACTCAAATT ATTTT AGAT ATT CATCTTGG AATTT ATTGG ACTTTTTGATG CACTTTT ACT CC CAT ATTG CAA 
GACC l'TT'ITC'll I GGT1TT ATGAGCTGTG AAAAACCC CCTTCTATAACTTGAAAAAAGAAGAATGAATAGTCTATCTTTATTC 
CTGTCTTAACATCATTTGTGAGGCAATCTCAAAATGGTTACGGAAATACCT 
5 GGCAAGAACCATGCTTCAGCTAAGGCAAGAAGATAGGTTATC 

GG TGG AAAGGTTTGTGGG AT AAAACGTTGGGAAAAGGAAG AAG CATTG AG CAGTTACAAAGATG AG AACACAGG AAAGGAT AG ACA 
GG T AAAGGAAAGGATATT ACGGTT AGTGACCAAGGATG ATAGGTTTT AGCA^ CATTT CT 

CTTGTGGATGGAGTTAGGGAATACTCAGTTAAGCCAT^ 

TCTTT CTCG AGG AGG CAGGTTT CATCT AGATTG ATGGG ATGGACTCTTGGAAT AG CGC CT AGGG TTGAG C CTTT AAGGG AG TGG AT 
1 0 ATAGGCTGCATGACAGAGTAAAAGATCCCATAGACITGCTTC 

TATAGGTACATGATTAATATCACATCACAGTTAATATATTTTTAGTT^ 
TACTTACTCTACTTTAGGAGAGACTGTTACATATTGACCTTTAAAAATCT 

TGACACACATGTAAATAGTGAAATGATC^CTACAGTCAAGCAAATTAATGTAATCCATTATCTTGA^ 

GT ATGGTT AAAAT ACCTAAAAT CT ATT CTTTTAGT AAATTT CCAGT AT ACCAAT AT AT AACT AACT AT AGT CCT CATG CTG TTT AC 
1 5 AGTGACTATTTTATATAAAACATGAAACACTGAATATT^ 

CCTGTGACTGAGCAGTTAAAGGGTTACAGAAACTGACTGTGTGTCT 

GAAATGAATCCCCCCCTTCCATCACATTGCAACAGATGTTC^GCCCTTC 

AGGGGAAATCTCTGAGAAAGCCAAACTGGAAAACTCAACACAAGCTGAAGAG 

TGAATGATTCCAGAGAGTCATGTGTTGAGGAAAATGATGATAAAATTAC^C^^ 
2 0 CAGCCATCAACTTCTAGTAGCATTATTTATAGCAGC 

GAGTGTGGAATCTAGTTTGCCCCTTAATGCCATTGAACCTT 

GCAAAACAGGACATCTTATGGCCTGCTTTACATGTGCAAAGAAGC^ 

ATTCAAATGATTGTGCTAACTTATTTCCCCTAGTTGACCTGTCT^^ 

TAGACAACCTGAAATTTATTCACATATATCAAAGTGAGAAAATGCCTCAATTCACATAGAT r 

2 5 ACTTTGGTAGTGGAATAGTGAATACTTACTATAATTTGACITGAATATG 

TTTCTACTCTGTCTTAAATGAGAAGTACTTGGTTTTTTTTTTTC^^ 

CCATTCTCCTGCCTCAGCCTCCCAATTAGCTTGGCCTACAGTCATCTGCCACCACACCTC^ 
ACAGGGTTT^CCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCACCTCGGCCTCC 

3 0 CAGGCATGAGCCACCGCGTCCGGCCTAAATGTCACTTAGTACCTTTGATATAAAGAGAAAATC 

TTTTTGTTTGTTTGTTTGTTTGTTTGTTTTGAGA 

CGCTTCCCGGGTTCAAGCCATTCTCCTGGCTCAG CCTCTGGAGCAGCTGGGATTACAGG CATGCACCACCATG CCCAGCTAATTTT 

TGTATTTTTAGTAGAGATAGGGTTTCACCATGTTGGCCAGGCT 

TCCCGAAGTGCTGGGATTGCAGATGTGAGCCACCATGTCCAGCCAAGAATT 

3 5 TTTTTTTTTTTTTGAGACAGAGTCTTGCT 

CTGGGTTCAAGCTATTCTCCTGCCTCAGCCTTCCAAGTAACTC^ATTACAGGCAT 
TTTTAGTAAAGACAGGGTTTCACCATGTTAGCCAGGCTGATCnTGAACTCCTAAACT 
AATGCTGGGATTACAGATGTGAGGCACCTGGCCTC^GATTTTTGAT^ 
ACTCTTTCTAGGTTAAAAAAAAAAAGGCTCTTATATTTGGTGCTATGTAAATGAAAATG' 

4 0 AGAGACAGGGTCTCCCTGTGTTGCCCAGGCTGGTCTCGAACTCCTGG^ 

AGTAG^ATTACAGGCGTTAGCCACCACACCCC^CTGTAAAAATGTACTTATTCTCCAGCCrrCT^ 
TGGGAGTAATGATGTTATCTGTGAAAATAGCCACCATTTACCCGTAAGACAAAACTTGTTAAAGCC^ 
ACATC^GGCCCTTTTTCACACACAAAAAAATCCTTTATGGGATTTAATGGAATCTG 
AAGACACTTTAAAGTACCTTCTTGGCCTGGGTTACATGGTTCCCAGCCrAGGTTTCAGA 

4 5 CCGTGAATTCAGAAAAGTTAATTCAGAAATTTGATAAAC 

GAATTATTCAGAAATTATGCATCATTTTCCrrTCAAGAATGACAGGGTCAGCATGTGG 
AAGGTCCGTGTTTGGTCAGTG^AGGCCC^TCCXSAGCTCAGCACTGAGAAGTG 
TCATGATGTTCATCTGCAGCTGTTGCAAGGTGTTCAGATTGTAT^ 
AAAAGGTTAGGTGGACTAAAGCATTCTGTAAAGCAACTGCTAATAATGAGCT^ 

5 0 AGCCTGTCAAATATCTGCAAGAACTATGGAATAAAACTACTGATGCAGTGAAGACAGTTGAA 

ATTTATAATGAACAAATTCAAGAAAAAGGACTACGGAAAGTTCAGGACATCAA^ 
GCAGGCAAAGGAACGCAGCTGGAAGAAAAGATGATATAACAGTTAAC^ 

AT CT ATAACCATTT CTAT ATTTACATTTGAAAATCT CCTTTGG AG ACTTAGAAC CT CT AAATT ATTGACTT ATTTTTT AT AT AAGG 
TCACTCCGATGAAAGGTGATTACAAAATCATCTACATTGCTGTCTACAAAACAGATAATATGGATGTTTGATCGCATCT 

5 5 AACTCTTTACTGATATGTTTGTAAATACAGAAGTGAAATGTGGACATAAAATAGT^^ 

TGTAATTAATGACATTCAAAAATTTATGGCTAGTGATATATATAAAGTAAAAT^ 
AAGGG AGGATATAAGGAACCAACAGTTTGTATGAAAATAGCTCAAATAATATCTTT^ 
TTATTAGTGTCTTAGAACAAAATGGCCTTATATAATGAAGCCTAGTTAT^ 
ATAGTTGGGGATGAGAGCCGAATAAGGTTTGCCTGAAATAACTGACACTATATAATTTCTC 

6 0 TGTCATTCCTGGTAGAACAAGCriTATTTTTCG AGCCTAGCAATG AT CT AG AAG CAG ATGTTATCT CAGTGCCT1T1X3CAATTTGT 

TGTGTGGGTTTTTTTTTT'rTTAAAGCCAC^CAATAATTTTGGAAAACAA 
AACCACTTTTAATGGGTACAGAGTTAAATTTGAAGGAATAAGTTCTAGCTGAAGTA 
TCCAAAGGTAAAAGTACTAATCCCTTTGGCCATTTATTGAGAGAGAG 
TGTTCTTGCTACAAATAAATGATATTTGAGCTGATGC«GTGTGCTA^ 

6 5 ACATACACCATATTTACAATTATGTATTTAA<^TTTAAAATTTCTAATATAAGTATCTCTCAAACT 

ATATTTAAATATGAATCTTAAGCAAAACAGTGAAAATAACCATCTTGATT^ 
AGGTGAAGACAATAAAATCAACTGAACTGTAAGCTTAGAATAGGACTGAGGTAAT^ 
TTGCTTCAAAACTCTCTCTCTCTCTCTCTGTCTGTCTCAATAAATGGC 
CATTATTTGGAGTTGATAATACTTCAGCTACAACCAAGCAGAATCTCTTTT^ 

7 0 AATACTTCAGCTTCAATTTGGAGTTGATAATATTTCAGCTAGAACCTAOT 

TTATTGGAGTTCATAAT ACTG AAG CT AG AACCAAGCAGAATCTGTTTTTT T CTG AGGAG T ATCGGTAG CAT AAATG TGATT AT AAA 

CATAGTACACTTC^TATATC^AGGCAGTGACAGCTATTTTTACAAAATT^ 

TAAAATGGCCTGATrTCTTAGGTTCTTTATAGTACACGTC 

TCTTAAAl'ITT TATGAGTTGTTAAAATAG AAATT ATTTGAATATCATATATTTGGG 
7 5 TTTTTCTGGAATGGCCATGCCTGCCCACTCTAGAAATACAAATATCACT^ 
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CTG AAGAATCTGGTCCCAGTT AGTAGAGTAAGAAAGG CTGGTTCTCCCTTTTTTTTTTGGGTC CCTGGCCAGCCTGGCCTAAGTGT 
TCGTGATTGGTCTATAAGGGAGGTACCTGGCACAGGGAGGGGAGTTTGGGTTAGCCACC^ 

ATTGTACTGGCCTGTTTAGTTTTCGGCCTC 1 1 C'ri' l 'GAAAG^CCACCAAGCTGGGAACCTCTTGATTGTGAGGCACAAATGTAAG 
T AC ATCAGAAAAAAACAAAAAAACTGG CTTT AAAG CAGG AG CTTGTGGGCC CCT AAGC CAGACGGGGACT AGCTTTTGGCATT ATA 
5 TAATTAAGATTTTTTAAATCCTTAATAAGGGTTTTATTTT 
TGGAGTACAGTGGTGCAATCTTGGCTCACTGCAACCT 

TGGGGTTAGAGCACCCTGTCACCACGCCCCGCTAATTTTTGTATTTCTAGC^ 
TCAAACTCCTGACCTCAAGTGATCTGCCCGCCTTGGCCCCCCAA^ 
AAGGGTTTTAAAGATAATTAGTCTGTAGGTCTGTAGGCTTATGATGGTAACCA 
1 0 TTG CG CTTT ATGGGTGG ATGCTGAATT ACATTTTGATTTG AT AC^ 

AAAAGTTTCTGATCATATTGTCTACCATGTAGCCAGCTTTCAATTATATGTAAG 

G AGGT AGAT ATCTGAAAG CAC CAGCACTTGGAAGGTGTTCAGAAGT AACAAATT AT AAAAT^ CAAAATA 
AAACCGTAAAGCAAGOIGATGGGAGGCGTGTTCAGTAACTTATTCATA^ 
TTAGAGTAATAGTATTTTGAATGAAAACCATAGTTGATTGTCTTTAGAGAG 
1 5 CTTTCCTTTAAAGCTCTCCTTTAAAGACTTTAAAAAA 

TGCCTTCAGAATGCTAC AACTTAGATGTCCAGC CAACAGCTTAAAGCC AATTTATAAAAAT AG CATCTTTCCTAATTCCAAAATTT 
CACClTTlGTTTG'lTTrGTlTTGAGATGCG^ 

CTCAACCTCCTGGGCTCAAG^TCCTCCCACCTCAGCCTCCCAA^ 
CCTTCCCCCC^CCTCCCCCGACrcCGCTGCCACCCGCCCT 
2 0 ATCTCGGCTCATTGCAACCTCCACCrCCCAGGTTCAAGCAATTCTGCCTCAGCCTCCT 
ACCACACCTGGCTAATTTTTTTTGTATTITrAGTAGAGATG 

TGATCCGCCCACCTCGGCCTCCCAAAGTACTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCTA^ 

ACAGTTTCACCATGTTGCCCAGGCTGGTCTCAAACTCCTGGGGTC^ 

CAGGTGTAAGCCACCACGCCAAGCCCAAAATTTCACCTTTTAAATTTATTCCAT^ 

2 5 GGGTTAAATTTTAGCGCTTTTGCCTGTGTCTATTTTTGGT^ 

AAGATCACAGTAATATCTCCCTAAATGGATTTC^CTACGTTTGGTCCT 

TGCTTCCCTGATTTAAATATTGGGGGAAAAACTTGCTTTTTGGA^ 

ACACTATGAAATGGGTGCTTGCTTTTCACTTTTCAAACCT 

AAGTCACAG ATCTGG T ACTT ACTCX5CCTAGTGAC CTT AG AG AC A^ CT AT AGAG CCT AG T ATGGGGTCTGG CAGGTAGTAAG 

3 0 CACTTTTAATGAATGGAAGTTTAATATTCTGTTTGCATCCACTCGACCATAATCAC 

AC CCCT ACT C CAC CTTG AAGTITI'IT ATTTG TTTTTTG CTGAG ACAGGGT CT CATTCTGTCACCCAGG CTG GAGT AT AGTG GCGCA 
AACATGACTCACTGCAGCCTCCTGAGTAGCTGGGAACACAGGCCTGTTGCCACCATGCCT 
TACATATATTACATATATATATATAATACATATGTGTITATGTATGTATC 
GAGGCAGAGTCCTGCTATGTTGCCCAGGCTGGCCTTGAACT 

3 5 TGCTGGGATTACAGGTGCGAGCCACTGCATCTGGTGAATTAAGCCT^ 

TATGTTCCTTTGTATCTTCITGCCATAGCAGGTTGTAACATCCTTTTGTATCT 
GTCCCGCTGTTTAGCCCAGGCCGGATTGCAGTGGCGCAATCTCGGCT 

GCCTCAGC CTCCCGAGT AGCTGGGACTACAGGCGCCCGCCACCGCG CCTGGCTAATTTTTTGTATTTTTAGT AG AGACGGGGTTTC 
ACCGTGTTAGCCAAGATGGTCTCGATCTCCTGACCTTGTGATCCGCCCGCCTCGGCCTCCCA^ 

4 0 CCACCGCGCCCAGCCGTATCTCCTITCAAATGTCAAAT 

CAACTATCAAGTTCTTGTTGATTTAGAAGACAAAAAGACAAT^ 
ATCTCAA»CITTGGGAGGCCAAGGAAGGAGGATCACTTGAGGCCAGG 
ATCCAAAAAATTGTGGTGGCATCCATGGGAGACTGAGGTGGGAGGATCACTTGAGCC 
rrGCGCCACTACACTCTGCTTAGGAGACAGATGGAGACTTGTCTCI^ 

4 5 TCTCACTGATTGTGATTTTTTGTTTGTTTTTTTGAGATGG 

TTTTTAGTGGAGATGGGGTTTTGCCATGTTGGCCAGGCT 

AAG TG CTGGGATT ACAC<JCATTGAG CCACTGTG C CTGGC CTG ATT ATG ATTTTTT AAAGT CC CC C C C CAAATT ATTG TTTGTG CAA 
TAAATGGCAAAAAAGAAGCATTCTAAGAAAAAAGTAGTCTTAAAAGTTGATATGGTGT 

5 0 GAAAAGCCTTTTTAGAAAAAATTGCTCCCTACAAAAAA 

TGAGAAGACCAAATCTCTTCAAAGTTAAG C TT T TTG rm ^ 
TGGCATGATCTCGGCTCACTGCAACCTCrGCCTCCTGGGTTCA^ 
GCGCCTGCCACCATGCCCAGCTAATTTTTGTATTTTTAAT^ 
ACCTCATGATCCGCCTGCCTCTGCCrCCCAAAGTGTTGGGATTACAGG 

5 5 TAACCACAAGGTGGGAGGTGGAAGTATCTCGTTGGAAGACTAAAGGTGTCC^ 

TGGAGTCIT ACTCTGTTGCCCAGGCTGGAGTGCAATGGCGCG ATCT CCTCCCAGGTTCAAGTGATTC 
TTCTGCCTC AG CCTCCTG AGT AGCTGGG ATTACAGGTGC ATG CCACC ATGCCTGGCTATTTTTCATATTTTT AGTAG AGATATGGT 
TTTGCCATGTTGGCGAGGCTGGTCTTGAACTCCTGATTTCAC^ 
ATGAGCCATTGCACCCGGCCTCAAAATTATTTCTAATGTGTGCAAAGATATCT 

6 0 NNN^GCTGGAGTGTAGTGGCATGATCTCGGCTCACTGCA^ 

AGTAGCTGGAAATTCAGGCGCCTGCCACCATGCCCAGCTAATTTTTGTATTT^ 
CTGGTCTCGAACTCCrGACCrCATGATCOSCCTGCCTCTGCCTCCCAAAGTGTTG 
CAAAGTTAAGCATTTTTTAACCACAAGGTGGGAG^TGG 
TTTTTTTTTTTITGAGATGGAGTTTCACTGTCGCCCAGACT 

6 5 AGGTTCAAGTGATTCTCCTGCCrCAGCCTCTAGAGTAGCTAC^ATTACAGGGG 

CAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCT 

G CTGGG ATTACAGGTGTG AGCCACTGCGCCTGGCCG' l"! T ' l TT1 i 1 1'1'GTTGTTGTTGTTGTTTGTTCGTTTGTTTGTTTTGAGACA 

GTCTCTCACTCCAGGTTGGAGTGCAGT^CACTATCTCAGCTCACTGCAAACT 

AGCCTCCCAGTAGTTGGAATTACAGGCGCATGCCACCATGCATGGCTAATTTTTTGTA 

7 0 CTAGCCTTGAACTCCTGACCTTGTGATCTCCCTGCCT 

TGAACTCCTGACCTCGTGATCTGCCTGCCTCGGCCTCCCA 

HUMAN SEQUENCE - mRNA 

G CAC CGCG CGAGCTTGG CTG CTT CTGGGGC C~1^TGTGGC CCTGTGTCTCIC-G AAAGAT^G AG CAAGAAG C CG AGCC CGAGGGGCGGC 
75 CGCGACCCCTCTGACCGAGATCCTCCTGCTTTCGCAGCCAGGAGCACCGTCCCT 
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CTGGCC CGGAG AGTGGAATGATCCCCG AGG C CCAGGGCGTCGTG CTTC CG CAGTAG TCAGTCCC CGTG AAGGAAACTGGGGAGTCT 
TGAGGGACC CCCG ACTC CAAG CG CG AAAAC C CCGGATGGTG AGG AGCAGG CAAATGTG CAAT ACCAACATGTCTGTACCT ACTG AT 
GG TG CTGT AACCAC CTCACAG ATTCCAG CTT CGG AACAAG AG ACC CTGG TT AG ACCAAAG C CATTG CTTTTG AAGTTATT AAAG TC 
TGTTGGTGCACAAAAAGACACTTATACTATGAAAGAGGTTCT^ 
5 AGAAGCAACAACATATTGTATATTGTTCAAATGATCTTCTAGGAGATTTC 
AAAATATATACCATGATCTACAGGAACTTGGTAGTAGTCAATCAGC^ 
GTGTCACCTTGAAGGTGGGAGTGATCAAAAGGACCTTGTACAAGAGCTTCAGGAA^ 
GACCATCTACCTCATCTAGAAGGAGAGCAATTAGTGAGAC^ 
CACAAATCTGATAGTATTTCCCITTCCTTrGATGAAAGCCT 
1 0 CAGTGAATCTACAGGGArcCCATCGAATCCGGATCTTGATGCTGGTGTAAGTGAACAT^ 
TTTCAGATCAGTTTAGTGTAGAATTTGAAGTTGAATCTCTCGACT 
GATGAAGATGATGAGGTATATCAAGTTACTGTGTATCAGGCAGGGGAGAGTGATAC^ 
CTTAGCTGACTATTGGAAATGCACTTCATGCAATGAAAT 

AG AATTGGCTT C CTG AAGAT AAAGGGAAAG ATAAAGGGGAAATCTCTGAG AAAGC CAAACT CTG AAGAG 
1 5 GGCTTTGATGTTCCTGATTGTAAAAAAACTATAGTGAATGATTCCAG^ 

AGCTTCACAATCACAAGAAAGTGAAGACrATTCTCAGCCATCAACTTCTAGTA 
AGTTTGAAAGGGAAGAAACCCAAGACAAAGAAGAGAGTGTGGAATCTAGTTTGCCCC^ 

CAAGGTCG AC CT AAAAATGGTTG CATTGTC CATGGCAAAACAGG ACATCTT ATGG CCTGCTTTACATG TG CAAAG AAG CT AAAGAA 
AAGGAAT AAGCCCTG CCCAGT ATGT AGACAACCAATTCAAATGATTGTG CT AACTT ATTTCCCCTAGTTGACCTGTCTATAAGAG A 
2 0 ATTATATATTTCTAACTATATAACCCTAGGAATTTAGACAACCTGAAATTTAT^ 

CACATAGATTTCTTCTCTTTAGTATAATTGACCTACrrTTGGTAGTGGAATAGTGA^ 
CTCATCCITTACACCAACTCCTAATTTTAAATAATTTCTACT 
TATATGACATTTAAATGTAACTTATTATTTTTTTTGAGACCGAGTCTTGCTCT 
GCTCACTGCAAGCTCTGCCCTCCCCGGGTTCGCACCATTCTCCTGCCTCAGCCT 

2 5 CACACCTGGCTAATTTTTTGTACTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCA 

CCGCCCACCTCGGCCTCCCAAAGTG^GGGATTACAGGCATGAGCCACCG 

HUMAN SEQUENCE - CODING 

ATGTGCAATACCAAGATGTCTGTACCTACTGATGGTGCTGTAACCACCTCACAGATTCC^ 

3 0 ACCAAAGCCATTGCTTTTGAAGTTATTAAAGTCTGTTGGTGCACAAAAAGAC^ 

GCCAGTATATTATGACTAAACGATTATATGATGAGAAGCAACAACATATTGTATATTGTTCAAATGATCT^ 

GG CGTGC CAAGCTTCT CTGTGAAAGAG CACAGG AAAAT ATAT ACCATGAT CT ACAGGAACTT GGT AGT AGT CAAT CAG CAGGAATC 

ATCGGACTCAGGTACATCTGTGAGTGAGAACAGGTGTCACCTTC 

AAGAGAAACCTTCATCTTCACATTTGGTTTCTAGACCATCT 

3 5 GATGAATT ATCTGGTG AACGACAAAGAAAACGC CAC AAATCTG ATAGTATTTCC CTTTCCTTTGATG AAJiGCCTGCCTCTGTGTGT 

AATAAGGGAGATATGTTGTGAAAGAAGCAGTAGCAGTGAATCrrACAGGGACGCC^ 
AACATTCAGGTGATTGGTTGGATCAGGATTCAGTTTCAGATCAGTTTAGTGTAGA^ 
TATAGCCTTAGTGAAGAAGGACAAGAACTCTCAGATGAAGATGATGAGGTATATCAAGTT 
TAC^GATTCATTTGAAGAAGATCCTGAAATTTCCrrrAGCTGACT 

4 0 CATCACATTGC AACAG ATG TTGGG C C CTT CG TG AG AATTGG CTTC CTG AAG AT AAAGGG AAAGAT AAAGGGG AAAT CT CTG AG AAA 

GCCAAACTGGAAAACTCAACACAAGCrGAAGAGGGCTTTGATGTTCCT 
ATGTGTTGAGGAAAATGATGATAAAATTACACAAGCTTCACAATCA 

GCATTATTTATAGCAGCCAAGAAGATGTGAAAGAGTTTGAAAGGGAAGAAACCCAAGACAAAG 
CCCCTTAATGCCATTGAACCTTGTGTGATTTGTCAAGGTCGACCTAAAAATGGT^ 
4 5 GGCCTGCITTACATGTGCAAAGAAGCTAAAGAAAAGGAATAAGCCCTGC 
CTTATTTCCCCTAG 
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Table 42 

MOUSE NOMENCLATURE 
ICSGNM Prlr 
Celera mCG18319 

5 

HUMAN NOMENCLATURE 
HGNC PRLR 
Celera hCG3710X 

10 MOUSE SEQUENCE - GENOMIC 

TCAGAATTTGACTGTCCTAAAGAACCTTCTTCCGCTGTGCGAGTCT 
TCTCTTTTTAAAATTTCCTTTTCATGCTCTGTT^ 

CCCTGGAATGACGTGCTTTCCTGCTCTTTATTTATCAGAGCCT^ 
1 5 GTTTTTGTTATTTTCCAGTTAGCATACAAACTGTTTC^ 

TTTACTTTGTTTAATTTTACCCTTGTCCCTCTTCACCCTTCCT 

TCATATAGATATGTACACTCCCCCCACACACAC^TACCCCA 

TTCTATTGGGGAAGGTAGGGTAGGTATGCCCCACXXn'GAGCAC^ 

CTGTGGGATCCAGGGTTCAAACCTAGGTCATCAGGCCTGGACCAAAGGGCCT 
2 0 G TT T1TT CCCCIT (IAAT CTGCAT ATG AT AGAAAACTTGT AGT ATI"!' CTTGTTCAAAAT CTGG CTT CTTTG ATTT ATTTG CCT CTCT 

AGTTCCATCrrAGCATACAAACTATTTCAGAATGACACTCT^ 

GATGCATACATACATACATACATACATACATACATACATAGATACAGATAGACAGAGACATT^ 

TCTGCT ATGT CT ATG CAC C ACATTTG CTCTATT CAGTCAT CT GCT AAT AGACACC CATGCTGGTTC CATATCCCTACT ACT AT AAA 
TAGTGCTACATGAACATGAATGTACAAGTCATCTCTCTAAAATGCTGCCTC^ 

2 5 TTTGGTCATATAGTCATTCITTTTTGTTTCTGAT^ 

GCGTATGAACATCTGTTCCCCTACATATCTGCTCCAGAACTTATTGGATTTTT^ 
TGGGATGGAATCTCAGTTCTGTTTGTTGGTTGGTCrc 
TGGACCAGGCTAGTCTCAAACTCTCCCTTGATGTTCCTGCTTATTTTTCCT 
GCTTCATTGCAGCTTTCTGAAGTTTTATTTTTAGAT^ 

3 0 TGTGGGCCACATATGTGTGTGGTCCTTGCAGAGGTCAGAAGAGGGTGTCAGATCTCCTGGAACT 

CATATCTGAAAACTATACCCGAATCTTTCCACAAGAGTATOTATGCTCATAACTGCT 

AATTTACATrrrCCTATGGCTAATGTT CCTATGAACAA TTTI AT ATITT ATATGTATTTATTGGCTAATTCAC TT C TTT f GAGAAC 
TGCCTTTTTTAATTTATCTGTCCATTCATTGATTGACTTTTGTCT^ 

AT ATTCTAGAT ATATAAAGATT AAT AGTTGGC AGG AATTTTTCT C C C AAAG AT CCTGAACGTTAATG CTTT CC CTCAAGGTTCACT 

3 5 TTAGCTGTCTTCCTTAAAGCTTCATAAAAAAACTAAAGTCCTT^ 

TGCAGTGCAGGAAACTATCTGGATCAAAAACGTGTTCTCGTGATTC 
GTGCTGATAACAGTCCTCCATACCriTAGCACGTAAGTCAATCATTAACAATGGGCT 
AGTCAGAGCTACAGAAATCTCCTTTCTTGAGGTAGGCAGTGGAAGGAAGAGTC 
GCAGGCTGCAGCCCAACACGCAGGGGACGTGTCCGAAC^GAGGAGAGTC^CAGGTAAGGAATACGG 

4 0 CCATT CTGATTCTCTCTTCTCAC CTTTCCAATCTGTGG ACT AAAAATG AACAGGGTCAAGGTGTGATTACAGTATCAGTGGTAG AA 

AGAGX3TTTCTATGGACTGACCTGTCTTTTAACAAGAGATG 
GACTGATTTCTTACAAACATCTGACTTCTTCAAGACTCXK3AGTT 
GCAACAGCAGTX5GTCACATGACAGGCCAGTCCTTGGGGAGGATAAAGATC 
TCAAAGGCAAAAGGAGGTTAGATGGTAAATTTGAAAGAATTGCTACAAACT 

4 5 TTTTGACAGACTTTGGGGGTTTTTTGGTGTTTT^ 

TTTTTTTGCAATGATTTGTTTTAAAATAAATTTGTCTTCT 

CTG CT ATT ACATGTGG CAT TCTT GTG ACACATTT ATGACG ACG AAC CAACGTT AG CATG CT ATCT ACTTAAGCT CTT ACACT AT CT 

ACATTAACATTGAGTATTTGTCTCAGATATTTTTATG 

CCATGATCCTAAAAGTCCCATGTTCTACCTAGTTCTCTTTCCCCACT 

5 0 GACTCCIATGGCTTTACCCGGTCTAAAGGGTCGCCTGGTTGCAATC 

GTGCTTTTGAGTTTTTTCTGCATCITTCTGTGG 

CTACAGCAAATGTATCCAGTTCGGOVAATGTTTTAGCATGGTAGAAAGTG 

GTACTATAAACGTGATAGAAGCAAGAAGAACACGAAGTAGAAAAACTAGCT 

AATTTTTTTAAAAGCTCTGGTTCTAGGGTGGCGCCAGCTCTTTTCT 

5 5 CAAGAGCCTACACATGGTAGGCACCCCATAACATTTTT^ 

(ZACCAGCTTATTG^SCTCAGAAGCGATATGAGGGAAGCTGTAGCT 
ATTCCCTTGATAACCTTTGTTTGACCGTGGCTTATGGA^ 
TAGCCCAAACAATTTCCATGTAACAACAACAAAGGCTATGAGAAGATGAGCC^ 
CAGAGAACAGTCATGTACCATTATCAGTAGCCCATGCAATGATGTAGGAGCCC!AGTCCAAGGTGATG^ 

6 0 GTAGTTGTCCAGGGAAACTGATGGGCCAGCTGGGGCTTG CAGAACCTAAAGAGACCTGCAAGTAGCAATTTG CT AACTCAGTGGGG 

CT ACCATG CAAAAAAAGCTGAT CACCCAGG CT AGT C AATGT ATTG GTTCAAAAT C AT CAACTG CTT AAAAC CAGG TTCTCTGC CAG 

AGCATAGCATTGCTATTCCTTAATGCCCAGATTCCl'rTCTTACTGTGTGATGAAA 

TTGTGATTCAGAGCITTTCATGTCCTCrACTAAGTTAACTTACACA 

AGACAAATCTGGGAACTGCITTGCCCAAAGGCTAT^ 

6 5 TTATGCTTACACAACAAGAGGCCnXSACCATCATTCT 

TGTAGCTCTCTATCCCACTCAAAGCrCCrGGATOGGAATACITGAAGAA 

CTG CCGAC CT AG AGATCTT CAAG CCACAGGG CAATAG AAACATCT CAAG CAACTTCCGTGTAATGAGAAATGAG CAG AGGAG AATC 
AGACTCTGATTCCAACACAGCATCAGCTGGGGAAGCAGCrrc 

TCTAGAGCATATCCTGAACATAAAGTGCCTGAGACAGGAACAAGCAGTACCTAGGCT 

7 0 CTTGTGCAAAGCACCAGACTCACTTTCAGAACCTGGGGCAAGTCT 

CTATAGGATGGATTTTTTTGTCATTTTCATGTGAG 

TGTTCT LTTTTT CTGTGTGT ATGG TG CCAGAT AATTGACTC CAAATTT1T CTG GG AG AAATGTGCATCT AT AT CT CAAG ACTTTGT 
TTTCAAGGAACTAACCAAATAGGATGTCCATCTTACTTCCTTGTTTATTTGT 
AAAATCTAAGACCAAGTCCAACATTTGoTAGTAGGGGA^ 
7 5 CAGACAAACCTTCTCGT C TT C TTGTCCACATTTTCTCT 
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ATGCTCTTCTCTCCACCTGCCTGATAGTACTCAGCTC 

T ATCC C CCAAGGGAT ATGG CGTGGGTGCAAAG ATG AG AGCACAAGGCAG AG AATGTCACGAG ATGG ACACAAT CT CTTC CAT AG AA 
GCCCTCATTTTTAGGGATATTAAAGGACAGACACTATGGC^GAAAATTGT^ 
CCTTTGACCATCCATATGTTCTCTCCATGTAGGTTAATCAACCACAAATTATATAAC^ 
5 TG AAG TAGC CCTTGTTC CCAAAGAGCT AATGGGTG AGACTGG CTCATGGGT C CATG ATGGTGGCAAAAAGT ACGG TGAC CTGGGAC 
CACATCACAGAAACTTGTAATGAGGTCAGCT 

GAAAGATGAAGCTGAGAGGAAGATGTCTGCGGAGAGCATTACACTGGAAAG^ 
GTAATACAGTGGTAGCTGGAAGACTGCAGGTCAAATAAAGGAAATTGT^ 

ACAGGG CTAGC CCCTCTGGATGGACTT AAG C CATGCAAGAAGAT AG AGACT AAGGAAAAC CATCAGAGCATTG CAGAT AG ATG AGG 
1 0 CATGAAGGGAGATTTGCATTTGAGGAAGGAAAAGTCT 

AAACACGTGGGAGGCTGTTGACATGAGGTGGCTTAGAAACAGTGTTCCTAGGAAG 

TGATGACCCGTGAGCAAGACTTTCTAGAAAGAGCCGGTTCATTAATCACGTCACTTCT 

TTTATTTTGTATCGACAACACTATCATGTAAATATTCC^ 

AGGGAGCCAG<rrACCTTGCATTTCTTCACTGCCTCTGTAAA 
1 5 TGCTTGTTCTAGCACAGCGGAAGAGGATAACTACCAGGGAAACTCGCCAATCACAAGG 

TGTTG TCAGTT AGGTTTCATTTCAAAG T CAAGATTGTT CAAGC CAAAGT ACAATTGGTTTTAAATGCTGT CTTT AAAG AAGGAAT A 

TGTTAGAG AAAATTGCTTGACACTCATTAGTGTGAAG CATGAGGG CAG CACTT CATGG CTGTG ACTCAG T CT AAAAATTT CATCAT 

CTCTACTAGTTTGCCAGGGTGGCCATAGCGAAGCCTAGCCAACTGGGGGATCT 

AGGAAACAGAAATCAAGGTGATCAAGGTAGC^AAAGTGGCTG^ 
2 0 TCACTACTAATGGGAATCGGTTGCACTCCTTXK3CTCATAGAACAC 

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 

ACCGAGCCATCCCATTTTAATAAGTTAGACTGAAGCAAAGCCCATCATAATGGGCTC 
TTTTTCCAAATAAGATCCAATTCTGAGATGTCAAGTGTTCTTATGT^ 

AGTCCAAAT ACCACC T AATATGATTTCAAAATT AAAT AAGTT AT CTTCT ATTT AAAAAAAAT AAAAC CTTGAAG AT CCTCCAG TTT 

2 5 CTGAGTTCTTCTTCTGTTTCTTTCTTGAGAGACT 

ATATTTTATATTATTTGTGAATATTATGAAGGGTGTTGTTTCCCTAATTTCTTTCTCA 

CTACTGGTTCGTTTGAGTTAATTTTATATCCAGCCACTITG 

GAGTCGTITATGTATACTATCATATCATCTCCAAATAGTGATACCITGACTT^ 

30 " GTGGGATTGCTTCAAGTATCTCTCCATTTAACTTGATATTGG CT ATTGGTTTGCTGTGT ATTG CTTTTATTATATTTAAGTATGAA 
CCTTGAATACCTGATACTTCCACCACTTTTAACAT 
GATGTCATTTTTTTTCTTTCAGTTTATTTATATAGTG 

GAAGCCTACTTGATCATGGTGAATGTGGTTGATCGCi riGATCrGTTCTTGGATrTGTTTTGTAAGAATTTTATTGAGTAT'lTTTG 
CCT CAATGTTCAT AGGG AAATT ATT CTG AAGTTCT CTTTTTTTGGTTG GGTCTTTGT ATAGTTTAGGT ATT AG AGT AATTTTG G CT 

3 5 TCATAAAATGAATTAGGCAGTATTCTTAATGTTTCT 

AAAGCCATCTGGCCCTGGGCTCTTTTATTGGGGTGGGGGGATTT^ 
TAATTTACCTGCTCI^GATTTAACTTTGGTTCA.TGATTCTAG 
GCCCTGAGTTTGACTATTTCCTGTCATCAACTCCTTTGGGGTGTGTTTGCT^ 
GTACTGTTATGTTGCTAGAGGAGAATCTGTCCAATTTCrrTACCAGGGCACTTAG'l ,l l 
40 TAAGTTTAGGTATGCTGTGTCATCATTTTCATAGAATTCCAGGAAGTCTTTAATATC 

C^TTGAGTAGAGAGTTGTTCAGTTTCCAGGTGTATGTGGGii iiCIGTTGii 1 liGTTGTTACTAAAGTCTATCCTTAGTCTGTGG 
TGATCTGATGGGATGCATGGGACTATTTCAATCTTCTTATAAAGGTTGAGGT^ 

GAGGGTTCCATGAGGTGCTGAGAAGAAATTGTAi iU i Tl rGTTTTCGAGTGAAATGTTCTGTATATATCTGTTAAATCCATTTGGT 
TCATAACCTCTATTAATTTCATTGTGTCTCTGTTTAGTTTCTGTTTC^ 

4 5 CC<^CTATTATTGTGAGGGGTTCAATGTATGTTTTGAGCTTTAG 

TGATGAGATACAACrGATTTTCAGTTGTCACTGAGAGTTTGTAGTTTGG^ 
GTCCTCCTGAAGAACCCAGAGAATGAGTTTTTCAACATTTGA 

AGAATGTCATTrCAATOTGCCCTGGATGGGATGAACACATGCTGCCTAGGGGAGTAGCTGACGTGC^ 
CAGATCACATCAGAATCAACCCTTGTAAAAATAAATAAGGTATGTTG^ 

5 0 AATGAGTTCTAGGAATAAGGGATATCAGAGGGCAAAGCTGCTGAAGATAGCAAGAGCTC 

CTGGCAGCTGTGAGTGGCTGTTCTAGGGGACTGCAGTGCCAAGAG 

G CAAAGTC AAG CTGAGCTGAATTCTAGGGAAACGT AGAAGAG C AATCATCCAG AG AGGGGCTCTGAT AGAGCTCCCTGGCCACTGT 
TTTCACTGATGCTCAGGGAGACTTGTGACTTGAGCTGGAAAATA 

CTT CTGGG TG AAATGG AT CTT AG AAAC CTAGCCCAG TTTTGTGG CTG AAGAAACTAATGCTGAATTGAGAGATTT CATT ATTGT CA 

GGAAACTAATCTATTAAAGGCAGGGTACATAAAACTTGAAGATCTTCCTTCTCACCA^ 

CAGAAGCCTACATTTGTTCCTCCTGAGAATCCTCTGAAGTAC^ 

GG CTTC CTG CTGCTGAAT AAAAAGTCAG AACCTT ACATGTCAG ACACATTT CCTG CC AC CT C CATTT ACCCT CCTGCAGTTC C C TC 

6 0 TTTCTAACTGGCTTCAAGCTGAGTAGAAGCAAGGATCTGATTGTTGTGAGTGGG 

AAAAGACTTATTCCAAGCAATATCCAGTTTCCCAAAGGAAACTCCTATCCAAT 
ATGATTTGATATGAATTTAAATGATCCGTTGTGAAATAGAATGCAGGGATATGGCAGCACAGGT^ 
TCTGCAGTGATTGGATTCTCCTGGATCAATATCCTCCTCCTCCCGCCCTCCCT 
CAAGTACAGGATTCCTGATCAGCATCAGCACCTCTCAAAATCTAAGAG^ 

6 5 GGTACTGATGCCTCCTCrGGGTGACTATGTTACACCTTCCAAAGTGATTTGGL i i iTATGTCCCAGTTAGATGGTTTGGTGTCAGC 

AGCATATTGTAGCrrTACATTGAGGATAAGTTAAGTTATTGGTCCCACCCCCCA^ 

TCTCTCACTGAACCTGGACCAATTTGCCAAGAGTACGAAATCAAGCC^ 

CAAAGGCATGTGCTACTAACACCTGGGCTrATAGATGGATGCT 

ATTACMACAAATCTGTCTCCCCAGCCCTCATTTTGTTCATTCAAATATCATCCAAATTTATCT 

7 0 TGCCATTCCCCITCTCACACTAAGCCTTTGAAATTCAGGATGAATTTC^ 

CCGATGGAGTCTGCTG CCAT CQVGGAT AACACATTTGTGCAGG AATATGTG CTTTG AGCCCTGTGCT 

ACCCAGATCTTTGACX3TGTTAGAGAGCAGTGACATGGTTGGAGTCACGCATTCTC 

AAATAGTGACCGTTCCCTGGCTAACCATTCCTCCCCAGAAGC^ 

CTTG ACATG CCTC CAATG CT AGTTCCCAGCAGT AAG CATGT AAG ACACCTG CCTTCCCTTGGT AG AGTT CCAj^GTTAATT ATT7L A . 
7 5 CAAACAAATCCACGGTGCACCACAGCAAGATGTCACTC 
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CTGGGTTCCTAGTCTGAGTCCAGACCAGTGTATACACAAACAATGGGGGTGGTATTTTTAT^ 
TGGAJGAACACACTTGGAAGCTTTCTCTAGGAAAATGTCTGTTCCTATT^ 
CACACTACCTAAGAGGGAACTGCAGTAACCATCCATCACCTC^^ 
AATGGAGGATAAGCTTTCAGGAGATTTGATTTACTAATGACATA 
5 GAGGGGATGTGGGAACTTTGTCCCATTCTAGAATATGCATGAGGCCTCTGAAGTAC^ 
AACAAGAAGAGGGGCAGATTGCTTCCTACCCAAGGAGAGCTG^ 

CAAATTTTGTTT ATG TGAAACAAAG CTTTTC CT AATGATGT AATGTGGCTTTGG C CT AAGT CACATCAAAGTACTGTCATCTT AAA 
CTAGGGGTTAATTGTCGGGTAGTTACTATCTGTATGGCCCTGGCACACTCGACAGCTACTCC^ 

GACTCACTGGGGACGGGCATGAGCG TTCTTG C CTT ACT AGG AAGATTGAAT AAGCACAG TCAAGGTG CTT AG CCATCTATTACAGC 
1 0 AG CACCATAAGTCCTG C CTTTGTTCAGG TAG CATCATGGC CTG AAGACAGGTC CAXK5T AGGGT CATTCTCC CAGGCAG AGACi^ 

TCCAGGTGTCTATCTAATGATACGGGTACCAGGATCTTGCCAAATCTGACCATCCTAGTCCA^ 
CTTCTAACTTAAAACAAATGCTCTTTTGGAGGAA^ 

CCAGCACTAAAAGACCAAAGTCTTATGAGCAAGGGCATACAAACATACATGTCTTAGGACTCAGT 
TGTTATCTAGAI"i rrAACACTCTTGGCTTACTTGTCTATTACAAATACTATG^ 
1 5 TTCAGAAAAATTATTTTGAAAATCCTTCAGTTCAGAAGAAATTGAGGTTCT^ 
ATTATTCCATAACCCTGAGGTTGCAACACTGTTaVGTAATGGGCTGTGAACCT 
TATGGCCAATCCCAGGCCAAGTAATCCACTTATTCCCTGTGGCTTTTCCTCCAATATCAAT^ 
CATCCTGAAGACCCTGTTTGCTATGGCCTGGAGCTCCTTGGACCTGGTGAAA 
GGAGACCTGGGTCTGAAATCAXMATGCTCTCTGGTCATAGGAjGGCATGGTGTGGGCT 

2 0 AAGACCAGGGCCTTTTATTACAAGCCTTTTATTGGTGATGTACCTCTAA 

AAGCTGCTAAGAATCAGACTGAGGAGCCTAGAAACTCTGAGGAGAAGTTAGGGTGC 

TGTTTGAAAGC CAGTC CT AT AGAACAGG T CTT CTTTG TTCCAAAT AAT AAAAACAGG AG CT AGGT CAAGGAAACTG TTGCAAG TT A 
TTCTCTTACTTCTACACGCATCATGCATGCTGTCACACACACA 
ATATATATATATATATATATATATATATATATATGCACACACATGCAT^ 
25 AAATGAATAAATAGGGAAATAATTGTAAAGAAAATAGTTGAAGTCAGAGCAGTG 
GGTCCTTTAAGCTGCCTGCTCTCTAAGGTGATGCTTTTC^ 
CACTAAAGAGCCTGTTAGAGAGATGCTAAGAGGCTCATTTGTGGATTTT 
GTTCTGCAAACTGTGATTTTCGACTTAGGCCATCCCACATATT^ 
CATCCCATCTTAGCATGTCTGTTATTGTTGTCCTTGCTCC^ 

3 0 TGAAATTACTAGC7UVGCACAATCTCACAGCAAACTCCCTGATCCTCTGCCT 

CTGAGCCTTGGGTATAGGAGTGGTCrrrGTAGATTGGGACTGGGATCCTCCATCC^ 
TCACTGTCTGTTGCATACJUVAGGTTTCTTCAATGAGGG 
AGTTAGGAACTATGCCAGCTTAGCAAAGTGCTGCTTGCAGTTTCTC 
TACCAGGTGTGATTTTCCTCTTGTCAAGTGGGTCTTAGGTCCAATT^ 
35 TACTGAACCCTTTGTATTACCATGACATATTGX3TTATAATGATAACCATATGTGTTATATGTGTGTAATATACATG 
GCTATTTATGTGTAGTATATATATATATTTATGTCTATTAT 
GTGTGTGTATGTGTGTGTGTGTTCCAGTTCATGGCTC 
TCTCTAACCATGAAATCTAATCCTCAAGGAACAGGCTTTCA^ 

TGGTGTCTTCAGCATTAAGAACAGAATGCTAGGAAGGTGTGTTGTAAGTAGTTTGTGGTATT^ 

4 0 TGCTAAGCCAATGGGTCACAATAAGCAGAATGX3TGATGTGGCTAAGAA 

CAAGT CTTCTCTTATTGG AGTCTGAGAATGTGG GG C AG TT ACATAAT^ CTTTGG TG TGGAGAT AG AAATGAT 

AATGTTTGCCCCTGCTGTTATCACTAGGTTTTAATCACAACTATCTTT^ 
CTCATAGAAGGAATATTACCTTTTATTTCTTAAGCCAATGTGAGAACnT 
TTTTGCATTGTGTTCrrAATGAAAGGAAAATTAAATGCACATACATTTTTAT 

4 5 GTTGGTCTAGGGAATGACTACTCCTATGACATAAAGGAAAATGAAATCAAGTCCAGTATAGATTT^ 

TAAGTAAGATTTCTGGAATTCCTATCATTCTATGTGGCTCTATGGCTTATTCCCCAA 
AGGTGCTTCCACCCTGGTGTAGACAGTGAAGTAGTAACAGCTGATC 
CATGAGTCTCAGCTATGTGACCACGGGAAACAGGAGAGAAAAAGATACAATGG 
ATATTGAGGTTGGGGGTCTCTGAGCCATTTATTCCTCCTCrrGGTTCTTTGGACCACTTC^ 

5 0 TCAAGGCACATTCTGTGGCTCTTGGGAGCCTTCCTGATCCTATATGG 

CAGTCAGTCAAAGTGGTTTCCACTCCTAGGAGGAACACACCAGT^ 

ACGCTAAGCAAACCATTGTGTGGCAGAACAAGCTTCITTT^ 

TCCCTTTGGAAACAGAATGGTGGCAAGAATGAGCAGTTATGTGGACATC^ 

GAATGCTGACAATGATATGTTTTGTGGGCATGCTCCAGGGGAGGATTCAGTACATGGAAAATACATGCrr 

5 5 TGAGGGAATCATAAAGAGTGGATGATGCCCTGAGGGAATGTGTATTATTTACAGAGATATCT 

ATCCATGGGAAATCGGCAAAGCCTTCTCTGGTCTGAGTATGACTGTTGACAGTGTC 
CAGTTTGTGCAAAAGCTAGCAACTGCAGCTTCCTCCCTGCAGGTGTGTACTGCCT 
AACCAACCCCCTGTCTCGTAGATAACAAAACCCC1TACGTTTCAGACTGCCTCAGTAACT 
CCCATCCAAACGCAGCTACACGTGTCTGTGTCTGTCCTTTCTTCATTCCCTC^CTGCTCAGTCAGGTC 

6 0 GAACACAGAACATCCCCATCACCTTGGGCCCAAGCGTCTGGCATATGCTTCCAATCT 

CTGGAATCTACAAATGTG AG CTX^ AAGCCATTCCTCATGATAACCCTCGCTC'rX"!' LTTGCCTTTTTCTTTTCTGTGT TT T T T TTCT 
T(^TTTATTGCTTATGACACTAGCAACGCAACCTATTGGAATAAAAATGTGTG 
GAAAGGGACCATTTCTGTTGATAGTAAGACTTGGCTTGAAAGGGCCAGTTAAC 
AGGCGTCCTATATTTGATCTAGCACCTACAGAAAGAGTAGCTACGCTCAGAAGAAAAAAATATCT 

6 5 GAAATATTTTATACACTAATCTAGCTGAATCGGTCTCTTTCTGGT^ 

GGTAAGCCTGATAAAGAAGACCTCAGATCATGTGTGACCAGATAAGCCTAGCAATTGTCTGAA 
TACTGAAAGGATGCCTGAAGAAAAACATCTAGAAACAAGTTTTGTGTGG^ 
GAGTCTATGTAACTGCCCCAGTCATTTCTCCTAAATCTTTTTCTGCTAGAAA^ 
AGACrTACAGGATTGAAATGATTTTAGCCTCTTCTTTCTT^ 

7 0 AAGTCTGATGCTGTTAAATATATCAGGGGTTGAAAAGGGCTTAGAAGTGTAACATTCAG 

CCCAGCACTCCGGGGGGCAGAAGCAGGCAGATTTCTGAGTTCGAGGCCAGCCTGG 
TATACAGAGAAACCCTGTCTCGAAAAACCAAAAAGAACCCAAAC^ 
ATGTCCCTTGAAATGTTTCTGAGCrrTACTATGCATTAAATAACTTGAAATGAATGC^ 
CTGAArrrAAGGATGCCAAACAATTCTCTa^TTTTTCCTTTGCTGGAA 
7 5 ACCTCTAATACCTAATTAGTGCAAAACAATGCTACAATTATTGGGCATGACTT^ 
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AGTTAAATAGATTGGCAATGAAGTTTCAATGCAAACTTTGTCAATGAGGCCAGCCTACTA 

AATGG CTGAACAG ATCTTCATTCCT ACAAAAC CT CAAGAATT ATC CATTTTG CAACTT AAGT AT ATG CTG TTCATT ACTTTTG T AC 

AATGGCTTACG AATC CACATAGGATT LTTGT ACATATT CAT'TTTGGTT ATTT ATTCCC C CTT AATT AATC CC C CTTCAT CTCCATG 
5 TCTCTCCCTGCCCTGCTGCACCTAACCCTCTTCl'l'ItjCTATAUTCTCn^ 
GTGGACCOTTTTCTCATTACCTAGGCTTTATCCATACTCACTTCTACT^ 
ATCTGAAAAAGGGTGTGGGGATATGGCGTGTTTATTTTTCTGAGTC^ 
CTATCAG^TGTAGGATTGGTAACAATCTTTTCCCAATCTGTTGGTTC 

1 0 GGAAGCAATGCTTCTCCCATTTCTTGGAGA 

AATGTGGTG CAT ACCT ACAGT AGGATTT ATTCAACAATAAAGAT AAG CAAAATT ACAAATTTT ATAGGAAAATTGATGGAACAT CT 
CTCTCTGTCTCTGTCrCTCrCTCrCTCTCCCTGGTG 

CACTCAAGAGCCTCAGACAAAAGGITGTCATGTGAGC^ CCTAT ATATTTCTCTACGTTGGT AACAG AT AT 

AGTATGTAAAAGTGTTTGTAGCAGTACrATAACACrATGCATAGGTAAAAAACAGCT 
1 5 ATGTGCTAAGTAATATGATGGAATAGTACTCAACTGCAGAAAAGAATGAACTAC^ 
AGGATCIAACAGTATTATGCTAGGCTAAAAGTCACTTATTTGTTGTCT 
TGCTAGAGTAGTGTTTCCAGGGGCTGAGAGAAGACTAAACTGAGTGTAGATGGAT^ 

CTGGCTGACGACAGATGTTTTGACAATGGATGAGAAGCTGATG T TTTTTAAAAAAATACTG AACTTT ACGTTTTAAAAGG 

GTAACTGTTATATGAGTACTCTCAATTTTAAAGGAAAAAAAGCAATGT^ 
2 0 TTAAGAGCAGATTCACIAAATAAGCACTGGA^ 

AGTTTGGATGAGTCAGAAAATAGGGAGACCGGCTAGAGAGATGTCTC^ 

GTTCAAGGTCC CAG CAAC CAC ATGG TG C CTCACG ACC^TCTGT AATGAGAAC CAATGCC CT CTTCTGGTGTTT CTGAAAAGAGTG A 
CAGTGTACTTG CAT AAAAT AAATAAATAAAT AAATCTATTTAAG AAAATAGGGC^ ACAATTGTG TG TT AGCAAAATACAGT AGAG A 
CTAGACTAAGGTTCATTAAAACACAGGCCAATAAAGATGAGGCC^ 

2 5 ACATTAAAAAGTTCAGACTTCTTCAGGCTCXSTGAAAAAC^ 

AGTCATCTCCCTTTCCTTTCCAGCTAGTTTGCATCTTACGTCCGCGTGTC 

TT CCTGTTTG T CCTTTT C C CAATTTTTT ATTTGG T ATTTTCTT CATTT ACATTC 

CCCTCCCCCTCCTCTACCCATCTACTCCCACTTCTTGGCCCTGGT^ 

GGCCTCTCTTCCCAATGATGGCCAACTAGGCCATCTTCTGCTGCATATGCAGCTAG 

3 0 AAAGACTGTATGTAACACCAAATATTAGAGCAATTGTCTTCACTTA 

TCAGrTTCCATGTGCAATGAATTTTTTTACAGGTGAG 

AAAATG ATG CCAATTTT AG AACG CATCTTCT CTTT CAG ACITGG AAAATT AACTTTT AT^ CAAACT AAATTT AAA 
AT AAGCTTTGGATT AT AC C C CGTG CGTATGAG CGGACGTT ATGACATTG ATTG ACAACG TCAG ATCT AC CCTCAACCCTCTG AGAG 
TATGATTCACACATACCTGGGATCAGAAAAGAAGTGTTTTCCAG^ 

3 5 TTCTTCCCCATAATTCCTAJGCTAAAGATGAAAGATACAGACAAATGACCCATCCTAGGAG 

TAGGTTTCACLAACAGCTCATGAAACAATTCTGCAGC^ 

ACTCAAAAAAAAAAAAAAAGGAGGAAAAAAAAACCCTATGTGGGGCTCIAGC^ 

AGTTCAATTCCCAGCAACCACATGGTGGCTCACAACCATCTGTAATGAGATCTGATGCCCT 

CAGT ATACTCAT AT ACAT AAATCTTTT AAAAAAAAAGTC CT ATGT AAAAGG CAT AATG GT AG ATG TTG CAAGAAAG CAGTG TG TGT 

4 0 ATTGGTATCTGGCAGGAGGCTCAGGAATGTTCACATCA 

GACAATTTAATCATGGGGTTTTTTTCCCCCAATAATAAAATTTT 
ACATAAGGATGJVAAGATTTCAAAACCCGTCITTTACAAAA 
TTTGCAGCCC^ATGTAACAAGCCCTTTTGAAGAGCTAAGAGGGC^^ 
CTTAGCCTCGGACCAGGAAGGGCATCCTTGCAC^AGACCCTGCAGGTCTGTGCAGTTCAT^ 

4 5 GCCACAGA(2TTCAC^AGTCTGTTCCCCCCTGAGTGATTCTGCAAC^GTTGTTT 

AATCACATTGGAGCTTCCAGACCATCANNNNNNNN^^ 

GAGGAAAGGGAACACCTGTTGCCACCATTTTGAAGGGACAGTGGTGACCCATAAAGAGGGTGCT 
AGGACTTGGATAAAATATCAAACTtlACAGTGCTGAGCAACTGTCAGGTTCT 

5 0 GGCCATCCACACAACCCCTGGAGGGAGCGTTCTATAGAACCGGfrTGCCTTGGTCAGCTT 

GGAC^GC^TATAACCCAGGCTCACCTCTCCCAACCCCTACACAGG^ 
GAAATCAGAAGGCAGTAGGACTCAGGTGCAGACC^TATAATTCAGAA 

GAG CAC!AAACATTGGCCAGC!TGC!AGCCCGGCCCCAATTGGGA CCACATCCTGCTCTG AG 

CCTTTTGAGTGACTCrGAGATAGAGCGCCTCTGTGGAGTTCCAC 

5 5 CTCCTTG CCGGTTTGCAAGTCTTGAAAGCTCACTCCrCTAAGCrGATC CAGCCCTCATGCAAAGCAAT CTGATTG CTGTGTCTGG 

GGTTCAAT ACACACACACACACACACACACAC ACACACACAC^ TG C ACGTG CAT ACAC A CAAAG ACAGAACAGTTTG C 

TTGCAACATTGTCATAGCTTAGTGAACAGTATAGCATTCTTCTG^ 
ATTATTTGCATCACAAACAAAAATTGTTTTGCATTTTTTAGATAATGAAG^ 
AAAATAGGTGTCTTTGTGAGACTGGCTTAATTCACTTACCATGATCATATTCAGT^ 

6 0 TCCAATGACGTGAGATTACAGTGAACTATCTAGGGAAACTAACAGGAGTGGA 

TGGGCACATATGCTCAAAACCATAATGTACrGGGGCTXKSAG^ 
TGAGTTCAATTCCCAGCAAACACATGGTGGCCCACAACCATCTGTAATGGG 

G ACAGTGT ACT CACAT ACAT AAAAT AAATG CAT CTTTTTTT AAAAAAT CATT AT AT ACTTG CATGG AAATG ACCCT ATGT AACCG A 
GTTTCATTTAGTTGTTTTTCAATTGTTAGAGAAAGACT AAATTT^ 

6 5 CCAGAAGCCCATCTGGAACTTAGCATTTTGAGCAAACCAT 

CCT CACATTTCT AAGTCTG CCC CTGTG ACTG CCTT AG CTCCTTCTTCC CAAATT CT CCAGTT ACCTGG CCCTT AGGGGTGG TAG CG 

AAATCTCCTTCCTGACAGCTTCCTTGGCCAGGTGCTTCTGTGTGGAG 

ATGCACACACAGGAACGAClTGGGCAACACCCnTTGCCTTGT^ 

GCCTGTGAAGTGCTCC^GAATAAATCGGAATGTCCCACrGAAGCCAGACACCGTGGTCTTCATGTTAATC 

7 0 ATCATTTATTTATTCTTTGATTCCCTTTTGCAGAGTTTTTATTC 

CAGGATTTCTCCAAGGTGACrTrAGCGCCAGTGAAGGACTTATTGTC 
TAAAATGAC^TGCCAGGTGGATTCAGGACAAGCAGACACAAGCAAGCT 
CAAGCCTTCTCGGCTACTGAATTTTGAACTTTCAC^ 
GCATGGCATGAGGTTGGCGCC^CCl^TTTATTTCTT^ 
7 5 ACACACGTGTGCATGGGC^GAC^TGCCCAC^CTATTTAAATCTTCTGCCCTTT^ 
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CAAGCATTGCTTCACCTTTTCACTTAAATTTCTCCTCT 
TTCCATATTAATTTTTAGGTTTTCATGTTCTCGTTTTATTCGTACTTAT^ 

ATTTCTGTAT ATTG CATCCCTGTTGCCCATCTATGCTTCTCCTCCAGACT A l ' lU ' C T f CCTGTGCTTATTCGCTTAGTGAAAGTACA 

TTGCCAGGCTAGAAAGGTGGTTTGTTGGCTAACAGTGTTCTCTGCACA^ 
5 AGGAGAAAAGCAACTCTGAAAGTTGTCCTCTAA Cri T TTCATGTGCACCGT AGCCCATGTGCACCCACACCCACAAATACACACCA 

CACACACATACACAAAATAATAATAATAATAATTTTAAATATATCAACTGACTTACAAACAT^ 

TgrGATGTTAACGCATAATTTGATTCTAATTTAGTTGTAAGCATA 

AGCTGTGGTGGTGAGCTTTCCACATCAACTTATCATAACCAAGAATCACTGGGAAAGA^ 

CAGAGTCATCTGTGGTCJ^GTCCACGAGAAAATGTCTTGGTTGCT 
1 0 CAT CAGGGGACAGTGAG CTGTC CAAGGCAAG AGAAAACG AG AAGACCTCAAG CAAG CAGGC CG CATGG CT ATTTGTT CATGC CTGC 

TTGTGACTGTAGACATTATGTGATGAGCTGCCTTGAGAGTCTGCTTCAGTTTCTCCT 

GACAATAAACTCTTTCCTCCCATAAGCTGCTGCTTCTTGGATGC 

TTCTGCCTGAGTGCGTATCCCAGGTGCCOU3AAGTGTTCG 

GTCCTTCCTAA TI T T TTAGCAAATGTGGTTTA ITI ^ 
1 5 CCTCTCAAATGATATCCCCCTTCTTGGTTACCCCTCCCCAGCCCCATCCACCTTCTC 

ATTAACCAGTACAGAAAAAGTCCACTGAAAAATCCATGTCCTGGGCCCCAAAATACAAATAAAAT 

GCTCACATTTCTTCGTGAGTAGTTATAACTTCCCTGAGATTT^ 

TCATCCTGAGGTTAGTGGTTACTGTCACTGTCTCTATCCrCATCT^ 

GGTTTAAGAGTAGCTTCAAGCTGAGAAAAGGTCAATTTTCTCCAAA^ 
2 0 AAGTACACTCATTCTGAAATGTCAGGGCCAATT 

AAAATGCACAGTCCTCCAAAGGGCGGCCTCTTTGTGCTCATCCGAAAGTGATGGAA 

GTCACCAGAAGTCAGAAACCATGCAAACAACCAGTAATGAATAGTGAATTCTAGAATTCA 

GCTTTCAATTTT CJVTTTTAAACATG AG CAAAACCTGTGT TCTGAAAGAAAAG ATTT AGAGAAGAG AAAAAGCAAAAAG AATAT CT A 
ACATATTGCATGACCCACCACTCCCAGAATCTGCTCCnt^ 

2 5 TCAAGGAGAGGCCTGAGTTAACrcrCTGAAATTCCATATCTTAGGCACTGGA 

CTTTGAAGTGAAGACAACTGCTGTCCTGATCAATGACTTTGAATTTAACT^ 

GG ACAGTTTTG TT C (HTGGTT CTTGGGCTT CC CCTTCCT AAG AG TTTTGTTTG CATGT CT ATTCAG ATGCT AAAAT ATTGTGTT AA 
CATCGATGGTGATTArri l AATGTTTGTTC^ 

TAGATGAATATAGAATTGTATCCTTATGATGAATAAACCTTTTATAATTACTA 

3 0 ATCTTTTGTCTTTATCTGATATTAGGTGTAGCTGCATCAGCTTTCT 

TTATTTTCAGTCTCCTACTTTCrrrTTC-GATTATTTAATCCATTTACAGTT 
CTCACAATTTCTTTTACTTCTATAACTTATATTCTTTTATATTCTTCTTGA 
TATTCrCTrTGTTCTATrTACTGTCCCGTTAATACrGTCTACTGTGATTCT 
TGACCTGTTACAGTTTACTTTAAGTrAGTCTTTCTTTTATTTTGTATCCA 

3 5 GGGTACAAGCTACCATTGAGTCACCACCCTACCCTGTTAATTATAATGAATAATGT 

TTGCAAATATTTCTCAGCTTTCCATTATTATTATTATATACrTTC^ 

TACAAGATATTGTTGTATTAAATGATCAATGAATGTTAAAATTTACCAAATATTTACTCT 

TG AGTAAATTGTAGCTGTGTTCAGACACACCTAAAGAGGAT AGCAG ATTT^ CTGTG AG CCACCATGTG ATTACTG 

GGAATTGAACTGAGGACTTCTGGCACAGCAGTCAGTGCTCTTAAC 

4 0 CCTTCTGGTGTTAGTATCATCCTTCTGTGATCATGCTTITTACT^ 

ATGGCACTCCACTGTGGG AATGTGAAATTIGGGTCATTG CCATCT AAAACT CAGTClACTTTtSGCAGCATGGCTCAGTTG AATGGCC 
TCAAATCCCAGGACACTTATTTCATTATGTATCTGTCTCTGCTCCIAACAATTGTCC^ 
GACAAAAAAGAAGAAAAAAACC AC CCAGGCCATTATC AAAAT CACCTCACTTC^ 
GATCIAAATGGACCATTTAAAGAGCTAGAAATGATAGATAGATAGATAGATAGATAGATAGATAGA 

4 5 GATAATAGGTAGATAGATGATAGGTATAGATAGATAAAAGATTAATATTTGATAGATAAATAGATAAAAGATAGGTAG^ 

AGAGGTAGATAGATGATAGGTAGGTAGGTAGGTAGGTAGATAGATAGATAGATAGATAGATAGATAGATGATAGGTAGGTAGGTAG 
AGAGATAGATAGATAGATGATAGGTAGGTAGATAGATAGGTAGATAGATAGATAGATAGATAGATAGATAGATAGATAGAAGTAGT 
TGGTGGTAGCAGTGATGATAAGAAGATAAAAAAGCAACTTTATGTTGCCTTAGAA^ 
ACAACACTGAATCTATTATAAAACATCCTTGAACATTATGTCATGTGGACAGACAATACrcr 

5 0 TTTATCATCTATTTAATGGPTCAGTATTAGTAACCAAGTT^ 

TC!AATATATAAGCCTGAATTATGAACGCTCTATAATGCAGCC!Ara 

ATTTATATTTCAAATGTTTTCCCCTTTCCAGGTCTCCCCTTTGGAAA 

TGCTCCTCCACCCACCTACCCACATACCGACTCCAACCTTCCCATCCnKSTATTCCC^ 

CCAAGAAGGGCCTCTGCCACATATG CGACTGGAGCAATGGGT CIACTC CATGTGT ACTCITTGGTTGGTGGTCC AGTCCCAGGGAGC 
55 TCCAGGGGTTCTGGCCAGTTGACCCTGCTGCCCCCCCCACTCCACGGGGTTGCAAACCCCCCCCCCCCCTCAGCT 
TTCTCCAACrCCrCCATCTGGGACACrGCACTCAGTCCAATTAT^ 

AGCCTCTCAGG AGACATATCAGGCTCCTATCAGCAAG CACTTCTTGG CTTCX3GCAATAGTGACTGGGTTTGGTGG CTGCAT ATGGG 

ATGGAGCCCCAGGTGGGGCAGTCTCTGGATGGCCTTTCCTTCAATCTCTC 

GTTCTCCCTTCTAAGAAGCACTGAAGCATCCACACTTTGATTTTCCT 

6 0 TTCCAAACTTTTGGGCTAATATCCACTTATCAATGAGTAC^ 

TATTTTCAAATTCCATCAATTTGCCTGGGAATTTCATGAAGTCATTGTTTTT 
(^TATTTTCTGTATCCATTACTTTTATGTATCACAACTTGGAl'l 

GTTAAGATTGCAAGCCTATAATCACATGTATCTGGGAAGCrGAAGCTAGACAGTTGGAAGT^ 
AGCTCAAGGTCAGCCAGAGCAACrGACTGATACCTTATCTC^ 

6 5 AATGTTT CAT AT CCT AGGTG C!ACITTC CAGT ACT AAGAAAG AG AAAGAAAG AG AG AAAG AAAG AAAGAG AG AG^ 

AGGATGGAGGGAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGGAGAGAGAAGAAGAA 

GAAGGAG GAGG AGG AGGAGGAGGAG G AGGAGG AGG AGG AGG ATG ATGAGGATG ATG AGG AT AACAACAACAACAACA^ 

AGGAGGAGGAGAAGGAGGGGX3GAAGAGGAGGNNNNNNNNNNNNN 

CT AAT ATTTTT C"I T CAAGATT ACAACAAGAACATTGAACACT AAT ACAG AG AACftTCTG 

7 0 CCAAAGGCTCACTTATCCAGGCACTTTGAATCACrTTCTTAACT 

AAATTTGAGATAAGACTGTTGGAAAGCAATATGTTCTATAGTTATTGTCATGAATTATTTTC 
TGTCCTCATATAACCCAGCCTCATCACACAGGGCAGrrTTATC 
AGT(nTGAGAAGCCTAAAACACATACAGAACAATGTGGAATTAACTTCT 
ATTTATTTAGAGAAACTAGGCTAAAGCAAAGCAAACAAACAAAAGAATTG 
7 5 CTACCTCAGCCTTGGTAAACAGAGCTCTAAATT^ 
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CACCATACACTCAAGTATGTAAAGAAAGTGTACACACAGGAATCAATGTGTTCCA^ 

TTATCCCAGGAGACAGACTATGCCAAGTTATAGTACGAAAGTTATAGTATCTGCATTGTACAATAGCACAGG 
CTACACCCCAAGTTTITCTTTTATGGTAATAATAATTTGTTCTTTAACATl^ 
TTGTGATTTCTCTTCTGTGTTCATTACAGGAC^U^CACTGTC 
5 TCAAGACAAAATAGTTATGTTCTACATACTCTCrTGCCCTTAGCTTCCGTAACTTTT 
CTCTAGTTATACCAAC^TGTGACTTTGAGTACCATAACTG 
CACAGGGTATTAGCCCATTGGGATACAGTGAAAAAAAAGTCTTTAAAAA 
CTACAACTCATAAAGAATCTAAATGCTAAAGAACAAATTATTATTACCATAAAAGAAAA^ 
CTCCTGTGCT ATTGT AT AATGCAGAG AACTTGTAAG AAAGTGAGC CA 
1 0 ATATAGCTATGTGAGTCCAGAAGTGTGTAATTAGTATATTATTCCA^ 
GAATGATTTTGCCAGAGGCAAGTTAATGACTGA 

GG AAGGAAGGGG AAGAAAGGAAGATGTGACTT ACTAGAAG TTG AT AG T AG AATTCTGACTGG ACTCT AAGAGTTTAAATTTATTAC 
AGGTTC1TCCT C CT AG CCTCATGCTCTTAGAAATAAGACTCAGATTCAAAATATATTTT CAT AT ACTTTGGCCATGTGG CT AGG CT 
CTT CTCTG ACT AAAAT CAT AACTT AAAAC CAACC CATTGATTTTCATCT AC AT ACTGC CACAT ACCACATGTT ACTTG TG CT ATG T 
1 5 GCCTGTCTCCTCACATCTTCCTGGGTGGAACTCTTGTCTGGCTCTATCCCAGAAT^ 

TTCCTGCCTAAG CCAT AGG CCAT AAG C CITTT AATTG ACAGGTG ATGCACCCAT ACAGT ACACAAGAT AG TCT CT ACAGCT AAG 

TTTTGACATTGTTTCCTAATGTAGAAATTATTAGGAGCTTUVTTTCrc 

CGTTTCAGAAAATTGTACACAGAGAACTCTTATTTCCTTrTGCCT 

T ATGTCATTGAT AATCACT AT ATTTTTTC CX3CT(nT^ G CTCGATTTT CAC C 

2 0 AGTCrTACTGCCTCTAATCCTAATACCTTTTTGAATGGAGAACTGTTAGAT 

AAGTGGACTATAGGATGATTTCCAGAAGTTAGATAGAAACCAGGATTCCTTAGAAATT^ 
GATTCACTTCTCTTGTCCCTGGCACAACGGACATAGAGCTCCACCG^ 
CCAGGCTCCTTCAGCCCCTACniAAATATACTTGTTACA 
CCTATTTGTTCTCTTTTCAACATGAATGATTTTCCCCTTACCCT 

2 5 TTTACTCTCTGACTCTATGGGCTATTCTACATTCTCTCATAGATGC^ 

CTGATTCACAATAAAAACCTTCTCCTCACCTGTACCATGGAG^ 
GATCACCCCAAAATTTGGTGCCAATTCTTCTACTAGTAAGC!AATCCATAAATA 
TCCTCCAAATTAATAAAACTTAAAATATAACAAGTTAAGAAGGCATGTAAC^ 
GGTATTTTAATGGAAATCTGTGACTCAATTTACATGTGGAAGTTCTAGTACACAC 

3 0 GGAAGTTACTATATCTCTTGCCCTTTGCCTTGCAAATTCCACTCAATCAATGCCAAGA^ 

CTTTCCCITCACGTTCAATGTGACTCAGTTGTGACC 

TC CATCTCTGGGATTGACAGCATCTGTGTC CTCTG ATTTCTACTCTT^ CAT ATTGGT AACC 

TTTCAAAAAGCTCAAGAAGAGTGACAAGAAAGCAAAATTGCTCCCA 

CTCCTGGCAGAAAGCACAGACATTCCAGGTCATTCCAAA 

3 5 TG ACTCTT AAC AT GGTTATTT ACT AACCACGTGTTG C CATGG AAAC AAGGT AAAGCIAAGATG CT CTTGAGACACAG AG ACT C A 

TGTTAGAGAGGTGGTGCCTAAATC CCATTCTACAGCGAGTAGGCTGTTCACACAG CT 
TCCI^GGGAC^TGAGAACCCTGTAGCCACTCATCCT(2AAGACATGGTATCCGCTCTCCrCT 
AGGCTAAAGGGCGGAGCCAGCCATCTTCTTGACCGGTTGGGCTTACAATTATGTACTTC^ 
AGGAATACAGTAAAATTCATTCATATGATGCTCTGGTACTCCTGCTTATTGAAATAGAAAAAA 

4 0 AACGTTTG C AGG AAAAGAAGTTG AAAAT ATGTTCAAG AG TT CAGATGGC TT CT1TG ATT CGTGGT AAGTTTGAGTT CT ATGG ATTT 

GGGTTACTCGAACTCTGATTGGTCTCTGGAGTACATGTAGACGTTAAGATGCTATATGCT 
AGCCTATAAAGTAGTGACACTCIAGATATAGAAACATTTTGATATC 
GAGAAGCTTCITGTTACTGGAAGGAGAAAAAGTCATCAGACCTTATGAAAA^ 
ATGTGCAAGATTCACTGAGCTCTTTCTTATGACCATATAACCCATCT 

4 5 AAGCTTTGGAGCCTTGGAAAATTGGCTTTAGGAAAAGAGATCACCTGGC^ 

TAGG ATTAGGGTCAACAGTGATTTTTAAGTCTCACAAAGAAAA CAAGTTTGGT 
TTTCCAGCATAGATCCGGGCTGCCAAGAATGCIGGCTCCAGCT 
C!AGGTGACCTGAGCCAACCCCACCCACTGAGAAAACCAAGAGAAGTTTGCCr^ 
GACTCCATGTGTCGTGACTGACACCAQUVC(^GGTATGTCA 

5 0 ATTTCTTCCITAACTTTGCTTTCTGTGGCTAACAACT 

AGTTAGGATTG TGTCCTTTTGT AAAAATG AT AACCCT ACAGAGGTCC CTTT CAT AG CATT AACCTGTGGT CG TGT AAATGGG TACT 
GAGCCTTAGAGCCCATCTCTTTAGAAAGGAAATCATTATGCAATCAGA 

TAGGAGTAAGGGGAGAGTATAGTCTGTAGGAGTTATGCGACTTTTGAGGGTAACACAGGTAGCTGGGCATGCAAC 
TACrGGCTGCGCTCTCATTCAACGTATCAAGAGTCTTCTCT^ 
55 GATCTTAGATTCATCAGGTATGTTTGTATGCATATGTGTGC^TC 

CAGAG AG ACAG AG AG ACAG AGAG ACAGAG AGACAAAG ACATTGAG G CAGAAAGAGAGAG AT AG ATG AAG CAGAAAG AG AG AT AG AG 
ATATAGAGACACAGAAACAGAGAGAGACAGAGAGACAAAGACAAAGGAAATAGTAATA 
TCTCTCAATTTCTTGTGATTGTAGCCTGAAAACATCTAAAAATCACAGGAACAGG^ 
ATATAAAAATGTGAATTTACAGCAGTTTCATGTGCTATGAAAATTTATTATG 

6 0 TTCTTGGCTCTTAGTGGACCGGATTTTGCCTAGGAGAATAGTTT^ 

TGTGGACTCTGTCCTCCACTCGGGCTCTCTGACCTTGTAGCTGACTCCACAAGCCCCCTTT 
ACACTGTGAAATTAGTCACTGTGTGTCTCTTCCCAAAACA 
ACGGTCATAAACGTCATAGTGAGGACTGGAGCAAATGTTGCrrTGAAC^ 
AGGAAGTACTTCCGTTTGCCCATAAACAACTGTCTTAAGGAATTTGCTACATTCCTT 

6 5 TTGTTTAGCTTACTAACAGCCAATCTTTCTTTCCGACTGGCCCT 

CAAGCAAATGAAGACCACTGTTTTAACTCAGATACAGGCGGTGGGGG 
TGGTGTCJAGTGTTTCCTTTGCAAACTTTGTTGAAATGTTAT^ 

AACTGGGTCATGAGGTCTCTTGCCTTATGATTGTCTGCAGTGTGGAGAATGGGTAAGT^ 

7 0 GTTTGAAAAACTTCAGAACCATTAGACAAGTAAAAACTT^ 

ACTTGTTTTGTTTATGTTGGTGTTGTGTTTGGCATATGTTTTGGT^ 

GG CCAGGATAAAGGCACAAAG ACTTTGGCTCAGAGCCACCACAGAAGAACTGTGAG AGAG CCATAATCCAGGCGTCT 
TCCATATATGAAGTCTCC^.AAATCAACTAATATTGCTTACACTAATATTGATACCAT^ 

CAG TGATG CAGTGTGGGGCCTATTAAAT AT AGTT ATCAAAACTTTGAGTTG CAAG TGG CAGaAAC CAAATG CAAGATGAGAC AT AT 
7 5 AAGGAAACGGCCACATCAGCTTTAACTTTGGAACCAGCCAT 
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GCGTTGACATCATGCATCATGTGGAGAGACACA 

AGGAAGAGATTTTATCTCCACATCTATCTCAGTAGACAAGATTTTCTGTCCri^ 

TGGAATGGAAAAGACCCTTTTTGGGGTGGATTTTTAGATTCCCAGTGA^ 

AAGGGGGGGGGTAGATGCAAGGCTAAAAC-CAGTGCCTGCACCAACTGATT^ 
5 AAGCATCGTAGAACCCACCCCATGAGTGTTGAGGTGATTCTGGATAATTCTTTCCTC^ 

GTGAAGAC CTGAAAT AGTACAAAAG ACGATGAATGG^ TG AG AG TTTGGT AT AAAACTCAG CT CTCTTTTTG CT C 

G CT ACACT CTTTGATTT CT ACCTTTTGTTTCT ACCAAG AATGAT AT ACCAAGG AG ACATTT ACTT AGCTC 

GAGTTCACAGCACCCAAAGCTGTAGGAAAAAAAAATGTATTCITTA^ 

AGCAGGCTGCAGTGGCTGTGGCTCCTGCAGAGTCTTACTTCCACACAGTGACA 
1 0 AAGTTGGAATGCIAAATCCCITGAAGCCAGGGCCAGAGTCACT 

TTTAGCACAACCCACAGGTCTTCTGAATACTCAAAAGCATGAAGCT 

TAC^TGTTAGTTAGACACCTATAATCCCAACACAGACACCT^ 

ATTAAGGTTTIGTTCCTGCTCCCTTCTCCACrCCTCCTCCTCCCrrCTTLi 

AGTTTAAACTACCCCTAAATTCAAAATCCTCTTGCTTCAGCCTTCAAAGTACTAG^ 
1 5 TGTCAGTTCAGCTGTTTGACrnTAACTGGCACATACCA 

ATAGAATAACTGGCTCTACAGTGTATTGCCTCAGTTCTGTGGTAAGAATGTCTr 

ATATTCCCCTCTTACAACTGAAATCTTGTTCCCTC 

CTCTTG CTCT AT ACGAGTAGTAG TTTTAGG TTCT ACAT AG AAATGAGGTCATG TATTTGTCTTCCAGTGTCTGG CTT ATTTCACAT 
AACACTCTGGAGT CATT CTTGTT AT CCAAG TCTTTCTG T AACTG AAAAG T ATCTCACTGG G CCTGTGT ACCACATTTTT ATT AG C C 
2 0 ATG C AG CCATGATGAAT AATCAAGCT AC^TTTTTGATACAGCAAA CAACATGGG AT ATCT CTTTG CCATATTG A 

CTTT ATTT ATTTTTTT AT ACACAACT AGT AGTC^ TGT CT ACTTTTAGG TG TCTGAGG AAACTC CAT 

AGTGTTTTTCCTAGTGGTTGTACTAATTTACATTTCTACC^ 
ATAAATTTTATCTTTTCAGAGTAGTGGTTCTAGCAATGAGGTTGTATCT 
ATGTTAACATTTTTAAATGCACATAATAGGCACGATTGTTT^ 

2 5 CATACCCTTGTTATTAGATTGCTTTGTTCTGTATGTATTTTGAGCATTAAGCCCTCCTCT 

TTCTATCCTTTCATTGTTCACTGGCACATTTTGAAGAGATGTGCTGTGAC 
TATTGGAGACATACACTCTTCAGAGGCACACAGAGAGACAGGAGAGC^ 
ATTTAGGCCAGGTTGGTGCTCTAGATAAAGGTAAGAGGAAGCTC 
CAGTTTGAACTGCCAAGTTGTATGGAAAGAGATGAGA 

3 0 AATAAACATTTGTAGTTTACTTTGTTCTTGACT 

TTGAAGTATGGTTTTAAAACTGTCTTAAACTAAAGCCATGAACTTTTAT^ 

CAT ATG T ATTTTTCAG AAT ACT AAG ATAT AT AAT AAT AT AATACACACACAC AC ACACJ^CACAT AT AT ATAT AG ACTGTTTC 

ACAATGAATTCTTTTCTGTATCATTTTATAAATC^TTATGTGTCCAG^AATCA 

GTTCCTTATTAATTTCATAATTTAACA.CATTGAGCTAATTGGTTTGCTCA 

3 5 AAGAGGAGAGCAGCTAAAGCTGGATGGTGCAGTAAGATGAACCITGGTCCTC 

AGACTCTCCAGCCATTC CITTCATTACrTTGGACCTCGGTCC C 

AAAAACAGCGTCTGCACTCCAC^TGTCCCCGATTACTAATGACCTCCACTGGGAGCAGTTTAGTTAAACAA^ 

TGTATTCTAATTTATGTACTTGTTCATGTTCTTCTTTGCTATAAATGATTGCTGACCTT 

TTGAAACCAATG AGCTT AATGTG TT AAAACTGTG AAATT AAT AAGG AAC TTTTG TT ACATGTGT CT C CAAAAAG CTTT ATTCTGTT 

4 0 TCCTGGGCACACAACGATCTGAGCATCAGAAATGAGCAGAATAAAGCTAGACACTG 

ATTAAAAATATACTCCAATGTCTTTTTCATATTTGAAAGGTACAACCTCAGA 

GGATCTTATCTAGCCCAGGTTGGCCTC^AACTC^TTATGTAGCCAAGAGT 

GGATTATAGGCATATGTCTCCATGATCCATAGCAGATCCTTTAAAATAATGGC^ 

OVC^ATGACAGTGTGAGACTAGATAACCAGATAACCCTAAACTCTAGACACTGGTATGTCTGTGTACCCTAAATT 

4 5 TGGTGCAAGACGTCAGAATCCTTCTTTCCTCAGATGAGAACATTGATGTCTTCIATGTCT 

TATGAATATCTCCCTGGGAACTGTTGGCTCTATTAGTAGGGAATATATATTCACATGTT^ 
GGAAGACTGTAGCTATTTGAAGCTATCCATGCACTCTATTAGAAGTGGGAATTTA 
GTATGTGACTGG CTTTG AACAAAAAATAATAATAATTAAAGCACACACACACT^ 

G G AACAAC CACAGCACAT ATGGCATG AATGTTGCTTG AG AC CTG AG T AT AATCCT T AGGTC TT G AGCAG AATG AG ACAG CTGACTT 

5 0 CCTGTAGCCGGAAGAAGAATGAGTGCTTGCTGTGTACAATGCTCAGGAAGGGCTGA 

AAGACAATTACAAAGGAAGCAACGAAATCTGAACAAATCAGAGACTCTCTAGAAGAT 
TGTCAACCACTTC CTCT AC CTG CTCT CTCT AT ATATTTTTATC 
TCCCTATACCCTCCCCCCCCCCTGCCCTGCrCCCCrACCCACCCACTCCCACTTCr^ 
ATAAAGTTTGCAAGACCAAGGGGCCTCTCTTCCGAGTGATGGCCGACTAGGCCATCT^ 

5 5 CTCTTGAATTGGAGGACTCTCTGCTAATTGTCTTGTCTATACCTGATAGCCACCCTTCACTAAGCA 

TAAAAGACAAACAGCAAGTTTATGGAAGACGTGTAACCTAAGGCTGGGATTTTCCAGGA^ 
TGGCTTGCCTCCATTAGCAAATGTACACATCACGTGAGCATGTAGACCCCAACTC^CTTCT 

AGTGAGAAAGTAATGATACTTGCAAACAAAGCATCCACGTGTTACATTATCTCCCACACACACTACAAACCACAC^ 

6 0 ACTTAAAACCAGCCCTCAGGCTTAATTAACAAGAAGTAA^ 

AT AT ATTCATTT ATTC CCAGAG CTGT AT AACTTCAAAATTAAACT AATT AATT CCTTG T ATTGTCAAG CA 
TTCCTGTGACCGCCTGTTCCTAGCATGCTTGTTCTTAAATGTGTAATGATATAAAGTCTTA 
CTTTCATGCTAATTATCTGAGTGATGGACAATGTGTTCCtATCTCCTAACT 
ATGGCATGAGTGCCAGCTGTTTTTGAAACATCXJATCTGTCATCTCACAAC^ 

6 5 GTGTAACTAGGTCTTAAAGTACCCTAAGTTTCCTTGAGGCACAGAC^ 

TAATTGAACATGAAAAGTTGCTGCACCCCAGTGAATCCAGTTTATAA 

TCGTTATGGAGAAAAGGCCATTCAGATTAAGOUU^CT 

GGAAATAGTAATTTTTCCACCTTCCCTTGATCn'TTATGATTTGTG 

CAGGATGATAGCAAACTGCGTGCACCAATGTGTCCATGTCTCTC^CTACTTTATAACAA^ 

7 0 aatgatcattttattatactcatgaattatgtaggtcaggaaatcagacac^ 

tctagggtctcagctgtgaacagtcaaagactigc^gaactcgaat 
ttcttgg caaac^ cttttttg aacgt atg cttt ctg 

acgtgagatcttagagcagatgcacctgaagcato;tgcacgtgcacataagcgggcc^ 
agcaatggttcacctggatcagggaagcacttga.crta 

7 5 ATTGACTCXJ CCG TG ATT ACTGGT CAGCAACAGCTG CTG TGGAACTGTTGCAGGGGCAAAGG CAG TGTTTAGGGTCACAT AGT AATT 
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TAAATATCACCT CTCAGGTAGGACCTACATCACTTCTGCCTGCATTTTAAGG CTTGTGGAGCCACGCCTAGATTCATCACGCAAGG 
GTGGTGTCAACTGTGTACTCAGAAGOAAAAGAGAACTACCAATATT^ 
CCATGATGAAATXn^XIATGl'lTCrUTCAAAGCrAGTTTTTGT 
TATCCTTGATGGGCAATGCACGTAACATAAC^TGCTC^ 
5 TTTCGGTGTATGGCCTTTTCAATTATTTCTGTCITTATCTCCCG 

CCCTGATGTCATAGTCATCACATCGGTACCCTTTTGTAATCTGTGATATTGTATAGAGTGTGTAG 

AGGT AGAT ACT ATT AATTGAAGTG T CCCGATT AATTTAATT ATI"! CI '1 AAATCATTTCAAACAAGAGTT ATCTTACATCAAACTCT 
GATAG AT AGAATGGTTCGTT CTT ACTTTCATTT AAAATGT AT AT AGAT ATT AAAAGTACAAG T AGTGGAT AAATCCAGGTAT AAAG 
OVGCTTATTTrrCGAAAGCAGAGCCTCTGGCT^ 
1 0 TTATATTTTTGTCTATAATCAGAAAATCATAGTAACAAATTGTCAAACAA 
AGCTCTGGGACAGGCTCAGAGGTGACAGGGTGGATTTAAT^ 

T AACAACTTT AAATGCCCACCCITC CTTT CGACCTC CAT ACT AG CTGGCCTGGGG ACT CT 

GAGGATTCCCCTGTCTCTGTATCCCATCTITGCTGTAGTAATGCT 

Tt^CTTCTCATGGGTCCTAGGATCCATTCTTCAGAGTTGTAAAGCAAGGG 
1 5 TOTCTTCACAGGTAGCTCTGGCTATCTAGAAAAT^ 

GTGAAACTCTCTATGGATAACTTGGTAAAAGATCAAGGCAC^^ 

AAGAGATAGAACCAAATGAACCCATTX3GAAGCATAGACTAGAAATGCAGTAATATC^ 

GGCAAAAGCCATACAGCAGCATGGCTAAGCTT(nTCATCAACA 

GACAACTACTTGTCCAGTTTTCTGTCCTGGCACATCTATC 
2 0 TGCTAAAACAGGTACCTCTTCCCTAGGTCTGTCAAACATGGTCAATCAGT^ 

CAGiAAGAAGTTAACATAGGTTCTATCAAAACCACACACAA 

GAAATGTCAGGTCTTTTGAGGTCTCAGTCAGATGTAAAACAGACTAGATACTGATTGT 
CATTTGTCTCTCAGGAATCrTAGCTACATTGTGGCAGACAATGA 

CACCTCT CAT AG C C CAATG CTAT ATTTCTTGT AC C CT CCCAGAACTTTGAAGG CTG TCATGGT CACTGTTCTAT AG TAG TG AAGAG 

2 5 ACAATGTGAC CAAGG CAACTATT ATATGAAAGAAATCATTTAATTGGGAGCTTGCATACAGCTT^ CCATGATTAT 

CATGGTGGGAAGC^TGGCAGCAGGCATAGCAGGCATGGTACTGAAGAAGAAGTTAA^ 
ACCTCAAAGCCTCCCTCAAGTGACATACTTCCriTCAACAAGGCCATATCT^ 
GTGTTGACTAAGCATTCAAGTATACGAGCCAATC^AAGCCATTCCTATTCAAACC^ 
AAAAGATTATGTATATGTGTACATTTTCATAAATATGTCrrTACATATTrAAAACTCT 

3 0 AACATGCAGAGGGTCTATAATACACrTTGGCrAGCTTGTGTTCAAATGACATTTTGTTGT^ 

CTTGGAAATATCAAATCCCTACAGAACTATTAATGAAAAATAACTATCTC^^ 
TAGAAAAGAAAGGGAACTTTTTCTGGGGTGCATCTGTAAAATTT^ 

ACAGCAAAAG ATAAGGTTX3AGCTCACAGGACT<MTCAACCATAGGGATTCT AT CATCAAAGCCCCAA 

GACCCTCCAGCCTCTCCCCTCAGCCTCAGTTCTTGCGCGCGCCCGACT^ 
3 5 CACGTTTATTGGGAGAGCTTGATTGCAGAGGCGAAGAGACCCTGAGCCC^ 

GTCT CAT ACC CGG ATTGGTT ATGCACT ACG C CTCATTTG CAT GTT CCT CAT CTG ATTGG CT ACT CTCT CTCT CTCTCT CT CT CT CT 

CTCTCTCTCTCTGTACCrCACAG AGCCTCATT ATCATACCT CATGTCTCACATCTGATTGGTT AT ACTTCTCAAAGCCT CA 

TTATTATGCCCAGGCCAGGCAGTGTCTTTGCAAAAAACm 

GGCC^GCAGTAGTCAGTGCCACTCTGCAACC^CACATGNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNWTNNNNNNNNNNNNNNNNNNN^ 

NHNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 
45 NNNNNNNNNNNNTJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNKNI^ CGTGCAGCTTCTACCTCATTT AA 

AACTTGCTAAGACCACATGTATCTGATCrACATATATTCCCTGTTAT^ 

CATCCCTTCTCAATAGTTACTGCTAGCCAATTCCCATTGAATGCCTATC 

AATCAGTTGAAAAAGTCACTTCGTAACTTTTCCACTATCCTAA 
5 0 TAGTCTGTTCACAGTAGC!AGCACTAGAGAAAATCTTATGCAGTTGTTATGAA 

AATTTGAGGAAGTCTCTCTTCACCAACAGAGTTTAAATGCCC 

AGTTTCTAACTCTTCACTGGACAAGCTGTCTTATTCCC!AACrCTTCCTTAGC^ 

TCAACATGTGTGATACTCATGACTACTTACCATTGAGAGAGTAAGTGGCAGCCTAGCCAGTCC^ 

TGATGAGCGATGCCCTTAATCACCTAAATTCCAAAACTATTACACTTCCCATGCA 

5 5 TACCT CAGAGGTCTGGGCCCTTTAGCTCCCTCAGTG AC AAAGCAATG ACIACTA^ CTTTCAAACT 

AAAAACAACT ATAG ACAGT AG AAACAATGG CCATCATGACTTTT CTTT CCTCTT AGGTGAAGGAAC CAG AGG CTG AT AAAACTT AA 

ACCCCTACAATAGAAATTAAATAAAACAAATTTAGTGAGTCACTATGGCC^ 

GTCACGATGCITTCATTACACAGAGATTTTAGGGACATC^ 

TXXSTGAAAGGCTTAACTATTTATTTTCCTACCTCAGTATTA^ 

6 0 TAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATGAGTTCm 

AAGTTCCAAAGTACTGTGCTCTTTTATGGACTTTCAATCAGTCT 
GTTAGGAAAATGCAAATTTAATAAAGGCTTTGGCCATTAGGAGCGTGTGTGTCT^ 
TCTGTACAGTAGAGTAGAGGGCTCAGAGAAAGCAATtaGAAATTCACAGAAAGA 
AGGAACTTAAAGAGTGTtnTTAGTAGGAATAGGCACCTGC 

6 5 GAAGGAGTTTCTTCTGAGTTGGCTCAGTTX3TGCACTGACCG 

GCTTCTCCTCTGTATTGTATGCTCCTTTCATTAAATCACCCrCG 
AAAAAAAAAAAAAAAAACOUVAAAAACAGGTATTTCTGTGCAATCAGAA^ 

AAATCAATTTTCAG CACCCACTGG AT AG GG CTGATGTG CGAACT AGCTTTG CGT AGCCCATCGTCTT CTTTCCGAGATG 
AGGACTGGCATTTTAT CATGAATACAAAGAGGATGAGCATATTCTCCTAAT AT ATTCATTACTTC CCTCT CC G AACTGG T AAATAA 

7 0 TATACATATGGTAATTTGAGAGACATAGTGTTCCCCATTGCTCCTTTTTAT^ 

AAATCAAAGGCAAAATTACCTOVAAATTTGATTTTGGAGTCCTT 
TTTCTGTTCTGACCTGTATTATTCAATAGCATGA^ 

CCATCTATTTATTCATTAACAAGCACCrATTAAGTACCTACrCTGTCCCAGAC^ 
GGAAGCCTATC^CCCAGGCCCATTCTTGTTTTCTCGAGTTTTCTAATCATT^ 
7 5 AAGATAGGAAAGACTGTGGAGTGGATGCCGTAAACAGAACTAAATGAATA^ 
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GAGCATGTGAGTCTAATTAGCAGGCATAAACTGATATGCTAAACCAAA^ 
AGACTGGACCTGAAAGAATTGGATTTTTAGATGGGAGTATGAAGG^^ 
GATCTTTTACTCTGC^CGGACTTGATTGACTTGCCA 
TTGTATACTTAAAAGTGTTCATTGACCTACTGCGTTAGATGAATTGGTC^ 
5 TTGAATTTGTGTTCAGATGAATGGACAATGGGAGTAAATCAGTGTATGGAAGCCCCAGCT^ 
AAGAAGACAAATGGGTGGAAGATGTCTCTATAAATATACATGGTTGTTAGGGCAAAC^ 
NNNNNNNNNNTOJNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
1 0 TACATGAAGATATGACAGTCTGGTTTCCATTTATATAATTGTTTT^ 

GTCCTGATTGACCTGAACTCTGGGTTTCAACAGTCCTGATACCCTCAGCCTTCAGAGCAAGCT 
ACTCrCTCTCrCTCTCTCTCTCrCTCTCTCTCTCTCTCT 

GAATCTCTTTTAAAGCATACCTGATAATGATGCTGATATTGGTTAAAATAATGTAACAAATGCT 
TACCTGTCATTTAATGCTCTGAGATGGTGTGGGGATATAGGGCGCTGTTATCGTTCCATTTT^ 
15 GAG AT AAAGTAACTCACTT ATTTCCAT AAAG CAAGT AAGTG GTG C CACT AGAGT AT AAAAT CAAACAACAT AGTTAGAAACACT AT 
CCTTATCTGTCTTTAGTGTTTATTTCTT^ 

AATTTCCACAAGCAAGCACCTTGAGCTGAAACATTCGGGGCTGTTTTAAACT 

TTGTG AAGGAGATTTT CAAG TGATTTCCTCT CAGG CTT CATTT AGAGGCATTGAATAAAGT AAT AGCAC CTTTG AG AATG CACCTT 
GACATCTTAGTGTTGGGCACCTGTAATGATAGCAACGTG 
20 GAGGGGGG AG AGAGGAAGAG AGAGAAGAAAG ATT C CT ACTCTGG CAGACT AACTG TIT CTl'T AACT ATCACT ATT AAAT ACACTAA 
ATTATTCTTCAGCTCCTCCTTCACrGTGGATCCATGCCTTTAAAACAAGC^ 
ATACACCTTTAJGCATTCTGACCCAGGATTCTCAATTGATTGCCACT 
CAATATGCATTAGCATAATTAGAATTATCTAGGGATX3TCAGATCATCTTAAGCACCT 
AATTAGCATTTGTGAAAGGTCTCTGGTTTTCAAGTGTCTTGCAAGACT 

2 5 GCTGCTTGTCTTTAGGTGTTCAGTGAACGGGAAAGATAAACATT^ 

AAAGTAGAGCCACGGATACAGCAAGCCCCATCACACAGATGAACACAGATAGCCCT^ 
TCTCTATTCCAAAAGAACTTCACAATCTCCATAACTACGTTTGGCATT^ 

AGACAGTGGCTCGGGCTCTCATGATGAATGTAGGGTAATACAGCAGCCCAGAGCCATGGCACCTAAGTATGA 
CACTTGGATATGATCCCTAAGGGACAGGATAAGGACACTAGATCAAATT^ 

3 0 TAACACCCAGCCCTGCATCTGCATCTGTTTAATTAGTCTTTTCT 

AAATAGAAAAATAGTAATTTTTCATGGAGTGAGGGCAGGAAATAAATC 

TATAATTAGCTTTCCTCAATCTC!AGTGTCTTCATCCTAAAAGGGCTTACTTTTGGTAAAG 

GGAGTCCAGTGAAAGAGCAGCAGGCAAAAGAACATGACAGCT 

GTGTCTACGTTATCCAAAATGCCACAAAACAATTGATGATAAAACTGGTGGATGGACATATAGTCT 

3 5 AGTACTCAAAGAACTTAAGAAAAGGGGAATAAGGAGGTTTGCCTAGAGTAGAGTC 

CATTTATTAAGTTGAACAGAACAAATAAGAAGCTATCCATCTCTATAAAAAATCAAATA 
TTTGCTCCTAACTGTAATACATACAATAGTGTATTATTTCCTTTTACCCAGATAGATTrCCT 
GAACTAAAGGTTAATTTAAAAAAAAAAAACCCTCAGGGGTAAATTATTACTATTAGAGGAG 
TATTATTTTTCCAAATTAATTTTTAAAATAGTATTTTCATGTATT 

4 0 AATGG ACTACCAAG AAAAAAGT ATACATGT AAT ATTCAAT AT ACAAATT AGCTT CA T AAAT CTCTCTTTTG AGTCATT ATTT C C A 

CTAAAATGTCTCTTACATTGAAATTATTCTATGAAATATGAAATTTAGA^ 

GTAACGCTTIX3TCTCCITTTCATATTTGAAGTG CAGAAACGC CTTT AACATCCT AGGAAAGCCATCTGGATGTCATGATCTGGCTT 
T CTTIGTTCGGAT AAT ATACT AGATTATTTTC'i'lT CTT CTG AACAT CTT AAACT CCT AG ATG AG TT1TTC CTTCAATTTT AAATG C 
TTTTGAAGCAGGACTCATTTTGTTTCGAGCATTTCTTGGAATTA 

4 5 TGGACTACCTTTAAACTCTTTCAAAAGTAAACCAT 

CCCTGCCCCCTGTGGG'nTTTlTTTrCTTGGTAATAAC^ 
ACGAAAAGCAACTTGGGAATGAAAGGGTAGCAATAAGATTACAGGTCT 
AACTCAAACAGTGC^GAACCTAGAAGAATATOCTGATGCAGGAGCCATAGA 
GTTCTACCTGCCTTCTTAGAACACCTAGGACCATCAGCCCAGCCATCAC^ 

5 0 T AAT CAAGAAAATGCACTACACAAACTTGCT CATG AG C CTAG CT CCTGT AG ACATTTTCT CAATTG AG AATT CATCTTCCCAAATG 

AAAATGACTCTAGCTTGTGTCAGGCTGTTATAAAACTCACC^ 
GAATCTCCAATCATGCTGCTATCCTTTTAAGAATTTGTAACT 
ACCCTCTTACATCCTAAGCAGAATGAAGCATGAA 
A<^GACACTTCTCAGCCATGAATCTTACTGGC 

5 5 TGTCCTCTTCCCCACACATTCTCAATCTTTTCTCA 

GACTGCCATTGACTXrTGGGAAAATATTAGTCTTGT 
TTTCAAGTGCATGGTrAAGTACAATCACACCATTATGACAATCCCC^^ 

ATCT CT ATG CT ACACT AACCAGT CCTTTCAAAAAATT C CAAG TCGT AAGT CT AAG ATT AGG CAAG ACAT AG CCTTTGTG CT CTT AA 
ACACTTTTCCAAAATATATTCATTTTAAATTTGTATGTATAAAAATGAATAGCTAT 

6 0 AATGATTCTACTGATTGGCTTCCAGGAGCATCTGTGTG 

CTCTTCTTCTCTTGCTTTGCT ATTTAATGTTT CTTTGAAGT CAAAG ACT AG CTCTT CATG AT ATG CAT CCAAG TGTG CACTT AAAT 
CATAGAACTTATTCTTAGCAAGGAAGTATGTAGTGAAGCACCTGGTCAAATGa 
GCTGAATTTCTAGGAATAGCCAACAAAGTGGGAGTCCTGIATAGGTGAG 
CTCAACTCTTCTTTATTTTTGGATCCTATGTAAAATTTTCTATAAAGAAAA^ 

6 5 ATCTCGAGGAACGTTTTGTCTCATTATTGAGATTCATTGGAA 

TAAACCAACCACCGACTCTCTCCAAGCCTCTITTCrT^ 
AGAAGATTTAAAAGCAGTGAGCAGTCTGCTGCCAGCCTCCTCACCCTC 

CAGGAAGC^GGTGACAGAGGATGAGGTAGTCCAGAGCAGAGACAGGAATATAACAGAAACTGGATCT 
ATCTAGGTTGGACGTAATGACATTGGGAGCTGTGACTGATGGGTTCTCAACC^ 

7 0 ATCTAACAATGACTAAGATGGTTTCCCAAGAGAATTTGATACTGAAATCTCACTCCCTCT 

TTirCTTCTCCCTCTCCCCATCATCCTCCTCCCTCTTCCTCCACCTTCC 
TGTTCGAATATTAGTTAGGCGCGCTTTCTAGCACTGCGGTACATCATCAGCCCTG^ 

TGGCA'fGT ACG TG ATCAAGG CACAGCCACCATG^TCCATGGG TTCTGTT AAAT ACCT CACT CTGAACATGACAAAAC CTCGATAT A 
ACAACATTAACAGCTTAACACGTCACTCGTC 
7 5 TCATTGATTTCCTCTTCGTCTGGTAACCTGGAAATTCTC 
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TTCATCCCITCTTCATAACTTTTTCCTTTCACTGACTTCACTAAGACAAGATTCTTTT G TTTl CI TCACTAACTAGTAAATTGCTT 
CTTTCCCAACTCTTACTCTTAAAAGCAACAG 

ttgtatttgtgtagacatggarggattgttctt^ 
agatagttagggggactggactgtgtaacatcaatcccaagaaatacctt^ 
5 t agg aggtgtctt agtt ctattg ctg agaagagacactat aaccaagg caag tctt attt aaaaaaaaaaag cattt aattgggg c 
tggctcacagtttcag aggtttagtc cat ccatt atcat catgacagaaag cact cagg cagacacgat actgg agaagt aactca 
gagttctacatctagatcctgcatgcagcac-ggagagagacacaggacc^ 

tgattaatttt ctccaaccaggct at acct actccaaccagg ccacacct actc ca^ ctactc caacaaag ccac 

atgtcctaatgctactccctaagcattcaagcatatgatcctatgggagatgttctt 
1 0 ccctataaaggagtagggctttcaggccaaacttacttt 

aacaggttgtttttat aaatttcaac cactggcttgct atctt at cactcaaccttttaccaatgag cag attt caaccaaaacaa 

CATTTTTTTTTGATTAGTTGAGTGAAATCTCAATTCATTGCATCT 
TTCCCATCAAGGCAGCCTTTTGCTATCCTGTAGCCAAAATAATAAATACTGGTTGTT 
TTTTAAATTATTCTGATACAGCTTTGGATGGAGAGTGAATAAAAA^ 
1 5 CAAGAATAAAGTGGTCTAAACAACCCTTACACACTTGCT 
CTCTTGATTTACTCTTTATTCTCATGACTGAAGAACTC^ 
CAATGGGGACTGTTTCTGAGATATCCAGCTTACTGGCATATTCT 
CAAGTAACTCTGTCTTTGAGTAGTTTGCAGATTTATCACTTGTTG^ 
GGTTTTATAGGAAAAGCCTAGTTGTTTCCAAAAGCAAGGCCAATTAATAGAG 

2 0 AAATG ATCTT AAAGTGAAGTTAATTAGGAGTGCAGAGAGCCAG AG ATGATAAATTCAG^ 

GGGCCTGTGTCATCTCCAGTCCTAAAGAGACCGGCTGCTGAAGAATGAGACTCAGAAAAATG^ 
TCCATGTCAC^TGCACATGTCACATGGCCTTTCATGACCTATATTAAAGT^ 
ATATGTGTGTAGGTTTATATATGCATGTATATATAAATGAAGGGAGATTTTrTT 
TAAATTCAATCTGGAGACCTAGGAATGACAATGATGTAACTCATTCTACATCT 
25 TTAGAGGCCAAAACCTAAGGTACAGAGTAGAGACTGGGGGAGTCTA^ 
CAAAATGAATCTGACATCCACCAGACTTTTAGTTTTTCT^ 

AAAAAGAAAAAAGAAGAGAAAAGAAAACAGGAAAAACCATTCTCCTTTTGAGTCAAACT 

GATACATCAGAAATAGTATCTTACCAGCTATOUUW3TACCTTTTACCCTAGTCAA 

ATGTCCCTGTCCATAGAAGACAGGCITAGCTGGATAATACCCA^ 

3 0 AGCCTATCAAGCAGTCTGGGCCATGCAAAAATCACX3AAGCCTCA 

AAGTTCTCAATT AT AT ACCAGTG CTCT ATG AATG CAAAAAT AAGGCTTGGAAC ACT CTGC CTTG ACCAGTTT AAG CAT CAACAG AA 
G CTGAT CGCTGG ACATG CCTT C^TCACAACTT AC ACA.TT AACI^ CT AATTCC AATTTGTTTT CT ATTTTG A 
ATGGCAAAAAi irrCAACTGGGTCiCTACTTGAi i i l ATCAAGAAATTGATGGITCTGTTTGGTATAAAAGTAi i riGGrTCl TTT 
TCCAGAATATGGTCATCATTACTAGAGATATAAATTGAGCTAAATAGGAGTAAAGTATCATGTCTG 

3 5 GATAAAATAAAATGTGGCTAGATCACATAAGATGCTGTTTGGGCTGG 

TCAGGGACCTAGGGGAGACAGATATGGGTAACATTGACTGTTCrrGGG 

GTAGTGTGTGTGTGAGTGAGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTGTG 

TTTCCTGAAGCCTTGAGAGATTTAAATCTCTTCACTGAGAGTGTCTCT^ 

GT ACT ATG ACTG C C CTCTTGG TTCATCTTT CCATTG CT ACTGC ACAAGT CC CAG AATTTTG AAT ATCGTTGGGT CACCATTTGTC C 

4 0 TATAGGCTTCTGTAACCTCCTTAATCATTTGTATATAATCTAGTGTGTGTTCATTACA 

CTGTAGATAGATTGGGCTTTCTCATATTCTTCCCTCTACCCTCCTTGT^ 

TTTGTTTCnTGTTTGTACATATGAATTTAAAACTGTGAATTT 

TGTGTTTCCATCCATTTTCCTGAAAATGATAATACTTAAATTTTTATGGCT 

T AT CAATTT AC CT ATTGAAAAGT ACTT AT ACTGCTTTT ATAACT CGG CT ATTC CT^ CATTTTTCT AATTCTTTCT AAT AG ACA 

4 5 TCCTATTAAATAAAACTGAACAAGGTATTGTACCAATGAACT 

TTTGGAAGGAAAATAGATGCTGCCCTCGAGTAGGGTGCAX^ 
TGGTCCATCAGTAAGTCAGAGTTGTCTATAGAACACATGTTTATAACCCT^ 

T AAGGTTGAT AT AGT AAAGTTG CCACAATT AGGTGG CIT AAAAAAACAG AAACTT ATTTTCT CCAGGTT CCAG AGTCGAAC^^ 
AGAATCTAGGTTTATTTGGTACTGTGCTCCAATTGAAACCCTAGATAGAACCCATTCT^ 

5 0 AACrTGCCTTGTCATTTCTCAGCTGAAAGCTGCATCCTCCAATCTTCATCTTCATATGTGA 

CACACCATGAGATTAAGGGCACACCCTATTTCAGGATGACTTCATCT 
ACATAGAAGGGCTTCACCATCTCTTTTTGGAAGAAGCAACTCAAACCA 

CATTTGAATCTTTTAT CAT AGCT ATAGTTCCTAATTTT ACAAAAATGAAGGG CT AAAGTTT ACTT C CTG CCATGC CTGGGAAATAA 
TATTCAGACAACATGTTCCAGGACAATGTCTAGGATCTTTAGAAACACATACTCT 

5 5 GAGAGTCACAATAGAGACATCGGATGTAAGCTTCTTCTGAAAAGAATTAATGATGC^ 

C^CTCCTACCTAAAGTCCTTCCACTCCATCTGAATCACTGAAAGCTTTA 
AACCAAATCAGGACAAACCAACAACCCAATCAGAAGTATTT^ 

T TTTCTG CAATG AGGG AG AAAT AGTG AGG ACCTTTTGT ATCTCT AGG T AAAT ATT AAAT CTG CCAG AAG AGCCAGTT AC C CAACAG 
GT AGTCAGCTCACAGG T CAGTG CTCATTGG AAG ATGTT CAG AG C CAGAAACCAATC CCAG CAGGG AT C CT ATG CAAATCT AG CAAG 

6 0 ACCTTGGCTTTCATCTTCTATTTCCTGAAACAAGCACCCAAGTGTTAACTGGCT 

TTAAGATTCTGGCTGATTTACATCTGCCAAAAAGGGGT 
TTTTCGATTTT C ATT CCCAACCTG ATG CTTG ACACTCT 

TCATTAGCACAGAATCAGCTAAGAAAGAACTAGGATAAAGATGGAAGAAGGGGAAATGAATATCATAAGACTAA^ 
AATATGAATTACAATGATTAAGG ATT AGCATTAGAAAGTATGTGGGT^TCACAAAG CTACTGGAATCTTTTTAGACTTTAGAG CTC 

6 5 TCACCTTTCTTGAAGTCACTTGAGTTTGCCATCTTGAGATCTTGACTATT 

TGTAAGAGGTGTGAGAAGGTTGCATGATTGCTTGTGACTCTTTTGTTG 

gaatgaatggaacaatcaccatatttgctgaataaatgtctttgtcca 
gttcacttcctacaagtctggacatgggaagtccaaju3ctgaaagg 

7 0 tccactactcaltatgcagacatccctcrcatacctcatcgactctta^ 

ggtggtttgtgggttatgagac<;acaaacacgggaagaacctagtga 
gtttcttcaataaactgagcaaatgagtaacagtaagaaaggaatagtacc 

GGAGTGCACACGACCCCACCCCTTATCTrrCCTTGCCAGTTT 
AGAAGTCACAGAGTTGCAAG<jT"TATGTCCTCAGTCA^ 
7 5 ACAAGGCAATACTGGTAATAACCAAAACTGAGGCAGATATGGTTTGG^ 
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CITGGATGCCAACTTCTCATTTGTCAAATGTTTGATGGGTTTAGTCATC 
TCTITACTTATTGTC^TCTATTCAAAACAGGTCGGCTAAGC^^ 

AGG CT AGTTCATTTTTT ACT ATCTATTGTT AT AAATTCATG^ C CCATCAAATGC CTTGAT 

CCCACTCATGCTCATGCTACTCTGCTACCCACCGCAGCCTCTTTTCTTG^ 
5 TACCTTCATTGTGATAAAGTGTACATGCTTAAAGCTCATGTTTAAAATGT 
AAACTGTCATAACAATCTAATTTTAGCACATATCACCAAGT^ 
CACTCCCGTTAGCATACAATACTCAGTCTCTGGCAATCTCTAAGTCACTTTGC^ 

ACTG CCT AG AT CAT AAAAT ATTTCCCCT ATATTTTTCT ACAG T AAATTGAACATTG AAGTT ACAT ATTAAAGTCTTGATCCATT^ 

GAATTGATTTTTTTTTTTTTTCGAGACAGGGTT^ 
1 0 TGAAATCCGCCTGCCTCTGCCTCCCAAGTGCTGGGAT^ 

TGGGAGATACAGATTTAATTTCATTCTTCAC^TGTAAATACCAGTTT^ 

CCAGCTTTGTCAAAGATTAGATGACTGGAGCTGTGT^ 

ITTGCTATGATATCAATAGCAAAAGTAAATTCrATTAGTGTTTATG 

CTTAATGGTAACAGCATCAATAAGATTCCGATGTCTATATCTG 
1 5 GCTT CCCATTG TAAT AAAACAT ATCTAAAATGTACAG ATG AG TTTGGTTTTGTTTATG^ 

CAGCTGAQ3TGTCTGTAACGGAATGTTCTGCCAGTCT 

TTAACTTCATCTTCTCACAGGCTGAGTATAAATGAGGAGACAC^ 

ATTCTTCCIAGAATCACTATTCAGGATGGTCTTCCITAAG 

GTTCAGGAGGCCTCTTGTGGATGTTTCTTGTCT^ 
2 0 TGACTTTGTTCTTATTTCCTAGGACTATGCTGTCAGTGATT^^ 

TATTGTGTACCAAAAAGAAAGAACAACAACCCAAAAAGTTGGTTAGAATTAG 

TATTTATACAACTTTCAGATGGAGCCCCTCTGTTGGCCCATGC^ 

GGATAC^CTACAACAACATGCTCAAAGAAAAAGAAACCTTGGTTT^ 

TTGAGTGAGTTGTATCATTAAGCAATGATGTCACTAGAACTCATA 

2 5 AAAACCAGGAACCCCCAGCACAGTGAGTGCACGGACT CATTGCTGTCAG CTGAGGAATTGCTCTTGTGTCCCTGCTACTGGAGCTC 

TGACTTCGTGATCTAAGATGGCCATCCCCCTC^CC^CTCATCCATCAGCAGC^ 
CTGAAGTACAGTGCGTGTCATATGATGAGTTCAACTACCTCTTCGAA 
AGGAGAAGTCACTGTTGTTGACACAAGCTATTAATTTACTGCTCTGA 
GTTCAAGGGACAGAGAGAAATATTGTTTAGATGGTCCCTAATGAGACAGAAGT^ 

3 0 ACTGCTCCCTAAACAGGCTCCCTTATTAACCAGCATAA 

GATACTAAGACGAGAAAGTATTTACTCTCTGATACTTCCAGGTAAAGAGGGGATGTTCT 
TTCTCTTAGTTAATTAGAAGAGTATGTTCTCTCCTGGTGTACGGCATT^ 
ACCTCCTCG<^AGGATTTAAAGAACAGGACACAGGTGACIATTT 
TCGAGGTGAGAGTTTCCTGCGTCTGGAAACGGAATCTAGTCCTTCACACCC^ 

3 5 AGCCTCACATATTTAAAAAAAAAAAAAAACAATTCCGGATTTTATGTG^ 

G AAAT AAGAATGG ATTGATTTGCATT CTT CT ACATG CT AACCAC CAG TTG AAAATG CT AT CTTTTTTTCCACTGG AT AGTTTT AG C 

TTCTTTGTCAAAGATCACGTGACOVAAGTGTGTGGGTTCATTTTGGGGTCTTCAATTCT 

GCACCAGTACCATGGAGTTTTTATCACTATTGCTCTGTAAT^ 

GTTG AGAAT AGTTTTTGCT ATC CrGGGTTTTTTG TT ATT CCAATTAATTTGCAAATTG CT CTGTCT AACTCTATG AAGAATTGAGT 

4 0 TAGAATTTTAATGTGTGAGTCTCIAGCCTCACAAATTTTTTAA 

TCACTTAGATGCTAGGTTTTTTTCTTTT^ 

TCCCCCACCOTCTCCCCCACCCACCCACTTCCACITCTTGGCCCT^ 
ATATTAGTAGGATGGCCCTGGTTTGCGAATGCTCCACACTTCtACCTC 
CAAAGCCATTCCGTTCGGGTCCACTGCTGGTGGTCACCT 

4 5 CCAACTTCCAAAGAACCACATTTGTGCTGCAGTTATTCATTGGACT^ 

CTCTCTCTCTCTCTCTCTCTACCTCrCrCTCT 
CTCTGAATTGTAATATTCCATGTTGGTCAATCAAGGCTCCCrrc 
TCTATTTATTTTATATCATTCTCAAAAATATTTAAAGTAACTCGCTTCCAATATG 
GGTTATGACAATAAGAAAACATATAACCCTTTTGGTATTTCCCTGTTACA 

5 0 TGAGAATACCAACCITTCTAGATGCTCCTGGTAAAGCTAGC^ 

CACAGG CTGGT AAG ACiTTCTTG AGT AGCAGTGTGAT AGTCT AC CTGTGTT CTC AACTTTGGTTT C C AAAG AAATT C CCATCAT AA 

TCTAGTGAAACTCCTAGCAGTGGAGGGTATGGAGCCTGAAGTTGCCTCTTCCTCTA 

GATAACCCACCCACAAAACCTTGGGCCCAAAATTTATCCTGC^ 

TT CT CAAC CAACG ACTGG CTGGCTTG AAATCCATT CCGTGGG CAAAAAC C AAT C C CTG AT ACT ACT AATGGTCAG AACCTGTTATG 

5 5 CTTGCAGACAGGAACCTAGCATGGCTGTCCrCTGAAAGGCTCC^ 

GT^AGAACTGAGGAATCTTATGGAAGAGTTCGGGGAGAAACTAAGGGCCTC 

TCAACTAACCTGGACCCTCAGGGG CTCCCAG AGAATTAATCACCAACCAATGAGCAAG CTTGGGTTGGACCTAGGTCCCCTGAACA 
TCCTTAGCAGGTGTGCAGCTTG^CTTCATGAGAGTCTTCCATCAACT 

G ATCCTGTTTCCCT AGCTGG CTGCCTTGTATGGCCTCATTGG AGGAGAG AATGTG CCTTGTCCTGAAATGGTTTG ATGTGTCAGGT 

6 0 GGGCCTCACTGTTCTCAGAGGAGAAGGGGACAGAGGAAGGAGGGGAGGAGCTGTGTGAGG 

AGAATGTAAAGGGAATAAATAAATAAATTAGTGGAAAAATGTTTATCAAGAATAGTGTACACAGCCACAAT^ 

CCATGAATTTAAAAAAGCAATGGGGTTATGTGATAGGGATTGAAGGAAGGAAAAGGGAGGAAGA 

AAAATGCTCTAAAATTTATTTAAATGTAAAAAAAACAGTAGGGG 

TCIATGAATCCATACAAAGAATAGGAGACTATATGAATTCAAGAATTTAGGTTGGATGTGAGTAGT^ 

6 5 CCAGCAGATT ATTG CAAAAATATC CAAACATTGGAG CAAAAGTTCT AAGTG CAGATT AAATTG T AAGTTT CTCCAAT ATT CT AG AA 

CC!AGAGTATTCCAAATACTGCCTTCrTATGAGTTCTGTTTATC^ 

TTAAAAATTT'ATTCTAGTTCTGGCCAGTAACAGAAAAGTA^ 
GCTGCCACATAATTTCCACTGGTTTGAGAAGACAGTAATTTGGGTGGATCCTT 

7 0 GTGTCTGTG TTGGAAAGCAAAGGGTAG AGTCT AACAAGAAGCAACTAC^ 

TACCTTTGGGTGATAAAGCCATAGGGCTCTCACCAGCTGTGAGAAAGAGTATTTTATCCT^ 
AAATATAAGGCAGGGGGTCCACTCTGTTCTCTCAGGCCTTTGAATA 
ACAACACATAGAGTTCTTGAGAAGAACATGTTGTCTTCCT 
7 5 TT CCTCTG ACAAGTTGCATCTAG CTCT CAT AATTCTTT AAT AT AACA'rITT AArTTT 
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GTCCTGTATTTACATCTCCTTTCTCTCCCTCAATCCTTC 
AAATTGTGTTTGTATACAACCTCCTGAGTTCGTTTAGTGTTTCTTTTATGTA 

CTTGTGAAATTTTTCCTACCCACATTGGCATGTAGACTGACCTTGTCATTAT 
5 TTCATGAGAGCAGCATCCCTGGCATGTCTAGAAGATACTATCTAGCAGCAGGTATTTTGG 
CTCATCTGCAATAATCAAACCTCAGGTCTAAAGATTGTATTGTAG^ 
TGAATTTTGACCAACTGTGGATCCCTGTAATAGCCTCTTGGATGGCATCT^ 
AGTATAAAACAACACAAACCATTGTCATGACCCTTGGTTGC 
TCCAAACATACTTCAGATGAGGATATGGTGGCATCAAGCTGGGACTGACCTGA^ 
1 0 CTAGAAGTAATATGCAAATTGCTAAGGAAGAAAAGAAATAAATATTCCTTCCCAGCTAGAAGGCCT 
GCCTGCCAAGGGACCTACAGTGGGGTAATAGTGAAACTATTATCTTGGAGTCATCAACAG 

T ACTAGAAGGAAACT CAT AC CTGAT ACTGGAAGCCT AATGAACACATGG CT AGAC CT CAT AAG AGAAC CTACTATGATTTTTTTTC 

TAAATCAATAAATTATATAATTATTTTCTrAAGTGAATA^ 

ATGTCATTCCrrCCCTCTTCCCCTTAACACTGTGGTTCTCAACCT 
1 5 GGAAAATGTGACTATTTACTGTATGAACCATAACAGAAGCAA 

GGTCACAACAACAG^GAGGAATTGTTTTAAGGTTOVCAGC 

TCCACTCCCCATCCACTTTCAACTTCIAAGTTCACCTTTTACT^ 

GGGAGAGACCATACACTGAAGCATGCTCCACCTATCAAAC^CCACACC^ 

AACTATGAATAGCTCTTCAGGCTGTGATGGGAGTTCATGAACCCTGCT 
2 0 TCTCTGAGCTGCrGGGAGCrCAGGAGTGGGGTGGTCATGTCTTCTCCAGAAGGCACl 

C CCTT ACAATCCTT CTTTGCTGGTT CCTGGAACTGGG AGAAGC AT AT AG CTGCTG CTTTT AAAACT CT AAAAGG CAAGAAAATT CA 

GTTAAAAACAATGGAGAACTTTGGTTGAAGCCAGGATAGGTTTATTTC 

GAGTGGGAATGTGTAATTGAAGACGGCGAGTAACTATTTATCATAAGCTAAAATTTAATATAAAG 
CATCCTAAAGGGCTAAGCACTATGTAGTGCTAGATCCTTGAGTAGAGAACAATAAA 

2 5 CTCAGATATTACCCAAATACGTTTCAGATAATCAATAACTGTTGGCTGAGTGAATATCT 

GCTCATTTTCCATGATGACrrnTACAGTTTTrCTCC^ 

GAAGCTTGGTACTTATGAAGATTAGACAAAATCGTGTGTTCAAGTGCTCATC 

ACTAGTGATTTTCTTCTTTTTTCTAGTTATTCriT^ 

CCCTAATCCACTAATGAAAAGAGATCTGGAGTTGCAATCAGTTCCTACTCT 

3 0 GTGCTGG CAC CAAT AAAATAT ATG CAAAATCCT AAAT ATTTTTTCACTC CTTCCT 

TTCTCCTTCTCTCITAATCAGCCCAGTGAAGGTTTTGTAATT^ 
GGATTTGCCTCTGCCACATATGGATCAAAATTTCCAAACTGC^ 
GGTCTGATGTTCTTTGTGTTTCTGTATTTCAGGAACAGAAGGGGGG 
35 CTCTGTTAGTGCTCTATGTGACTGGCTGCCCCACTGAATCTGTTGGCTCT 
AAGATTATGCCAC!AAATGATTTCTTGGATGCATGTATGTGTTCITCC 
ACCTGTTGCCAGAATGCATGAATCCTCTCTGATTTTCTGAGTATATCCGAAAA 
CITrGGTAAATTCATCAGTTACAATGACACCAGCAGCTCTGGATAGAC^ 

GAG CTTG C CATG CAG AAT CT CGTCTGTT AGTT ACAT AAAGACCAAAACTG TGAGCTCTG TG CTC CTG AAAAGATAAAATT CTGG AC 

4 0 TTGCTTGCTTCATGACACACTATGCAGACCGTGTCTCCTTTTATATTGAGAGGCAAAATATTGAA 

TAATAAAGGCACTAACAAAATTACACTTTCCACTCTGTGTTTTATGCCATACTCCT 

TTT ACTCAATTT ACTCAAGTGTGTGGGTGGGGGTGGGGGGGGTCATTGTTTC CTTGAATGGTTG AA 1 ' 1 AAAAACAAATTT ACTCA 
ATG T ATTTT AT CAT ATTT CT C CCTC CC CATT CCTCC C7ATAACC CTCTTT CCT AAC CAACCT ACTT CACTTTTTTCTTGCTT AAAA 
AAAAAAAAAAAAAGG T ACATTTTGTGTGG T CCACCT CTTC CTG AG CATGGGG CCTTC CCTGGACTGTGGTCG ATATG CCCAGTG TC 

4 5 ATTGCATTCATCGAAAAGAAGTGATGTTTCCTTTCCCTCAATGACTG 

ACCCATTTTTCTTTCCCCATACACTGATTTTGTCCTGTTCGAACTTG^ 
TGAGCATCAGTCCTGTTATGTCTGAGAAACGCTGTTTCCTTGAAGTCG 
ATTCCTGAGCCTTGAGGAGAGAGGTGTTATGCAGAGATTCCATTTAGGACTGAGTGCT^ 
CCACTTGCGGGTCTCTGAGTTAATTGCCATCTACTGAAAGAAGTTCTCTGATC 

5 0 CAGTCATTTATTTTGTTAGAGGAGTTATTCTGTTGCTATGTTC^^ 

AACCTCATT AACAGTG CTG ACATGTGTT C CATCT CAT AG AGT AAC C CAGAAAC CTT AC CAAACGAG T AGTTGG TT ACG C CAAT AG C 
ACCTATGCTACTATTGCATCACTGAGTGTATCTTGAAGACAGATTGCTACTACAGCTCTCAGG^ 
AACTGCTTTTCTTCTCCAGTAGCTTGTATTATATCATCACCCAGCACGATG 
TACCAGGTTGGTTTCTCCATGTTCAATACCATAAGTGTGTGGTGTCTTCAGTAATAATCT 

5 5 CAAT AT CATTGG CAAT ACTTTG T AATATTT AAGGGT GGTCTGTGGAACTG TTTTTG CCAATGACTCAAAAAGATAT AAC CACT CCC 

AGGTATAGGTTTTATGGTAACATATGATGTCCAGTTGAGACATTGTCTCCCCCATCCTATGATAACT 
ATACGCATATATTTTAGGAAGCATCTACAGGAATAGGCTTCTGTATGGCTTTTCAA 
TGGTCCCTTCTCTACTCTGTCCACCTATCACCCTCTCCATTTAATCCTCCTATTCTAAT^ 
AGTTATTTCCCATTCATGGTCCCTAGAAGATCACCTCCTCCTTCATCCTAGCTCTCTA 

6 0 ATTGAAACACATATATCTAAAAGCTAACACTGGCATATGAGGGAAAACATTCAATATTTGTCT 

CTG GGTGATTGTTTCT AGGG C CAT CCATTTGC CTG C AGATTT AAT AATTTCATTTTT CTTG AAAG^CAG ATAAAATTTT ATTG TAT 
AAATG T AC CT CATTT ACATT AT C C AATTT ACCAG ACGGT AG ACTTCAAGG CCTTTTC CAATTT CTC C 
TATGAGCAGGGGGCCCTTTTGTATATGAACAAGAGTTTTATAGATGGATCTTGAGATA 
CCGTACTGACTTCCATGGTGGCTATATAAATTTC 

6 5 TCTGCTGC03TTTGTTTTATTTATCTTGCCCATCTTGACT 

TGATTAAATATATTGGAAAATCTTGTTTCTCACTCAGTTGTGTATTATui 

GGG AT ATTTG L"iri ATT AATGTTGAGGGTTTTlT'ITT'nT'r ACTT L~i'i 1 GTAT ATTG TAG AT ACTT ATCCT CTG T AAGAT AGAT AA 

TTAGTAAAGATTTTTTCCTATTCTGTAGCTGATGCTTTACTCAAATAAT 

TTCCACCTATTAACTGTTGGTTTTAAACCTTGTGCTATCCAGATCCAGGGCCAGAAAGTCT 

7 0 TATTCCTTAAGTTCTCTTCTGTCATTTTCACTGTATT^ 

AGCAATAGATATAGATATAGGTCTGTTTTTTTTTCTTCAGCATC 

TGTGTTTTTAGGCTTCITTGTAAAAAAAAAAAAA 

ATTCGTGCATTTTCAATGTCTCTGATTTTATGCCAATGCTATGCT 

G AACCT ACAAAAAATGAAC^T AT ATGTG CATGAC^T AC CAAGATT AAT AAAACAACTT AAACAG ACT CAT AACCC 

75 TTAGTGAAATGCAAGAGTAATTAAAAATCTCCCACGTAAATGAACTCCAATGCCAAATGG 
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t<^tgaagaactaacm:caatactcctcaaattattc 

TCATACAGTCACAGATAAGCTTGTCTTTTAAGAGAGAGCAATGAATGAGACTATGCA 

G AAATCTGTCACAACTCATCAACGTGAAAAAG AAATGAACAAGAA CTCAGTG AAAGG CTGAACCATGGAGG 

AATAATATGGCTGGAGAGACAGACCACCACTAAAACTAAGTCACAGGAAAGGTAGTO 
5 ATTTCCCATTCTCTACCTCATGCCCCATGCCTCTAC^ 

AAGGCAAGAGAGCCTAGCAGTCAGAACCATCAAACCCAGAACAGAAAAAGAATAAGC^ 

GGCATGC CAGTCAT CTG AT ACAT ATTGAATGAAT AATAG AGGTG CAAATGATG AGAGC CCAAGACCCAG CATGCAGT AAAGG CTTT 

AGCGGCTGCTGAAACTACAATTCCCTTGCTGTCTCTGTGCTCCGAGTCT 

CAGTGCAGCCTCTGACCTGTGTCTTGTTGCCTCTGTGACTTTCT 
1 0 AGTGTTGATGGGAACAGGGAGGAG<^TGGGTTGTTCCGAACAGAG 

T AAAACACATTCT AGAAT ATTT AAG AGA'rrCiT l'T'l' L'TTI ATL'TTTr IT CTT1T1T AAAGAAAATTTGGTGC C ACTC CAC CCT ATG 

GAGAAAATTTGGTGACTTACTTCTAGGAAGCTTGCCCCAGACATCTGTGTCATAAGCT 

ACGTCTTGTGGATGAACATCITTTCTATTCAAATAGCTTTCATCCTATT 

CAGAGCAGCAATGTTGACACTGCAGTGACTGTGCAT^ 
1 5 GGTCACCTTCTCCTCCITAGCTGCATTAGCAAGAACATCCAGCACTAGCT 

ATGCTAAGAAATTATTTGAAAGCCTTTTTGGCTGGT^ 

TAGCTATGAATTTATGTTCATAAATAAAAGACATTTTAGGG^ 

TGCATATATGTGTATATGGCCCTTGTTTATGCATGTAGAGGCT^ 

TAATCTAGCTATATTTGGCTGGCCTAGGTGGCCAGCTTGCTACAGGGGT 
2 0 CATAGGTAGGCCACCATGGTTCCCCAGGATGTACATGAGTGCTCK5AGATCTGAAA 

ATCCACTAAACAGTCATCCTAGGCCTAAAAACCAGGTTTATAAGGCAGTTTATGCCATATAGACAC 

GAAAGAGTTCCITTCTATCTAAGATCAGCATCTCCAACTCTTATCTACTTT 

TCTGGAATTTCATCCAAAAGCCTAGAGTCAGAAATGCTAGATGGGATAAGATTGAGGCTC 

AAGACAAAAGCCCTCATCCGACATTTTACTGAGTCTTC 

2 5 CTGATGAAGGATCTAACTTGAGCTTCACATAAAAGTGAGTCTCACTCAGACAAGT^ 

AGGGCCTCCAATATATTCCTTATACTTTAATTCCAGTTGGGACAGCT 
TAATATTAATTTCATGACACACACACATATATACATATATATA 

CACACACACACACACACACAT AT ATATGATGAATGCCCCATTCTTAGG AGAAT AAACTC^ 
CGTAACTATGACTTACTCTCTGATGACCTGGGGAAAATGTAATTTTTGITACATA 

3 0 AATTCGTGTGTTTGTGTGCAATACGTGGGATTTATTTTTACAACTT^ 

AAACAAGTAAGTCCAGCTTTTCTAATGCTCATCTATGCTCTGCAACACAGGAAGCT 

AAGGTAAGGAGGGTTTTACCGAGTGCCGACATAATTGCCTCCCAT 

GATTCTCAAAGAGCATCATCTCAAACCATCAATACTCACACAGAGCAGTCCACTATCCTCAA 

3 5 CTGTTTCAGAAAACCTGATGCTCGTCTCCTTTTCTTGTTTTTCTCAAAAGCTrTTCATGGTTC 

ATTTTTATTTCCTCTTTATACTTGTCTGTACTTGGGAGGATTTTTT^ 
ATTGCATCTTCTTTTATTTTTTCAACCAGCTCAATACCAATTCTC 

CACTACTTGGGCCTGATCCGTGCCAGCCTATATCACAATACTTATATGTGGTGGGCTCTCAGTTTTCCT 

GG AGCC CTGGG AGC CAG AAC CT GC CAG CAAATAGGTCATGG AG AAAT CAC CCACACAAAG ATTG CCT CTG CAT AT AC CAC AG TCAA 

4 0 AGATCTAGGCTACTTAAGAAGATAGCAGATTGCTTTTGGTCTTTGAGACCTT^ 

ATT AAG AT AGAAAACTGACT ATT ATT ATT AG CTGAAGAAAAATCT AAG TGAAACGTTCCTAAGATTT CTTTG AG CCACCTTG TGAT 
GGATGAATAGTGATTCTCCTTTAAGGGCCATCTTTTCACTCCAGTC 
TCAAGACCTTCTGAGTCAAAGATCTTTTTGTCACAAAGACT 
CAAACCCAGCTTCAACAACCAACAGTGTCrrAGATGTTGCTTAAGGTCCTGGGAAG 

4 5 AATGTGTTCAGACACTGCCTGATGTCTCCTAAGTGAAACAAAAATCCCTTCT^ 

TGCCAAGACCTTCTGTTTTTTTCATTTAAGTGGATTGAAA 

CCAT AAACACTTTGGGTAT AAGAAATCTCTG TGGCATT AATG TCATTTTCTAG ACAAT C CATT AACAT AAGG ACTG GTGCGT CAAT 

CCAGCAAGAGTTCTATGCATTTCrTCCTACTTTCTTGGC^ 

AGAGTCAAAACATTATGCTAAGTTCTTTAGTAATCITTTGCCATGTGGAT^ 

5 0 CAAACACAAGAGAAATTTGCCnSCAAATTA CrTTTU ' l ' lTTT A Cl ' ru-l ATGTGTA 

GTTACCTCTGACTGACTTTTCAAATCTTTTAAACATCGAGGCATAG 
ACAGCAGGTGAAGTTTTGGTAACAGAGAGATGTTCC^GTGGTTC 

ACATAGAT AT CTACTCTGCTTG£TCATCTCTAAGTACCCTTGGGGCCTTTCATCC CTCTGGTTG ACCTTCTT CCCTAGCATCT ACC 
CTAGTCTCATGTCCTCCACCCTTTACCCAACCTGGGATGTGGTATGCAAAGATTATACTCATGCCAGTGGATATC 

5 5 AGGCAGAAACTTGAAGGCAGTTAAGACAAGTCAAGACACACRGCAACACAGCT 

ATGTTNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNhHTONNNNNNNNNNN^ 

CAAAGCAAAGC^CCGCCCTACCTGGCACCATACATCCTTITACAAGAACTCTG 

TG CAGCTCCTG TCCAT ACACTTGT GACTTT AAATCTTG ACAGAT CTG TG AG AT AT AAAG ACACAGAATAATC CAAG AGCACTGTGG 

6 0 AGGAGTTGTCATTCACTAGGTGTCTTTTGATGCTCATCTAAAGGTCTTC^^ 

CAT CCAATT CTTGC CACTTGGG CITCTACTTTGACT AAAAT ATGG T ATC ATTCAAAATCTG TGTG CTAATCTTGGCATAT AG ACAT 
GTAGCTCGCTGATAGATTTCTTGCITAGTATGTGTGATACT 

GTTGCCTATGAGGGTTAATACCTTAGCTTATGCATCATATTTGCTTATGAGTTGGACrT^ 

TTGGAACTCAATGGAGACTTAGi n TGTTTTTGTT i i rGTl'riGTTTTGTTTTGTTT"rGTAGTAGAC^GCTGAGTGCACTGACCC 

6 5 TCATTCACTATGCCACTAGCCCCAAATACCAATTTAAATATAAGCCTGTTGAAAT^ 

TTGAAT CCACT CC CAATGTT CT AT AG CTG CACATGCTTAGTGGT C ATGTT ATTGGG CAGGG AATATCAAGAAT ACTT CCATT ACAA 
CATAAGTTTCTATAAGAACAAGCTGACCATCCC^TGAGGAGCTCrGG 
GGGAGAGAGATGACAGAACAGGATTCTAAAAAGAAAGTGAAAGG 
ATACCCAACCATGAACAAATGTACAACTACTCTACCACAAAACACTAGGA 

7 0 ACCCATTTrCTCCCCTCTCTAGTCTTTGGAGTCTGTGAACTCATT 

ATCCTAAGCTGTGCTCACACATTGAAGCCAATGAACTCCCAGT 

TTT AG AG CT AGGATT ATT CAAACOULAGAAAACAAT ACAAGG CAAAGG T ACC ATCT CT CTGGG^ ACTTTTG ACTCT AT AACAAG T C 
CAAGGAACACAGGGGTTTCCAGGACAATTATTGTCCCTGTAATGGAGCAGATAGTGTG^ 
TTATCCTTACCTAGCACATTGTCAACAAATAGATTACTTCCACAAA^ 
75 AGAAGAGCAGATGTGGCCCTCTGTACrrTACATACAAAAGAGTGAA 
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TGGTGAAATTCAGGACCAACAATAGAAAACAGCCACCTAACCTGTCT^ 

AGCACCTGTTCAGGTCCCTAAGTCTGT(^TCTCCAAGGCTATGACCTCCTCATAGATAATGCTGCAAA 
CCCTGTACAGAGTCCCCAGCAAGCrCACTGATGGTACCTCATCTACTCAAACTGC^ 
TTCTGTTGCTGCCCTTCTCAGAAGACCCC1TCTTTCAAGCA 
5 TGTATACACTAGGTCTACCGTCTATCTTCCTGGCAA 

ACTCTTAATCATGGCAGATTTTATTTCTCTAGAAAACrrGAGTTAG 
TATGATATATTCTCACACTTGAATGAACTCATCATTCATGATGTCTAA 
CGATATTTTCCTTTAGCATGCAAAGATATGCAGAGTAGATATAGCCTTTATGTC^ 
CTAAAACAGTGAAGAGGAAGTAAGATCTAAGGCTGTTTAGATAAAATAATATTGA 
1 0 AAATGTTTCCCAACTAGCTGACACATCCAGCATTGTTTGGCTTGGAAAC^ 

TACACATACAGAAAAGCCAAGTTAGCCAGGCTATGCACACTCTAAAAGAACAGCACTG 

CAG CTGGAGTTATCACAGAGAAAGGAGCTTCAGTAGAGGAACTGCCTCCATGA CATTTT AGTG AT CAAGGG 
GGAAAGGCCCCITGTGGGTGGGACCATCTCTGGGCTTCTTAGTT^ 

AGTAATGAACACCCCTCCATGGCCTCTGC^TCAGCTCCTGCTTCCTGACCTGCTTGAGTTCCAG 
1 5 CAGCAGT ATGGAAGTGTAAG CCGAATAAACCCTTTCCTCCCCAACTTCCTT CAGG AAT AGAAAC CCT 

GACTAAGACAGGCACCAACTCCACAGCACCCTGGAACCTCAGCGTGTTGACTCTGTAGAG 
CAGTAGGAAGTCTTACTAGCTATGCAATCTCAGGGCIAAAA^ 

CAGTTGGACTG CTTGAACACACTTT C^CATTTG CATTTG AACCAG AG GAAAG CTTT AC CATC C CT CT AAGG CTCCAACT AGG CG T 
G C AATCTTTG CCATT CCCATGC ATCTGAGGAGTTGGT CTTTG ACATGG CTGT AAACATGT CAAACAAT ACAT ATT CACTCAG AATT 
2 0 TCATCTGTTCTTG ACACACT CAAG ACTT ACG TG TGGCTCTT CAT AC CT AG CAATG CAC C C CTTG CCTTCCATT ACAG CAG AACCCC 
AGTGTTTCTCAGTCX3GCATCAACCTTTCTCTTCATCTAAGCCA 

TGCTATCTGCCCAAGGCAAATGGGCCACGCTCTGGAGATTCTACTTCCTCCCAGGGCT 
GCATCTTTTTGTGGCATGACCCCAGAGAGGATACITGGAGC^GCAA 

ATGTAG CACTTTCCTAAGAAATTCTC-GGAAAATCAGTTTT^ CAGGTCATGAT AACTTCAAGGAACCAGCGTCAG 

2 5 GATGAACGGTAGAAGGGGGGCTCCTCTCCCTCTITCrTTCAGC^ 

GGGATGGCCATTCCTAAGGCTGCAGTCTGGACAAAAAAAAAAAAAAGATTACAATGGCCCTG 

ATCCCCTGGGGCTGGATGCCTGATGCCTGACTGGACTCTGCATAAAAACAAA 

ATCTGCCAGGCTCATTGGGTGGATAGGACTGTGTCAAAGACAAGA^ 

AAG CAG CGTCAT CT C CT CAACAAGG T CAG ATCATTT ATGACAAT AGG CATG ACAC CT ACAG AG CATTTTTTTCCT ACACTTG T CAT 

3 0 ttatgtgctttactggagactctttgagagatggtatgtg 

cttacatctaactatacatagctagatgaaaagatgggtggatga^ 

nnnnnnnnnnnnnnnnmwnnnnnnnnl^^ 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnn1w 

gggtggatggctggctgggtc^atggctgggtggatggctcxx;tgaatggctgg^ 

3 5 atgggtggttggatgggtgtatggatgggtggatggatggatggatgactggatgggtggattgatgggtgtc 

ggatggatgaatagttgaacatttaaatcactaacatgc^tcaatctacctgttttctaa 
tctttatatattcactcctttccattcccacccaac^gtctacaatgaaagtat^ 
ggttgttacctctctgactggttggtaccttttctgccc^gaaggttgcttctgaagaagaacaaga 
aagcaagtaattatcttggtagatatagttgcccagatgtttgaatgcagaagtaagtctga 

4 0 aaagtgtgactcagaaaagtagagacaggttgtgctcctaaggg 

ttatgaggtgggagagatgtgtaggctttctgtgttcctc^cccagtctgtgaacctacaat 
ctcctagtgctttgtattacaagtctgttgagccagaggaaaatttacacagagaatctcac^ 
tctgaataagatggcctggtccitctcggggcccctgaagaa 
aatggcaaataatgggttcctggatgttttaaatgcactcagacaca^ 

4 5 aactgagcagccagtgggcatgagagctgttgaaatccttcactctagct^ 

catggaggttggttctcagccactgctttactcattgccagagacacagaggtcc^ 

tgaatgtcaggttitccttagcgagctggttttgttgcattaatgatw 

tttttaittcgttacattagaaaagtagtcattacccactttagaataatattccctc 

c atgaatctgtttcgt atccactgt cttg agcac ccgtg caaag acatttgtg ag acatt aggatcag g catgtct acttc ctg tat 

5 0 ctaattattcctgcacttcacctctc^aagtcattgaaattttaatca 

atgctatgatctggttaagcagccctttaggatctggtgtttagattgt^ 
ggtggacrctctttatrctaatgttttgtgcacctaggttcactcaggttaaagttct 

ccgt att cttgag acttaacat atg ag aaaagtt aaac c aatgt acact caaaccacaaat cc aaatg t attt at acactcag t aa 
taccattctgaaac ctccag c^ttg aaaaaaaaaaaaaaaaaaaaaaaaaaag agt aaaaat 

5 5 tgaaaattaaaatt ccatt aaaataaacattgtcatactatctatt^ 

aatattgtggtgttaatagcctttttaaagtcccaaatccttgctaacttacctct 
acaggttatttcctcagaaccatttactgaatacccatag^ 

aaag aatt aat caatgag cttct agtct agc ctg aag ag ct act ag agtg ggg aattg cataaag at aagtcaaaatcattatcag 
g act accctgtacaacag ctttgtg agacctaaagtgtctg ctggatgggagatggg acag ag aatcaggaggtatctggtctttc 

6 0 cattggatgaccatgaacctatctttgctgcttctaaaaatactcatatt^ 

tcattttgggcaaagacttattttaacagtcta 

aaatcaatagagctttgtggtgaattaaatacaaaaggaggggctgactatacaact^ 
ttagggaagtgtgagggaaaataaggaggggggggggcgcaaatattgccactgt^ 

ATCTAACGTGTGGTGGCCAGAGAACCCTTCAGCTTTGTCAAACTACAAAATCCTTG 

6 5 TGAGTGTAGTTTCCAGATGGGTGCCTGGTGAGAACGAAGGCCACCTACTGGAAGCTGTGACACTTCAATACT 

TTCATGTGTAAGCTCTACCATGGACrCAGAACAGTGGTGCT 
C^TGACCTGCTTCTCTTCrCATTTTAATCTGATGGATACCCAGGTTCCCATGTC 

AGTGGGGAGTGTAATG CCATCTTTTAGACC ATAAAGGG CATGCAGAGGTTTTTTCCCAAACACCCAGGTATGTGTGAGCCCTGCCA 
CTTAAGGACACCTAAGAAAAGCATCTCAGGCTATTATAGTTTCAAATAAATTG 

CTGGTG CCTGCCAGGCT ATGGACTGAAATCCTTAIGGGTGATTCC CAAGT AG AGTCTTTGTGGAAAACAGGAAAATTGAACT AACC 
TTTGAGTATTGACATCCCAAACTATTATCCCTTTCATAGTTTCAGAGAGTAGTCGTTA 
GAAACAGGCAGTCATTTTATTCTTATAAAGCAAAGAGTAA 
GTGTAGATGTTTACCTAGGCCTGTTCTACTTCATTGACCTGGGAAAACT 

7 5 TAAATAGCAGTGTAATGGAGGACCTAAGAAATACAGAGATACAG 
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CGACATTAAGGATGCTCTGITTTTTITITAATTGATAG^^ 
ATTTTGATCACATTCCCTCTCCCATATCCTCCCAACCAATCTATCTAT^ 
AC CT ATCT AT CCAT CT ATC CATCT ATCTCT ATCTCT AT CTCT ATCT 
ATCCAGTTTTCCTTGGCCAACAACTCTTTGACAGGGGACCTGTTT^ 
5 CAGCTCAACTCTCTGGAACACAAATGAGATCCAGAAAGCAAACC^ 

TTTCATCTTAGCTTTCCAACTTTG 

GCAAAAACAGCTATTAACTATTAAATTAAAATTTTTAGGGAG^ 
AGCTAACAAGAAAATAGCTAGGCACTGAGGATCCTTGTGTCC^ 
1 0 ATAGCATCCTGTGACCATG CCACT CACTAT AACTGATATGTT CACAAATG AT ATCAGTGTT CTGATT AAG AAGTCAT AGAAGT AAA 

T AAT ATTT ACATTTGT AT CCAATT CATT AAAAT ATTTTTT ATCTGAGAACATG CTTTACAT AGCT AG ATACAT AG CTACCCTGTG A 
TGTGTTTTCATTGAAAAATACACATTTTTrAAAAATCATTT^ 

AATTCCACACATTC CTT CAATTTTCAATAATT AT ATTTCTT ATG TAATTTGG CTATTAG CCAG ATG C C CTGG AGCCAAATCTCATT 
GTATGTTTTAGTACATCTGTTGGCGCCAAATCTCATTGTATGTTTTAGTAC 
1 5 GG T ATTGGAGGAATGCACAAG ACCTTAGT CACTTGAAG TGCAT C TT AAT AGG AAG AAAG AATTCCCTG ATT CAAGGGT ACATG TTA 

TTGCTCCTCTCCTTACCCCCGTGGATCnXiAGACTGTGAGAAGGGGTGAAAATATT 

GTCCGAGTTTG TAACTT AGTGGT AG ATTGCT CACTT CAAAATCCTG ACG CT CT AGGTTC AAACCCTGGCACT ACAAAGAAG GG AGA 
AAATGGTAAAGCAACCAGTC^TTCAGCAAGTGTTACTC 
ACAAAATGACTTTGATAATAGTCTTCAC^GTGACACTGAATCAGCAC^ 
2 0 GATTGAATACTGTAGTTATCACCCAAAGTAAGCAGTAATAGCCAGCCATT^ 

CCAAG CATTT CCGATACTCAGTCTCTAGCT ACAAAG CTT ACT AACTGGATTG TTACTT CTCAG CC CT AT AAAGTTCAGT AGG TTCT 
GAGTGGAATTCTAGAGCTTCCCTTCTGTTCCATGATCT 

GACAAGGATG TGGAGG GG CAG AGTG T ATTT ATGGAAAGAGCTTT AAAGG T CAAGACT AG AC CAAAAAAAAAAAAAAAAATGT AG CC 
TTTCTCCCCATTCTCCAATGCCACAGCTCAATCATCTGACCCCATGTAA 

2 5 GAGAAAGGAAGAATAGACCCAAACTGTGGCAC^TTCTGGATCCTTTTCCTTCT 

GGTAAAGAAAAGTACAACnTAGAAAGAAGAGATAATTTTGACATATTCTTGG 

GATGTAGGTACAGCTCATCCTCAGTGCTCTAAAGATATTCCO^TCAGACACCTGGCTAT'ri TCCCATTGATGCTCAAAGGAGGCCA 

TGGGTTACTCAAGGCCACACAGTCACTGTGAAAAGGGAAGAACTGGACCTCCCT 

CAGATTTTCTCAAGTCTTTGGTTTCCTTTCAGTGCATT^ 

3 0 GGGAGTCAGTAAACCGATAATTAATCTTTTTTATTTT 

ACACACAAAAATAC^TAAGCTTGTTACCAACAGTAATTTCTGTGTGCTCA ll' CTGTTTAA 

TTAAACCGCATGCACACATACACATACAGCTCTGTAAAACAG 

ACAGTGGCCrCTTGGTAGTGACAGCTTGCCATAGAAACAAACT 

AAAAACTCTTTTTATAAGAGG CAG CAGGCTGACTCTGGTGCTG CTGAG ATT AT CTGGGCTC CTCTGGG CT CTGGAG AACTTTTTG C 

3 5 CTTCTTGTTTTGCCACCCAATTGAGGATTTATTATCTTATACTAAATAGCTCCATAACTTAG 

GTGACTGTTTCTGGATGGATCGATCCAGCO^AAGTATCACCCAG 

TGCAGTGTGTTCCCTTTCCAACATAATGACTCAGTATGCCTTATAATTATACATGCTTCCCCATGACTCGG 
GGGGACAATAATCCACTTAGCCTGTCCC^CCACCACCTGACCTTCATTCATCGAAGT 
GCCTCTCTTTCTCTTTrTCrCrGTTTACCTCTAAAT 
40 AATGTCCCTTCCTGGG CCCTACATCCAG AG ACATATGACATT AGG AGATAGTGTAT ACAGAAAAGTCTCT 

G AATGTC T ACTACT AT AAAAGAAG AACTACTATTGG C CT C ACACAGT ACC CAGCATTAG AG C CACCTTTGGGAAG TTGGTTTG AG C 
ATAGAGACCCTTATCAAAACAAGGGAAAACAGCCTTGGAGAA^ 
TCATACCAGGTACATTTTGTATGTCTAAGATGGGGCTCCTT^ 
ATGCGATAAATATAOU^TTCTGGTGTTCAACTCCTAAGAT 

4 5 TGTCTGAGCTGCAGCACCCTTATCTGTGAGATGAGAATAATAAGGG 

GGAAGCCATATAAAGCACAAAGATTACAAGTTCAAAGCTTGCCTTGGACACAGGGT 
GGACTATCTCAAATATAGTTt^AAAGTGGAGTGGAGTTATACCT 

ACCTTACTACCAGGGGGAGGAGGAAGAGAAAAAGAAGAAGGGGAAGGTAGGGGAAGAGGTATAGGGAGAGAGAGGAGGAGCAGAAA 
AAAG AAG AGGAAGAG ACAG AAAAGGAGG AAGGT AGGGTTGGGTT CAGACT C C CACTCTTGTAT ATT ATGT ATACAT AAC CT A 

5 0 TACACATCTTCCTGTACACTTTAAATCATTCCAAGATTATTTATGATGCCTAATATAAGC 

ACCCTATTTAGAGAATGACAAAGAGTCATATTTC^TTACAATGCAAATGCTT^ 

CAGATGCAGAAGCTCTGGATGTGGAGCGCCCAACCATCTTCCGAA 

GGGAGTGTGAATCGTGGTTTGAAAGCCACTGCAG^CTGTGGAG 

5 5 GCCCTGGACAC^AGAATTAACATTTACCCAATTCCTATTCTTTCCT^ 

GCAGTAAAAAGAGGCAGCAGGGACTCTAAGTTGAGTGAGTTAGTGTAAT^ 

TCTACATGTTAAAATCACCCTAGTGCAGAAGTGAATACAGC^CTGAACTGTAATT^ 

TGGGACACTACTCTGATGTTGGACCAACCTGTGCCAGCTCCCATTTGGACCGCACTGTCTCTGCT 

AATGCAGCTGTCTGCATACACAGCACATG<3CCGTACACAGCACAG CTTCCTCTCAGGAACCTCTTCAG T ' l ' l ' lT GAAACAGGAAG AG 

6 0 AATGGGGTATGCAAACTCAGAAGTGAATGTTTTAAAGTGTCTCCATAGCTATTGCTAGTAG 

CTAAACCACTGAGCAGTTATCCAAATCXSCAGCCAACTTAGGATGGTGCACAGGGAGGACAGATC^ 

T AAATTACCACCCTGTGGTGT AAG CATCTTTGCG CT AG ATTCATTTT AAT AAAAG AAATG CAGATC C CTT ATTC CT AGC TGGAG TT 

ATTGCTCTGAAAGCCAACGTACAACAATCTTAATCTTCAGCATC^TGTGAAACAG 

GAAGATTATGTCTTrTGTTTTCAAAATT AAATG CTAG 

6 5 TGTGAGGTTCTTACCCAGTTATTTCACCCTCTGGCCATTTAG 

GGATGTACCTAGCAGTGCATAACACTTAATAGATAAATTTATTT^ 
GACAGAAATGTTAGAAATGGATTTAGGTTAGTGGAAGAATGTATGGCTC^CAGTAC^ 
GGTGAGCTTTATGTGTG CAGGG CTTTCTTATGAG<K5TCTTGAGTGCTACCAGAGG<3AGAGTCT 
TGTTCTCTGCCACATTGCCCTACAACGTTAAATGTACAGGTAGCAAGCGCTAACT 

7 0 GTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTTTGGTT^ 

TAGTGTTAGGGAGGGTTGGTGATAAATTGGTGGGGAGGAAAAGGGGGCATCT^ 
CTCAAACAGCITTTAAAGTTGTCCAAAAACCAAAAACAACCCT 

AATATAAATGAAATAAAACAAAAATAAATTATGTGAAACTACATACAACAACCAGAATG 
AJlCTG^ATACfiC^CATTCATTAAATGCTTATGATGCGATTCT 
7 5 AGGAGGAGXjCACTGGTTGGATTTCTGGGCTACTCTTTGTGA 
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AACTTATTGCAAAATACCTAAGATTTATTCTTCTT 

T AAAAAGATC CTT AC^CATT CTGC CGAGCTT ACTGGGAACA^ CAAAT AG AGAG AC CACAGTG A 
CAAAAGAAAAAAAAAAGCAGCATGGAGAAGAAAACTGGAAGTATCT 

5 CT ACATCAGAAAAG CT AACATCAGCCAGTGAT CTGGTGAG AGTTCAGATCTGTC T CT CCT 

ATTATTTGGGATGCTTTCCACAATGTGGATOSGAGG^ 

GACAGCAATAAAGAGGGGCTTCTGGAAGGAGTGGGTCTAAGTAGGACCACTC 
AGTTTTCTTGGCAT 

10 

MOUSE SEQUENCE - mRNA 

ATGTCATCTGCACTTGCTTACATGCTGCTTGTCCT 

CAAATGTCGTrCCCCTGAOVAGGAAA(^TTCACCTGCTGGTGG 

CATAC^CAAAGAAGGAGAGAAAAACACCTATGAATC 
1 5 TACACTTCCATATGGAAAATATACATCATCACAGTA^ 

TGTGACTTACATTGTTGAACCAGAGCCTCCTCGGAACCrGACTTTAGAAGTG^ 

TAAAATGGTTGCCACCTACCATAACTGATGTAAAAACTGGT^ 

GATGAGTGG<1AGATCCACTTCACAGGTCATCAAACACAATTTAAGGT^ 

TCGCTGOUVGCCAGACC^TGGATACTGGAGTAGATGGGGCCAGGAGAAATCr^ 
2 0 CMCTGTGTGGATCATTGTGGCCGTTCTCTCTGCTGTCATCTGTTTGATTATGGTCT 

ATGACCTGCATCTTTCCAC CAGTT CCG GGGC CAAAAAT AAAAGG ATTTG AT ACT CAT CTG CT AG AG AAGC^a CAAGTCTGAAG AACT 

GCTGAGTGCCTTGGGGTGCCAAGACTTTCCCCCCACTTCTGACT^ 

AGGACGAGCGGCTAATGCCATCCCATTCCAAAGAGTATCCGGGTCAAGGTC 

GGTCATGGAAGCTATGACAGCCATTCTCTTTTGTCT 

2 5 GATCACTGAGAAGCCAGAGAATCCTGAGGCAAATATTCCTCCCACCCCAAATCCCCAAAATAACACC 

CATCCAAATCTACAACATGGCCTTTACCACCTGGCCAACACAC 
GCTGGAAGTCCTGGAGATACACTGGACTCTTTCTTGGACAAA^ 
AGAAGTGGCTGTGCAAGAAGGGGCCAAAAGCTTCCCTTCTGACAAAC^ 

TTGTCTATGCTAAACCCCCAGATTACGTGGAGATTCACAAAGTCAACAAAGACG^AGTGCTAT 

3 0 AACCACCAGACAGAAAACCCTC^GTTCCTGAAACCAGTAAGGAGTATCCCAAGGTATCT 

GTTAGTGCCAGACTCACGAGCCCAGAACACAGCGTTGTTCGAGGAATCAGCCAAGAAGGTTCC^^ 

CTGAGAAAG ATCTGG CCAG CTTTACTGCAACCTCAAGCAACTG CAGACTCCAACTGGGCAGGCTGGATT ACCTGGATCCT ACGTGC 
TTCATGCACTCCTTTCACTGAGAGC 

35 MOUSE SEQUENCE - CODING 

ATGTCATCTGCACirGCTTACATGCTGCTTGTCCrTCAGCAT 

CAAATCTCX5TTCCCCTGACAAGGAAACATTCACCTGCTGGTGGAATCCTGGGTCAG 

CATACAGCAAAGAAGGAGAGAAAAACACCTATGAATGTCCAGACTAGAAAACCAGTGGCCC 

TACACTTCCATATGGAAAATATACATCATCACAGTAAATGCCACGAACGAA^ 

4 0 TGTGACTTAC^TTGTTGAACCAGAGCCTCCrCGGAACCTGACTTTAGAAGTG 

TAAAATGGTTGCCACCTACCATAACTGATGTAAAAACTGGTTGGTTTACAA 
GATGAGTGGGAGATCCACTTCACAGGTCATCAAACACAATTTAAG^ 

TC G CTGCAAG C CAG AC CATGG AT ACTGGAG T AGATGGGG CCA.GG AG AAATCT ATTGAAAT ACCAAATG ACTTC AC CTTG AAAGACA 
CAACTGTGTGGATCATTGTGGCCGTTCTCrrCTGCTGTCATCTGTTT 

4 5 ATGACCTGCATCTTTCCACCAGTTCCGGGGCCAAAAATAAAAGGATTTGATACT 

GCTGAGTGCCTTGGGGTGCCAAGACTTTCCCCCCACTTCTGACTGTGAGGACT^ 
AGGACGAGCGGCTAATGCCATCCCATTCCAAAGAGTATCCGGGTCAAGGTG 
G<3TCATGGAAGCTATGACAGCCATTCTCTTTTGTCTGAAAAGTGTGAGGAGCCCCAGGCCTA 
GATCACTGAGAAGCCAGAGAATCCTGAGGCAAATATTCCTCCCACCCCAAATCCCCAAAATAAC^ 

5 0 CATCCAAATCTACAAC ATGG CC TTT AC CACCTGGCC AACACACG CG CAG AT CTC CTT ACCACAGCATTGC CGATGTGTG CAAG CT A 

GCTGGAAGTCCTGGAGATACACTGGACTCTTTCTTGGACAAAGC^GAGGAAAATGTTCT 
AGAAGTGGCTGTGCAAGAAGGGGCCAAAAGCTTCCCTTCTGACAAACAAAACAC^ 
TTGTCTATGCTAAACCCCCAGATTACGTGGAGATTCACAAAGTCAACAAAGACGGAGTGCTATC^ 
AACCACCAGACAGAAAACCCTGC<5GTTCCTGAAACCAGTAAGGAGTATGCCAAGGTA 

5 5 GTTAGTGCCAGACTC!ACGAGCCCAGAACACAGCGTTGTTCG 

CTGAGAAAGATCTGGCCAGCTTTACTGCAACCTCAAGCAACTGCAGACT 
TTCATGCACTCCTTTCACTGA 

HUMAN SEQUENCE - GENOMIC 

6 0 CITCCCQMKSGACACATATTGAATAGGCCTGCAATGGGGCCTTG^ 

AGCTATGTTTGAGAATTATGGCTCTAAAGCAAAGCTCTCCAAGAAAGAAGTACATACCTAGGTA 

AAGTATGAAAATAATATAGTAGTTATTTGGGGTATTTTTTCTAAGAAAGAAACT 

CCTTCACTGAGTTCTCAAATTAGCTCAACACAAGTTCTG^ 

CACC-GGGGTTGCCATGGTACCTCCACTGCTTGATTGCCATTTATGTGCTACAGGTTT 

6 5 AGACCTTTAGAATATGCAACTTGACAAAATAGGGAGTGAGTATGAAAATGTCT 

ACGAAGTAGTAAAAAAGTTGTTGGATTTAAAAATGAGTTGTCCATTTCC^ 
CAAATG^IATGATGCTGAGTGAAAGAATGCTACATACTGTACAATTCTACTTA 
GAGAACAGATTAATGGTTGCCAGGGGACAGC^TAGGGACAGGGTTTGGCTACGC^ 
GCGGGGCGGTGGGGTGTGCGGGAGTAATGAAACCATTCTACATCTACATTGTGATGGTTA 

7 0 CCTAGAGATATACACTAATGAGTATGAATTTTATAGCAAGTTACAC^ 

CAAAAAGAAAATTTCTACCACCTTTCATGCAGTATATTTTTCA^ 

TAGAACC AGATCrGCATATCACTGTGTCTCAAAGTGTT AAG CCAAC^ TT T AAAAAAAGCAT ACTTAATCACATT ATT 

CTCATTAAAATTTTGATAATTAATAATTTCAGAACTAAA 
ATTTTGACTTTTATTTTAGATTCGGGAGTACGTGTGCAGGTTTGTT 
75 TGATCACCTCATCC^TGAGCGCAOT^ 
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TTGTCTATTGTTCCTATCTTTACGTCCATGCATACTCAAAGTTTAGCTC 

GTTCCTGCATTAATTTGCTTAGGATAATGGCCT 

GTAGTATTCCATGGTCAAATTTTTGTTTCATCTTATTTCATTCTTT^ 

TAT AT ATAATTT ATGACT AAAAATAC ATATG TATTTGGAAAGGGATGCT AAAAATG ATTTT ACTG CT AAGATGATAAACAG AAAAC 
TTTGGAGGTCACTTCTCTAGAAAATAACTCTTCAAAATTTAA 

GATGCAG CAGGAAGT CTCT CACCAGAT CAAATGCTGG CAC CTTGATCTTGGACTTCC CAGT CTCCAGG ACTGTGAGAAATAAATTT 

ATGTTCTTTATAAATTAAAGAAAAAATACAAAGTCTGATTCACAAAG 

ATGATTAATCTAATAAGTTGAAATCATTTGTTTTGGTTTAAA 

AT AGACTAGGCCTAGGGTTAAAATGATAATTAGT ATGATGCCAGTG CACCTGGCCCTAAGACTATACTTTG AATAG CAGGACTCAG 



TGGGTTTGGGAGACATGAATTCTTATCAGTCAATAGGCC^GCTCr 
CTCATAAAAGCTGATTTTACriTAGGTCACTCCrTATCCACCAGCGTGG^ 

AGTGC CTTAT ATTT ACACATGGTTT AT AATT AACCAAATGT CAACTCAT AT CAG C CT CACAATG ACCTTTGGAGGT AGAACAAAG C 
AGATGTATTATAAACCCATTTtSCAAGTGCTACATGACAGAAGCAAGCACAGTC 
1 5 TTTAAGAAGAAAGTACCATGG AGTCTGAAG ATCAAGAATGGG TAAGGTTTTT AG ATGAAGGGT GGG AAAT AACATTC CAAGAAG AG 

AGGATGGATTCAAACACACTGTAGATATGTATTGAGCTACTACTACTCT 
ACTITATTCTGTGGGAAGTAGATGCCATTAAAAATTTTTAATTCAG^ 

TGATGTAGGCAAGACTGGTGAAGATGAGTCCAGCCAGGGGGGCTATTGAAGAGGAAGAAGTAAAA 
GGCAATAACAGTGGAAGGGACAAAAATATTTAAGGACTTCATTCCTC 
2 0 TGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCT 

ATCTCCCGATGCTATCCCTCCCCCCTCCCCCCACCCCACAACAGTCCCCAGAGTGTGATATTCCCCTTCCT^ 
CATTGTTCAGrrCCCACCTATGAGTGAGAATATGCGGTGTTTGGTTTTTTGTTCTTGCGA^ 
ATTTCATCCATGTCCCTACAAAGGACATGAACTCATC AT1TTT1A TGGCT 
TTAATCCAGAAGGACTTCATTCCTAArTGGATGTAGAGTGTAACAGAGAGAACA 

2 5 AGCTTTTATATCTCAAGGCCCATAAACAAGTGCATGCCCTTTGACCTAG 

TTCCCAAAGGCAACT ri^TTTTTCTACACTTCCCATAAGT ATGTTT ATT ACAG CATTATTT AT AAG AGT AAAAATTTGGGAACAATC 
T AAATG TTCAT AAT AT AGAATG ATT ACATAAATTATTGTG CAAAAAG ATT AT AATG AT A^ 
TTTTAAAAAGATATAAATAGATATATTTCCTGGATGGGAAGAATTAATGCCCT 
ATGTAATGTAATGCCAAAAAAAGATTTTGAATTGAACAAATTGACCCTAAAATTTTATGA 
30 G ACAATTT CAAAAGGAAAGTGACAGTGTGCTTTC CACTAACAGG T ATTGT ACTGT^ TTACAGT AAACAAAAGT CT AT AT . 
TTTTGGCATATAAATAGGCATCTCAAAGAAACAGAGTAGTGTCCCGAATTCATATAAATATATATC^ 
TGACACTAGAAATCAGTGGAGAAATTATGAATTTTTCTATAAGGGGTAAAATTAA 
CCTTATGTAATAGTTAAAGATCTATTAAAAAAAAAACAATTTAACAGAGTTAATTGCAAAGAAGCAATA 
GTAACAGAAAATAATTGACAGATTTGAGTACATGAAAATTAAAAACT 

3 5 GCCTGAC AGGC ACAAATGTTTG CT AT ACATTG AACAAAGTG TT AG TATGTTT AAT AAT AAAAG AG ATG C C AAAAATT AATTTT AAA 

AAAT ATGG AGGCCAGGCACGGTGG CTCACG CCTOT AATCCCAGCACTTTGGGAGGCTGAGGTTGGTGGATCATGAGGTCAGGAG AT 

TG AG AC C ATC CTGG CT AACACGGTG AAACC CCGT CTCT ACT AAAAAT ACAAAAAATT AG C CGGG TGTGG TGG CAGG CG C CTGT AGT 

CCCAGCTACTCAGGAGGCTGAGGCAGGAGAATAGCGTGAACCTGGGAGGCAGAGCTTGCAGTGAGCCGAGAT^ 

C CAGCCTGGG CAACAGTG CAAGATTCCATCTG AGAAAAAAAAT AT AT AT AT AT ACGGG C CAGG TGTGG TGG CTCACAACTG TAATC 

4 0 CCAGCACTTTGGGAGGCCGAGGTGGGCGAATCAGGAGGTCAGGAGTTCAAGACCAG 

TAAAAACAGAAAGAAATGCTCGTCGTGGTGGTGGGCACCTGTAATCCCAGCTGCT 
CTGG AAAGTGGAGATTGCAGTGAACCGAGATCATGCCACTGCACTCCAGCCC^ 
ACAGGCATAAGATTGAACAGCCACAACATAAGAGAAAAAGTCAAAATATAATAAGCACATC 
CACTGTGAATTAGAACAACAATGAGCTATCATATTTTGCTTATCAGACTAGCAAGATAA 

4 5 ATGTGAGTAAACATCCITGGTAAGAGTGGACACTGAAACATTTGTGGATAAW 

TACCCTGTGGCTCAGTAATCTCATTTCTAGGGCTTTC^^ 
TTCATTCTCTTTGAGCCAACCAAGTCATTTATTTTTTATTTTA 
CAATCTGTTGCCCATGCTGGAGTGCCATGGCACTATCTCGGCTCCCTGCAACCT 
CAGCCTCCCTAGCAGCTGGGACTACAGGCGTGCACCACCACTCCTGGCTA^ 

5 0 GTTGGGCAAGCTGGTGTGGAACTCCTGACCTCAGGTGATCTGCCTG 

GCCAACCTGCCTGCCAATCAC^CCATTCTAAGTATCTATCCTAAAGTCCATGCAAA 
AAGCTGGAATTACTAAAGACCTAACAATGGAAGAATGGTAGAATATATTACAAACTATGTAACC^ 
CTATACATAGTTACCTGGAATATGTGATTTCATTAAGTTTAAAAA 
ATTTGCATGTGCATACAGAGACATGACTATATACACATGTAAAAAAG^ 

5 5 ACAGGATTAGATGAAGGAAGGAGGATGTGTGATTTGGAAGTACTATATTTTACTGTATAGATAA^ 

GTGCTAGGATTTGGGGTAGTTTTTTAATCTTTGGATATTTTTAGC^ 
CACATATGGACATGCCATTCTTCTGAGGATGCTTAAGT 

TCAGCAGGAAGTAGCCAGAAAGAGTCCTTGCCCAAAACCCCCTAACAGCAGTTAGT 
GAGTTATCAAGAAATTATTTTAGGCAGATAGAGAGTAAAAGGGGTCCTTGGGAA^ 

6 0 TTCTTGTAAAACCCCGGCITTTAGAGCCAGGCGGGCAACCriTTGATATGCAAA 

TGCCGATTCCAGGGACCTTClTGCCCTrGCCCCACATGTTCCTGGCAAC^ 
CATGGCGCCCCGCATTTGCATATTAAAAGGCTAGAGTGGGAGGGCCAGCTCTTTCATGGG 
CCAATCCCCTGAGCCCTAGGCAAATCAAACACCGCCTCCTCCAGCCTCT 
CTTACTCTTTTGGCTTTGGAGCGCCCCCCTCCCTCTGTCTCTGTACAGGGAAGCTTC 

6 5 acccttcttgcctattaacctctccactccttaaaaccaaaaaaaaaaaaaa 

aagtagcaatgataaaactccttccctatttx5agaaggagga 
gacagaggggaggctcccgccagaggatggcagaggcctggaagccaaggcgtga 
cagtgctgcagagaggcctggagggccctcccagagagcagccagtgagtggtagtgatgaaagccaggtgcaaaga^ 
agccattttccaccctgcctgtgcatttttgtaaaaacagagaagtca^ 

7 0 agtgac^cccagacacagtacattttaaacactcaggtgattcttcttca 

gagggaagtaagtaaggaaagctggtgtgcgtaagtcattctttcactaagtttggcaggtaa^ 
aagccatcaaaagggtcaaagaagatgtagaagttgtaggttgagataaagaa 
aaggtagaggaaaggggaaggaaagaaaggaactrgatgtttctagactt^ 
ttattattcatttgactcgatttttga 
7 5 agtgccagtgctgggctcttcctctgcatcaggctgagaagcaaggtcttggagga^ 



889 



WO 03/008583 



PCT/US01/51291 



TAGGGAGTCAGCTTGCAAAGGACGAACACGTCCGCT 
TG AAG CTGG AGCACATCAGTGCATAGAGAGAAGGAGATAGTCAAAGA^^ 
GCGGATTTCATCATCTCAGGAAAGCTGGGGCAAGGTAACTCACTGACAATGACT 
CACAACAGAGCAGAAAGCCAGCCCIGCCCTTCTTACGAGTGAGGTA 
5 CAGACGAGCCAAG CTG CCTG CAGCTGAGTGGG AAC CACAAGG AGG AAAAG CCAAGT CCCTG C CTTCT AAGAG CTG ACAG T CTGG AG 

TTACCCAGAGTGATCTTCTATTTCACTAACTGTTAAAATAG 

CAAAGGTCATATAGAAATATTCTGCCACTGAGGATCTCATGTCGCCCAGTGGTTCTGAAAATAAATAGAA^ 

GACACCCATGAATGAAAGT AAATATACCCTAATCATCACAGGAAGCATGTX3 CGGTCTG CACGTGAAAGATGCATCTCCCATCAGCT 
AACAGAACCAGG TGGAGT CT ATGTTTT AATAAGG ACGTGTT ATTG ATCGGG AAGT ATT AGGTTATT ATG CCT AGG TAGTTG CTTCT 
1 0 GGGATTCCAGTCCAAAAGGATCACCAAAGCATCATGGGATTT^ 

ATTTCCCACTTATCCAGAGTAGCAGTCAAATCAAAGGGGAAAGCCTC^ 
CCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTCAGGAGGCCAAGGT 

TGG CT AACAGGGTGAAACCTCATCTCT ACT AAAAAT ACAAAAAACT AG CCGGAC^ CG CCTGT AGTCCCACCT ACTC 

GGGAGGCTGAAGCAGGACAATTGCTTGAATCCAC^AGGCAGAG^ 
1 5 GACAGAGCGAGACTCTGTCTCAAAAAAACAAACAAAAACAAAC^ 

TG G C C CTCTGGGTTGTGT AACAATCGG ATGCATT AATT ATT ATGG CATCAAATGACACCAG CAACATACAT AGTCAG AAAT AACT A 
TTTGATTCAGG CAG GTG CT ATT ATTTG TTTCAGCACTGTT AAATG TGGGGTTTTTGT^ 
GTTTCAAGAAGAAGAAAAGTCAGAACAACATGCAGCCCATTAGAAAAG^ 

T ATTT AGGCAC CG C CAATG TTTATT AT ATTCAT ATGG CA.GAGAAT AAACAC AGCACA.CT AGAAG TTTGCG CTTTGGGT ATT AAAAA 
2 0 AAAAAAAAAGCAAAAC CTT CTT AGG TCACTGACT AT CTT CTTTTG CT C CTTT CT AT CTG ACT CCT CAGG CAAGG TG CAATT CAAAT 

CCCTGGAGAACCTCTCAAATATAGCTGCTGTTCAATGCTACTCA 
GGCTGCCTGGAAAAATATCATCTTTGAGGTCACTCAACTCACTAAGT 
GTCTCAATGAAAGATATGTGGCCCTAATCATGCAAAACCGATCTGGGGGGAATCAGTTC 
CTCCCCCAACTACCTCCGTCGCCTTCTAAAACTC^CAGGCTCTGGACGTT^ 

2 5 AAGGAACATAAATGTTGCAACACTGACTCCTCCTCTCATGA^ 

C(^C03GGCTGGGGACCGGGGAGCGGAGGCTGAAATCCCCAGACGCCGGTTTT 

TC C CTCTTT CTGGATTTT ACCG ACCGTT CG CGAAACAGCTTTC CACACAATGGAGCTT CATGTCCTCGTG CAGG AAGT ACTCATCG 

ACTGATGTGGCAGACTTTGCTCCCTGGCAAAAGTGAGTACCCATX3GCGAGCTGAGGCCGCCTCCGACGGCGC 

GCGGGCGGCCX3CTCTCCGGGATGCC<KjTTCGAGGCCCGCGTGCTGAAGG 

3 0 CGCCAGCCAGGAGCC^VGAGTC^GCAGCCGGCGGGGAAGCCCC^ATCCGCCCGCCTCCCGCCCGGATC 

AAAAAATACTCTTAAGGTGCTTTGCCTCCACCACTGCCCCCAACCCCAG 
CAAAGGCATCTTCACCTGTATCCTTGAGCATGAGTTTCGGCGTTGAT^ 

AAAAGAAAAGAAAAAAAGAGCGGGGAGGGGGAAGGGGCGC7VGACAGGTGATGTGATGAAAGCCGGACTCGA 
AGCAGCCGCCGCCGCAGCAGCCTCGCGGGCATCCGGCGGGGAAAGTTTAGGGTTAGCAATGCTGACCTGAAAGG 

3 5 CAAAAGTC^CC^GGGTAGGGGCAGATGAGCAGCCTCCGGGCCAGA^ 

GC C CGGG AGG AAG CTTCG AAGGAGAGC CCG CTGG AGAAGGG AGGAGGGAAGGG AAG AGGAGTTGG T^ CAGGCAAACT CAGGT 

AACTGGAGGGC CGGCCCG AGTGCT ACGGCAGGGG AAGGCAAGGGCTGGAGTTCTGGCTCTTTGAGT AGAAAACT CCTTTTAGTG 
ACCTCTACCTGTATCCTGGGGAAGCAACCCACACTGCCCTGAAAACAAGCAACGTAGGGGATTCTATGACTGGACTTGAA 
TGCAAAACACTATTATTAGTCTGGAGGGATCACTTTGTCTTCATTTG 

4 0 TTCACTTGACCAGGGAAGTGCTGCTAATCCCAATCCTGACCATCAGCTGTGCAAACTTGG^ 

CGATTTTTTTTCTGCAAAGTTTAGGGAATTAATTAATC 

TGAGGTGGAAGTGCTGAATAGTGCAAGAGAGGAGCAGATGAAAATATTAATTTCAGTGGTTCT 

TTTTGAGAACTAGCTGTGAGTCCTCTTCCTGGGGGAAAATGCATATGGC^TGTACATA 

GTGGATACTCCCTGAAAGCCATGCCTGAATCCCAGGTGACAATCCCTTATTCCT 

4 5 TGCTCCTGGTTGGGCTGACTGTGGGAGTAATCCACAGCCITCT 

CTGTCATGGGTGAGCTCCCCACATGCTCTCTCCAGAGCTCTACC^ 

TG ACTCTCCT AAGG CG ACAGTCCTT ACCCCT AG AATTTTCAGT C CCACCAAC C CTGG AG AG CCATCTAAAG AGGAT CG CTGTCTG A 
ATCAGAGTGCTCTTCAGGTCACCTAGCTGGATGG CAGGGAGGCCAAGG CTTTGTTTGGAAATGAAG CAGTCTTTGCTCCCTG AGGC 
ATAGTATGAGTAGGCAGCCATATTCTTTTGAACCTCIGTACTTTGC 

5 0 TGTTCCACTAGCAGTGCCAGCAAGGCACATCAGAAAACACAC^ 

ACACTGAGCAAGCCTCTGAGTTTTCCTGAGCTTCACTTTTCCTGTCAGCACAATGAAAATA^ 
GCATGGTTGTTGGGAGAATGGAAATACTATGAATCATGCCTAGCCCATAGTCAATGATAAACTTT^ 

CAT AAAT AAGTG TT CAAT AGT CT CATG CTTTT AAG AACAG CTT ACT CT CT CTCTC CT ATGGGGCTT ACATT AG T AG AAGAATG AGT 
AAAATTAGTTCTGGTCTAGTTATGCACAAAAAGGAGTGGTGGCCTCAGAATACAC^GTCAGACTGGCT 

5 5 CAAGGAGAGCACTATGTATTGCCGGCAAGTAAAGTATGTCCAGACCGGCGTGGATACTTGACCTGTCTCT 

(^TAATCTTGAGTACAAAACAGCTAGTGCTCTGCCTGCAGCT 

TTCAGAAACTGTAGAGAGAAATATCTCCATTGTCATCTTTCACAGTTTGGCACA 

ATGGGGTCCTGGTGCCTCCATTGCAAGTGGTCAGATGATGTAGCAATGAAAGA^ 

ATCCACAGAGTAGCCAAAGATCTATCAGTATACTTTGGTGAGGTACTCCTGCATGCAAAGATAGCAATAT^ 

6 0 CTGAGCCTTTAAGATGTGCTAAGCACTTTCACACATTGCCTCAATTAATTCTCAT^ 

AAGCTTTACCAGTAAAGAAACTGAAACATC^AAATGTACATACACCCCC^ 
CAGAACCAGGATCCAAGCCCAGGTCTGTCTGACTTTGGC^CCCACTCTTT^ 

GG CACTTG CATCAG CCAACCGG ATT AAGATTCTT CT AAAAGG CAGGAATAG C CAAATCT AGTGGGTGCTTTT AAGTGGTG TG AACA 
TGTG CTTGGCTTGTGATAC ATATTAACGGGTAG CAACAACAAGATCACCTATGGG AGG ATAAT ATTCCTGACCCTTCCCAAGCCCA 

6 5 GCCATTAGACTCAAGATTGGCAATATCCAACATTAGCAGGTGAGGCTGGCGCT 

AAACAG AGCTCT GG AATGTT CATT CT CT ACCACCTTT AG AATCAGAACACCACTT ACCTTT AC CTC CCTGTGCTGAG CACGTTTT C 
ATTCATTTATTCGACAAGCATTTGCTGAGCTCTGTTTTGTGCTGGGC^ 

AG CCCTTGTGG AGCTTCTATGCCAGAG AG AGTG ACAG AAAACAG T CGGGT GAACAAAT AC AT AAAT AAAAT CATT T CAGATACTG A 
TAAATGCTATGAAGAAAATAGAGGCTGAAGGCTACAGAGGGACAGGGCGTGG 

7 0 CCATACAAAGGATACCCTTGAACTTAAACCCGCATTTTCAAAAAGGGAGCCAGC 

CAGAACTCTC^GAGAAAGTTCTTAAATTGGAGTAAATGAAGCAGTAAAGCTTAAAGTCTGAAGATGG^ 
TGTATATAGGGGGAATTATTATTTGAATGCCTACTATACTATTCTAAGCT 
TGAGTTAGCTATTTTTGTTTTTATTTTACAGTTGAAGAAACCAAGAG 
GTCCAAGGTTATGAAGCTGGTATTTGAATCCCTTAGTCTATTTC^CTCCAAAGCCCA 
75 TTGAGATAATGTCTTTTGGAGAAAGTAAGGAAGCCGATAGAAG 
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TTTGTATTTGATGTCCCAAATTCCAGTTAGTGCACCTACAACTGAACT 

GCACTTTG AAATGTGGCTATTTCAT ATTGAGATG T ACTGTAAATGT ACAACGC^CC C CAGATTTTT AAG AC CT AGT ATTCC CC CAAA 
AAAGTAAACTGTCTTGGCCAGGTGTGGTGGCTCATGCCTGTA^ 

C^GGAATTCGAGACCAGCCTGGGGAACACGGTGAAACCCCGTCTCTATTAAAAATGCAAAAA^ 
5 CTGCAATCCCAGCTACTTGGGAGGCTGAGGCAGGAAAGTCTC^ 

TTGCACTCCAGCCTGGGTGAAAGAGTGAGACTCCGTCTCAAACAAAAA 

TGTGTGTTGAAATGATAATATTTTAGGTATACTAGTTTAAATAAAATATA 

TAATAGAAATGTTTAAATTACATATGTGACTCACATGATATTTC 

TCCACTATACCATGTCACTTTGGGATCTGATAAAATAGACTCAGTC 

AT CTCAAGAGGCTTTCAAACGTCCTACAATGAAAGGGTCATTTTAA CTAGT A 

CATTGTCTTTGTAGTTTTGGGCACTTTGTAAAATCAGACGAGGAGAC^ 
AGTAATCTCATGCAOTAATAAATGTTTCTCCCrGTCCTCTTG 
1 5 CTACTCCAGGGGCATCACAAAAGACCACCTTTGGTTTC'riTCTrri'GTC 
AAAGATATATTTTTTTTCCTGCAGTTTCATTTATTTCAA 
AGCAAAGACCTGGATTCTGCTTTTGTGTACTCTACTGGTTTGCA 
ACTTCCTGTAGTAAGTTTGGTGTTGATGAGATAATGCCACAAAAAATGCTAAT 

TG ATTGAT AAC CTGAATACAAATGGC ATCTCTAGAC ATGTG CCT G CAGGG AGGAGTTG CCTTT ATGT AG CTGG AG CAAAACATT AT 
2 0 GTAAGATTCTCI^GGAATAACATTTGTTCATTTCCACTTGAAA 

AATCTTAGTGTATCTGTCACTGGTAGACTGAGTTCTTTCTCTAGTCT 
CAAGCTGGTTTTATTCACTGGCCAGAAATGGAAAATGAGAGACCTCTT 
GCAGGATTGTGATGAGTTAGATGTGGGGCGGGGAGCTATGTAAACACATGCAGGGAGAAAG 
ATGTTTCTCCATGCGTCCTATGGAGAGAAAATGGCGGCAGTTTTCTTCT 

2 5 GACCTAAACGGAACAAGGGGTAGCCAGTTCTGGTTTGCACTGGTTTCCTGC^ 

CTGCGACATCCCAGGCCATCTGGAGTTCTTTAAGCAAGAGTATCTATAGATA 

ATG AGCTTTCCCAGCrATTrCTCTAGGGCTGCCCTGGTAATATTT CTACATCAACAGCAACACATATTTTTCAGCATTGTGTAAG C 
CAGG AAATCT ATG CT AG ACT CCACGGGGAAAAT AGAAG AAT AAAACATGGTTCTTG CT CTG G AGG AG CAT AGTT CCT AGTTGGGAA 
GGTGGACAGAAACATAAAAGACCAGTAGCATGTAAAGAACATTTTCT 

3 0 TGGAGGGCAGC^TGCCCATAACCTCTAAGGTATATTAAGAAAGACAAAGTAGAAAAT 

CAAGGAGAGAAGGAGAGAGCCACCTCACATAAGAAAAAATGCAO 
GATGTGGAAAAGAACAGTTGGCrTAATCCAGTGGCTCCAGATCTGAGCT^ 
AAATGCCTCAGCCCC^CrCCAGAGCTTCCATTTAATTAGTTTAAGACTG 
TTGTTTTTGTTTCTGTCTCAATTTCCCAGAATATTCTAATGTGAAGTCAGAGT^ 

3 5 ATCATAGTC^TGGGACATAAAATGAAAGAAGTAATCTTTTAAAGGAGATTAAATGGACAACC^ 

AAACTG T AT AT ACA.TG CAC!ATTTGT ATGT AGT CTTTT AAATT CAC AG AT AATT C CT CG TGGT G ATTCTG T CTACTTGGATG ATGTG 
GCTGAAAGCCTTTGTAACCAGCTGCCTACTCTTGTCCTCTGTGTACTTT 

CAATGGG CATCTCACAGGAGCTTGTTTCTACCAGGCTTTTTC CCCAC 
AGTAGCTATTTTATAAGCATCTGCAATCITCCCTCACCTCTCCAAC^ 

4 0 CCAGGAGACTAGCTGGTGATGAAGGTGAAATGAGAATTCTCAGGGAGGTGAAATTAGAATTTCCAGCTACTGAATTCC^ 

TATT AAACT CC C AAAGTGAGTCT ACC CACTCTC CTGTTGG TGTT AG AATG TT ATTT CTT ACATGAATG AATCAGTGT CACAG ATCA 
GTAGGTGAAGATGACTGGTTTCCCTAAGCTGCCAACACTGTGATTATCTGAGTGTCAATGACCCATAGGCAG^ 
AGAGCCCCGTCCCTGATGCCTGGGAGCTCTCAGTCTAGTGAAGTTTGGCCTTC 
AAGTGAAGACTCTGGTTGCTATTCCACTGCTCCTAGCCCCACACTTTACC^CTGTTCTGATTTCC 
45 CTGAAAGGATATCAGCITTTrGCTACACCCCrGA^ 

GGATGGAGTCTTGCTCTGTCTCCAGGCTGGAGTG^GTGGCGCGATCTCGGCTCACCGC^CCTCTGC 

TCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGC^CGCGCCACCAAGCCC^ 

GTTTCACCATGTTGGCCAGGATGGTCTCGATCTCTTGACCTCCTGATCTGCCCGCCT 

ATG AG C CATTG CACCCGG CCAACAG CT ACATTTT AAAG AAT CCTTTAT ATGT GTTGGGAG CTTCTGGGTT CACTG CC C ACG GTG AG 

5 0 ACTAGGGGTGGGGACGGAGGTTCTTCTGAAGGACAGTGCGGGGAAAAAGCCAGAGT 

ATAGGTCAATTTGTACCATTCCCAAATGGTTGGGTTTT CTTTT CTCTGTCT 

GCTTTAG CAATAG CAAAAAGCAAGCCTCTTAAACAGTGTTG CTCCACACTG ATTGCACXITG AGAATCACCT AGGG AGCTTTAGAAA 
CAACT AT AG ACTCACTG AAT CAAT ATTCTGGG AAGGGG CCTGG ACATT CGTATTT ATTT ACAG TTCT C CAGG TTT AT AATGTG CAT 
CCAGGGCTAAGAGCTACTACTC^TCAGCTATGATCCAAATCTTAATTAC 

5 5 CATGGCCCAGAAAACCATCAAACTATATTTTTGTTGAGTAGCAATTT^ 

TTT AAGTTTGCT ACTT CT ATAGAG AGACCTCACIACTCAGTAGCCTCATCACTAAAATTTTCTGTGTG C AATTTAGAAATCACATTG 
CCAGAAGCTTGCTAATTAATTATGAGACTGAAAGATAAAGGTTAAGCT 
TTAATCTTTAGTTAACCAAGACTTGCTTTCTTGTCTCAGCCATTTTAA^ 
AATATAATTACAGCAGTAGTCTCCAGGGACTGTCATTGG^TAC^CT 

6 0 ACACCCTCTC^ACCCTGGAAATCATTTTGGTTGCAGAAGCTACIT 

GAAAAAGCCAATTCCAAGTCATCCTAAATGTGGAAGAATCTGTGATTTTGAGAATGATTCTGGAATCAGTTA 
AAGTCTGATACACCGATCCTCAGAAC^TTGTATTAATAACAAAAAGAAACCAGGTTTT^ 
GGACAGAAGGTGATGTAGCATTTTGTTTTCTGGGGTGCTGAGATTGTGGGATTAGGAGAAAATAAG 
ACGATGGACAGAGGCCTCCTGTTAGTCAACTATCATGGGCATCAGAATTTCrT^ 

6 5 TTTGTTCC^^GGACCCIATTATAGGAGCTGGAGATAGAAGGATGAAGT^ 

AGATATCCACACTAAAAAOUVACAATAAATAAAACCC^ 

GAGTTTGCTGAAGGCTGCTTTCAATAGCTTGTACAAGTTGCACTGTCTGAGGAGGTGCCC 
AGAAAAGCATGCCGAGAGCTCXSC^GAAAGAAGGCTTCAGGCAGA 

TGTT C1AGG AAACAGAAAGG AG ACCIATTGTGG CTGG ACCCGAGG T AAG AAG CAATGAGGG AAAGGTGGG AAAAGAG CAAGGAGGGC C 

7 0 CG AGG TCATGGG ATGG AGTT AG^CTGTT AAGTG CTTTGAG AG CCTAACATC CT AACT CATTGGGAGG TTTTCAGCAGGGT AGG AAC 

ATGGTTGGCATTGCCTTTGAGAGCTCTTCCCAGCTTCTGTGTGG^ 
GATCGGAAGTACAAGCAAGAGGGGAGGGTGGATCATGCCAAGGTGACAGCAA^ 

TTTGGAGGT ATGG CCAGCATG GCTTTCAG ATGGAGGGGAC CAGCG AAAGTG CT GGAAAAC CT AGTCCTG CTCAAGATG ATTGTTC C 
TG CC CTACC CTTGTTTT AGACCCCTTGTTTTG AACT AAAACAGAAT AATAT 
7 5 TAACTTCTTTAAGCAGAACAAAACAAATCCCTGCCTGGGGAG 
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TCCTGTCTCGTAAfiACTATTCTGATGGCTTTTGTTGTTTTTGATTACTGTGCTTTTCX* 
ACTAXTITTATCTCTTTTCTGAGTGTTTTGACACTTTGGCT 

CTACAGTTGATCAGTTTCTCCATATAAGTGCTAATATAATCGCAAGCATTTAGTG 
GAATTTCATGTTTTAACTCATTTAGTCCITCCAGCCACCCTTTATA 
5 AAACTGAGGCAAAAAAAGAGGTTAAGTCATATGCCCAAGATCATACAGCTGGTAAGTAG 
CTGTACAGCTCTGGGTTCCTAGTAACTATGTTGTGCCACTTTTCTATGT 
CCTGCAAAAACCCCAAGGTACCACTAGGAGACACTAAAACT 

T AGAGG CTGAATGAT CCAACC CC CCT CCAGG AAGGCCTCACATCT AAAC CACCTG C CATCTGCCAAGGTGGGAGGAT AAACCAC CC 

CCCACCCCGCCGCCAACAGGGCACTGGCTGGTGAAAGTTGCCAATACCAGAGGA 
1 0 CTCTTTCTGCCTACTATTCATAGTATCTGTGTTGTACCCACCCACCTG 

AAGCCATGAAGCAAATTCTAATTGCTCCCCACGGCCATCTGAGGCTGGCCTTTC 

GTCATATTAATTTATTTGTTACCACATTATTTGCTTTAAAATTCiGGTGG 

GCCTAGGAAACACAATATTTCATTCTGTCTTCTCTGCTTTTCACT 

TTTCTCAATAACTAGCGTGAAAGTAACAGACCTTCCTTCCTTCCTAATGATA^ 
1 5 ATAAGGAAATGCATTAAGATTTTAACTTAAGTAAGTGA^ 

TATATTATATGTTTAAACTAGTTATATATTTTAATTATATAAGATGCAA1TT1-1UTGCT 

GAGATTAACATGCGTTCATTG<3AAGTGTTGAATTTA 

TTATTTTAAAATATTATATGATATGTGTCAGGTATTTCTTACGGTTGACTTCCAAATT^ 

ACAT AT AGTTGCAGT AAACT AT ACT C AATTTTGGAAG ATTG CAGTGG CAACATCAGTGAATT AAAGTTTGAAAGCT ATTTCAAAT C 
2 0 TTTGCATATTCTCCAAATAACTCAGAATCCTACAAAAA^ 
TCTTTATTGGGATGATTTTGGTTAAATATTTAGCAGAA^ 
AAGTCATTTAAAGATCAGAGCAATTAATGGCAGTTATGTGATCACAGGAAT 
GTTTTATTTTTCTCTTAGCAACTTTTCTTAT^ 

TAAA TT TTTATTTATTTTTATGCATGGTTTAAAATTCATTTAGTTCA 

2 5 TCCCCCTCTCCTGTCCTCCAGCCTrTACAAAAATATACCATATATATTAATATGGGCT 

CAAGATGCTATGACTGTGTGAAGCCGGAGCCTACAGACTATTGCT 

GAGCTTC CTTTCT AAACTCAAGTGTAT ATTT AATGTGG CAG T ATTTTCAG AGAAAAGT AT AGAAG AAC 

AGATGAAATCACACAAGATGCACACATCTGGGTATTGGTGGGCATGTACTT^ 

AAGTATCAGGTTTAAGAGTCCTAGGATCTAAGCCTGCACGGGTTCAGTCTTCT 

3 0 CCCCCAGTGTTGGCATGTACACCCACTTTGAGGAGGCTGGAGGTATAAG^ 

AGAAGTTTCTCAACAGAAAAGGATTGCATGGGTTGGATGTCCAGAAGGATACGTGGAGT^ 

TTGCTAAG ACTTGGGAAGAGG ATGAGGGAAGGCAAAAC CTG TC AC CT AAAGGGGAATTCTAGAATGTACTT ACTACACT CATG AC A 

TTACCCTACCACCITCATTTATCACCTTGTGTCAAACATAGACTTCT^ 

ACTATGACTTATTCTCCTTTGTTTACCCTGCCCCAGTGCCTAACAG 

3 5 TTGAAACTGAATATTTCAAATAATCACAGATAAGAGATGGTTAAAAACCAAGTCCTTCT 

TTGTCCATGTCTCCTAAAGAAATGAAAGTTTTCTAATTACCTATATAGGTCTGG^ 

G TGTTGACCT ATGT C C CAGG TTG TCC ATGG TCATC ACACACT AATG ACCACTTG T ATGT ATTT CAT CT AG AGG AT CACACAG AG TG 
GTATGCTAGGCCCAAGGAAT AAGTTTAC CTGCTTATTTCACTTCTCCACTGTCATCCTACTGTGTTTCAGGAAGAATTTCT TTTAC 
CTCTGCTTACTCTACTAAGATGCCCCACATAAGCCCCCAACTCCTTTTCTTGTCCCCCTCCC^ 

4 0 CAGACCCCTTCCTCCTCCAAACrGAACCTTCCTGAGAATATACCAAAGAACT 

TGAGCACTTCCAGAATTTGCATTTTAATAGTGGTTC^ 

TTGTTGATATCCAGCTAAGTGAAAAGAAAGTCCTCAGTTCCAGAGTTCTTACTTCTATTT 
TTTTTTTGAGACAGAGTCnTTTTCTGTCACCCAGGCTGCAGTGCAGTGGCGCAATCT 

4 5 TCAAGCGATTCTCTTGCCrCAGCCTCCTGAGTAGCTGGAACAGATGTGCGTCACC^ 

TTTTTTAGTAGAGACAGGATTTCACCATGTTGACCAAGCTGGTCTCGAACTCCTGCCTATCC^ 
TGCAATTACAGGC^TGCGCCACCGTGCCCGGCCCTTACTTCTATTTCTAAATAGCAATC 
TTGCCCTAGGGGTGAGGGTCTTTTGTACATCTTGCAACTGAGAACTTTTGCT 
TTTTCCAGGAGTGAAGCrGTGGAGCAAGTGAGTTACCrCTCTCTCT 

5 0 TC CAGGGTTG CTGGGAGAAAAAGACAAG ACAAGGT ATGTACAACACTTGG CAC CAAGG AAGCACACAATT CATGGGAGTAATT ACT 

ATCATCCTCACCAAGAGTTCATGATTTACTGGGGGACACCAACACATGTACAGTC^ 

GTGTTGAGTCTGAGCTCAGTAGAAATTTATGAGCCTCAAAGGC^ 

TGTACTTCACCAACAAACCAGCAACATGCTTCTGGTTGGCAGCCCT 

TAACTGCTT AC CAC ATG ACAG CCAAT AAG ACAAGATATTGGGG CAAGG AAG ATGACT1TTTTT CAGGG AG C CAG CAAT CCT AG AAG 

5 5 ATGGCAGACTCATGTACTAAAGAACCATCrTAAGTTGATAGATTGTTAGGCTC Cr 

GGTTAAGGTCCACAGTTGACTGGTGACCACAGGCATCTGGGCATCAGCAGGGATCTAGGAGGTT 

CAACTATTTTCCAGATTGACCTTGTTTCTATAAATTTTTAAGA^ 

TCTGTTTC7VAAAGAGAGATGGTCATTAGTTCTGAAGCTAAACT 

AATGAGCAAAGGCAGTTAGCCTGTGAGGTTAGAAGCAAGATGGAGTCAGCTATGTGATATT^ 

6 0 TACAAGAGGAATACATTTCTTCTTTTTGCTGTGTTTGCTTCAA 

TCTCTTGAACCTCAGGCTCTATCTTTGTAACAATGGAGAAATCAATGC 

GGTTGAGAAACTTCTCATCTATAGTTGATCCTCAATGCGTTTGCCCCTTTTCCTGTGGC^ 

AGTGCCTC^CACATGGAGGAAAGGCAATGGAATGATGACAAATA^ 

CAGTTTCAGGG CCACTGT CCTTG C CAAAT ACT ATCTC AAG AGCT AGGTTATG AGC CTTTGG AAC CCAG ATCTGAT AAG CATGG AGG 

6 5 AAAACTCCTCAAGGGAGATGAAGCTAGGAGATTAGCAGAATATACTTTCT 

GAGCCAGCTGGGCAGTGTACAATTCCCAGAAATAAGGAAAGGGTAGAAGCACCTGTTTT^ 

AAAGCTCAG AAAT AT ATCAG CACCATTAG CAAGAGGCAC!AACGGGG AAGGCTGGAAATGTGCTGAGG AA^ 

ACGCATAGCTACCTG CTGGAGAAG AGAGTCTGCAGACACACCTCAG CCTGGACAAAGGCCTCC AGGACCGGGAGCTGTGCTCTGCA 
GGAACTGGGCCC CAGG CACTGAGGACTGAAAACCAGCCCAACAAAG CAT ACCTGTTTCC^ 

7 0 TTTGTGACrrC^CTAAACCACTTGGAAGGCCAAACAA 

GACTTGAGTAGCTGTGTGCCACTTGTCTGGGGTGGCTGCACCAAAAGGAACAGTGAG^ 

AGTTTTGGTTCrGTCTATTTACTTCTAATTCCTCAAATATTATTG 

CTGTGTAATAAGAATTTAATAATGAAATTCCCAGAAAAAAAAACTCAGATA^ 

CTTGAGTTG T ATTGTTAAGG TGAGAAT CCATAATGTAAAGGG AAT AATGTAAAGGGAATG CTTTGAAATCAAAGG CAT AATCCAAG 
7 5 TGTGATAACCTAAAAGGTTGTGGATTTCTTTTAAAGTAGAAATTACATGTGG 
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CTTATGGATTCTCGCTGTGTTGTCACTAAGACCGATTATAT^ 
GTATTTCATGGK3GATCTCCTTTTACAAAGTGGTCGTTC 

GATAGGATATGATTATTGTACAATGTGATAGGTAGCTCTGATTATTCAAGCAGCTAGGG 
ACTCTGGAATCTTTCTTCTGTGTAAGCAGGCAGCTGCTGTTGA 
5 TTCATTGGGAGT AGGGGAATGACG CTG AT CACTGTT AATGACAGTTG AAAATG AAATG CTTTAATGAATCTCAT ACTTGGACTTGG 
CTGCTGAGC CAGAGC C C!TGT ACTTGAAGAGCTT AATT ATGATGC CT CAAAATTT AGAT CTTGACC CTGAACTG CTATGACTTCCTC 
CCACCCTCAGTTTTGAGAGAAACTTTAAATTCXSCTAG 
TCCAAACAATCGAATAGGCITCTATATTATTATTTTCATTrAGGCACT 
GTATACAAATTAATTTCATACCGGGGGACTTCACTGGGGGGAAAAAAGGGAAGTCCCT 
1 0 TTCTGAGTAAAGCACTGCTAATCTCTTTGGTGT 

CAACTAATCTGATGACATCATTTTGTACTCAGAGTTTCATTTCCCAA 

AAAGG CATCAAAAACAG CTAAAT ATAACCAAAAACCTGCT AACT CACCAGCTTT CT AAAATTCAGG AAGT ATCTTTAACTTGATTG 

CTGAATCTCCAAAGACATGAIXIATCAGATGACTGGTTAATGGGTATTT^ 

GCCAATT"GTATACCrrACGAAGAGGTATTATGTAGAAGAACITATATTCA 
1 5 TAATGTG TT CT AGACCGGAGTCAGCAATACTTTTCAT AAAGAGC CAG AT AGT AAATATTTCT CTTTTTGGG CCAT AT AG CCTC 

TGTCACAATTCCTCAACTCTGCTCTTGTCACACAAAAGCAGCCATAGA 

AACTAGACTCTCTTAATGCCAAGTATTTGCTGGAATAACACAAATTAATTTATC 

CAATATTACATTAATATTTTATGAATCTGGTATCATATTATACATGAATTTATCT 

AATATTAATACTACATCATGAATCTGTTATCAAGTCATATACAGATTTCTTAAAAAAT^ 
2 0 CATCAATAAG CAAGTATCTTAAGTTAACTATCAGGTTAT AT AGCAGATCCTCTCC CCCTTTTGTAACT 

TTTACATTATTATGGTATTGGGAAGTAAGTCTATTT^^ 

TCTTTGCAGACAGAATAGTTCAAATCTTC^TGATTTC 

ATCACTGTCTCTTCAGGTCTTGTGAGTCCTTGAAGTGC 

CCTAATGATGCCCCATTGTACCTTTTGACTATAGCAAGGAATAAACAAAGTCAGAAGTGTC 

2 5 GCAGGGGATTTAGGGG AG AG AAAAATGGACT ACAGGAAAAG AAAAC^ CA 

TTCCGTAATTCTTTAAACCTTAATTGTAACCGCAGTTAAGCTTTCCCCT 

ATAGAAATTATCCTGCAGTAACTTAGAGCCAATATCTCTAGTTAGGATGAGCGCAGTGGCTCA 

GAGGCTCAGGCAGGTGGATTGCTTCC^CCCAAGTAGTTTG 

TAAAAAATT AGACGAG CATGGTGTCACGCAC CTGTTG TCCT AG CT ACTTGGGGG CACT GAGGGGAG AGAAT CACTTGAAC CTGTAA 

3 0 GGTCAAGGCTGCAGTGAGTCATAATCACGCCACTGCACrCCAGCCTGGATAACAG AGTTAGACCTGTCTCAAAAAAAAAGAGGAAG 

AGATCTCTGGTTAGAGTTTGTCTCTGATGGATACTAACCATC 

TCTTGGATATAATTTAGCATAATGAGCTAAATAACTCTA 

ATCTACCCTTGCACTCAATTCTCTCCTCITCTCTCCTG^ 

GAACTGTGTGTCATTGAAAGTCTTGTACAGAAGTTCACATTAAAATATATCATA^ 

3 5 AAAACAAAGATCTCTGAAACCCTAAGGCACACTGTGCTGGA^ 

GCTTCTAATATATTGAGGAAAGAGATCTCAATCAAATTCAATTGCATTTATAAGTA 

TTCTGTAGGAGAATAACCCTGTCAAAGTGATGACTGATTTAAGATCAAAATGGGAAGGTAAAAT 

CTGAGGTGGGAGGGAGCATGCCAATGTAGGGAAATGAAGAAACCCAGTGTGTATGAGCCAAGCTGAA 

GGAG AATGAGAGAGACC AGTCCCCAAG CTCTCAAGGAGCAAGAGGAAGCCTTTTCGGCATTTGAAG TGGAGGG ATGGCATG ATCTC 

4 0 GTGCGTAGTTTTTAAAAGAACCACTCAGGCTACTCTTTTGAGAATATTGTG^ 

GGTTAAGGCAGTTGATAAGAGTATAAACAATGATGGTAGCTTACCTC^ 
CCATATACCATCATGGCACAGTGAATACGTTTTCCTTCTGGATCGTGGAAATGAATATTC 
CAATGACTTTTAATACTGTGGGATCrCTGGCACATACTTCAGAAGATGAGAAAGGAG 
TAGACATAATTTTCATTAACATCTACTTGGTATTTCATCr 

4 5 ACTGCrCAGTTCrGAACAGTGGGTCTTAAGTACrCTTTGGATTTCTTTGAAG 

CATTTTG CCC CAG AAAAAATGT AGTCATTT CTTTG AT AAGTGAAAT CTAAG AAAGG CTT ATTT CATTTGGTACATT AAAGAGATGG 
AAGAAAGTAACTTTACATAAGAATATAAATGCCAGCTGTCCAGCATATTACAGAATAGA 
AATGGATTCTTATATAGGCTTAAAATTTTTATAATTTT^ 
ATCACCCAGAAGAAAAAAAAAACACTCATACATGGTGAAATATTTTCCT 

5 0 CATATTTTTCAAATTCTTTCTACTTAGTCTTATATGATAG 

CTTGTTTTTTAAGCTTGTATAACATTCCGTCATATGGATGTACTGATTTTG 
TTTCTTTGACTACTAGGTATAATTAATTTTTTCATACACT 
ATTAAGATTTCrrATTTGACTCrCAACATTTTCTTAATTTTTTTATCATTAAT 
ACCCTTTTCATCTTAAAGATAGCAATCTATTTTCCATAATACTTGTTATAAATATT^ 

5 5 GTTTAAAGTATTTTTGACAAATAAATAAGTTTTTGTGGCITGAAGTTTCT 

ATTCAACACATCCTTATTGAGTCTTTTGTGTTCAGCACTAAGCGAGACACAAGAG 
TGCCACTTAGAGATTACAATTTAATGAGGGGATAGACAAACAAAACCAGGTCZA 

GAAATACAGGTGGGAAGAAGTACCTGATCTGGACCTGGGGG AT AAGGG AAGG CTTCTGG AAGGAAACAAC CTTCTAG CTGAGAATG 
AAGGAAAATGGCAACATGAATTTAGAGAGAGGAAACAGTATGTGTGAAAGTGT^ 

6 0 CCCCAGAACATG^AGTGTGTGTGTGTGTGTGTGTCTGTGTGCGCGCGCGTGGGTGTGTGAAGGAGCAGGGC^ 

TGTGAGCAGGAGGCCACTGGGGAGGCTGO^CRGTCCGAGAGAAAGACAATGATGTTGGGCAGAGGATGTGTTGAATAACT 
CTTTTCACTGGACAAGCAATGCTTCAGTGTATATTAAGTTCTTATTCATAGCATACCGTCTAT^ 
ATTGCTATATCAATTCTTGGGCCAACACCACACTGGTTTTGAGTTTAA 
ACTTTTCATTTTGAAAATTGATTTACCTATTCTTGCCAATG 
65 CATACGTGTGTGTATATGCATGTGTGTAAATACATGTATATATGTGTATGTGTGTTTC^ 
TAAGATGTTTTTATTCAAGTTGCTGTCAGTAAATATTTTATATTACATGACT 
CACACAATCTCCCATTGGTAATTTGATTGGGAATC 
GAAGCATTGAATGTTTTCCCACTTACTCAATTCrTTTATACrr 
TTATTGACGGGGTATTCTTTTTTCTTITATAATTTTA^ 

7 0 ACTGGACTACAGGATAAATATTAATCATGTATATTTACTTTTTATCTGAT^ 

GTTAATTATAAAAGTTTATAATCATGGTGTCITTAAATAATGATAATTGTTCrC 

G TCTT ACAGCATTGAT CGT AACTT CT AAT ACT AAAT AACAATGAT AAT AGTG AG CGTCCTG TCTT AAT ATCAATTTCAAGAAG AAC 
ACCTCTTACATAACATTTTTAAAAGTGTGTTATGTGGAAGCTAATAAAATAAACrrTG 
CCAGGCATTCTTAATTCATATTAACATATTAACATCA^ 
7 5 AAATGTATTTGACTATGGAAGTATTTTTCCCTCATTTAGAATTTTACATC^ 
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Tri'CITTTTITAGAAAAATAA'JTTTTCA^ 

ACTTTTA ArrnT TTCAATAATTGAGAAGCTCGTCACC L ' Tl ' Ll 1 1 i 1 ATTTCTACAAGTGGTTACACTTAAAGTTTCCAAGACATT 
TCTATCCATATCTTATACTTAGCAAAGTCAAAGACATGACGATAC^ 
AACCCCATCAAAAGAAATTTAATCCATTTTTACTTCCTTCCAACCTTTCCT 
5 ATGTTACTGTCAGTAAATATTTTATATTATATGACTTCTCTTTTGTGAATTATGra 
CATTCCTGAGCAGTTCTATGGTTATCTGATTTCTGTGTCCATAACTATTG 

TT AATTTT AAT ATCTT AAG ATTCG C CACCTTTT ATC CCATTT AT C CTTTTAGGGTGT AT AATGTAGGTCATCTAOlTTTTGAG AAG 
TGCCTGATTCTGTTT ACT AT AGTGT CAGTTIT AC CC CTT ACTGC CTT CAGTGGACAAATTAGT ACT ATTATTTCATTTCTGG CTC C 
TCTAC^CCCCTTTTAATCTAACCC^CACT 
1 0 TTTCTATTTCAGTTGAAGATGATAAACAGTTTTCTGACATTGAC^ 
ACTGTCTAAGCTACTTATTTTTCATGATT'ACTCGT 

TATGTAATGTGTI'Crr AAACCTGTTCAAT ATGTACTGGG CT CT AAATGTC CTGGCCT ACAGCTGGAGGGGAGAATG ACATGT C CT A 

GTT CAATATCAG CCT CTAACAGG AATG CTACTGATGT AGTTTTCTGCACTG AGAT AGGATCTr CITTI'CITC CATTGCCT ATTCT C 

CTGGAGGCAAAATAGAGAACATAAAAAGTCATCATCTTTGGTATGTTCTTCTT^ 
1 5 ATAGTCCTTACTTGCTAGATGTGTGTTCCAGGATTCAGTTCACCACTCCCC^ 

GTTGAGAATTTTCATACAGATATAGAAGGGATGCTGTGCTTC 

GGCAAAGAATGAACTGTCTACTTACTTTCTTTGAAGAAGCCA^ 

AATTGTGGTTTACAGTGACATGGGTAATGTTAGTCATAACATGTCTTCCAA^ 

TTGGTATTATCTAGAATTTTCCTTCTCCCTCCrCCCCCACCrCrT^ 
2 0 TCTAGCATCTTCATAAGTCCTTTCCGGTl"ri , AAGA , ri'l"l'TATCTTATAri i rATATATATATTACATATATTTCCTGTGAATCTTG 

GCTGAATTT CTCATTT AATAAAG TCCAAGTTAGT ATG AAT AAAT AACA GTTTC CCGATTTTACACACAT ACAG AAATTCAAACAC 

ATAATGAACTCTCTGGCAGCCTTTAGCATTTGTAAGTGCTCTCTCACCAGGAAAAT^ 

AG AAGGGTG TCAGGAAACTTATT AT ATTTGTGTCACT'lT CTT CAAAGGG TGATTAAG T ATATGT CTCCAAAGATGTGATGTT ATCA 
TCTTTGAGTATCACTTGAATTCAAGTCAATATTTATTGAGACCCCATGTTGT^ 

2 5 ATGCATAAGACATGATCTTTTTCAGTGGGTAA 

CT CTTTGGAATTTGTGAAG ATTCTT CCAGGTTGTTT AAAAAAT CTTTTGATTCTCACAAATGT Al'l'I'IT'l' CT ATTTCTT AG AATT A 
TGTTCTTTATGGCTACCAGGAAATCCTTTGAAACTGGAATTAGATTGCATTAT^ 
TGTCTTTGAAGAATAATGCTAAATAArcCAGACACTAAGGCTGAAT 
CTGCTTATTAGAAATCCGAAAGCCCTGCTCCCTTAGGAAAGCGGCCCATC^ 

3 0 ATCTCAACTTCTCACrrTGAGTATTATATACTCTCAGGCCAGGATACTT^ 

CTTGTGTGTGTTTTTATGTTTTTTTTTTCATTTTAT^ 

GAATACAACGGCGCAACATTCTGATCTTCTTTTGAGAGAAAAGTTTCACACAA 

ATGAAAACATTGTCCCATACAAGAATATTTCTAAGAAACACACAT^ 

ATTAGAGCTGTGTTGTTTTTGTTGTTC^TTTGTTTTTGTTT^ 

3 5 TTTCTCGATCTTGATTTGCAACACCAAGGTTACCAAACAAA 

TTAGTATAGCAATGCTGAGCTTTTGGTAATGGCTCAAAATT 
AACCATATTCAAGAAATAATTCITATTAATACTTTGCTTCTTTCCT 

GCAAAGCAGTCTTCGATGAGCACAGAGACCCATCAGATCTGTATTAACACCTCCAACCACTGGAGTT^ 
ACCCCTCTCCTTCAGGCCACAGATAGGAAGAATGACATTGGGTCCAAAGTTCTG^ 

4 0 ATTTCAAATTCAGGGTAATAATTCCAACrATCITTCCCTGGCCATGATAGAAAGCT 

TCCATCCAGTTGGTCTTTCTAATTACACACTCAACACAATGTTCTG 
TAACCAGATATTTAAAGGTACCTAAAGGAGTTTAGCTATATTTTGCCAAGATCCT 
TCTATTGCCTATAGAAAAGGACAGCTCAGTCTTTCTGAAAGAATTGTTCTCAAACAAGA 
GTGGACTGGTAATTGGTGTTCCrTTCAGAGCCAGTTGGA 

4 5 ACAAGATTTAC CCAG ACACAGCTG ACT ATGAG AAGTCTTCAGGGATTT ACAAC CTGT CT AAGAAC^ CCAATGGGGG ACT CACT 

TTGCAAC1ACCTCCCTATTTGATTGTGAGAGG ACAG ATACTCT ACT 
TAATTGCCACTTGAACGTTTTTCCAGGCTTTCAAAGGAATGTAAGAA 
ATTATATTACTGGCAATAAAGTATATTATTACTTTACTTTCAGATGCCCTCACAT 
GGTC^CCATTCTTAAGAGACTTAACTAAAAGACACTAAT^ 

5 0 CCAGGCTCTATACCAGGCACCAATATGAAAAGX3TGAGTATTCT 

GCAGCCATGGCCAGAAGGGGGAGCAGTTAATGAACAAGTAGGCAGAGACTAGGGAGGGGACT 

AGGGTAAGTTGTACATCCCTGATATTTTGGGGAATTGCAAATAGTCT 

GGCTGAAAATGTAGGTTGGTGT<ZAAATTGTAAAAAGCCTTGAATACTGTGCTGA^ 

GACTCAGGTCTCTTGACTCAAAGGTCATG' lTrr TT CTCTGCTAG ATTG CACTT AT CCTTC CAAATGCTAGTGGACACCACACTG AA 

5 5 TTTGATCAAAGATCTCTCAATGGAAATATGTAAATTTCCTGTAAGAAGCA^ 

GATAAGGAAACTCAAGTATAAAGAACTTGAGTAACTTGTCCAAGGTCACACAGCT 
AGTCACTAGTCCTAACTACAACAATTTAGCCTCTCTTTATCTAGAGGTCrT^ 

GGTGAGT AAAG ATTCCCTT C CAGAAT AGGTTTGAT AAATTG CCAAGTCTG AAACTG AAAAATT AAT^ C CATGAT AACAG AGGTT T 
TGGGGTCAAGGAGTGAAAGGAATTCATCCCATCCTGTCTCTAAATALT'l' 1 1' IT' CTTTACCAATTT ATGGGAATTTG AG ACCAG AT A 

6 0 T CTGGAACCAAGCAG C CTCAC CCTATTAAGGG AAAG AAAAAG AACAG AAG AAAAGAGGAGAGGAGAAG AG AAG AGAAGAG^ 

AAGTATCTGCAAAAATATTAAAGGATTTGACCGAGATCTCTGAGATGCTCCAAAG^ 
GCTGTAAATCT AAG AGAATT ATG ATGTGCCTGGT AAAT ATTGGTTGTTTTGTT^ 
GGCATGTAGGATGAAGTCACAGCAGGACAGACAGGTCCTGCAAGTGTTTCTTTAGATGCAT 
CTTAAGGCTGTCTTCTrTACTTTACACTTACCAAAGGGGTAAAATATC^ 

6 5 ACATCACTTGAATGTTGTGGTTTTGCAAATATACTATGTTTTAAAAGCTATAGCCCT 

TGGAATTTC^CATCAAAGATTTATTGTGAAGGGGGAGGGG 
CAACTCTGCATGCATTATTGCrTCTAATCTTCATAATTT^ 

GGGGG CTT AAGTGACTTG C C CAAAGTT ACACTlG CTTGGT AG AAACAG AATTTAC CACAGAACTTGTGTGTT AAG CCAAAT AATTTT 
CTAGTTCCATACAACTAAAAATTGGAAGGAAAAAGTAGAGCCTTATGAATAG 

7 0 GAAGTACTCTGTAAAGGAAAGATCTGGGTTTGAATTCTAAC 

TTTAAACTGTTGGATATTTATAAAATGGGACTGGTGCTATTTGGCCTGATATCTTACAGGC^ 

GTAGGGTCTCTCTGACCTGGAAAGCCCTATATAAAGACATGGTACCATTC^ 

GACACCTCATACCAATACCCAGAAACCCTAGGCTCTTCCTTCAATCCCATAC^ 

AAAG CCTTG TTCTTGGGG AGT ATATCC^TC%VL^GT ATCTCT ATTT CGTTG C AGTTT C^VCATT AAGT AAAATTAGAGTG C C CTT CAC 
7 5 GAAGAATATATAATTGAAAGCCCTCCCACTGACTGCTGGACAAAG 
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AGAGTTTTTGTATTCTGAGAGTGACrAGCJtoCTTTCT 
GTAGCCACX^TAGAATTGGTTTGCTTTTGTGTCCAAA 

TGGTGTATAGGACAGGCGTGTTAACTATATTATTTCTAAACCACAAGATAAGTCTG 
AAGAAATGAGGCTCTGCGAGACATGCGAGAAAGAACA 
5 CTCTTCTTTCTTGGCAATCATCAGAGCCACTCAAGGCrTGG^ 

CACCCACACTTTGGAGGGG ACAAGATTGAGAT CAACCCCATTT C CT ACTCTTTT CTTT ATG ACAGTTT ATCAGAG CATTTTTTCCT 
TATATTCACTCAGTCCTGAGTTTCTAITTAGTTGTGATAATAGATACATTT^ 
TCTATGGAGGATAAGGGAGTAACAAAATGTTATGGAAATTGAAATATATAAATTATTAATTTT 
TCACTTTATTTGGGGATATCAAGTAGGTTATGTGGTTACTGTAGTACT 
1 0 GTACAACTTAAGGGGGAAATATATTTTAGATGATGTTTTTTTT^ 

TAAAGAATTTCTCCCACATTTCACTCCCTTTAAGGTAAATTGATAGCTGTGAAA^ 

T AGGAG AGAGTG AGAATTATT AAT ATTCTCAGC CATGTAGCAT CAT AAAGT AGATCACAGCTG TTTCTTT CAGAAAATGT CATTTG 
ATTTTCGATCAGATGTATGGTCCATCTCACAATCTAACCGCTTCCACATCTGGAA 
CGGCCTGGCTCAGGGGGAGAAAAGAACTAGGGCCTGCATCGAATT 
1 5 TCCTT AT AATT AAGAAG AG CT CATAGATTTGGGAACTG CATGAGC CTCTGTAAC CTTCTGG TGAGG GTG AG AAGCTCAGAACTGCA 
TTTCTATGCATTTTTCCCTAGTCTTTCCTGC^ 

GCCGGATCC^GAGAGCCAGAAATCCATCTTCAGCTCCTITGAACTTGGCT 
AGGCTTCACX3GAGAGCAGATCAGCTGGCTGTAGG<SAG<M 
AGAAAGTTGAGGACATGTGGCTGAAACACTGTCAGAGCTGC^ 
2 0 GACACTAAGTGCTTTCTAGTGGTGGAAAAGGAGCAGAGAGAAAAAGAA 
CTTTCAGCAGCTAACATATTTTTCTtrrTTAGAAGATTGOT 
TAAATCTCCGGTTTGCTAAATAGCAGAGCGAAGAAACTAAGGTCAAG 

AGAAGATACTGGAAATG CCAAT AT CCAGT CAATG AAATAAACAG AGG CT ACTGTAGGCATC CC CAGAAGCTTGCCTTTTGTCCAGT 
GCCATGGGAACCACAGAAGGCACTCAGCTTTGCAGCTTCACT 

2 5 AAAGTGCTCCATTTGTTTACTTTGGCAAGTGGAAGATGACITTC 

GTAGGAAAAGGGAGACTTCCTGGGGCAAGGCTTCTCAATCrTCCrcC 
AAAGGATGATGAACTGTGTCAGCAGTGTTAGCAAATGGCTGAAGTAGAACGGA 
CTGAGGGAGGCTGAAAGATATTGTAAAAGAGATTGAAGGGCTGGACTGGCTGGAGGAAT 
CTCCAGGACTGCCCTGCCTTTGTCAGGACAGCCATCAGAGGCGGTTT^ 

3 0 AGTCTTACCTCTCCTCATGCAATTGCATG^ 

GAAAATGCTGAATCTCATCCTCTCAGAGATCTGCTGCAGTCAGCGACTATACTGAGGTC 
GGGAGGCTTCTTTGTGGAGTGGGTCTCAGCCACAGCCTGAGGGTCCTCT 
CTCCACCCAAACTTAGGCCTGTAGGTCCATCAATTA 
GATGGAAGCAGCTGAGCTGAGGAGTGCTGGCACCTGTGCAAATCC^ 

3 5 GCACCAATCGGGGGTCACTCCGAGATCTGAGGAATGAGGGA^ 

GCCAGAAGGAAAGGGGTTGAACCTGTAGTTGATTAAGATTCAGAGAAAGAAAAAACAAAAACAGAAAACG 

CTCCCAATGAG CCTGTGACCT CCCCAAAATCCTATAG CCAAACACACGGGTGGATCTGGCCTCACTTTCCCATTCTCCCACTTGTG 

ACTTITGATTCCACATCATAGCTAAGGTCCTCGGCAAATTGGATTAAACTTTG^ 

AAAAAGTTGGGGACAGGAGAAAGTGGTTTCCAAGAACCTACACCrrCTCTACCCTGAAATGAC^ 

4 0 TCATATCTTGAAATCCTAACCCCCAATATAATGG AGCCTTTGGGAGGCTT CGCTTACAAAAGGGTC CCA 

GAGAGCTCTCTCACCTTTTTCCACCATGTGAGGTTACAACAGGAACT 
TGACCATGCTGCCAACCTGATCrCAGACrrCCAGCCrCCAGAACTGTGAGAAA 

AAG CACTTTATTAT AGGAG CCTGAACTGATTAAGACACTC CTGAG AACG ATTCCTT CGGGG AAGGTCTTG ATT AG CCTTCACCT AG 
CCCAGTTTGCAAACACACTGGAGTGTCCTGCCCATGGCAATGTTCATGTAG 

4 5 AGCAATCTTCCCAAGGAGGCAGTTCCTACTAGCTAACTT^ 

TATGTTTGTTTTCCTCCTAATCCCCTGGCTGATGGAGTCAGGCTTT^ 

CAAAACACACCCTTCTCTATTGATGCGAGATTCTGAGC^ 

TTATTCCCTTTGCTTTTTCTTTGTGTTATTTCCCCT 

CTGGAAAGCCTAGTGTCACCTCCCTGCCTTTTCACAGCCTATAGGATTGAGTCCAAACT^ 

5 0 GG TTT AGTACAGG ACTG CT AAACACAOTITTCAGAATGAACC^ CT CCTGGC CTTT ATCATTG C CAATATTT CT CTTT ATTTT ATT 

TTATTTTATTTTATTTAGGGAGAGCCGATATTTCATAAGAGTTGATGGACTGCrTTGCCTG^ 
TCAAAATCGTTTTCATTTTTTTGCTTAGAGCCTGTTAAAGACTGAGGCCCTCT 

G ACACATGTG AG C AAGTTG TCACCTG ACCCGAAG CT C CAGCCCCTGGAAC CAACTGAAACAAAATGT ATCCTTG CATTTTCCAG CA 
AGGAATTGCCTTGC CT ACAAAAAT ACTCTCT ACTTT AAT ATGCAG ATCAATT ATT AAC AGT AGG CT G T AC CTTT CAC CAGGTTGT A 

5 5 AAAGGATGACTGCGTAGAGCCACAGCATTCTCCTTGTCAGTGTAGGAAATGCAGAGGA 

GACACCTAATAAAGATGAGGAAGGCTCCAGCCTGAAGCGTGGGTGAC^TGTCCAGCTGCCCAAAG 
TCCCCTTCCTCTTTTTGCTGTCAGTTCTGATTCTCTCCTTTCAATCTTTCT 

TTGGGG CATCAGTGG TGGT AAAAATGGTCTT ATGGG CAGGG CCATG TTG AAT AAGAGATG AG CATGG AG ACAG CG T ATTAACTT AA 
AAGG CCAACACCG AG GCTCATTTG AG AC CGT AGG AATTC CAGGG AG ACAG TGTTGTT CAGG CATG GGAT CGAAG ATC AGG AGGG CG 

6 0 G CTCAAT AACACGCTGG C C CGGT CACATGACAG AC CAGTCCTTGG AG TG AACTGGG AG ACAAGGAGGCT CTTTGGC ATGGAGCTG 

AGAACATCTCTAGATCGCACTGGAAGGCAGAGAGAGGCCAGACAGGGAACTCT 
CAGGGTGAGTTGAGATGATTTTTTTAACAAACTTCATTTTTTGCAGCAGTCTT 
CACTTCCCACATACCCGTTTTCCCCCACAGAGTTTCTTCT^ 
GATGTTGATATATCATTATTAACTAACAAGTCCAGAGTTTACATTAGGGT 

6 5 AAATGTATAATGACCTGTATCTATCATTACAGTATCATACAGAATAGTTTCACTTCCCTAAAAA 

AATCTCTGACAACTGCTGATCTTTrTACTCTCTCTATAOT 

GTCTTTTCTGACTGCTT CTTTCACTTAGCAATATGCTTTTAAGCTTCCTCCATGTTT^ CCGTTT 
TAGCACTC^GTGATACTCCATTGTATGGATGTACTGTAGTCTGTTTATCC^ 

GCAATGG AAAGGGTGG ATG AAGGG AAAACTGCAAGGAACACAGGACAAG ATAGAAG CAAG AG CAGG AAGG TTTTGG AG AAG TGGAA 

7 0 AAACAAG CT AGAGAGATAGGCAC^GAGGCCTAAXICrGGGGT AAGG CAGGGCCTTGTAACCAACGCTCAGTG AGAGAGGCAGCCCCT 

CACTGACCTGCACCAGGCCTTTCCTGATTTTATAGACAG^ 
GAGCCCAGCACTTTGTAGGCATCTAATAAATATTTATTACACCAATGAA^ 
GACACCAGAAGACAAATCGCCTAGCAGCCCCTCAGCCAGTCATCT 
CACGGGGCAGCATCAAGCTTGCCTGACACACACCTCCrGAGCCAGG^ 
7 5 TGCTCCGTGGACAGAACCAGAGCCGTGCTCCAGGGAAGCAGAA 



895 



WO 03/008583 



PCTYUS01/51291 



AAACX3CATTCTTGCCTCAGAGTAAGAATCCAGCT 

TGACCTCTATGGGACAAGAAGAGCAAGCAGGGTGATTTGGGTGCTAAGAAATAGCTGTC 
CAGATGTTCGGAGAGAACATCTGTGCAACAATAGCAGTAGCT^ 

TGGGTGGTAGGGGGTAGCTCTCAGAGGGTCTGACAATTCACCTAAAGATTGCAGGACC 
AAGGCATGGTGTCCATTTTACTTTCTCTGGATGAAATGAAGACAGAGAATGAACCCACT 

CTGTTCATATCTTACCCGTGTTTGTG CTCACATGAGTTACACAGT CAGT ACAAGGGTAGAGAG CTCTTGAGAATTT CTT AC CT AAG 

GGAAAGAACTTGGGGGCTG CTTTG CT CAAAGGCT ATTTTTCAGTGTGGGG CTT AT CT AGAGACAT ATATTTTG CT C CTTGT CCCCA 

CTGCTGTTGGATAACTACGGGAGTTGAGGGTCACTATACACAGA(^ 
1 0 CTG C CTTG CC ACTT CG AAGT CTCCT AAATGG CATCTCTG AAG AAGAAGG AT ATGT CCACAGGG AG AAGGGC CCCATGG TGACTCTG 

TTGCTCCTAATGTGCCAGGCTCAGACTCTTCAAACCACAAGAAGTGGCCCAGCGAGTTC 

AAACCACACTCAGATCTCAGCACAGCATCAGTCTTGGGGACAGCTGGCCTTTCCT 

CCTGGATGTCTTCAGGGCAGTTCTTCCACACAGAGGACCCCAAATAAAAGCAGAAT^ 

TTTCTATGTGAAAGATACAGCATCTGTCTCTCACAAGAATCTG 
1 5 TTCCCATCTTATTTTCTTGTTCATTTGTGCTACATGGAAGGGA 

TGGAGAGAAAGAGAAGAGAAAGCTCTGGACTGTCTTCCTCTGTCTGGGAATGTGACT 

CTTTTTCTCCCTGACTTrCCAGAACCCTTTTTAGGCTTAGCT 

GGTACTTTTCAGCCATAGACATTGGAGCACATTTAAATATGCTGGAGT 

AGAGCTGAGGATGGCACn^GCTCATTCACAAGGAGCCCTGAAT^ 
2 0 GCTCTGCCCTAATTGTTTGTAATGTTGAACTATGTGTCATTCGTTCACTCATATATTCT 

TGT AACAGAGT AGGT CTTTGGCAAT ACAAAG ATGAATG AAACAT ACC C CT^ CACATGG 

ACCMTGATGGTCATGTCAAGCGTGATGAAGTCCATGGTCCIXSAGGGACACTTGGACCACG^ 

AACTGGGCGGATCTAGAAAGAATTCCTCTGGACTTGATTCCTGAACITGAAGG 

GTTCCTGGCAGAAGAAGCCCACrrcCAAGAGCAAAGGCACTGAGGAAGCCTGGTG 

2 5 ATCGTAAGGGTGCAGCCTAGGGAGAAGTGAGACCCGTGGTGGGGGGCAGAGCCAAATCTCTGTATCCAAA 

ATACAGTGTCTGTGTCTCACAAGAATCTGTTTTCTTCGGGTGTCTGTTAT^ 
CTTTGAGGCCAGGTAACAAGGCTGAGGCTAAGAAGAACTATGGAAGGTCT^ 
CAGAAGGAAAGTTCTGGCTGAAATCTGCAGAGCAGTTTGGAGGGGAGAAGAC^ 
ATAAAATAGGCTAGAAATGATGTTCCTCAGAAGTCCCTTTCTCCCTGCACACCTGGCCACCTCAG 

3 0 CAGGCTGGGGAGAGAGGTTTCTTGAAAATAACACATCTGGGCC 

CGAGGOSGGGGGATCACGAGGTCAGGAGATCGAGACCATCCCGGCTAAAACGGTGAAACCCCGTCTrcrACTAAAA^ 

TAG CCGGGCX3TAGTGGCGGGCGCCTGTAGTCCCAGCT ACTTGGG AGG^TTGAGGCAGGAG AATGGCG CGGAGCT 

TGCAGTGAGCCGAGATTGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAA 

AAAAAAAAAAAAAAAGAAAATAACACATCTGTTAATTGGATCATTTCTGTGTGGT 

3 5 ATAGTATAAAATCCCCATTATGTAATATTTCCTCCTGATAAAAG 

GCTCATTACCAGGTTCAAGGACCTGTCTGTGTG<3CTCTTCCCCAGAGCCTCTGTTGGAAAGAAGATT^ 
ATACACAGTGGTCAAGATGCTCTAGCACAGTGCAGCAGAATAAGATGGGCTCAGTAGGGAAAGATCA 
CTTATTCrTTTCTGATTATGATAGGATTATGTTG 
AACTTTTTTTITTTTTCCTGAGGCAAAGTCTCAGTCTGTCACCCA 

4 0 GCAACCTCCATCTCCCCAGTTCTAGCGATTCTCCTGCCTCAGACTTCCGAGTAGCTAGGATTACAGGCACCT 

CTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTGCCACGTTGGCCAGGCT 
ATCAGCCTTCCGAAGTGCTGGGATTACAGGTGTGAGCCACCATGCCCAGCCAGAATTA 

TTCTGGAGCy^AACTGCCTGCTAAGTGTTCGCTGTGGGGCATTGGGGAAGAACTTAATCTL i n GTGCCTTGGTTTCCACATGTAAA 
ATGGGAT AATT ATATTGTCCCCCTGTACTGGTTTGCTGGGG CTG CCATAACAAAGT ACCACAACTGGGTGGCTTAAACAACAGAAA 
45 TGTGTTGTCTCATAGTGCTGGAAG CCGACTCACCTCAG AAG CATGG ACTCCCTT ACCT CCAAGTCCAAGGCAATGGGGCTCTCTCC 

CTCITGCCTCTTGAATTCCTGCATCTCATTTCACTGGGTAAACAGGGAGGCTCCTGTGCrC 
GCAGGATGTCTACTCTGATCCTGCTCCCCCAAAGCTGTC 
TTTCACTATTATTTCAAGCKSCrTGGGGTCATGGAAATC^TGAACT 
ACGAACCTAAGGCTGAAGAGGTCAGGGG<nTGCACCCAGTC^TAGAGCTAAAT^ 

5 0 ACTCCCACCTGGGACACATTTTATCAGTAATAGTGAACTCTCATCTGTTGATAGTGTTAGAAT^ 

ACTTTCTTTTTTTTTTTTTAAACGGAGTTTCTCTCT 

GCCTC CCGGGTTCAAGCTATTCTTCAGCCTCASCCTCCCTAGT AGCHX^ CATGCACCATGG CGCCTGGTTAATTTTT 

GTATTTTTAGTAAGGATGGGGTTTTGCCATGTTGGCCAGACTGGTCrrCAAACT 

CCAAAGTGCTGGGATCACAGGTGTAAGCCACCATGCCTGACCAGCACCCT^ 

5 5 TGTCGGTTAGCAGGGTTGGATCTCTTCTGGTACTGTGAGGAAGAATCTGTGCCATGGCTGTCTGCTAGCTTCT 

TCATCCTTAGTGTTACITGGCrTG CAGAAGCACCATCGATCT CTGCTTTCATCGTCGTGTGGTGCTG CCCCTGCATG CATGTATCT 
GTGT C CAAATTT CCCCTTTTT AT AAGAG CACCAATC AT ATTG AAAT AGGG CTCAC CCT CAT AATCTCATCTCAACT AATT AT AT CT 
GCAATAACCCTACITCCAAAAAACACCACGTTCTGAGGTCCTGG^ 
ATTCAACCCAAACATTACCTAATAAAGTTTTCAACAGTAAGTAAATGAGT^ 

6 0 TCAGAAAGCTCTCAGTAAACGTTACATTATGCTGGGTATAAATAAAAAGCTCC 

TTGTATTAGTGTGGCCTCGCCAGTAATTTACATCTTCTATCrTTTTTGGTTG 

AATAATACTCCT AGCTAAAAGGTGCAAGGAGT AG ATACTGTGGGTT CTG ATTTTTCTTTCCCGTTCTC CAAGCTGTGTACACAGAA 

CCATGCAAGTCKXCCCAACCAGGGAAAAACAACTGCCATTTCTTTC 

AAAGCAAACTTCTCCATGTCTGTCGCTCACTCAGTGGGCT^ 

6 5 AATGTGCAACGTGGGCTGATCCCAGGCTGCCCAAGGAGCAGCTGACGAGTGTAAAA 

TAAAAAATAAATAAGGTATGTTGATCTTTTATGTGATGATTAAACAAATC^ 

AAATATCAGAAGG CAAG CTGCTG AAATTAGCAAG AGCTG AACTCGAAAAG CCCG ACrTrCCAGGCACGGCTAGTTTCTTCCTAG CA 

AGTTCCTGACGGAGCTGGGTGGCTGCCCC^GGTC^CTGCAG 

CTGACA.CAATAGG CCAAAGTTAAATTG AGCTG CCAAGGGAAAATGATTGG^ 

7 0 AATTTGTGTCCACAGAGAGATTTTGATGGGGCTGACTTTGCCGCTGAC 

TCCAAGGCAATGGGGCTCTCTCCCTCTTGCCTCTTGAATTCCTGC^ 

GCAGGAAAACCCTGGACTGGGAAGCAGGATGTCrACTCrGA 

TTTCTGGGCTCAGTTTCCTTGATTTTCACTATTATTTGAAGGGC^ 

CTT CATAATCCATTTTGT AG AT AAGGAAC CT AAG^CTGAAG AGGTCAGG CACC CAGC CAT AG AG CT AAATGAGGGGCAAA 

75 GAACCTAGAJtCTCTGGCGTCl~rGACTCCCACCTG 
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TCTCACATTTACTCCCCATAACAACC i"l CTTC1 1'CIT'l 1 TTTTTrrriTAAACGGAGTTTCTCTCTGTCACCCAGGCTGGAGTGCA 

GTGGCGTGATCTCGGCTCACCGCAACCTrCTGCCTCC^ 

GGCATGCACCATGGCGCCTGGTTAATTTTTGTATTTTTAGTAAA^ 

GACCTCAAGTGGTCCACGTGCCTCAACCTCCCAAAGTGCTGGGATCACAGGTGTAAGCCACCATGC 
5 GTAGGCACTCTTATACCCATTTTACTGATCAGGAGACCGGCTTGCCCAGGATCCT 

CTGGTTrcGTGCGCTTCCTGGTACACACCAGCTGCCTCCAATCTCCCT^^ 

TGTAGGCTGGCACCAACCCCACCCAGCAAGAGCCAGGATCTG 

AGTAGTATTCATCTTCCTAGTGTAACCACTTATAACTACTTTTCAGTGAATCAAA 

TGCTTAATCAAGGGTGGAGCAAAATGAGTTAGCCAGGGCC^ 
1 0 TATTTACACTTGCTCCCCACCCTCTCCTTTGCCCCATTATCTrTTAGTTAAA 

AG CACCAG CAC CTCT CAGGGTCAT AGGGAAGAG CTGTTTCCTCTGGGT AAACTG AGG CCT CAAT AAGG CAT AAATG CCTT CT CTGG 

GTGAAGAAGATATGCCCTCTAAGATCTCTCCAGGGTGAATCTAGAATAAGTTGTAGTCT 

GACAGTAAGTCTTCATGAACATCAGTCAGCCTCATGCTGAGGGACAACCTCGAGGAGAGGGAATC 

CCCAATAGAAATAGCATGTAAGCTACAAATGTAACTT^ 
1 5 TTTTAATAGATCTTATTTGTTTCAATATATCCCAAATATTGTCCTTTCAATGT 

TT CC CTTTTTCAT ACAAAG T CTTTGAT ACTCAG TG TGT ATTTCACTCTG ACAGTG CACCTCAATGTGGTG CAAT AGGCG AGGGTCT 

ATGTGGTACATCCCGTGCCCAGTGTGC^CTTAAAGGCTGTAGATCCATATATGGCGACTGTCT 

GCTCAGTGGGTGTTTTGAGGTCCATGGTCTCTC^VGGTCCCGT^ 

2 0 TCCCCACAAGCAATCAATGGGCTGGTCTCAGACAAAGA 

ACCTGTAATTCTCAGCAGTTTGTGTGTATAGATAGAATTCCTC 
ATTATTAGCAAATTCACAGTGAACCAGAGCTAGGTCTTGCTGTAAGGCTC 
TGTCCCCACTGGGTTCCTCCTC^GGACCCCAGACAGATAGATACACAAACCCTG^ 
CTAGAACCTCAGGGAACTCATTTGGAAGCITTTTTTT^ 

2 5 GGGGAGAATAAAAGCATACTACCITAGAAGGGACTGGGGTC^TCT 

AAATAACTCTAAATAATGGTAAGCTCTTAGGAGACTTGATCTACTGGCTACATATACCC^CT 
TAGATGGAAGAATATGGAAATTGGTCCTTCTCAGTGTATGGACGAAGCATCAGAAA 
ACTAAAGACTCACTGGTCTGTGGGCTTCCTGCCCAAGGAGAGCTGAAAGCT 
CTAAAGTCCITCCAGCGGTTGAAACTGTGTTTATGCAAAACAAAACTTTCCT^ 

3 0 AGTCATGCCAAAATATTATGATTCTAAACCJUSTAACACCTATATT^ 

CATGTAAAATGCTT'AGAAGTCTACCACCTCATCTTACCTA 
GAGGAGGGGAAGAGGATAAGAAACAGGCAGAGTTGACCTTACAGGTAG^ 

ATG CACATGCCTG ATT CAG T CATT CT AG TTG AGT AGG ATTGT CTGGTTGGTTGTT ATT ATTTTGTT^ CTTT 

3 5 TGTTTTATAAGATGGAGTCTCACTCrGTCGCCTAGGCTGGAGTGCAGTGGCT 

GGGATCAAGCAATTCTCCTG CCTCAGCCTACCG AGTAGCTGGGG CTACAGGGGCATGCCACCACACCTG^CTAATGTTTTGTATTT 
TAGTAGAAATGGGGTTTCACCGTGTTGCCCAGGCTGGTCTCAAACT 

GCTAGG ATTACAGACGTGAG CCACCGTG CCTGGCCAGTTTTTATCTTGAAAG CTTAAGGGTCTCCCACTTAAATTCAAATACTCTT 
CTGAGGATAACCTTTTAATCAGCCAACTATCATGAATCTCAATTATCAGGAATT 

4 0 TCATCCCAGGAAT CTAAAAGAGCAAGGTCTT AT AGGTAAGGG CAAACTTT CTG ATGCCTTGG AATTCAGACCCT AAGTAGAGATGG 

TCATTTGACATCTAGTCTGATCGACTGTAGAGTTACTTCTATACAAATAAAT^ 

AATACTTTGAAAACTTTTAGGCTCTGAATGAATAACTTO 

TTCTCACAGATA(^CCAGTCTATAACCCTGAGGTTTCAACACTCC^ 

GCC CACATGTG CAT CTG CAG CCAACCACAAATGATCT AACAGG AC CAG ATTTC ACTGG AGCTCTGTG ACCCTT CTCCCAGTG TGAA 

4 5 CTGAAGGGAAAAAAACCTGGAGCCTAAGGGACTCCCTTTTCCGTGTCCTGAAGACCA 

TCATTGAG AGAGAG ACTTCAGGACCTCGGATCATGCCACACTCTCT CATACCAC AGG CTG CCAGCTTTGTTGGAG 

AACCTGAGACITAAGGCCAGGATATATCAAGAGCCCrGTACCCAGTTCTGTGATG 
CTCACTGGGTGTCCATTGGGGGGCGACAAGGAAACACrACTGATGAAA 
GAAGAGAAATTCTCRGAGGAACGTGGTGACTGTCTGTTCTTTAATGGACTGTCATATGA 

5 0 TCCAAATGACAAAACTAGGAGCAGTGAAATGAGCTTATAAGGTGGCCGGGTTTTTCT 

CACITGGCAGGGTAGTGCCTACTCCGCATTTGATGTGTTCACACCACGGCT 
ATTCTTTTCCTTGGGTGGGAGAGGGGCTAGAGAACGTCTCAGGTTGTTGCTAACTATGAGCATG 
GGTAAACAAGAAGAGTGTGTTTATGGTGTCTGTATTCCACAAACTATTATTTCCCACATT 
GAAAGAGGAATGCTAAGAAGGTTGCAGTATAATTTAAAAATACT^ 
55 TAAGAGCGTGGTTTGTGGAATCTGAGCCC^CAGGGGTrTAAATCCTGCCTCTGCCCTT 
CCTCACTTATTTAAGACTTCACTTTTCAGCCCTTGAGATCCCACCTGCT 
AAAGGCCTCCTGCTGTTGAGCAAAATGTTTTCCAGTCCTTTGTTT 
AACTGCATATGACAGAATGAGCTTATTTCATGCTCCCAGTAAATACAAGA^ 

TGAAATTGCTTCTTCTC ATTGCCTTCTTCTTTATTG CTTCTTTCT CTGTTGTTCCTTCCrTCTCITGTTTATAACACT AACAGTCA 

6 0 AATCTATCTGTGTGGAAATCTATCTACTATATAGAACGCTACATCACAATTTACAGAGGGGAAAA 

TTCACTGGAAGAGGCACTATTTGGCCACCTAACTCTGGGATTTAGGCTCAGGTAGAATCAGTG 
TTATGAGTTGTTCCTTATTTGACCTAGCACATATATAGATGGTGGTTAAATTTAGCAAAAT^ 
TGGAAGACTCTAAGTACTGATCTGGTCAGAGTACCATCAAAGCCCCTTGTTTGTC^ 
TCATGTAAAAAAAAAAATCATCAACAAATGCAGTGGGAGTTGGTAGTCATGAGTTTG^ 

6 5 TCAATCATGTGTGATAATTGGCCIAACTCTACCTAACAACAGTTGAAGACCTCC^ 

CAGAAGAACCTGGAGAAACATTCACATTTGTGTGGACCAA 
AGAGCCTGTGTTATTGGCACAGGTATTTCTCTCCTAAGAGCCTTTTCTC 

AAAAAGGG ATTTG CTGGG CT AAATT ATTTT AG CCT C CT CTTT ATGC CTT ACTTCACAAAG AATTG CAGTT AATCTG T AAAT GAG TT 
CTGATGTTATGAAAATTGGATATGGCAAATTGTGTTATTAAGTCT^ 

7 0 TCTGAGTGGCAGCTCTCACTTAATAGACATCTACCTAGAAATGTTTCTGA^ 

AAGCACACAGCACCTCTCCCCATAACCTTATTGAAATTAAGGGTTCCAAAT 

TGGCTCACCCITCTTTAGACTAAGAGCTAAACrCTAATGCCTAGTTAAT^ 

AAGTTTTAACCCCTAATGAGATTATCCTTGGTTAAATAAAT^ 

GGTTGCTAGATATTTAACTATACATACCAGGAAGCTATGAGGACAAATTTCCATTTGC^ 
7 5 GACTAAATGTG CTCCTCATT ACTTTTACTCAAG CAAT ATATGTTT AAATACCTACTATGT ACAACTG CTTTC 



897 



WO 03/008583 



PCT/US01/51291 



AAAAACACITTCAATCTTGAATTGAGAAGGTAGGAGATCC^^ 
GACTGGGTAATTTATTAAAAAAAGAGGTGGATGAAGCTGGAAACCATCATC 
TTCTCACTCATAGGTGCGAATTGAACAATGAGAACACTTG^ 
GGAGAGTGGGGAGGGATAGCATTAGGAGATATACCTAATGTAAATGAC^ 
5 ACACGTGTAACAAACCTGCACGTTGTGCAC^TGTACCCTAGAATTTAAAGTA 

GAGGTTTAATTGGCTCACGGTTCTGCAGGCTATACAGGAAGCATGATACTGGCATCTC 
TTACAATCATGGCAGAAGGCAAAGGGGCAACAGGTATCTCAC^ 

TTTAAACAACTACATCTCATGAGAAGTCACTCACTATCACGAAGAGAGCACCAAGGGAA 
CAGGCCACC^ATCAGTATTGGTCCATGGCCTGTTAGAAACCA^ 
1 0 CATTTGTATTTACAGCTX5CTCCCCATCGCTCATTACTGCCTGAGCTG 

GGAGTGCAAACGCTATTGTGAACTGCACATGTGAGGGATCTAGGTTGCACACT 

TTGTCTCCCGTCACTCCCAAATGGGACCG CCT AGTTG CAGGAAAGCAAGCTCJUaGACTCCCACTGATTTTAC^ 
TATAATTATTTCATTAAATATTACAAGGTAATAATAATAGAAATAAAGTGC^ 
CACCCCCTCTCTGCCCCGTCCCAGTCCATGGAAAAATTGTCTTCCACAAA^ 
1 5 TGCT AAACCACT CAT AAGAAATCCAAACC CATG AT CCAGTCACCTC CCAC CAGGCCCCACCTCCAACACTGGGG ATT AT AATC CAA 
CATGAAATTTGGGTGGGG ACACACAT C CAAACTAT AT CAAT ACCTT CAT AAAAG CAATTAAAT AACTGT ACAGTG TGAT AGGCAAT 
AAATGTCAAATGTCAT ATG T ATGAGGAACATAGAG AGGTG CIXJIT AT C^ CAATGAGGC 
ACAGGTAGTAATGTACCTGCTCACTTGCTTATTGTTGTCCC^^ 

TCTATTACCACAACTGCCTTATTCGCGTGGCCTGTAGCTAGCAATC^TAAAAAATTACAGC 
2 0 TAAATACTGACAGGTGGGCAAAATTTTTAGATA 

GCGTGGCAAATATATCCTTTCATTTGCCCAGCAAACATTTGCAGATCCCTCACTATGTG 
GGATTGGTGTTTTTCTCTCTCAAAACTGGAGAGATTTTGAGATC 
GGAGTAGTTGGGATTCTGTGAGGGATAGGGCTCCITAGGTGTAGAGAGCCTTGAGAC 
AAAGAATTTATGTTAAGGAAGGTTAAAAAGTCACTCTAAAGGTCrC!A^ 

2 5 AAACTTTAGCCAGGTTACAAACACTTTAAGAAATCCTATCTCAAAACTTCAAG 

TCGAGGACTGGATTATATTTGCACTTTTTTGTTAA^ 
ATAAATATAACTTTTAAAAAATATGGTATTTCTAAAAAGGTTTGGTGCA 
ATCTTCTCTAACAGACTATACTTCCATTTCATAATTCAGTGAGGTAATGGG 
TGCTAGTTTTCATCTGTTCATTACATTTTATGTAATG^ 

3 0 AATGTTTCTATAGGAACATACAGTGTACATTITGCAGTTCrrCTACAC 

AGTTGCTTCATAC^CCATATGTCTATCTTCAGTCCCTATGA^ 

ATGACACATAAGAGTCAAGTATACTCAGCTCCTAAGAGCAAGTGAGAAGACAAAGAAGA 
ATC^TCTTTCCTGAAGGCCTAGTTTGCAACTGAGTCGCAAAAATTTGGTTTCAAA 

CAG TTT AACAATGGG CAAAGTG AT CTGAACAGACATTT CT CAAAAAATG AT AT AG AAATGGCC AGT AAG CACAAG AAAAG AT ACTC 

3 5 ATCATCATTACTAATTAGGGAAATGCAAATCAGAACGA 

AAAAAGACAGGCAATAACCAGTGTTGGCAAGGATGTGGACAATTTAGAACCCT 
GTCIATTTCAGAAAACATTCTGGCAGTTCCTCAAAAGGTTGAACA 
ATCCGCCAAACTGAAAACAGATACTCAAGCAAATACTTGTACATGAATGTTTAT^ 
AAACAATTCAAATATCCATCAATGGATAAATAGATAAACAAAACGTGGTAAT^ 

4 0 GAAGTACTGATACATACTAGAACATAGATGAACCTCAAAAATGTTATGGTAAGT^ 

TATGATTCCATGTGTATGATATGTCCAGAATAGGGAATTCCATACAGACAAAAAGTAGATGAGTGTTTGCCAG 

gtggggatgggg aatg act gg ctg atggtt agtttc ctt agg tgg t g aaaaaaatg tttt aaaattgattg t agt aatgg atc c ac 

agccttgtgaatatacqvaaaaccactgaactgtgtaggggtgggttgcccct 

gagagacttaggaaagaaaaagacacagagacaaagtatagaga^ 

4 5 ggatcccgccagcctctgagttcccttagtatttattgatcattcgtgggtgt^ 

acaattgtggggagagggtcagcagacaaacacgtgaacaaaggtctttgcatc^tagacac 
t^agatatgcatacacataaacatctcaatgctttacaaagcagtattgctgcccgcagg 

tt ccct atct cag t agatgg ag c atacaatcgggtttt at accaag acattccattg cc cagggacagg caggag acagatg c ctt 
cctcttgtctcaactgcaagaggcatgccitcctcitatactaatcct 

5 0 cggtcaggtctttcccttcccacgaggccgtatttcagactatcacatggggagaaacct^ 

gaggtccctgcggccttccgcagtttttgtgtccctgggtacttgagattagggagtggtgatgactcttaaggagcgtc 
tcaagcatctgtttaacaaagcacatcttgcaccgcccttaa 

cggggttggg<;gtaaggttatagattaacagaatctcaaggcagaagaatttttcttagtac^^ 

CTACTTCTTTCTACACAGACACAGTAACAATCTGATCTCTCTTG CTTTTC CCCACAGAACTGTACACTGTAAAAGGGTGAGTTTGT 

5 5 GGTATGTAAATTTATCTGAATTTTTAAAATAAAAAAGAAAGTTAAAAAATGTTAAACTTGGAA 

GATTAGTAACTACTGAAATTAGTCCTTGATTTTTGGAAAAGCCTTTAAAAAATG^ 
ATGAGTCACAGAAAATTGGX^GACAAATAGATGATAACAAGGGACAGTA 

CAGATAAACATC^GGGGCTGCATGTGCATCCGTATGTGTGTATATGTTTATTTACATCTAATATACACACATGAT 
ATTATAACATATAATAAATATATATTCATACTGGAAAAATTCTCAAAGAATATACCACCAAATGTTAACAATAGCTGTCCCTGTT^ 

6 0 AGTGGGATTTCAGGTGACTT AGATCCTTTCCCC CTTTGG CCTCTCAGTATCTTATAAC^TTGTGTGC^ , AAGATCATTGATAGTTGA 

ACAATAAGAAGAAGAAAGTGAGTGTGGAAGGATGTTGCTAGAGAAAGTGTATTCTTGATTATATGGAGGA^ 
G AAGTCTTCCTT ACGAAT AG CATC CTATTTGAG AAG ACCAGTCI^CACACTGGAAAAAATG CAT AGAG CAAGGTGGG CAT CACTTG 
CAGAACAGTTTGGCIACIAGGACTACCTCTGAAATATTAAGGCT^ CTTAGAAACTCTTCAGG CAAAGTTTACAAAA 

GGCCTTTTCCTCCITG CTT AGCAGATTCTCCCAAGGCATCCAC ATCT 

6 5 ATCTGTCTACCCATCATTGTATTAGTCTGTCCTCACGCTGCTTATAAAGACATACCTGAGACTGGGCAATT^ 

TCTAATGGACTTACAGTTCOVrATGGCTGGGGAAGCCT 

TGG CAG CAGGCAAAGAG AG AGAG CTTG TGCAGGGAAACT C CTG CTTTCAAAACT ATCIAGAT CTTGTGAGAATTATTCACT ATTA^ 
AGAACAG CATGGGAAAT AC C CGCCC CCATTATTT AATT ATTTCCCAC CAGGT CCCTC C CAC AACACGTG CGCATT ATGGGGGCTAC 
AAGATGAGATTTGGGTGGGGACACAGAGCCAAACCATATCAACTATCTATCTATCTATCC^ 

7 0 AATTGGGATACGTCTACAAGACAACTGATACCTAAAAGTATGGGAATTTCATTTGCTAA^ 

TATATTATACCTAACATGATCIT3AGCACATAATATTCT 

GTGAGCCAGAAGGAGTTGGTCCAAGTAGAGCAAGTCTCACAGGCTTTGTGGTACT 
CAGAAGCTGAAAGGCAGAGACAAGAAATTCATCCTGAAAGAGTTCACAAGGGTAAAG 

AGGAAT AAGG77 CTGG "CAT ACAACAGG AGC CAACC AC C C CAACT AC AC C CACTTG GC^GTCGC^C\AGTG<^ AGGGTTTCATTG A 
7 5 AAAACTAGGTAGAAGACAGGGAGTAATGGTGGATAAGGCA^ 
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ACAGCAGTGTTGTTTGAAAGAGAAAGGCJira 
AATAGTGAGATTTCCAGCAGAGGAAATTAATAAACCAGAGCTC 

GG CTCACGCCTGT AATCCTAG CACTTTGGGAGGCCGAGGCGGGTGG ATTG CCTG AG CTCAGG AGTTCGAGA 
TGGTGAAACTCCGTCTCCACTAAAATACAAAAAATTAGCCGGGCGTGGCGGCCTACG 
5 GGCAGGAGAATTGCTAGAACCTGGGAGGC^GAGGTTG<^GTGAGCCGAGATCGCGCCA 
GACTCCGTCTCTTAGCAAAAAAAAAAAAAAACCGAACACCCAGTGTTCTCACr 
ACACAGGAAGGGGAACATCACACACCGGGGCCTGTTGTGGGGTGGGGGGAAGGA 
AAATGATGAGTTAATGGATGCAGCACACCAACATGGCACATTCATACATATGTAATAA^ 
AACTTAAAGTATAATAAAAAAAGTAAAAAGAAAAGAAAAGAAACAACAACAAAAAAAA^ 
1 0 AATAATTTATTTCTTTCACATTCATTTCACAGTTTGCTTACT 

ACATAAGAGGCAGGAAAGTITTGGTTGCCAAAAATAAAGAGAAAC^ 

AGAGG C CTTGGAGGAGGG CAAGG1TTTGCATTTTGTTTGCCAAT I'l 1 T 1' CCTT AAAT AAAGTGC C CTGGACTTCAGGTCAGTCTTG 
CCCCTTGAGCTGCTAAATGATTCTGCCAGATTCTTTCAG 
AAACTGACTCACTCCCCAGC^AACAAGATCAAGGCTGAGGA 
1 5 TTCCATTTTCTCCACCTGCCATTCTACCATCACAGTGAAGCTTTTCT 
GGAGATGTTGAATTTTATCAGCCAAGATGTATGTGGGTGTC 

GCAAGTGGGAGGAGAGTGATAGGAGAACGTTTGAATCAGGACAAAACGAGTGAACAATGTGGAGG 
GCTTTGCCAACCAGACAAGTTTTTCAAGGCrAATTTTTCT 

AACCAAAAAAAGAAAAAAATCTAAACCCAACAGCCTAAGTATCAGAAACCATCTTC^ 
2 0 TTT ATT ACATTTTCACAGAAC CATTT CCTGGG C CGG AACTCATTTCT C CTT AAAT AAAT ACT AT AAAATTTTCCTCAGAACAAATC 

AAG TC C CCTCT ATAACC CCAGAAGACAAAGTACAT Ct^CAGTAAAAAC^ TTG TCAGAGG TAG CATGG C 

CTGCTGGGAGGACAGGGAGCTCATGCTGCCGCATCAGTCGGGAGGAGGGATIT^ 
CAGGACAAACTGAGAAGGGCTTGACAATGGAACTCTTTACAAAGG 
GGGCTAGCAACTGTACGGTTCCAjGAGCCAACCCATGTCG^ 

2 5 TCTXn^GAAAGGGCCATCTC^AGGGAAGCAGTGACCre 

GCCTCACTCCTTCCCTTCCTCCAACCTCCTACCAAGCCTCCCTGTGGGTCAA^ 
TAC^TGCAGTCCACATAGTTCAGAGCAGGTTGCAGGTCAG^ 
TCCMCTCACX^CCCTGTCATGCCTGCAGTCACTGTAGTACAGACr^ 
CTGAGATAAAGAGCCTGCCITGAGTCTCACXSGGAC^ 

3 0 GTTTAGAGGTITTAGGGGCTCGTTTCACAAAGCTGCC 

AATAAATACATTTGCATTCAACATTGACAAAATACACACrTAAGTC 
TCAAAAAGTAAAC4GTTCTTTAAAAGATATCATCTAGAGTAGAATTAACX3GCTATTATTA 
CATTTTGAGGGATGAACTTAAGCAGGCCCCAGACTCACGTCTTCCATGTCCT 
AATTTATCCAAATACTTCCTGTTTCAGTTGGCAAATTCCACTGCCCCAA^ 

3 5 CCTAAATCCCCTCCCCATAGTGTCCTGGACCCCATTAAAATGCC^^ 

TACATCTGTTCTGAGTGGTCA.GTGCCCAAAACAGCCCAGTTGCT 

G ATGATGGTCCTCAACTGCAAAT AAAAAAACACAGGCCACTCTGT AGG AGCCATT A CTGGG C AGGCTC AAGTTT 

TAGGACTCAGCATGCTGAATAACACCTCTATCCAATGGTAAATTTCTCT 

GGGACTTTCAAGTTGTTACATAGCAGCAAGGGTTGCTGTCA 

4 0 TTGGAGTGTTTGCTGCAGTAATTATGGC^GGATTITTC 



CCGGAAAAGAGGAAACTAAATTCACCCTCTAATACTGATAAGCAATTC^ 

TACTTTGTCCGTTTGTTCAAT ACTTTATTCTT AGTG CCTATTACAGTGCCTGGTACATAGTTGGCCCTCAATATATGTTTATTTAA 
TGAACTGATTAAATAAACTAGACTCAGGCTCTGACCCTTGTGCTAAATCTGTGCTCACAATA 

4 5 TGAAGTCAGTTCATCTCTATGGAATTT ATAGG C AAAGG GGCTTTCTTTTTTTAGAGG T AGGTTCTTGTTCT GTCAC C CAGACTAG A 

GTGCATTTTGCATTTTGAGTGCAGTGGTGTCTGCCAGTCAGACTAGAGTGCAG 

TCT CGGCTC AAACAATCCTCCCACCTC AGCTTCCTC ACTAGCAGACTA CACCTGGCTAATTTTTT AATTT 

TATTTTTTTATTTTTGTAGAGACAGACX3TTTGCCACATTGCCCA 

TATTTCTATTTTGAAGCAAACAAAATGATGCAAATGAATAAAGAGTCCT 

5 0 TCCAGACTTCATAGTGGCCCCCT/rTGCTTAAGGCATATGATAAACTTCTAATC 

TTATTTTCCTCAGACTCAAATTTTGCAATCAAAATGCACJVGTGTACAC^ 
CATTTCAATTTGTGTGATTTCTTACATGATGGATTGTGGGATGACAA 
TTGCATTTITATAAACCTCCATtlAAACAGCTITACT^ 
AAAGAAATACTCCCTCCATCACAGAGACCX3AGCCTCATATGA 

5 5 GGAGAGCCACCTC AGGTT ATTTG CATTATAG AATGCT AAACG CAGACT ATGCAAG AAACCAG CCACCT AG AGAG ATCTATATAGAA 

GTAGAGAAACrrrCCATACCACCAGCAATTTTGGCTTTCCAATAAATA 
GGGTAAGGAACAAGATAAAAGCTAACCCTTGTGGGTGrTGGTAAATCAATAATTAAT^ 

G AATCATTG CT AAAATTCTTGT AATG ACT ACACAG ACT CC CAAG ATTG CAAT AGT CAAACAACACAAACIX3TTGAAG TC ATG AAGG 
G ATATG CAACATGG CTCCTTGGTTAACCTTTTGGT AGGGATC CAGTTTCCACCATC ATGAAGTCCTGTTTTCAATATTAATTTACA 

6 0 TCATTAGAATGCCTGCTGTTCCATCAGCATTTTCTTGAG 

TCATATTTCATTTTGACCTAACTATTATACCCAGATATTTATAA 

GAACTTG AAT AT CT CAT AAAAT AAGG TCTTTC CX3C71AGC C ACACAT ATCT AG AT ATGG ATAAT AGAG CTAAAT AATCAGG TGT CG A 
GT CCAATACACTG T CAAT CT ATCHTTG AC AAGCATCCAGTTC CTTGC CTTG AGTCATT CAGTT CTTCG TTGTTCTTG CTGG AATG A 
AAGCCTATACAAAACCTAGCATTAATGAAGTTAATATGAAACTAATTAGCTGCA^ 

6 5 TAATGAATGATTCTCMTTAAGCACTTCAGAAAAGTATTTC 

GTTTGAGGTGTGGGATCCAGATTCACTTATAACCTAATCAGGAAATGAACAGGATC 
TATCAAATCAATAAAGAAAAATGTAAATAAAGTTAGAACrTGTAGTA 
CCTGCACCATAAATAAAACAAGATCTTCAAACITCCCAGGGCTATT^ 
CTCAGCCTATTCATTGCTCCTTTTCXjAATGAAAT^ 

7 0 CCTTAGACCCACGGGTTTATTAAAAGTATATTGCTTATTTTTAAATATTTAG 

AATTCAGTTTCTCTGTGCTCAGAATGCATGTTCTCTATGA 
TGGCCTTGGTGAATATTCCATGAGTACTTAAAAGGAAC^TACATTCTACA^ 
TTAAGTCATTTCTTAAT CATGTTGAG AAAT CAT CT ATX5CC CTT ACT 
CX3TTAACATCTTCTATGATTCTGTATTTGTCTATTTCTGCTTTTAG 
75 riAGAUiAATCrrAGATCnTTATTTTCCTGAT 
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TGCATTAAGTTTTAATTTATCTGATATTATG^ 
TCTTTATTTTCAACCTTCTTGOTCCTrATTAG 
TACCATCTTAGTATTTCTTTTTCTGTTGTCTC 

AATGACAATTTTT AAAAT ATTTTCATTT ATCTTCTTTGTTAC CTT ATT AATTAGG C CTCTGTTG T AATT CACCT AGTGCT AACCCA 
5 GGGATTATAGCATCCATCTTTCACTCATTACAGTCTACTTA^ 

ACTGTATGCGTTTTAATTCTACATATATTATTAAAATTATAAGACATTTT^ 

CCTATATTTACTGTrTTCAATGAACTTTCTTTTTTC^ 

TTAATATTTCTTTTAGTTCAGAGCTGCTGGTATGTATTTTCT 

ATTATTGACTTTCAAAACTCAGTCATATTGGCAGCCCCTAGCCCAGTGA 
1 0 CATCTATTCTTGCTCCAAAAATTGGCAAATGCCTCAAGAAGAAATGAAGAG 

TTTCTGTGGGAT CHTGGAT CC CTCTCATTGAC CAAT ATCHTT AAAGAGTT AG ATGAT AGAT AGAT AGAT AGAT AG AT AG ATAGAT A 

GATAGATAGATAGATGATAGACAAACAGATTGATGTTATTTTTAGCAGCAG CATT AGTGTG AAT AAG CT ACTCCATTTT AGCTAGA 

GGTAGAAGT AATTT CITCTTAG AGAAAAAGTTTG TAT AGAATTGAATTT AC CAT CAT AGCCATT CTTGAATGTCAGG CCAATTGAA 

CTCCT AAG ACT CT AGCAT AT ACAAAAAACAGAAACCACT ATTT CTT ATCAT ATG TTTAAGGGTGTG T ATTTGG AAC CAATTT AG CC 
1 5 ACAAACTTCCAGATGATGCTGACATACCAATAATTACTATT 

AT ATGAGCTGTTGCTCTT ATTG CAAGGATTG T AAG AGTTCTG CAG TGTGGCCATT AT ATGTGTGCCTCA,CAACTTGTATT CATTGT 

AAGCTTCCTTTGAGAATTTTTATCTAAGAA 

AGATTGAAAGGAGAATATTGAATGCTGATTCTCTGAACATCTGCAATAATGTAT^ 
CAACCATCCCAGCTAATTAAATTACCTGGTTAACTCTGTGGACT^ 
2 0 AATAAAGATTACACAGTTGAAAAAACAAGATATTCTCCAGTTATG^ 

ACAAAATCCAGTTGTCACATTAAACAAGCCATGCTATCCAAAGCAGAGTTGATT^ 
TGTGAAAGGGCTATAACZACGTATCAGACAATCTAGAATTGACCACTAGGGGAC^ 
TAGTTTCAAGTACTATTCTGAGGCAAAACAAACAAAAAAGAATGATTATG^ 
GTTCTGTCTCAAACCTTGGCCTCAACAATCAAAGCTCTGTATGAT^ 

2 5 ATGGACCTATGTGTAGTTAAGTTGTGTGTAGGAATATGAGGATAGAAACTATTACTATCAACCAAAT^ 

CAAAGATTGTCTGAAAGAGCTTGTCCATGGAGACTAACCCT^ 

TCACCAGCAGAATTCAGTTCTGCTGAAATAGTCATGGAACAGT^ 

GTTTATAAGACAAATAAGACTATCCCTGGCATATAA 

GTAAACTCTGGAGCTACCCCAAGGTAAGATCGTACTACTGTAAATAAGCCCGAATCACTAGA 

3 0 GCATTATTTGTTTATGACCTAAGAAAATAAGTCCATAGAAATAAATTG 

TCTCAAATCAAATAGGTGGTTCTACTTCCTGAG 

CAATTTTTGAATGG TG T AACTTG CT AT CTCCTTG CCT ATTGGTACAATG CAGAGAAACTGTT CAAAGGTGAG CCATTCGGTAAAGA 
ACGATAGAGTTAATCAGAGAGTCTCTGTTATGCAGCTACATGGAATGCAGGTC 

TTGTTTGTTTTTGTTCCT ATAGTAGGATG CTGCACTG ATTTGGG CAGGGCAAAGTGGTGGTTGATGGAAG AGCAGAACACCTGGGA 

3 5 AAGTCATCCAGAGTAATTTTTGATCCCAAGAAAAGATCAGCAGAATTCTCCTTGAGTCT 

AGGTAAAAACTAATTCAGGAAATGTGCAGTTATAAAACTTTAATAAGTCACCCATTAACTC 
GGGACTCTTTATTTCCCTCTCATTCGTAAGTGGACCATCCTCAA 
CTCTCTTTCAATCTTATCAAATCTTATTAGCAAATCTTATCCCTGAA^ 
TCACCAGTTCCACCGCTACCATCCTAATCCTGGCATCAACTCTGAACTGGACT 

4 0 TTTTTTTTCCATTGTCTCTCTCCATTTCATTGTACA 

TGTGGCTTTTCAGTGTACTTAGAAGAAAATCCACACTTGTAGACCCTGATC 

AGACTCCATTTCT CATAT ATTG CAGCCATG CTTG CCTTTTC AACATCTCAAG CAAT CG AAGTG CTTT CT CG ATTTG AG ACTTTTTG 
TGCTTGCTATTTTCATTGCCCGGAATGCCTTTTCT^ 

TTGTATAAGTTTCCATTCTTTATATAGTCTTTGAATAGTAATCAT 
GGACAGTTATTTATATCTAATCCAATCTGAAGAATTATTCCAAGCA 
TTTACTTCTATCTCAAACCTCAGCTTCAATAAGCAGGCTTTCAGCCACATG 
CTAGTCATATAGTGTGTCTACCTTAACCTGGTACCTTTACTTGGGTTGCATTTCATC^ 

5 0 GTCTCTAGTACACTGCTrCTACATAGTCAGCAATCAGTGAA^ 

TAGGGCCAGAGAAAGTTGATGTTTAAAAAGTTAGAAAATGTAGGCACTTGAGAAGGATTAAGAAAT^ 

TTTATCATTGAAATCTATTATTCAATTTACATGTACAATTTCAGCAATACATAT^ 

ACTTACCATGTCTCTGACCCTTTGCCTrGCAAATTCCACTCTGATAATA 

ACTCCTTCCACTCATGTT CTAATGTAAG CCTTTCTTTT ATGCAGCCTG AATGCACAATTCACTTGTCATTATCTCGCATCCTAG CT 

5 5 GCTCCTCATGAAATGGCATTCCTCTGGATrTG CTGAGTCCTAATGTTCAGG CAAAGT AAATCCCACATCATTCTGGCAGGCTTT CA 

GGAAGACTTAGGGACTCGTTGTCTGCTACTGCAAATAACAGCAA 

CCTTGTCGCACGGCCATGAAAAATTAGGCTCACAGACGATT^ 

TAGAAACAGGGGCCCTCTCCAAAGCCAGAGTCCCTGCTAGAGTACTTCCCACCTCIA 

AGGAGGGGCCAGACTCCTCCCCTCTGCCAGCAG AG CAAACTTCTGTGGCTCCACCCCAGTGTGTATTCCTCCCAGTGCX3 CAAGTCG 

6 0 GTTGGAGTTTT CTGGG AACCG CCTTC CCAC CTGG CTGT CT CAAAAGG ACTC C CAG TTG CAG AGG T ATG TGTGATTCAACAAGATCC 

ACTGGCAGAAAATAGAGGCAAATTAACTCAAGGAGTCCATTGGAATATATCAA 
CTAAAACAGTGAATCATTTTACTTACAAGGGGTCACAATGGAGACAGGATAGAGGAAA 
GTGGAGAGGTAGACCTTGGATCCAGTTCTGCAGCCAGTGAGCTGCTCACTCAG^ 
CATCTGGGGTATGAAACAACTGGACCAGCTTATCTCTGAGATCCCGCTCA^ 

6 5 AGACTCCAGTCCCAAG TAG CAAT CTCTTCGGAGGGCA^ 

GT AAAATT CATTCTT ATG CTACTTTTGT ATT CC CACTTATTTTGAATGGAAAATCCTTG T ACAT ATTTCCTCAAATTATGGAT AAA 
AAGTAT ACAAAAT ACACCCTAGAGG C CAAGAAG CT C CTTTTGGT AG AAAG AATG TTG AG CCCT ATGAATTTGAG CT ACTCAAACTT 
TGTGGTTTGTTTTGGGAGCCAATCTTGAATATAAAATATT^ 

GTAATGCTAAGATAGATAGATTTTCTTGTGTGCCACACTTATGTTGTAACCAGAACTTAATTC^ 

7 0 TGTTACTGGAAAGGATAAGGGTTGGCAGGTCTTGTGACAAATGTT 

TTGGAAAAGCGGCAGCTATTTATTGTGACTATGAATCACGTCCACAGA 
TTTTTAAAGCCTTGGAAAATTGGCTTTAAAAAATATCACCTGACAAACCTCAAGC'T 
ACTAGGATCGACAGTGTTTTAAGGCTC^CAAGGAAAAGAATCTAACACTTGTAA 
GTGAATCTGGGCTGCCAAGCATACTGGCTCTAGCCAGATCTTCATTAGCCGATC 
7 5 TGCAAGCCAACACTACCCTCTGAGAAAACAAAAAGTAACTTGC 
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TCTCCTTTTACTCGCACCATGATCAGATATTTTCAAGA 
TTCACTTTGCTTCTGTGATTATGGCCACAGTTTTGAAAATAAAGTCATGT^ 
ATAAAACAACAATGCTGTATTCTTTTGTAGAACAAGATCCrGCCrrAG^ 
GCTCCCAACCATCCAACCTATCCCTTTGCAACCAGGCTTTTGTGAGATCAA^ 
5 TCTAAAATAAGGTACTAOVGGAGGGAAGTAAGGACCGACAAGAGTTAAATCA 
CCAGGATGACAACTCTACTTGCTCITTCATTGTATCTTAT 
GAAAGAAAGATCTTTGTAGAGAAAGGGTCAGCAGATTATC 

GGCCTCTATAAAATGTAATTACCCAATGGGTTCTTCCTGCCCACTACACAAAGACCACAGCTCT 
CAAGGCCGGCCACGCCATGTGAGAGACAGAGTTATTACTCAAGTCAGTCTTATCGAA 
1 0 AGTTTAGGGGAAGAGGTGGGGGTCGCTAGGAAATGGGTGCTTTCCACTGATT 
GCTGTCCTCTTGAGCTGATTCACTTATAGGTGGGGCCCTAAGGGTGATT^ 
CCAACCATGTCAGAC^TGCAAAAAACCTGAAAACATATCTCAAAAjGGCCAAT 
GGAATTTGCATATCTTGTGACCAGCCTACACCTTAGCAGAATTCAGGCTCCTCTCCT 

T ACAAAGGTGGTTGACTTTTGGGGAGGGGCrr ATTATGATTTAAACCAT CTTCCAAAGTT ATCTTGGCCTCAG CCC 

1 5 AGGAGTAATTAAGGCAGCTTG ACCACT AAAGGTAAGAGCAGAGATGGCTAAATCAG ATCTCC^ CACTGC CATAATTTTCTCACTG 

ATACAATTT TTGCAAAGG CATTTT CAC C CACT AC CCAACTGTGT CATTGT AG CATGAAGG CAGC C AT AGAAAGG AATAGG CATGAC 
TGTTCCAGTGGAACTTTATGAATTAACACrcAAATGGAAATGTCAT 

TTTCAACT ATT CAAAAATGTCAAAATCATTTTTAGCTTGGTGGTGG ACTGGATTTGGTC CCTCTGTT 
ATAGAGTAACACATCTGAGCCCATGAGAGAATTGATCCTGTACCTTGGGTAATGCTCTTC 
20 . TCACrTTTTCCCCTGCAAGAGTGTTGTCACATTAGACTGTAAAATTGGATCACTT^ 

AAAAGTCATTCTAGGTATGACTAAAGTTCCAGGGCTAACTTAGAAAAGGACTGAA 
AGCCACAAATGCCAGTGATATTGTCCCAAGTATTGGTGGCTTGGGTTGCAC^ 
CTTGTTAACATAGCTTTAGTGGAAAATACAGAAAATGCCCCCAAACTCCATGCT 
AATTACTTGGGTTTCTATAAAAACAGCTGCCTCAAGGAATTTGCCACATT 

2 5 GGATAGGCCTGGTTTGTTTGCTAACTTAACATGCT^ 

ACACTGTTTATTTGTATCAGTAGAGTGGAGGX3CT 
TTTTTCTGACTTGGATGGGGCCAATGGGTCTAAGCC 
TTCCTATCTTC^TTGGAGTACCCTTAGAAGGAAAAGGAG 
AAATTGTCACAGTGGTGACTTTTGGCTC^AGCCAGTGCAGAATGA 

3 0 TCCATGAGGTCAGGAACCCTGATTCGTATTGCTTTTTCCATGCTGTAGAACATGA 

GTTACTTGTTATAGTAGTCATTGTTATCAAGACTTrTTGGTTGTAAGTGTCAG 
ATCTATAAATAAATATAATTAGCTCAAAAGTCTGAACTCACTGGATT 

TGAAAGACGGGCTCCAGCCTTTCCAGGCTCCTAAGTTCCTTGCAGCT 

3 5 TCTC CAT AAAAAG AAG C C ACCGTCCTTG CTTG AGT CATGTTCTC ACT CTTGG ACTT AGCAG AGTCCAGAGGGG CTTGGAT ACT CTC 

ACTTGGATACTCTCAATACGTGCCAGCAAGGGGCACATATTCACCCTTCATT^ 

TCATGGGGAGTAAGCAAAAGGAAGTTCTGAAAATGAAAGAATGTGGGACAGACATAAAACTCAAAGCTGTGC^ 

TTATTCTTGTGTACTTAATATGAATATGGCTTTTATAAACAGTAAGCAAATCAGAAGCCACCCT 

ACATTTCTGGATTCCTTCTTTATTCCCTGATAATCrrCTTTCCTTCCT^ 

4 0 GACCCTCTCCTGCTCTCAATACACACATGGTCTAGTTC^GGAAGACAACATATAAACAGCAC^ 

GTGATGCCTGCCCAGGGTGCTCGTGGTCTACCTTTCATGGACCAAAGACAACAGGAATAAGAGCATTCA 

GAGJU^T ATAGACAGTGCAAAGACTTGG ATG CCAGTGCTAACATC CCTGCCIGACCTGACTTTX3AGAAAAGGACCGCCATTTTCAGC 
ACAACCCACAGGTTCTCTAAATACTCAAAGTGTGAGGCTAGCAGATAA^ 

CACAG CAAG AAAGG AACAAATTTTT AATG T ATT AATTG AAAATCAGATT^ CAG CTGGG AAGAAGAT ATTAGCTGGAACACACAC 

4 5 AAAAAAGTATTTTTAAAGGTGCCC r'lTTAAAGGCTGCCAAAACTCACCCA 

AGACGGGAAATTTTAGTTCCGAGTATGAAAGAGAATTTTATTTATTTGTTTTT^ 
AGGGGATGAGGGTATGTGTGCTTAAAGTAGATGCCAGAGTGTCOT 

TGAATGAGGACCCACGGCCCTGTCCAGTGTGGACATGCCATAACCCTGTGCTGAGCITCCTCATCTGTG 
ATATGGCTVTAAGCAACCCCGAAGAGCCACAAAACTCTGCTCT^ 

5 0 CAACCAGGAAATGGTGGTCAGGATGGATACAGTAGCATCTATACCTCAGCCCCCTTGAAGTATCCTGATGTCCACT 

TCTAGGCCTTCC^TAGGAAGTC^GAGGATATTGTGCTGATATAAA 

GTGCATATTTCAGATATAAGATATTGCTCTC^CCTCAGCTACATACAGTT^ 

AAAGAATAGCCTCGTGTTGATGTATCTAGGGACAGAACTCTCAGGCAC<^ 

TC C CAAG AT ATCT CATTG CAT CTTTT AATTT AGCTCCT AATTCAAAGC CAAGCT CATGC^CTGTTTCAAGCTCTGGAC CATGAG CTT 

5 5 AGCTCTGAATTAGGAGCTAAATTAAGGATGCAACGAGACATCCT 

AAAAG CAGACCCACCCAGCATGTGTAATTCTTCCATGAGGTTTCATCTCACA.GTCTGCATGT CTTGCAAGTGGGCTTAGGGCCACA 
GGGGCAGGAGCTCTTGGTACTCAGAATGTTGTCTAGAAGTTTGGAGG 
GTAAAGGACTTGCAGACATGATAACTTTGAGGACAATATCTGA 
AACAGGAATAGCAAGTTCATAAGCCCTX*ACCTCAC*GAACCrc 

6 0 GCACAAGTGTGAGGAAAGTGJTTGGTAGjGATOTGATTTCAGAGGGATGCCT 

ATAACTTTGACTTTTACTTTGAATGAGATGGTAAGCA 

TCTGGCTCCTGTATTAAAAATAAACTGAAGGGCAAGGAGAGAGGTGGAGAGACCCAT^ 

CAAATATCACTTAGACCTCGGTGGTGCAAGTAAGGAAAATGAGAGGAAATTGCATTTATAAA 

CAGGATTCATCATTGGATTGGATTGGAGTAAATGAGAGAGAGAGGGAAATCAAG^ 

6 5 GTTTTTTTTTTTTATCTTGAGCAACTGGAGAGATGGAGA 

CTTAT AAACTG T ATT ACAGGTGG CAATCAAAAT AAG CACCTGG CTGATTTGAAAT ATGGTTT AAAAAAAATT ACAAAGT AA 
ATGAACTTGTATTGTTCTGCCCATGCAAACAATCTGTAAAACTCT^ 
AGAGAACTGTTTGTGAAATCATGAATTATCTTTTGTTTGAATTT^ 
CCTTTTTTCTAAAGGTGAGAGCCTAAAACTACCATGTTGCAATGG 

7 0 T ATT AGCTTTGGAATCTCCGGT AACT C CTTTAATC CTTTG AATTCTCACTTT CT ATGTCTTAAAGGTG ATGG CTG CCCATTT CATC 

AACTGTGAAATGGCATCCAAGTCCCAGGTATCGTTAAGAACCA 

G TTTT ACAAGGGTATT AT AATT CAT ACC CTT ATTCATTGTTCTTCCTTG CTCCGAATAGT CC^ CTG TGG C 

TTATGTGATOJlAGAGGTGAAACAAGTTAGCTCAATTGGTTATAT^ 

75 GTGT AG ATCT ATCACTGTT ATTGAG AAAGCAGTT CAAAAAACAAG CATTCC CG C^TGTT ACT AAC^TGTTTTTAT AAAA.GCTGT 



901 



WO 03/008583 



PCT/US01/51291 



ATGTATATTTATTCATATGATTTAATATTTTTTATCTTCAGTTTTTATTTTAAG 
GTTACACAGGTAAACATGTGCCGTGGTGGTTTGCTGCACAGATCCT^ 



TCATTCAGCTCCCACTAATAAGTGAAAACATGTAGTGTTTGGTTTTCTGTTCCTC 
5 TCCATCCATGTCCCrGAAGAAGACATGATCrCGTTTT^^ 

TCCAGTCTATTATTAATGGGCATTTGGCTTGATTCCATGTATTTGCTATTG 
TCTTTATAATAGAATGGTTTATATTCCTTTGGGTATATACTCGGTAAGGGAATTGCT 

TTTGAGGAATTGCCACACTGTCTTCC^TAGTGG . 
AACCT CACT AGAGTCAGT AAT ATri' C1TCAAATATGAAGG ACITGA CCTCAG ATC CTTTATGATG^ 
1 0 AAGTTAAAAAATAGGAAGGACAGTGTGATACTAAATAACT 

GAGAGATCATTTTCAGAGCCTCCAAGAGTTTCTATGCCTTGAATTATTTGAGCATCCAGTGC 
TAAATATAAAACAAGAAGTCAGATACCTTTGCCTCTGACT^ 

CTACCT AGAAACT ATTCCTT C CATTCAGTT AGG AAAT AT ATTCAAGT AT ATTTT ATTT AAT CTTAAACTTTGG AATCC CATTTTGG 
TAAGCTGCTGTTATTTGAAGGTGTTGACGTATCCrCTGGGATCT^ 
1 5 TGTGCGATTGGTTTTGAAATCTGTCCAGTGGGCTCTTTCrGGAGGC 

AAACTGGGATGACATTTGGGGTTGGGATCCTTCTCAGGACCAAATGGGCACC 

TGTGTGTGTGTGTGTACAATGGTGGGGAACAGTTGGCAGAGCAATGAGCAA 
ACCCTGAGCAAATCAAAGAATTTCTTAGAATGCAACACTAGCTGTC 
2 0 TTGTTATATAATACCAATCATTTCAGTGTCACTGACGAAAGCCCAGA^ 
TTGTGAGGAATAAGGCCTTGAGGATGAGGGTCTAAATCTCIW 
TCTTTTGAGAGAGACAATCCCAGACGAATTGGCCTGGGGGGCAACATGAGAAGGCAA 
AGAATCCAAATTGTAGTTCTCTCAGCAAGTTATCAAGAAGCAAATTAACTGCTAGATTT^ 
TAGCTTTGTGTTGGAAGGATCGCTACTAATCATCTCTTTTATTAAAATATGAT 

2 5 ATACCAGGCACTTTGTATACATGATTTGCCTTATTCAGAATAACCCTGAAGGATCAATATTATT 

AACTAAGGCAT ACAAAAG TT ATTTG CTCAAGGT CAAACAG CATGG T AAG AGG AACTGC AATTTCATTC C AGGT CTT ACGGATAC CA 
AGTCCGATGCCTGCCAGCITGCACCTGTTAAGTCATGTGATACTTAGGAAATGTACACATCACCCC^ 
CTTCCCTTCATTTTGCCTGGATTGTTGCTGTGAGTGAGAAATAAATGACATC 
GTTAGTTCCTTGCTCTGGATTGTTTTTCTCTTTTGTTGGATATTTATTTTGACT^ 

CITAG ACAAT ACTTGT ACT AGTGTGGGTTCATGTTGTCCATTTGT CTTT AGAGGTGTAGAACCTTGAAAGTCAG CT AATTAATCAC 
T ATT ACTGTTGTG T ACAG T ATTGG CAAT CTC CT ATG CTAAAATIT^TTG TGATGATTTG AAGGTGGGAGGGGG CAAT ATTT ACT AA 
GTCTCT CAACCT ACATGG CAATATCTGAAAC CTGCTTATAATGT AGT AATAGCTT AGGCACCACCTATTTCTGAAACATAAATCAT 
CCCACAATATAAGCCTCCCAGTTTTATTCATGCGTCTTACCATGTATAAATATCTTTTAATA 

3 5 GTTATTTGTAGCATAAATCTGCCTGGATAAAATAGTAAAAATTGCTATGCCCC^ 

AATGAGACATTGGAAATGTGGTTTATAGTTAAGGAGAATGGATCACACGGATTATAC 
ACTAAGATATTAACACCAAAAAACTGAAAATAGTATTTTTCATCTTACTTTGTTATGTA 
TTCTATTATGATGGGATAAATTACGGTGTGGTAACAAACAAGCCCAACCTGTTTGTTCCGGT^ 
ACCCCAAAATITGTTTAAAACAGCTATCATTTGATTGTGCTCACAAGTTCTC 
40 ATGTCTGTTGTGTAATGTCTGGGTTCTCAGCTGTGAACACTCAAATGGCT 
ATGTTTGGTGCCTGGCTGGGATGACTCAGATTAGTTGGGGCTC^^ 

TCTATGGTGGCTGGATTCTGAGAGGGAGCATGCAGAGAAAAGAATTCCAAGACAATGGATGCTCTAAGAGAAGCT 
TCTGATCTAGTCTTGGTGGTCATGCAGCATCACTTTTGCCACCCAGATCAA^ 

AGAGTGTCAAATTCACIACAGAAGTTGTATGOIAGCT ATTCTTGGT CATCTTG CTCATCTATG AAATAT ACCCTCCTCCATGCATTG 

4 5 ACTTACAACAAGGGTTT A TTTCTTGCTCAAAGTCTG CTGTGAATCTAGGCAATTCTCCAGAG CAGCTGTT C TTT TG rTGGCTTAGC 

AATCCAAGCTGCTGCTTCCATCTTACAGCTTTTCCATATCA^ 
GGATCACACACTACTCTTAGAAACCACCAGCTGGAAGGGACTTATATCACTC 
AAATTCAGTAGAGCAAGCATGGCCCTCCrrGTGTACCCAGAAAGGAAGGAGAGCT AG AAATAT 
ATAGCCTCAGAGATATTGCCTCGTGGTGAAACCCTATAATTCCCCTCAATTCTGATTTTTG 

5 0 TCCTTCATCCATCCAACAAAGATGTTTTATATGCCTACTATTAG 

CCGCCAAAACACCATATAATTCACTCTAGAAAACTAAAATTTCAGATGCT^ 

CAG CTCCG ACTTTG TC CTG T CTC CTCT AAGG CATGTGAAATAAGATTTTTTTTTCCTATTCTTACCTCCTATGGTCCCATTTTATG 

GTTITCATGCACTACTGTCAGTCIAAATATTTATCATATGTCrTTCAATATC^ 

GCCITAACTTTATGACCGACTATTAAGCACGCAGCTGTA 

5 5 CTGTCTTATAAATTAAATCATTTTAATCCATCnTAGAGTTATCATGCATAATTAGTC^ 

TAAAGGAAATTTAGATATCAAGGCTGCAAAGAATGCGTAAGTCTAGGCAGAAGGCTCTTATTT^ 

CATTTCAACAATG AC C G CAAC AAAAAC CATCAGG AG CTG TTTGG GGCCTGTTT GT GCTTT AGTCTT CAATT AG AAAAAAATG TGTT 

TCTGATGGAAAACCATAGAACAATCOTCACTCTGTGGTAAGAAAGAATAAAGAGAAA 

GAATACTATTTTAGACTAGGTAGTGGCCGGGCGTGGTGGCTCACGCCTGTAATC^ 

6 0 ACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACATGGAGAAACCCT 

GCGCATGCCTGTAATCCCAGCTACTCAGGCGCACTGGCTCACGCCTGTACTTGGGAGGCTGAGGCAGGAGAAT 

GAGGCAGAGGTTGCAGTGAGCTGAGATTGCGCCATTGCACTCCAGCCTGGGCAACAAGA^ 

AAGACTAGGTAGTTATCTTCCrGGGTGGTTTTGGCTATGTAGAA 

TTGAAAT ATC CCACTG AAAAGGTCT AAATT ATTGAT AT CITGGG TCAAG TT CTG ACTTG AAGG C AAAT AGT AGG AAAT ACAATGG C 

6 5 ATTTATTGGAGGCACACACTACAAATATTGTAGTATCTTTAAGATGT^ 

GTCACTAAGACTCTCTAGCAGAGACCCTTACCTTTTGAAGTGCA^ 
CTTAGTTGTCTGCTACTCTATTGCAGTATCATATGTGTGAGCTGCATGAAAACT 
TGGAGCACAATTTTATTAATAACAATATTATTATTATCAAAGGTTTTAATT^ 
CTAGGTCAGCCAGTTAACCCrCAGATCTGGAGTACCACATTTGGCTGAGT 

7 0 AACTGATAATAGACATTCTGATAACTTTAACTCCCACAGTAATAAATCTTGGGTC 

CTTACTAATTATACTAAAATAGTTACTAAGCAACTCGACATTTTGATTT^ 
TTGGTGCTTTAGAGTCCAAGGCACTATTTTACAGCCTAAAGCTGTATTTCT 
AGGGATCATCTGTCCTCAAAAATATAAGAAATTCTGTAAATGTGTATTTTTG 
CTATATATACTTCCAAGACCCCTTAGAACTTACACGGTACCTGTAATTT 
7 5 ATACATTTAAAACTAGGAATTACCTTGTTTGCAATA^ 
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TGAACTATTATAAATGGCTTCAAAGATCCTTTCGTTTCTA 

GTGCTCATCTACAGTTATTGAATCCTTGCAACAATCGTATCAGGTAAACGCTGTTATTCTTCT 
TTCTAAAATGAAAGTTTGTCCAAGTTAACATGACCAATTAGGAGAGCT^ 
TCTCTTCCATCCTTGCCCCTTCTCAGCTCACATAGTAGCTTTCAG^ 
5 TTCCITAGGAGAGGCTAGAAACTGAATCTrCTACCCCCTTTATTGGC^ 

ATACTCCCAAGTTCAACCCAAATTGAATGGTTGGAATATTGTTGGTTTGTA 

CAAAGTGAAGTGGAG ATTG AAG CAG TTTTCTAGATAATT AAGG CAAAAAT AACTGCAATl"ri"l M IT AAAAG ACAGTTTGAAATATTT 
AGAAGGGTAGATTGCAAAATTATCCAACTTAAGGTAGTCATATTCAGGGA 

AAATGAGT ATAAAACTTAAACCT ATGATTAATT C C CTGTTG TTTTTCT AT CTGT ATTAT ACAT AT AAATATATTTT AT AT ATTGT C 
1 0 CTCCTGAGATTTCAAGAGCAATTITCTTCTAATTCTT 

CTGTGACAATCTTGTTCTCCATTTGATTGCATGAAAAATACC^ 

TTTGCTGCAT ATCATTGC CT ATG CITCTTGAG CACT AG AAT AT CATG T ATAAACCAAGGCCTTTCTTCTG AAAATT AAT 

AGTACTTOMTCTCTTCTATATTCTGGTTCTTT^ 

TTTTGAGGCCCTGATTTTTCACAAAACATTTTTCT 
1 5 C CATT AC CCTTTGG CCTCCAGTT AAG CTCTGTCCTG CAAGACAT AAATTG T CTGATTATCAACT CCT ACTTTG AGTG AAAAATGGT 

TG AATG AAATG AAT ATG CAGTTCATTTCAACAAAGCAAGAAATTTGCTGTTTAAAATTG TG TTTTTATAATT CTTG AGATGG AAAT 

ATAGCAGACCTCTGTGAATCTGAGAAGGCATTTCATGCT^ 

GGAAGCAATGATACTAGAGTATGCAGCAAATCTCAGGGAGATTTCAGTGGTAC^ 

ATTAAATCTTTGCTGTCCTTTCTGGCrrGGTTTGATATTT 
2 0 ACAAAATGGACT^CCTCTAATGGCrTGGTAOATGGGAT 

GGTAGATTCCTGTTCCCAACCCAATTAGATAACTCCACTTCCATCT 

AAAGAAGCCTTCCTTCAAAAAGGTTT AG AGGG CT AAAT AAG AGC^ CTGATTG CCTGCGT AA 

TTAAACTGTCACTCC^GCAAAAGCATTTCTCTCCAAAA 

2 5 TCCATCCTCTCCTGCAGAGAGAGGAATTAAGTCTCCACATTTCTAAACCATT 

TTCCTATTCTCAGAAGGAGTTATAACGATGACAATGATGC^^ 
GGCAGCCACCCATGTACTGAACACAATAGTGCnTTGGAAAGA^ 
AGGAACTTAATCTCCAGTtrrGATAAAACGATGCACACATTACAAGGA 
ACGTACAAATCTGCATTGCAATTGATTTATTTC<5ATGCCTG 

3 0 TTTTATCTATATATACTCTCTTTCATATCCCCCArc 

TCAAATGCCTACAAGGACTC^TTGGTAGAACCAC^TTCT^ 
ATTrGAGGGTCATGATGAAGTTGTACTAGGATTTATGGAGGT^ 
ATGTCGTCTATTGGGAACAGTCCTGCTTAAAGCAATTAGCAGGGACA 
CAAATACCCTCTGCATTAGCCATGTTTTGAAAAACTC 

3 5 TC^CAGTTTTATAAAGAAGTTTCTAACCAGAGAAGAAAGGTGTTATTTTGAAATA 

ACATTTGTTGAGGGCCTGACATGAGCTAGGCACATCrGCAAAGTTTTAG 
ATGCGGGCAAAGCAAGCACATTTTTTAAGTTGCAATAATGCAACAATG 
CATGGAAATCCTCAAGCCAAGTTCAAAGAGTCCTCTAATGTATGGCATAC 
AAAGGCAACCTCCAAAGATTCATCCAATCCTAAC^TTTCATAGAGTAGGGACT 

4 0 GGTGTGCAGTAGACATAAATTAGGGGCCGTCCACTTTTTCTTGCAATG 

ATCT CTT CTT GT CTTCATTTTGTT CTTCTTTT ATT AAATCACCCGC AGG ACAACCGG ATG CT C CAG CAATAGATTAT AG AAAAAAC 
AGATGGGTATTTCTGTGCAGTCAGAAGGCTCCTGTGCCCTTTTCCAGCATAAAACG 

AACCAGG CTGATGTGAG AACT AGCTTTCCACAGTT CAT AAT ATT CACT CCGGG ATGATTTGACAGGG CrGATAT/TTT ATCAC G AAT 
ACAAAGGAGATGAGCACCATTCTCCTAATATATTCATTACTTGTCrCACCACCCTGGTAAATAATAT 

4 5 ACACAGTGTTCCCATTGCTCTTTTTTATTCCCAGAACTCT^ 

GCACCTGTAGTCCCAGCTACTCAAGAGACAGAGGCTCTAGACC^ 
CTTTAAAAAAAATTGCCTAAAAATTTGGCCAGGTGCGGTGGCCCATGCCTGTA^ 

ATCACTTG AGG C CAC^G AGTTTG AG ACCATCCTGGC CAACATG GCT AAAC C CTG T CT CT ACTAAAAAT ACAAAAAAAAAAAAAAATT 
AGCCAGGCATGGTGGTGTGCACCTGTAATCCCAGCTACTTC^AGGCTGAGGCATAAGAATCACTTG 

5 0 GCAGTGAGCCGAGATTGTGCCACTGCTGTATTCCAGTCT 

AAATAAAAAATTACTTAAAAATTTGATTTTACATTCCTCCTATTCT 

TCTATGTTTACCTGCATCTCTAGC^AACAAAGCCCCTTTCACTTGTTCT^ 

TCATTCATTTGTTCATTTACAAATATCTATTAAGCACCTACTGTGTGTCAA 

5 5 GCATCATACAAAACAAAAAAGAATGCAAAACTGGTGCCTTAAATAG 

AATGTCTACAGAACACATGATTTCAATTAACAACAGTGAGTAAACGAATGATCT^ 

TTTTGTCG TTTTTT AAAAATTTTGTTTG TTTTT CT CAT ACTCAT ATT ATTCT AAGGCC CATTG ATTTTAG AAAG AAAATCT GTTTT 
ATAAACTTAGCATATTAAAAG 

60 HUMAN SEQUENCE - mRNA 

GGAGGCTCAAATCCCCAC^CGCCGGTTTTCTC^CTCGGCTTTCTGCTTACT 

GTTCGCGAAACAG CTTTCCACACAATGGAGCTTCATGTCCTCGTG CZAGGAAGTACTCATCGACTCATCTGGCAGACTT^GCTCCCT 
GACAAAACTAAAGAACTCTCCTATTCATGXIAGGCGAACACTGAGGATGCTT^ 

TCCTG CAGCJAG AGAAGGCAGCCAACATGAAGGAAAATGTGG CAT CTGCAACCGTTTT CACTCTCCT ACTTTTTCTCAACACCTG C 

6 5 CTTCTGAATGGACAGTTACCTCCTGGAAAACCTGAGATCITTAA^ 

G C CTC'GGACAC^TGG AGG ACTTCCT ACCAATTATT CACTG ACTT ACCACAG GGAAGG AG AGACACTCATG CATG AATGTCCAG ACT 
ACATAACCGGTGGCCCCAACTCCTGCCACTTTGGCAAGCAGTACACCTCCATGTGGAGGACATA 

GGAAGTAAAACAGCCAGAAGACAGAAAACCCTACCTGTGGATTAAATGOTCTCC^ 

7 0 TCACGCTCCTGTATGAAATTCGATTAAAACCCGA 

ATTCTCAGCCTACATCCAGGACAGAAATACCTTGTCCAGGTTC^ 
GACCTTCATTCAGATACCTAGTGACTTCACOVTC^ 
TGATTATTGTCTGGGCAGTGCCTTTGAAGGGCTATAGC^ 
TTO3ATGCTCATCTGTTGGAGAAGG<5CAAGTCTGAAGAACTAC^ 
75 TG AGGACTTG CTGGTGGAG1 A'rrT AGAAGT AG ATGA7 AGTGAGGAC CAG CAT CT AATGT CACTCCATT CAATL^G AACACC CAAGT C 
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AAGGTATGAAACCCACATACCTGGATCCTGACACTGACTC 

GAGGAACCCOIGGCCAATCCCTCCACATTCTATGATCCTGAGGTCATTGAGAAGCCAGAG 
CCCCCAGTGCATAAGCATGGAAGGCAAAATCCCCTATTTTCATGCTGGTGGATCCA 
GCCAGCACAACCCCAGATCCTCTTACCACAATATTACTGATGTGTGTGAGCTC 
5 TTGAATGAAGCAGGTAAAGATGCTTTAAAATCCTCTCAAACCATTAAGTCTAGAG 
AGAAAGCTTCCATTCTGAGACTGACCAGGATACGCCCTGGCTGCTC 

ATT ATGTGG AGATT CACAAGGTCAACAAAGATGGTGCATT ATCMTTGCT ACCAAAAC^ CCCAAG AAG 

CCCGGGACTCCTGAGAACAATAAGGAGTATGCCAAGGTGTCCGGGGTCATGGATA^ 

TGCTAAAAACGTGGCTTGCITTGAAGAATCAGCCAAAGAGGCCC 
1 0 ACTTCACTGCAACATCAAGCAAGTGCAGGCTCCAGCTG 

TGAT AG CTTGACT AATGGAATGATTGGTT AAAATGTG A1T1TTIT I CAGG T AACACT ACAG AGT ACGTGAAATG CT CAAGAATGT A 

GTCAGACTGACACTACTAAAGCTCCCAGCTCCTTTCATGCTCCAT^ 

CAAATCATTATGTTTCCTAACTGTGATTTGTAGATTTACT^ 

ACTGCTTAGTATTCTTGAGGGACAATGCCAATAGGTATATCCTCTGGAAAAGG 
1 5 AAATTGTCAAAATTGTTTACCATAGAAAGATGACAAAAGAAAATTTTCCACATAG 

AACTGCATAACCTTTACACTCCTCGTCCATTTTATTAGGATTACCCAAATATAAC 

TTGTTTACATAAGTTCCTATACCTTACTGACACATTGCTGATATGa^G 

HUMAN SEQUENCE - CODING 
2 0 ATCAAGGAAAATGTGGCATCTGCAACCGTTTTCACTCT^ 

AAAACCTGAGATCTTTAAATGTCGTTCTCCCAATAAGGAAACATTCACCTGCT 
CCAATTATTCACTGACTTACCACAGGGAAGGAGAGACACTCATGCATGA^ 
CACTTTGGCAAGCAGTACACCTCCATGTGGAGGACATACATCATGATGGT 
TGAACTTTATGTGGACGTGACTTACATAGTTCAGCCAGACCCTCCTT^ 

2 5 AACCCTACCTCTGGATTAAATGGTCTCCACCTACCCTGATTGACTTA 

AAACCCGAGAAAGCAGCTGAGTGGGAGATCCATTTTGCTGGGCAGCAAACAGAGTTTAAGATTCT 

ATACCTTGTCCAGGTTCGCTGCAAACCAGACCATGGATACTGGAGTGCATG^ 

TCACCATGAATGATACAACaSTGTGGATCTCTGTGGCTGTCCTTTOTGCT 

AAGGGCT ATAG C^TGGTG ACCTCCIATCTTTCCGCCAGTTCCTGGG CCAAAAATAAAAGG ATTTGATGCTCATCTGTTGG AG AAGGG 

3 0 CAAGTCTGAAGAACTACTGAGTGCCTTGGGATGCCAAGACITTCCrCCC^ 

AAGTAGATGATAGTGAGGACCAGCATCTAATGTCAGTCCATT 
CCTGACIACTGACTCAGGCCGGGGGAGCTGTGACAGCCCTTCCCTT^ 
ATTCTATGATCCTGAGGTCATTGAGAAGCCAGAGAATCCTGAAACAACCCACACCT 
AAATCCCCTATTTTCATGCTGGTGGATCCAAATGTTCAACATGGCCCTTACCACAGCCCAGCCAG 

3 5 CAC AAT ATT ACTGATGTGTGTGAGCTGGCTGTGGGC CCTG CAGGTGCAC CGG CC ACTCT GTTG AATG AAGC AGG T AAAGATGCTTT 

AAAATCCTCTCAAACCATTAAGTCTAGAGAAGAGGGAAAGGCAACCCAGCAGAG 

AGGATACGCCCTGGCTGCTGCCCCAGGAGAAAACCCCCTTTGGCrrCCGCTAAACCCTTGGATTAT 

AAAGATGGTGCATTATCATTGCTACCAAAACAGAGAGAGAACAGCGG 

GTATGCCAAGGTGTCCGGGGTCATGG ATAACAACATCCTGGTG TTGGTGCCAG ATCC^CATGCTAAAAACGTGG CnTGCTTTG AAG 

4 0 AATCAGCCAAAGAGGCCCCACGATCACTTGAACAGAATCAAGCTGAGAAAGCCCTC 

AGGCTCCAGCTGGGTGGTTTGGATTACCTGGATCCCGCATGTTTTACACACTCCTT^ 
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Table 43 

MOUSE NOMENCLATURE 
ICSGNM Topi 
Celera mCG15357 

5 

HUMAN NOMENCLATURE 
HGNC TOPI 
Celera hCG3 8802 

10 MOUSE SEQUENCE - GENOMIC 

AGCCAGTCTTGTAAGCCTITGTTCITrCTTGGGG 

CAGAGGCTGCAGCTTGCTCTGACrCTCCAAAAATACGCTCTGCTGCCT 

GTCCCTGGACAACTTTTGTTAATTGACCAGTGGTTTCACCTGCTCGG 

TGGGCATAAGCTCCCCAATGGTAGTTTGTCTGATCAGCAGAATAAGCTCCCTG^ 
1 5 CTCTGGAGTGAAAGCAGCTGCGTTTGCTCGGGCCTrGCGCT 

TACTAGGGAGAGCGGCnTTTACAANNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TCTTTAATTTGTAAGTATTCTACCGGAAG CGTGGACACG CCCACCCTCG^XlTCCTTC!AAAGACTGGAAATGCCrGTTGTATTTTCC 
TTTGTTCCTCTCTGGGAATGAATGAGTCTGCJGCACTGCCTCTCrGCGC^ 
2 0 GCGCATTGCCTCTCTGCGCATTGCTGACGCACTACGCAGGGCGGGGGCT 
CCAATCAGCAAACTGGCCTTCGCCAGCCGCTTTTGGCT^ 
CATAATGAGCrGCTTTTTCCTCCCAGTCTGTTTCCTCAGAGGAGAAT^ 

AG CC CATCTAG CG ATG AGCACAGG CTC CTTTT CCTAGAG ACCTC CG CTAATTGATCT 1'T CI' 1T1TTTC C (nTTTTCCTCTTTTT CT 
TCTAATCTCrCTCCAGGTATTCCTACCTAACCTTAACTTTTTCT 

2 5 T ACCAT A1TT CCrTTTTGTTC CT ACTCTCTCT CCCCACTTT ACTTCTG AT AGCTTGTC CTGAATTTCCTCT AGAATTTTT AG CCCT 

ATCTTAATCACTTGATAACATGTGAAAAGGAACAAAAGGGCTCCTAAC^ 

TT CTG AT AG ACTGTCTTGAATTTCCTT AG AAAG TTT AAG ATCAG ACTT AC CT CGTAAAG CTGT ACTCACTGGT ACT CT CGTT CC C C 

AGCTGAAAAGTTCTGAATTCATACAGTTGAATCCTTCTTAACAGT 

GTTCTGGAATGAGGGATCTTCCTTGCGCCAGTCCCGAGTTTTTTCT 

3 0 CACGACCGGCCAGGAAGAACACCACAGACCAGAATCTTCTGCGGC^ 

AGCAAGAGAGAGAGAAAACGAAACCCCGTCCCTTTTTTAGGAGAGTTA 

G C C CATCGG CCG AGTTG ACGT CACGGGGAAGGCAGAGCACATGG AG TGG AGAACCACCCTCGGCAT ATGCG CA^ 
CCACTTAGAACACAGCTGT CAGCGCCATCTTGTAACGGCG AATGTGGGCGCGGCT CCCAACAAGCTTAT ATGCT CTCTCTC 
TCTTTCTTCCCCAAGGGAGTGTGAGGCTTCTCTTCCTATCACCAGCCTT^ 

3 5 TC^CCCACCCCCACCCCCTTTTTTTTTCaUVGACAGGGCT^ 

GTGTCCTTGAACACGGAGATCTCCCTGCCTCTGCCTTCTGAGTGCTGGAATTAAAGGTGTGTGCCACCACTACT 
T1 W 1TTTTTTTTTTTGAGAC^GGGTTTCCITTATGTACCCCT 

AGAGATCTTGGTAAACTCCACAGAGGGAGTAGCTACAAATGGGGTGTGAAGAAGAGGACCAAGGAT 
GTTTGAGACTGAGAACTGGGCAAGGTCAACAGAAGAAATAAGG 

4 0 ATTATCAAGTAGACCCGAGAGAGAGAGGCTGTCTGTGGGAGTCAACTTAGGAACTATAAAGCT 

ACATGCTGTTGATAAGCTGGATGGGTAACCCAGCTCGTTGGATGTTGCCTCAGAAGTAT 

AGAJVGCTGGACGTGGCGGAACACGGAATGCCAGAGC CTCCTCTGCTTCCTGTCTGG CCATCCCCTTTCCAGATATCTCACATATGG 
AGAAGGGCACAGTGACCTAAAGACAGCITGGGAGATCAGGAAGGTAGTC^ 

CATCCITG CCCAT AAAAAGAAGGG AGCACTAGAAACCATT C C CAG TG ATGAAAAAG ACG AAACAAG ACAAACT CG CAGAAGC ACAT 

4 5 AGCATAGGAAAGAGCTCAAAGTAAGTAGGTAGTTTTGGGGTTTGT^ 

GGCGCACGCCTTTAATCCCAGCACITGGGAGGCAGAGGCAGACAGAT^ 
CAGGACAGCCAGGGCTACACAGAGAAACCCTATCTTGAAAAACAACAACAAAAAAAGAC^ 

CCT AGCATTC ACTATGTAGACCAGGCTGATCTAG AATT AACAGAGAT CCTCCTGCGTCTAC CTGCCATATCCTGAG ATTAAAGTCC 
TGATGCCTGGTTTCTAAAAACX3TATTTTT AAAGAAGTCAGGAGGG CACCAGGGACAGGAGAGCAAGTGACGCTCCTG CTACAGGGA 

5 0 CTTCTGTAGGAAAAGATGTCCCTTGTAATAACCAAGCTCCT 

AGGGAAACGGGAAAGGTGGTGCTTTCAAAATCTGACCCTC<5CTTAGCGCT 

CTCTCTCTCTCTCTCTCTCTCTCTCTCT CT CT CrrCTCTCTCTCrCTCCCTCTCITCCCTCCTTCGTTCCTGAGGAGAAACGCTGM 

TCAGCGCAGGGCTTCTCTAGCTCCCAGAATTTTGGGGTGTTT^ 

TGCTTCTCTGGCCAGAAGCAGTTAGACCCAAACAGCAGCGGGATAATGACCTCCTG^ 

5 5 GGCTG^TTGTCACACCCAACCACTCTGTTCCAATCTAGATGGGTCAAAGTCCTCT 

TCCAACTAGCCTGCTATGGTCAGTATGTCTCCCAGCTAGCCTCCCCTTCTCCATTGTCAGCTCCCCGGGACTGTCCGCT 
CrCTGGTACTCAATGGTTTTCAGATATCATTTAAACCTAGCTCACCTTG 
TCCCAGCCTCAATCTATGCCACAGCCTTACACTGACTTCATCTGTCATCATGATCTAAAGTC 
TGGCCATTTGCCAGCITTTCTTGTACTATCTTCTCT^ 

6 0 GGTTCCCCCCACCCCCATATTGTCCCTCCAAAATTTCCTAATGAGTTGGATG 

GTTTGTCACCTCTCCCCTTATTGTTCCTTGTATCTTAGGGTAACAGTTGAGGAATGTG 

TCCAAAATGAAGATCAGATCTTGTTTGTATCTGTATTTACACAGCTTAGCAT AGGA CTATCTACCT CAT ACT ATCAACT AGTCTC 

ACATTCCTTGGGGAGTAATGGTGGAATGACTAAATAATCrTATCACAACAGGGGTGTCT^ 

GCCTGTAATCTCAGTACTTTGGGTGGTTGAGGCAGGATGAGAAAGATAA 

6 5 CC CAC CCC CCAAC C CC CAG CCAC ACGG CAGG CT CCT AAAT CTCAG CT ACTCATTCTG CT ATT CAAGGCTTGGGT CTTTCCCACCGG 

CCrTTCCTTTTATTCAGACAATTTTCAACTTTlGCT 

ATCAACAGCATGAACGTTTTCCATGGATCACCAAGCAGGTTCTATTTTGAGAGTGTC 
GCAAGGCATTATGGCAAGGGGAAAACTGCCAGAAAACTGTAGTCA^ 

ATGCCAGTAGTAAAT C CAG CACTT AGGAG ATT CAGGGCAAATG ATCCAG AG TT CAAGTTTGGCCTCAGCT AT AT AATGAGTTCAAG 

7 0 CC CAACCTGGG CTTCATGAAACCCT ATCTCAAAGTT ACAAAGT AA^ CCTTTAAT 

CCCAGCACTCAGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGCCAGCCTGGT^ 
ATACAGAGAAACCCTGTCTCGAAAAAACAAACAAAAAACAAACT^ 
AG^KIATATCTGGGTCCTGAACCATGAAAATCCCAGAGTGGAAC^ 
ATC^AGTTGAACATCAGTCACATGACCCAACATAGCT 
7 5 C^CATTGCAATGAC^CAGAATTCACAGACCTGAACTTCATTAAGGTOT 
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AACCTCTCCCAGCTTCCCCTGGCCTTGACAAAATX3CTGCT 
GTTACTTTCTTCTCAATTCTAAAGCCTCTAGAAATTCCAT 
CCACAGCACATTTGTGGCTGTGCATATGGTCCCTCC 

GCATCATCACAAGTGTTTAATTCTGGAAGACAACAGTTAGACCAGACTTTAATAAA^ 
5 CATTCATT C CAAAGGAG CC CAAGCCTTAAG CAGCAG ATAGG AGATGGGGG TGAGGAG AGGGGAAAGAGG AG AGAAG AAAG AAG AG A 

GGGGGGGGAAAAGATTTTTCTAGGCTTAAAGATATTTTGGAGGGATGGTC 

G AAGG CAG AGGCAGAGGCAGAGGCAG AGG CAG AGGCAG AGG CAG AGG C AG AGG CAG AGG CAGAGGCAGAG G CAGAGG CAGAGG CAG 
AGGCAGAGGCAGAGGCAGAGGCAGAGGCAGAGGCGCAGAGGCAGAGGCTCAG^ 
AGCCGGGTCCATAGACTAACTCAAAACAAATAAACAAACAAACGAATCTCCAAAA^ 
1 0 GAAGAGGAGGAGAAAAGAAGGAAAAAGAGAGACAGACAGAGAGAGAGA 

GTTTAAGCAGGAGCTGGAGAGCTAGCTCAGTAGTTTAGAGCATGGCTACTCTT 

TTTGTAATTCCTVGTTCCAGGGGATCTGGCACCCCTCACACAGATATGCAGGCAGGCAAAACACCAATGC^ 
ATTTTTAAAAAGTATTTTGGAGACACTGCAGACTTAAGCACCATACA^ 
AAAGCTCTGTITAACTTTAAATCATAATGATGTATGTATTTGTGTGTGGAGTGCrCC 
1 5 CCTTTT AC CAGTCATG CTCTGGAG ACAGG CATACAT CATCATCT AGTT ACTTTATAAAGG CTG AGGTG ATC CGTCAGTGAAT AGTC 

TTGAATTAAAACAGTATTTTCCATACCTGATATATTTGCTTTTGACCT^ 

TTGG ATT Trrri' AAAAAATAAAAGCAGTCTCATT ATGT AAC C CTGGTT AAC CTGAAACTCACTATGCT AGG CTGGCCT CAAACTCA 
CAAAGATCCCTTTGTTTCTGCCTGTGGAGTAGTGAGTTAATTAAAGGT 

TTAATCACCCAACATCTTAAAAAAAAACCCTTGAACTTAAACAGTTATTACTGTGGGTTTTATC 
2 0 ATACACATTGAAATAGATATAAAGCAGTAAAAAATGTAAAAGT 

TGTGTGGGATCATTGGTTGGAGAAGAAAGAGTGGCCATGTGGACATCACT^ 
GATGTIGGGTGGCCATGATTGGGCCTAGGATACCACCTTTGATrCATCATCCTGG 

GG TCTTCCTGGGAGGGAC CTGGAAAGCAAATGACGATGATCCCAAG TCAC CAGGGGCTGGAAG CT AAAGCAG AGG CACT ATTTTTC 
TTGAAGGCTGAATTATTTCAGCCTAACTGCAGTAAATGGGTGGGAGGCAGATGCTTT 

2 5 GGGAAAGCGG CTATTGAGGAGACACT AATGCAGG CAAAACTGGCTCTGG CAT ACAGCTC CTGGAATCAGG CAG 

ACCTAAACATCAAATGATACATATTATCTACAGGCCCTTAAACA 
TCGCTGTTGTCCCGTTGTCCAGATGAAGAAACTGG 
TCTGCTTAGGAAAGCAGAATTGAAAAATAGGCCAGTTGACTTC 
TTGATTGGACTCTTCTGTCAGAGGACCTGTTGATTCCrcG 

3 0 TCGAATTATATAAAGGAGACAGAAAAGAAGTCACTAGCTCCCATATGGGT^ 

GGCACATGGGTAOVGCATGCGGAAAGGGGCGTGCAGCACTTCAAGCTTG^ 
CCTGGAACTCACTCTGTAGATCAGGCTGGCCTCGAACTCAGA 
TGCCACCACTGCCCAGCTGGCTTGAGTTTTGAAATATTCACAAGA^ 
TCCGAATTTATTrATATGGGGGATTGAGCCAGACTGGAGATGAATTCAGCCTGGGATGT 

3 5 TAATCCCTCAAGACAGTTTGATAGTGGTOSATGGTCC^ 

CTAAATAANNNNNTOWNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNin^ 
NNNNNNMINNNNNNNNNNNNNN^ 
NNNNNNNNNNNlWNNNimNNNNNrWNNNN^ 

4 0 TGTCTCACAGTAGGCGCTTAATAAATGTTTATAAAAAGAACAAATGCAGTAGGGTACr^ 

TGGAGCTAAACCTGACTGAAAAATTAAAATAATTTTTAAAAATGGGTGTG 
AGCACGTGGGAGGCAGAGGCAGGAGGATTTCTGAGTTCGAGGCCAGCCTGG 

CAGAGAAACCCTGTCTCGAACACCCCCCCCCCTACAAAAAAAAAGAATAAATAATAAATAAATGGATGTGAAGT^ 
TGGACAAAGGCTGTGAGGAGACTCCAGCTAAATGCCTTACGGCTGCTGGTGAA 

4 5 AAACTTGTCTGG CTCT AG AG ATGC AGAAGCTGGAGACACGGTCCCTGGCCAGCTT AGCTGGGTCCCGC TO 

ACCACGTGGCCGCCCGGGTGTCCAAGGTCCAGAGGCXKJTGTCCACGCTGCGCCTGCGTGCT 
GAACGCCAGTCCCGTAGCAAGGCAGAAAGCAGAGAGTGGGCGGAGTTAAGTCCGTATC 
AAACCGGCTCTGTCTCCCGGACCTCTAAGAGACCCGCCGCAACCCTCAGC 
CATTTAAAAGCGGCCGCGCAGGCGCMTGAGGGGAAATGCGAACTTAGGCTGTTACA 

5 0 CCCGCCCGGCAGTCTGTCAGTCGCGTCGCrACCGCOSCTGCCTCCTCCACrc 

GTCTGTTCGTGCTTCrCAGGAGCTTCGTCCXSCCCGTTCAGGCCGT 

CGCAGCGTTCGCACGCCGGCCGACATGAGCGGGG ACCATCTC CACAACG ATTCCCAGGTG CGACCAGC CGGCCCCCGACTCGTCCC 
CAGCTGCTCGGGCCCGGGCCXX3CTCCGGCCCCGGCCAGGGCCCCGGTCCCAGCCCCAGCCGGCCGCG 
TCTGCX5ATGAGGX^CGGCCGGTTGGAGTCGCTTGGATGGCCGGGTCCCX3GGCGGA 
55 CAGCCCCGGGTAGGCCATTCGCGGGCCTCTGGGTTCCGGGGCGCCGGGTGGGCCCGAGCCCCGGCCCCCGGCCGCGra 
CGCCGCCCCGGGGCTTCATGTCCCGGTTGTTTCATCTTCCCTCTTTTCTTTCCTC^ 

GTGTGCCCCCCGCCCCGCCCCAAGTGTCCCCACACGCCGCTGCCTGCCTCCCCCGCGCCGGGCCGGAGCCGGGCAGAGGACAGACA 
TCGCGTCCCAGAGACTAAGTCCCCKKTCCTCGCTCACCGGCCCCATTGTTCCCATCGGGCCGC 

CTCAGCACCCCCAGATCACGGCCITCTCCAGAGGGGGC ACCGGAG ACCCCGCGTCGACCGTCGCTGGGTCT CGGG CGG CCTTTCTC 
60 CGCCGGGATTCTTCCCGGGAAAGTCGCCTTGTCGACGGTCGG^ 
CTGTGCCTGTGTCTTTCTCTCCCTCCCTCAGCTCCCT 

ACAGACCTCACCAAAGAAGGAAGGCTTAGGGACGGAATTATCTTTTATTTTTTCTTAATA 

TTCTGCTTCATGTTTGCTTGCrrTTCTTCTT 
TATGGGTrrrrrrTC r i » rc ^^ 

6 5 CTTG AAACT AAAG CCATC"I"ri"lGTGGAG'i M rCl"l"lGTAl"U"n'GGGATTGTTTTCCCAAGTTGTG CTG TTTG C AAAGAT AACTCC CTT 

TAAAAAAGTTTTCAGGTTGTCTCTGTGTCATCTAGTGTGAGTGT^ 
GTGACCAGAACTTCCTATTGTGGCAAGGTAGAGTTITTTG 
AGGTTGCTGAATTGCTCATTTAGACATGGTGTGCTGGCC^ 
AGTAGAGTGGCCAGCCCTGACTCCTGTCTTCTrCCTCATTCTTCTCAGGG 

7 0 TTTCCTCAC^TCCTTAGGCTCTTGAGCCCTTTCCAG^ 

GGGTGTCTCCCACCACCCCAGTAAGAAGCTCACATTTAATGGATTTTC 

GTCCATGGTACTGGCCTGCCCAACTGAAAGCCTTATGTTTATATT^ 
ACTCTGTCTAGGTAAAGCTTTTCTTC^JG^^ 
7 5 TTTGTTCAACTCA.GCCTATTGGGGGAGTCACATC^GTTAATTA 
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AGCATAAAGGGCTTAATTTTATCTGGGGGTGGGGAAATTTCTTGGAGGTGACTGAGCAG 

ATTG TCAGGGGTTGGATT ATTTCITGGG AGT C CATC CT ATCAGG CTT^ CAT AT AG AT AAG 
CTTCTTTGGAAAGGGGGTAAAGAAAGGATGTGGTAGTTCTGG^ 

T AT AGAAAATTT ATTTG CrTTTTAGTTGGG CCT AGT ATGT AAT AT ATTT AG CAATG AAAC AAAAT^ T CCTTTTT CCTCTTAAG 
5 TTT AACTTCCATTGTTCCCAT CT AAAT ACAAAAGAG CT ATTGAAT CT ATG CATG ATT ATGGG CTGGG T ATGTT ATCTTTG CATTTT 

TATAGAGTGAATTGAACTCCCTATGTTAACTTTTAGGATGT 

ACTGTTCAGGCTTGAATCTCATCTTACrCTTT^ 

TGTACTC^GTGAAGCCATGAACCTAGCCTCTCTGCAAACCGGATGATC 

C)AATTAAGAGGGAAAGGGAAAAATTGATAGCATGCCGGTTAC^ 
1 0 CTACCTTAACCTTCTAATGTTCTAGTGATTGTCTTTGTGGTTATTC 

TACACTGTTAGCTAGTGATTGATAAATATAAAAGACAATT^ 

TTTATAGTGAGAGGATATTTACTTAAACATGGTTTTAGGTGAG 

TTlX^TrCCTCAAATCTTCTAAGATAAAAAAGATTTC 

CATTTACAGCCTTTTGTGGGAAATTTAGAACATTCTGGGTCATACT 
1 5 TAAGATATAAGCAATGAGCTAGTCACnTCAGGAGCACAGCCCATACITTGTATT^ 

CAGT AG CACT AAT CAT AGTGAG CTTGAATT ATT ATT AT AGT AGTG CACAGGATG CCT AGCTGC A C AGCTGTT AACATT AGTTGAAG 

GTGCTGGTAATCCrrilCTGTTTCAAGAACn'CTAAGTTCCrc 

TAATTTGACAATCTGTAGATTATGAGTAGGTCTAATG 

TGTGCTTGCTGTGCATGAGTTATGATAGGCATTGAGGAGGCTCGTC 
2 0 .TCTGGAAATCTTTAGATTTAGTCTGGACGACTAATAGAAGTATTAGATTCC^^ 

TGAGGGACATTTTTGAAATTAGTAAACTGCTGATATCTTAGAGC^ 

AC71AGAAAATCTAGGGTTTCCTAAAGAAATCAGAAAAAAAAAAAAAA 

TAAGCCTTGAATCTATTTTTCrrTTGGTAAGCAGATAGTGTGGGACCT 

GGGCAGGGAAGTAGATTCTTTTTCCTGTCAGCATAAAAATGATGGATTT^ 

2 5 GAAGTGTGATAGCTGTTTGTATGAGTGACAGGAATCTTTATAGT 

TTTTTTTTTTGTTTTCAG TGTG CGTGGCATGGG AAAG TG AAAAAAG TCTGCTTTTTAAACTTGTAAATATGTTAGAAAAT^ 
TTATAGCAGCAGTTGTCTTATGTTGATTGTGAC^TGGAAAATAATTrGTGTC^ 
ACATATACTATAGATCTTCAATAACTTTTATCTAATGATGCTGAGTAAA^ 
AGAAGTGTGTTGTGATTTTTCTCCTATGAAACTATGATGGACTCTAATT^ 

3 0 CTTTGAATATAGGTATAGTTTACCITGAGACTA 

TTAAAAGAAAGAAAAGGAAGGAAGGAAGGAAGGAGGGAAGAAGGAGGGAAGAAAGAAAGAAGAAA 
ATTTAGAAGTGGGAGTCCTTGAATGAAATTCTTTCCTGTGGTTTTAAA 
TTGACAAATCAAAGTCATTTGGGCAAACAATTGAGAATT 
AATGGAGTACTGTGTTTAAGGGAAGATTTGAGATTCTATTTACCT 

3 5 TAAGTCTCAAGTATGAGAAGCACCATTCTATTCTGTGGCTTCAGCT 

GGAAGAATTAGCTTTriTT'CAAGCACCTTCTCTTCCTACCTCCT 
TGGGCCTGGGAACTTTAGAAGGGAATGAAGAAAGAAGGGTGGAGGG AAG CACT 
AAAGCAGAACCCTATTTAGATATTATAGTATTTTGTTACTGT^ 

ACATGGCATCTGCTCAGTATGTTCTCAGACAGTAGTGCTACTGTTATCTGGGGTAGTAATTCTA 

4 0 ATTGCCCACATCAATGTGAGTTAACTGTTACATATGTCTCAATTTTGGTAAGGAG^ 

TACTCAri'TTlAATTTTTTGCCCTGTTTTTAGATACATCCACTTT 

ATTCTTATATGCCTTTCTACCAGAATArrriACTTTATTTTATTTTATTTTTTTG 

CTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTGGAACTCAGAAATCCGCCTGCCT 

AAGG CGTG CG CCAC CATG C CTGG C CCIT CT ACCAGAAT ATT AATC CAT ATTTT ATTT C 
45 CACITTTAATTAANNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 0 TGGATTTCATTATGT AGAATAAAGGTTTGG CTGATCTCAAACCTTAAACTCACAG AGAACTACTTGCCTCAACCT CCTGAGTGCTG 

AGACTAAAAGCATGGACCACCACTTTCTTGAGGCTTCCTATT'riTTCTAAGAGCCAAGAACT 
TTCX3AGGCCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCA 

AAAAC CG CATGTTTITTCAAAG ACC CCAAAACATTTTT AAGTTGC A CTG AG AATT ATTT AATCCTT AT TAATT 
CTGTATCTTTCTTAAAGTGGCCAACTCAGCCGTCCTGAAATAGGGAAATGAGTATCCACGTAA^ 

5 5 CTTAGAAAGCCGAGGCAAGAGGATGATCAAATGAGTTCCATGTCAACCT 

AGGGAGGCAGTGGTGCTTTCAGACAAGACTGCAGCTACCAAAGCCCAA^ 
AAACCATCAGCTTTGCTGTCCTGCTGGCAGTTTTGGTGGTGTC 
CTACTATGTGCTCAGTCTGACACTGCACTGTGGGGTTGCCCAAGCTCATATAACIACT^ 
GTCTTCTCAGCCAGTTTTACCAACAAGGATAGGGCACCTCTACATGCAAAACT 

6 0 GAATATCTCTCTGCTCAGAGACAGCTACATTGCTCAGATACAAGGATTTAAGCCTTCTGG 

CTGTTC(CTTCTTTCATTCTCCCAGATCTCATTTCTCTC 
TCCTCATAAGGTACTACTGTACTACGACATGCCATAAACATTCCACT 
TTGTAATCTTCCTTCCAGAGTTGCTTCTATTAAAAATGGCAGTCAGTAC^ 
CTATAGGTTTGATCTCTAGTACTACAAAAACAACCCCTAAACTAAAGCn^ 

6 5 GTTGTGAAAAAGATCATGCAGTTAGAAACTTCCACAGTGGCTCATAAGTCTTG 

TATTTTTAAACTACCACTTAAAGGAAATAAAAGTCACTTAGGAA 

AATAAATCTTATACCATTGATCAAGAAGATCACTATAATAGGCAAAGACTATATAGAAT^ 

TCCATATGAAATTGTTGAATAAGATGAATTATCTGGTCATAAT 

CCTAGTTTTATTCTGTATAATTTGGCTGCTTAAGGATATATCCTCCCCCAACCT 

7 0 C CTTAAGGTTGTG AT ATCAAAT AATT CCT AATG AGG AAGGGGCTCTCT AG AG CAT AGG CAATTTGGAAGTTACTTTTT CTGTT ACC 

CAGGCCTTAAGAGCTCAGTTGATGGAAGTGAAGGAGGAC^GCCTGCAG 

ACATGTTC AGGTT AGATTAGT CT AC CTCCTGGG AAAGG AG AAGATTG CAATG AAGGG AT AT C CTTGTGAAAATGAGTCAT ACT ACA 
GAGACTTATGTAGGCAAGAATAAAGCATTCTGGGCTGGAG 

GAGTTCAAATCACAGCAACCCCATGGTGGCTCACAACCATCCGTAACAAGATCTGATGCCCT 
7 5 ACAGTGT ACTT ACAT AT AAT AAAT AGAT CTTT AAAAAAAAAAAGAATAAAG CATTCT AACTTT AAG T ATTG TCCTT AAAAG AT AAG 



907 



WO 03/008583 



PCTAJS01/51291 



TTAGTACAAATAGTATTTGTCTTGCTCCTGAAGTAGACTO 

GGGACATTGTTCAGAAGCAGTCT ACTT AGTGGT AG CCCAC C CCTG TAGATTATG CTATGTGGG AAAGATG CTTCAGCTTCATGGGT 
GGGTCAGACAGAATTCTAGGTATGGAGTTAGGTAAGGCAAAGGGAT^ 
GTTTTGGTCCCAAGTTTCTAGATTAGCAAAATCCAGTTCTGTATTGTCCTCTA 
5 TTTrAACAAGGTAGAAAATATACCCCTAATATAACTACCTTGATCAAGAGCC 
TTAGAACCATAGAGTAGGTTTTCTTGAGGTATCTTTAGATGATCAGCT 

AG TG ATTTGGGGGGGGGGGGGTTTC CTAAGGGAATTTTTGG CTTCTT ACCAAATTTTTGG CATCAGTTTGTCTACAACCACAGGT C 

TCCTTAATTTCTTCTCnTGTCATTACTCATATTCTCTATTTTCCCAGT^ 

TGGTATCTAATCCCAACTGTGACATCCCATTGTGCCAC 
1 0 GGCTCCTTGGACCATCAAGGCCATCCAGTATAATGGGAGTCCCCAGGGC 

TACACAAAGGCATCTTTGGTGGTAGCAGAGCTCAAGTGCAG 

GAATATTGTGTCATGTCGGAGCATGAGAGACTGGCTGTAAGAACTCCATCTGTC 

GCTCATAGTTCATGCTCAGAATTTGGCTAGTGTAGCCTAC 

AACAGCTA(^TGTTGCAAGACACCAACATGAAAGGTAGTGCTGCCTCCCT 
1 5 GCTTTTCTTTGCCCTTTTTG CTCCCAATTAACAG CAGGGCATTTGTATGCCCACCATTGT AGTCCGTTTTAGCTTCTGAGTATAAC 

ACAGAGAACTAATAAAGTTATTAAACTGTAGGCACTATGCT 

CAGCCACGTGGTTCITTACCTGCAATCTGAGTCTTCTTACCCTGGCT 

GATCTGCTTGCCACCAGCCAGTCGTCCTGTCCTTCTCCAGGTCTCr 

AAC CCCTGACTCTG ATCAGAAG C CCCACCIT ATTCACTTCTG CTCAG CT AT AGGCTGAATT AG ATTT^ CAATCTGAGG TG 

2 0 ATAGAAAAGAGTTTTTACACATCTTTGAGACAGGA 

TCAGCATCTGAATACACTGTACAAAACTGTACAAAACCCCCTCCCCCAACAACCTTCGGTTTTGTTGA 
tGACTGATTGCTTTCTITAGACAGTACAACTTTGAATGTAGA 
TGGGGCTCTTGTAACTTCTCCAGAAATCTGAAGACCAGTGTGTATAGTGTGT^ 
ATTACTTTATTTTAGATAATCTCCTAATAGAAC 

2 5 CCAGTGCCAAACAACAGAAAATTTAAAGTCCTTTAGA 

CTATATTTGGCAAAGTAGXnTGGCTTAAATTTACACAATTTC^ 

TCATTTGAAAG TCT AAATTTCG CTTGGGCGTGGTGGCGCACG CCTTTAAT CCC AG CACT CAGGAGG CAGAGG CAGG CGGATTTCTG 

AGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTGCACAGAGAAACCCTGTCT 

AAGAAAAGAAAAAAAGGAAGTCTTAAGTTTCTAGCATGT^ACCTATATCAAGTTTG 

3 0 AACACTACATATACATAAATTATAGAGAATCTGGTACATTTACTITTACGTAAGAGC^ 

GTCAGGGTTGTTTTCTCACGTGCATGCCTATGTTGAGTATGTGCTACAG 

TTAAT CT ACTG AG ACAAAATG AT AT AG CCTTGGCTG CTGAG ACCACTC ACAATAG ATATTTAAG ATCTTAACAACTTGAGTAAATT 

ATGTAGAGACCAGAAATAGAACCAGTTTCCTCTCAGTTTAGCTTTGAATATTATCAGGTTT^ 

AGAGGTGAAATGAATAGCAGAAATGTAATCAATTACTATTACT 

3 5 TTCCACTTTAGAAATTTAGGCTCAGATGTTAATTAGCTCACAGTTC 

ACTGTTTGATTGATCCCACAGATTGAGTTAAATCTGAAACAGAATG 

GGAGTAGTTAAATATCCATCAGACTGTGGTTCTGTT^TCATGGATCTTAAGTCGATTGATAAT 
ATTTATGTCTGTGTGTCCCTGAACTGGTGTTACAGACAGGTGTGAGCTC 
GGTTGGGGAGTt3GTGGTGGCTGTGGTTCCCTGAAAGAGCAGCCAGCACTCT 
40 CTTTTTTTTTTTTTTTTTTTT G<3TT TTTTTTTTTT^ 

TGTAGACCAGGCTGGCCTCGAACTCAGAAATCCGACTC 

CTTTGCGGTTrTTAAAGGCTTGTACCACCACTCCrGGA 

CCACATTTGCTATACAAGTACTCGGCTCCTGACCCACATCCCCAGCCCAGGATT^GCTTCAGAGTGAATT 

4 5 GCTTTTCGAGACAGGGTTT*ITCTTTATAGCCCTGGTTGTCCTGG 

TCCTGACTCTGCCTCCCX3 AGTGCTGGGATTAAAGGTGTG CG CCACCACACCCGGCT CAGAGTG AATTCTTTTTTCCTCTCCTTTGC 
AATTTCAGAATTCTTAAGAAGAGATTTTAGTCAGGAACTCCTTGTGTCTTCT 
TTTGAATTCTCAATGAGAACTATTGTAAAATAACTGCTTTTGTAT 
TGAGAGTAAACTAACACTTTTATT'AAAGTATGAATTGAGACAGT^GGAAAGAGGGA 

5 0 ATGACAATAAGCTTTCAGGCTTGTTAGTTCCrGGGGGGGAGATTC^ 

TTATCACCAGAGGCCAGTTTGTTATCTGTCCTAAACAAAAACAGAATTTTAAAGC^ 

TG TTG ATTTTG T AT AAACTCCAG CTTTGGGGAT GTGGTT CT AAG AC GAAGATGCAGGGTTT CAGT AAG CTTTAT AATAATGT CTTC 
TTTTTTCTTTTGTCTTTCITTCTTCCrTTTTTT^ 

GAACTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCACCTGC 

5 5 ACCACACTCGGCTATGTTTTCTTTTAAAAAATTAAGTC 

CTGGAAAGCIAAACCTAAGTCCTTGTACATGGGAGGCAAATCCTGTAGCACTGAG 

T AATG C C CTGATCCTTT ACTC AGGATGTGGTGACCCCGATCAAAACATT ATTTTCATTC CT ACTTT AT AATTCT AATTTTC CTTTT 

TCAAATTTATAATTTATTGTAAAATATCTGATATGTAACCCTTATAAAAGGATTGTT 

TTGAGAACCACTXSCTAAAGCTCTTGTTTTTATGATTTTATGATGAATTTTTT^ 

6 0 CCTGGAATACATTATGTAGATGAGTGAAGCTGGCCTCCAACTCACCGAGATCTC 

GCACAGTGTATTGTCTTTTTTTGAGACAAGGCATTGTTAAATATTATA 
CCTCCTTAGATTAGATTTAAGGGCTATCACAAGTAGTTCCAGTGTTC^ 
ATAGAGATTTTCTATGACAAAAGTTGTACTGTGTGTGTACCTTCTC 
CTAC^CTGTTTACACCTGTAAATACTATATTTGAAGGA 

6 5 TCAAGAAAAGTTGGTGTAAAGAGTCrGGCrrCAAACATGGCT CCTTTGTTTTT 

ATGGAAGGAAGGAAAGGACTCTTTCTTATAGCITGATTTGGTTACTTTA 
ATGTTTGTTAATTCTTTAGTGTATCTTCrCC^ 
CCCAGCCCGTTTCCTCTTAGAAAGTGTAAGTGTGTTrGGATTCT 
TAGGTGGCCACCTACTAAATAATGCAAAC^TGAATAATGCAAACAGCAGAW 

7 0 CAACTCCTGAAAGATATTCTCTCATTTCTAAAGACTTTCA^ 

ATAGGTGCCTCCCACCrCATCTGGCATTTTATATGGGTTCTCTAGTTGTGGACT 
CCAGCCTGGAAATTCCTCAGACTTCTAAGTACAGGATTGTTAGAGAAG^ 

TGT ATTAG TTTGT CTTTG T C AG AATCTCT AAGT AT AGGGGAAT AAAACC C CIAAAC ATCTTTT ATCCAATGTTG C CTTATCAGG CAA 
75 GTATATATATATATATAtaTA^ 
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• AAATT AATAATG AAG AT AATT AAAT AATG ATTGCATTT C CT AGG CAG TTGTCAGT ATGTTTG TAATTTGTT AGAAATT AG CCAGGA 
ATTCIXjTTGAAACTATTCTGTTGATTAGCTTACCAAA^ 

tgtttcctctaacaaagaggaacatttttactaaggacacagatatcaagt^ 
ttcccatgctgccttttgagtggtgctgctgagacagggtt^^ 
5 gtttaattatgattgttcatattctgtctcattcttt^ 

ttctctgtctcattcatl 1 itt ittl'l 1 ttttrtggctttt cc ag acaggg ttt ctctg tag ccctggccc ctggctgt cctggaa 
ctcttttgtagaccaggctgggccttgaactcagaaatctgcc 

ACGCCTGGTTTAAACTTTTTTTAAAAAATATGTTCC 

AAAGTACATTTT ACTT AATTTTG AT CAGT CATG AGTGGATG AAATTTTCAGCAATGAT AATAGTGGCT AG CTGCHG TTG CT CATTG 

ACAAGAACAGGGTTTCTCTGTAGCTCTGGCAACACTGGAGCT 
CTCTGATTCCTGAG CTCTG AGATTAAAGGTATGCTTCACCACTGCAC^ 
CAGAGTGCTTGATTCTTAATTTCCTCAGCrrCCT 
CTTCATAAAAGAATAGAAGTAATTTACAAGCACAAGTAAATGTAGG 
1 5 ATTATTCTTTGAAGGATGGTAATATAGAAATT'CGGTTATAATTCT^ 

TTTTG TGT ACTTAG TGTT AGT CTG C C^TT 1 !^ ATTGGTTCCC^ri'I ATCTGTCACTTGTT^rri CCl C1T AGT AAACTTG CTGTC CATT 
CTTCCTTTTTCTAACTCTAGAGTCTAGAGGTAGATACCTTGAAGG 

CAGAGGAGGGATTGAGCTAGTTATTTCATTAGACATTGTGGAATTCT 
ATTTAAAAAAATATAAAGCAAAATGCATATTTAGTAAGGAATAGCTATTACCTTTGGTC 
2 0 ATGGCCACATGTTAAGAGTCTGC^TGTTGCTGGCrCTTGTTTTAATGTCCATTre 

TCCATATATTCATTTACATGATTGGATTATAAAGGACAGAGCCCTTGTCTGATCAGTCT i i i GAACTTGAGAAA 

T ATAATGTTTGTAAATTAAATAGC CCTT AT AGT AT AAT CCTG CAG CTG CTAAT ACAAGGATGACTTTTG TTG TGTGCATT AACCAT 
TGAATCCTGTCTAATGCTAACTAGAAATTGCTCCTACAGTGAGAAAAAGA 
TTTATT T TTAAATGTGTGTATATCTGTGCCCTTGGAATCCAGAGATGAAATTA 

2 5 TAGGAACTCJUXCTCAGGACCTGGAAATTTATTTACTTAA^GCCCTTTT 

AAGATGTTGATTTTTCTAGTGTTTTCTATTTATTCATTTACT^ 

TTTGTTG CACATTT CTGTCTT CAAAG A CTTT AG CT AGCATTCTT CC TG TGGTTTCAT CT AG AGTTTT CI TITTTCTT TTTT ATCTT 

TCATATTTTATTTGCATTGATGTTTTGCCTGCATGTATGTATGTCTATGTGAGA 

TATGTGGGTGGTGGGATTTGAAACTGGGTCCTTCGGGAAGAGTCAGTATATTTAACT 

3 0 CAGAGTTTTCAAATGTTAAGAATTGAAACAATTATTGTTTTGTATGTGAGATT 

ATAGGATATTTAGTGAATAAGATTTTATGGTTAGCCAGGTGTGGTGGCCCA CTTGGGAGG CAGAG 

GCAGGTGGATTTCTGAGTTCCGAGGCCAGCCTGGTCTACATAGTGAGTTCCAGGACAGCCA^ 
AGTGAGTTTCTTTAAAAACCCAGTTGTGTTATGTGGATTTTTATGGTGCTATAGTACACAGTGGTTC 
TATCAAGTTTTTTACAGTTCITTTTGCATTTGATAGATACTTCGAGTGTCACAAATC 

3 5 TTTTGACAAATTCCAGGAGGTATACTTAATCTTTAAAGCCACTAATGTAT 

GTTCTGTGTGCTTTTACTGGGTTTTACCTTTTACTTACrc 
TATACTTTAGACAAATAAAGTCTTTGCTTTCTGTTTTACTAGGGCCTCT 
ACCTTCCCTCCTCCTCTATACTCTTTTTCRCAATAAGTTCTTAGAACT 
TCrrAATCTTAATCTTTTGCTATTACrGTCrTCTGGTTTCT 

4 0 TTTTTC CTTT AAG T ATG ATTTTTGGT CTTTT ACAAATTGAAG T AGCTG TCAT C CT AGAGT AGAACATTCTGACC CAGGG TT AACT A 

TTAAAATTTATCTTTTGGCACCTACTGTGGTGGCAAAGGTCTGTTTTCCCAGT 

TTCTATAGACAGCTTAGCAAGCCCTT'GTTTCAAAAACCCAAGGGCrGGGAATATT 

ACTTTGGATTTGACCCCCTAGCCCCAATTCCAAACAAAATT^ 

TTCTTCTTTTACATAATAAGGAAAAATAAAGAATGGAAAGGAAAATGCTTAGAGCTCATC 

4 5 AAGGATACACATGTGTAACTGCAGTTTGATTTCCTGTGTACCCATCATAACT 

TATCTTTATAATTCrrGATTCrAACrCrAATTrACTCAGA 

GCTGGGCCTAAAGGTGTGATCCACCACTGTTGGTAAAGGTTTTT^^ 
TTTCGAGATAGGGTTTCTCTKrTGTGGCCCTGGCTGTCCTGGAATTCACTTTGTAG 
50 CTGCCTCTGCCTCCCCAGTGCTGGGATTAAAGGCGTGTGCCACCACACCCAGCAGGGG 

CTAGTAGTATACAGGGCATCTATACAGTTAAGTTAGTAGATTATTAAGTTTTTTAAATGTTAAATGATTGTTTAACAACT 
GTGGC^VATGCTACrTGT ATAGTGATTTAAATCTATTGTATCATTTTTCCT 
TTGATTTATAGAGCTCAACAGACATTCATATCTCTTATTACA 

TGG AACACATGTT CTGGT CAGAAG ACAAT CCATGG AAGTCATTTTT CTG CTGTG T AGATTC CTGGGGATTGCACAT AATTGT CATG 

5 5 TTTGGCATCAGGGACTTTTCCAGCCTGAGCTGGAAAAGCCATAGAGATTTCAATTAAG 

TTTTGTTTTTAATTTAAAATCTCTCACAGCATCTGTAGCACAGTTTGA 

TTGG CCT AGATGTTTTGT CCATG G AATT ATTTAC CAGCAACCCTAATTTTCTCTGAGATAGTTAATTTTTACATTTTTAGATAGCA 
ITTGTTTTCAAAGGCTTGGGACCCGGTTTTAAATTCCATTTTGA 

GG AACT ACTCTG AATTT G T ATTTT AAAG TTTAGTG CTTTTTAGTGAC"l'l"l''rGC^TGTTTGTATTTGACAAAAATAAATTTG TTTG T 

6 0 TTATTT AG ACAGGGTCTCTCTGTAACCCTGGTTGTCCTGG AACT CT CTGTG TAG AC CAG^ 

TTIGTCTCCTGAGTGCTGGGATTAAAGGCGTGTACTACCAGGTCTGGC 
AAAAGTCGTTCATTATGGAAAATAAGATGAAGCACACTTAAGAACTCAAGTTCACAAATT^ 
TTTTAAAATGTATGTCTTCCAGTCTTTTTTGTATGTGGTCATG^ 
TCAGATAATTTCCTTGTCATGCCTTTGAACCAAAGCAGTAGATCT^ 

6 5 AT AAACAAAACAACACATG CTTT ATTT C AGGTTGT AG TC CTGGTGCATG CATG C ATTGTCATTTTT ATGC CATG AGGT AAAAGT CA 

CCAGTTTTATCCTGATTGACTCAGCATGTTAAGACTTTAGACTGGAGACTTAG 

TTG AGTTCAGT CTT CT AACACT AT ATG CTTTACTT ACTATTCTCATCT CTG TGGG ATGG AAAGTT CTTAG AACCAGGGCCTTG ATC 

CAG CAATACTTAATTTAT AT ATAATCTTATTG CTGTG AAACATGCTACT^ 

7 0 ATAC ATT ACATT CG TT AGT CTGGTTGAATGT AAG AAAGAATTCATTT AA 

GTTAAGAGCGCCGATTGCTCTTCCGAAGGTCCCGAGTTGAAATCCCAGCAA 
TGATGCCCTCTTCTGGAGTATCTGAGGACAGCTACAGTGTACTTAC^ 
AAAAAAAAAAGTTTTTCATAAAAATCGCATGTCACCTAGAGTAACAGTCATCTGAAGGCT 
Caj^ATAJVTTTCTTCACGTGGCT 
7 5 TTCTCACTAGAACAAGGAGAGCAAATGGGAAGTTGCAGTGCCTTTGTGAGCT 
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AGTGCAAGAGACGGACGGTGCATGTGCATTTGCAGGTCAACATCAGATGTCTA 

ATCTTATTACTTGTCCATGTGTATGTATGAGTATAGATGGCCCTGGAGGCCAGGAGAGGGTGTTAGATCT 
CAGGAGGTTGTGAGTTGCCCCTAACATGGATGCrrATGAACrGACCCCTCTC 
CTTTTCTCCAGCCTCTTTTCACCTTTr TlT rr r ' ri ' rr 
5 CTGTCTCTGTTTCCCCAGAACTGTGATATAGATAGGTACACAGCAATAAGCCAAAATCA 
TGTATTCTTAAAAAGGCACCACAGTAAATGAGAATAGAAAGGGA^ 

T ATCAAAGT ACTTT AT ATG TTTAAATATTTT CATAAGGCTT AGGATGTAGCITTGTGGAACTGGGTGGGGGTGAGAATGGGGG CGA 
GCAACTGATGTTTTGGTAAGCAAAACTCAAATCAA 
AAAACACCTTTAGGATTTATTTATTTATGAATGAATGACTGG^ 
1 0 GTTCTTCCAGAGAATGGGGTGTTGATTTCCAACACCCACATGGAGGCTCACGACCATATAC^ 
TCCTCTTCTGGCrrcCCAjGGCATTGCATACACATGGTACACAGA 
GTTTTTTrAAAAGCCrcTTCACTTTTATACTTTCT 

ACT ATCAT AAT ATATGTGG CCAAAAAT AG CAAT CCAG AAGT ATAT AAAGAA.CTT ACAGAGG CAGGTGGATCT CTGTG AGTCCAAGG 
ATAGCCTGGTCTAGACAGAGAAACCCTGTCCTGAAAAACCAGGGGGTAGGGGGAAGAACA 
1 5 GTCAAACAAGCAAATACTGTAGTGAACTTGCAATAGTTATTTAAAGGATGTGAATAGTC 
ATTTAATCACTAGAAGAAAATTAGGAATTAAATTCATAATGAGGGTGCTGGCT 
GTTTGGCTCTAACATTTTCAGTTCCAGAGAATCCAATACTCTTATGGC 

GCAGGCAAAACATTCATX5GGT AAAATTTTT AAAAAT AATTAAAAA^ CAGG CT AATT AT AT ATAT ATATAT ATT ATGTAT A 

TATAAATAAATAAAACATCCCCAACCAAAGCAAATTTTGCAAAAGAAAG 
2 0 TTTAGTCCGTTATCATGTTTAGGAGCATGGCAGCAGGAGGTAGGCATGGAGCTAGAGAGTAGCTGA 
CCTACTGTCCAAGAGGCTCCTGGGCCTGCTTTTGGCTTTTGAAAC 
TACAAGTTTCAAACAGTGCCACTACGATACTATGGGGGTGGGGGTGGGGTCATTC^ 
AGAGCAAAAATTAAGACAGTAAAGGCTGTTAGCCAGGCATTGTAATGT 

G CCAGC CTG ATGTACAT AGAGAATT ACATG CTCACCAGAG CTG CATAG CAAGAGTTTGC CTT AAAG AAAAACAAACAAACAAAGCC 

2 5 AACACTGGTAATTCTAGATGTTGCTAATGATTTGTAGTAGCTTGGAACTCTT 

CTTAGGAATATATAGTTGAGCAGTTTCCAATAAAGTTAAAAGTGAATTTTT^ 
AAGGTTCATCAACATGTGAGTTGTAAATTACAGTACTTCCATGTAATGGTTTTCACT 
TAGTGGGACAGTGCTTGCTTATTATTTATAAGTTACTGGATTTGAGGCAAG^ 
AGGATGTGCTTTTATGGTCACTTTGCCCTTTGACTGACTTCTGTTTCTAAAG 

3 0, ATGATTCCTTGTGATTGACTTGTCTCTTATGATTGCAGATTCTCACAA^ 

AGGAGCACAAGAAGGATAAGGATAAGGACCGGGAAAAGTCTAAGCATAGCAACAGGTAAGAAT^ 

GGG AACTTT CGGGATAG CATTTG AAATGT AAAT AAAG AAAAT AT CTAAT AAAGAT AAAT AAATAATT AAAAAAGAATTG AT CAAT A 

GGGAGTTAGCATCCTTGGTGTCCATGGATAGGTATCAGCTTGAGGTGAGTAGCAAAGTAAGAGACTATATCTGTTGTAGTTA 

AGAGACTCCTTTAAGAATTAATGCCTCACTTGGTAAGT 

3 5 ATTTCTTTCTTTTTTTTTTTTTT^ 

AGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCCGAGTGCTAGGATTAAAGGCGTGTGCCACCA 

ATTGTTTCTTAAAGCTATATTTATATTTAAAATTTATATTTTAAAAC^ 

AGAGAAAATTCCTATCCTTTTTTAATAATTGTGTTTTGrrrTACC^ 

TGGAGCTAGCTCTGTAGATTTGAGTGCTAGGATTAAAGGTGTGTGCCACCACCACCCAGCCAGT 

4 0 AAAACTAGTTTAATTTTCTCTrAGGATATTTCTTCTCAAA CATTTCTTACA 

ACTTAGTTT ATG TTTTATTCTTAATATTGTTTTTTCC CATCT CGT AG TCATTT CTTACAACTTAGTTTATGTTTTATTCTTAATAT 
TAATCTCAGCACTTGAGTCAAGTTTTGCTTTTTATGCACCCTTCTTCT 
TTGAGAGCAAAGAACACTGCCAACAATAAAGAAACCCATTTCATGATAATACAAAG 
GAGCCTCAAAATATGCACACACAGAAACTATTAGAATTTTGAAGAGAATTCATGATTGTAGTTAA 

4 5 AATAATTGTTATTCGTGCATGTAAGAAATTTCCAAGGCTAAGGGAGGGTTAAATAACAGA 

AC ACCTGTATGT AACCTTAGCATTCTGAGCCTGAGCACGGTGGATAACC AGAAG ATTGAGG CCAGATCTAGGCTACATGGTG CAAG 
ACACTAACTTACAGAAACAAACAAACAAACAAACAAACAAAAAAGTGTAG 

AGTGACACCAAAATCTTAGGACTGTGTTCAAGTAGGTGCTGAGATAACTCGTGCITTTAGCTAGTCTCT 
TACTTCTCCATGCTITTGTGTGTGTGTGTGTGGTGTGTGTGTGTGTGTGTGCGCG 

5 0 TTCCTTCTAATGTGCATAGTGCTTATCTAGGA 

ACTGTTAATTGAGCATTGTAAACTTCTTTTAAATTTCTTAGATA 

GGCTATTGTAG AGTACTCTTACTCTTCACTC CTTATTCCCCTTCTCCTATTAGAG ATTG AACGTAGGG CCTTAAATGCTAGTCAAC 

AATACCACAATATCCCTGACCCTTTTTTAGATTGATAGAGATTGGGTCC 

GGCCCAGGCAGGCCTGGAAACTACTTCATCITCCCAAGCACTACTGTA 

5 5 AAT AACTTT CT AACT ATGT CAT ATT ACT AATTT CTG TGGTCTC CTG AAT ATATT ATTTCCATAG T AATT ACCAGGTTTGG CTGTT A 

GGAAT AATACTG CT ATGAAC ATTGAT ATTTGAACACACATTTG AAGTT CTGTTCT C CTT CAGGTG TGTACATCT AAGAG TG 

GAGGTATTGGACCATGTCAGTCGTTGCTTTCTTTTTCCn^ 

TCTACAGCTGAGCTACATCTGT AACCCTT AACCCGTTTT ACCCTG CTTGTTATG ATGGCCTAGCTT ATGTATAATG CAAT AGAAAA 
ACAAAGAGGTTTCGATTTCTTGGGTTTTTTTTAACTTATTTT^ 

6 0 GACATGTTTTTACAGGTCTGATCAAGACATGCTGATTTTAATGT^ 

ATTTAAAACAAGCTTTTCCAAAATGATTTCATGTTCrCTTACATATTTCACCAACCACA^ 
C!AG CAT AAAG CAAAAGTGTATCTAAGGACCTAGGGACTAGTTTCTTOT ACT 
ATTGTCAGAACTATTACAATAATTCAATAGCCTCTGTCTTGGGCGTCCT^ 

TCAC CCAAAACC C1TTTTCT CCAGTGCTTCACAGGAGGG CC CAGG AGT CAAT AAAT GT AAAGAT CTTATGAAGTG CT CTACAAATG 

6 5 TGAAAATATATTTTGTCCATTTTTCATAATTACAGTTTTGAGT 

CTTTTTGACAGTGTCTAAAAATGTCTTAAATGCCAGCTTATGTTACTTCCTT^ 
GAAAAGCACTAATCACATTAAATTCTAACTTTAGGAAAACTTCAT^^ 
ACACTAATCAAATGAAATAATTACTTATATATTTTTAATTAGATTTATCTTA 
GTATGTGTACCACATGTATGCCTGGTACCCTCAAAGATCAGAAGAGGACA^ 

7 0 TGAACCACTATGTGGATGCTAGGAACTGAACCCAGGCCCTCTGCAAGAGCAG 

TTTTTTGATAATGTAGCTATAGTTATATGTTGGTGACTTACATGAATT^ 
ATTGGGAAACTGAAAAACTGCTTCCCrrAAGATCGGCCTATAGGCCAT^ 
CAGAGGCAGGTGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGT^ 
TCTCAACCCCCCCCCCCCCC^AAAaAAAGATCG^C^ 
7 5 AGAAGGCroVATCTACTGTGGCTGAGGTGGTAGTTGTATAAG 
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TTCCTTTATGGCTT(^TGGCriTCTGCCTT 

TTTTGAT CAG TGTTTTATCAGAACAA.TAGAAAG T AGTAAT ATTTT ACAG ATCAGAT ATTT AAAAAT ACACTTTG AAAT AAAAGGAC 
TGTTTTT CTAJGGTCTCAAGG<1ATTACTGGG CTIXST AGTGTGACCAGGT^ CTTT AATT 

TGTAATATTGAAAAG CACT AT AAAACAAAAGGTCAAGTT CTG C CACTCACAAACTGT AT C CT AATCT AAGCA.TCTT ACCTTTTTTG 
5 CCTCCTTTTCATTTTTGATTGATTTCCTGCTAAGTAATTTA 

ATCATGTAGATAAATTGCAACTGTCAGTTTITGTAGTG<JAATCT 
ACTTTTTAGTTATTTCTAGCTTTATAGTTGAGT^ 

GCAGATGTTGGTTATCAAACTTGTTGATTCTGGCTC'lT'I'l 1 1 1 i 1 AAAAAAAAAATCITTTTATTCATTTAATGTATATGAGTACA 
CTGTTG<^AT CTTCAG ACACACCAGAAGAG AGCAT CAG AT ACT ATT ACAGATGGTTGTG AGC CATCATGTGATTGCTGGG ATTTG A 
1 0 ACTCAGGACCTCTGAAAGAGGAGTCAGTGCTCTTAACTACTC 

GnTTCTCTGTGTAGCTCTGGCTGTTCTAAAACTAGTTCTATAGGCCAGG 

TCAAGTCOTCGGATTGAAGACATATATCACCACACCC^^ 

ACAGAAGGATGAAAG^^TAATGGCTAAAAAAAGAGGTATCTGGTG 

GACCAAGTTCCAAG C CAAACTG AG CT ATAACTGT CT CAAAT AT AATT AAAGG AG CCTTT CAG ATG AT AAG AAAAGCATG^^ 
1 5 TAAGGAATGTAGCTCCGTGTTAGTGTGTACCCCGTAGAATATACAAGCCCTATC 
ACACAAGACATGTTACTGTTGTAGCATGCATTTTGTTT 
GAAATGAGGTCTAAAATGTTGTCTTGAGATAAACTACTGCTGAA 

TTGGGCAGCTTTTTATAATAAGATAGGTAGCCACAAAAG^ 
2 0 GGAATAGAGGTTTAGAAAGTGCACTGGTTTGC^CATCTTTG 

TTCCAGTCCGCAAACGAAGGAATAAGTTATTCTATTTGACTTTTCTGTAATTGCA 
TCATTTTTTTTCAGACAGCAAGTTCAAACTAACCTAATAGAA 
TCTTGAAATAGTTACCCAGGCTGGATTTAATTAAGATCTTATGG^ 
AGCACCCrTAGGAAATACAACCCTGGGACTTAATTCATCTCTCT 

2 5 ATCATAATTGAAAGTTGTGTTCAATTTACCTTTATTTAATGCCTTCAGAGTGAG 

AAAG CGGGCAGTAAGGTGTTTCTTGTAAGTGGAAAG CAAAAGTTCTCAAAGAAGAGCGTG AATGGTT^TGTTGAACTCCAGAAAAT 
GATGGCTGTGTATTAATTCCTTATACCCAATTTATAC<5CCAAAATTTAT 
GGACCTCCACCCCAACCCTGCCCTTGTATATAGAGTTGGGTACTATCTGGTC 
CCTTGTGATTTTGCTTAGTAGCCTAATGATGATGGACATTCAGCAGCCG 

3 0 TATTGTCTC^AGACAACTTAAAGTTACCaSGAATTCTTTTATT^ 

ACACATTAATTTCTTTCCTTCTGTATTGTTAAAGCCTACTCATTCT 

CTGCTG CTTTGCAT AG TTGG CAAATATT AGTGG CTAAGTTCT CTT CT C AGAAG ATGAG CAGATT CTACCTAAT AGTGTTTGGTTTT 

TGTTTGCTTTGTTTGGGAAGAATAGAATAGAATAGAAGTAATAGAATGGTTA^ 

CTCCTCATTTTCCCTGGACAAGTTACTTAAGCATTCTGTGCTTTTAA 

3 5 AGGGTTTTGTTGTAGCAAGATGTGTATAATATTAGGTGGTAATATCGT^ 

CTTTAAATTTTGATG^CTTCCCATAGCCAACATCTAGCCTTCCT^ 

ATATTTTTTTTCTAAAGATTAATTACTTTTCTAGTGCAAAAGAGG^ 

CTTTCTCATGCCTTCCTTCTGCTTAAGGTCTTTAAAGAAA 

AGACCAGAAGTTTCAT AATTGTCACCT AAGAAATAG CATGG CTATTTTCTCCCACCATCTTTATCATTAGTTTCTTCTCCTCCT 

4 0 TCTTCCTCTAACTCTTCCTCCTCHTCCTCTTCTTCCTCC^ 

GT AT AAG TCAAG CATGCACATTATGAAAAAAAGGTTTT AAG AATTTTT ATT ATCCTT C^ 
AGAC!ATCATTATCTATCACCTCTTTGTCTTTCTTTCTGCGCTCrr 
TGTTGCGTGT ATT CCATTTAGGGCTATGGATATTGTCT ATTT ACT^ 
CTTACAGAGAAACATGAGTATTGAATCAAGGAGTCAACCTCTGTTCATAGCCATTCATT 

4 5 CATATCTGTGGTGT/TAATTTTATATTTCATCCAAAAGTGTAACCATTA 

AGTTCCTTCCTATGCrTGTTCCTAAGCATTTCTTTTATGTATT 
AGCCTCTGTTGTAGCCTTTGGTTGTTAAAACTAGCITTGT^ 
CCCAGGTGGTGCATGCCTTTAATCCGAGCCAGCACAAAC^AGGCAGAA 
ATAGTGAGTTTCAGGACAGCAC^GTTATGGGGAGAAACCCTGTCTTAATACC^ 

5 0 ATACAAACCTAGTTTATGATATTCCAAACAGATGGGTATAAAGGAGACTTAGTCT 

TTTACTGTTATTGGTACACAAGAGCtATTAATACTCCAGGAGGATCT^ 

GAGGAAGTAACIAATTTGTGTGTTTTGTTTTATTACCCTAGAG TAGT CTTGATATAC CATTACTACTTACTGATTCTTTTAGTTTGG 
TGTTCTAATGAATAACAAAACTGAAATCTAGAGACTCAGGTTTGCATCCT 
. TTAGTTTTCTAATAATTTTCATTAAGAAAGAGAATTGAGAAG 

5 5 TTAGCCATTTATATTTTCATAAGAAGTAATCACAAAGGTTTGATCTTT^ 

GTCAAATCITAGACTTAAGTAC^CACCATTTATATCTGAGGTTGAAGAGGTGGT^ 
GAGTACCCCATTAGAGTCACTTGCACCCACATGCCAGCTCACAGCT 

AAAT AC CAGGTG T G CATGTGG T ACAGACAT ACATG CAGG CAAAA CAC CCACATACAATAAACAAATATTTTTAAAAACX5 ATTT AAG 
TGGAAATGTAATAACAATGTACAGGTTTTCTAAACTTCTAAATGGAAATACTAAGTTTA^ 

6 0 TTTGCATTAGTTTTTTGTCTTGTTTTTTTGAGA 

CTCGCCTTGAACTTAGAAATCTGCCTCTGCCTCTGCCTCT 
CACCACrGCCTCCTTCCTTCCTTCCTTCCITCCTTACTT 

TTATTTATGTGATGAGT ACACTGT AACTGTCTTC AGACACACCG AAG AGAGCAG CATCAG ATCTCATTACAG ATGGTTGTGAGC CA 
CCATGTGGTTGCTGGGAATTGAGCTCAGGACCTCTGGAAGA^ 

6 5 ATACATITGTCTAAAAGAAAATAAAACCTCTTAGAATTATTACAGAATTGAGTAAAATAAG^ 

CAGGTGTGGCTTACACCTrrAATCTCCCAACAGGGGCCAGCAGGTCCCTCT 

T AGGCT AG C CAGG CTG CAT AT TAAGACC CAATCTCAAAG AGAAG CCAAAAAC C CAAAGCACACACACCATG CACACATG CT ACACA 

CACTAGTCTGTACATAATACTTTCTCAACATCAAAAAACAAG 

CAAAATGCCTGCGATTGTTTCTGGAGTTTATTTTTATGTAAAA 

7 0 ATAGGTTTCTATAACTATTACrATAGTACCTTTATAATAGAAGCrAAACATGGTA^ 

G ATAT ATAGTG AG T CTAGTTTT AGT CTTTG C ACT AT ATAAAGG ATTTTTTT ATACTG AT AGAAACACATCATGAAAAC^ 
TGGAATGATGTACTGTTGGTAAAATGTGGTCTCTGTGTGTGCCATTAGCATTCTGTGTGCCAT^ 
TGCTGTGGAAACTTCCCCTTTAATGTATAGCTCATATAAGTAACAGCTA^ 
AAACAATGCAAAGTGGGCCCTCCTTCCCrCCCATCTGTGATAGTGCTGGAATCCTC 
7 5 TTACCTTCTGGTTAATTTTCTAAGAATTTGTTTT^ 
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TAGACCCGGCTGGCCTCGAACTCACATTTGCCTGCCTCTGCCCACCAAGTGTTGGGATCAAAGGTGTGCACT 

TCAGTTTGTCTTTATAACCCAGATGCCrGTCTTAGTTATC 

TGTATTCTGTTGCTTGGCITAATTAAATGCCATTGCTTTTTGTCTT 

AG TGTTT AAAGGAAAATTCTTGGGTCTCTTG AG ATCATTG CCAGTGAGGCC AG CTTTTTCTG CT CT GAGTCAAACAAGTGG CAAAT 
5 GAG CTGTGACAAATCTT AT AGTC CTGACTGTTTCTAT^ 

CAGACCACTTGAGCTTGAGTTCTGTTTTGCTTAGCTGTG^ 

ATCCACATTGTTCTCCAAC^TGTGGTTTTTGCTGCTAGTCT 

TGTTGGGATCTGTCAGTAGTATTCTGGATGTGTACTTAAAATAACAAATAT^ 

TCCTCTCTGGTTAATCATTTAGATTG CATCCATTTTCTCATTTT CTTTT TCCCTTTGGATACT ATAGTAAGTACCAAAATAAACAT 
1 0 CTTCC CT AAGAGACATTTCTAGAAGATCTTATATGAACATGTATGATCATTTTAATGACT CCTTGTGATGTAATATCTTGCT 

TTCCAAAATAGAAAAATGTACTTTTACGGGGCTACTAATATGACACATACCT 

GACCTTGTCCTTCCTTTCAGGTGGGTCTTATTC 

GAGAAAGCTTTTAGTGTGTTGGGCCATTCAGTT^ 

TTGGATTGAGTCCTCTTACAGCAGTTGTGACTGTTGGAGATAG^ 
1 5 AGACTAAACCTACTGTATATTCTACrACCTrTTAAAATGCTTGCAGAGCrrA 

TCAGTAAGACCTAATCTGAAAGCTTAACTTGTCrrTIT^ 

ACTTTCTCTAACACTGAGCTTCTCACTTGTTCTT 

GTAGCTATTTGCAGACCTCCATTGCTGAACATCCCAGCCTAATTC^ 

i riA_ i i rrCCTCTGGGAGGTACTGAGATTTCCACCTGTGTGCCCTGAA 
2 0 GTGGGAACTCCGAATTTCCATGTAATAATTAGTCAACTATGAAGGC^ 

ATGTTCAGGATTCTAGCAACTTTATTT1TAAGGGACTTGATTGATTGCT 

T CTCATTTGTT CCATG AT AACCTCATT ATGGGT CATGAAG CAAGGTT ATGC CATGG AATCGTTT AT ATTTCCAGTTG TTCATGTGT 

TCCTAACACAGCAGTGCAGTTATACACTGGGTACATGTCTTTGTCTC 

TACCTAGGAGCTGAAGGGAGATCAATACAAACTCCTTATGACTTTTAGTGCTT^ 

2 5 TCCCCTCTGATACAGCTAGCAGTCTTTGAACCTTTTTTCAGGACTGTGTAA 

TGATTTTGTTGTTCTTACTCTAGGCTAACTTTTTATGTCTTATTTTTA^ 

AAAG AG AAAAC CAAACACAAAG ATGG CAG CTCAGAAAAACAT AAAGACAAACAT AAAG ACAGAGACAAGGAAAG ACGAAAG G AGGA 

AAAGGTACAGAACTTTGCCTTAAGGGGAAACAAGAGTTAAGATGTACTTTTATTTTGA 

TTTGTTATATCTACCCTTGACCCTGTAGCAGACTTCAAAATTAGCAGGTAGTGCT 

3 0 CTTGCAAT GT ACT CATT AAACT CCTCACT CCCCAAACAAAAAAGATTTT AG ATC CTG TCTG AAG CT ATG AT AAC C AAATT AT 

TTAACACCATGTCITTAACAATGCAATAAGTATGTTTATTTTATACTA 
TTTAGGGAAAGTTTTGTTTTTGGAAAACCTAAAGCTTTAT^ 

TATTCTAGCACTTGGGAGGTGGGGAGATAGGGCGAGTAGAAACTTAAG^ 

3 5 ^rc^NNNNNNNNNNNNNNN^ 

TTTAAAGTGAAAAACAAATTTGTAGGTGGGCAATCATACACT 
CCCACTGTGAGGCTCGCAGCAAATTTTTGAGTAGTGGATATATTTATTGGGGA^ 
TTGCTGTTTTGAGGAAGAACITTCTCGATTAATTTCACCAGAATTGTCAAAAA^ 
TTCAGTTTTGCTAACAAGAAGAGTCAGTCCAGGATCAAGATGCTCCAAAATAGCCTG^ 

4 0 TGACTGTGGTTATCAGCACAGGACACGAGCAGTATTGAGCCTGTCIA^ 

GAAGACTACCATCCATTAATGGTTATTAGAGGAAGTTGAGACATTTTCTCAGGTGGTATAGT 
TCCTATAAGCAATTCTGATTCAACTCAGTGGGACATCCAAAAAGCAGGAAGAAAAGACAACT 
GAACGGGGAGTGAGTAGATCCGAGTACCTCATATACTTCCATATACC^ 
CAAATTTAAAATCTCC(2AAATTGATGGTAATCTTC"1TGAATAAGA 

4 5 TACTGAGAGGCTGTGGTTTTACCTTGCAGATT AG AG CTGCTGG 

TAAGACCTTTTGCTACAGCAGCTTTTTTTGGGAGGGGGGTGTGTGACTCT 
ACTAGCCACCTTGACAATTCAAAAAAAGTCAGTATTAAGAGCACTTGACCT 
GAGACTGAACAGCATTTATCCAGAAATCTACCAGAAATGGAGAAGGCTATCAGAATGAGG l ' 1 ' 1 ' 1 ' 1 
TTTCTCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCT 

5 0 GATTAAAGGTGTGTGCCACGACGCCCGGCCAGAATGAGATTTTAAAAGTTGAGGTATAT^ 

AT ATATAT AG CTT ATGGG CTTIT ATGGT ATTTAC CTCACT ATTT CCC AAT ATTT CAAGAG AAACT C CTTGGT ACT AATT ATGGAAT 
TTCTTAGTCTT AAATT AATTCCTCCrrTGTAATATAAGACATACGTAAACCCTTATTG<3A 
ACCTATAGGCTGTTTTTACACTCTGCAGTCCATCTGTTCTCTTTTCCTCAr 
TAGATTGAGAAGACCATTTAAGATC!ACATTGTATATTCTGATGT(rTTCCATGGAACCAC 

5 5' AGCATTTGCCTTATTGACTGTAACAGAGTTTTAAGAGCTT^ 

AGGTGCCTTTTATAGGGATATTTTACAAATGGTGTAAGAAAGCAGAATAATAAAGTTGAGTAAGCATT 
ATTCTGTTAAACAATAAAATGAGGCTCACGATGTCTATTCTTTGCCAAGAGATCCTC 

GT CATG AATTCACTT AATGACTTGGGGG ATT AG AAGG ACAGG CG TGTGTG CTTTG C C CCC GATTGTCGTTGCTGG C CTG CTTGT CA 
GCAGTGAGTCCTGTAGAGAATCCAGATGCTTCACTTCAACTTCATCTTCGAAGT^ 

6 0 ACTTAGGATGGGGCATTAACTTTGAACCATTGGATGTTGTAGTGTCTTCT 

GCTGTAGAAGTGGCATTTAGGAATTGGGGAAAATGTGCTGGAATAAGAAGAGCTAATCTTTTTAAAC 
CAAACTGGTTTTAATGCTTTTAGAAAGAAAGGAGAATACATTATCT^ 
TACTTACACTTCAGATCCACCACITCATGAATGGCAGCTTTATTTTGGGAG^ 
AATTTGAAGAGATXH'AACAAGCCAACCTAGGAAAGAACATTCCAAGGTTTCT 
65 CCCTTCCCCTTCCCCCATCATCATCATC^CTACCACCACCTCCTCCT^ 

AAGATAGCTGACATACACGCTCCCCTATCTCATTATTAGTAAATGGCAGAGTCTTCTGAAG 
AAACCAAGACGAACCTCCACTCCTGCTCCTCCTCCTTAATTGTTGG^ 
AAAAAGAAAAAATTAGATATTTAAATTTTACGTGTATGAGTAAATGTTTTGCTCATAT^ 
GTAAAAATGAAGTGTTGGATATCCTGGAGCTAGTTATAGGTGATGGTGAGCTTCCT 

7 0 CAAAAACAAGTGCTCTATAAGTGCTGAGCCATCTCTCCAGACCCCCAGTT^ 

TGTACTCTGGCTCATAAATTCTCTAGATAGATAGAGATTTTGCAAAGG 

CT AT C CCGTT CTATTTT CATGAAAAGATTITTTTI'TT AAATGTATATG ACTATTTT AG CCTCA CATG C CTG 

TGCCTAGTCACTAAAGAAGATCCCCTGGAACTTGGGTCTATGCTTTCTG 
TCCATCTCCTTTCATGCCACATTTTTAAAAATTGAGGGTTTTTTC 
7 5 AAATAACACAATTTT ATTCTTTATGCCTGAAT AAAATT C CATTGTG'i'AT AT G C CAT ATTTTCTTT AT C CATTCAT CT ATTG ATT AG 
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CATCT ACACTG GTTCTGGATTT AT ACT ATT ATGG ATTT ACAGAGAGAT C CATG TTTT ATGG TGT ACTG ACAAAG CTTCTCTTGGG C 
ATATACCTAGGAGTAGTATAGCTGGGTCATGTTGTGGAAGACCTCTATAGGATTTTCAT 
TTGCATTTTITTTTCACATCACCTTTTATT 
GGGGTGCTCCTITACTGGCITGCCrrcTCATGGCTTGCT 
5 ACACAATGGACTGGGTTCTCTCACATCAATCACTAATTAAAATGCCCT 

TCCCTTCCTTTCCAATAACTGTAGCTTGTGTCACTCAGGTGGACATTAAACT 
TTCTGACAATTTAGTAATCCTTAACTAGCTGCTTTGTTCT 
ATTTTGCTCCTCCAAAAGAGGATATTAAGCCATTGAAGAGGCnTC^ 
TGGAACATATAAAGAAATTTCAATTTTGAAATAATGrTTGTTGCT 
1 0 CAAtZAAATGAATGGCTAATGGACGTGTGATCTGTGGGTGGATAGTAGAAT^ 
TTTTATAGGAATATGOATGGATrTAAAATATGTAATAGCAAGCAAGGTCATA 
GAATTCTACCAATACTTTGAAGGAAGAGAATAGGAAAGGCTAAGTAATTGAAACA^ 
GGATATAAAGCTAATTGTTTTTCTACTTCTGTTTTTATC 

GGTAGATAAGGGCATAAACTACATTCAGTATTGAAAATGTAAAATTGAACAAAGCT 
1 5 AT ATAAATG TT CATTG AACT AG AT CTGTT AG TTTTAG T ATGGT AATTTGGAAGAGTTT AT AT AT ACATGTGTGT ATATAT AT AT AT 
GTATATATAAATTTATAAAAAATTATAAAATTATAAGAAGA 

TITlGriXjTTGTTTGlTTG Ti'lGTTTG CTCTGT AT AGCTCT GG CTG CCCTCGGAACTCAG AAATCAG CCT 
GCCTCTGCCTCTCAAGTGCTGGGATTAAAGGCATGCIAACAC 
CCAGGCATGATGCCACCGCCTTTAATCCTATTACTCAAGAGGCAGAG 
2 0 AACAGGGTCT AGGACTGCCAGGG CT ACACAG AGATAC CTTG TCTTGAAAAAAT AAACAAAAOVAAACAG AAGGG AAAA 
AAGAATAAATATGTGTATATCTATAATTAATAATAATATATTAATTGGCATTTGTGGTTAATGGACT 
TGCTAAGCAAATTAATTTATGGGTTATGTGTAATGTGTTTTTT 
TATTTCAAAATAGAATTAGTGACGGGTAAACTATTTTAGATCCTTGGGAGTT^ 
TTAGGTTAGGGATATATAGCTCAGTTGGTAGAATGCTTATCTACCACACAAAAAGCCA 

2 5 CATACAAACCTCATATACCAACATTTGGGAAGTGGAGGCAAGAA 

GCTAACTTGGGCTACATGAGACrCTGCTrCAATTCTTTTTCCT^ 

AG AG AT AGTG AAGTTGATATTGTTATTT AAGG CTTACATTT ACCTCCAATG AT AATG T ATT C CATTTTACTAGTTTTGAGTATT AT 

ATTATATAATTATATAATACTTATTCTATTCTATGAGATAGTCACAGGATTTAAGATGGAAT 

AAAGTACTTACTTTACAAGCATGAGGACCTGAGTTTGATCCAGAGCTCACATACAAAT^ 

3 0 TGGAAAGGCAGAGACCGGAGACTCCCTAGGACTGCTGGCAATTTAGC^ 

GTCTCAATGGAGCTTGAGAATGTTCTTTAGGAGGACAGTACAAGT 

CAAGTTTGTTTGCGAGCATCCATATTAGGCAGCTCTTTATAGCCTCT 
TATATTATGTCATGATCCTGAACATTTATATGTATACCATATGCATGCCTGGTACCT 

3 5 TTAG AACTGGAGTT ATAG ATTGCTATGAGCT ATCCCATGTGGATGCTG ATAACTAAG CTGAGGTTCTCTG CAAGAGCAG CCAGT AG 

TCTTACCAGCTTGAACJVTCTCCCCAGACCTTAATCTCGTCTTCACTACCAT^ 
ATGGACACTTTTCAAAGAGTTGGGAGATCCTAATTCTATAGGTGAGATCTCTTGAGTTGTAGT^ 
CTCCTTTCTAGTGCTGATTATAAACCTAAGAAAATTAAGACAGAAGATATCAAGAAGGAGAAGAAAAGAAA^ 
GGTTAGTAAAAGTTTTTGAGGCATAOVTTTATTTGTATAAAATAGCCTTA 

4 0 ATGCACATGTGGAAGTCAGAGGACAGCTTTGAGAAGTCGATTCTCCT 

TAACTGCTGCATCATCATCTCTTATGTCCTAAAATAGATCTTGAGGCCAA^ 
TAGAGTGGGGAAATGTTTTGGAAACAAGTTGTGGTTGTGCTG^ 

AGTCTATTCATAGCCTGCGTTGATGTTGATAAAGTATGCCACCTCTAGGTATGCAGAAAAATGTTC^ 

TTCTGTCTTAACAATATTTCATG ACCAATG AAGCATGCTCTTTCCT CTCTAATTTAATCAGTGCCCTC 

4 5 TTGGTTGTAGGATGAAATCTAAGCACTGGAGTACCTTTCCTCAGTCTGGCCT^ 

TGAGCTCACCTTGCTCTGGC CATACrGGCTGCCTCTGTTTGTTTTTCAC AC CAGATTTTGTACTTCAGATTTCGGTGTGTAGTACA 

TGTTGrrCTTCTGTCTGGAACACACATTAT^ 

CTTTTTTC(3UK5GATCCTATTTGGTGCTTGTGGTAGAT^^ 

TTTACrTGCAAGAATTTTTTTTAAAAAAAT 

5 0 TTAATCAATGTTGTGGACAAAATAGGCTTAAATGTGGTTAGGT^ 

TTTTGAGATTGTTATCTCAGGGTGAGTATGCTGACACACACCT CACTCAGAAGG C AGAGGCAGGAAT ACATG AGTT 

TX5CGTCTACAAAGACCTTGTATTCAGGAGGCACAGAGAAAGTAATTTGATCAGTT 
TTGAAGACTGAGATGGGAGAATTATGAGATCAAAGTCrAGCCTGGGCTACACAGTGACAACr 
GATATGCAGGCTGGAGAGATGGCTCAGTGGTTAAGAGCACCGACTGCTCITCCAAAGGTCC 

5 5 GGTGGCTCACAACCATCCGTGATGAGATCTGACTCCCTCTTCTGGTGCATCT 

AAATCTTTAAAAAAAAAAGATTTTAAAATTTTTGTATATGAGACCCGCTTGT 

ACTATAGGTATTTTTAGACTCGAGCCACCAACAGAGTATTTGAATGCTCTTC^ 

AATGCAAT AG CAT AG CATTTG CTTTTTTCAT C C TT AAAATAAT ACTT AAAGACCTT CAAGTTT ATT ACT ACITTG ATT ACCATCTT 

6 0 ATTACAGACTCACAGGATATTCTCrAGCCATTTTCATTCATCATATTCTGTTGG 

GGTTTTATTTTGTTCAGCAGCIAGTTATAT 

GAAACCAAATAGTTTCCATTCCATAAACCATCTTTClATTGCr 

AGATCAAAGTTAGGTTCTTGAGAAGAAAGTAGATTCTCTGGAGTCTGC 

TTTGGATTTGTCTTACTCTrGGGTTTGAAGC r n T i TTTTTT TTTG I I 1TTAGAGTTACACACCCACGGTTGACCTCAAACTCAAA 
S 5 aSATCCCTTTGTCTCTGTCTCCTGAATACTAAGATTGAAGGCATATGCTATCCCCAGCCTATTTTG 
GGCAGTCTGTCTTTGAACTCCTGTGGCTAAGTTACTACTGTTTAGAC^ 
GAACTTCATCCCCTCACTCCCTTCCTTCCAATTCCTGTCCTACCTCrCT 
AGCCCACTTAGTATTTTCTrAGAACATGGGTGTGGTGTC 

7 0 CCTGGCTGTCCTGGCCTCAAACTCAAGAGGTCCACCTGOT 

CTGACTTITTTTTTTTTTTTTTGGTATGCTTT^ 

AACAGGTGTGCACTATCATGCTTGACTGGTGTGi rriGI'I'lU'l^'lGTll' rTl'llTTTl rTTITACTTTGCCACrGCGAGCAGTTCC 
CAGCTGAGAATACrCTTAACTTTGTGTATATAAGCAATAGTAACTGAATC^ 

CTTGT ATTTcrrJ. T u~f ATACTTTG TTAT ATTTAAAGAGATT AAAG ATGAACACACTTTAAAAT AAATGAAG AAAAT ACCCTC 
7 5 TACTGTTAGTTCCCCAGGCITTCCCTTGTGAAATTCCTAAATAATTGTT^ 

913 



WO 03/008583 PCT7US0 1/5 1291 



AAATATCAATAAATTTTTCCCTTGATTTCGGCX3A 

TCTT C CT AGTGGCTTCCAAAAACAG CTT C CAGGCGTGT AAAC AATC CTGATTT AAG TGCTTT CTG r'r'l"rr'riX3TAGGATGGT AAAC 
TTAAAAAGCCCAAGAATAAAGATAAAGATAAAAAAGTTGCTGAGCCAGATAATAAGAAAAAGAAGCCGAAAAAG 
AAGTGGAAATGGTAACATCTCAGCACTGGGTTAAGATGTCAGCACTTTCACTGTC 
5 GGTrAAlTrCTTGACATTTGATGGAGAAGTACTTACAGACTCl''1 1 1 "GAAG AACAG ATAAGTTTT CAG ATCCT AG T AATTT ACCAG C 

TCTAAATCTAGTGGATCTCAAAACTAGTTTCTTCTCATTGAATCGGATTG^ 
TTCATTCATTCATTCATTCATTCATTCAATCACTTCAAA 
AATGTTTAAATCAGGCTAAGTCTACCTATCTCATAATACATTCTATCTTTATC 

TTATTATTGCTCTTAGAAATGTATTCACTATGTAAAGGGTTCri 1 1 1 1 111 T T AATCTATAAATCATTTTCATAAAACTGCCAATTT 
1 0 GCATAGGTTTTCAGGTCTATTTGTCTTTTTTCCTCTCTCAA 
ATAAGCTTAAAGTCTACTAGAACACCTGCTACTCTTC 
CTCTTCTGGACAAACATCATCTTTTTCCCCAAATAATCCATC^ 

TAATTCTTT ACCAAAAGACrTGGAGAATGG C CAGTTCAT CAC CT ATG CT AG G TTG AAGTTTGAAGTCTTTGGG CAAAGAAAAATGT 
TAAAAAAGAAGCTAGGGATAAATAATATTGTATAGGAATTACTGTTG^ 
1 5 ATCCAATACATACATGCATACATACATAATTAAGAAACC 

TCATTCT AGACAGCTTCC AGAAAT CTCT AG TTTT AGATTTTT AT CTACAAAT L"l" IT Ari'l" 1 TCATAAATT AT AT AGTCTG TTTTAA 

GATACCTGCTAACACCAGGAGACCTTGACTTGAGAAAGTAGTAGGCCT 

TTTCCCCTTAGCCTCCCAAGTTACTGGAACTGTAAGCCAATCCC^ 

AACT CITT AT AGGA1T IT r AACTTC"I"GAGCTA'l u rrr UT ATTCATTITT AG ACTGTT AAACTTT1TGT Ci i T CCZAAGGAAACTTTTT 
2 0 TGTTTTTTTTTAAAT AAAACTG CT AC CTAACTTG AATTT AT AAG CCT AC CCTTCTTT AGTTGGTG CTGG TG TTGG ACTG TTAG AAG 
GCCAACACTACAGTT1TATTTGTATGGTAGTACCGGTCIX3TTTAAACTCAA 
AAATGGTAAGGAAGTTGTTATTTGTCCTGAAGAAAAAAAACATGTTGAATTT^ 
TTCAAAACAATGGAGTTGATGACAAGAATATAAAGGGAATGGTATTCTGAATCACT 

TGAGTCACTTGTAGGTGCTTTCACATGCTTGCAAACA'l 1'fTTCTCTGATTTGGAGAGTAGTAAATTTGAAGGTTAGGTAGTTTCTA 

2 5 TCTGGGCATTTGACATCAGCTAGTATAATAACGTTTTATCTGTAAAGTGGTTGTTGAT^ 

TTGTTTTTCGAGACAGGGTTTCTCTCTAGAGCTCTGGCT 

CCGCCTGCCTCTGCCTCCTGAGTGCTGGATTAAGGGTGTGCCACCACGCCCGGCTTATTCC 

AAATAATACTTCTTTCTTACAATGTAATTTAGATGTATATTTAC^ 

ACATCCATTAAGAGGCAACAAAAAGAAGTTTTGGTGGGACT 

3 0 AGGGAGGTACGTCTACCAATACTC^GCTTTCCTTTGTCCTTACTC 

TAGCCTCCCAAGrrACTGGAACTGTCAGCCAATCCCATTATGTCCAATATGGAATTCT^ 
TTTTTAACTTATGAGCTATTTTCATTATGAGTTACCTCTTTT^ 

TTTTGTTTTTTAAATTTTGATGTAACCCAACTTATGTGGTTTTGTTGCTGTTACTTTTATATT 
CAAATTCAGTGTGATGAAGCCATT'CCCTCTTATTTTCTCCTAA^ 

3 5 CTGATTCATATAGAGTTAATTTTTGTATATAGTATAAAGGACTTGCCC 

TCATCTCTAACCATAACATGATGATAATTCCTCTTTAAAATGACTTC 
TACAGCTITAAACGGATTCATTTATCTCrrACCCTGTGATTCTGAATGTTTAA 
AGCAGAACAAGGCAGAACTATCCCAGCACTCAGGAGACAGGCIAGATCA^ 
ACAGCCAGGACTACACAGAGAAACCCTGTCTCAAAAGACAACAACTACAA^ 

4 0 GTGTAGAAGTAGCAGTAATAAGTTTGAGAAATTTAAAAGTAGTCT 

GTAAGATCTGTTTATC^TGTAAAGGGTGTGTTAGAGAC^^ 

ACTGG ATCT CATTG GTG AATTTAT AAAAAAT AAAGTGAC ACAAAAAAAATCAAT CTGATGTT AT AG TT CT AT ACAAG ATCAGTTTT 
ATAAAGATAATTGCTATAATATTTATTGAACTTTCACIATCCAAAACT 
TTTAATAATTCTTTGTATATATTTACAGTTAATTGTTTTTATATGCTT 
45 TATTATTATATGTAAGTAGACTGTAGCTGTTTTCAGACACACCAGA 

ATGTGGTTGCTGGGATTTGAACTCAGGACCTTCGGAAGAGCAGTCGGGTGCT 

CTT AATTTTT AAAAAAT AAAATG TTTTAGAAAGATATTTATTCTG CAG TTT ATTT AGATTTTTGTCAAG TAT CTACGGG ATAGTTC 
CCTGT ATTTCAGAAGTG CCIT AAACTTAGTGCACCTCC^TTAGAATCTCCTCAGTC 

gtccaggtcttactatgtagccttggctgtcccagaactctaccttctgctc<;tctcagaa 

5 0 ccagtctggttgtgctttgaattttcaaatagggctgctttcttct 

ttttgcatgtaccactgtggcatattttatatataaatagt^ 

atctctaaattg aaactgttgaaatg aaattttttt aatt ac cat att cttt cagtttg cttg aaattg ctg ct cttttcagtg ag 

acacagcctttgaatgggcaccacaaattgattagttaa 

cacttaatgagcttgtgccttgtctttgaaaggtgggaagaagaaot 

5 5 agggccrgtcttcgctccaccatatgagcctctgccagagagtgtcaagt^ 

tttt'ggcraagaatggtttgtttctattcagggtt ag aaaatg cctatattg cacagtcctagctagttatttcact 

ttgttttattgtctatgtgtggcttgggaaaaggaatctcaggtgtttaattcagaa^ 
aaagggggctcttaatgtcttatctgtagtctaaaaaggtttat 
attttgagtgtgtaactcaataggaagctgttctgtattgagttctcatttaaata^ 

6 0 catg ctg ctt atcact ag aaattg t att ag caaagg att agtggcc atcatt cagctt at agg t aag aaacct aagct atgtcact 

ctaagtagtgcaactgtgcctttgacaggtgaattttaaagaaac^ 

ctaactgtxstagccctgcctggcctgtaacrtgataatatagatcagcr^ 

ggc^catgccaccttatcrggccrccttgggtctcti^ 

ggttg ag gg at ct g agag t agcctctcacaatccttcgactgactg tgt at caggtagtaatg tcc aactg actcctt cattct aa 

6 5 gagttctgtccagagatgctagagttcgttctcaaactctctatgtatttataattcaccaggttaat^ 

ggtg ttt atgaatcaaataaatgt attt act aacagggagagacactggacac 
tatct ctttttttttt t tttt^^ 

agaagaagac-ggcgtcagattttgttacagatggttgtgagccaccatgtggttgctg 
cagtcgggtgctcttacccactgagccatctcaccagccccttg^ 

7 0 TGCCTGTGGGTTTAGGATGAGACGGGTTTATTAGCATCCAATAATTGAAATCTC 

CATCCTTAAGATATTTTATTTCTCTCTCrCCCTCCCTCTTTTTTTTT^ 
GCTGGCTTTGATCTCAGACrGCCTGCTTGTCT 

CTGATACCTTATTTTTACAAACACACCCCCACATACraTC^TTCrCCAAAAT 
ACATGTTTTGGGAAATGGATGTTCAACTAATATTTTCGAAAGGTTCGGAAATGTT^ 
7 5 TCCCAGATAGATAGAGACTACTCATTT:AATeTATGTAGGAA^ 
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ATTAGCTAACCAGTTTGACCAGAAACTGTAAGACACCCTACACACTAjGATC^ 
CTTCCTAGTGAAGAGAAAAGGCCTTAAGACCTTGAAAAAGACAATTGCTC 

CAGCTCGCTGGAAAAG CCTCACTGTGAAAGCCAAGGTGATGACGAT AGAAAGT AG f TTTT 1 1 1 I TT I 1 TTTTTTTT ATTGTGGGGT 
TTCAAGGAAACTAAGGCCTGTGGTCCCrACCrCTAGTTTCCTTTTCT 
5 ACTCCTCCAAATAAGTCAAGAGTGTTTGACTTACTAGTGCAAACTGATAACT CTTG CCC 

TGGACTTCCCCCTGGCCGCCTCTGCACAGGACATGAGTTGGCTG^ 

CTCTGACT CAACAGri"l'L , l"l'l"ri^TTTCCGGG AGCT CAACTTTTC C CACAAAG T ACT ACGATG ATTGCTGGGAACCT ACACACC CT 

CCACCACTCCGGTGCTATAGCTGTTACTTAATCCrrCCACAGCCCATTTA 

GCCACTGCAGTGCAAAGGCTCCCCACCCAAAATAAGAACACTA 
1 0 CATTCCTTAAAATTTCCTTATCTATTTTT A l'rrrri ' GTTT 

TCCTTTCCCAGCCITCCGAATGCTACAATTATAGGCTACTGCT^ 

TCCCCTCCATTATATTTGAATCATGTTGCTGTTTTGCTGGCAACT 

AAAAAATCCTTTTCCAACACJVGAATCTTGAGTGTGCTGA^ 

TGATAATGTACCTGTAGCACAAGTTTGTTTTGTGATGTAT^ 
1 5 GATGAGATGTCTCTCAATTATTAGAATGAGTAGGCTTCT^ 

TATTAGTACTTACATAGTTGTAAGTGTGAGATATTAATCTCATTTAGAATC 

ATAATATTATAAAGTTGATTTTTTTTTTTTTGAGAAAAGAT 

GATTAAGCTTTTAAGTACTTTTCTTTCACTGTCAGTTATGATCT 

TAG TGG CTTTC CTTGTGAACCACAGGe"i"i x-iU"iGGACAGTei"i"i ACATATACTTACTTG CTG CCTAAC CCACCCTCACCTCATG CT 

2 0 GTGGAATGATTAGACTTTCCTGAAAGAAGTGAGTTCCT 

GAATACACTAGTGCTCCAAAAGTGCTTTTCTGGCTGGGAGGCT 
AGGATGTTTTTTTAGAACCACATCAAAGTAGC^CACTTG 

G AATGGATGAGTC CGTG CCAG CTAGCCTCTAT AT AT AT AT AT AG C AGTGAACAAAG AG ACCTTAT CT CAAAATAAGATGG AAGG CA 
AGGACTGACACCTAAGGCTCTGACCTTCCATGGGTACTGTTGTACACATACAACCC 
25 TCTAGGTATAGGACTGAGGCCCAAAACTTGGTATTTATAACAAACTTGCITGTGATG^ 
AGGTTCTAGAGGAAGGTCTTATTTACGCTTTTTATGTGGGA 

ACCCACGTAACrTTGAAGAAGAAAGATG CCTATTTCTTTTCCCCACAATTC Ci r T C l'TCCACTCCCTGCCCCTATCCTTTCCTC CC 

CTTCTTGAACAGAGGCTGCTTAGCrTAGCAGTGGCAGGAG 

GACCCrGCTTGCTGAAAGGAAAAGGAAGAGTGGCTGGGCCTTCCCCAGGTCAT 

TCTGTTX3TTGTTATCACTGTGTGTCAGCAGACCTTTGTAC^ 

AGAAGTAGCTACATTCTTTGCAAAAATGCTTGACCACGAATATACTACT 
GAAAGGTATTGTGGCCCATGTGTTCATATCTTAATCATTTA^ 

3 5 TTGTTTAAAAAATGCAAAGTATAGTTTAATGTATATCTTTTT^ 

CCTTCTCAGCTT^CATGTGACTGCAGGAAGGAGCTTATCAGATTAGGTTAAAC^^ 
ATT'GTTTCTGrCAGGGACTGTCCTCTGATGCCTTTACA 
GTTTTCAATTTGGATAATTTTGAAAAGAGGTTGAATTAAAATTCACC^ 
TGCATGGGAATAAGAAAGTATTTCTCTTTATTGCTAAAGTAATTGATTAATATAGGTCT 

4 0 TTGTGTTTTTGTCAAAGATCGCTAAAAAATGAGTAATTTTCTTTGATAAATGACTAATGA 

TAACAGAGCAGCACCACTCCCCAGAAGCAATGCTTATTGAGGTTGGCTTCCCr^ 
GAATTTTGAAGAGACAGGAAGACTTATTAATATATTTAATTTTCTTAACCT 
AATACGATTACC^CCTAAGCAAATGCGACTTTACACAGATGAGCCAGTATTTCAAA^ 
CAAGGAAGAAAAATTGGTATGATCGTCTGCTAAACTAGAAAATCTTGGATA^ 

4 5 CAACAGTGAATTCATTACTTTTATATTTTTCTC 

TTGCATTAGACTTCTTTAAGTTCATTTTATCT^ 

ACACTTTCACCATCAAATGCATCTTTTATGGACTTATCCCCTACCACAAAATA 
AAAAGGGAAAAAAATTGCTTTATACCCITTTGTGAGTCTGTGTTTl'GCATTCAATCCTT 

5 0 CTTTCT AAGGCTT CACATTT AAAAAT ACCT AGG AAGGG CTCAGTTG^ T ACAGGG CTTTCCT AATG TGTG TGG AAC CATG ATTTTG CT 

TCTCAGCACTGCATAAAACCATGTGTAGTGATGGTACATGCT^ 
CATTCTCAGTTACATAGAGAGTTTGAAGAC^TCTGGGGTACATGAGACCAGATACr^ 
TTrATAAGTCTATACCCTTTGTAATGAAGATTAAGTGTTGAAGCACTAGTTCACT 
CAAAAGTTGAGAATCTTATTTGTGTAAAGTCAGTTATTCTTGATAGATCT^ 

G AG AT AAAAATATGT AATT CTT AG CATGT AG G AGTTTG AAT C CATTTTGTAT ACTTC CTTGG T CT ATTTT AGAAAAT CAAAG AAGA 
AAATGAAAAGTTATTGAAAGAATACGGCTTTTGTGTAATGGATAATCATAGAG^ 
GGCTTTTCCGAGGCCGAGGGAACCACCCCAAGATGGGTATGCTGAAGAGAAGGAT 
AAGTGAGCACAAAACTCTTTAGGGGGGAGAAGTGGGTGCAGGGCTATTTCCT 

6 0 GAGAGCGGTGCAGCGCTTGCATGGGCAGGTCTCCCAAGTTTGTTCAG 

TTCATTTCATTTTCTTTCTGAATGGTTATGTTCATAGAGACGCAAAGGTTC 
CGACATGATAACAAGGTTACTTGGCTGGTCTCCTGGACAGAGAATATCC^ 
ACXjAATCAAGGTAAGCATAAATCTAGGCCACATCCATTGGTGGAGCTAATATAGTAATCTC^ 
TGCTTTCTAGACTCATTTTAACAAATGATACTCATTTCAACAAAT^ 

6 5 TACCTAGTAAACTCTTATTAAAGATGTTTTACATTTGTGTCCrrATAGTAATCT 

TGAGAAATGTAGATACTATTTTTCTCTAAATATAGAGACAACTATAGGACCT 
ATACTTTGTCTCATCTGAGTAGGTTCCTTTAGGTTGTGATTGAAATATTGCT 

7 0 TTCCATCAGACTTTATGGATATATGAACTAGAAATAAAAGAGAC^ 

AGCCTATAGAACTGTTTTCCATTGAAGAGTCTAAAGCTAAAGAAACCCTTGTTCCTAAGG 
TCAC^GATGAAAATGCTATGTAAATTTACATGCTTTCT 

ATACCATCTGGATACTTTCTTCAAATAAATTGGAAAATAGTATCTGTTTCAGAATAAGA 
CCTTrATTCCrrrCT T CCATGAGATAGTTrACAGAACTACAGAAGTGGGTAA^ 
7 5 TGGX5CAAATTCTGTAAGGATCCAGTTCTGTTAGAAGAAAGGTAGTC 
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GAClTTri ' AAAGGAAATT AGATT AAAATGAGCTCTTT AAAG AGG TCTT CTAAAT AG AC CCAGGTG CTT ATGG ACAAAAG AGGAAGG 
TAAGTTGGTCTACAGGGATGTGACAAGTACCACACAGTGTCAAGTAGGG 
AAAAAAGACCTGAGTTCACCTCTCCAGCACCCACATAAAGCTACAC^^ 
AGGAGCTCCTGGCTGATTGTGGTCTTCAGTTTCTTAATCAGGTGG^ 
5 TTTTTCGTGATTGCCGGTGTTTGATATTCCTAGCGCCCATACATACAACAGTATGAG^ 
GTATGCCTACATC^TACATACATACATACATGCATACAT^ 

CACAACTACCAAGTAGTAGTGGTTCTATATATAATATTATTAATATACTAAATGTCACATTAATGG 
TCACCACAGTCAGTATATTGTCJU3TCTGTGATGAGTATTATTTTA 
TTACAGGCATATATTCACAAATAAATTTAGTCTGTGGGAATTAAAAAA 
1 0 TTCTTAGAATGTAATTTAGATCTCCAATCTCATGCrTTCCCATGAC^^ 

CAAGAAGACGGTCTTCTGTTCCGGGAAGAGAACCCTTrAAGGACTTTGAAT^ 

AAGATT AGCAAGTT ATT ACACCAAT AAATAT AAACT ACGAG AAATG CTGCTG CAT ACCT 

CAGTGCTCAATCTTAGCTCAGTTTCAGGAAAGGCTTT^ 

CATTTGCCACTGGAAAGAAAAACAAAGGGAATTAGCATTTC^ 
1 5 TCACGTGTTTGGAGAACAGAGTGCTTTGAGCAGTAAGAGAAA 

AACITTTTAAATGAGTCTAAAAGGAAGCTACCGACCTAGACAGACGTA^ 

GTTTTGACTACTACTTGGTGTATGTAGCCTGTATGTATTGAACAaVAA 

CAAGAAACTGTAGTGATCAAGGCAGGTGATAGO^GT^ 

CTATAGGCAAAATCACAGGAAACTGTAGTTCTCACAAGAGTC 
2 0 GAGG TG CIT AGTCT AC<3 CAGT AATAACCAG AT AACTGCAT AGT AAGGATTTTC CG T AAAC 

TTGAAAATCACCCCATG ATAGAATGCTTGACTTG ATACAACTGGGA CAAG AGTAGTCAT CACAGGTCTGG AAAGGTGG 

AACTGCTC^CAACCAGGGGTGAGGAGGCAGTGCCTGTGGCCT 

GAAT AG CAGTGG CTITCAGAGGG AAG AATATGATAGC ACCATGAATAGTGT ACTG CTACCAGAGAATTATTCCATT CTT AAAT AAAG 
ACATAGGTTO^AGCrTTAGGATTTTTTGGTGGGAGGGGGCATGGTC 

2 5 ATGT GTGAGT ATTTTGCCTG CAT AATG CATG CAT ACTG TTTT ATGT CCCTTGTG TTTG CAG AGT C CAAT AAAAGGT ATCAGATCC C 

CAGAAC CAGAGGTAGTT AGGG ACAG TGG CAAG CT CTTCATGGGTG CTAC^ AACCAAATCTACTT CCTTTTT AAC^^CAG AGAGTG C 

ACTTAGTCTCTCAAC^ rm ' T CTAGGTCTt^ r rrrc r T T 'rA^ 

CAGATCTTAGCACTTTGTCCAATATACTCAAATCAGTTAATTAAAAAA^ 

TTAAGAGCACTGACTGCTCTTCCAGAGTTCCTGAGTTCAAAT CCCACCAATCAC ATGGTGG CTC ACAAC CACCTGTAATGAG ATCT 

3 0 GAAGACAGCT ACAG TGTACTTACAT AT AAT AAAT AAAT AAAT AAAT CTT AAAAAAAAAAT ACT 

CCTAATACTTGGC^GGCAGAAGAGTCGCTTTGAGCTGGGCATC^ 
GCAGATTTCT<3AGTTCGAGGCCAGCCTC<3TCTACAAAGTC^ 

AAAGAAAGAAAGAAAGAAAG AAAG AAAGAAAGAAAG AAAG AAAG AAAGAAAGAAAGAA^ 
GAGTGTGTGTCTCGCTTTX3AGTCTAAGGCTATCCTGG 

3 5 TT TTTTTI i i 1 CCCTAGATTCTG AG AGATTTCACATGTTC CATCAAGTGTrACCATGC AAATATACACATGGAGGATTTATGGGTG 

GCACTAACTGTTCATAAAACCAAAAACTTGAAGTAGTAGGGGGG 
TTATATAAATGTATGAACATTTCAAAAGTAAAGATAAAATATGTGGCAATTCAAATT 
TGCTGGTAACTCCAGCACTGGGGAGATGGGAAGGGATCCCTGGG 
CC^TGAGAGACCCTATTTOUKSTACAAGCCTCTGAGCAACCAG 

4 0 TGGCATCTTCCTGAGTACAGGTCCTGTGAAGAGTTAAAA 

ATCTACATTTAAAAGAAGTATGGTGGCCACTGGTTTATAACT^ 

GGGAAAGAGGAAAAGGAGTCAGAGACrCCTACAATATCTACCTTTGTT T ^ 

TGATGTAGTAATAGAATTTGAGGGAGCACATTTCTGCCAGTGTCAGGAAGGTACT^ 

AGACCTAGGTGGGTATTT1TTAGAAGCAGTCCT AAAGGGTTTTG AGTCTATTAT CATTTTAT AGGTTCTG ATCAG CAGCGCTGTAA 

4 5 GCCACCAGGATAAGCGTATTTCGGAACTAGCCCTTGCCTGCT^ 

TT CT AG AG AAAACTTTG ATTTGTT CCAGG AGG CCAATCATGT CAAC^AAACACT CTTT AGTGGTTT CTT CTAT AAATT ACTT ACTT 

GACTTTCTCAAACTTTGTCTGCCTGACCACAACCAATGCTTATC 

GTATTTTAGTGAGTCTCTCAGGTGTCTTGGTGGTGGTTTATGTAAGAAATTGTGGTA 

CAGG AGG AT AG CTTGT ATTG T ACATTTTCTAATTT ATTCT CTTO CAG AGAGAAATTTT CAG C 

5 0 TTCCTGAAAGTTATTCAACCCCITGTTTAA 

CT CCCT AG C ACAT CCAAAAAAG CATT ACTTT CTG AGCATTTG TT AAGAT CTGG AAG T AAG C CTTTTT ATTC CATG CAC CC CCATGC 
TTTCATGTCGTCCTATGC^TTAGGTCCTTTTGTCTTAC^TGGAAAACAATATACAAA 
TAGGATCTTTAAATCrrCATGTCn3TGTAGGGGCATGTGCGTTTGAGT 
GTCACAGGTCACCTCTGATATGCGTATTGAGCACAAGTCTTCTGGAAGAGAGC^ 

5 5 CCCCAGATTCTTAGGGGATTTTCACCTTGTAAAATTCTGCCTTGTGGAATTTT 

GCTTTCAAGTCCTTTTCTACTCAGGGTGGTAACIAGAAATCTTTT 
TTTAGTATTTATATATTTTACATAATGTACAACTCCAC^ 

TGACT CAG TTGAAATAGTGGTGTCCATATATGCAT ACAT AGGGGCTGTCCAATC^GGGTG 

AAAAGAGACCACTACAAATAC^CACrAGCAGCT ATCAAATG CT AGG AGTTGGGGTTT ATGTATCC CTCCCC 

6 0 ATTTCATGATGG AATGTTGATACTGTGCAAGCAACCAGAGCAGTTCCTGAGTG CAGTGTGG CCCT ACCATGTTTAGAAG ACACCTT 

C^jCCCCAGTCAAGTTTGTACCTCTTTAAGTCTTTCCATCACCTTTTCCGA^ 

GGGGTTCAATTCATG CCTG AATATTTCACAG ACATTCTGTT'CTCTGT AATTTG ACCAGTCG CAGCC 

TG CACAACTCTT'CTGATGAGG ACCAAGTCTGGGGGTGTACAGATATGTGTTTAG^ CCTGGTGCCATGTCCATTTAGC AAA 

GT CTCAT CAGTGCTTTTCTT AACCCATCAGACCTGTG AGTTCCCCAGCT CACGTG 

6 5 TGACATTTTCATGGGCTTTAGGATCAGATCT 

GGTTGGTGTACACCTTTAATCCTAACATTTC 
TCTAGGACAGTTAGGGCTATATAGAGAAACCCTGTCTTGAAA 

CAGTCAAATCAG AGACCATTTTAAATG CAGAAGTGAGAAAATGGTG AAT AATTACG CCTATTCATGG AAGCTTTTXjTCTTCCTTGG 
T AATT ACATGTTG TTG TCTTGGGTTCTGTTT CTTTG AT AGGTCCCTG AAAC CG CT ATC^CTGTCTTG AATTT AATGG AATTCAGTG 

7 0 CTAGGAGAGAACTGGAGCATGCATTCCACAGCAAC^CT 

AAGTCAGTTKKMGGCATGCTTGGTGGATCTCACAAGAGTG 

ACGGAGTCAGTAGAATAGCAAGTATTCTTGTCATGTTACATGCCCCAGACTTr 

AAATCAGGTTCCTGTTTTGGGGTTCAGAGCTCTA^ 

ACACTGCGCGGCGTCTGAAGAAGTGTGTGGACAAGATCCGAAATCAGTATCGG 
75 CAGAGAGCTGTAGCACTGTACTTCATTGATAAGGTGGGGACATGTTCCTTTAT 
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WITGTCTACAGTATAGGGATCAGGAATGTATAT 

TAGAAAGAAAGAGGCATGTACAGTATGTTGGCTATTGAGCAGTCTCTATAA 
GTTTGTTTGTTTGCTTGTTTTTGGTTTTTCGAGACAAGG 
GCIX3GGCTCAAACTCAGAAATCTTCCTGCCTCTGCCTCCCAAGTGCT 
5 TTTTTTAATAACTGTGATGCTCAAGATTTTTGACCATA^ 

TTT ATGTGAAATTAAATCG AAAG TTG CACAAAC CACTGGTT AACT AGT AG CATTG AATTCTGGTGGTT ATTTCTCATTTCATT CTT 
TT AG AAAATG ACAATT AT AAC CTAT ACAATTCTTAAAAAAGTG TGATTT ATG AT AATTG CAGTTTG AAAG ATAGTTTTTTT AGTG C 
TCAGATTTTTTTCTTCATTTTACACTTAAATT^ 
TCATGAAAATATGGTCrCATTTGTTGCCTTGTTT^ 

TCCCTCCCTCCTCCTTCCTCCTCCTCCTCCTTCTCTCCCTTT 
TATTTGTTACTATATTTAAGTACACTGTAGCTGTCITCAGACCTC 
ATGTGGTTGCTGGGCTCTTAACAGCTGAGCGATCTCTCrAGCCCTC 
ATGTTCATTCCATCTACAAAATTGGGGGCTTCTGGTTTAGAGTGTCATTGAGATT 
1 5 TTCTGAACAACTCTGTTCCCrrrCTTGCACAGCTTGCTCTGAGAGCAGGC 

GGCTGCTGTTCACTTAGAGTGGAACACATCAATTTACACCCAGAATTGGATGG 

GAAGGACTCAATCAGATACTATAACAAAGTCCCAGTTGAGAAACGAGTAAGTTCATGTGGCCTC 

TTTTTACATGTTTGTTTATTCAGCAAATATTTITTGAGGGCTriT 

AAAGAAAGAAAAGAGATAATATTTGGTATATGGTGTGCCTCATTTCATGCCTGCCCTGCAACCT 
2 0 TGGCCATCTCTGGAAGCAGTTTATAACAGAGTCTGCAGAGAGTTAGCCTTTATA 

TTT AGTG ATTTAAGCAT CACAGCATCTGCTGATCACTTGTT ATCTTGTGG TTT^ CCAG TC CAAGC C CCAACTGAATAA 

AATGGTAATCTTTCAGTTCTCAGTCTCTGAGCTCATCTCTGTAGACCCATGAGATAGAC^ 

GATAACAGTGGGTGCCAGTTCAATCGAGTAACTAGGGTAAGCATAGGTTGGCTATAAGGGAGTC 

GC CAGGATG TTTGATG CT AACAAACAT CCT ATTTGAGTT ACAATAT AT AC CACAAATGGCTCAG AACT T ATTT ATG 

2 5 TAAATTT AC ACATG CAG AACA&GAATTTGGGT AT AAT AT AT AT AGGTTGT CTTT CT TGA 

GTTTGAAATTATTTCTAAAAAATTTTAAAATAATAATTGGACTTGTTTTA 
GCAGCCTGAGGAOSATCTTTTTGATCGACTTAATGTGAGTAAACACAGGAC^ 
GGGTCAAGAGTTGTCAAAGGGTATTTGGACTGTCCATTCAGTCTAGATTC 
CACTCAGTCATCTGTAACCAGCAGCAGACCCTCCCTCCTCCCACC^ 

3 0 GAGGGAGCTGAGCTTTCCATGGATCTTCCCTCTTCCTTTCATCCTTTCATGC^ 

CTAAGGCTTTCACTTGCTTTCTTTACTGGGAGAGATATTTATAAGAAAG 
TTATAGAAAGGCACTATGCAATCCTTCCTGTATTCIGGAATTATTT^ 
ACTGGCAGGATGGTGGCAGCACTGGCACAAGGACTCCTAGTATTCACT 
ATATCCrCTTTGTTGTCGTGTTTAATGCAGACTGGTATTCTAAATAAACATCT^ 

3 5 GTTCCGTACCTACAATGCCTCCATGACACTACAGCAGCAGCTTAAAGAGCT 

TGC^ATGGTGCCAGTTTCCTTCTCTTCATTCTTTCCCGCCAACCT 

TT CCCACTG CTG CTGC AAACT CT AGATT AATCTCATAG AATACT AAC AAG AG CTTTG CAT CT GGTCT AAAAATTTTT ATGTTTTAT 

CTGTGTTACTTACAAGCTCAGAGGGTATAGGCAGTGGTGTTAAATTGGGGAGAATAGAGTTGTCA 

AGTAAAT LTT ' lTTC Tl XrnT CT Trr 

4 0 ATTTAAAACAAACTCTCACIACTGTAGCTAAAGGCTGGGC^ 

CCACAGCTGAC^CTGTAGGCAACCCTTCTGCTCCTGCCACTTGAATTTCC^ 
AGCCAGAGATGGTATGCTGGAGATCAGTAATTAGGATTGGTTTAAACAGTGGCT 
ACTTC^AAGCAGAGGCAAACAAATCTCTTGAGTTTGAAGCrrAGC 
ACCCCATCCCCACTCCTACCCCTCCACAAAAAAGAAAGAAAGTCAGATCCCTAGA^ 

4 5 TCTCTGCATGAGCAGCAAGTATTTTTTAACCACTGAGCCTTCT 

TCCTGT AGTCATIT CTTCTAG CACAG CTG GAG CATTTG CT ACAT CTG CCT AG T AAGTGGG AG ACAGGG AAT AAG AAAAG CAGT AT A 
AAGGTCTGGGGGCTTTGGGTTTTrGTTTTTTGTTTTGGTTTC 

ACTGTCTTCAGACATACCAGAAGAGGATATTGGATTCCATTACAGATGACTATGAACCACTG 
CAACCATCTGGAAAGCAGTCAGTGCTCTTAACAGCTGAGCCATCTCTCCAGGCCCAAC^ 

5 0 AAATATAGATTTCCCTGTAATGTAGACAAGCTGGGCAGTAGTTATAGCT 

CATAATGAGACCTGGACACAAAATCCAGTAAGGGTAAAGAAAAATATAAAAGAGAG^ 
TTCT^GGTTTATGTAGAAATTTACTGCCTTTGACrTCAAATGTTGA 
TCTAGAATGCATTTCATGTGTATATTCTTGGTATACAAACTGACCCAGTTAC^ 
AGCGAAGATTCTATCTTATAACCGTtSCCAATCGAGCTGTTGCAATTCTTTGT/^ 

5 5 AGTCAATGATGAACTTGCAGTCTAAGGTATGGTGGGTCCAA 

TGGAAATAATTTGATAGTATCATTGTTTTTTTTTCCTAAAATAT^TGTGCTGGAGACA 

TGAAGTGCTAACTTTGACATACCACTGAATTGCTAGTTGTTGTCCTTGTTATAC^ 

CTTGCClTGTGTGC^GCATGCrrCTGGGCTCC^TCCCrAGC^TGACA 

6 0 AAGATACACCTCAGTGGCAGAACCCACCTTTGCTTGTCTACCATATGTGAAGCCCrc 

AGGGAGAGGAATCCTGTTACATTGTTTGAAGTTTAAAAGATGGTGAAGCAATC^ 
TTTACATCGGTTGAGGTGTXjTCAGATATGACTGTGTCAGGTTGTGCCCTATT 
TCTCTCGGCACTGGCTGCTTAACTATGCATTTAAATCCACAGTGAACATGTTAAA^ 
TAATACACACACTCTGTGGCTTAGGAATAAGTATCCTCATTTAAGATCTTGAGGTGATCCT 

6 5 ATAATAAAAGTGACATTTCCAGAGTTCTGCAACACTGCAATC 

CAAGTTATriCTTTGGATTCTTGCTCACAGTTAAAGAGGA 

TT AAT C CAAATTT AT AGAATTCT AT ACTTGGACAATTTACTTTT AG AAACT CTTTG CAATG AAACTTAACACAT ATTTT CAGG CT 

TAGAATATTCCCCTATTGAGATTTCTGCAAC^TGAGAACACAGGCATAATGTATATTAATCT 

TACCCCTGGCCTGAAAGGATCTCCAGATCCACAGGGAGTGAATCTCAAATCT 

7 0 GCTAAGAACCCCACAGACTGCCTTTTGAGCATGAGCCTTTGCTTTGCCCT 

CATGTGTTGTCATTCCAAGCCCAGAATAAATAAAGTATGAGTACCAGGTGCT 

GAGGAGGTCATTTGTTGCTCTCATTTGAAACCACTGTTTGAGCAGCTCGGGCTGTCT 

TTCTCCAGTTCTGCCCACAAAGTAAACACACGCCCAG 

GTTTAGCATTCACAGAAATGGTT ACG ACTGTGArrTG CTG TGTTTCT ACTCT AAGAAG C CTCTG CCT AGTTGTGG CTCCAATGCTT 
7 5 TTGTGTGCAAAGTCAGTTCTAGTAC^AACATAGATAGTGG 
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AATTGTTTCT AATCTTGGGCTG CATTTTATGTGGGTCCCCTACCCCCAC CCCCGG AAGGGTAGAGGGTGGCAATGGGTGTGGTGTT 
TTTCTAACAAATTTTAGGCTCAGTCAACAACAACCCAA 
TAGGTCAGCAGGGTGATTATGGGGATAGGATTCITTAGAATCC^^ 
GCTGTTTCTAAGCAACACCCAAAGCAGAATTTAGGTTTTGTGGGTTCAA 
5 CTTGTAGAT ATGGATTC CAT AAG ATTTC CTCTGTGGAGGG AAAACTT CAATTGATG CTT ATC CJ.T CT CA 

CTTTAGACTCCTTTACTTTCTAAAATAGTAAGCCAAGTTGAAGC 
GATATCCTCTTTAGTATATCCTTTCCCAAAGCTGGCAATTCTr^ 

CTGGTAGTCTCAGAAGGAATGCTAAAGGCCCAGAGTGGTGGCCCACGCCTATAATCCC^ 

CACTGATCACTGAGTTTAAGGCCIAGCTTAGTCTACA 
1 0 GTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCAACT 

CCCAAGTGCTGGGATTAAAGGTGTGCACCACCGGCTCCAGTCTCTTTCTAACGAACT 

TAAAGTCTGGCA.TAGGATTAATTTGAAAATAGCTGTCATCTGGGTCCTCCTCTG 

TACAACCAGATTTCCAGGTTACTCTACAGAACCTTATATTGTTCACCGAAGAACG 

TTCTGCAGTCACTGTCCACCATTTTCATAAAGCCTTTTTG 
1 5 TTCTTTCCCTCTCCAGGTCATTTTATTCCTAACTGCACTGCCGTTTTTCT 

G CAG ATG CT CGAAGGG ACCTGAAAAGTG CT AAGGCTGATGCCAAAGT CATG AAGGATG CAAAG ACCAAG AAGT ATG T ACCTGACAC 

TGTACCAAGGGTTGGATAAGGGAGAACAGTTGGCTVCCATC^ 

CTTGGCTTCCTAGGATAAACCACCATTCTCAGCTGTTT^ 

ATATAGGATACTTGTTTGTTCTTGTCCAGAGGTCAGTCnrAATCTAT 
2 0 AGGCTGT ACAGAGACTAGAAG AGCAGTTG ATGAAGCTGGAAGTTCAAG CCACAGATCGAGAGG AGAACAAACAAATTG CCTTGGGG 

ACCTCCAAACTCAATTATCTGGACCCTAGGATCACAGTGGCnTGGTAAGTCTTAGGCC^ 

AAAGGCAGAAAGTCTCTTGGATATTGCCAGCTCCITCATGCCGTG^ 

TGCTXaAGGTACATGGAATATCCTAGGTGCCCCATGGGCACATTAGTATAGTAGCCCATGCCTGC^ 

AATG AGCTGATAAAAT AAAG T C CTTTG AAT ATTTTTGTG CACCAGAAAGCTCTGG AC CCCTGCT AT^ CTAG AC 

2 5 TAATCTCCriTGTTGTCTCCATATTTATTCGTTTCTTTTTTCT i'rCl'l'TCl'l'TCTTTCTTriTrrrriT'riACAGGTGCAAAAAGT 

GGGGGGT C C CAATTGAGAAG ATTT ACAACAAAACCCAG AGGGAGAAGTTTGCTTGC^ 

TTTTGACCAGTCTCGGCAGGGGTTCTGTGAAAAGGAACAGTGTGGTTTGGGGAAGATGGATAAACTGAGCCT 

CCCGAGGGAGAGGCCACAAGTCTTAACAAACCAACATCTTTGCAAAAAGATAG 

TTGTCCTATGGACAACTTATTTAAAAATATTTCAGATATCAAAATTCT 

3 0 AGAGAGCAAGTGGATGG^AATTTGTCAGCATCCTGCCAGGCAAAT^ 

GTGAAGTCTGATTATATTGGTGCGTTTTTACAGTTAGGGTTTTGCAATAA 
CTCCTCTCCCCrrATTTCAGGAATTTAAAATTAAGTAGAACAAAACAAAAACC 
TGGGAATCAGTTTTCATTAAACTTGAAGCAGTTGTGACTTCGGCAGTC 
GAAAATATTTCCTGACTTGAGTGTTCCTTTTTAAATGTGAATTTT^ 

3 5 TGATCTT AAAGATCGTGTAG CTTGGGGTTGGGGAGGGATGAAGGGCGAGTG AGT CTAAGGATAATGTAATAATCAGTG ACTGAAAC 

(^TTTTCCC^TCATCCTTTGTTCTGACCATTCTTTGTACCCTTTAAGAGATCC^ 

GAAAGATGTTATTGTATAAAAACTTCCACAGGTCAATAAATTTAGAGGAAAATGAGCATTrGGTCCAAGA 
CAAAACAAAAAAGGAAAAATAATCAAGATTTTAGGGCTTTTATT^ 

AAAGGAAAATTTT AATG TGAAG ACTITTT1TTTG CT AT AAT CATT AGTTTT AG AGGCAT TG TT AGTT AG TGTG TG TG CAG AG T C CA 

4 0 TT CCCACATCTTTCCTGAAGTATCTTCT A irrri A 

ACTTAACGGAAACTTAGCGTCTGCCTGTTTGTGAATGAGGCCCCAGGCCAGA 

GTGGGATAAAGAGTATCAGTTTCCATCCCTAAAGACAGCCCCAGAAAGCAAGGGAA 
CCCAGTGTGGGCAGTGCCTACTGAGTGGGACAGCCTTGTTTATTTGAGAAACAATCGTAGAGAGCT 

4 5 AACAGGATAGCTAGCTTATATGGAGAAGCCTCCAGACGGAATGAAGATTCTG^ 

GAGAAAGTGAACTCCAGACTAAAGGGAAGGTCCGAATGGGCCACCAATTGAAGGGCCC^ 

TATTCAGAACCGCCTTTCTTAGTGGTCTGCGTTATCACTGATGCTGGGACCACAGCAGCAAACT 

TTCAATTGGCTATTGAAGGCAGCACTTACTCCTGAGAATAAAAAC^ 

5 0 GGTGTTTTTGAATTCTGTAGTGTTAGAGTTAGCCAGACA^ 

TGGTAGCTGGTAAGAAATGAACCTTAGTTTTAGGACCTGGTAACAA 

GCGGGCCTCTTG AGG AAAGCCACAGAGCACACATGAATCTCAGCTGATCCT CTG CCAAG ATTGGGGG 

AGCTGATGAGTTTTAG AGGT AT ATAAGGCTTGCG CTT AGAGAATGGG AGTAAACCTGCCATC CTTG ATG ACCCAG CACATCTCAAT 

5 5 CACCAAGCTACACATCATCCACACTCTTGCCATCTGTGCAGCAAGTGCACT 

ATTTAAAAAGTGAAAAGAGACACTTCACACACACACACACACACACAC^ 
CCACAACCTCTCTTCAGTTTCACAATCAAGTAAGTACCTGCGATGAGTCA 
TGTCTACCACAACCCTCTCCCTGGGTTCTAACCCTGGTCACCAGTGACC^ 
TTGCCTAAGCTCTTTTTGGGTGGAATTGAATGCTGTTACTTTACATC 

6 0 GCTGACAAGTTCATCCATCATCATGGGTCAGTTTTAGATCACTTTTCATCCCTC^ 

TG CAT CCTCCACTGGTCTCAGGTTT ATTCTCTAAACATGTCrTATGGATGTTTTA 

TTGTAAAAGTGCCTCTCTGGCTCCTGTAATGTCGCCCAGTTTCACTCCT 

TGTAGCTGCCATCGTCCTTCTCATTTCATGTAGCCAATGCAGTC^ 

CAGG ATATCCACAG AAACCTCTCGAGG AOLACTCAGGACTCGCTGTTCT CTGGGTCTGGGTTTCCTGTGTGTGAAAAG C 

65 ACCTCACTGAAC CAGG CAG ACTCAGCCCTTGGCCCTTCCTTCCAAGCrCAT ATCTAGTAATTTGTC^ 

AATACTTGTACCAC^TATAGGCATTTTCCCAA ITITIA 
TCAGCCCTTGCAGATCCCTCCTATAAGACATTTTCAATC^ 
GACCrC^GTCAAGCTAGTTTTGGATGCITATGAGTAGTCCCCTGA 

CTTTCCCCACAACCCTAAGGAAG CAG CCTTTTTGAAATTGCCAT ATAGATTTTTTAGGTTAAATAAT^ 

7 0 TTAAAAAATCTTTCAX3AGAAAATATTCCTGGAAAACAGTGCTCATCAGTGCAGAGGCTGGGGCT 

GCCCGTGACCCTGGGGTCTGCCCATAACACTATAAAACCACCACCACCAAAACCGTCAATGTAATCAGTTCCCAAA^ 

GGCTCACTATACTAACTGTCATGCCTGTACTTACATTGGAAAATCCTTATAGCCGGG 

CCTTGGGAGGCAGAGGCAGGCAGACCTCTGAGTTCGAGGCCAGCCTGGATGTT^ 

CCAGTGAC CC CAGATCTG CCCTAT AT ACTG T CTGGGT AGAAG TT ACAAGGAGT AT ATTTTTCAAACT CAAACTTCACACGT AAG AT 
7 5 GGGTGCATCrTAGGGCTAGAGAGATGCTTAAGGGGTTAAGAACACTTGTTACTCT^ 
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T ACATGGTGG CTCACAGCCAAC CAAGT AACAATTC CAGGTG ATC CAGT AAT CTCT CCTG ACCACTGTG CAC CGCKSCACACAACTGG 
TGTGCATGCGTATACACAGGCAAAGCACTCATACACAAAGTAATTTTTAA^ 

T AAG TCTGGG CTTT CAGAAGCAG TGGGGACXIATGTGCTTTG AT AACATTG^ ACTCAAGG 
CTGGAGAGATGGCTCAGCAGATAAGAGTCCTGAATTCCCAGCA^ 
5 CTCTTCTGGTGTGCCTCAAGJVCAGCAACAGTGTACTCATCTAT 

AGATTCATGCAAAGTTTACTGATAGGACTGAATGTAGGGAATCAA 
CXSTGCCACTGJ^CACATACTTGGCATTTTAATCATTCA 
TGGGGTGACCTTATTGTGAAAGGAATAGTTACCTCATTTGGAGAGG 
TTCCCATAATCACTAAAGAATCTAGATAAGGTCCTTCTTGCTCCCTGTGC 
1 0 CTTGCTGTGTTCCTGCCCTGCTGAGATTCTCTAGAAGGCACAGTAGGGATGTGTGCT 

AGAGAGCTAATGGTAACAGTCAGTAGTGAGGCAGCCAGGGCTGTGGACACAAGAGGT^ 
TATGGAGCTTTTGTTCTACAAGACAGGGTTTCTCTGTGTACCCT^ 

ACTCATAAAGATCTGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGTGTGCCACCACCACCCAGCAGGTATO 
1 5 GAGTAAAAACAAACAAAACCCCrrAAAGTrCCTTTAAGGGACTGAGAATGCAACTGAC^ 

TCAGAAGGGATGGGGCCCTAGATAGGTTTGCAGGTC^GAGGACAAGGGCITATA 

TGCTTGCTGCATGTACAGITAACAGCACLAGGAAATA^ 

AGTCTirGTCTCCCTCTCTGAACGGAACATCACTATC 

GTTAGCACTTGGTACTAGCCrcCTAAGAATGCCCTCTGTTATTTTAAT 
2 0 AATTCCAGGAAAATGTTATTCCCTTTCCACAGAACrAGTTAGCAAAC^ 

CAGCTGCCTCTTAATCGCAGCAGCTCCATTACCACTCAACCA.TACrGTG^ 

AGC^TTTTTGTCCTITTCTGTTTTAAG 

TGGGAGGCTGGGGAGGCCTCAAGTTTAAGGCCAGCCTGGGTTCTACAAGCAAGA 
AGCGTACCTGCTTCCTGTTCAAGGAAAGACTTGTCATCACTTCT 

2 5 CACAAAGAGCCAGGAGGCTGAGGATGAACGTTTTGCAGAACAATGAACATCCATTCCAGGAT 

GAAGTTGTCTGAGGCTGTTCGCACTCCCATCTCCTTCCTGTTGCTTC 

TACCCACTAACCTATCAGAGACAGCACCCTAAAACAAATTCCGACCTAGCCGGGAAGCAC^GTAGCT 

AGGCTGAGTTGCTGAGTCAGGAGAATAGCTAAACTCAAGATGAGCCTGG^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNKN^^ 

3 0 CTT AGG AT ACC CAAGAT AT AAGAT AC AATTTG CT AAACG CATGAAATTCAAGAAG AACGAAG ACCAAAGTGTGGACACTTTG C CCC 

TTCTTAGAAATGGGAAC^AAACACTCATGGAAGGAGTTACAGAGACAAAAT^ 

TTGCCATATGCAGGGACCCATCCCATAATCAGCTTCCAAACGTTGACACCATTGCATACACTAGCAAGATT^ 
AGATATAGCTGTCTCTTGTG AG ACTATGCCGGGG CCTAGCAAACACAG AAGTGG ATGATCACAGTCAG CTATTGGATGGATCACAC 
GGCCCC CAATGG AGG AG CT AG AGAAAGT ACCCAAGGAGCT AAAGGG ATCTG C AAC CCT AT AG GTGGAACAAC^ C 

3 5 AGTACCCCXjGJ^CTCTTGTCTCTAGCTGCATATGTATCAAAAGATGGCCTAGTCAG 

tgcaaactttatatgccccagtacac^kksaaccccagggccaaaaagggggagtgggtgtgta 
ggacttttgggatagcattgaaaatgtaaacgaggaaaatacctaataaaaaataaattaa 
ttaccactgttctaagtagttattaaacgcaatatgagatcaggcccacagg 
caggaacaggcttggtataagctaaggttgaagcatgagtgttgcctgaaataagcccc!aat 

4 0 aggcaagtgccttgtgtgggggctagagcaccrrcccagacagcacaaatgtat^ 

tatgctgagctgggggaagccavtactggctacaccctgtgtgtgccctacttttc 

aaggcctaggaagctaggaagaggaaatgaggccctgtttcccaattat 

ggagcccctgggggtgg<x;gtggggtcttgttacagacacaaaag 

t agaatttt aaagcatactgtagcctct agg ctgctggagttgg c acag aaagctcaac^acgattacaacattttaagagggtgt 

4 5 ctcctx5agagaaccggccagaccaggcaggcagggaaaggccctgttcagctcca 

ccattcccctccactgagtaaaaaccccagggcctttgcacatgctgttattcatacctc 

tcaaaat acatatgtccatctctttacaccatttgg cctctcttg aagtcct ac ag acataagac acagctc ctcttacttaatga 

ttggccagctagccaatggccaccaggattacctcrrcacccagtgggctctct^ 

acatagcctcacttcaagtcacggggcrctgaaaccttgattagaagataattw 

5 0 gagtttccgcctacatcagaggcctgtcactaggtg^ 

ggtatacagaattcggacgttcttgagaccacccrcaagggaaat ag cg catctgtcact atttgcctggcaaagcgaaggtggcc 
caggtgggtgggttgtccaaacccatacatcacaaattct^ 
gggacttcrcacctctgaaatgcagatgctgtccagagctgcagggtagctgtccaa 
gcaggatcaagggatgccctcgggtttacccgtgctgttcagagaacggci^^ 

5 5 agacaggtgtcatgtctg<*agaatagaggcaaagtccgcggtg^ 

tgggccacgaagctgtgctcatttgaagcacagtgggaattc^ 

gatggajggcjvagacccaagaggatagtgtacacaga^ cc ccaaagg 

caggagggaagagcctcagtggattacttagggatgagggagagaagaaaaaaggttct 

ggagttcactgccatatagagaaggtgtagcgggtgaaaggggccatgtgatgaggatggcaagcmggctgtggcgcagatgacg 

6 0 AGATGCCTGGGTCGGGAGGTCAGGGGAGACCCAGGATTGGGGTC AC CTGTGTCTX5CX3 CAG AGGGGAAGCCACCCTGCAACTGGCCC 

AGCACTCAGTCCAGAGGAAAATGAGGCAGAGGACAAACCAGAGCTTCGGA 
AGGGCAGCGGACCACGCGAGAGGCCTCGAAGGCC^CCGGACCCGCGTCCGAGAGTCT 
CTCCCCAGAGGCCACACTCCyATCCCACCCTAGAACCCGTCTGTCTGCT 
GGCAGGCGTGCTGTCIACTCACGGCCrGTTAGCC^TCCACGAGTGCGCCCCT 

6 5 CGCCACCGCGGGTGTGAGGAGGCCAGGCTGGCGCGGCrCCCT 

GGGGGTCX3CGCAGCCCCCTCGCTCCCTGGGAGGCX3TGCACACTGCCGCGGCGGGTCC 
NNNNNNNNNNNNNNNNNNNNNNNNNNNh^^ 
tO^WNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NKNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNI^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

7 5 CAAACACITGGGCAAACTTTCGGGCCTCGCGCCTGGCGCCCCGTCTCCGCCCAG 
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GGGCATCCGGGTCTTGGATCACCTGATAGGAC^ 

GAGTCCTCACTGGTTGGACAAGTGGAGTGGGATTCCGGCCCTTAGCATCGGGCGGC^ 
GGGACCCCCTCATCCAACCTGAGAACCTGGGGAACCGACAAGATCrrCCTGCCCACTGCCATTTCT 
AAACTCCTAAGAGCTCCGGGATGGGCTTATTGGCXSCAAGAACCTTTGGJ^ 
5 CTGGTTGTGGAGCAGAGCGGGCCTACTCATCTCCCTCTTCrrrTGGGAGGATGGGGTA 

TCCCCATGTCCCGTGGATGGTT 1 1 1 1 1 ' TT l TriTITlTriTr GCCCCAGATCTGAATTTTGGAGGTCCA 

MOUSE SEQUENCE - mRNA 

ATGAGTGGGGACCATCTCCACAACGATTCCGAGATCGAAGCGGATT^ 
1 0 AGATCGAGAACACCGGCATAAGGAGCACAAGAAGGATAAGGATAAGGACCGGGAAAAGTCTAAGC^ 
ATTCTGAAAAGAAACACAAAGAGAAAGAGAAAACCAAACACAAAGATGGCAGCTCAGAAAA^ 
GACAAGGAAAGACCGAAGGAGGAAAAGATTAGAGCTGCTGGGGATGCAAAAATAAAGAAGGAA 

GCCACGAATT AAAGATG AAC CTG AGG ATGATGG CTATTTTG XT C CTCCAAAAGAGGAT ATT AAG CCATTGAAG AGG CTTCGAG ATG 

AAGATGATGCTGATTATAAACCTAAGAAAATTAAGACAGAAGATATCAAGGAGAAGAAAAGA 
1 5 AAATTAAAAAAG CCC AAGAAT AAAGAT AAAGAT AAAAAAGTTGCTG AG CCAGAT AAT AAG^ CCG AAAAAGG AAGAGGA 

AGAGAAGTGGAAATGGTGGGAAGAAGAACGTTATCCAGAAGGCATCAAATGGAAA CCT AG AGCATAAAGGG CCTGTCTTCGCTC 

CACCATATGAGCCTCTGCCAGAGAGTGTCAAGTTITACTATGATGGTAAAGTTATGAAGCTGAGTC 

ACATTCTTTGCAAAAATGCTTGACCACGAATATACTACTAAGGAAATATTCTGGAAAAA'r 

GACTAATGACGAAAAAAATACGATTACCAACCrAAGCAAATGCGACTTTACACAGATGAGCCAGTATTTC 
2 0 CTCGGAAACAGATGAGCAAGGAAGAAAAATTGAAAATCAAAGA 

GATAATCATAGAGAGCGGATTGCCAACITCAAGATAGAGCCTCCGG^ 

GCTGAAGAGAAGGATCATGCCTGAGGACATCATCATCAACTGTAGCAAAGACX3CAAAGGTTCCTTCT 
GGAAAGAAGTCCGACATGATAACAAGGTTACTTGGCTGGTCTCC 

AATCCCAGTTCACGAATCAAGGGTGAGAAAGACTGGCAGAAGTATGAGACTGCGCGGCGTCTGAAGAA^ 

2 5 AAATCAGT ATCGGG AAG ACTGG AAGTC CAAAG AG ATGAAGGTTCGTCAGAG AG CTGT AGCACTGTACTTCATTGATAAG CTTGCTC 

TGAGAGCAGGCAATGAGAAGGAAGAAGGAGAGACGGCAGACACTGTGGGCTGCTGTTCACrTA 
CCAGAATTGGATGTTCAGGAGTATGTGGTGGAATTTGACTTTCCTTGGAAGGA 
GAAACGAGTTTTTAAAAACTTACAACTGTTTATGGAGAACAAGCAGCCT 
TAAATAAACATCTTCAGGATCTGATGGAGGGCTTGACTGCGAA 
30 CTTAAAGAGCTCACAGCCCCTGATGAGAATGTACCAGCGAAGATTCTATCTTATAACCGTGCCAATCGAGC^ 
TAACCACCAGAGGGCGCCACCAAAGACCTTTGAGAAGTC^TGATGAACTTGCAGTCTAAGA 
CAGATGCTCX5AAGGGACCTGAAAAGTGCTAAGGCTGATGCCAAAGTCATGAAGGATGCA 
AAGAAGGCTGTACAGAGACTAGAAGAGCAGTTGATGAAGCTGGAAGTTCAAGCCACAGATC 
GGGGACCTCCAAACTCAATTATCTGGACCCTAGGATCACAGTGGCTTGGTGCAAAAAGTGG^ 

3 5 ACAAAACCCAGAGGGAGAAGTTTGCTTGGGCCATTGATATGACTTATGAGGACTACGAGTTTTGACCAGTCTCG^ 

TGAAAAGGAACAGTGTGGTTTGGGGAAGATGG ATAAACTGAG CCTCACTTGCCCTCACACCCGAGGGAGAGGC CACAAGTCTTAAC 
AAACCAACATCTTTGCAAAAAGATAGACCTGGAGATACTATAAGGGAGAGCTGAGCCAGTT 
TATTTCAGATATCAAAATTCTAGCTGTATGATTTGTTTTGAATTTTGT^ 
GCATCCTGCCAGGCAAATTCAGTTTTGCTGCAATGTTTGGATT 

4 0 TTT ACAGTT AGGGTT TTG CAAT AACTTCT AT ATTTT AAT AGAAACAAATTC CT AAACTC C CT CCTCTC C CCTATTT CAGGAATTT A 

AAATTAAGTAGAACAAAACAAAAACCCAGCGCACCTATTACAGCTGTCATTCTATCGTTAT^ 

AGCAGTTGTGACATTGGCAGTGTTTTGGTTCAGACACCTATTCACAGAAAAGC^ 

CTTTTTAAATGTGAATTTTTATTTCGTTTTAATTATTTTAAAATA 

GTTGGGGAGGGATGAAGGGCGAGTGAGTCTAAGGATAATGTAATAATCAGTGACTGAAACCATTTTCCCA^ 

4 5 CCATTCTTTGTACCCTTTAAGAGATCCATCTTTTCTTTGGA 

ACAGGTCAATAAATTTAGAGGAAAATGAGCATTTGGTCCAAGAAAAACAAAAACCAAAACAAA^ 
ATTTTAGGGCTTTTATTTTTTCTTTTGTAATTGTGTAAAAAATGGAAAAAAAAACATAAAAA 
TTTTTTGCTATAATGATTAGTTTTAGAGGCATTGTTAGTTAGTGT^ 
TATTTTTATCATGAATTC 

50 

MOUSE SEQUENCE - CODING 

ATGAGTGGGGACCATCTCCACAACGATTCCCAGATCGAAGCGGATTTCCGATTGAATGATTCT 

AGATCGAGAACACCGGCATAAGGAGCACAAGAAGGATAAGGATAAGGACCGGGA 

ATTCTGAAAAGAAACACAAAGAGAAAGAGAAAACCAAACACAAAGATGGCAGC^ 

5 5 GACAAGGAAAGACCGAAGGAGGAAAAGATTAGAGCTGCTGGGGATGCAAAAATAAAGAAGGA 

GCCACGAATTAAAGATGAACCTGAGGATGATGGCTArTTTGTTCCTCCAAAAGAGGATAITAAG 
AAGATGATG CTG ATT ATAAAC CTT AAG AAAATT AAG ACAG AAGAT ATGAAG^ 

AAATTAAAAAAGCCCAAGAATAAAGATAAAGATAAAAAAGTTGCTGAGCCAGATAATAAGAAAAAGAAGCCGAAAAAGGAAGAGGA 
ACAG AAGTGG AAATGGTGGGAAG AAGAACGTTATCCAGAAGG CATCAAATGGAAATTCCTAG AGCATAAAGGG CCTGTCTTCGCTC 

6 0 CACGATATGAGCCTCTGCCAGAGAGTGTCAAGTTTTACTATGATGGTAAA^ 

ACATTCTTTGCAAAAATGCTTGACCACGAATATACTACTAAGGAAAT 

GACTAATGACGAAAAAAATACGATTACCAACCTAAGCAAATGCGACTTTACACAGATGAGCCAGTA 

CTCGGAAACAGATGAGCAAGGAAGAAAAATTGAAAATCAAAGAAGA 

GATAATCATAGAGAGCGGATTGCC^CTTCAAGATAGAGCCTCCGC^ 

6 5 GCTGAAGAGAAGGATCATGCCTGAGGACATCATCATCAACTGTAGCAAAGACGCAAA^ 

GG AAAG AAGT C CG ACATGAT AACAAGGTT ACTTGG CTGGTCT CCTG<5 ACAGA^ CATG CTG 

AAATCAGTATCGGGAAGACTGGAAGTCCAAAGAGATGAAGGTTCGTCAGA 
TGAGAGCAGGCAATGAGAAGGAAGAAGGAGAGACGGCAGACACTGTGGGCTGCT 

7 0 CCAGAATTGGATGTT(ZAGGAGTATGTGGTGGAATTTGAC 

GAAACG AG TTTTT AAAAACTT AGAACTGTTT ATGGAG A^ CTTTTTGAT CG ACTT AAT ACTGGTATTC 

TAAATAAACATCTTCAGGATCTCATGGAGGGC!TTGACTGCGAAGGTGTTCC^ 

CTT AAAGAGCTCACAGCCCCTG ATG AGAATG T ACCAGCGAAGATT CT ATCTT AT AACCGTGC CAATCGAGCTGTTG CAATT CTTTG 
TAACCACCAGAGGGCGCCACCAAAGACCTTTGAGAAGTCAAT^ 
7 5 CAGATGCTCGAAGGGACCTGAAAAGTGCTAAGGCTGATGCCAAAGTCATGAAG^ 
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AAGAAGGCTGTACAGAGACTAGAAGAGCAGTTGATGAAGCTGGAAGTTCAAGCCA 

GGGGACCTCCAAACTCAATTATCTGGACCCTAGGATCACAGTGGCTTGGTC 

ACAAAACCCAGAGGGAGAAGTTTGCTTGGGCCATTGATATGACITATGA 

5 HUMAN SEQUENCE - GENOMIC 

TTCAAACTCCTGGGCTCAGGTAATCCCCGCACTTCAGTCTCC^ 

TTGAAAAACATTTTTGCGGGGAGAGATGAGGTCTTGCTATCITGCCTTGTCT 

CCTTGGTCTCCCAAAGCATTGAGATTACAG<nt5TGAGCCACCATGCCCA^ 

CITT ClTTTTTT r rT , rTTTTTrGACAGCGTCTTGCTCTGTTGCCCAGG CTGG AGTGCAATGGTGCAATCTCAG CTCACTGCAACCTC 
1 0 TGCCTCCCAGGTTTAAGCAATTATAGGTGTGTGCCACCAC^CCCAGCT 

TGTGTGTGTGTATTTTTAGTAGAGACGGGATTTCGCCA^ 

CTCAGCCTCCCAAGGTGCTAGGATTACAGGCATGAGCCACCATGCCT 

CTCCATTGCTCATAGCAACrCTCAGTGACTTCCCAGTGCTATGAAGATAAACT 

CTTGCTTACCTCTCCAGCrrTCATCTCrrCACTACT 
1 5 TAAACCCTGCTCTTTCCAGTTTCCAAATTTTTGCCCATCTC 

TCTCCCCAACCITTTACCrGGCTGACrrCTGTTTCCATTTTGAG 

TCTGTAATTCTGGATGGCAGTTCTTCTAAAGGGCTCCTACCGCAAAC^ 

GACAATTACTTGTTCCCTTCTCTGTCTCACCCACCAGACCTGAGCT^ 

ATTCCTAATACTAGCTTAGTACCTGGCTTATAGTTGATTGTGTGTGTGTGTGTGTGTGT 
2 0 GCAGGG AAGCCAAAG AAGGGTCAGGATG CAATG AT AAC CTGGGG AAGGAGG GACAATTACAAAGG AAAGTCTTG GGCAG ATTC CAT 

ACAGGTAG T AGCCACCAATGGG ATGTGGAGG ACAGGATGAAGGGTTAACTCAAGAOT CTT CAAGGTT CTGAGCCTGAG TGACTGGG 

AGGAATAGAGCTATTCACTAAGCAGAGACCATAGAAGCGAAGGGC^ 

GGGGTCATCAAGTGGACACATGAAATAGGCATATGGGTAAATATTTCTATGCT 

TAAGAAGCTACCTGGCCAGGCACGGTGGCTCACGCCTGTAATCTCAGC^ 

2 5 GGAGTTTGAGACCAGCCTGGCCAACATGGTAAAATCTCGTCTCrACTAAAAATA 

GTAATCCCAGCTACTCGG^AGGCTGAGGCAGGAGAATTACTTGAACCrGGGA 
CACTCCAGCCTGGGCAACAGAGTGAGACTCCGTCTCAAGATAAAAACAAGG 
GTCATGATATAATCTTCGGGATCC^TTAAATTGTGCTGAAAGAAGAGAG 
CCAGGAGAATTAATGGTCATTGGATCCAAATGCAAAGGACCTCAA^ 

3 0 GAGAAGAATAGGCTACTCTACTTGGAAACTAGGAAGTCCCTGATTAGAATA^ 

AAAGCAGAAAAGGCAGCACATGTAGACTGTCCrrTCTGAATCITTCAA^ 
GAAGCAGGTTAGGGGTTCTAAGACTGGGCrCCATACACACACTGCTTAAGTTCAAATCAT 
CITTGGGCAAGTTAACTTACTTAACTTTCCTAAGTCT 
TTAT^GCAATGATTATATGAAAATGTATGTGAAGCACTCAGCACAGGGCAG 

3 5 ATTTTAACAGGGTCAAG<3AGGAAAACTGGGAGGAGGAGAAGTTGCCATGCAG^ 

GAGGAGGTGGAAGGGAGGGGACrGAGGACTTGGAGGGCTTGTGGAGAGGGAGGATGGGGAAGTAGGC Cri ' ri 

GGCTAAAGGATCACCTGCTCGTTACTGAATGAAGCAGCTACTTCT 

GCTTCTCTCACTCCTAGAATCTTCCAGTGTTTCCTGGAGCCTGGTCTOT 

GGAGGCAGCTGGACTCAGG CAGCCGCAGG ATGTCCCAG CAGCACACATTGGGAGAGGGGGATTCAGAGAG AGCTG CTTCACAACTT 

4 6 CTCTTTGGGCTATACCCTTGACACAGGCCAAATCCTTCTGTCCCCGTCAC^ 

G ATCTT CC^TG ATTAAGGTTGCTrCTGAAACACAATGGTCTCCCTGGACCTCCG AG CTAATCTGCCTATTTCCAGTG TCAGTTCCA 
CCAAACTGTTCCCCCTCTGCTGCCTGTGATTGTTCrCATACCrGTTTAAAAC 

AACTGGG AGGG C CTCTC CCACCTG CC CAG CCTACATTGGTT AT AT CTTCACTTTT AC CCTCTGCT C CAT C CTG ACCAAAG CT AT GG 
CTCTCCATATATTGCCTCTCTTCTTGCAATATCTTTTCTTCT 

4 5 ACTAGAAACCTTTCCTAAT ATTGCCCCC!ACCACCACTGGTG AGTTGGATGTG CTCCTTTACTCAAG AGTTCCCACAG CAACTGGTG 

GCTGGCCTCCTCTTCCCTATTATTCCTTATACCTTAGTC 

CCAAAATGAGGATTAGGTCTTATAAATAGCTGTATCTCTAGGGCTTAGTATAGGACCTACCAACCT 

ATTTGCCTTCATTCCTCAGGGAATACTGGTTGAATGAATAAATGTCCTCATTTAGGA 

CAAGTGGACTACTITrTCTTGCCCCAAGCCACATCGCCAGG 

5 0 AGTTCCCAGCTTTCTTATTGATACTGACC ' X I TTT 1TTTT1T TT TTTT GAGATGGAGTTTCCCTCTCGTTGCCCAGGCTGGAGTGCA 

ATGGCGCAATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGCAATTCTCCT^ 
GGCGTGCAGCACCACGCCTGGCTAATTTTGTGGTTTTAGTAGAGAGGGGTTTCTCCATGT^ 
CCTCAGGTGATCCTCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCG^ 
ATTCCACACATCTCAACTGTTGCTGAGCTGrrTCATTCTAGGAGGCAAAGAGCTTTTCT^ 

5 5 TCTCCTTTCTTGAGCACCCAGCAGTTTCTTTTCCAAGGG 

AGCTGCTAGAAGACATCACTGTCAAAGAATTGGCAAAGTCAAGAATGTGCCAGCATCCAAGGG 
TGTTCCACAGCATGATAGCCCCAGAGGGGGTGCCAAAAGGATGACAAAAGCCTTGTGAGTGCCCAGGAA 
CCAACCCATCAGAGTGCAATTAAGCAAAAGGGGGAATCAATTTAAAGGTGCCAGAACAGGAAC^ 
AGTAGGAACGAGGGTCCTGAACTTCATTCATCAGGGTCCTCTCTCTCTCTTTTTTAA^ 

6 0 GAGTGCAGTGGCACGATCTTGGCTCATTG(nX3CCTCAGCCT 

GCACACACCACTATGCCCAGCTAATTTTTGTTTGTAGAGACAGCATTTTGCC^ 

AAATGATCCAGCCGCCTCAGG CTCCCAAAGTGTT AGG CTTACAGGTGTGAG CCATCTGGCGGGAGGG ACTTCTATCTCTTATCTCT 
TCTCITCCTCCACCCACTITCTCCrATATCCrCCCC 

CTCCCTAATCTACATACrACAGATCTATCCACAAGAAGACAGATGGAACCACrrCTTTTCTTCT 

6 5 TAACCTGAGTGGCTCAGCTTGGATCAAGTGCCCATCCTGGTCCAAACAACTGT^ 

CTCCATAATCTTGCAGTCCGGGGAAAGTGTGCAGATGGATGGGTCCCTGAA 

AGTTCTGGGAGACAAGAGTTGGGCCAGGTTGATAACTAGCCTAAGAGAGTCGGAATCAGATGAATAAGA^ 

AACCTAGATCAAGAGTCTTGAGGTTCAGTCATTTCAAAAAGCTCAAGTCTCA 

GTATCTGTTCTTACCAGTTTAATATCTGATACATCCTCTATCTGAGGATAATATAATAAATA^ 

7 0 TAGGAGCTTGTTCTGT CCACTCTACACATTGGCCTGGATTGCAGTACCT CCAGGAATGGTGCACCCTCTCTACCACCCC 

CCCAAGTCTCAAGAGGTAGTCTGGAAAAAAGTGTTTTTGGATTTACCT 
CAGGCTGGAGTGCAGTGGCGCCATCTTGGCTCACTGTAACCTCTGCCTCCA 
TAGCTGGGATTACAGGTGTGTGCCAACCATGCTCAGCTAATT^ 
AGGTCTCAAAGTCCTGACCTCAGGTGATCTGCCTCCCTTO 
7 5 CTAATTTTTGTAATTTTAGTAGAGACGAGTTTC 
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GCCCTCGGCCTCCCAAAGTGCTGGCACTACAGGCATGAGCCACC 
CATGTCCCTCTTTTCAAGAGTCACGTAAGTATTTACATATGGA^ 

AT AT AAT AATGT ATGT ATTTGTATGTGG AG CT CTT CTTTT AGTGG ATT ATCT AT AAAC CG T CCACTGTT CCCAATG CAT ACTTTGG 

AGACAGACCTACACTACCAGTTTTCACCTGCTAGTTATTITTATAAGGGTTGGGATC 
5 CAGTGTTTCCCAAACCTC^TTTATTTGCCTATA^ 

TTTATGTAAATCAATGACTTTTTTAAAAAGC^ 

TACATATCTATTAGAATTAAAATGCTTGTCCATGCACT 

GTTAGGAGTCAGGAGTGGTCTCTGTAAGACAGAGTAAGTCAGTCAACCTCTC 

CAGCCTGTGACAAGACTGCCCGGATATTTCCATGTCTGTGCACGGGGATG 
1 0 ATGCCTACCTAGTCTTCTTTGC^GGGGCCTGGAAAGCAAATGCAGGCTC 

GAGGCAAAGGTGGGCAGATCACCAGGTCAGGAGTTTGAGACCAGCCTGACC^ 

AAATTAGCTGGGTGTGGTGGTACGTGGCTGTAATCCCAGCTAC^ 

AGGTTGCAGTAAGCCCAGATGGCACCACTGCACTCCAGCCT 

AAGCAAATGCAGATTCTCCCT^X^AAACGGGGGGTTGGGAGAGTGGA^ 
1 5 AATTATCTTGGACTCACTGCAGCAAGCAGATGCCT 

ACCAACAGCAAGGCAAAACTGGCTCTGGCAGCTTGCTTCAGAGA 

AAGACTCAGCACTTACTAGGTGCCTGCCCTrCAAACCATTATTTTTTC 

TGAC^AAACAGGGACTCAGAGAAGTTAAACAACTTGCCTGATATCACACAGGTAAC^ 

GATTTGACTTCCAAATGTGTGCTCCTAACCAAAGCAGGGTAACCTGCTTTGATTTGCCC^ 
2 0 GG CCTGTTGCTGGGTG CTGGGAATTCAGTGGT AGG CAAGATGG ACAAG ATTC C CC C CT CT CA CAGAC CTTGCTTG CT AT AGGGG AA 

ATAGACAGGAAACCAATTACTACATTTGTAACn^AATAACAAGTGTCATAAGT 

TTATTTTTAT'rrGTlTATTTATTTTTTGAGACTCACTCG^i^i\ii i<3CCCAGGCTGGGGTACAGTGGT<3CGGTCTCGGCTCACTGC 

AACCTCTGCCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTA^ 

AATTTTTGTATTTTTAGTAGAGACAGGGTTTCAC(^TGTTGGCCAGGCTG 

2 5 TGACCTCCCAAAGTGCTGTGATTA(^GGCGTTAGCCACCGCGCCTGGCCTATAAGGCACCCTTGGTCAG 

AGTAGTGCGCCGGAGAGTCACTTTGAAAAAGTGATATTTAGGTTGATCTTGA^ 

TAAACATCTGATGGTAAACATTAGATGTGATAAAGATACTGATGGTCCACTGGATC 

TCATAGATATTATTTCACAGATAACACTTAGGATGAAGG 

TGGTAATAGATGTGGCAATATCCACAAAGGAGTAATGGTTAGATTAGCATATTGAGAAGATCACT^ 

3 0 TTTACGAGAGGG CCAAGAATGGAAGCAG CAATCCCTGTT AG AAG TGT AG TGTTTTGGAG TG CGGTG'i 1 1 TTTGTTGGTTTGTTTTT 

CCACCTCAGCCTCCCGAGTAGCCGGGACTACAC^CACATGCCACCACAC^ 

CTGCCATGTTGCCCAGGCTGGTCTTGAACTCCTGGGTTCAAGTGATCTGCCAGCCT 

TGAGA(^CCACCCCCAGCCAGCACATTGTATTAATCCAAGCAGGA 

3 5 GATGAAGCAAAGAGATGCAATCAGCGGGACTAGTGGGAAATGAGGGAAAGGAAGGTGTCAGGA^ 

AGTTCAGTTCATTCCCAGAGAAGCTTGAGAGGTATCCAACTATACATGCAAGGTAGCTGG 
TTTGACTTTTGTAATACAGGCGACCTCACAGAGCTATTGTGAGAATTCAGAGCT 
TCTATTCCCCATTTGCCCCGTCCTCCCAGCATGCCAGGGATACAGGGGCGGATAG 
TTCCCCCTTCCCTCAGCACAATTCCTGGGCTCATAGCGGATGCTCAATAAGTATTTGTAGA^ 

4 0 GTGACCCAGGGCTACATGTACTCGGCGGAAAACTGGGGTTAAACCGGACCCTTGGTTGGTTGAAA 

GTGGGCX3TGAAATAATCCAGAAGTTCGAGGGCCTGGACAAAGGCT 

TAGCAAGAAAGGAGGCGCTGGGGAGACCTCTGGGGTCCCAGCTGCCAAGGTACTCGCCGCT 

CAGCTCTGCCCAAGATTGCTGGGTCCCGCCGCGGCCTTTAAAGGrc^ 

GCCCGGGTGGCGCCTGCGCACTGCCrAGCAGAGGGGCTGGGGTTCCGAGAAAAAGCGTCT 
4 5 TGGG ACGCCGTGGGAGGAGTCGGCTCCTTATGCAAATCACAG CGGAGCGCGCACGGTCCGGAGGCGGGGCTTG CG ATG CAAAG ACA 

GGTCCGTCTGGCGAACAGCGAGGGGGCtX3GCOGOUVCCCTCTGCCT 

GCGGCCGCGCAGGCGCAGTGAGCCCAAATGCGAACTTAGGCTGTTACACAACTGCTGG^ 

CAGGCAGCGTCGCCGCCGTGGTAGCAGCCTCAGCCGTTTCTGGAGTCTCGGGCCC^ 

AGCCTCCGGAGTCCCTGTCCGCCCXXyVCAGGCCGGTTCGCCGTCrGCX3TCrCCCCCACGCCGCCT 
50 CTCCGGGCCGACATGAGTGGGGACCACCTCCACAACGATTC 

GGCaSTCCCGCX3ACCCCC<^rcCAGGCCCCGACCC^ 

CTGCCGGAGTAGATCGGCTCGCTAGGCCGCGAGCGAGGC^CGCGAAGTTACAGTTCGAGGCAGG^ 
CCCCGGGTCCCCGTCCTCCCCGGGGGCGC^GG«3TGAGCCAGACC^^ 
TGTTTGATATTCTCTCCTTTTCTTTTTCCAGATCGAAGC^ 
55 CCCCCC^GCCGCOKK:CGCCTCCCCCGCGCCCTGCCGGTGCCGGGCAGAGGACAGACATGGTOTCCC^ 

AAGAGGGGACAACGGAGACCCCGTGTCGTCCGCCACCGG^CCrCGGGCG^TCTTTCCGGGCCGGGATTCCT 
TTGTCGACGGTCGGGACTTAGTCTCTGCGCCATTTTCTTTTTCTCTCTCCT 
CTCC^CTCTTTCCTTGAGCTGCCCAGAATGAGCTTTCTGCAGAGCAA^ 
6 0 GGGGGACTGAATTATTATrTTTTAAACTTTATTTTG 

TGAAAGATTTCCTTAAGCAACTGTTAATTTTTTTTTTGATAGGCT 
TTGTGGAGTTCTTTATATTTCCAGGTTGTTTTTCCCAA^ 

AGTTCTCTGTGC^TCACCCAATGTAAATGTTGGATTTTTAAAAAATTATGTTAGTCACT 

CAATG CAGTG TTTTTCTGGAAACG CAGTTT AGTG T ACTTGTTGG AAGG AAAGATT GTTGAG CAT CTCTTG TGG ACCTGGTGTG TTG 

6 5 GCCnTTTCGTTTGGGTTGTAAGGAGGAAAAAAAGATTTTATTTATTGAAGCAAAGAGTGTCCAGT 

ATTCTTTTTCACGAGCCACTGTTTTTCAC^IAATCAAGAACT 

CAGCCAAATCAAGG CATTTATTCTTACTGTATTTCCACTTCAGTT ACCCAGGGAAGAGTCCTCCAAT A C AG 

GCTTTCGAACAACCCTGTTGAAGGAGAAGTTACCTTCTGTAGTGTGCTGC^A^ 

CAATTGAGTTTGGGTTCCGGGAACCTCTCCAATGGGAGAAGTTGATGAG 

7 0 TAGCCTCCTCTACTGAGAGCAACCTTATGTTCAGAGTTTGAAGTGGGGCA 

GGTTTAAAGTATGGCATTTCTTGAGCACCAGATATGTGCCAACCACTGTGCTTC^ 
CCTTGCTCTCAATTCAGTCTATTGCtfSGGAGACATAATC 
TGAAGTATAGGGGAGTTAATTTTGTCTCvAGGGTGGGGAAATTTCTCAGA^ 
GCGCAGAGAATTGTCAGGGOTTGGAGTGTTTCTTAACTACGTTTAA 
7 5 TG CAGTTT AG AAAG C CACAGT AT AG ACATGTTT (HTTGGGATACAGG AG CAAAGAAAGGATG TG CkGGTGAGGATG T7L* GAAGTCC 
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TTTTGATCTTGCTAGAGGAAGGTGGGATCTTT 

AATGAAACAAAGTTTAGCCCTTTCTTGTGAGAAAGTTAACTTACATGGTTCTCA 

GG TTATTGATCTTT ACTG ACATTTTTAT AGAGGCTGAAG T AATCTG CG T ATG TTG ACITTT AG AATGTGTTGGCTCACATTT AAT A 

GCAATAATAGCTAATATTTATTGAATACTTACTGCCAGGC^TGAATCTCAG 
5 TGGAAGGACCCTTGTGAGTTAGGTCCTGTTATTTCTGTTTTGTAGATGAGGAAACT 

AGGGCTGTGAGTTGGCCACTCTAAATCCCATGAGCTGAGCCACTATGCTAGAGTGATACT 

AAGCCACAAATCTCTATTTTCTITACAGAGCTGAAAAAGGAGAGAAGG 

TGCTTTTTGAAGAGAATCTGCCACCATAAAGTTCTGACATTGGTGTAAT^ 

GTAACTATK5CTTGCTATCTCAAGTAATACATTC 
1 0 ATCAATGTTTAATTATGCCCAGACATGTCATTATTGAACTTTGTATTAAGAGATTG^ 

ATCTTACAGATTAACTGTACAAGTTCATATATTAAAATCAAACG 

TCCCCCCCGTCCCCCCACTTTGGAGCTCATTCrrTCGCAATATTTAAAATTGTAGCT 

ATACTCAACTGCTGGGAGACTCTGTTATTTATATTTAAAATGT^ 

TAAGTGAAAAGACTTCCTAAGTATTGGTAGAACTAACrGTAGTGGGGCTGAGTC^ 
1 5 TCTGTAGAATTGATAATGGTAATACCAGGTTAAAGTGTTAGCGATTCTTCTG 

CTATGGAGATAGGGAGTTCTCCCCCZATTTGACCCCCTTAGCRAAGCAC^ 

ATATAGGATCATGXSTTTGACATCATCTTGTGCTGATTCTGGTTGGCT^ 

TGTTTCTGATGCT AAT AG AT AG TTGT ACTT CTGT CTG TTGGGT ATGGGTTGGG ATGGGCGGTAGCAT AG CTG ATTG CGGTGGG GG A 
ATTTCAGATAGCTGATGATGTKX3CTATCTGAAAATTGCC 
2 0 GCAGC^TCCTCAGACTCAATCAGGAAATACTTAAGCATTATGCT 

AGGCACCITTTTAGAATTAGTGAGCCGCTAGGATTTTTATACTATGCTTTGA 
TTAGTTTACCAGCATATCAGATCTCACAAACAAATTTrAGGCTCGG 

CT AAAAG AATCAGAAAT ACG TTGTGGCTTGAAAGGTG CAAACT AAATAAT AT AGCT AGCAGT CTTTTC 
TTTGGTTCAGTAGATAGAGTAAAATGGAAAATAGGAAACCAAGGTCAGTCCCCAAATCT 

2 5 AAAAAAAAAAAGGAATTG AGTCTGACrrriT CC CCTTG CTGT CAGC AT AAAGATGGGGAATTGAGAGAGTTGGAAAG ACATT CATT 

AGTTTGTT CAGATTATT AAAT AACATG T CTT AAAGTTT AT ATTC CAAAATTGGGCTG TGAGG TTTTTTTG TTTTG CTTGTTTTCAG 
TGCATGTGTGGGGTGGGGAGGTGCAAAAAAATGTTTGCCTTCTAATATACAGCTTTGTTGTAT^ 
CCCTGCTGCTAATGGTATGGAAAATGATTCTGTTGTGCTTTCTGAATACATATCATTTTAGA^ 
GTTATTTCATATTAGTTGGACATGAATGCATATTTATATGACACATCATC 

3 0 GTAAAACTGATATTCTTCAAATCCAAAGTCGTTTCCTCAGAAGTGTGTTTTGTGAGGGA 

CGAACTCCAGTAAACTGTGTTTGCTTGGGAAAATGAATTCAAATGTAATTAGTTTGAATGAAGC^ 
ATTTTGACATTGGACCTGGAGGTTTAAAAAAAAAAACCCTATTGAAAGAATAGT 
TGAGTGAATTTTCTGTGTTGTATGGTTTAGATTATCTGACTATCCCTTAATTATTTTGAA^ 
TTTAGGCAAACAGTTGACACATGGATTTCTTAAGGTAAAAAGTTTAG 

3 5 TGTTTAAAGGAAGCTTTCAGATTGTGTTTAAACTTCGAGTAGTAATATT^ 

ATCTCAAGTTTGAGCAGCAGCTTTCCATCCCCATGGCTCCAGGCCAATAGCTCCAG 

AATAATT AG CATTTTG CCAACCCC C AC C CTC CCTG C CACCTG CCT ATCT AGATGTGG ACAGT ATG TG CTGTGTTG AT AG CAT AGGG 

CCTGGGAAGGGTGGAGAGGGATGTAAATGAGGGCAGC^TCXIACAAAGTTTGATTG 

TGTTAGATATTGATGGTATTTTGGTGGCGTTATGAGAAATGGCTGATAGTTTGCTGAAATG 

4 0 AACATTTTCTGTGTGTATAGGAAAGATCCTAGATATTTTTCACACTGGGAGAGAAGTGATGTTAC 

GGAGTATATAGATCGTAATATCAACrTTGGCTATCTGTATCGAAGTTTGTAT^ 
ACXsTTTAGTAAGAAAGAAATTCCAAGGAAGGTAGGCAACTTTTAATTTTAA 

GGGCAGAC AGGGTTTCTGTTTGTTG CCCGGGCTGGAGCGCAGTAGTGTG ATTG CAGCTCACTGCAACCT CTGCCTC CTGGGCTCAA 
GCG ATCCTCCTACCTCAC CCTC CCAAGTAGCTGG AACTACAGGTGCATGTCAC CACGCCGGG CTAATTTTTGCATTTTTTGTAGAG 

4 5 ACGAGGTTTCTCGCCATGTTGCCCAGGCTGGTCTTGAACCCCTGGGCT 

CATGAGCCACTGTGCCTGACCCTTAATTTCTTTTTGGTAGGTATTCATT^ 

TACCrGTCTGCTAAATTTTTCTGAAATGTGTAATAAATGTTTTAATTTA 

GGGAGGCATCTCTACCAGGCAATTAATTCCCCTCTGAGCCTTTGAAATTAACATT^ 

ACAC AGTTCATT AGCATT CCT AAG AAACAC CT AGGT AAG CTCC AACAAAAG ATGTT AAAG CAAT CAG TATTTTT AGGG ACTT CAG A 

5 0 T CTC CTGTATTTGTTGTTGT AAATG AT G ATGTTGGG AAAAGAATTT AACAG ATCC CTGGGTATT CTG ACGTTTC 

GAACGAAACCGACCCAGCCAGCTTGTGGTCCCATTATTCTTTATATTGTTTTTCTT 
ACTTTTCTTGGCTATTGGAGTTTATAGAAAAAAAGTTCTTrc 

TAG T AG AAGTC C AAATGTT AATCTTGCT ATCAT AT ATG T ATG AAT G CCCAGATTT AT AAAG CTTTGT AGGT AT ATT ACTGATTG TG 
CAG*GAGACTAATCTAGGAGGCTTCAGGATCATTGTCATTTGTGGAT 

5 5 GGTAGCCCCTTTGTTTCCTTGTAGTTCACTATGTGAGCTCTTGCCA^ 

TATTAGGCACCACATTTTTATGTGCATTAAATCTGAATTGAATAAAGATAATATAGATTGCATA^ 
CCAGGAAAGGAAACG rn 'TT'rrCTTTTTACCCTTTATGCCTTrrCATGTCCC 
GGAAACCCCACTCAGGCTGTAATTGGTAAGCTTCTAGTACAGATGTCAGAACCTGCCCTTGATT 
GACCCTTTGTATGTGGAATGTTAGGTCTAGGGGAAGAGT^ 

6 0 TTCCGAAAAATCGAGATAAGTCCCTGTTCTTCCTATGAAACATGATCTGACCACTGA^ 

AAT ATTT ATG TTT CACAAACTGTTG AG AATT AG AT AAG T AT ATG AAGG TG CCT CTCTGTC CT CATTGT AAAATGT AAACTTTG ATG 
AGACAGCCTCTATTCTCAACTGGATCTTTATTTTGAATGTTAGGTTTGAAATAGTGATTGATGTTGG 
TATTCCCTGTTAAAGTTTTAAATCTTCTTGACATTTACACAGTGA^ 
GAATCAGCAGAGCCTACACTGATGTCTTCCAGTTACAAAG^ 

6 5 AGAAGGATCCAATTGTATATCTAGTTCCTA ClT TTCTTT' r ' lTi ' T ri 1 1 1 1 J T TTGAGATGGAGTCTCACTCTGTTACCCAGGCTGG 

AGTGCAGTGGCGCAATCrCGGCrCACTGCAAGCTCCGCCTCCCAGGT^ 

ACT ACAGG CG CCCG C CAC CACG CCTGGCTAA'ri.'T'r 1" 1'GT ATTTTT AGT AG AT ACAGGGTTT CAC CGTGTT AG CCAGGATGGT CT CG 

ATCTCCTGACCTCGTGATCCACCAGCCTCGGCCTCCCAAAGTC 

CTTTTCTAAGAATGTTTAC^GCAAAGTGGTCATTC 

7 0 TATTGATGTGACTAATTTCTCAACCATTTAATAGA^ 

AAATGCCAAGTTCATTATTTGTTAATTTTTTAAAACCACAAAATTCAATATG 
TTTACTTCTCCCCCAAAACGTTTTTAAATTAAACAGAGAACTCTCAAAGCACCTGAGAATTA 
AGTTG CTT AAAGTG AC CAACTTCGTTTTCTTTG AAAT AGT AATT 
TGTGAGAATTTAGTACCAaaTGCCGTGTTAACAGTTATGCGTGAGT.AT 
7 5 CAGTAGAAATTTCAGTAATGGCTCATAAGTCTAGCTAATGCrcAATATTC 
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AAAGGAAATAAACAAAAGTCATTTAAGGATGTGTTCACTAGTGTTCATTAGGTTTAAGCCAAA 
CCATTCAGCAAAAGGATTATCACAGAAGCATC^ 

TTGTATAGAAATTATGGAATAGGGCCTAATTTTAAAGAGTCTTCAAAGATAAACATATCAGG 
GGAAAAAAATCTGGGAAGATGAGTATTTGTTAATGCTClXn'AGCTC 
5 TGGATTTAAAAGTCTTCAAAGCTTTGGTCTCTTTrTCTTTCGTTTTCCT 
CCCTCTACATCCCCACCACCCCATAATAGAGACTTAAA^ 
TGAACAAGTTAATATCAG^CCTTAGTTCTTCACCTATAAAGTAGGTT^ 
AGAGGATTACTTTATGAAAGGTGAAGCTTAGAACAGAATTTTTAGTTCCTTG 

T CAGAAATTGCTTT ATT AGGCTTTTGT ACAAAGGG AACATCT CTTT AAATTTG CACATAT AT CT ATTGTG AAATTTT AA 

1 0 CCAAGCTGGAGGGGTCACTAGGGAGCATTGAGTATGATACTATTGCTGTAAATCATAGAAAGGCT 
GCTITGACTTTCCGGCAACTTCAGTTGTCCAGTGACAA^ 
TGGATTAAAAATCTTTGGCGCAGGCCTGGAGGCTCATGCCT^ 

C CCAGG AGTTCAAGATCAG CCTGGG CAACATGGG AAAAACCTGTCT CAACAAAAAAAAT ACAAAAATTAG CCAGG TGTGTTGGT GC 

ATGCCTGTAGTCCCAGCTACTTAGAAGGCTGAGGTGGAAG^ 
1 5 TGCCACTGTACCTACCCCGTTTTCGGCAACAGAGTG 

T AACTTT CT C CTTTCACACTTGGG ATGGTG CAAT ATTTGGCAT CT ACTT AGGAAG CTAAAATTTT CACGTTT AATTGT AAATGGAA 

AAGTTTAAGTTGCCATATTCATTCATACAAACATTCAGAAAGTGAA 

CCTATCAGGATGAACTTTGAAATTAGTACGTTGTA^ 

AAGGCTGCTGTCATACAACATGAATGGGGACTGGAGGTACrTT 
2 0 CCATTTAGTCCTAATCGTGAGTTTTTCCCCCCITAAAGAAATTGGAGACCT 

TTTCTTAAAATGTTACAAAAAATGGTTTTTTT^ 

AAGTTCTGTGTATTTATTTCCTGTTATCTTTTTGTATCAGACATCCATGT^ 
TC^TAATGGCTCATGCCTGTAATTCCACCATTTTAGGAG 

TAACACGGCGAAACCCCGTCTCTACTAAAAATACAAAAAACTAGCCGGGCGTGGTGGCGGGC 

2 5 GCTGAGGCAGGAGAATGGCATGAACCCAGGAGGCGGAGCTTGCAGTGAGCCGAGATT 

AGCGAGACTCCATCTCAAAAAAAAAAGAAAAATAGTTTTCA^ 
ATTCTGTGTGAGAAATCATGTGACAATAGCTAATTCAAGTATTTGAAATAA 
ATTTAAAATCACCATCTTCACATATTrGAAGAACTAAAATCrTAGAGAATAAACT 
AAACTATTGAAATATTATTGCCCCTTCCTAATCCATGAAGTATCT^ 

3 0 AAAGATGCAGAGTACATAGTGGTATTACTTAGAGTGGTTTGAAT 

TTAATCATACAGTCAGTGCATTTTTGCAGGAATGGATGCAATGTAAG 
GCTTCTTAGGAATTG ATTGTGG CTCAGGATCTATTT 
AACACAAATTTGTTAGTCrTTATAACTACT 

TGLAGAAGTCCCTAAGAAGTGCATTTATTGGAAATTTGATTTTTAAAAAAAAC 

3 5 GCCTGGTGGCGCATGCCTGTAGTCCTAGATACTCAGGAGGCTGAAGCAGGAGAATCACTTC 

AGCa3AGATGGCACCACTGCACTTCAGCCTGGGAGACAGCGAGGCATCGTCrCAAAA 
TTCTGTTTAAGTAGTATTTCITATACCCTGGAATGAATGAATC 
CAATACTTCTCCATTTTACTACATATTTTTAGGAAGGCACT 
TTGAGTATTGATGGCATAGTCACAGATACTCCATTTCCTTTCTAGG 

4 0 GT AACCAl titi CrCTG ACTTG CATTGAGu ; i in GTTATCC 1 CT rTT AAT ATTTCATTCTGTAGTAGTCTGT ATTT ATGTGTTTTC 

CTTATTAAGAAGTGGCCAGCACTTrcGG AGGCCIAAAGTGGGTGAATTG CTTTAGCCCAGGAGTTCT AAAGCAG CCTGGGCAACATG 
TCGGGACCCCGTCTCT ACAAAAAAATAACAAAAATT AGCCAAGCATGG TGGTGTG CACCTACAGTCCCAG CT ACTTGGGAGGCTG A 
GGAAGGAGGATCAGTTGAGCCTAGGAGGTTGAGGCTGCAGGG AG CCATGTTTG CCCTACTGTGCCCCAG AACG AG ACCCTGTCTCA 

4 5 GTGCAAACTCCITTCrCCCCCCAGCTGGTTGAAGATCAAGTGGGAGA 

AAGCTGATATGCCTAAAAGTCACCTGGGGATCATTGGTGAAAATACATATTCCTACT 
AAGGTACTGAAAATCTGCATTTTTAATGTATCTTCAACAA^ 

AG AAGTTGATCTATCTG AACTTT ACT ATCTTATTTGGCTAAAGATCT AT ATTTTGCT^ 
GGTATAAAGCTTACAGGGTGATGATGATGATGATGATGATGATTATTATTATTATTTGGAGACAG 

5 0 CTGGGGTGCAGTGACACGATCTGGGCTTGCrcCAACCCCCACCTCCT 

TGGG ATT AC AGG CAC AC CAC CACCACACCTGG CT AATTTTTGT ATTTTT AGTG^ AG ACGG AGTTTTG CC ATGTTGG CCACG ATGGT 
CCCAAACTTCTGGCCTCAAGTGATCCGGCTGCCTCAGCCTCCCAAAGTGCTTAGATTACAG 
ATTACAC^ATATTTTTAAAGGACCAGAACAGTATTTTTTAATATATGTTTATAAAAGCTTGATACTTCACT 
TACCTTTCAGTGAGCATTTACCAAATAGCAGATATAGTTAACTGCCTTTGCAGCTGTT^ 

5 5 CTCCT ACTTCAGTGTTT AACAAGC CATTT AATT CTGATGTT AC 1 T TT 1 CGGATTGGGTG ACAAG CAG AG CACAATGAG CAGGGTTT 

GTTTTC^TACTAATACTAGGATGGTTAAATCAGGACrTTAGAAA 
GGAGCTCACTGCCTTTAGATTAAATTAGTTCGGGACTTCAGCTCCT 

AT AGTTG ACG CTTGAACAACACG AG<M TT AGGG ATG CTG ACT CCT ATG C AGTCAAAAATTCG AG TT AT AACTTTTG ACTCCTCGAA 
AACTTAACTAATAC^CTACTGTTGACCAGAAGCCTTACTGATAACAGAAACAGTCAAT^ 

6 0 TATATACTATGATCTGACAATAAAGTAAGCTAGAGAAAAGGAAATATTAAG 

ATTAAGTGAAAATGGATCATCATAAAAGCCTTCATCCTCATTGTCTT 
CTGTCTCAGGAGTAGCAGAGGAGAAAGAAAACCCTCATGTAAGTGGACTCAAGCATT^ 
TGGTTCTCAATGTGTGGTTTCCTGATCAGCAGAACTGCATTC 
CCTACTGACTCTTAAACrCTGGGAGTAGGGTCTTAGCCATTT^ 

6 5 TTTCAGAACCAATAGAATAGAGCAAGATACCAGTTTTGTAGTAATTCA^ 

CATGTATGGAGTTCATTACTTGTGCTTTCTTACAGTTGTGAAGTTGGAG 

AAAATTTAAGGAGCAGTTATATGATTAACACCAGAAAGGTTTCT^ 

GTTCAGTCTTATTCrrTTCTACAGCTAGTGGCCCAGAGAATGGCCTGCGCATTTCAAGC 

7 0 TTAGTGTGTTTGGAGGCCCCTGACCATTGGAAACAAAC^CAGTTTAGGCTGAGCTGTGGATCrc 

TCACTCTCACTCTTCCCAG^CATTAAGCGTTCCTTAATGGAAGCAAA^ 

CTGGCCTGAG AAG ATACATTTTCAGGTTAGAC^lAGTCC'riTT T CCCTTTTAAGTTCTGCCTCTTGGGAAAGAAGATGCATTTGTAG 
TGCAGGGATGTCCTTGTG AAAATGTGACTTGTT AT AGAG CAG T ATTTGAAATGCAAGTG CCCAG CCGGG CGTGGTGG CTCACG CCT 
GTAATCCCIAGCACTrrxSGGAGGCCGAGGCAGGCGGAT 
7 5 CAT CTCT ACT AAAAATACAAAATT AG CCAGG CGTGGTGG CAGACACCTGTAATCCCAGCT 
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GCTTGAACCCTGGAGGCGGAGGTTGCAGTO^CCAAGA 
AAAAAAAGAAAAAAAGAAATGCAATTGCCCATCTTAAGGGTC^ 
TGTTGTCTTCTATTAAGAGAGCAAAGGTAGTTAGTAAA^^ 
CTCAGTTTTGTCTTATGCTTCGGCAGACTAATAACTT 
5 GAGGTTCTGTGCAGAAGCAGGCTGCTTTGAGATAGCGAGCT^ 

GAAAAGTCCTTGACTGGGCTGCAATAGGCAGAGTGGAGAGTGCTAGGAAGCA 
TTTTTGTCATGATGGAAATGTCCTATATTTGTGCTG 
TGAGAGAGAGAGAGAGAGAOUlGAGTCTTGCTCTGTOVTTaVGGCTGGAG 
CTCCCAGGCTCAGGTGATTD3ACCCTCAGCCTCCCGATAGCTAGGATTACAGGCGTGGG 
1 0 TTTAGTAGAGACGGGGTTTTGCCATGTTGACCAGGTTTGTGTCAAACT 

GTGCTGGGATTACAAGCATGAGCCACTGCACCTGGCCTATATGAGGTTTTGAGCCCCCT^ 
AACTTAATTATTTTATTTTATTTTATTTTAA1TTAAATATAAACT 

TTAGGTAAGACCCAAGAGATTGTGGTTTAGATTCTGCTTGGAATATGAGTGATTGGGGCT 
GTGTGAATTTGATGTTATGAGTTTGAATAAC7VGCCTTCCTTCAGTATGTCTACATTC 
1 5 AGAGGACTTACAACTGGTAAGTAAAGTTGTGTTTACCAAAGCAGTTG 

GCTAAACAGGTGAAAATTTCAGGTCCTTTGGATGAAGTAAGTTCTCATTTTA 
GCGTAGTTGACAAAACAGATGAACTTATATTTATGTACACCriTTATGGATTA 
TTCATTTGAAATTGTAAGTTTCTAGCATGTTGCCT 

TCAATATCCTACTATAAGTGGTAGAGAATCAAAGATTGTAGACTACCAAATAGCACC 
2 0 T ATAGTAAAAGCAGATCAG AACT CAGCTG CTGT ATG CCAT CTGGGG TT ATCTTCTCACTTG CACGCTCATGACCAGTGT AGGCTG C 
ATCTTATAAGT AATCTGGCT CACAGTT AGCT AGACAT AT ATT AGT CTGATGG^ C CGATTGTAG CTTTGAC CACTATG AG 

GACCACTGTCATATAAGGTCTTAGATACCAACTTGAGTAAATTGCTTAAGGACTGGGAATAGATCCAGATTCCT 
TCAAGTTGTGATCAGTTTTCXSTGGGATGCAAGGAGAAAATGA 

CTAGGCAGCGTCTTAAGAATTCTAAGCTTTCATTTATCAATCCTGAGGTAGTTAACATTA 

2 5 AGTTTACTTTTGAAG AAATTGAAT CTC ACGG TTT AAGTAAAGTAACTAG CT CAAGG CT AT AT AGTT AT AACT AAG CCACAG AG CAG 

GGAGGCTAATCTAGGCCTACCTCATGCCACAGCCTGAGTTCTTAGTTGTGATACAAAA 
AGTTGCCTCCTCAGAATTCTGGGCTGTGGTGGAATTACCCATCAGACTGG^ 
AGACTAAATGCAGGCTCTGCTGCACTTTAATGTAATTCTTTCCCTATTAA 
ATAGTTTATTTTTTAGCAGTTCAGCTTGTGCTTGGACCCAGTTG^ 

3 0 GATCTTrTTGTT AACT AAGGATT AACT AG AAACCTTATTTTATAAG 

AGGTGGTTAGCATTTTGTGAACAAGTATGATGTTGTTCCCAC^ 
AAGCAGAATTAGCAGTGGGCCAGTCATTC^CCTCTCTTC^^ 

AACCACTTTCAAGTCAAGTTGATTGGCCATTCATGAGCTGTGTGCAGGGAGAGAAGA 
GGTGTGTTAGACXXX5AGTTGTAGGTAAGTCAGGGAGGATATTCTT 

3 5 GACATTGCGGATACTTTCTAGTTAGTACTAGCGCAGTGTTCreCAAA 

TTTGTATTTTTAAATTTAGGCATGTGCCACATAACAAGATTTCTGTCAACA^ 

G AAGGTGGTATTTTT ACTG CCTCTTTTCTGTGTTGCAG AT ACGTT CAG ATACACAAAT ACTTAACZACT 

GT ACKH'AACATG CTGT ACAGGTTTGT AGCCT AGG AG CAACIAG^ T AGGCT AT ACCAT CT AG 
GTTTGTGTAAGCACACTATGATGTTK5CATGACAAAATCGCCT 

4 0 AGGAGAGTATTACTTCATGACACAGGAAAATCATTTGAAAT^ 

G CTCATTCATTT ACATGT AATCTGTGGT AG TTTTTG TG CT ACTGTGG CAG AATTGAATACTTGCAG TGG AACTTGAATGGG CT ACA 

AGGTGTATTTAGTACCTGGCCCTTCACAGAAAAAGTTTGCTGACCTGATTGAATGT 

ACTCTTGGTATGTAGCCTCTAGCTCCCAAAGGCATCTGAGCCCCACT 

CACCCATT AG C CTC CAG ACAACGAAG AAAACTTCT ATTTT AAAG AG CTTGACT AAC C C CCTC CT T ATT ATCTAAGTT AG AG TAG CT 

4 5 GTGCCACCCCTTAAGTAGCTTTTCCTTCTTCCTTTCAAATAGG 

TTGTAGAGACTCAATAGTCTAGGCTATAAATTAAAAAAAAAAAAAAAAAATTCCCC 
AAAGGAACAGAGAGACCCAATTGGAATAGAAACrrGAAAATAAAATCAGCAGCACATGJ^ 
TAATATTGCTCAGAAGACTCAGGAATAAGAAATAATAATGCAGGGTTTCrTTCAAAATA 
TGGAGAGCTTGTGTTAAAATATCTGAGGATGATTCCAAGTACCAGGGCTT 

5 0 ATTTAAAATTrrCCTTTTACCTAAATCCACATGGGCCAAGACAAT^ 

CAAT AAAGGTCATTTT AT AG AT CAGAAAACTG AGAC CCAAAG CT GAG AAAT AAG TTATCT AAAAC CC CACCAGG ACATTG CATT AA 

AGGAGTTGCTCTAAGCGTGTCTTCTTACTATTAGAAAGTGAAATTGACTTTTAAATCT 

ACAATGAACAGTCTGTTCTGGCTCTAATTGCX3CCTAGAATCT 

GAAAGACAGTTTGTCGGTCCTATCACTGAAGTCTGAATTATTTTTTTGATTTTTAAATATAATTAAT^ 

5 5 CCAGAGGAAGATAAGAAGATGAACTTGAATTTAAGGACCATAATGGAAGGAGTGGGATGGGAAAC^GGCT 

GGTTGTGGAGAGACAGATTAAGGTAAGGAGAGAAGGTTGTAAAATAAGACAGGAAGGGGT^ 
TGGTGTTGTAAAGCCATAAAAGAAATTTAAATGCGGAAGTGGCCTGGAAGAGAAAACATTCTCT 
TGTGTCAGTTTGAGGCGTAAAGTAGACGAATAGCCTCCTTCTTGTGTGTCTTTTCCATGGCAAA 
CACTAATCTCIACACAGCTGCrrGATGTTTAAGAACCTAATTCTTGAA 

6 0 GGGCACTTTGGAAATTGTAGTGACAGGTTTTAGAAATTTGAAGTTGTC 

CTGAGTCTTTTAATGCAGTGCACTTCTAAGTTTCCGATGGTATTTC 
AAATAATTGGCAACTTTCATTTAATTCAGTITGACCCTGTCTTGAACT 
TAACATTAGAC^CTGATTCK^AACTTCATGGAGGCAAGTCCAGTGAAAC^ 
CAAAGTTACTCAGTCAGCAGCTGCTACTGTTGGGATGGCGTGGTCAGGTGC^TATTATGCT 

6 5 GACTCAGTTTTGGGGCTGAGTTGTATTTCAACACTT 

TCCTTAAGGACAGAAGAACTATTTATACTTGTCrcCCATCTTCTTA^ 
TTGAGGTCCCTAACACXXXSAATCCCATAGTTTCAATGATGAACAGGTCC^^ 
GGGAGAATAAATGGTCATGTGGACTTTAAAAATTGGAAATGTTACTA 
CAGAGTTTCACTCTrGTTGCTTAGGCTGX3AGTGCAA 

7 0 TCCTGCCTCAGCCTCCCAAGTAGCIX3GGATTACAGGTGTGCATCACCATGCCCGGCT 

TTTCACCGTGTT^XJTCAGG CTGGTCTTG AACTC CTGAACTCAGG TG AT CCACCTG CCTTGG CCTCCCAAAGTG CTGGGATTACAGG 
CATGAGCCACTGCTCCCAGCCCrC^TTCATCAGTTTTTTAAC^^ 
GAATTCCAGAGCTCTTTC^CCTTTAGCCAGAAACAGTCTCTC^ 
GGAAATt5TTCTCCTTGAGAATTGCTGTAAAGTAGTGATAAGGACT 
7 5 GCTCACAGACAGGGTGTCCTGGATTGAAACTGGACCAACGTTGGTATAGTCT 
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TCTCCCCTAAC^GTTGGGGAATACTTGTTTTTA 

CTTTAGTTTTGCATTAAGTTCCTGGGAGATTCACACACCCCTGATATTTACTGCA^ 
CCAGCTTGTTATCTGTCCTAGACAGAAGCCATCre 
ATTTTrrTAAAACTTCTGCTTTTGGGGATGTGATATGGGTTTGAAAG 
5 CAAGAGTTTTAATAATGAGGTGATAAATATTTTCTITAAATAATTTATTCAC 

TTTCCTT ATGCG CTAGTTGT AT ATTTG T ATGTCAG AGAAGAG CAGAGATT AT CTTTGACAGAAGTT AT ATTGTGAGTGT CTTTTTG 

GAGAAGGTAGGTTTTGATTGTGTAACTATGCCACACCTCCATTCCCCATA 

ATCATATTATACAGCTCCCCTAAGTCTCAAATCTATCATCAAAAGGCAGTGTAA 

1 0 TAAGATCATGGAATAGTTGATAAAACAGTCAGTATTCTGTCCTGATG^ 

GCTTGGATTGAGTTTAGAGACACCAGATAGAATAAGATTGGCAACACT 

GCTTCTGCATGGGGAATGCTATGGCGTTAACAGTGC 

GAGTAGGCCACAGTTTACTTCCTTTTrCATAAGTGCTTTTCT^ 

AGTTG'l 1XXSCTCGGTGGGCATTGTGAATGGGTATGGTTGCTGCCTCAAATGGA. 
1 5 TTGAGTAGGCTGACTTCAGTTCTATGTAAATTAATCAA 

TTATAATTTCTTGGAGATTAGCCAGGAATTCTGTrACAA 

ACTGAAATTTTCTGAGCCCAAGTTTCll'CATCTGTAAGATAA^ 

GATTTGAAGCTCTTTACCACTGATTTATGCTACCITCTCA 

GATAGAACAAGG ATTTTGAATAAGATATTTTTCACTGTT CTT CCTTCAG AGGTCCCTAAAGGGGA 

2 0 CCTCTGTGCTGCTCTGCAGTGCCTCTATCACATTAGGGTTTGGTC^ 
GTATTTTACAGACTTGGTTATTTCTTCCCCTCATGTAATTTGTC 

T ATATG CCTG ATGTG ACATTGATTTGGCCACTTGGGGTTGAC CTTGTACTTGGTTTCCATATCTGCAGCTTCATTGTCAi 1 T it3GG 

TTTAGGCCAGCTTTCAGGAACnTTTTAAOU3TTGGA 

TTGTTTCCTTTCTATCTAAGCAGAGAACTAGGACTCTAGAGTTACAAGA^ 

2 5 AAAGGACCAGGCATCGTGGCTCACTCCTGTAATCCCAGCACTTTGGGAGGCCA 

GACCAGCCTGGTAAACATGCTGAAACCCTATCTCTACAAAAATACAA 
CTACTCGGGAGCCTGAGGCAGGAGAATCTCTTGAACCTGGGAGGCGGAGGTTGC^ 
CTGGACAGTAGAGTGAGACTCrrGTCTAAACAAACAACAAAAAAAAACTCT 
TCTGGGATTTTATTTTTTAATATGCTCCTCAAAACAGTCAACTCTCC 

3 0 CTCATTCAGGAGAGGATCAATTTTTCAGCAATGATAG 

CCAAGAATGCCCTTTCACCTTCCTTTGTCTTCCCAACT 

GCCTATGTTAACTCCTAACTGCCACTCTTGGATCCAGGGTTTrTATTTCATCTTGTTC 
TTATATTACTCTTCACTTTGAAACAAGGAGGCAGTTAACCTAGATTCAGTA 

CAT AAATT AAG CAATCTT ACTG CCTT ACTG CGCTT ACTGAAAAAT AG C AG CACAG CAG AAAAAT AG AAAATCTTG CT AT CATG T AA 

3 5 GAG ATG TGGAAGT AATGGTG CC CACATGCTTCATTTGG AAAATG TCT ATT AC CTTT AT AAAG AGTTTGT ATAGTTT CT AAAT AC AG 

GAGCATT'ATGATGCGATAGAAACTGTTACTTTAATGTTAATTGACGTTCGAAAA 
TCTTAACCCTTTTGCTATTCCATTGATCTGCCATTCTACATTCTGCCT 

T AACTTTG TTGT CAT C CACTT ATTTCTCT AG AT ACACCTTGGAGTGTTTTTTCAGG CTG AATTGTT AGG C ATGGG ATTG CTGGGTT 
TAAAGAGGGAGTGTGTGAGTTTGTTTGATTTTGCCAAATTCTTCCTTCATGGGGG 

4 0 GATGGTAAAATGAGGCAAAAAGTATCTTAGATTTGAGGAAATCGTTACCACCCAGAGTTGTGAAACT 

ACTGC CTTG CACAAAAT AT AC ATTT AGTAAAGG AT ACTXIAT ATGGTTTGG CTGTGT C CCC ACC CAAATCTCATGTTGAATTGT AG C 
TCT CAT AAT CTT CCTGTGTTG TGGG AGGGACC CGGTGGGAGG TG ATTGAATCACC CGGGCAGTT ACCTC CATGCTGTT CTTG TGAT 
AGTGAGTTCTCATGAGGTCCGATGGTTTTATAAGGGGCTTCCCCIACCCCX3CC 
' T/TGCTGCCATCATTTGAAGAAGGACGTGTTTGCTTCCCCTTCCACCATGATTGT 

4 5 CTGTGAGTCAATTAAACCTCTTTATAAATTACC<ZAGTCTTGGGTATGTCT 

T AGAT AC CCATT ATT AT CTTTTGTGAACTGGG ACT ATTTCTCT AT CGTAAGA^ 
AGCACGCTAGCATGGTGCTGGCACCCGTATTAGTGCCCTTTTCTCTTGTGTC 

ATT AC CTT AATGG ATTGT AAACTCCTTGAGGG CAG AGTC CTTGTCTGGTG AG T CTTT AG CAGG AAG CTTTGGACTT GAAT AT ATTT 
GGTTTTAAATTGAATTAGTCTTCAGAATATATTTCrcAAAC^ 

5 0 AGCCCTCAACTCCTGTGTAATGCTACCTAACATCTATTCCTACAGTAG 

TTCTTTACTTAAAAGCrnTTTAAACCCCAGGGAATACGTTAGGT 
TTATCTGTCCTTAGGGAGTAACTCCAGTTAAATTACAGTGAAAGGAGTTAGTTGCCA^ 

ATTG CAGGGGG CTT CCT C CCATGTCTTCATCT AGAATTTTCACT AGGAATT AAAAT ATTTGTT ACT CATT AGTTGGCCTG 

AATTCACAGTGAATTAGGTTTTCTGATGATTTAATGTGCACTGGTTTGCATCT 
5 5 TTTTGCACTTAACAGGTGCrTGAAGTGGCIAGGGTCCTTCCT 

GAAAAGGTATGTGCTGAAAGTAATCTTOUVGGCCACAGTATGTTCTCT 
GTGCCCTTATTGGCTTTACTTTTTCACITACTGGCATCTCT^ 
CAATATGGCTCAGGCTC^TCC^TCATTTCTCCAGTATTTTTCACCATCTTT 
TTGTTCCCCCAGTCACCTGGACGTAAAGACTTTCTCTGACTCTTTATTACACTGCATTTATTAC 

5 0 CTGGGTTCTACTTCCATTTrCCCAAATGTGCTTT^ 

G CTT C AGAAACCTCTAACT ATTT ATTTTG CTTTT ACTTTTATCT CTT AC T ATG CCTT AACAC AAAC C CTCT CTTGGAACT AGG C CA 
GTCTTCTTGCCATGCCCTCACATAATAACACAA 

GTTTCAAGCATACACATAGAGAAATATAATGAACTCCC^TGTGTCTGTC^ 
ArmGTCTATCCCCGTATCCCTTCGTGCTCCCC 

6 5 GGAAGTACTTCAGTATTCATCTTTAGAGATTCTTTAATCTGTCAGTTTACCTACTGCAGAATATCT 

GG TT ATTT AGTTTGTTT AAATGCTTTTGG CGTTCAGAAGGCATC CCATT CTTTTAGTGG TT AG 1 1 rr i"i GTACTG ACCT CAATTT 
GCCTCCCTATCATTTCTCAGCAGTAATATAAATTTATATTTACTC7VGAAATTACAAATTTACTGAG 
TAATAGT(^TTAAGATATTTGAAGAAGTTGCCATGTAATACACTTTTTGACACAC 
TTTTATTTGTGTTTTAGAGAGAGGGTCTTGCT 

7 0 GTGCCCARGCAGTCTTCCCACCTCACCCTCAGAGCAGCCAGTATTACAGGTGCACCAAGATGCC 

TCGT AG AG ACAAGGT CTTGCTATGTTGCC CAGG CTGGTCTCAAACT CCTGG C CTCAAGT AAG CCTC C CAC CTTGG CCTCCCAAAGT 
GTTGGGATTGTAAAC^TGAGCCACCCTGCTCAGCCATTCCTTTAAAATAGGAATACTTG 
TTTCTATCAATGAGATTAATGAGCCIACAGCCCCXSAGTTTC^ 
AAAACrTGGGGACCTGGCTTAAAATCAATTT^ 
7 5 GATCCCAGCACCAGACTTTGGGAGGCTGAGGCAGATGGATTGCCT^ 
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CCCCATCTCTACTAAAATACAAAAAATTAGCCGGGCAT^ 

AATTG CTTGAAC C CGGGAGG CTGAGGTTGCAGTGAG CCG AG ATCAC ACC ACTGCACTCCAG CCTGGGTGAC AG AGTG CAC CTGT AG 
TCTCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGG 
TCCAGCCTGGGTGACAGAGTGAGACTCTGTCTCCAAAAAAAATAAAAAAAAA 
5 TGGAATTACTTTCAATTT ATCCTTT ACTT AATTTC CTTTTT AAGTTTAG AG CTTTTT AGTGATTTAAGCAAATTTTCCTATGT ACA 
CT AGT ATTTG TCAAGATAAATTTTT AAAG CTTGGT CCATTTTTT ACT AT AAAAGTAATTGTGGGGAAAACT AGAAAACAAG ATGAA 
GTATATGCACAAAAACTCAAGTTACrCATAAATTGACTACCCT^ 
CTTCATATGTATTTATGCGTTTCTTGAATTATACACAGAGTAGTT^ 
AGCAGATCTTGATGTTCTGAAGCTTGTTTATTG 
1 0 GTGCTATTACATGTATTGTCATTATTATGATTTTAGCCATAAGTGTT^ 

AACTGGGGCCACTAACCTTGTGGGAATCAACTTAGCATlTrAAAATTATAGACT 

ATGCTGCATXXIAAG ATTGAGTTCAAATCCT ACTTCTG ACA C CAT CTCTG TAG G ATGGGAATTTCTT A 

CAGTTAGTACTACGGTCATGCAGATTTCTATTTTCCCAGGGCCT 
CTTTTGCTGTGAGCCAGTCATG<K3ATGAAGACACATTGCCTATGCAT^ 
1 5 AATAGCTTAAAACAACAGACACTTATTTTCTCATGTAGTTCCTC 

TTACCATCTGTAGATTGCAGTCAGTATGTCAGCCAGGGCTGCAGTCATT^ 

ATGGCTCATTCACATGGCTAATTXSGCAGGAAGCrTAAGT^ 

CAGTGTGTCA.CTGGCCTCCCCCAGAGCAAGGAGAGAGCAGGAAGGAAGATCTA^ 

C CTCACAATT CCACTTCAT CTTG TTTGTT AGG AGTG AGTCACAGTG T CC AG CCCACACT CAAG GGGAATTAGG CT ATACCT 

2 0 CTTG AAAAGCGGAG T AACAAACAAT CTT AT AGT CAAGATTGTGTTTGTG TAAGGTT AGAAT AG AGGGTC CGGAAAT AGACCTATAA 

TTATATGACCGCTCGAAAACGCCAAGACAATTTAATGGGGAAAGGGCC^ 
GAAACATGGAATCAACCTCAATTTTCATTAATGGTAGACTGGATTAAAAAAAAGAATGTG 
CAGCCATAAGAAGAACGAGATTtSTGTCCTTTGCAGGAACATC^ATGG 
CAGAAAACCAAGTATTGCATGTTCTCACTTATAAGTGGGAGC 

2 5 CAGGGGCCTTTTGGAGGGTGGAC^TGAGAGGAGGGAGAAAATCAGGAAACATAATTAGT 

TGAAATAATCTGTACAACAAACCTCCATGACACGAGTTTACCTA 

AAAAAATGGTGTTAG AACAACTTG ACX3TCTTTATTGATT AG AAAAAAAT^ CT AGAAAA 

TAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAGGTGGTGGTGGTC 

CAAGGGCTGTGGCAGGAGGATTGCTTGAGCCCAGGAGTTCGAGAGGAGCCTGCGTA^ 

AC CCTAAAAG AG AAAATTG ATGAAG TGGACTTG ATCAAAATTT AAAACTTTTGCTCTTTGA^ 

AGAAAATGATGGG GAGAAAAT ATTCACAG TT CGT AT AG CT AACAAAAAATTGGT AATCCAGAATGT AT AAAG AG CT CTT AC AACT A 
GGTAGTAAGAAGACCAAGAAAGATTTGAGCAGATACTTCACCAAAGAAGGT^ 

GTCATT AAACATCATTT AATCATCAGAAATATGTG AATTGAA^ CTAGAATGG CTAAAAT 

3 5 TAAAGACTGATAGC^GATATTGGTGAAAATTTGTAGCAATTGGAATGCTAATC 

GGCAATTTC CTGT AAAG TT AAACAT ACATTTTTT AT ATT ACC CAGT AACTTTTCT C CT AAGT ATGTAC CCAAG AGAAATGAAAAC A 
TGTCTATAGAAATATTTGTATATGAATGTTTGTAGCAGATAAATAA^ 

ATGGG TAAG TT ACAGT AT AT CCACACAATGT AACACC ACTCAGCAAT AAAATGG AA TG GAT 

GAATGTCIACAAATGTTATGCTGAGAGAAAGACATCAGACACAAAAAGAATAAATCCT 

4 0 ATCTAGACAAATCTAATATACAGCACGATCAGATCAGTGCTTCCTTTGGGCAAGGGGTG^ 

ACGAGGGAAACTTTCTGAGATGATGGACTTGTTCrAAATCrTTATTGTAGTGGGG^ 

TTATTACAGTTA riTL ' n 'AATATTTGTAGGGGATTGGTTCCA 

GTTCAAGTTTTTTTGTATATAAAATGGCACACTATTTGCATATAA 

ACTTAGATTACCTAATACAGTGTAACTATATAAATAGTTGTTATATTGTGGGGGGATTTGTATTTTTT^ 

4 5 TATTTTTATTTTTCCTTGAATATTTTCAGTCT^ 

TTTATTGTTTAGACATTACACCTTATTAACATTCATTTAAAAACATCT 
CTTTAGAGTCAGTGTTCAATACTATCCCITTGTTATCATTTATAGGTCAC^ 
ACTGTGGCTTCTTTCCACAGAATAATTGGCTCTCATTTCCTTATCCTCC 
AATGGGAGCTTTGTCTGGGCTTCTTTGTGAAAGCTTTTTTTTTC^ 

5 0 TCATTGTACTTCTTGACCAGCTTGTCAAGGTAGGCATACAGA 

CAGATTCTCATAAACACAAAGATAAACACAAAGATCGAGAACACCGGC^ 
AAGCATAGC^CAGGTAAGGGTGGAATCAAGCAAGTCCCTCATCATTCAGCAGTGGGTTC 
GTCTTAACTTGAGCTACATGTAAAGATAGCAAAGTAAGTAGAAACTGTATT^ 
AGGCCTTAAATGGGATGCCATTGTTCAGATCAAGATGTATAAAGCAA^ 

5 5 TACAGTTGTTCACTAAGATAGGTAAGGATTCATTGTGACACATGATGGCATTTA^ 

GGTAACTGGATTGGAGACCTG AI T T T T XTTC^ 

GTCTGTTAGCCCCTGAAATTATATGCAAATGGTTCAGTATATGAGATTTCC 

GAGGAGTTCGGCTCCCACRAAAGCCTACAATCACCAATATGAATGAAACrCAATAGTC 

GATC' l ' lTCTlT TGTTCATTrTGCrCCC A l'n'r rU 'lGGTGTAGGAGGCAAGTGGT^ 

6 0 TT CCATT CTGATTT AAAG ACAGTTGTTGG TG AGAAAG CCAT ATGGT ATTCC CATTC CCAT AG TGGC CTGAG CT AGGTT AGGG AATG 

TAAAGATCTGTCACACATTAGAACAAGTGTCTCCATTGTAGAGTTCTTAC^ 
AGTTAAAGTGAGAATTGCTTTCTAAGTCTGTT CTTTTT^ 

TCTTTGGGAATCATGTATCATGCCTGTC^CCTTTTTTCCCTATCCCTCACTTATCTCT^ 
CATTATTCTGGT^AAATTTAATCTCAGTAGTTGAATCTACTCTACAT 

6 5 CCGTACAGCAAAGAATATTGCCAAGGATAAAGAAGTTCATT^ 

ATGTTTATCCACTAACGTCACAGCCTCAAAATACATGAAGTAAATCTGAT^ 
TCAGAGATTTTACTATCTTTCTCAATAATTTTTAGTCCAAATA 
CTTGATCTAATCGACATTTATAGGTTTTTCCCATCCAACAATATTAGAAT^ 
TTTTTTTCAC^TTTTATTTTAGATTC^^ 
70 . TACAAATGATCCTGTCACCCAC^TACTGAGCATAGAACCCAACAATTAGTTTTTCAACCC 
TCCAGTGTCTATTATTGCCATCITTATCTCCATGAGCACC 

TT CTGTTCTTG CCTT AATTTGCTCAGG AT AAT AGC CT CCAGTTG CAT CCATGTTG CT ACAAAGG ACATG ATT CAGTT CITTTTTT A 

TGACCGTATAGTATTCOJTG^TAGATATGTACAACATTTTCTTTATCC^ 

TTCCCTATTGAGAACAGAGCTGCAGTGAACATGTGTGTACATC 

7 5 AAGTAATGGGATTGCTGGATCAGAAGGTAfiTTGTTTTTITAAGrr 
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AATTTACATTCTCATCAACAGTGTATAAGTGTTCCCTTTTTTCTGCAGCCTTG 

CATTGTGGCTGGTGTGAGATGATATCTCATTGTGGTTTTGATTTGCATTTCT 

ATTTGTTGGCCATTTCTATGTCATCTTTTGAGA 

TTGTTGAATCATTTAGGTTCCTTAT AG ATTCTGGAT ATTAT AC CTGTTAG ATO CATTCTGTAA 
5 GTTGTCTGTTAACTCTGTTGATAGTTTAATTAGATCCCATTTGTC^ 
TGAAATCTTTGCTCGTTCCTATGTCCAGAATGGTATTC 
GTATTTGATTCATCTTGAATTGATTTTTGTATGTGGTGTAAGGAAGGGGTC 
CCTAGCATCATTTATTGAATAGGGAGTCTTTTCC^ 

GGTTTT ATTTCTGGGTTCTCTGTTCTGTTCCATTTGGTCTATG TGTCTGTTT1TGTACCATTACCATG CTGTTAGTGTTTCTGTAC 
1 0 CATTACCATGATGTTAGAATACATTCTTTTTAAGTC 

ACCATCTTCTTTGGCTTTAATGCTTCCATGCCT 

AACAGTTTTTGGCrGTCrTTTCTCTGCTACCTG^ 

TCAGTTCCTTAAACATAGCATGTTTTCTGATGCCGCTAAGCCTT^ 

TGTCATTCITTTAGTTGCAGCTCAGACTTGACriTGTC 
1 5 GCTCTCTGGTGCCTrGAATATATTCCTAGTGTAATAATTATTATATTTAATTGTGTCTTTAC 

TAAAGTCCCTGAAAGCAGAATTGGGATCITTTTC 

TTTTTTGAGACGGAGTCTTACTCTGTTGCCCAGGCTGGAGTGCAGTGGTG 
CAAGTGATTCTCCTGCCTCTGCCTCCCAAGTAGCTGGGATTAAGGTGCGTGCCCCG^ 
GAGACAGGGTTTCACCATGTTGACTAGGCTGGTCTCAAACTCCTGACCT 
2 0 G ATT ACAGGTGTGAG CCAC CACGCC CAGCCTTTTGT ATT CATTTTGCATC CTT CAGG CTT AGT ATGG T ACCTGGCAG AT AG CTGG C 

ACTTACTAGTCTATATTTATTGAATTAATGAGTGGATGATCTATGGGTACA 
AATCTTCTATAGGATAGTTATTAAGTCACTTTGGCTTATCTCATTTTTCACACAA 

T ACTGTTTT AAAGTAG CATTT AGT ACATTGACAATAGT ATGCAACTGTCAC CA CT ATCT CCAG AACTTTT CAT CATC CCAG T CTTT 
GAATTCACTAAATAATAACITTTCATTCnXSTTCCCCTGTCCCAATATCTTT 

2 5 TTATCTTTATGAATTTGCCTATTCTAAGTACTTCATAT 

CTTAATATGTTTTCAAG^TTTATCCGTATTGTAGCGTATCAGAAATTCATTCTTT 
TGTAATCACATTTAAAAAAATCCATCTGTTGATGGACACTTG 

TTTCATTCTCCGGGGTGTATACCTAGGAGTAGAATTGCTAGATCATGTCCAACAATGGGTGGCTGCCTG 
TAATTGATGTGGTGTGAAAAAAAAAATTAGATAACCTTTGACCGTGAACAGTATCATCT 

3 0 CAGATGAAGAATGTTGATTTTTATATTACTAGGCTGTAAGAAGAAATTCCAAA 

CATAAGCTTTTGAAAACTTATCTTCTCTA^ 
GAGACATATAAGTTTGGCCCTGTCCTGTTTTACAAAAATCTTCT 
TCATCCTTtH'ACATCCATTTAAGTCTCGTGATCTTGGCTAGGTTACTTA^ 
ACATGCTTCACAGGGGAACCATGGGAGTCAATAAATATAGAAGCTCITC^ 

3 5 TTTTACATCTATTTAAAAGCATACTTTTCCAAGATCATA 

TCTTTTCrrTTTTTTTTTTTTTTTTGAGACAGTCTTGCT 

CTCCGCCTCCCGGGTT<^TGCCATTCTCCn , GCCTCyU3CCrCCCGAGTAGCTGGGA 

TTITTATA TlTrTA GTAGAGACGGGGTTTTACTGTGTTAA 

CTCCCAAAGTCnrGGGATTACAGGCGTGAGCaVCCGCGCCrGGCCAC^ 

TTATTCAC ATGAGT AAATTTTAACAGTTTAATCAG ATGAT CTGTTGTAAGCAAACTTTTAAGTACI'T CTTGTTTTG ATGTCT ACTG 
ACACAGTACCTCTTTGCAACAAAGCAAGGATATTGGTTATATTCCAGTAAA^ 
TTTGTCACCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTCACTGC^ 
CCTCCCGAGTAGCTGGGACTTACAGGCGCCrGCCACCACGCCGAGCTAA l^ 

4 5 TGGCCAGGCTGGTCTCGAACTCCTGGCCTTGTGATCCTCCTGCCTCAGCCTCCCAA 

ACCCGGCCTCCAGTAAAATTTTATATACACTGTGATCCnTTTAAAGTAC 

TCTTCTGTAGCAGATTAAAGAAATCTGGGGGATTATTTAATCACATCTGAACCCCAGAAT 

AAAATGCTTTCATTTAITCCIAATCCTTGCTAATCAATTCT 

GTGTTTCTGTGTCCCTTCCTTCTCTGCTAGATATTAGCACCAATCATTTGTCATCAG^ 

5 0 ATTTACTGTCCTCTCCAGTAGCAATACTGCAGAC 

ATCTTCTCTTCTCTTTtSTGTGGACAGGAGTCTCAGGTCACATGCT 

GAG AC CAAGGATC CTG AG T CCC CACT ACATT ATTCCTT C CAG TG AAAAAT ACCTC CCT GTGT AAAAAG AGTT AGTTTTG TAG CT AA 
ACTGTCTGTTTAAATACAAGGACCCCTTACTTGTATGCATA 

5 5 GCCTTTATCTCTCTTTGTCCCCAAATCCCTGAAATCTT^ 

CATCrCTCTCTTTCTCTTGTCTGGCCTGTAACATG^ 
ATCTCTTGACACTGCTTCTTTCTGTTAGCGTTAATATTTTGCTCCTC^ 
ACTGCTAAGCTTGGGCGCGGTGGTGTGATTATAGCTCACrGCAGCCrCCCTGGGCC 
TAGCTGGGACAATAGGCATGCACCACCATGCTTGGTTAATTAACTTT^ 

6 0 AGGGTGGTCTGGAACTTCTGGGCTCCCACATTGGCATCCCATAGTGTTG^G^ 

TAAAAAATACATAAATAATCTCAAAAGGAATACTCTTCTAGATGCCAAGGAATTACGTAGCTTGTAGGAT 

CTTAAAGGGTT ACATTCTT ATC CATTGTTT AT CTTT AATAGT AG AT CTCTTGT AT AAACAGAAT ATT AT^ CAATTTGGGAG 
GGGGG CTTGATTTTAGAAAATAAGAGAATCGTGTGAAATGAAATTCCTCCTAGTTTT 
ACCTC^CATTCACACTGGAGTGACAAATTTCTGGAAAGCAAGGG 

6 5 CAGCCATATTGGCAGTGCTATTCTGTACTGTCAGTGTGAGAAACATCTGAGAATAAGGATGCT^ 

TTCCAAATCCTACCTGAATTGTAATTGAGAGAATGTAGGATTTTAGGAATATGCA 

GGAGATTTITCTGAAATTAATAAAATTTAAAAGATATGTTTGCTGCTTTC 

CACATCACTATATCATCCTGTCTGTGGGAAATGGACAACTAACATGG^ 

TCAGTACCACCACC^TTCACATGGGAGTCAGTATGATATATTGAATAGCACACTAAGATAAAAG 

7 0 TTAT AAGCT ACGTCTCGTGATCCAAGGTGCTTAACTTCTCTGGGTC TCATTTTCTGTTTrGGTCAAGGGTrGATAACAGTACCTG C 

TTTATCAATCTTTCAAGTTTGTTTATGCACATGACAGCTCTTTG^ 
ATGATAATTACCGCAAAGCCTAGTATAATATCATGCAGATAAATTGGAGCTCTTGAGTT^ 

TTGAT AG CATTCAATTTGGAGC CCTG ATATTTTCAAT ATTATATTT AT C CCT ATCATAGT ACG CCAT AATCATTGTTTTT AAG ATT 
TCATGGTGGAGACAGCATTTGAGCTAGCCATTCAAAGATGGTTAAAA 
75 AGCAAAACATAGAGGCATTGTACTATATGCrriW 
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GAGAC CCAAAAAAGTGGCTTG AGGCAAAACT ATT AAATG C CATTTCAAAAAGTTGG AACTTGTT ACTT ACACTAACT AAAAAT CGG 
GGGG AAATGTTGGGACTCGATTCTGAGCn^CAAGAAGCCACTGGAA^ 

GACATGAAGGGTCT AAGAAAAAGGGTT C CTG TCTTGGAT AACTGGGTGG CTGTT CACG CT ATT AAT AG AG ACAAAG AACTT AAGGG 
AGGAATGGG AGTT AGAATGGG T ATCGG TTTTGC CATCTTTGGG CACATC CTTTT AT CCTCT CTGG CCTGAAAACAAAGGATT AAAT 
5 AATTTCT AGGAAATT ACATTT AC CTTG AAGTTTTG AT ACTT CT AT AATAT AAAACG CCC<5 AGT CCTTGTG ATTTTGTTT AG AG AT A 
TATAATCGGCATTGTAGTGTGTAATGATGATGGACATTAAGGAACCCTTGTT^ 

AGCTGCG CTCTTTT CCCAGAGG CAACTTTTGAAGTTACTTAAAT ATTTAATTT AACAAAT ATC^ CT ACTA 

TCCTTGAAAGGGTCATGAGTTTCCCTCCTAAACCAAGACTTT^ 
CATGTGACCTGCAGAGGTCrreCTGCCTTGAGGAGTAGACAGGTAGCT 
1 0 AG^CTCCTTCTACCTTTAACACTGCTTTTTTGC^ 

. CTGAGTTTAAATCC!TGTTTCTCTCTTTAGAGGTAGTTCCTTG 
AAATGAAGATACCAGTAGTTACCTCrAGTTTTGTTACCGATATCAAA 

ATCTGTTAAATACTTTTATGGGGGATAGAGXIACAGATT^ CAGTAT 
TTTGTTGTGGTAGCGGTGGTCTTTTTTCCTAAAGAAACAGTGCT^ 
1 5 CACCGTTTCTCCCACTCTTCTATATAAGTGCCTTAGAGAAACT 
ATAGACC^AAGGTTTTGCTGTCATCACTITA 

ATGGAGCCTCAGTCTATCGCCAGGCTGGAGTGCAGTGGCGTGATCT 
TCCTGCCTCAGCCTCCCCAGTAGCrrGGGATTACAGGCATGTGCCACCACGCCCA 
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ATGAGCCACC^CGCCCAGCCAGATTA T TT TT TCTTTAGTGGAATT 

AGGTGGCTTTGGATCTTTCCATTTGTTCTCnTTAGCGCT^ 

ACAGATGCAAACTATGAACCAAGAAGTCAGCATCTATGCCTAGCCAAT^ 

GGGGTAAATTCATATTTCATCCATAAGTGTGACCATTGCAGAAGT^ 

CCCAGACTAAACTTACTCTTAAAATATGTTTGTCCAACCTTTGGTTTCCTTCCT 

GAAGACAAAATGGGG AG ACAAAAAAGG CAACCCTG AAGGTTTTT ATCCTTT CAGGG TAT CT AG CATTTTT CTAATTT AGCCCT ACT 

GAAGTTTCTTCAGTAATTAATCATCTGTTTTTCAGCCTCTAACCT 

TGTGACITTGACATATTGTGTCTATTGTGTATGGAAAAGTAGCA 

CCAGCAAATGAACrTTCTGCATTGGTCAGAATTTCCATCATTTCACTGTT 

TCAAACATATCCTAAGCCTTTTGGAAAAAGACATACATGTTAGAAAATCT 



AGCGCCTTTGCTACTTCCTTCAGGCACTnTCCTAGGA 

GGTTATGGATGCCAC<5TTGGGCTCCAGGCACTCTGCrCTCCATTTGTG^ 

ATCACATTGTATTCAGAAGCCTGGTGGAGCTCTATAAACCCAAATTTCT 

CCTTAACCCTCAATCATCTGTATTCAGCTAGTATAAAAATGCAAATCTGCTCT 

GCTTCCTTTGGAAAGTCAGX1ATGATAGCTTCATCCCATTCTGGTTTTTGTGATTAATTGTG 

GCAGGAACTTTGAATTTCCTATATACATCTCTTTCTGGT^ 

CTCATTTGCCTTGTCATTTGTTGATTCCTTTTCCCTGATCT 

ACATGTACACnrTCTAGGTACCTATGTTGTGATTGCCAAAAGGCTCAGAAAG 

AATGCTTTCACTC^GAAAGAGAACGGGTCAC'GGGAAGGTGGAACT^ 

AGCCATTTATAGTTTCATGGGAAGACAATGAAAAAAAAAAAAAAGAAAGAAAGAAAAGAAAAGAAAAGA 
GTTTTTATCCTTTCAC^TATCCAAACATCTTTATTCT^ 
CTC^GACTCACCATGGCrrGTTTCTTAACACCAGTATACCTI^ 
TAGTGGAGGCTGCTCTGAACTTGGAGGTGG AAAT ACT AAGTTT^ 

TT AACCATTGTCT CCTG TTTGT AT CAG CATGAGGGG AT AG AT AAT AAAAC CTG CCT CACAGGAGTG TTGTGAGAATTGAGTG AAAT 
AATGTATGTGAAAATACCAAGTACAATGCTTGCTTAGAAAGTAGAATTT^ 
CAGGGGGAGGCAGTATATTCATCCATCAATATTATAGATGCAATTGTTCCCAC^ 
GACTGTATACCCTCrTACAGTTGATGCTTAATAAAACAAAACTACCATG^ 

GT AGT AT AAAAAC CAAGTGCCTGTGATT ACCT AT ACATT AACT CCTGAAG AATCAATTAT AATCTCA^ CAACCTT G CTCTTTG AC C 

ATT CATT CT AAAG CTTGG CGAAG TT AATG AG AACAG AT AAT AGG AGTGCT AATG CAATTGG AAG ACTTTTTTTTT 

TAAGGAAGGGATACATTTTTAAATAAAACATTGCATTTTGGTA^ 

TATTGATTCCAAGAACAAAAAGCATGGTGGAGAAATTTTGTAACCATGCAAATCC^ 

TTTTCCTCCTATAGAAACTTGTTATTTAAATGTGTGTCACATGTAA^ 

AAGCAAGGGAAAGTGGGCATTGTTTTGTTTTTTTTTTTCT 

GTTTCTCCCTGGCTAGATTTCTTAAAGTTTCTGCCTATTCATCACTCAAGTACCTATCTCAAGTTC 

GTGTAATAGCAGATGTTCTCCTGATCTAGATTTTGTTGCTCT 

ACTGTAGTCAAGTGTCCCTTTAGAACAAACTTCATGGGATGGACATA 

TGCATTGGTTAGTGCAGTAAGCAAGGCTCCTTAACTGCCTGTCCCAGTGTACTGAAAA^ 

AACTTGTGGTTTTTGCTTATTGCTTCTTTAGTCTTTGACCACATGCATGTTTTTACATAG 

TTCTGTCATAAACACTTTCCAAATCTGTTTGTGGTACTCAAATTATAGCT 

TCTTGTTAGGTGTATCCAATTTTGGGGTTTGTTTGTTCGTTTTTGT^ 

AAATAAACCTCri'TrTAAAATTAATTAATTAAl'l'l'l'l'lTGAGGCAGAGTCTCGCTCT^ 

TCTCGGCTCACTGCAAGCTCCGCCTCCTGGGTTCATGCCATTCTCCrrGCCT 

CACCACGCCCGGCTAATTTTTTGTATTTTTAGTAGAAACGGGGTTTCACTGTGTTAGCCGGGATGGTCT 

GATCCGCCTGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGCGTGAGC(^CCGCGCCCAGCCCAA 

TGTTATTTCCCTAAGATAAATTTCCAGGAGCTTGATCCCAGGGTAA 

ATATTTTGCITTTTAACCTTTAACAAAGAACATTCCTA 

CGTAAGTCCCATCTAGAAGTGACTAACTTGCCTAGTTAAAACTAGAAGGAAA 

CTGG CTTGG CTACATX3TGTGGTATGTTTCAAGTCTGGCACAAAGC 

ATCAG ATATGAATGCTGAAGTTG^ AGCTTTCATTATTCTTTTTGTG CTGGTAGAATX3TTGCAG AAAT AC^VCAGGG CTGGCTGTGTA 
TTCTTAGTCACTTTAAAAAAAATGTCCTC^TGTGTCTAGAATGATGGTTTCTGTT 
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CTTAGGATATCTTGTGAAACGAATCCTTGTGGTCCCTACTCTGGGCT 
TTCTGAAAAGAAACACAAAGAGAAGGAGAAGACCAAACIACAA 

ACAAC^IAAAAACG AAAAG AGG AAAAGGT ACAGAGTTTTCT AGT AC GC C ACC AAGG AACGTGO &TG CACTTTTGTTT ACCTTTGAAA 
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AAACTATAAAAACTAAGCCTTTTACCACCATAT 

AATGAATACATTGAGAATGCATTAGGGAAAATTAATTGTTT^ 

TTCTTCCOIAGTGAAAATTTAGTGATGCTGGTGTCTTTT^ 

TAGGCTCACCCAGCTGAGAATGAATAGTACTCrcTATAGTTTTAAAAAGAGAT 
5 GTACITGGTAAATTCAGATACTATGAAATAAGGAATATGGTGA^ 

. TCAGAGGCAACCGCTATACTGATTATGTTTTGTATTCTTCCATAGATACTCTC 

GTATATATGAAGATTTTTTTGAAAGATAAATTTTCTGACGAATCTAG^ 

TTGCCTAAGTAATAGAAGACAGGGCATAGACTACTTTTTTGTTTGT^ 

GGGCTAGTCTCGAACTCCTGGCCTCAAGCAGTCCTCCAACCTCATCTTCCCAAAGTGT^ 
1 0 TGGCCCAGTTTATATGTTTTGTTTGTTTC 

GTATTTATTGGGGAAAAGAGACCTGCCACCACTTAAAAGTAAAAAG 

GTTTC^CCCAGAAGAATTGTTAAAAACGAGTTTTGGACACATTTC 

CCAGCTTGTGATCATGATGTCCCAGGAACTGATGCTACATATTCTCAAAA^ 

GTTACTGTTGTTTTACTTTTCAGGTTCX3AGCCT 
1 5 TTTGCTGCTGCGTGGGCTTCCCTTTCTCTGGGTGGACAGC 

CACTGACCTGGCCATCTTGATGGTTCCCAGAAGCTCAGCAGTGAGAAGACTCGGTCT 

TATAGTGAGACTG AC CAG CACATTTT CCAGAAGGTT ATTTCG AAT AG AG AAGGTTGACA.GGAG AG AT^ 

ACTTGCATATCACAAAATTTGTATTTTAATGGTATACAGTGTA 

GCAGCCTCTTCACrAATTCCAGAACATTTTCATTTCTCTAA 
2 0 TCACTTAGTCTGATTTCATCATTAGTAATATTG 

TTTGGGTAGAATTATGTTATCTCTTTTCTAGAAGGAGAGGAAAATGCGCAAAATCT 

TATAATTTTGTTTTCCTCAGCCTCAATTTCCTAAAATAAGCCCAGAGGTCCCCTCTAG 

CATATTCTCACTTCACTGAATGACAGAAAGGATTTATTAGAAATGGTTGTAAGTC 

AGAGTTTTGCi^CCTTGAATCTTTGAACrTTCAGTTTC^ 

2 5 GACAAGGAGAAAAGCAGGTGCCCTCTTCAGGCATGTTTCATAATTGTTCT 

GCTGCAAGCTGCCCTAGGAATTTTGATTTCTGAGATAAAGTGACT 
TGGATTTTTATTAAACTGATATGAATAAATTTTTTGGACTTGGTGCATA 
TTAGGAACACAGCCTTCTTTGAGGCAGTGGTCTCTATGTTCACTTTGCT 
AATCCAGATCCTTCACTGGAATACAGTTTTCAGGTAAATTGTTTTAGGACr^ 

3 0 TTTGTCTTCAGTGTGACAAACTTAGGATCTAGCATTTACTTAACTAGTTG5GGTA 

GGTCTGTGTCAGTATTCAGG CAGG AGGGGGAACAGCTGT AG AAG CTGG CATTTGGTT AGGGTG CT AAAAT AAG I T C I'TTAAACATT 
CAAATTCTATCACAAGCAAGGGGACCCAGTATGCTTTTTTTG^ 
GTCGGGATTCTTAATCCTTGGATCCCTCCACAGGGGGCAGCTGCATGTAAGGAGG 
TTCTGAAGAGATGTGCCAGGCCAGGGAGCATCCCATGGCTGCAGTGATTGCTGGGCTCTATGGT 

3 5 GTTTTGGTTCAAGATGTTTGTCTCAAAGGATTGTCTTTTTATATTTATATTTT 

CTCTTCTGTATATAAAGTAAACCTAAGTTAGGACGTTGAACAATGAATAGATTTGGCAT^ 
ACACGTGCCCTACTTTTGTTGTAATGTTGTCAGCTTGGTTATTTTTCCATC 
TAGAACTTTAAGGTACTAAGTTGGGGAAATCAATGTTTATTCCTTTCCTTGTAAAAGCT 
AATCAGTTCAGATTGTGGGGTCAAATATATGGTAGGTTTCCATGTAAGGCAAGAAAACACCTTAGGTT^ 

4 0 TGCCATGCTGGCAAAGATAGCTAAGAAATAGTGAGAAAGAATTCTG 

CCATTCACCTGTACAATGAGGAGAAATACTATCTACTTCACAG 

GTACGTGGTTATCCTTATTTCTGTTAGCTTCTTTTCAACGAAATGACATAr^ 

CACTCTGACCAGCAATTTTTTTTCTCTTTAGTCCACCACAAAT 

GAGGATATAAAGCCATTAAAGAGACCTCGAGATGAGGATGAGTGAGTATTTTCTTAAAACTTT 

4 5 AGTTTAAAGAATAAATGTGATGTGTTTCnTTATAATACATATAGAAACTGCATTAATTGGCT^ 

AATGGACTGATTTCTCTGAACTATGAGAAGTAGAGTTTGCTAAACAAGTAGTATACCCAA^ 
TTATTACTGAAAAGGCAATAGATATTTATTGTAGAAATTTCAAATAATACAGAATTCATGAAGAATA 
AATTTCTTATTGCCTACCACAGATAACTTAAAGCAGTTTCTCTTAGACCTATTCCrGTCA 
TAAAAATAATAAAACTGCTCATAAAACGGTCGAGGCTTACATGTATCrrCTTTCCCCAG^ 

5 0 TTTTCACTCTTATATAGCTTACTATGCTATTGTTATAGTAGAAATTGGATGGAAAGGGAGAA 

GTATTCATTCAATATAGAATTTCTTTCCTCCACA^ 
AAACTTTCATCTTTTATTTTTATTTATTTTATTTTATT^ 

CCCCCCACCCTATTTTGCTTCCAGGCCGGTGTAAGTGGTTTGTTTTGTTCTATATAGGGCT 
TTCATTATTTTCTCTATTTCTGTTAGTCTTGACCGra^ 

5 5 AAGAGTTTGTAGGTCTCCTATGAGTGAGAACTCCTGAATCATAATTATGTTAACTGTGTATTCATG 

TAT AAAC CT AAG AAAATT AAAACAGAAG AT AC CAAG AAG G AG AAG AAAAG AAAACT AG AAG AAGAAGAGG TT AGT AAAGAG ACTT A 
C^TCCTTTGGGGCTTGAGTTTGGAAGTGGGAGTTTTCCAGTAAGCAACTTCTT 

TT AT AACATT AGGG AAAACAAATTT AAT AAG TGGTGG CTGTGG CACTGG T AG TTTT AAG AT CT CTGTG AATCC CCACAGTGTCTTA 
AGTTAACCACAACCAGCAGTGATTCTGCTCTCAGCCTTCCTGGGTGTCTGTCTACCCACAGCT 

6 0 AAAAACTATGTCATCTTCATATATACAGAGGCAGTGTTAACAAGAAACCACATTTTG^ 

/UVf ''j'TT'lTG ctg ATC ATTC ACTTT AGGT AACCATTC CT ATCT AA71G AATGGGT ACC CCTTTG AGATAT AATTG AC AC CCA 

AACTCTTTACTGCAGCTCCTCCCAATCTGGCCCCrrGCATCTTGTGTCT 
ACrTCACACTAGCCTCCTTTCTTTCTrTTCCACACC^ATT 
GGAACACACATGCTCCCTTCTTTATCCrTTTGGCCTATAACCTTCCT 

6 5 CTTGTTGATGCTTGTGCTAGATTGAAAATAGAATCATATC 

TAGCTCTTATGCTACATTTC^GGTTAGCCACTGTCATGG^ 

GTAAAGAGAATGATAGTGAAGGGGCAGGGAGTCTTAAGAGGATAATCTATCATAATAGAATC 
AATAGGTGCTTACCATGTGCCAGGCACTCTTCTATATATGTATCATT^ 

7 0 TATTATCATCTCCATTTTGCJ^GATGCAGAAATAGAGTATAGAAATATTT 

AAAGCCAG C ATTTGATC C CATG CAGT CTG ACTTCAGTG CCAG CACAC CAGCT ACT AT ATT AT C CT AC CTCC CCTGGG ACTTTG CTG 
ATGCTCTAATTCTGCAAGGAGGGAGAAGAAAGGTCTTTAATGAAATCATATG^^ 
ATGATCAGGTCCTGAATTTAGAAGGTTGCCGTTTTTTTGACAACT 
CAAGGAATTGAGAGTGAAATGTTAAGTAGTGA^ 
7 5 TAAAGGAATATAAAACTTGAGTTCrCCAAGTGTTTTTT'lTTnATGTTTATATT 
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TAAACCATTGGGAAAAAAGAACAGTTGACTTCATTTGTTA^ 

GAAAGACTGAATTAATACAATCTTACTGTGGCAGGGACCCCAGATGGCAGAATCT 

TTTTGTTTTTTGTTTTTTTCTCTTGTTTTTC 

TACAATGTAGCCAGTAGTTTCCTGATGGCACCAGCTGGATGCCTGCTCAGAAGCT 
5 CTTTCCTCTGGTGTGTTTGACCCTAAATAGTAAACACTATCTG 

TATTTATATTCrCCGTGCCTGTTTTAAAATATGGATGGTACTGTAAGCTTTCCT CTl 
ATTATGAGGATCTAAAATGTTTATTCCAGTTATTCCCATAAGTCACATTCTGTTATT^ 
TTAATAACAGCTATATGACCTTGATCAAGTGACAAAATATGTTTCAAAGTCCT^ 
CCATACACTTTCCAGCTGTGCCACTCnTTAATTGATTTTAAG 
1 0 AAAGACTGAATAAGTAGTTAAATTATGTTGAATAGGCAGGACAA^ 

TTCAC I rGrTGAATATTACATTTTTGTTTGGGlTITGTGTGTATGTTTTGAGATTCATCT 

AGGTTTGATGCCCX5TTTATTGTGTCACTGTGATCAGCTCATTTA 

GGAATTGGTGTCATTTTAGAACCTATCAAATACTTGCTTACTGCA 

TC CACTAAATCAACTGGG AATTTTCTTTC ATCCTGAAT ACT AAAT AT CT AATG CAAAG C C CCGTCT AG CCGTT ATTTG ATTGT ACT 
1 5 GG AAGAGG TTT AAAGATGAG T AGGATTAAACATAAGTGAGG AAAAT ACACTCGG ATT ACT CT CT AGAG CAGT ACTTTT C CAAACTC 
CTTTGAGGTTAAAAAAAAAAAAGTTTCCCAGTCTCTTCCCCTGCAGAGTGT T ri TT AACTACTTTCTCAAGTAGTTGTTTGTCAGG 
AAATITTAGGAAATACTGCTGTAGAACAGAGCACCAGTACATTGTTATT 
GTTCTAATACTCTATGATTACAATTTTTCTTCCTCATGGCT 

TTT CTCACCATG TTTCT1TGT AGGATGGTAAATTGAAAAAAC C CAAGAAT AAAG AT AAAGAT AAAAAAGTTCCTGAGCCAGATAAC 
2 0 AAGAAAAAGAAGCCGAAGAAAGAAGAGGAACAGAAGTGGAAATGGTAACATCT 
CATATCCTTACGAAGCAAGAGTATTGAGTTATTAAAATTAATATCCTGAT^ 
TAACTITCTGAAGTAAGTTTTCCAACTTCTAGTGAAATCATTAACTC 
GTGAACAGAATGGACTGTAGAAGCAGACCTATGTATAGGTGGGAATTTAGC^ 
GGAAAGGACAGATTATTATCTAAGTCX5TGTTGGGAAAACTGCCTGTTCAACAACA 

2 5 TATTTTTATAATTCTGCAGTAGTGAAGATCCTTTGAAGCAAGACATAACCAGAGCCA 

GCTGATATTCCTAATATGAAAAAAAGCTCTTTTAAAACAAATATGAAAGGAATGATCT 

GTAAAATAGTTAATAAGCATCTAAAAATGTGATCAACTTCATAAGTAATCAAAGATGCAAAA 

TGCCTCTCAGATTGGGAAAGATTAATAAGTGAAAATATCTAGAGTTGCAAATGTA^ 

3 0 TG AAAATTCAGT CTTTATGAAGG ACAGTTT AG T CAT AGCTGGTTG CT CTTGAG AAGTCTATTTAGGTGGAGAACTTTTCT ACTTTT 

GCTGTTTAGAATCTTGAAATAAACATGTTTTTAAAAAATACT^ 
ATGATTTTGTGTTTATGATGAAAAAACTCAAATGATTGACAGATTTAAT^ 

CT CC CATTCT CATTTTC CTTTCCAG CTTT C CAG CTTTG AGGT AATC ACTG TTC AT AATTTG ATGT AT AGCCTTCCAAAC CTGTTT A 
TGTGCCTTTATGGCTTTTGTAATTTAGAACTTTGATGTCCACAGCACCTCT 

3 5 GCAGAGGGAAGGAGAGCCACAGTGGCAGAAATGTGGCCACACGGTGACIAGCT 

TAAATGGG AAG T AT AG AAAAGGAATAT CACAG AG CAG AG CAACC CTG C CT CTTT CT AGAGGT C AGAG TCAG AAAGT AAAAGG CCAG 
GCACAGTGGCTCGTATCTGTAATCCCAGCACTTTGGGAGGCCGAGGCTGGTGGATCACCTGAGG 
GCCAACATGGCGAAACCTCATCTCTGCTAAAATTACAAAAATTAGCCGGGCGTGGT^ 
AAGTTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCA^ 

4 0 AGAGCGAGACTCTGTCTCAAAAAAAAAAAAAAGAAAAAAAACCCCACATG 

GAACTAATACATTTTCAGTGATCTTGACCTTTAAGAAAATGATTTTTA 
GAGTTAAGAGAATGGTAGGTTAGTAATTCATCACTCACKSGT^ 

TTTCCTTATCAGGAGCTCCCTGCTGTCTCCCCTTTACTTCGGAGCTTTGGCTCTTT^ 
CCTTGTATGTGTAGAATTAGCTGCCCTTGAGGGGTTTAGGAGCCTAATTT^ 

4 5 ACATTCACT CTGAC AGACCATGACITTTGGTTTTT ATTCTTATTTTGGC TCTTTAACTAAAAGTTGT 

TTTTTGAGGAATGAGCAGACTTTGCTACAGCTTCTAACGCATGCATTTCTCAGG 

G CACAG TG CTTGAATGG CACACCG AGTTTTTGG TG TTG AAAAT ATC C CTCTT AACTG AT AG CT AGAGG CTGTGT AT C 

ATACTCTGAAGTCAGTTATCCAGTCCTCAGTGACTGCCACTGATGACCTAGTCGGGGAGTTAAATC 

TATTCCTTATGCTATCCTACTAAAAGAAGTACACACTTTACAAGCT 

5 0 AGACCTGTTTGTCTTTACCAGAAAAAACCACTTACTCTGATCTTTATCTCCCT 

GTCCCTCAGGGTCTCTTTGCTATATCACGTTATCTCCATAGAAACATTGCTA 
AATCCTGTTTATCCCAGATGTCCTGATTTTTAACTGTTGGCCATTCT 

CACTTTTGTGTCAGGGCTGACCTGGGAATATAAGTTGCCAGTTAATTCTCAACAGATGATTCT^ 
TTATTGCAGGTGTTAGGTGTGACTTTAAATGGAGTTTTCAATCTCTTAGCAAAC^ 

5 5 TTTTAAGGAATTAGTGTTGCAAGACCAAGACTCIATAAATATA^ 

ACATTTTGCTCAAACCAGAAAAACTTCCCTTAATTTCAAGGATTAGCATTCAGAAGC^ 
AAAAAAATTAAATATGAGTTTTATAGCATACTGTGAAGTGCTTGCTAACTC 

ACTTTCCTGTCTTT ATATTTCTAGCALI IT'TCCACCTTTCCCCCTTCAAGCAAATTTTATTATTAAATTGGTACCT 

G T CAAG CTG C CAAACKSGAAG CC ATTT CGATG TCT AAGT AC AT GTTTTTGTCT CGGGGGG CGG ATAT 

6 0 ATGCACTCATGGGGAGGCTAAAAGGACAGAACTGTCTGCTTTTATTTGTATAATAAT 

GAAACAGATATTCTGTCTTTCTGTCTAAACTGAAGTTTAGTCTTGTGAT 

TTGCTGTGGGTCTTCAGGATACACATGTTGAAATAGGAGTCGTAAAAACAGTCTGGAGTCTTGATCT 
G CGTGGTGGCTCACG CCTGT AATCCCAGCACTTTGGAAGGCCXSAAGCGGGCAGATCACGAGGTC^ 
TAACATGGTGAAACCCTGTCTCTGCTAAAAATACAAAAACAAAACTAG 

6 5 CAG G AGG CTGAGGCG CGAGAATGGTGTGAACCCTGG AGG TGG AG CTTG CAGTG AGCGGAG AT CACGC GACTGCACTC CAGCCTGGG 

CGACAGAGCGAGACTCTGTCTCAAAAAATAAATAAATAAATAAAAA 
TAAGGGAAAGGGTGGCGTTCGGAATCTCCTCACGGTTACTCTGTGACTGCTC 

ACAT AT AT AT AT ATTTTCT CT ATTTGG AGAGT AAT AG ACTGTAGG ATTGGGTGGTTT C CTG CATTG AG ATCCTT CT ATT AATAT AT 
C^TGACTCTGCCAATTTCATACCTGAAAAAGTGGTAGGTTAGTTAT 

7 0 TAAATATCTTTGTTTAAATTGTGTAAAATACATATGACATAACATTTATCATCTTAACC^ 

TAAGTACATTCATATTGTTGTGCACCATCACCATCTCCAGAACTCTACAT 
CCCACTTCTCTGTTGCCCCAGTTCTTTTTTTTTTTTTTTTT^ 

GGTGTGATCTCAGTTCA.CTG C AACCTTCG CCTCCTGGGTTCAAGCAATTCTCCTGCCTC AGCCTCCCAAGT AGCTGGGATTACAGG 
CTCCCACCACCATTCCCGGCTAATTTTTTATATTTTTGGTAGA 
75 ACCTCAGATGATCTGCCTOCCTTGGCCTCCCAAAGTGCTGGGAT^^ 
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TGGTAAGCACCATTCTACTTTCTGTCTCTATGAATTTGA 

TG ACTGACT CATTT CACTT AACAGAATGT CAACG CTCGT C CAT ATTGT AG CATG TTT CAGAATTTTCTG ACTTTT AAAGG CT AAAT 

AATATTCCATTGTTTGTCCTGCATTTTGTTTATTCATTCATCTT^ 

GTGCTGCTGTGAAACATTCGTGTACAAGTATTTGTTTGAATATTGAGTATAT^ 
5 TCTGTGCTTAACTTITTCAGGAACTGCCATACT 

GTGCTAGCTTCCCTACACCTTCACCAACACTTGTTATTTTTTGr 

TTITGATTGGTATTTTCCTAATGATTATTGATGTTGCATATTTTTC 

TGACCATTCAAGTTCTITGCCCCACCCCCACGTGTT^ 

AGTGGCGCAATCTTGGCTCACTGCAACATCTGCCTCCCAGGTTCAA^ 
1 0 AGGCGTGCACCACCACGCCTGGCTAATTTTTT^ 

CTGACCTCAAGTGATCCCTCCGCCCGCTTTGGCCTCCCAAAGTATTGG 

TAATTTTATC r n'TT T TTCCCCTCCACCTTAAGACTGAGACTTTGCCCA'r 

GTTTATATATTTTGTATGTTAACCCTTTATCAGATACATGATTTACAAATA 

TGAAGTGTCCTTTGATGCACAGAAGTTTTTAAGTTCAGTGAAG 
1 5 CAGGCTGGAGTGCAGTCACGTGATCTCGGCTCACTGCAACCTTCACCTCCC^ 

TAGCTGGGATTACAGGCGTATGCCCCCATGCCTGGATAATTTITGTATTrTTAGTAGAGAC 

GCTGGTCTCGAACTCTTGTCCTCAAGTGATCCGCCCA^ 

GCTGCAATTCATTTATTCTTTTTGTTACCTGTGTTT^ 

TCCCCTATGTTGTCITCTAAGAGTTTTATATTTTTGCT CATTTTG AATT AATTTTTGTGT AT AAT 
2 0 GTAAAGACCCAGTTTGATTATTTTGCCAACACTATTGTTGAAAAAACT 
AAATCATTTGACCATATATGTTGACTTTGGTTTO 
GTTTCTGAACTTTGTTTAAAATATTGGACTTGTTCC^ 

AGTAAATATGAGGGTAGTAAGTTAGTTATCTATTCGAGTAAAAGAAACTTGAGGCATAGCCAGTT^ 
GATCCTGGTCTATAAAATAAATTTAGATAAAATAGGGAAGTTTTTAATATGGACTG 

2 5 GTTAGATGTGATAATAGTTTTGTGGTTATGTGGAAGAATATTATTTTTA 

AAATCTGCAACCAACTTTGAAAATGGTTCXK5CCAAA 

GATGG AGTCTCACTCTGTTGCC CAGGCTGGAGTGCAGTGGCACAATCTrGGCTCACTGCAACCTCTG CCTC CTGGGTTCAAGCAAT 

TCTCGTGCCTCAGCCTCCTGAGCAGCTGGGACTACAGGTGCGCGCCACCATGCCCAG 

GTTTCACCACATTGCCCAGGCTCCTGACCTGGATCACGACCTCCTGACCTCGTGATCT^ 

3 0 ATT AC AGGCGT AT ATGG TGTTT ATTGT ATT ACT CATTTTTTTC TT ATG TT CGACCATTTT AAAAAT ACATTTTCAAAAAGTTGTTC 

CCACTGAGCITTAGATTCACTTATCAGTGTTTGCTAGATAGT^ 
AGCTATCATTTCCTAGTCGTCTTCCTITTATGTCCrTTCTCATAGGGTT 
AGGAGGCCTTTTCCTTACAAACAATGTAAACAATCC 
CCTCACTTGTTACTGAGCTCCTTCATTGTTGATCT 

3 5 CACCACAGTATTTCATCTTACTTGTGAGCGTTGGC 

GCCCAGAAGTC^TTCCAGAGCACTAATCAGTTGAGCGGATAATTATT^ 
GATCACTAAATGAGGCTGTGCTTTGTCTTTTAAAGGTC 
TAAAGGTCCAGTATTTGCCCCACCATATGAGCCTCTTCCAGAGAATGTCA 
GATTCTTGGCCAGGAAAAGAAATCrGCTTCTATTTAGGGATAGAAAACAA 

4 0 CATTTTGTTTTATTGCCATGCAATTTTGAGGAAGGGGCATCAGGTC 

TTTTGATCTGAACTTCATTTCTAGGGAGCAACTCre 
CAATATAACAATGCAATCGGACAAGCTGCTCATCACTAGA 

CTAGATAGGGCrGGTAGGCATCTGTCACCTATTCTCTGTAGCATAACACTATGCC^ 

4 5 TGCITTAAAGAAATGCCCTGCCTGGGTCTCTTAGA 

AAGGATTTGAGGGTAGCCTTTGACATGCCTCCTGTTGGCAGTGTAGCAGTATGTAGATGGTA^ 
AGTTTTATCCAGAAATGGTT AGAG CATTGTTCCTCAT ACCCTT ATGTGCTTATG ACTCACCAAAGTGGATCTGA 
GTTCTGGGGTGGGCTTCTTTTCCCTTTCCCCACC^ 
TTCCTGACAGTCCCCCTTCTCCrTTCCC(^CATTCCT 

5 0 ACCACCCCATTTCTCCACCATITTTCCCACCCTCAGCCCCCCACTGTTrT 

AAGTTCTGGG^TGGGATTGAGTACITCTGAACCAAGTC^ 
AGCATCAGTGGTGAACACTCAGGCATTCTGGTGCATCTCTCTTCCCAC 

ATTTATTAGCCAAGTGTGGTGGTGCACGCCTGTAGTCCCAGCT^ 
55 AGAGATTGCAGTGAGCCTAGATCGTGCCACTCCCCTCCAGCCTGTGTGACAGAGTGAG^ 
TAAATAAATAAAACATATTTATTGAATGTATTTGTGGTCAGACAC 

TCTTGGGGAGTTTGCAGCCTAACAGGACTCC ATAAT CTCAG AGG CAAATGCTTATGTTATGT AAACATTTTGACCTGTTGACC AGA 
GAGCCTAAGAGAGTTCTTTCAGCCCTCACGCTGGTCA 

GAAAGGGCCTAGAACCTTGAAGAGCACCATCTCAAAGCGTAGGTATTCCGTAGGCCTGGTCTGG 

6 0 CTCAGCTGGTAAAGCCTCACTTTGAGGTTCAAAGTGATGGG 

AGATAGGAAGCAGAGGCCTGGTCTAGTTCAC1AGGTTGCCATAGTCCTTTCCT 
GCITTAGGGCTCCAGCATTTTCTCAGCTGGTCCTCTGCrrGGGATTTCTTCT 
AAGAAGTCATATTTGACTTTACTTCAAACTGATACAGTAAGTGCTGCCTACTCT 
CTTGCCAGGGAGGTGC^GGATGGGCTAGCTGAAGCAGTGAATTACT 

6 5 AGTTTCCTTTTCCTGXSAGCTCAGCCTTTrCCACAAAGTGTTAACATT 

AAAACAGCCCAAAATCTCATTTTTTTTAA^ 
GTGC^GTGGCACAATCTTGGCTCACTGC^ 

TTAACAGGCACGTGCCACCACACCTAAGTTTTGTATTTTTAATA 

7 0 CTGACCTGAAGTGATCCTTGCACCGCCTCCTAAAGTGCTGG 

TGGTGTTAACAGTTTACATTTTTTTCTTGCCATTTTATTT 
ACATGTTACAGTTTTCTAAAGAAGTTCAGGGAATCITTTAA 
CTTGGGGGTTGATGGGGGCAGCTGCGCCACCTGTTGGCCTTTT 
TCAATATTGTGCTACAGTTCTCTACACCCCTGCAJ^ 
7 5 AATGGATTTCTGGTTAAATTTTAGTTTTCTATAAAGAATGTGGGCAGCCTTGGT 
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TGGAATTATAGTTAACCTCATTTTAAGTATAGCCAA^ 

TTTTTAAGAAAGAATGGAAATTGGCACTAAGAGTACTGTCAAAAG CATTTGTTTGAAAAGTG5GATTAAATTCCCAAGTACTTTTCT 
TTCATTGCTrTATTAACTAAATCAGGTATATAAAACCCATAGGCCCTATTCAGA l 'l" 
AAACATCTGAGTCCCTTTAGACAGCTTITTGTTTATTTGA^ 
5 TGGCTCACTGCAACCCCTGCCrCCTGGGTTCAAGCGATTCTCATGCCT 
CACGCCCGGCTAATTTTTTATATTTTTACTA 

ATCTGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACT 

ATTGX3ATACACCAGGAACCCCAGTCATCCCTTGTGCTGTGGATGCATCCT 

TCCAGAGCAGTGTGTCGCAAACTCTAGTGTGCATCCAAGTCTTGTTAAAAATGCA^ 
1 0 GGGGCCTGGGATTCTGCATTTCTAAACTCCCACATGATGCCAATGCT 

AAG AAGG TCTT ATT AGG CTTTT CAT ATGGGAAAT CTGGG CTTGC CT AG T ATGGAT AT AT ATTTTTTT ATTCAT ATTC CAC C CAC CT 

AACTTTGAAGAAGAAAGATGGCT A r rrClTlT CCCCACAATTCC r iTCTTC 

CAGAGGCTGCTTAGCTTAGCAGTGGCAGGAGTCCCTCTGAGGATTTGA 

TGCTGAAAGGAAAAGGAAGAGTGACTGGGCCTCCCCCAGGTCATCCCTGTCCTGAGC^ 
1 5 T CTC CAGAGAATG ACCCAGGTAAAT ATATG CT AAAGAG AAGGAG AATTTG T A CAAGTAG AT AT AAATCAGTATGTTGT CTTTGTTG 

TTGTTCXTACTATGTGTCACCAGACCTTTATATACCATGAGAAGGCAA^ 

TGCTTAGAACATGAATACTATACCTCJUlTnTTGGAAC 

AAGTAGCTACGrrCTTTGCAAAAATGCTCGACCATGAATATACTACCAAG^ 

AAGGTACTGTAGCCCATGTGTTAATATCCTAGTACCriTGCAAAACAA 
2 0 TCCTGATGTAAAATTTGAGTTGTATGGATTTTGTTGTATT^ 

TCTGTCCAACAAGAGTACTTGTATATCTTTGCTATTGGCTCTCATGTGATG^ 

GAAAACCTCAACTCCAGTTGCGCACAC<JAAAGTTTGTCGCT 

GATGTCCCTCTCGCTGTTAATTCTTGCTCATTTATTTTAGCTACTTTCT 

TTTTTT AAAAT CTCATTATTCTC AATTTGGAT AATG TGG ACAG AGGC AG CAG T AAAGTTT ACTTCCTGAACTAG CTCAG AAAAAG C 

2 5 AGACTTAATAAATGAATAGAATAGGGGTAATGTTTCCAAGAGAGAATCAGT 

G CTGTGTGCCACGGAGT CT AAGAAAC CACAT AAG AATT CA1TTIT1"!' CGATGAT AAGTCATT TT AAAACATTTT AAT ATT AGT AAG 
TACATA'ITTTATAATAAAGTAATACAACTGTAi T riTTTGTTTTTTCAAAATTCATTAATTGAGATTGG CTACTTCCAGAAACCTT 
TGACTGAAAAAATGGTGCTTGGGTGTATTTGCAAAGAAACCCAAGGA 
ATTrrCl'TlTGACTAGGAAATGACTAATGAAGAG 

3 0 ATTTCAAAG CCCAGACGGAAG CTCGGAAACAG ATGAGCAAGGAAGAGAAACTGGT ACTACAG AA 

CTGGAATTGTGATTGGTTCATTTAACTTTCTTCTTGGT^ 
AAGACTTAGAGTTCTCAAAGTCTAAATTTTCTTAAAGGTT^ 

GTACAGGAATCTTGGTG CTTCAGTGACCTAAAAACCAGTC7VGTAACTCCAGCCAATAAT ATAGT AATTA CAGACCTGT 
CCCACAGTGGCTACTGGATATTAAGGATTTAAGACAGATGAAATCATATTGGCCATGTACCTCCCTGCT 

3 5 CTGGAATCTTGTGATGCTCATCCATCTCCATCTTCGAG^ 

TGTCATTTTTCTTCZTACAGGCTGCCTGTTTTTGTAAAAAAAAAAAAAAAAAATT^ 
ATTATCTACXX3CTGTGTTTATGCAACAATGACAGAGATGAGTAGTTGCAAAAGAGAGTAGTTG 

AG CCT AAAAT ATTTT CT AT CT AG AAAAAG TTG TCAGCCCCTGAACT AAAGG AATCCAGG CT AGAATTTCATTTTT AC CAATCGG AA 
GTCCAGAAGTTAAGGATTTTAAGGAATTCITTGCACTTCAAATTCCTGTA 

4 0 TT CAG ACCATTG T ATTG AT ATAACT CATTGTTTCATGGT ACTT ATTTG GT CACT AG TTT AAAAATT AATG CATT ACT 

TTAATTGCATCAGACTTTGTGTCTCAC^TAAATCTTATCCTCA^ 

G CATTTTG AAACTT AAAAGAT CACTTTGGAG CTTTG TTCTTAG AAT ATTG ATC CAAAG AT AG ATTTT AT AAACAT C CCCTTGT CAG 

AGCCTCTGCCAAGGACACCTGCTGCCAGCAAGGTCTCAATCTGTTCCCACCCA 

AAGTATATAAAACGCAGAGTCAATAGAGGGTAAGGAGGCTCAAAATCTCCCCCACCATCCCTCAC^ 

4 5 GT AT AGAG TGT AT AT AC CTG TAT AC CT ACAAAAAAATT AGTG ACAG AT CTGT CTG AGCTT AAAAACC CACGTTTTT AAGCTGTTT A 

ACrGCTGTTTAATTAAATGGCTTGAGCTATTCTTTTATGAGGCTCTATCAG 

GTAAAACAGTATGAATTTTTAAAGTACATTTAAAAACACCCAAATAACA 

TTTGAATCTAGTTGAAATCAATCCTTATAGGCAGATCTTTT^ 

TTC CTTT ATTTG AATTTATTTTTT AGTG ATTTTTCAGACT^ T CT C CCACAAG AG AT AAAATGCATT AG 

5 0 TAGAGAACAGCAATCTGAATCTATTTTGAGTACTITCTTGCTGTCTTCC^ 

ATATGGATTCTGTATTATOGATAACCACAAAGAGAGGATTGCTA^ 
ACCACCCCAAGATGGGCATGCTGAAGAGACGAATCATGCCCX3AGGATATAATCATCA^ 
TATGGGCCAAAAAGGGGGTGGAGGGCGTCCTTTATTGTACrre 
ACTTAAGAGCAGGAauSTATTTCATGCGTAGGTCTCCAGATGTTTTTG^ 

5 5 CCAAGACCCCCAGTGGATGCCTGCAACITCGACCCAATTGCTGTCACTTGGAA 

TAATGTCTTTTCCATGTTTATTAAGTACTTATCACATA^ 
GAC'TTTCTTTTTCCTTCTTCACAATTTCACAGATGGAAGGTC 
TATTAAGTCGATAATTTCCACCTTTTCATTTAAAGCACTCTGTGCAGCT 
TCACACTTTGGGGCCTTTATTAAGTAAAAGAAGGATTACTTGAACACAAGCAATGTA 

6 0 GCAGCTACTAGGTTACCCATGGGCAAGTGAGATATACAGCAT 

GAGATTTCATCATGCTCCTCAGAATGATGTGC^TT^ 
ATGACACTTGGCTC^AGGGTAAGTAAAACCATGCATAAGGTGGGACTACT 
TCAGCAGATAGGTCCACTTC^GGTCATGAAAGGTGAAATTATTCCTCACATCTTATTTCACT 
AGATGCCAAGGTTCCTTCTCCTCCTCCAGGACATAAGTGGAAAGAAGTCCGGCATGATAAC^ 

6 5 CAGAGAACATCCAAGGTTCCATTAAATAGATCATGCTTAA 

GTTCATGGTGCTGATAACCTTTTTTTGTTGAAATGTAACGTTCTCGTCCTCTAG 

TGAAAGAAGGGAAGTACTTTCATAGTTAGCATTATGGTGATCTTCCT 

TATTAATTCACAAATATTTACTGAAATCCCAAATGTGCCCCACACAGTA 

AGACGTAGGCCCTGCCTTCATATTGCTTAACGTCCAGTGGTCAGAATGTGTTGAAGTATGGTGATTGTAT^ 

7 0 GTCTTCTACrrCCTCACCTCTCTTGXjTTATGAGAAGGGAATG 

ACATACTTCTGG CTT C CTG CAGTCTCATCTGGGT AGGTTCACCAGGG C CAAATGAAATGTGTGTGAAAG CACATCAT AAAACTG T G 
TTACATACCTATGCACTGTTATTTATTGATAGTGTTATTGCA^ 

AGCTATGTATATCTAGCTTTGGAGAGTCCCACGAACrTCTTCCCTCCAACATACTATGW 

CAuTCAAGGAATT AGTT CT ATG TAGACTGTTAACTGTTTT ATGTG C a .GGAG AC CTGGG ACT AA AH ACCTT CTTG ATCCTCGTTCAT 
7 5 AGGTTATAACATCTGTTAAAATGTGAAAGCCACAGCTGAAAAGGCTATGTGAATAGGCAAGACT 
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TAGCCCTGTGTTCTTCTGTATATCACTACTTTTCTAATGAAAAATACCTGTTTATATATTTTCT 

TGGCTGGGTGTGGTGGCTCAAGCCTGTAATCCCAGCAC'ri TGGGAGGCCGAGATGGGCGGATCACCTGAGGTCAGGAGTTCGAGAC 
CAGCCTGGACAACATGGCGAAACCCTGTCTCTACTAAAAATACAAA^ 
ACTTGGGAGGCGGAGGCAGGAGAATCGCTTGAACCTGGGAGGCAGAGGTCGCAGTGA^ 
GGGCAAGAGACTTAGACTCTGTCTCAAATAAATAACTAAATACATAAATAAAAATAAA^ 

ATATGGCTACAAAAGCAGAAAAATAATAGTTTTAGCCCTTAAGGAACTTATAGG 

ATT AG AAGAAAGAT AAAAGT AAAATGCATT AG AT AAACAGAGGAG TT ATTTGCAG CACCTT AGAG ATCAACAAT AACTTTATG AAG 

AAAAGTAGGTTATGGTTGGGTCTTAAAAGTAGTATTTCAAACAAGAGAGAACTG 

GTATGTTGAGGATAGGGGTTGAGAAAAGGTGTGATAAAAAGAATTA 

AAATGTCACTAGTGGTAAATGGTATCAATGCATATTTTACCACAACTTAAAAATGATCAAAAT 

TTGCACATAAGTTAACATTreGATCTTCTAAGCATACCGCACATACATATACT 

TCTGGGAGGCCCAAAAATTTGCCCAAGATTCAGTGGCTAGTAACT 

TCTGCATGCTTTCCTGTGACATCCACTGCTAAATGTGGCCAGCTTTAAATC 

AACAGTGGAGTGTCTTGAACACAGCTTCACTTCAGCAAGATTTACAGGC^ 

TAAATGCTGCAAAGGAAAAAGTACAGGGTGCCATCTGAGAGTATACCAGGGTACCTGAT 

TTCTGAAGAGGTAACATTTAAAGGATAATGTAAAGAAAAAAT*TTTTTTAAATT 

GCTTCCTCTCCIGCCTCTTTCTTCCCTCAGAACGACCAAGCTCTTGCTCACCT 

TTAGAAACACCCATTCTCCTATTCTTTCCAGGGATACAATCCTATTCATCCTGTAAATCT 

GGCGATCCCTGAATCCTAGAGGCTGGATCAGGTACCCCGGTATAC^ 

GCAGTGTAAATCTACTGGTGTGATTTACATATAATGCCTGTAAATCTTGCTGAAATGAAGCTATC 
AAAGTCCTTTAAGGGCTTCAGTCTCTTTAACATG 

CATACGCTCTAAGAAGATTGAAGAGTTACTAGCTGTTGGATCrTGGGCAAGrrrrA 

CCTGGGATTTAAAGAGGGAAATGACACCAGATTTGTATTTTGGAAACTCGTT 

GACTGGATTGGACACAAAGAAGAAGTAGAGAGATCAATGGGAAGCTGTTG 

AATGATGGCAATGGGGATGGAGAGAGGTGACAGATACATAATGTATGCTGAGTACTTGTACAGGTAAGGAT 

GTTGGAGCCTCTCTGAGGGCTGAGATCCTTGTTAGATAGACCAGGTGAAATGCCTCAAGA^ 

GACAATATCCAGCTAACTGCTATACACAATAGACATTTGTGCAAGTTTCTGGATGAT^ 

AGAGTTAGACTCTGAACCTTATGCTCAGGAGAAAGTGCCTGACTTGAACTGCAATGGGATCCCAAT 

AAAAGTGGTGCCATTAGGAGAAAAAGAAATGATTTCTTGAGCTTGCTCTCAGTTCTCTTTr 

GTGAGATCITCAAAGGTTGGGGTTTGATAGTGCCTTGAATAATTT^ 

ATTT AAAAAATGT AGATACAAAT ATTAGGG GTTTTATTTTT AGTGAAA C ATTT CAAACAT ACAGGAAT AGAT AATT ATG T AATGAA 
CACT CGTATG T C CACCAT CT GG CTTTGTAAAAT CTT AAAATT ATGT CTT ATGTG CT CAATTG TTTT ATTT CAT AAAAG AT ACTG AT 



TTTGTATTTTTGCAGTATATTTATGTGTTCATAAACAATATGTAATTTTACAATATGTAACACACT 
CTTGTTTTTAGTTTACAATATGTTGTAAACTATTGTAAGCTAAAGAC^TATTGTACAAC 

TTTT AAATT AGGTATGTT ACTGGG AT CA1T CTG CAACTTGTTT ATT C CAGCTTTG ATTTTGTGGTTTT ATT ATCTT AACCT 

ACACTTTTAATTAATCCATTTTATTTGTTACATGGTATTCTATTATATCATAAAAA 

TCATTTGAGACCATGTCTTCGCTCTGTCACCCAGGCTGGAGTACAGTGGCGTGATCTTGGCT 

TCAAGTGGTTTTGGTGCCTCAGCCTCCTGAGTAGCTGGGATTATAGGCGTGT^ 

AGAGACGGGATTTCACCATGTTGGCCAGGCTGGTCTTGAACTGACC 

TTACAGGTGTTAGCCACCATACCCTGCCTCTATTCTCTTGTTAAGAGGCATTTAGCATGGTTATAC^ 
AGTGCTGGAAGAAACACATGTTTCITGTGTATATGAATGAAAATTTGTTTTATACAT 



TTTCCAAATGAGTAAACTGTTGTCACAGAACTATTTAGTTGTATTACCTCCTCTCTTGTATATCAGATATATCT 
GACTGTTTCTGGGCTGTCTGTCCTCTTTAATTAGTTCGTGTATCIGTTTC^ 

TAT AAAGTCT ATTGAG T AGGACAAGTTTGTTTC^TTCTT CAAAATTG CTTTGG CT ATTCTTGG C C CTCTGCT GTTT CAT ATT AACT 
TTCAGATAAACTTGTCAAATTCTAATGAAAACTGTTGATAAACTTCTTGA 

TGGAATTGCAATACATTTATAGATTAACGGAGAAAGATATTGACAATACAATTGAGTTTCCAAT^ 

CATT AATTCATGTCTTTTGAATG T ATCCACC AAT ATGGTTTTGT AATTTTCTT CTT ATTT 

TTAAGTGATCTTATAGTT ili ATTGCTAATGTGAAT 

TT AT AATTTTGGGGTG TTGG T CTTGT ATCTGG CAG CAG ATAC AAG AGG TG CT C AG AGTTG TTCAGGGTTG CTGAACTCTT AGTT CT 

AAAAGTGTCTGTG&TTTGGGGTTTCTATGTAGATAATTTAATTATCTATA 

TTTCTTTAAGTTTTAATTTTTATTTTTAAACACAATTATCCATAAAA 

TGTTTT ATTAATTGCT AT ACTC AGTG TTT CTTG T ATCTCAT AC CTTCTGG GGTTTCTTG TCTTGTTG AAAT^ 

TCTTTAGTGAAGACCCAACAGTGGCACTCACTCACCTTTGTTTACCTGAAAATTTCTTT 

TTTCTCCAGTGAAGGAAGTGTCTTATAGGGAAGATTCTGGTTTCACT 

TTTTAATCTGAGGACTAATGTATTTTATCCTACAGTATTACCAATCATTAT^ 

TCTTTTTTAGAGTCCTATTAGATGAGTGTTGACACTTTTCAATCT 

TTAGGTCTGATTTTTTAAGTTCAGGGGAATATTTCATTTTGGGTGAGTT 

TTAATCTACAAGTTATTCCATCTATAATTTATTTCAATTAC 

C^TTTTTGTTTTATAATTTCATGTTCTTTCT^ 

ATTGTTCTATAAAATGTTACCTGAAGTGAATTCATGTGCrrAATTACTGTTGGCAGTTTTTCTG 

AAATTTCTGTTTGTAAGCCCTTTTAACTGGGAGGTGTTTTT^ 

TTCATTA C ' riT T T GTCTTGGAGGGAGTCTCTTAGCCCAGCTCAGC 

TTGCTACATATTTGGAGCAAACATAATTAAAGTATGACTTACTGTG^ 
TTGAAGGGTGGTAACrGGTGGTTTTACTTCATGGAGGGCTTTAG^ 
CACATTTCTGCTCAGCCTCCAGAGATGTCTCCTATTCT 
CCCTCACATAATAGTGTATCATTGTATCCCATCTGTGCTCT 
CAATTTAAAAAATTAAAGGAATCATTCAGGGGTAAAAAGC^ 
CTTTGCCCCCTTTCTGGAAACrTGGATTGGCCTACTTGCAGT 
GGATTGGGATTTTTCiAGAACCACAATAACACTTCC^ 
TCCCAC^TTTGTTTTTGCTTTTCCCCCCAGGATTTTGATTAC^ 
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TGCCTGGTGCCCGTAGGATCTGAAATAGCATTGGAATGACAGCTAGCCTTCTGGAACCTGCAGTTTCAAC 
CCAGATAACCCAGGAGTTTCZTTATCCTGCATAATCTAGGAGTTTACAAAAAG 
ACTGCCAAG1TTCGAAGCATTACCTGCTCTCACAACCC 
ACCTTACAAACAGAATTTATCATCATGATTT lTCnT CAAATGCACT 
5 TTAGTC CTCTT AAAATACTCTCAGTGCCAAATACACACTTCTGCC 1 1TTTT CTCTTT AG^USTCT AGAATTCTGTGGGGTTCCAAAA 

ATGTATGTCCTTACAACCTTTTATATCTAACCAC^ 
TCGGAAGCTCTCATTTTGGTGAGTTCTATTAGAGAAAATA 

TTGGTATAAAATTCAAAACGTTTTAAATCTATTTATTAGTAAGACTAACATAGACCT 
TAGGTGATATCATGGTTGAATATGACCTTAGAGTAGATTCCTTAGGACAGAAAAACACCAACAAGAGAA^ 
1 0 CTTCATTAAAATTAAAAATTTATGTTCAACAAAAGACACTGCTAAAAT 

CATGTTTATCTGAAAGGGCTTATATCCAGACTATACAAAGAATTCCTGTAGCTCCATAATA 

AT AAAAAG AGCAAAAGACTTG AACATAT ACT AT ACAAAAG AAG AT ACAGG AATGGTC AAT AAAT AAG CAAATGAAAGAGTG CT CG A 
CATCACTAATTATCACTGGAGTGAAGATTTAACCACAGGGAGATACG 

AACATTTAT CCATGAAAAAACTTGT ACAAGAATGTTCAT AGCAGTGCTCTATTCATAAGA^ CAGAAT 
1 5 AGAT AACACATT ATGGT AT AGTCAT AG AAT AT ACT AT ATTT ATCAG C)AGTT AAGA^ TATTCAACAA CATGG A 

TGAATCTTAAAAACATGGTrAAGGGCCAGGTGCGGTTGGTGCATGCCTGTAAT 

ACTTCAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGCGAA 

CGCAC^CCTGTAATCTCAGCTATTC^GGAGGCTGAGACATGGGAATCGCTTGAACCTC 

TGCATCACTGTACTCCAGCCTGGTGACAGAGClSAGACGCrGTCTCAGAGGG^ 
2 0 ATATTGTATGATTCTATTTATATAATCTAGACTAGGCAAAATTTATCCATGATGAT 

CTAGAAAACAGAATCCAATTTGCCTTCTCACCCCCAACTTTGTAATC^ 

ATC-GAT AGCAGCT AT ATT AG AG AGACAAT AT AG TGGGCCT ACTAGTTTT AAAAAGTGAT CC^ T CATCTACAATCAG TT A 

AATTAGATTTTTACCCATTACTGAAGTGTGATTGAATTAG 

2 5 TGTCTTCTTGAGATGGTTTTGTGAGACCTTGCCT 

CAGGGCATTGAAC^CrGCAGGGACAAGC^TACTTCTTGGAGGCCAGCCATCT 
ACCTCCATCTCAGCTTATTTGAGAGAATCCTAGATCCTTGGTGCAGAAGCCCAGCC^ 

TCCTGTGAATGAQ3TGCGGGTTGGGTTTTTCTAAATCT CGTTT ACCTG ACTCCT ACCTT ACATTCCAGTGG TG TG A CT AAGG AACT 
GAGCAAACGTAACCCTAGGGGACTTTGAJGACAGGCTTTGGTAGTCCATCT 

3 0 GGGT ACTCT ACTGTTGAAG GTG AGCCTTTTG TGTCCAATT^ 

CTTGTGCATGAGGACAGCTrATATCTCACCTTlTCTTATT 

G TGTCAC CAACT ACT AAATCAC C CAC CCATG AT AGCAG GT AGTT ACTGAAACTCTTGT AG CC AAACAAG TG CAT AACACTGGT AC A 

CATC^TGTGAGAGTGCAGATGACTCCTTAGCTTAACTGTGTA^ 

ATTCTTGCrrCrAGGTATCTCATGCTTCTGCACACCCAGAAGGCTGGTTC 

3 5 TTGATAGGGTCATGCTCTGTTGCCCAGGCTGAAAGGCTGGTTATCTTCACC^ 

C^TAGTCAGGAACCAGGCATGTGTGGGCAGAAATCACTACC^CTTTCCTTTG 
GATTCTTAAATTCCTTCrCTATGTTTATGTCATCTAGTCTC 
TTTTTTCCCCTTCTCACTTGTCATAATTGGCTCTTGCTTGTGAGTCr 
TGTT^AGATTCAGTTTTCACATGGTTTTAGTAACCCTTCATTGGGTTTAG 

4 0 TTTTT AAAGG CTT C C CAG AAACAT AG TTT CTCTC AATCTCATCAAG AAATG CTTTT AAATG CAGAAGTT AGAAAATGG TG AAT AAT 

TATGGCTGTTCCTC^AAGTCATTGTGTCCTTAGTAATTACATGTTGT^ 

T AATTT AATGGAATT CACTTAGATTT AATGGAATTCAGTC CTGG T AG AAAACTG CT ATTGGGT CACAG AG TTGG CAG C AAGG CTGT 

GAAATTGATGTGGGAATGGC^TGATTATGTGCTTTGTGTTGACAGAAGTTAGTTTTTGGGAC 

AACATAAACATTTTAGAGAATTAGAGCTAGAAACTTTCTTCT 

4 5 AATGTGCATTCTTAAC^GGTTAAATGTGCCAAACTCTCTCT^ 

CTTTTGTTCAAAGTCTTGCrTTGTCAC CCIATGGTG^AGTGCAGTGGCGTGATCTTOGCT CACTGCAACCTCTGCCTCCTG CGTTCA 

AGCAATTCTTCTGTCrTAGCCTCCCTATTAGCTAGCTGGGATTATAGGCTTGrc 

TTACTACACTGTCCCAAAGTAATCTACATTTGGGTTTGGGTCT 

GG CTGAAAAAATGTGTGX5AGAAGATCCGGAACCAGTATCGAG CCAAAGAGATGAAAGTCCGGCAG AG AGCTGT A 

5 0 G CCCTGT ACTTCATCG ACAAGGTGAGAG CAT CTTCCCAT CGGCATTG T CT AGTGTTG AG CTT AACAAAGC^AGTTTCTGCTCTGCC 

CCAGGCCCTGTGCCACATACTGTATATCACAACTCAACATACATATATTTGTATGTAAAACTGAATCAAA 
GCTTAGCCTTACTGTGAGC^GTAGATTCTGATCTTTTTTTAACTCTAT^ 
GTTTAAATAAACCJVTGCAGGTATCACCIATTATATAGTTTTAAAG^ 
GAAAAAAAAGTTCATGTTTTTCCCCCAGTCTTGTATTTTAACATA^ 

5 5 TGATTGCCCAGAGTAGCAGCACTCCCTCATTTGCTGCCTTATTTAA 

AATAGAAAGAAAAGGAGCATGAATATTTAACATCCCTATAGATTTCATAG 

TTCCCAGG CCATG CTGACTCATGCTGTGGCTTCAAACAGAATTGGGAG^ CCT AG AAATTGCATAAGGGTCA 

AAAGG CTGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTCXXJAGGCCAAGGC^ 
GGCCAATGTGGTGAAACCCCATCTCTACTAAAAATACAAAAA 

6 0 CCCAGCTACTTCGGAGGCTGAGGCAGGAGAATGGCATGAACCCGGGA 

CTAGCCTC^CGACAGAGCGAGACTGTCTCAAAAAAAAAAAAAAAAAAAAC^ 

TTCATGCTCATCTTTTCTTTCTTTCCTGGGCAGCTTGCT 

GGGCTCCTGCTCACTTCGTGTGGAGCACATCAATCTACACCCAGAGT^ 

GGAAGGACTCCATCAGATACTATAACAAGGTCCCTGTTGAGAAACGAGTAAGTTAATGTACCTC 

6 5 TATTCAAa^CATGGGTTGACTGCTTTTTTCT 

G C AC ATGC CTGT AGACCC AG CT ATTCAGAAGGCTG AGACAGGAGGAT CACTGG AGAC CAAAAGTTTG AGGCTGT AGTGTGCT GTG A 
TCACACCTGTGAATAGCCACTGTACTCCAGCCTGGGCAAAATAGTISAGACCTTGCCTCTCATAT^ 
CTTGAAAGAGGGGAGAGGTATTATTTGATATCTTGTGTACCTCCm 
GTGGCCTTCTCCTATAGCAGTATTCAATC-GAGCATGCTGTAGAATAGGCCCTTAGAGC^ 

7 0 TAGTATCCCCTGGTCATGTtXSTACATTTGTCACTT^ 

TTAGTCCCCAGATCTCTGAGCCCATCACTGAAGAGGGTACTATAGCTCTGTAGTCCCAACCT 

GAAC CAGAG ACTT ACAGTGTG CAGGAT C CT AATAAT CCAG CCCACCAG C^GTC CrTCTTCTTATAAAGACACCTCTTCTCAT AAGG A 
CACCATAATCAGTAATCAGGTTTTGCTGGCTGTGGAAGTACTCT 

GCTGATACTGTGTAAATGTAAGGGAATATCTATTTTTAGAACATACACACTGAAATGTTTAG^ 
75 TAAATGTACTTTAGXSACCATATTATAAGTAAGTACTT^^ 
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AGAACAAACAATTATGTATATATATCTGTAAATATACAGATGGAGAGAGTGAGCATACAT^ 

T AAAATGTT AACAGTGAATCTCXX5TGACGG ATGTATGCG TC CTTTAAGTTTGGGG TT A 

TTTCTAAAGTTAGGATTriTTTCAAGAGTAATAATTAGGATtCAC^ 

AGAACAAGCAGCCC 

5 

HUMAN SEQUENCE - mRNA 
CAAATGCGAACITAGGCTGTTACACAACTGCT 
CAGCCTCAGCCGTTTCTGGAGTCTCGGGCCCACAGTCACCGCC^ 
CGCACAAGGCCGGTCGCCGTCTGCGTCTCCCCCACGCCGCCTC^ 
1 0 C CAC CTC CACAACGATTCC CJVGATrcAAGCGG ATTTCCGA CGAGAAC 

ACCGGCACAAAGAACACAAGAAGGAGAAGGACCC^GGAAAAGTCCAAGC^ 
AAAGAGAAGGAGAAGACGAAACACAAAGATGGAAGCTCAGAAAAG 

AGAGGAAAAGGTTCGAGCCTCTGGGGATGCAAAAATAAAGAAGGAGAAGGAAAATGGCTTCTCT 
AACCTGAAGATGATGGCTATTTTGTTCCTCCTAAAGAGGATATAAAGCCATTA 

1 5 AAAC CT AAG AAAATT AAAACAGAAG AT ACCAAGAAG GAGAAGAAAAGAAAACTAG AAG AAG AAGAGGATGGTAAATTGAAAAAACC 

CAAGAAT AAAGATAAAGAT AAAAAAGTTCCTCAG CCAGAT AAC^ CG AAG AAAGAAG AGGAACAGAAGTGG AAAT 
GGTGGGAAGAAGAGCXSCTATCCTGAAGGCATCAAGTGGA CCCCACCATATGAG CCT 

CTT C CAG AGAATGT CAAGTTTTATT ATG ATGGT AAAGTCATGAAG CTG AG CC CC AAAG CAG AGG AAGT AGCT ACG TT CTTTGCAAA 
AATGCTCGACCATGAATATACTACCAAGGAAATATTTAGGAAAAATTTCTTTAAAGA 

2 0 AGAATATTATCACCAACCTAAGCAAATGTGATTTTACCCAC^TGAGC 

AGCAAGGAAGAGAAACTGAAAATCAAAGAGGAGAATGAAAAATTACTGAA^ 
GAGGATTGCrAACTTCAAGATAGAGCCTCCreGACrrilCCGTGGCCGCGGCAACCA 
TCATGCCCGAGGATATAATCATCAACTGTAGCAAAGATGCCAAGGTTCCTTCTCCTCCTCCAGGACATAA 
CATGATAACAAGGTTACTTGGCTGGTTTCCTGGACAGAGAACATCCAAGGTTCCATTAAATACATC^ 

2 5 AATCAAGGGTGAGAAGGACTGGCAGAAATACGAGACTGCTCGGCGGCTGAAAAAATGTGTGG^ 

AAGACTGGAAGTCCAAAGAGATGAAAGTCCGGCAGAGAGCTGTAGCCCTGTACTTCATCGACAAGCT 
GAAAAGGAGGAAGGAGAAACAGCGGACACTGTGGGCTGCTGCTCACTTCGTGTC 
TCAGGAATATGTGGTAGAGTTTGACTTCCTCGGGAAGGACTCCATCAGATACTATAACAAGGT 
AGAACCTACAACTATTTATGGAGAACAAGCAGCCCGAGGATGAT^ 

3 0 CAGG ATCTCATGGAGGGCTTGACAGCCAAGGT ATTC CGT AC AT ACAATG C CT CCAT CACG CT ACAG CAGCAG CT AAAAGAACTG AC 

AGCCCCGGATGAGAACATCCCAGCGAAGATCCTTTCTTATAACCGTGCC^TCGAGCTGTTGCAATT 

CACCACCAAAAACTTTTGAGAAGTCTATGATGAACTTGCAAACTAAGATT 

GACCTGAAAAGTGCTAAGGCTC^TGCCAAGGTCATGAAGGATGCAAAGAC^ 

G AG ACTGGAGG AACAGTTGATGAAG CTGG AAGTTCAAG C CACAG ACCG AG AGGAAAAT AAACAG ATTGCCCTGGGAACCTCCAAAC 
• 35 TCAATTATCTGGACCCTAGGATCACAGTGGCTTGGTGCAAGAAGTGGGGTGTCCCA 
GAGAAGTTTGCCTGGGCCATTGACATGGCTGATGAAGACTATGAGTTTTAGCCA 
AGTGTGGTTTGGGAAAGATGGATAAACTGAGCCTCACTTGCCCTCCT 

TCTTTGCGAAAAGATAAACCTGGAGATATTATAAGGGAGAGCTGAGCCAGTTGTCCTATC^ACAACTTATT^ 
TATCAAAATTCTAGCTGTATG Al " lTGl w r iTGAATTTTGTTTTT A 

4 0 CAGGCAAATTCACTGTTTCACTGAAATGTTTGGAT^ 

TAGGGTTTTGCAATAACTTCrATATTTTAATAGAAATAAATTCCT^ 
AGTAGAACAAAAAACCCAGCGCACCTGTTAGAGTCGTCACTCTCTATT^ 
TGGCTTTGGCAGTGTTI^GGTTCAGACACCTGTTCACAGAA^ 
AATGTGAATTrTAriTCTTTTTAATTATTTTAAAATATTTAAACCTTTTTCTTC 

4 5 GGGATGAAGGGCGAGTGAATCTAAGGATAATGAAATAATCAGTGACTGAAACCATTTTCCC^ 

TGTACCCTTTAAGATATCCATCTTTTTCTTTTTAACCCTAATCTTTC^ 
ATAAACTTAGAGGAAAATGAGTATTTGGTCCAAAAAAAGGAAAAATAATCAAGATTTTAGGG 
TGTAAAAAATGGAAAAAAACATAAAAAGCAGAATTTTAATGTGAAGA 
GTTTAGTGTGTGTGCAGAGTCCATTTCCCACATCTTTCCTCAAGTATCT^ 
50 GGTTATTTAAAAATAAATTCCATCAACTTAATGGAAACTTAA 

HUMAN SEQUENCE - CODING 

ATGAGTGGGGACCACCTCCACAACGATTCCCAGATCGAAGCGGATTTCCGATTGAATGATTCTCATAAAC^ 

AG AT CGAGAACACCGGCACAAAGAACAC AAGAAGGAG AAGG ACCG^3G AAAAG T CCAAG CAT AG C AAC AG TG AACAT AAAG ATTCTG 

5 5 AAAAGAAACACAAAGAGAAGGAGAAGACCAAACACAAAGATCX3 

GAAAAACGAAAAGAGGAAAAGGTTCGAGCCTCTGGGGATGCAAAAATAAAGAAGGAGAAGGAAAATGGCT^ 
AATTAAAGATGAACCTGAAGATGATGGCTArrTTGTTCCTCCTAAAG 

ATGCTGATTATAAACCTAAGAAAATTAAAACAGAAGATACCAAGAAGGAGAAGAAAAGAAAA 
TTGAAAAAACCCAAGAATAAAGATAAAGATAAAAAAGTTCCTGAGCCAGATAACAAGAAAAAGAAG 

6 0 GAAGTGGAAATGGTGGGAAGAAGAGCGCTATCCTGAAGGCATCAAGTGGAAATTCCTAGAACATAA^ 

CATATCOVGCCTCTTCCAGAGAATGTCAAGTTTTATTATGATGGTAAAGTCATGAAGCTGAGCC 

TTCITTGCAAAAATGCTCGACCATGAATATACTACCAAGGAAATATTTAGGA 

TAATGAAGAGAAGAATATTATCACCAACCTAAGCAAATGTGATTTTACCCAGATGAGCCAGTATT^ 

GGAAACAGATG AG CAAC^AAGAG AAACTGAAAATCAAAG AGG AG AATG AAAAATT ACTGAAAGAAT ATGGATTCTGT ATT ATGG AT 

6 5 AACCACAAAGAGAGGATTGCTAACTTCAAGATAGAGCCTCCTGGACTT^ 

AAGAAGTCCGGCATGATAACAAGGTTACTTGGCTGGTTTCCTGGACAGAGAACATCCAAG^ 

cctagttcacgaatcaagggtgagaaggactggcagaaatacgagactgctcg^ 

ccagtatcgagaag actggaagtccaaagagatgaaagtccgg cagagagctgtagccctgtacttcatcgacaag cttg ct ctg a 

7 0 gagcaggcaatgaaaaggaggaaggag aaacag cggacactgtgggctg ctg ctcacttcgtgtggag cacatcaatct acaccca 

gagttggatggtcac<;aatatgtggtagagtttgacitcctcgggaaggact 
acgagtttttaagaacctacaactatttatggagaacaagcag 
ataagcatcttolgk3atctcatcgag<3gctt^ 
aaagaactcacagccccggatnagaacatcccagcgaagatccm 
7 5 ccatcagagggcaccaccaaaaacttttgagaagtctatgatgaacttgc^ 



936 



WO 03/008583 PCT/US01/51291 

ATGCCCCMAGAGACCTGAAAAGTGCTAAGGCTGATGC 

AAGGCTGTT CAGAGACTGGAGG AACAGTTG ATGAAG CTGG AAGTTCAAG CCACAGACCGAGAGGAAAAT AAACAG ATTG C C CTGGG 

AACCTCCAAACTCAATTATCTGGACCCTAGGATCACAG 

AAACCCAG CGGGAGAAGTTTGCCTGGGCCATTGACATGGCTGATCAAGACT 
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Table 44 

MOUSE NOMENCLATURE 
ICSGNM DusplO 
Celera mCG15978 

5 

HUMAN NOMENCLATURE 
HGNC DUSP10 
Celera hCG2503l 

10 MOUSE SEQUENCE - GENOMIC 

GAGAGAGATCAGTTCTTTGTCTCAAGCAGACCTTTGG 

TCAAAGT CTGTCAATTCAAATGTTAAT CTCATCCCCAAACAT C CAC ACACAG AAGCAAG CTGAG TCACAT CTG CT CACACATCTGC 

CCCTGGCGGGCCTGTTAACACCGTCTGGGGGGAAGATGGCACTGGCTTCAACTG^ 

CrGCAGTGrrrGATTTTGGACTTCTAGGTTGGGGATAGAGTGTCTCTGA^ 
1 5 TCAATTTCCCTAGAGGTTTCTGGCCGTTTTTGTCT 

ACACTTGCCCTAAAAACCCACCTTCCTGAGTGTCTGTTGAGTACCATCG CACTGAGTGTCAGAT ACAAG AATG CGT AACCACCAAG 

GACTCAATAGACCGGTCCATCCTTCrCCAGGCCCCGTGATAGTCAGTTTGACTCAAGACAGATC^ 

ACTTGTATQ3ACTCCTTGCCTTCTGCATTCTCACATATATCCTTT 

CCAGTGCTAGTCAGCTTTCCTTTGTTCCTGTGACTTGATGCCTGAGA^ 
2 0 TTGACTCTACCTTGTTCCTGGGCCTGCGGTTACCTCA^ 

GGJUVCTGTCACAAAACACTTCTTTT AATGGC CCACT CCT AGT AACCT ACAGACTC CAACT AGACTCCACAC CAACAAG TTT CTC C C 

ACCTTCAGGCAATACCTTAAACTTCTAAGGAATAAGAGAGTTAATCTGGATAA 

ClAAAGCTCrrGGCCCTCCAAGATGAGAAAACAAAAGAAA^ 

AGCTAAGGACAACATCCACAAAATCACACAGAGGATGAACTACACAGCACAGGATGCTCGCGGGCAGTTGGTCT^ 

2 5 ACTTGCTGGTTCTGCTGTTTTCTGGCTGAATGCGC 

GCTACTTGATCTCTGATGCTAAGGTGGAGATTTAAATGTC^ 

GGAAGTCCCATCAAACTTCTTCTCCACACCCTAGATCTTTAGAATC^ CTTAGTCAGGGTTTCT ATTCCTG CACAAACA 

TCATG ACCAAGAAG CAAGTTGGGG AGGAAAGGGTTT ATTCGG CTT ACACTT C CAT ACTGCTG TTCAT CACCAAGG AAG TCAGG A CT 
GGAACTCAAGCAGGTCAGAAAGCAGGAGCCGATGCAGAGGCCATGGAGGGATGT^ 

3 0 GCCTGCTCTCTTATAGAACCCAAGACTACC^GCCCAGAGATGGTCCC^ 

AAATGCCTTAGAGTTGGATCTCATGGAGGCATTTCCTCAACTGAAGCTCC^ 

CZAGAACT AG CCAG T ACAGGTAGGTTG ATAACTTG CTTGT AAAG AC AACAG AG CACATTG AGG CAT ACTGGGACATCCT AAG AG TCT 
GCTATATCrTGCATCCOVAGCAAGAGTTGCrrAAGATTACATTTGCAAATTTGGAG 

CCCTAGTACACATCAAGTCCCTTCAGACAAGATG CATCCTCCCCCACTG AGGTCAGACAAAGCAGTC CATTTAGG AG CACAGG ATC 

3 5 CATAGGC!AGGCTG^CTGGTAGGCAGGCAACAGACrAAGAGACAACCTCT 

CTGCTACATATGTGCAGAGGGCCTAGGTCAAGTCAGTGGTCACACTCT 
GTTAGTTGACTCTGTTGGTCTTCCTGTAGAGTCCCTGTCCTTTT 
TATTGTTTGGCTGTGGGTCTCTACATCTCTTTCATCTCTTTTTGGCTGCT 
CTATCTGCAAGCATAACTGAGTATCATTAGTAGTGTCAGGGACT^ 

4 0 GGTTGGGT ATTCCTTCAGTCTCTGCT CTAT CTTTGTCACTG CAT ATCTTGTAGACAGGACACATTTTG^GTCAAAGGTTTTGTGGG 

TGGGTTGCTGTCCTTCTCCCTCCACTGAGAGTCTTGTCTAGCT 

AGTTTCAGTTAGAGTCAGCCCCATAGACCTTCTOTAGCCrCTCCCCATCCCAGGTCTCTGGTATGTC 

CCCCCCACC CTTCCTTCTCCCTCTCCTTTG CTCCACTTCCTGTTCGCT CATTTGTGTATTTATGGTTAACTATGTGAGCTCTCCAC 
TTTCTGTTCCTGCTAAGCCTGTGGCTTGCTrCCATGCCTCTCT^ 

4 5 AATGAACTCCACrCTTCrGTAAGTTCCCTCCGTCATGCTGTTGGAT 

GTGTGTACACATGTTCACATATGGGTATACTCATTCCTCAGG 

AAGCCCAGAGGAGGGCAGTGGATTCCCCTGGGACTGAAGTTAGGATGGTTGTAAGCCACACT 
GTTCTCTGGAAGAGCAGCCAGAGTTCTTCCTCTTTGGTCTCT 

5 0 GGGTCTCTCACTATCTCTCCIAGCTCACTGCTTCAGTTAGACAC^ 

CTTTACACAGTAAGTGCTTTCACCCACTGAG CCATTTCCT CAG CT CTCAATT ACGATTCIT AATTrrrriTl"ri' AAAAAGAAT ATGT 

GTGTCTTTGTGGGAGTATGCGCATGTGAGTAAAGGTGGTCTTGGAGGTCAGAAGAAGGTAC 

TTTGGTCATGAGCTAATGAAAGTAAATAAAACTTGAGACATGTGATTCATGTAATTTATGTCAAA 

AG CTTTAAAACCTGGGGCTGAAAACAT AG CAGAACAGGCCAGGACATG CCCGGG CAGGCC CGTTGTT ACATGT CCTG ACTGG CCT A 

5 5 GTGCCTCCCTATCTCCTGCCCTTCTGACAGTCTGTTGATGTTTAAATGGACCA^ 

CCCACCCCTTTTCTATAAAATCCCCTAGCTTCCAAG^ CTCGTGGTCG AAACCACTGTCTCCTG CGTGAG ATACGTTTCGAGCCAG A 

GCTCCGCCATTATGGCrCCACCATGTGGTCCACACCTCTGTCTCCTGCGGGAGATATGTCTCAGC 

ACCTC!ATGCTTTTACATCAAGATGGTCGTCTGTTCGTGATTCTTGGGTGCATGCCG 

GGTTCTTTCACT AC CACGTGTAGGTGCTAGGGA CAG AACTCAGGCCCT CTG C AAG AACAG CAAG CAT ACT^ 

6 0 TGCTTCAGTTCCCTCATCTATGTTTTATGTAAACCTTATACATATAACCTTAAAGATTTATT^ 

CTTCAGACACACCAGAAGIAGGX3AATCAGATTCCATTACAGATGGTTGTGAGCCAAC 

TCTGGAAGAG CAGTCAGTGAGTGCTTTTAACCGCCAAGCCAT CT CT ATTTTTTT AAT ACTTTTT ATT AGATATTTTCCTCATTTAC 

ATTTCCAATGCTATCTCAAAAGTCCCCCATACCCTCCTTCCCACTCCTCTACCCA 

CTGTACTGGGGCATATAAAGTTTGCAAGTCCAATGGGCCTCTCTTTCC^ 

6 5 GCTAGAGTCAAGGGCTCCGGGGTACTGGTTAGTTCATGATGTTGTTCTACCTATAG 

TTTCTCTAGCTCCTCCATTGGGGGCCCTCTGATCCATCCAATAGCTGACTGTC 

TAGTCTCACAAGAGAGAGCTCTATCTGGGTCCTTTCAGCAAAATCTTGCTAGTATAT 

ATGGGATGGAACCCTGGATATGGCAGGCTCTAGATGGTCCATCCTT^ 

GGGTGTTTTGTTCCCAATTCTAAGGAGGGGCAAAGTGGTCTTTGTTCTT CTTGAGTTTCATG CG'IT 1' AGCAAATTGTATCTTATAT 

7 0 CTTGGGTATCCrAAGTTTCTGGGCTAATATCCACTTAT 

CAGGATGATACCCTCCAGGTCCATCCATTTGCCTAGGAATTTCATAAACTCATTG 
AAATGTACCACATTTTCTGTATCCATTCCTCTGTTGAGGGGCATC 
ATGAACATAGTGGAGCATGTGTCC'1''1'C'1TACCGGTTAGAACATCTTCTGG 
CTGCATTTAGAGATTTCCATTGGTAACATGATATTGGTG 
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T AAGAT CACCTG CAG CTC CG AGC CAGACCTTGG CAGTG ACTT CT AAGG TT G AAA CTCAGTGGTTC GG CTTGTTGGGGGGG CAGG AG 
GAATGTGGACAGCGTTTAGTGTGGCAACACATCGTACTCAT^^ 
ATCAAGACCGTTGGGCCTACCAGAATGCTCTTCTGACAAAGAGCAGC^ 
GCAGGAGGAAAGACAGCnTCAGAGAAAGGGTCTTGCTG^ 
5 TCTGTGCC^CCAATGGAGGTCAGGCTTGAGTTT^ 




TACAGCTGGTCrCCACTCCCAGTCACAAGGCCCCACTTCCTTCTAGT^ 

GTAGAGTGGTTATCTTCAACATCCAAGTGGCTTCAGATGAAAGCAGCra 
1 0 AGTAAATAAATCATCTTAGOCGCTCTCCTAGTCACATGTGAACTAAAAGATGAGTGAAA^ 

TACTATATCAAAAAACATAAGCAACACAGACATAATATCCTTTATGTC 

TGATGTGTCTTGGTTCCAACCAATCACATTTCAGGTGTTCAAAGTCACATGT 

CCTTACAGGTAGTAGTGAAGATAAACACTCCnX3CCTCTAGGGCTACAACTTACT 

TTCAGTCTCAGCAGTTAAAAACAGAACTTCACTTCCCAGAGTTCTTGAGTCTGAGG 
1 5 GCCTCTGCTTCCAAGATGCACTGTCACCGTGTAACTGGGAAGAAATG^ 

CCCTTCCCAGTTCC^TCTGCAAAGGCCCTGCCCTCATGGCATTACCACCTCAGACA 

GGACTGAAGTTCAACATGCGAGTGGACCTTCGTGGCCTCTTTCTCCAGG^ 

TGAAGTATTTCCCAGGCGTACrCCTGGGACTTCATCTCCTACTTCCTCC"i i^CTATTCCCCAGTGCCTTCCCTATTTGCTGATTGA 

2 0 AAGACCTCACCATTCTGCCACCAAGCACCT 

CAC7UVTT AGCTTTGTCCIT CAGCTCTAACACACAGC^ CAGTAG CAACAAAG T CAC 

AGACCAGCATAAGTGCACTACX3TGGCCCAAGGGCAGTAAA 
CCTGGGTATCCGATTGTGTTCATGGCCCITTGCTGACTGACTGATC 
TTTAC^TTATGACTCCAAATTAGTAATTGAAATCACCTCCCTTGXSCACAAAAGCAT 

2 5 GAATCTGACTTCTTTGAGTATAGCTACrCAAACTGCTGCCT 

ATTACCATGCATATGTTAATGACATGGAAATTACTCAATGCTTCGGGTT^ 
CAGCCTGGATGACAGACTTTTAGTTAACAGAATTACTATAACTTATTAGTC^ 
ATTTCTTTCrrCACATTGCCGGCATCAGGCTTTGTGAATCACAATTTACAGGTTG 
AAAGCAAACACAACCTTCCCAAAGCCTTCGGTCAATAGAGAACCTCGCTCTACTATAC 

3 0 TCTTTTT C GGAG CAAGG ACGG G AGTTTG CTGTTT ACAAATGT ACACAGTG ATG AACACAG CAAATGTGG CTGGACAG ACAAG CATC 

AGACCTAATACCTAGAACTCAAACCGTAAATGGGAACCCCAGCATACATAC^ 
AAAGTGCTCTGGGAAATTCTTTTCTGGGTTCCACGGAAAATGCGCACAGCCT 
JAGATCTAAGGACCAAAGGGTGCGACCTGGACACTCGTGTGCAGACATTCGGGTGT 
ATGGGGCTAAAGAAAATCCTTTTCCAACCCCAGTGTGCACTT 

3 5 AGCTAGTCAGCGAAGGTGATGATGGTTGGACACCTTATGATCCTAGATCAACAG CAATGGTG AATGTCTGGGG C CT AAAGCAAACG 

CCTCCCATCATTAAGGAC^TCrGAACCTGCCACGGAGCTGATAATGTTTTATTCCACT 

TCTGAGGGGGTAGGGAC^TATAAAAAGCTCTATTGCTTTAGGATTCTTTAACGCAA 

GGTCGCTAGC^GGCTTGCTC^GCTTCCCACTTCTCTTCACCGGTG 

CGG GG AATCGGG AG AG CTGG CGTGGG CCCTTCCATG CT C CT CAAG CG C CG CT C ATTCTGG CTG TT CAACCCGAG CT CTGGCT CCTT 
40 CCCGCCGGCGCTTGCTGTTCCAGGCTGGCGCTCCTTGGCTCGCCGAGGGCCCT^ 

GGGGGCGGATGGACTGCC^CTCCAAAACATCGCACAC^GGGAGGTGGCCGCGGGAGAAGCGGGGA 
TGCCTGX!K3CrGCGX3GX3CTTGTG£^ 

CGGCCCCCAGTGGCTCCTCTCACAGAGCCAGGCGTCCTCCTCCGAGTAGCTCTCCGGAGTGCCGCCTT 
CCTGAGAACAGGTTGGG<5ACGGCGGAGGAGTGACAGGGTCACGGAAGTAGGGGAAC^ 

4 5 GAGCAGTAGGGCAGCCGGAGGGGCAGCGACATCTGCGGGCCGGGCCAAGCGGGGCTGC 

TCCCGCGCGGCGCAGCCACAGCCATGGAGGGACCCGGCTTCGAGTCGTCT 
AGTCAAAAGGCGGAATGGAGACCCCGCGAGGCTGATGTCCTCAACAGGGGGCGGG 
CGACCAGAGCAAGCCATTGACACTTCAGTCCCAGGAAGGGTTAACCGCGTCCTGCT 
TGCTCCAAGCGGGCCCGGGCTAGCACTTTTAAGTTTT^ 

5 0 CCACTAGCX^TCCGAGCCC^CCTCCCTCCCAAGGCGAGGCCTCGCCCCTTTCTCCGCCCTCCT 

CA.CTGTGATGTAGCGGGCGGCAGCCTCTGGGTCTTCATTGGCGTATCGCGCCGTCGCTCGGAGGCAACCCC 
TACATTTACGCTGTAGCAAAAGAGAAAGTAACTCTCTGTTGCTGCT 

AGCGC!ATAGCTGGAGCG<^CGCCAGATGGTAGAGGGCTACGCTTATTGATGAAGTAAGTGGAA 

5 5 AGAGAGACAGAGAGACAC^GAGAC^AGAGACAGAGACAGTGTCAGAGAGACAGAGACGGACCTACGTCAGGTAAGGAAGATCCT 

AGGTTT AGTCTTTTGG CTCTG CTTG C CCTGGG ATG AAGAGGGG AAATG AG TTTTGG TTTGG TT AAG ATG CTTT ATGGCAGTTTT CA 
AAGCCCAAGGCTTTGCTCATGATGCTCAGTGGTGGCACGGAATAGGGATGCTACGTGCCCCAG 
AGCACTTGGTTTTTCTTCAGTCCCTGTTAGTGGTGGAAGGAAGATT 
AGGTTT CCCGGGCCAGACTGGCTITGGAGCTATCrG CTG CAAGG ATG CAG A 

6 0 CACAGTGAAGAGGTACCGGTCCCAGCCTGTGTCTCTGGAAAGTCXjTTCCCGGGTTCrCCT 

ATGTCCCCTTACCAGGTAGCAAAATAGCAAAGACTTTGACTGCATTCATGTTCCAGCTGGGACACAGT^ 
TACTCGCTCTGAAAAAAAAAATACTACGCAAGGGAGAGGGAAGGAGGGAGGGAGGGGAGGGGAGGGGA 
(^GAAGCTTTGATCCTTTGCTCCCAACCAAAGGTGGGTGAGAGAAACCC 
CTCCTGGGTGCGCGTGTGTGGATTTTGCTGGTGACACTTTGGCTCGGTCT 

6 5 CTTGATGTTTTTCCAGTCAGCTCTTAAAGTTCGTTTAG^ 

ATTGCTATGTTATAGGGTC^CCTTGTGCCTACCGGG 
CCCGTGGGCTCTAGGTCTX^GCTTCCCAATTAGCCTAACTG 

TCCAAGAAGAATATCAGTATAGGGTTGAAACATT TTT TACAAAGTTGCCACTCCAGTGGAGATGCAAAGTCTGTCTGTATGACCAG 
ATGAAACACATGCTCTTTTTTGCTTAGGGGCACAGCATAGAGA^^ 

7 0 AGCCTC^GGGTGC^TTCCTAGATCTGACTCCTTGGA 

TGTGTGTGTCTGTGTGTGTGTGGTAATGGCTTGATTTGATTTC 
TCAAAGGACCCACCAGAATGGAAAGGAAGATTAGCTCCTAGCCATGGAG^ 
AGGTTAACATCCAGCCCCCATGTAAATATAGCTAGCTGCCAAGCATCTCCCT 
TCrCTTTGCCTTGGCrCTGGGOAAAGAJ^ 
7 5 CTCTGCACAGGAGAGGAGGCTCAGAATAAATCTAAG 
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CACTAAGGCTGAACTTGGATGCATGTCTCTAACATTATGTGAC 

TTTG ACTT AGCAT CAG AATGGATGGGG CCC CTCAGGT ATG CATC CTCTCTG AACAG ATCAAAG TGAATTTTAG CCTTTGAT AGTAT 
CACAACCT AGGG CACCTAGAAGG ACTTTGACTTTGCTCAG AG AT ATG CAGAGGTCTTCCCCC<5GGGATTGGATGAGACTGGCTTTG 
TTTTCATGTXSACTTTATTTTTTTCTTTCTrCCCCT^ 
CTCCATCTCCTTTAGACXlACAGGGTAGTAGTGGCACTATCrAGGCCTC 



GT ATATGCCCTCATCCAGCGGCTCTGCCCGCTCCCTGAATTCCGG ATGCAGCAG CACT AGCTG CTGCACTGTGGCAACCTACGACA 
AGGATCATCAGGCCCAGACTCAAGCCATTGCTGCAGGCACTG 

1 0 AGCCAGt^TCAAGATCATCTACCCCAATGACCTGGCAAAGAAG 

TCATCATCGATTGCAGGCCTTTCATGGAGTACAACAAGAGTCACATCC^ 
CGGCGAAGGCTGCAGCAGGGCAAGATCACTGTCTTAGACT^ 

CAAAG AAATTAT AGTTT ATGATG AG AATACCAATGAG C CG AG C CGTG TG ACAC GftTCCCAGCCACITCACAT AG TCCT CGAGT CCC 

TCAAGAGAGAAGGCAAAGAACCCCTGGTATTGAAAGGTAATGACTCTCCCT^ 
1 5 AAAT AATGTCCTGATCAGCTAGGTTT CC CTATGAGATTCTCTTACAAGTGG CAG CCTT CTTCT AACCCCAAGGCAC CATGAACCAA 

AATAAAATAGCCTGTCTTCTTGTCTCTACTrCGAACTTTTr 

ATATAAGATCTGAGTCCCAGCTTTGATCTGGCTGCCACTTTGGGTTGCCT 

TTGTACAACCAAGATGGTATCTCAAGCCATGCCTTCTCTGGTAGCTCTGTTT 

TCAAGGCCTGACAAATGTCGATGGATCAGAGGAAGCTGCGTCTACAGG 
2 0 CCTGATTGTCTTGATGTTCTTATTGCTTCTAGTCATCT^ 

TTCTCCAAATGCATCCATTCCTCTGAGGAAGCTGCAGGGGATAGAGATGTGAACCGCCTACA^ 

CAAGAGTT AGCTTCTG T CT CTT CC CTAGAAGGAGTCAT ACATCATTTTC C C CTAG TG T AGAAT AACTG ATGACTGGACACCCC C CC 

CCCCCAAAAAAAGGAATAGATGACCCAGTCTTCTCTGAGCTTCTAACATAC^ 

CATCTTTGTCC1TTTGGGCTTTGAGCATTACT 

2 5 AATTTTGAGCCATTGTGTG CATGGTGGGTGGGGTGGGGTGGGGTGG AGTC CAG AAAAAAT AAAGGAC CTGGGTAGTCATCTGTCCA 

CCTCACTGAAATACTGACTCTTTTCTAAACATTGA 

TGTC^GATCACTGTATGGAACAATGCTACTTAACAATCCTCATTAAAAAAA 
AAATTATTTTGTATTATAATTATGTTGAGTTTT^ 

AACTATTTAATTTATCTTCCCTTC^CCCCATCCCTTGCCCCCACCCCACGTGAACATCGTCAT^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AG CCTGTTGCACTTCCCTTGGTCC CATCTTGTCTTT CTTTTCACAG CTTG AGAAAACACCATCT ACAAATCCCCACACCTGTAACCA 
ATAGCTTTGTAGGCATCTTTTCTGCCTCCTGAAGATAGGATAGGTTTGGTA 
AGTAAGCAGACAACGTAGTTTATGATGGCTCTAAACTCXjAACACAGGGCTGATTACGATGC^ 
TGGAGCTTGGTGGGTAGAGGGATGGCCCATCAGTTGCTGAGAAATGTCTCTCITGGC 

3 5 TACTCTTCTCCOlTAGGTAC^TCTGAGGATCCCTT3CAGCCTGGG 

GGGCTGCTGACCTCACCACC^GGCGCCCTGGGCAGCrGCGCT 

GGAAGACG C!TCTGGTGGTAGAGGTTG<^TT ATTTGTAGTTTTCGGGGGGGTTGTT CCTT 
• ACAT AGTTCAGGAGTAAAAGGTCTGGTCX3TCTACTTTGTTGTCATT ATT AAGTGGTACC^ TG AGGAATGG CCTAAAATCAGG A 
TCAAGTTAGGGAAGAGCTAAAGGAGATGGGACATGGAGTCCCCTGTTAGCCTCTTCTC^ 

4 0 GCCCTTGAGGTAGAGATGAGAGGTGCrTTTCrCTCAACAAGTATGGTGTGCAG 

CATCCTGGGCTGGCTTCTCTTCTGAGGTTTATTTTAGGATT 

GGTACCATTTTGGAGTGAGTGAATTTTGTAGGGAGCCTGTGTCTCCT 

AAGTCCCCAGGCCTTCTTGGTTTATTTTCAGAGATTTTTTT^ 

TTTGTAAACAC CGGTTGCT AC C CTC C CGTT CCAACTTTT ATG TC C CTCAGTTTCAATG CCACGGTCTC CA'l'T TI'TGG AG CTGCCTG 

4 5 TCGGGCCCGGTGGCGGCTGGGTTGCCATTTTAGTTAA^ 

ATTAAGTTACTGTTCTAATTTATAGTCCCCGCCCCGCTGTGACCTA 

GAAGGATATAGAGCACCAG^CCCITGTTTCrGGTCAACCTCCTCTGAGTGAGCCACATGAAT 

AAAGGATTGTAACCTTTCCCAACCAGTGTTGATTGTTGTGATCATCTGTAACTTGCT 

CCTTCTAAGACGAGAGGTGCrGTTCTGTTGTCACATCTGTCCTTr 

5 0 ATGTGGGAAGGGCATGTTTTCTAC^^CTCTTTCTTGGTAGCCGTG 

TTTATTCCCAACCAGATTAAGTATATGCTTCACCTATTGGCCTTCCCTTCCTTGT^ 

CTACTGTTTAGAGTTAGTGTTATATATTTAATTGAGCTGGGAGTATTTTACTAGAGTGGTG 

CAAGTTAAACATGTAGTGTCATATACCTGCCCAGGAGTGTTCTCCAAATGTGGAATGCCA 

5 5 ACTTGCTGCATGCTCTCACATGCAATGGTAGACTGTCTGCTCT 

GTTGTGGGGGT AGGGG TAAGG ATGAAAAAGACGGT CAGGGAAG TAG CTTTG AAAGGTTGATG CAG AGGGG AGAAG T CAGG CAAAGG 
AGTGAGATTTACACTAAAAACTTCCCCCTGATCTCCACTGTCCAT^ 
GTTGAACTATTTTCACTGTAAAGACGGCATGAAAGTAAAAATATtSTAAGTT 
CTCTTAATTCCCGAATAAAGAGCAAATGTGGTCTAACAGTTTCT 

6 0 CCGACTCTCACGACAG CGAATCGCTACGCCAGGJUU^CTAG AGACCCTGGC AAAGTCTCCTTTGTCTG CTTTCTTCTGGGGGAGTAA 

T AT ATC CGTCTTTG CTTTG TCTGCTCTTGT AAG ACACATCT AC ACTG (ZAAACTT ATTTT ATT CT^ C CAAAT AG A 

TCTX5TAAGAAAAAG(^GAGAAATGTAGGGAGAACTGCAGATCAGCAGGCCTGAG 

CAGGCTGCCTAAGTTTG CCTAAGCCTITAGTTCTCTGGCTTCCTCACTGGTCACAAGAAGG CGCAGCIACrGGTGTTGCACTTGGG A 
TGTTCACTCAGGAATTCAATAAGCGGAGGCTTGCAGAGTGTTCGGCAGAGCACCCTCTCCACCGAG 
6 5 TTGAAATGTGTCAGTCAAAAGGTAGTTTGTCCTCTTTGCTCCCAGGG^ 



AAACATGTTAGGTCAAGGGGCTGAGTAATTTGGGCATACCCTAGAATTATTCATTTGT^ 
ATCTGTCCAGCTAATGTCCTGGATAGGATAACAAAAGTCTAAAACTT^ 
TTGGAAGGAGGATTGTAGTCTGACTATAAAAGGGTAGGTGG AAGGAGG AGGGG TGTGGGCT 
7 0 AGCKK3AGTGTGAGGGAAGAAGAGGAGCTGATGCTTGCTCATTTGAGTGGGT 

CTGCCCTCCTTCCTTCATTGCAATCCAAGTGAAAGACAAAGGGACTTGAGCCCGCCCCTTTCC 

T C CT C^T AC<JT AGCTTTTC CATG ACACT AC AGTG CAT AT AG CCCCT ATGGCTTTG AGATG CTT CG CTG CTT AAACAG AGGGGAAGG 
GCAAATTGGTCTGAACCCAGGACTCTGGATCATTACCAGGTCAGATCrT 

GCTGGGTTGTCAAAG AT C AC TT CAGGAAACTCTTG AC AOTGTRG TGGGT AG TAGTCACGGATTG CAAAC CG AGAAACTGACAGGAA 
7 5 AG CAGTCCTCCTCGTT ACATTTCT AAAGCACATTTCTCCAGG AGGCA CTTAG 
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TCTGTTTGTGCCAACCCATCAGGTGGTCTGA1X3AGATCACGAGGGCAGAAGAGGTTCCTC 
TTTCTCTATATCTTGTTGGCTGTGATCAGCAGTTATCTCCATTCTCC^GA 
CACGACAGTGCI^AGGGAACTTTATCCACTTTGGTACCAAAtAACTAAC^ 
GGTGACTGTGGAGTTCAGGCCAGCCTGGAACTTGTAACGGTCATCTTTCCTCTGCT 
5 AGCTTAAGGATAGAAATTCTGATCCTCAGAGCGGTCTTCTAA^ 
ATTGACACTCTCTGGNNNNNNNNNNNNNNNNNNNNCCAAAAG^ 
GATTTTGAAAAGCTCCCCAAATGATTCTCCTGCACAGTGA 

AAATGAGGCCTGGAGAGGTTTATGACT AATCCAAGGT C ACACG CGTCATTAATTTCAG AGCCTTGGT CCTTTGTGCTAAAATAGGT 
CTTGTTAGGGCTATAGACTGATGTATGTGTGGTACATTATTACCTCAGAGCCT 
1 0 T ACACGT AG TCATTTTGAAT ACT AG T ACCTTTTTGG TTTT CAGGTTTCT ATT CAGAACAACAACAAT AAAAT AGACTT AAGTC AAA 
GGATTTTGGAAAGCCGTCCCTGATGGTCGAGATTTTAC 

TTCTTAGGAACCTAATATTTTAAGGGAGAGCTAAAAAGTAATGCCAAGGTACATCC^ 
TGGAAAGTACTTGCTCTGCAGGTGCAAGAGCCTGGGCTTGATCCCCAGAGCCAC 
GATGCTAACAGCTCGTTGGCAGGTAGCATCATGGAGTACAAACTGGTO 
15 AGTCCCXSTCTGTGTGTGGTGTGTGAAGCACTGTGTGAGTCTGGAGACTGTGTTCTGTC^ 
ATCCCCCCACCCCCACCCCCACCCCCACCCCCAGCTTGTTTAGAGAATGGCTTATC^ 

TG TG CCATTTTT CTCTGCTTT CTTTTTCAAGAAT GTTTCTAGTTTT CT CGACGTTCACT AAGGG C CT ACAGAATTT CTTTCCAGGG 
CTGCAGTAGGCAAAGGAACGATGTATCATTTTGAATGGAAAGGGAAATATCACATC 
GGGCCCCCCTGTGTAGCAGAGTt}GAACTTTGTCATTTGAGCCTCAACT 
2 0 AATTTCACCTCGTGTCCCTTGTtlAGGAAATGGGCAAAATGGGC^ 
GTTTGGGCCTGTTTTGTTTTATCTTTTAATTTTTTAAA 

AACAGTTCTCATTGGTTCCAAAGCCTTTGCTCTTTCCCCTATAGGATGAAATTTCTCTGTATG 
AGAAGATGCTCCAGCCAGAAGCCAGGAAGGGTTTCTTGGTACTCACAGTTTAAA 

GGGTTCGATGGT GTCTG CAG CCAGGAAGCAG AGAGAGATGCTTGCTTT CTCG CTTTTCT CCTGGGGCCCAGCCCCACAGGAACGG C 

2 5 G ATAC CCACATTG CAT CTTTT CAACT T AAC CG AG T CAAG T AATCCTT CACT CG CATG CCTGG AC ACTTTT ATTTC CTTTTG ATTTT 

AGGTCCCTTGATAACCCCTCTTTGGTAAATTTCCATTC^ 

AAAAATT AAAGAGGTGAACAAACAATGAATGG CCATTT ATT^ C CT CTTGTTG ATGG CTGAG C CCCTG C CTTAAAACAAAC 

TTTTATTGTAAAGTAAATGAAGGC^CATTTTGCACT 

TAGATCTTTTCATCCAGAACTCAAGAACAAAATCTCCATGT^ 

3 0 CAGTTAGCAAATCTGTATTCCCCTTTCACCC^CAAGACCCGTATTTTATGTAGC^ 

TAGGAAATCCGCTGTGCCTGCTTCAAACTTACACAGG 

GCTGAAATTCCCAGTGGAAAGCAGGCTAAGGGAAGAAGGTTTTATTCTGGCTTAGGGT 

AAGGCC^TGAGGACGCTTACTCTTGTCTGGGCTCACCAGACTTTCT 

GACCCTTACAACTTTTTGCAGCATTCTTATAGACACATCAGAGGTATGACT 

3 5 GAGAGACCCTACCATAGGTCTGTCTCTTGTCAGCTGGGCATCCAAACACATCCCTAGCAAACC^ 

AAAGTAAAAGTGCCTTTACAAAGAGAAAGTTGGTGTAGATTCAATACATTATCTGACT 
CGCCTCACAGCCCCACCCCAGTCTT'GAATTTCATGGAAAAAC 

T CTTGTCTTACCCAGGGAGTAGAATCCCACTGGGTATCTG CTGCCCACTCCT AAT ATCAAGT ATGTAGAAGAAACACACCTGATCC 
AGGGATGGTGTCAGGCCCATGTGACTTTACCTTGTGTCTC 

4 0 TT CT ATTTT ATTCATTCCATGGAAGGAAT CATTTCACAC C CITTGAAT CAT GG CCCTG AGCTC CACT ACCAAC CACTGCACTAAAT 

AGTACCAGATGTTGCTACTGGAAAGAGAGAAGAAGAAAAAAAAAATCAGATGGGAGGAGGG^ 
GTGGCAGGAGGGAGGCAGCAC^AGGAGCXSCAGTCGTAACCTGGGGATC^ 

AGTTCCAGACTG CTTTCACTGGTTCCCTCTTGAAACCGAGAGC AGACTT CATGGAGG ACAGTC ACTGAAGAAAAACGG ACCATTTT 
CTTC^TTTTGTACTTCCATGCTTATAAAGAAATCACTTTTTTTTTAATGC^ 

4 5 CAATCTGATATTTTAATGCACCATTCCAGTCCTTGACTTCAGGAAAAA 

ATGTCTGCTTGTTGAAGGAGCACCTGGCCCACTAGCGGAGGAC 
TCTAAGCACCTTTCAGGCTGATCACATTGCTATTTAAAGATGGGC 
CCATTAAATACAACTTCCCTTCCCCCAAGTCCTTGGCTGTCTTCAAGTGAAAATTC 
GTCCTTAAAGCTGACTTTGCAAGTGATCT^ 

5 0 AGTTTTGGGTTTTCTGACTGTTTTGTTTTTAGTTTAATTC^ 

GAAGCATTTCCGTACTGTCAGTGTCTTGTGTCATCAGAATACCGGTGGCTTAGAATCAGAGATTCAG^ 
GTGAACCCC^GCATCTTATGATGGAGCTTGTTGTTAGTATATATTTAAATGTCTGGAAC 
TGTATTAATCTATCCCACTCTCCGTAGTGGGAGCTAGATGCAGGAGAAGAAAGGCAGGTGTAAATCT 
AAGATGTGCTTGGTCCCGGTGACTCCATCCATGAGTGCTCACATCTCT 

5 5 TTACACATTTCCGATCCTTCCTGGCTGTCTCGTAGGCCT^ 

CACCATCC^TCTGTTGCCTGGTTGCTGGGTTTTGACCTCTCCTTTCCTGTTCTG 
GGTAGGTAGTTAAGTGACCTCAGGAGCCTGTTATAGGGAGTAGCATGCTCTTAGGCAGGATCA 
AATTCGGAAACTCAAAAGCCTAAC^TATCGCCTCACCCCAAACIATCTCAGGTCCCT 
CCCCCACTCCAAGTCTTCTTTTTTCCATTCTCTGTCTTTTTG 
60 AGGATGTCCTTGTGTTTCGACGACGTCCTGTCTACCGAGGCCCCAGTCATGC^ 

AATCTGTGATTCTGCCTTGCCTTATAACATACTCGTCCTTCCTATTATAAATAGACTrr 
ACTCGGAGACAAGATTCCAGGGC^GATTGATGACTTGGAATTTTGATCTTC 
CCTCAGTTCCCCCAGCTGTGAGAATGGAGATAATGACGATGTTCACGTGGGGTGGGGGCAAGGG 
TAAATAGTTGCTTATTTAAGTACTATAAATGATGTCTC^CCTCT 

6 5 AAATAATTATAGTAATAAGCCATCTACTTCTTCTTCTTTTTTTT^ 

CAGTGAGGTGGACAC^TGACTACCCAGGTCCTTTATTTTTTCTGGATTGGTGCT 

CAAAGAAAACTGCAGAAAATTGAGTGCTAGTAAAAGCTGTTAATATACACTTACT 

AAAAAAAAAAGATGTACGAGCCCTGCCCTGCCCACCCTC^TTTCA 

7 0 GCTT TGCAG AGT CTCAT AT AG AAACTTGAG CTG ATTTTCACTTTGTG CTGCAGGGT CGGTC CCTTG CITC CTGCATGGTT ATT ACA 

GATTTTCTTCTTCAATGCTGTCATTTGTCTAGTGGCCACATTCTGCT 
GGATCrTAAACCCGAGCAAGGTGTGGGGGGGTCCCTG 
CTC<SCTGTCAGCCCGTTGGGAGTTATCTTTTCTGTTC 
GAGTTAAGGGAAGTCCCTGTCCITGACC^ 
7 5 GGCTGTGATCCTGGGTTTGTTCITTGGTTGGGGCACGATGA 
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AGATTAGCCAGGGCTACAAACAATCCACTCTCTGAATCA 
GTGTGTGTGTGTGATGTGTATGTGATATATATATAATATATAACA^ 
TCTCACTTTCCACCATGTGGCTCTC^GATCTCACXSTTGCTGAGCT^ 
CCTTGATAAAACACACTAAAACATCAGTACCAACAGAAAAAAGCTAGACCCTC 
5 TGTCCCATTTTTAGGTCCACTAAGGTCCCCAAAATCTTTAAGCAAACTCAG 
ACTCCCACCAGTCCTTTCCTGOU^GGTAAAACACCCTTTAAATAGGA^ 

T AAAGT ACAT ATTGGGTTGAAATT ATTAT AATT AATTTTG CATGTCAT AT AAAAGT CCACTATG CCTGCX3 CTT ATT C C ATGGATCT 

CTTT ATTCI^ AGC CAAGGGAGAGCT AG ACGCCCAGTCCACTG CTTT CGTGG AACT AGGGAGATGCC CAG ATG ATGATG C 

CTGGCAGGCTCCCCACAAGTAGGGGATGTGACTGTGAAGTACCACCATCAAC^ 
1 0 TGTGTGTGTGTG TGTGTGTGTGTGTGTGTGTGTGTGAT AAAGACCAAAAG CCAACATTG CATGTCT ATCTGTACT CTCCT ATAGTT 

T^ATACATGTAAGTATGTNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNI^^ 

NKNNNNNNNNNNNNNNNNNNNNNNNNN^ 
1 5 G CGG TCTCC CCAG CCCAC CTTCCACAT CTT CTAT AAAACG TTTGGAAAACGTTCTG AGGCCTG AGAACC AGATC 

CCAAATCATGACCTTGACATTCATCTCCGAGGGACCTCTGCTTTCC^ 

TTCCTGGCTGGCTGGGACTGACTGGTGGTGGCTGGTGGCTCrrGACATTAAC 

AGGTCCCAGCTCCITCCTTCCTTCAGCTTTTCTTCCTTA^ 

ATGGTGTCTTCCAGGGGTCTTrcCCCATCAAAAGCCATTATCCGTGGTGGCTTTGTT^ 
2 0 GATGTACAAACTTGGGAACAGAACTCTTGTCAGATAATGGGGTGGAGTGTTCT 
GAGACAGCTC^CTTTCTGGTGCAATGCAGCTTGTAGAACCM 

CTTG CACAT ACZAAAC ACATGCT CCTTTCTGTT AACATGG CCAG AT AG T AATG ACTTTGG CACTCAGCAGGGACT CCTTG AATGGGG 

ACATCACCTTTACCTGAGCTGTTGTCTCCTCGTAGCTATGCCCTGCCT 

CATACTTATCAC^TTTCAGAAACCCTTCTTGGG 

2 5 TTATTAGTGTTAATTTTGGCCTGTTTCCCITTACATC 

CATTTCrATCCAGTGAGGCCAATCrTTGTATAATGATAAGTTATC^ 
ACAGAATCACTTGGAGTCACACCGAAGTCGCCTCAAGTCTAGGCTGACT 
TTGATTTTTTTTTTCTGCCCrCTGTCCCTGAGTTGTGGGAGATGATCT 
AGCTGCTCCTGCAGAGCCCCTTGGCTCACACCTGAGTTTATGTTAATGAGATGACT 

3 0 GCCAGCCCTGTGACTGGCCCGGAAGCTTGGAATGTAGCCAGCCTC^ 

GGGGCTGATGATGAATCAAGTCCAGTTGCACCTAGTGAAACATGATCCTGAA^ 
GCATCCTCCTTGCACACAGGGACATGCCAGCAGGGTAAATAGAGTGTCTTC^ 

GAAT ACT AGGG C CT ACGTTGTG CAGCTTT CATTTTG CTGG C C CTGG CTTG CAT CCTTT AT AG TGAAG CCATTAATT AGCACAAACA 
AAAGTGCTTTCCTGAGTTCAGCGAATC^TCCTAGTGGATTCTTGAACCCCGGGTATTATGGGGAAGCAAAGA^ 

3 5 AGTCAGAGGTCTCAGAGGCCTGAGAGACACCACACTTACAACT^ 

AAGAGTCAATCTTGGAGTCXSCTGTCAGAGCTGTGTCAGAGCTGCATTGCAGTA^ 
GGCCCKX3TGGTGAGAGAACACCGTAGCACTTGGTAAGATAGCAAGACTTGTTTCCCT 

GC CTTCACTCTAGTCITACAAAATGATGTGGTCrrGGG AAG ACTG CTTTAGACTGCAAGGCTTCO^GTC^GCZAGGAAGACCTGTG 
ATATTTCTAGGCCCAAGACCTTCCCACCTTTGGAGGCCCCCTCTATTTCAACTC 

4 0 ACATGCrGGCCTCTTTCTTACAAGAAATTCTTGAATATTG^ 

CTAAGTAAGGGATTTGCATTACTTGGTCATTTGCATTGTTAGGATTCATACTTGTGGCG 
GTCATATCOlACAGCGCCTTTTCTGTGGAC!ACCCCTGTTTGGATATTTTAG 
TTTTGCTTTCTCCTTCCTGGCTTAAAATAGACAAGGTCCCCCCACCCC 
CAGACTAGTCCAGTCCAGCAAAGCCTTCTTAGGTCACCAAGCT^ 

4 5 GGAAAATGGGTTTCTAAGAAAAGAAAACTGTCATTACTTGGTGGC 

GAGTTGTGAAATCTCCTGAAATCTGTTTAACCTGGTCTGGCCCTGGGCTCTCTGCACT 
AAGTTTAATGGTGGCGATAGTCCAGCCAAATAACTGACAGTAGCTTTCACTGAGGGTGTCAAGCA^ 
GGTGAGGATGGAATGTCAGCCrCAGTTTCTAAAAGAGATAGAACAGATGCTTAGAACTGACT 
CCCTGAGGTACTTCCAAATGTTTGCCTGTGGACTACAGGGTTCAGAGT^ 

5 0 AGAGAGCGAAGGATGTAAGAATCCCCTTCAGGAGAGAGGAAACTCTCG 

TGTGTGTGTGTGTGTGTGTGTTCCTGTGCTTACCnKSTATATATCCAGCGAGCCCCTTGGAT^ 

CTCCACTTCTTTGATAATGTGAGTTTAAGTGACCrTGAAATGTAAATAACT 

5 5 GGACTCGAGAATGCCTGATTGTAGAGCTGGTATTGTGGGAGAAACAGAGCCACTGGGT 

GATTG TT G AT C CT ATTT AG ATGG AAGGGCTG CTG CC AGGT CT AAGGGCGAAACCAG AC CTTTCTT ACX3 CAT ATG C CTGGG ACAGAG 

CTCTCTATCCCACTGTGAGCACTTCTTGAATGCATGCTGACCT 

GTCCCTCTTGACCATTTTTCCTCACAACTCTTTTTCAT^ 

GTGAGAGGAGGGACAGGCTAGAGGCCGCCAAATGTGACAGATCTTATTCTCAACCTCACCTATGGGCCTATT^ 

6 0 TCCAGCAACTGTTCTTTGCCrCTTCCTCAGCCTGGTAC 

GGTGTCATCCAGGCGAGTCTCCCAGTGTTCTCAGAACATTCTGGCATTGATACTTCGCT^ 
TCATCCCTTCTTTTTCTCTCCAATTAGATTTATCTTTGGGGACTATTTTTAATCT 
ATAATTAATGGCT CTTT CTGG CTCATTCTGATGTTCATCTCATC^ 
CAGCCTGAATGAATACAATTTCTCTGAATAACrCATTTAGTAGTGGTTCATTTA^ 
65 TCT ACTTT ACCCAG AAGT AAACTAGGATCTAAAGATGC CAAACAACCTTCCGGAGCCTGTACCGCAGAGG CGC AGGGCAGCCTGCC 

•ACCATTTCCCTGAGCAGACATGCCTTGCCACCTAATAGCACCTTTGC^ 

ATTAGGTGTTTTCCTCATTTACATTTCAATGGCTATCCCCAAAAATCCCCCATACCCTCCCCCC 

GCCATCTTTTGATACATATGCAGCTAGAGTCAAGAGCTCTGG 

7 0 ATCCCTTTAGCTCCrrGGGTACTTTCTCTAGCTCCATTGGGG^ 

TGTTTT3CTAGGCCCCGGCATAGTCTCACAAGAGTCAGCTATAT 
CAGCGTTTGG AAG CTGATT ATGG AATGGATCCCTGGATATGGTCCATCCT 
TCCATGGGTGTTTTGTTCCCATTTCTAAGGAGGGGCAAAGTGGTCT^ 

tatatcttgggtatcctaagtttctGguctaatat^ 

7 5 jCACTCAGGATGATGCCCTCCIAGGTCCATCCATTTGGCTAG 
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TGTGTAAATGTACCACATCTTCTCTATCCATTCCTCT 
CTGCTATGAACATAGTGGAGCATGTGTCCTTCTTACn^TTGGAACATCTT 
TCTGGTAGTACTATGTCCAATTTTCTCAGGAACCGTC^ 
GAGGAGTGTTCCTCTTTTTCCACATCCTCTGGACTCCATT^ 
5 AGTGGCAGATGTTCTGATATGTGGTCAAGGAGTTAGCTGAGAGAATACTCATAATATCAGGA 

TTTCCCGACATGGTGATACTGCAAAGTTTGATACCTCATAGGAGGTCAA 
GTGGAAACCTGACAGCAGAGTTGTTATTTAAAAAAAAAATGATGAGAAA^ 

TCTGTC C C C CCTATTTCCCATCCCAGCTG TGATG CCACCTGTCATG ATGTG ACACAG CCAAGGTGACCTTCACCAAAGTT AG CAC C 
1 0 ATACTGTAAGAGCTTTAGTCTCTGAAACTAGGAAGTAAACAAATCTGTCTTTGTATC 
AAACAGATTCCTGC^CCCTCCCCATATAGAATAATCCTGTGGAGGTTTCTTCTACT 
GCATGTCCTAACATTTCACTTGGAAATCATGTAAGATTCTCCCAGGCCAAAGGCT 
TGGTTGCAAGTCCCAGGGTGTCACTGCACTTCTGACCAACTAGCTGGCAATAAAAG 

TTTGCCATGAC AG CTCACAGAAATTGGAGAAACATTTAGTGAATTTGCITAATATAAAGGATATTAT CACATGAACAG 
1 5 GCAGACAGCAG AGCCATGTAAGACGATGTGCAGGTCTTGTGTCATATGTAGG CATGCTATCCCTCCAGTCACTTCCACATGG CCCA 

GCCATCTGGAAACTCACCAAACL"lU^l'l'Cl'lT , rGAGTTTCTATGGT 
TAGGTGAGCAATTCACTTTCATCTTCTCTAGATCAA 

TTCTGGTATCACTTCCATCTGGAAGGTGTT AAGGGCTCCCAGACTGCCACTC AC CTTGTTGG CATACAAACAGTCACTCTTATCTC 
TTTGGAGATTGCCAC^GGTCATCTGGGTGTGCGTTG 
2 0 ACAGAGAGCCTTGGAATAGATCATTGATGTCTCACACC^ 

TGTAGTAGGGCTTTGTTGAGCTTAAGGAGTGCTCGAGGGGTGG^^ 

CCAAGAGGTTGGCAATAAATGTCATAAGCAAGAAGCCGAGTCCCTTTAGTGTGGAAAGAAGAGCTACT 

AGGAGGTCTTCCCCCCGCTGTACCATGTTGGATATGGCCTTCAGGGAATGTGCCTAGGAAGTCCGACGGTGCCA 

AAACAGGGTCAGGTTTCTGTTCTTCCCTCCTGGGCTGCAGGGATGGTTACTG 

2 5 TAATCTATGATCCCACTCCAGACCAGCAGGCTGTATTGGTTGGACCATAATGTGGTCTACCAGATC 

CCGTTACTATATCCTGGCACTACAACAACCACCTCCAGAGGTTTCCAGACCCTTTCTCCT 
CCTGGTTTCAGTTTGACCTTGCAGAGCTCAGCCATTGCGGGTGATrrGACT^ 

CTTCCAGGCAGTTGATGGTTAATATACTTATCGTTGCTAACTATTCTTGCCAAATATAGACTTATAGCT 

GAAG T AGAAGCTGTTG CX3 ATAT ATTTTTTG CGGACTGAAAACATTGG CT AAAG AAAG C ATT C CAAATTGTGATAGG AG CAGTGG CG 

3 0 CTTCTCCCATCTCGCTTCCTGAATT CCCCAGGCCAAAGC CCTTTGACCTGCAG ATCATTACAAAATCCATGGCTGT AAACGCTGTT 

GTTTGAGTGTGGGGTTTCCTTCAAAACAAGTTGAGTGGCTTACGGATT^ 
CCCAACCCCAGTCTCCATTGTAACCCTCGTCTTTTCCrCCCCTCTCCT 
TGTTTTTTTTTTTTTGTTTTTGAGACAGGGTTTTTCT 
GAACTCAGAAATCCGACTGCCTCTGCCTTCCAAGTGCTGGGATTAAAGGT^ 

3 5 CTTTCTCTGAATATAATTTCCAGTGGGAATGCTTCATGGTAGAAGAATGTAGTCTTTTGAAGCTTGC 

TTATTGGATTTTACTTAAATCCCATTGGGAAACATATTTTCGCTGTG 
GGTCCTGGGGTGGAGGGATACATGGATAGTGAAGTAAGTTTTTCTTTTGTGCCATGCGCT 
GCTTAGCTATGTTTGATTATTTAGGCTTCTTCTGCCTGAAAAGTAAAGAATATTTGCCCTG 
TAGAGTATGTGGCTCACTCTTGGGAAAAATATGACTTTGTTTAATAGCACTA 

4 0 GCCATCTC^GTGTGTCCCCCCATCCCTTCCCCTCTTGATTAACCAGAGAATGTAGGGTTTCCCAAACACACA 

AAAACAGTT CTTGG CT CAGAAAG AT AC CTCAGGGTGAGCTTG AG C CAGGT CT TGT CCATAATG AGGTTG AGGG ATTCTTCTGAGAA 
AGGATAACTTTTGTGAGTGCrcCCGGCGAAGAGAAGGTTCTACATGCT 
CACCCCCACTACCATCTCAACCTTGGACCATATCTC^CTT^ 
CTTAGGTAGCCCACAAGTTCTGGCATGACTCTTGAATAGCAGATATTCT 

4 5 AAATAAAATGTAAAGTGGCTCTATATTAGCTTGCTTGGGTTACGTATT'AGTCAGT 

ATACTTTATAAAGAAAATATTGGTTGCTTTCGTGGTTTTAGAAGTCA 

GTG AGGG CTTTGCCATGG ATT ACGTGGATAATATTTCAGCTGTGGAAGT ATGTG CCAAAGAG 

GAGAC ACTCAGGG C CT AG ACCTG CT CT C ATTGG T ACT C ACATGGGTC AT AT AAG AAATGTG TTCAC C C CTT CCAAG C CCAGCAC CT 
GTTGACCTT ACGAT CT CC CACTATG CCC CACCTGTT AAAGGTT CT AACAT CTGT CAACAT CACCAC C CT ACAAGG CTCTT AG TATT 

5 0 TGAACTTTTGATTCCTTGCTCAAAGAACATCCAAACCACAG 

AGGAACTAGCTCTCCCACAGCTAAAGGGCCCAAACTCTAAGACTTGTAAGATTGG^ 
CTCTCACCCGCTCCTTCACCCAACCTATTTCTTGTTGTTTGTGC 

GAATAGGGGAAAGGGGTGAAGCAGGAGGCAGAAGCCAGCTGCAGGTGCATGGGAAGGAGCCCGGGGCT 

ACTGATTTGTG CAAAGCCTAAGCCGGTTCCTGGCACGGG ACT CAAAGATCTT ACAGTTTGAATCCGGTT CACATAAAAGTCAT AAG 
55 CACG AGAAAGCAGG CC CAG AG CG CTTTAATTGTCATCGTTGTGGAAAGTTACATGACTTGTCATCAGCTGTTGCITCTGGCAGTGCT 

TT CTGGACT AATGT AAAT AAATGAG CTAACAGGTTT AAAG AAG CAG C CTGT ATTT AC AT ACTTGTG AAGT AG C CT CAAACAAT CAG 
TCTGTG CTATT CAAACACACACACAAACACACACACAC ACACACAC ACACG C ACG CACACACACG CACAC ACACACACACACACAC 
ACACACACACACACACACACACACACACACACACACGAAATGTGTGGCT 

AAGGTTATGATGAGTAGTGTGAGTAGTGGAAGCCCTGCTTTTCCCCTTCATGGGGTTTTGACCTC 

6 0 CAGGCTCTCCTCAGCAGGCTGAAGTTTAATCTTCAGCCATTTCTATAG 

GCTTCAGAGGAGGGGAGAGTGAAGTGGCAGGGGCAGAAAGGAATCCCACACCCTCTGATAAAACACT 

GAAGTCTTC CAAAGGAAG CT AG ATTTTC ATTTTTTTT AAAAAAAT C AT AAGGTC CTTTTG ACT CAG TTTG ACATTTTGTAGATAAA 
T ATTT AGGCAAG CAT ATTTTGGGTGTGTGTGT AT ATGT AGCT CTG T ATG TG TGTGTG TG TG AAGG TCAAAGGTTGACATTTTG AAC 
ATTTTGTGTCrGATTCAATTACTCCACATTACTGAGACAC^GTCTCTTG 

6 5 TAGCCAGGGAACTCTGGGGACCCTCCTGTCTCTGTCTGCCCAGTGCTGG 

TAGGGAATCCAAACTCAGGCCCTCATGCTTGCAGAGTAAGGACTGTGTAGTGACCAAGCCATCACT 
TTTAAAAGCACATTCCTCTGGAGGGTACTAATGATGCTATCAAAGGAAACAGCT 
CCTCTAACTGCTCGTGGATGAGTCCATTGCACCACAGATTTAGAAAGAA^ 
AGTTACTGGTTTGGAGTCTCTC^GTGCCTTCATAGTCTGGGATAGCTTCT 

7 0 CAGCATGATTCZAGGCAGAGGAGTGGTAGGCACCAGGAGGGAAGCAG 

CCTGTAAGCATCTATGGCCTGTCATATTCTTTTGTCCAACACAAGAATC 
TTCTGGT AT AGGTTTACCAGATTTTTCTCCCTTTACCTCrrCT AAATGTG 
AGAGAAATGACCTTTAGACCGAGGTCCAAAGATAGTTTGTGATGGGTTTTGT1TT 

G CTGG CCT ACAC CTTTCTAT AT.AACTGAGG CTCG CTTTGAAG TCTTGATCCTCCT AC CT C CAGGCT CC CCATTGTGAGGGCAAATA 
7 5 TT AGGTAAACATG C CTGGCCATACATGG CATTTTTGAAGTGG CCT CT ACCT AGGATTTG T CACT AAAGTAGACAGGG AGAGAGAGG 
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TTTACTTT ATGTTTT AGTGTTCACAAATX^TTTTCATAAACTC AAAC CAT AGTTCCCT ACTT CTTCTGTGG CTCCTG TGG TACCAT 
CTCACCAGGTTCCATGCCCCCTGTGGGAGACCCATGTGCAAGGG^ 
GCTATTAACCTGGAAATATAAACATGAAGTGATAAATTCAAAACTAGAATATTCTA^ 
CATGCATCTGGTTGCTAAAACTTTTACAGAGTAGTTC^ 
5 TAAATACTTTGTATATGTAAATAGTGTATTTTTGTTTACCCTTTCT 

TGGCTCAGGCAGAACTTGAACTCACTCTGTAGCA 

ATTGTAAGTATATGCCACTATACCCAGTTTATGTGCTGTCGGGTATTC 

ACCTGAGCCATCTCCCTACCCtlATCATAATCTTATTTCTGGAATTTCT 
1 0 ATACACACATAC T T C lTTCATATAGTCTGAGaU^ 

CTTTCCTnX3TTTCTCGGAAACAAACAGTCAGACAGA 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNbTOWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNmW^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

AAACCACTGGGACTCAGCCCCTTCCCCCTCCTTTCCTTTC 

GTC C CTCTT CTCT CTTG CCAAT AT AG CTTCCCACAGGG CTGG ATC CT AAGAT AATTT AAT CC ATG CCT CCTTGTACT AAACTTG AA 
GTCACTTGTACTGTCAAATGGCCACTCTCATCTATGGGGTATGTGGA 

2 0 GCTAGGTCTTATTCTTTCGCCTTATCTTTGTGCCT 

ATAAAATAATATTTATTATATGCCAATAATTCTGTCGTTTGTGAGCTATATTATATAGAACT 

G^TATGTCCAAGTTACTGATATGGGAACTGAGATGCGGGTATACTAAAT^ 

ACCAGACCGCGTTTTACTACAGACTTCCTCCATCCAGTCCCTATATTCACT 

GAAAG CAAGGCTGTCACTAGTCGTCATCAGCATTCTTGCAG CAAGCAGTAAG AGTGCC CTTTTAGAGT AGGCAACACCCCCACCCT 

2 5 CCTGTAGATACTAATTCTATCTGCTGACAGTGTTCTCCTTACACTTCT 

AAGGTTCCTCTGTAAAACACCGGCGTTTCCTCCTTATGGGrTAGGCATGTGAATGTT 
GATCCCAGAAGGCACTGCGGCAGCTAGCAAGGACTGGGGCGGAAGTGTGGCT 
TTCCAGAGGTTGCCATTGAACTCATTtGGTCTCTGCAGATt^ 
TTGTATGTCTAAGGCTTTTCTTTTCTGGTTACTAATAAAGCGTTCATCATGTAC 

3 0 AACTTATTTTGTTCTTAGAATAGTATTGCATAGGCACTATTTTCTACTC^ 

GCCTGCTTAGGATCTCATGGCAAGGGTTTATCAGGGCCGGCTCA 

GTCATGGTGCTGATGAGGAGTATAATGGGGTGGGATGTATCGGATGCCATGTGATGTATCT 

ACCCCAGCCCTGTGCATAGGGTGTATTATGAATGATTATGTATTATGTAAGAC^ 

AACAAGTTCCTCGGAGGCTGAGATCTCACTGGAGGCTGGGCCCATGTGTCTCCACCCATC^ 

3 5 CCAACAATGTGTTAATGATGCTTTTATTGGGGTGTGTGT^ 

TGGCCAGTGCGGTACC AGG AATCTCCCCATCT CTTC CTCCCCAGTG CACAT ATC ACAGACATAT ACTATTGTCCCTTGCnTTCrCA 
GGGACACTGGGGATTQVAACCCTGATCCTCATGCTTGTGAA^ 
TGTAGTATTCTTCn'GTTATCAGGCCAAACAAAATTACCCCCrrCCCCT 
TGACCTCATGAGCTCTACTCTrACCTTCTGCTGCTCCCTVATCCCATACTCAC 

4 0 CTCTGAAATATCCCTGTCAGATACTGCTATTTCACGCIAG 

AGCACTCTIX3TGTGAGGACGGACGCCTGACATGAGCAGACAGAGCTC 

CATATAATAACACACATCATTTCACATAGATAC^CACACAGCACCCCA^ 
TGATCCACCCrrTCCCCCATGTTTTTTCCCCCCACTAGGCZAAAT^ 

4 5 CAGAACACAG1TGTACCCAAGGCCTTAAACAAAGTAGTT 

GTCATTGTGTGTCrTTGTGTGAGTGCTATGGCAACTGGGCAGGT« 
ACTGAATTTTAAAAAGCTCTCTGCTGAGTTGTTrTGTGTCT 
AAACTAGCTGCTTTAAGACAAATTACAGCTCGAACAC^ 
TTAGCTTCCCAGTCCTTCCTCCCATCTTACATGGAAAAAAAA^ 

5 0 CCCCTGGCAGGATCAGCATGGGGCTGTTTACTGGGGGCTTTGTG^ 

TGGAGTTTGCCACAGTGGCAGTGGGTAGTTTTGCACTCTAGTTGTTAAGTGTTCACT 
TTTTCTTTGTAGTTGGAAGGCTTCCTTTGGGCTAGCCACCCGGCTATAATG^ 
TAATTACAGAAATGAATTCAAATGAAGTCAAAGCTCTGACATG 
ACTGGTAGACTAGACACAGGATGGCCCTCCCTACCAAGTGGACTGTGAATTGTTGCT 

5 5 TGTTATACAGCTGGCTGTGTGCATCGGTTCGGGGACACACTGTGTTCAGGTCACCTGCTCT 

TTGAAAGGATGTGGTTTCAGTTCAGAAGCTGGTGGACAAGTGAACTTCCTC 

TTTGTGCTTTTAGCCTTAGGCTTTGGGACACTCAAGACCTCTGTTACT 

CCITTTCTAGAAGACCCCACGTGTCCCACACTCCrT^ 

ATTTGAG CTCCACAAT CCTT CAGGTTTGTTTGTGTAT AAATATGTGT AGCATAAGT AACA CCGTGCATTCCCTTAAC 

6 0 CGTTCITTTTTGTATGGCTAG<3CATTTTTTTTCCAGAAAGT^ 

TTTGATTTTATAGTTATTCCTTTAGTGGGTACATAACATCCCATGGATGTAriT 
TTCTTTCTTTTTAAAAAACirACTGTTGTTGAGTGTTTGTTTG 

GCTAGCTrTCTGGAGCCAGCrCTCAGCTTCCACCATTAGACAGATGGAGTTG CACAG CTGT 

GCTTGTGCGAGTTCCTTGAGTGACTCTGTACTrTATAO 
6 5 TCAAACCTTTAAGAGTTCCAAACTa^AGCCTTGAAGTTAGAGCGGAGCCTTGAGCT 
TCACTGTTAATGGGTTAGTGTACCCACAATCCTCTAAAA 

GTCAAGATCTACAAATGTCAAGGACATGTATGGAAGAAGTGAAAATGTAGATCCTG 



AAGGCTGGGGGTTGGGAATATACTAAACACTTTATCTGTGTTTATCTCCATCTCGG^ 
7 0 CTCAGGTCAGTAGTGCAAGCCGAACACTAGCCTGTGTAACCTGGCTTTTCCACAAGCCCTC 
GACTTGTCCTTCTGTAAAGTGAGTCATAGAGGGGACATGACATGTGTGCACAGTGCCA^ 
GATATCATCCCTGATTTTCTCAGCTCCOVAAGTACGAGT^ 
TCAGTTGATGAAATTACGGGGTGCTGTGTGGAACTTGTTCCCAG^ 

GAAGCTGGCTCrGGGTATCCAAACAGGCTTCTGG CTCTTGTGTTTr ATTTT AGAAATGTTAAAGTACGCACATAG CCATAGTGACA 
7 5 CC CATTTCCATGGGAAAG CCG AC ATTTTTT AG CC CTTCCCCTCT CCT CC C Ui ACTGTGGG CTTCACTCATATGTG ATTTCTGTC TT 
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TTCCTTCTAGGGGGACTCACTAGTTTTAAACAG^ 

GGGGGGTGGTGCATCTGCGGCCTCGAGCATGCTACCTCAGTCrGTCCCC^ 
TCCTGCCCTTCCTGTTCCTCGGCAATGAGCAGGATC 
GTCACCACGCACCTTCCTCTGTACCATTATGAGAAAGGCCTCTTCAACT 
5 CCTGCGGCAGTACTTTGAAGAGGCCTTCGAGTTCATCGGTAACTATCTCTC^ 
CTCTTCCCTCTACITCTGGTTTTGATATAAGGTCTGAA^ 
AACAAAACCTACCTAATGCATAGTCCTTCCTGTTAGTCTCAGAA 
CATCATTGAAACACCTGAAAGGTTGTCTTAAACGGAGGAATTATCTTGACTTATGAT 
TTCTTC^CCTGAGGTGAGACAGAACATGGAAGGGAAGGA^ 
1 0 GCCAACTGTCATCAGTGAGGCCTGT AGAGT ACTGGTCT ACCTCATGGCC CTCGG CAGCAGTGTTTCACTCCTCACTTCCACATACC 

CTGACCCTCCATGGCCATCATCTGGAAGGCACGGTTCCCAAGTAG CTCAGAATAGAG AG CAC CAG AAAG TG TTG AGTTTCAT ATTA 
AAAGCTTTGGATGCCTGTGAGCACTGGACAACAGTTTTTGTGA 

G AATGCAGGTCAATGAT AAGGTAGAGG AGAAGGGAG TAAAG AG ACTTGTCCCTT AG CTTT AATCACC CACACTGGCTG CTTCCCAG 

TAGACAAAGAAACAAACAGGATGTAACTATCTCTCTCACCAGTCT 
1 5 GGAAAGTCTCCCTTTGTGAACACTAAATGACCTCCCACACCAAGGCCGT^ 

CCATGTTCTGTGTCACCCCATGAAGGCTACTGACTTTTTGGCAAATAGGAAGGGAGGGACCGGGAC^ 

ACAGTGTCTGACCTACTTCCTCAGGCAAGCCTCTACCTTCTAACATTTTAGAAACCTT^^ 

AGCATTTGACACATGAGTCCGTGGGACCATTTCACATTCATATTAAAGCAATAATCATCCT 

AGTCCCTATCCTCCATCCTCCCGTTAACAAAGACGCCATGGCGGAGGTTG^ 
2 0 AGAGTGCAGAACTGCTCAGAGCTATTCCTGCTGCCTAGTCTATGTCCTCCAAGCCCTG ' 

CACCTTAATTATTCCTAAACCTTCACATAGCAAATTGK3T 

TGAGTTATAGTGAGTCACAGTGTATGGCTTATTCACTTGAGGTTGAGGCAAAGGCAGCATCGAGAAAGG 
TTCTTCAGAAGCAGGCCATGCTGTGGTAAGTTTCT^ 

TGTCATCTTCCACCCAAGTGATCTCAGTTCTTAAACACTGAGTGACAAGGACTGATG 

2 5 TCCTTTGCCAGTTTGTCTCCTGCTGAGAGAAGCAGGAAAGATCAACCCCCCCCCCATAGG^ 

TAGGCTCTAACCTCAGAAGGAAGCAATCACCAGCTCAATGCTGAAGTC 
GATAACTAAGACAAGAAGAGOVGGTGGCAGATGTTCAACATCATTT 

AATGCTCTACACATTAGCACCTCAAGTTGTCACAGGTCTCTGTCTCAGGAAGAAAGCATCCATGG 
AAAAAATGTCACAAGTTTAGCAGCCACCAGGGTGATGGGACTTTTTTTTGGCAAGTTGGCAGCT 

3 0 CAGCTTGTCCTTCATAGCTCGGCCATCCCATGGACTCTCTGGCAGGACAGCCAGAGCGGTCT 

TTCCATGTCAGAAATTCTAGTGTCAGGAAGGAACTGGGGACTGGGAGGAGACACAATGGGGAAACT 
GACCTGAGTTTGAATCCCATAGAATCCATGGAAAGTCAGATGCAAGAGT^ 

gaggcagagcacccctagacactcacaggacagctagctiggcaatctaca 
ggcaaggccccacattgacctccacgtgtgtgtcacggcatg 

3 5 ccatgactctttcccgtgtttaccatgcgtgcccc^ctgattagtc 

t aaaatctg agccaagccttgtgtcctg agcaag aaatg cccgt caccagggcgatgaa ctgttataa 

agaagagtcccacctgcttaatttcaggtggaggaagttgggaatttgttttccatttagcaagt^ 

aagagattacagccacagtgcttacattccttctggattattcaggcttacctcct 

gcaacattt atttag cctg cagaacaactcagg atcacgtgggtgggcatggttttgtt ct ctgattgg ataaactgaaattttat 

4 0 agtgtggctttcagggggcacacaacaaggttcacagcataa™ 

tagagtttaacaatgccagtttaaccccagaatcaaatgacttcatctcc^ 

ttatccgagcaggaagactgaagttctgggggggtcaaaggactcacaaggtcactcagctagtg 

ctactcagggtgcacctcactcattctcacccaccatgtc 

ttcctgaagcattactcttgggtctctcaaagcaagcttgtctacagactggtgagagagatagt^ 

4 5 NNNNNNNNNNNN CAG CCAGTGTGGGTG ATT AAAG CTAAGGAACAAGTCT CTTT ACT CCCTT CTC CT CTACCTT ATCATTGAC CTG C 

ATTCCTCTGGGGACACrTCCCTGGGGGATGTGTTTTAATCTAGAAATGTCACAAi^ 
GCTTTTAATATGAAACTCAACACTTTCTGGTGCTCTCTATTCTGAGCTAC^ 

GTCAGGGT ATGTGGAAGTGAGG AGTGAAAC ACTGCTGCCGAGGG CCATGAGGTAGACC AGTACTCTACAGG CCTCACTG ATGACAG 
TTGGCATCTCTGATGAGAAGCTGAGTATAGGAACTTTCCA^ 
50 TGCATTAAATGCTAACTCTTATAGAGAACAATCTGTAGGATAAAGGTC 

ATG TGGAAGAT ATTG CT AATAGCACTG CATGT ATCATGT AGTCAT AT AG CCTGG AAAG TGGT ATGAACATTGTT AAT CAG AAGAC C 
TTTTTCACATGCCCATTTCTGTGCATGGCATAATCTCCATTCCTT 
TTAAGAAAATAGAAAAGGAACAAGGTTGTCCAGAAAGAGGTCCGATAACTATGGGGTG 
CCTGTTGTAGGTATAGCTTCTGCCTTCCCCATCTCCTGTGACATGTGCTTTTCCCACCCATGACTT 

5 5 TGTAGAGCTGTAGGGGACTGGGCAGATCGrrCAACAGTGAAGCACrCATGA 

CTCACGTAAAAGCCACATAGCTATAGCAGCTTGCCTGGAATCCCATG^CACRAGAGACTGAGATGGG^ 

CTATCCTGACCAGCAGAAATAATGAGCTCCAGATCCAGCAAGAGATGCTCTGTCGTG 

GATACGGAACATGGTCTTCCACAGACACACGTGAACATGCATCAGCACT 

CACACACATATACATACACACACATACCCTTCCCCCCTCAAAACAGTAAAGGGCCAATGAGATGGCT 

6 0 CTTGCTGCCAAGACTGACACCCTGCATCCAGTTTCTACTAATTCATAT^ 

GACCTCTCTATGCACACAGTGGCATGTGGACCAACCCTGATTTTTCT1TTTTTAATAA 
ATTTCTGATTGTGGCCCAACTCTATAGCTCGAGATTTGAACAT 
TATGCAAAAGGAACCACTGTTGACTCAGGCAGAAGTTAAGCTTCCTCTC 
GGAAGCTCACCAGTGTGGGAAGGGCCTTCTCATCCACrGCCAGGCCGGCGTGTCCCGATCCG 

6 5 TGAAGCACACACGGATGACCATGACTGACGCTTACAAATTCGTCAAA 

GGGCAGTTGCTGGAATTTGAGGAAGACCTAAACAACGGCGTGACGCCAAGAATCCTTAC^ 
TGTGTGACAACGGGCAGGACGGAAAGGGCTGTGCTCTCTCAGGAGACGAAAAGGAGGGAAGGTGGATTCT 

TTCCTTTCCT TT TCTTTTCrCTTCT ^ ^ IT TTCT 1 C V 1 1 1 CTTTCTCTTTCTTTCTCTCTrriTTTTTA 

AGGTTGGGAGTGTTTGTGAAAGTAAACAAACTTGTCTAAAGACTCTATTTTTAATAAGTG 

7 0 ATGAGACCCACTTCTCTTAAACTGGTATTGCAGTTAGGTTAAAGA 

AAAATATTATAAAACTTGTTTCTTCCCAGTTTCCCTTGTAAGCTTAT^ 
CCGAAGATACCACCCTTCAATTACAAAGAACCAAAAG 
AGCAAACCCTGCCAAACCGTGCCCAGTTGTGCTCACTGGTC 
CTTCTCTGGGGAGAw»GGCTAGTGTCG<nTTT 
7 5 TTGTTTGTTTGTTTGTTTGTTTGTTTGGCAGAA^ 
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GTGTTTGGAGTGTGCAGCTGAGCK3AGJU3CTGGCAG 

CCCTCCCTCCTTTCTCCTTCTCCCCCCCCCC ITT CTTC CTC AT CC C C"l"i"l"r ATTTGT CTTTCrrTrG ATTTG ATT CTGGTT ACAGT 
GCCATACACCTTGTTACATATGTATATCAGAATCTAAAAAAACAAAACA^ 
CAAAAAAAAAATATCCTTGGTGTGTTTTTGTTGATGGTGTAAGAAT^ 
5 CATTCrTTGTACGTGCCTX5TCTTCTCTGGTTAGGCTCATACCT 

GCGGGAGGGACCGCTCACGGAGGGAGGCTAAGGATCCCAGCACTGATGCGGTCCT 
AGCAGACTCATGATTGAGCAGCTTCCGCCAGACCTGATAACCAA^ 
TGGTAACTACATGTAAATATCAGTGACAATCrrGCCTTGTATGGTTT^ 
TTTTAGGGACATTCACCCTATAGTAACAATAGTACAAGCTCCAAA^ 
1 0 GCTCTGTGTGCACACGGACTTTGACAGTGCTGGCT 
GGTTTTGTACGTTTTATTAGAGAATATCTATCTAG 

GAGTAAGCGCTGAAGAATTCTGGCATGTCTGGCTCCCAGTGACTACATTTGACAT 

TTACACCAGAAATATTTTGACATTG AT T TCTTT ATTCAAATAG CTTGCCTGCCAAGAGTCATTrAAAT l 'l'l'l'ClTTCATT AGAAAA 

GTGGAAATTGAACATAATAGGAACTXSAGAGCTAGCTGGAAT^ 
1 5 CTT1TTTTGGCATTTCCGTACTGTCAACAGT 

CAGTGACTTGGGATGCCTGGCAGGCTTGTCATTTTCTAAGTCTAA 

CTCCCCTCCCTCCTCT I ' C X' IT CTCTT^ 

TTTGTGTGTTTGGAAGTTTATGAGCTTAGAGGAGAA 

TGAGAGGGACrCCAGTCTTTTCTCCTAATCTGCTCTGAGTCACTTCACCT^ 
2 0 CCCACTGGATCAGCAGAGCCAAGGCTCTGGAAAGCAGAGTCCC 

GGGGTTGGGAAGTGACrrGTGTCTTCTGAGAATGCACAGTCTATCC^COT 

GTGAGCTGGTGGAATGAATCCAGAGTCTATCTAATGCCCAGTAGAGTAGGCGTCACTGCOT 

gattgtccacatttatttcaatagctgatagttgcttttccagtgtcaatgagtctag 
tggtgacatttggaccaaggtagatggagtggagagcgaggacggaagctcacaagaato 

2 5 tcttaggctgcagccatcactgatttctactgactggg 

tagtgtgaatgtccctaaaaagtctcatgtgagctagtgtcaatt^ 
aaagctaaagagaataaacagggggaaatgggaaagtagagtaacgagag 
tgaaaactgagacatagagaagcagtgttgatctctttgggtcttctgaggggta^ 
cttaactttcaccttatttlsaaacctgtctcrtggacatgaa 

3 0 tgttctggggcagggcacagtagaagatatcgaagctctggtactc^ 

ccagttttacccacctgccctttggaggcagcagggacrga 

CATCAATGGTGTGAAATAACCAGGCTTTGGAAACATTGGAAGACTCACTGGCAGAGTATO 

TCTGTGTCAATAAGGCCATGGGAGAGAAAAGCTGCACATGAAATAGAGGGGTC 

ACTGGACGGTTGAGGAGGAGAACAGGGGAGAGCX5TAAGAGCTGAGCTCA 

3 5 GCAAAAGCATGCTCACrATAATTCTCCCCATTGCCCTTCTAAAATGGGATGGCTG 

AACTTGATGCAAATTTCCrCTCAGACTTAGTAACCCAATGTGAAACAGGCAAGGGAA^ 
GATTTTGTCATCGACGTCATCGCGTTTGGAACCTCCCTTTCTCA 
TCATGGTTCTTTGCCTTCTGAATGGCCCAGACCTTGACAACTGGGCTTGT^ 
AAGGCAACTACAAAGCACAGCAATCTAAGCGACACTAGCATT 

4 0 TGACAATCACAACCATGTCAGTCAGTATAGTAAAATATATTTTTGCT^ 

GTGTGTGTGTGTGTGTGTGTGTGTGTAGTATGTGTATGTGAGTTCAG^ 
AGCTGGAGTAACAGGCAGTTATGAGCTGGCTAATATGGGTGCTGGGA^ 

AATGTCACT AG CCT ATGT CTG ATTCTT AGTGT AAGG CACCCAAGG AAGTTTT AAGTG TCAACTT CT AG CACAATGG ATCCAT AT AG 
CATCCCCCAGTAGAGGTGTCTTTACCTTAGGAACTAAAACTTCTACATTCCAG 

4 5 AATGAC!ATTTGTAAGAGACGT/rTGTCCTGTTTCT 

TGGTGCTGCTATGTATTCAAATGATAAATACTGGCCCCAAGATCTGGTTTATACCCCCCAGCTGATCCTAACATCA 

TGAAAAC CITGCTCC CCAATTTAAAAT ATTGGTT CAGT AAAAATGG CTACAGT C AG TT ACTGAGG AG AATAGAAAG AGG CAG CGTT 

TCCATTTACTGGGGCTGGGGGGTAGCAGGTAGGAGGGAAAATGAGTAAGAAAATC 

TGTGATGGACCAGGAGCAGGTGCCTGAGTGAATAGTCCCCCTCCCCCGCAAAGGACAGATTGCT 

5 0 GACTCAAACAGGAAGTATTTGGGGAGAGGTGATGGGGAAGTAGCCAGTCTAGCATTTGAAAAATAGATTAGGGGTAATCCCCCACT 

AATTGCGCAAGAGCTGATTAAATAAATCCATAGGACTCTGCCTCAGTTACTGGCAGGCTGAGTGGGTCATATTAATAACT 
GTTTATTAAACTCCATTGACAAGGAGAAACAACAACATATACTGACTGATGACTCTGACTTTA^ 
TAGCCCTGTAACATATTTCCATTCGTCCAGCATAGCAGTAGACTTGTTAGAACTCTCTGCCTTTCT 
AACATGGTAGGACTAATGAACATGAATCTGTTTCCATACACAATTCACCACAAAGGAGGTTCCT 

GTCAGTTTCTATGAAATGAGAAGAAACTTGCCCTCCAAGATAGAA 
TGGTTTGCTAGTCCAGGAAATGAGTGCTGCCTCATTTGGAGTTGGTTGGACATGTA 

GAGTAGACAATCATGTGCCTAATTGCAACCCCATGTTGACCAGCATGGGGACTTTGATCAGAAGATCATTGA 
AGGCTGACAGAATGGCTAAGCTTATCTAGTCTTAAGATACTTCTATCCTAAATCC 

6 0 ACTAGGAGAAACTGATTCCTTGCATTCGTTTCAAAACTTTCCAATCTGATTTACTCTC 

AATGAAAACTGTCACTGCTGCCCATAAAGCAATGTAGGTTCATATTGTTGCTCCTCTCTC 
AGATGCAAAGTTCTGCCTACTGGAATGTCTTTCCTTTACCTCTTTTTCT 

TCACTCTGGATGACAAGCTAGTATGCAAAGCTTTCTTTAAACAGGGTTAATl'1'Gl'T'lTTCAATGACTAGTCCCCA 
TGACTCTAGAGGTATAGGTTCTAAGACATATTGGAGCAGACAATGGGTCTGTGGGTCAGTGCTTATTGATTGGC^ 

6 5 GAGTGTATTCC ATGCC CATTGATAATGAAATG ATACTGTTCATGCT CTATGT AG ATACATGCTGGGCTCAAT AATC CTTAGCTCAT 

AATGGGAAATTTGGGTTATATTGTAACTTTATGGTTCCAGATCAAGCTTCGAGTATCA 

CTG AAAAAACAAAACAAACCAAC CAAACAAAACTACTG C CAT ACT ATTG CTCCTGAATTCT AAAT ATCT ATTCTGG CATCTATTTT 

GTGATTTGCTTACAGGGAAATGGCTCCATGCAGTATTGCTGTCACACAGAGAGACCT 

TGTTAAAACATCrAACATGGTAACCTTGAGCCGCAATAGAGACTCCATAGTTCTC 

7 0 CACAGTAAATGAATTAATTTTCCTAAATCTAAAA 

TTGTTACITTGTCCTTAAAGAAGAT^TATTAGCATACTTATTTTAAAAA 
ATTTATTTATTATATGTAAGTACACTGTAGCTGTCTTCAGACACTCCAGAAGAG 

CACCATGTGGTTG CTGGG ATTTGAACrCCGGACCTTCGGAAG AG CAGTCGGGTGCTCTTACCCACTGAGCCATCTCACCAGCCCCT 
TGTTAT ATTT ATATGAGT ACACTGT AACTC C CT ATT AT AG ATGGTTGTGAGCCAC 

7 5 CATGTG ATTG CTGGGAAT TGAACT CAGGACC CCTCGAAG AGCACT CAGTGCTTTT AG CCAT CTCfCTAG CT CCCT ACT AGCATTCT 
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TAAAAC CACT AGACGTCT AT ACATTCACTGGTGAT AAGCC CCTG AG TGTTT CAG AATG TG TTT CT CAACCTTTTCATTACT AAGT A 
GAAGATAGATAGGTTAAGAGCCAAGCTAAGCTCATTGAGAAAGAGATGACTCAGGTACCCATGGTCT 
TCTCAGCAGCATATCTACATCAGTTATTTCATTGTGCrr 
GGATACACACCTACGATCCCTGTGTACn'CAATTATGATACAGGGATATCAA 
5 CAATATTATTCATTCTGAAGGCAAACTGCTTCTAATGGCCTTAAC^ 
TGTAGGATTAGAAGGTGATATAGGCTGGCTAGCGAGGCT 
ATGGGAGTAACTATATACTGTAAATACAGCCATCTGGATTT 
TATACAACAGCCATAGAGAGCTAAACATTTGCTTTACCAATA^ 

CAAACCATGT CTGGT AT AACATCTGT AACAGAAACCAACAACTGAAAG ACAT CTTT AAAACCT ACTGTTC CTT AAATTCAT CTCTT 
1 0 TTTTTGTGTAGGACAAATTATTGCAATAAAGGAGGAACATGTA^ 

ACTAATCTGTTCCATTCTTTAAGAGATCAGCATTGCTCT^ 

AAACCTAGTGGCATAAAAGAATTCAAATTGTTGGGTTTGTTAATC 

ATAAACTCAGAACCAGAAGTGAGGCAGCTTGTTGGGGCTGGAGAATC 

GAATTCAGCCAGGGTrGTCCACTAAATACCTTGGATCCTCTCCTCGAGGATGCT^ 
1 5 ACTGAGATG AAAGTTGCAAAGCTTCT AAAATCGT AT AAACACGG ATG TCT 

TACAAGCTCAAGTTCAAATTCAAGGAGAAAGCAATTGGATTCTACCT 

ATCACACTGTAGCCCCCTTTGCACAACCAAATCAGTATGTGAAAAATAGCT 

AGGCAATGACATCAGCTTCCTCCAAATATTGCTTCAAATAGTGTTGTAATTCAAAG 

CTCCCAAACCAAGCCCTGAGGATAAGGATAAAGATCACAAGGGGAAGGTCG CATCAGCTGATGACTG CCGCCCTCCTGCTTCTG AT 

2 0 GGTATGCCAGCTCTCAAGGCCACACGACTCACAGTCTTGTCTCA 

TG AT CT AAGAGGCATGCG AT ACT CCAT AATTCGTGCCCAAATC C CTGGG CAT CGTTGTGACCTTT AG CTTG CTAT ATTTCAACGAG 
AAAGATGAATGCACCCAAAGACAGACAGACAGACAGACAAACAAA^ 

GTG ATGC CT AAGG ATTTGGG AAC CAAAAT AT AGGTCTGC CATT GG CTTTTGG ATTG ATCCCTTGC CG T CACTGGG AGAT AACTGTC 
AGATGTGTCTTCTGGCGGGGGGGAGGGTGTCTCAATTTTTTTTTA^ 
2 5 TTTGTCTTGTTTTGTTTTTTGTTTTTGTTTT 

AATAAAACTGGCAAGGGAGAGTAGAACniAGGCCAAGATGGAGGGAAAGGG^ 

CAATGTAGAAAGTGAAAGAATTATCCTAGTGAGGGTGGGAGCTGTTGCATTTGTTCATCAA 

ATGCCAGTCCCAGTACTTCCAGTCTTCITAGAGGGAAGCCACA 

CGTCCTAGGAATCATAAGTGATACAAAAATGAACTGAACATGAGTGTCTGGGCTATAGGCCIT 

3 0 CCCTTCACTTCTGACCACTTTCCCAGTCCCATTGAAAAGGATAAGGAG 

CAGATCTTGTCTGCrrCTTTCCTTTCTTCTTGAACT 

GT CCTGTCTTTTTCACCCTCTCA (JTrX '' l 'C T T TTT T ATTG AAAG AGTTAAACTTG CTGAAAACCCAGTGGG CTTCTGAAGAGTCTCT 
AGTCTCTAGTGTTGCCTTCCTACGTTCCCACCCTAGGCCrCAGATTCTTCAGTTTCTCACTC^ 
GG CCGGTGTG CTTTCGGT AACITAG ATGTCTGTTTGGCAATAT 
3 5 AGAAAAAAACACTCTAGGTTTGGTACTAAACTCAGACCGAAGTCrc 
GATTGATGATTACTTTAAGAAATGATTAAATTATAATTATTTCCTC^ 
ATGAAAACCCCAGTCAATGATCTTTTCTTCATTATCACGTT 



MOUSE SEQUENCE - mRNA 
4 0 AAAGTAACTCTGTGTTGCTGCTGGGGATCAATGAAGCTGAGTGAATGGGGGCGCACTG 
GATGGTAGAGGGCrACGCTTATTGATGAAACrCITGACTTCITGAATT 
ACGACAGGGTAGTAGTGXSC^CTATCTAGGCCTGTCCGACCTCAGGATCTC^ 
AGCCCAGGTAGTGGTAGCCACGCTCCTGTCCTTGCCGCCGCCGTTGTGACCCTCAAGGCTC 

CAGCGGCT CTGCCCGCTCCCTG AATTGCGGATGCAG CAGCACTAGCTGCTGCACTGTGGCAAC CTACG ACAAGGATCATCAGGCCC 

4 5 AGACTCAAGCCATTGCTGCAGGCACTGCCACCACCGCCATAGGAACCTCGACCACCTGCCCTGCT 

GAAAACACAGGCTCTGTGTTAAGTCCATCGGGTGGGGTGGGCAGCCCTGTGTC^ 
CATCTACCCCAATGACCTGGCAAAGAAGATGACCAAATGCAGCAAGAGTCACCTC 
GGCCTTTCATGGAGTACAACAAGAGTCACATCCAAGGAGCTGTCCACATTAACrGTGCC^ 
CAGGGCAAGATCACTGTCTTAGACTTGATTTCCTGTAGGGAAGGCAAAGACT 

5 0 TTATGATGAGAATACCAATGAGCCGAGCCGTGTGACACCATCCCAGCCACTTCACATAGTCCTrc 

AAGAACCCCTGGTATTGAAAGGGGGACTCAGTAGTTTTAAACAGAACCATC^ 
TGCCGGGAGGTGGGGGGTGGTGCATCTGCGGCCTCTGAGCATGCTACCTCAGTCTGTCCCCACC^ 
GCTAACGCCCATCCTGCCCTTCCTGTTCCTCGGCAATGAGCAGGA 
ATGTCATCAACXH'CACCACGCACCTTCCTCTGTACCATTATGAGAAAGGCCTCTTC^ 

5 5 AACAAACAGAACCTGCGGCGGTACTTTGAAGAGGCCTTCGAGTTCATCGAGGAAGCT 

AATTCGTCAAAGGCAAACGACCAATTATTTCCCCGAACCTCAAC^ 
GGCGTGACGCCAAGAATCCTTACACCAAAGCTCATGGGCATGGAGACAGTTGTGTG^ 
TCTCAGGAGACGAAAAGGAGGGAAGGTGGATTCTAGCTTGTCGCTCTTC1 1 1'1'CCT 

6 0 TTCTTTTCTTTTTCT T C TTT rCTrrCTCTTTCTCT 

TAAAGACTCTATTTTTAATAAGTGTAAGAAGACTGTAACTTTTGATGCGAATGAGAC 
GGTTAAAGAGGTCTTTTTTGTTTTGTTTTGTTTTTTTAAGCCA^ 

f 

MOUSE SEQUENCE - CODING 

6 5 ATGCCTCC^TCTCCTTTAGACGACAGGGTAGTAGTGGCACTATCrrAGGCCTC 

AAGCTACCTTGGCTCTGCCAGCCCAGGTAGTGGTAGCCACGCTCCrGTCCTTGCCGCCGCCGTTGT^ 
TGACGTATATGCCCTCATCCAGCGGCTCTGCCCGCTTCCCTGAATTGC^ 

GACAAGGATCATCAGGCCC AGACT CAAGCCATTG CTGCAGGC ACTGCCACCACCGCCATAGG AACCTCGACCACCTGC CCTGCTAA 

7 0 AGCTAGCCAGCATCAAGATCATCTACCCCAATGACCTGGCAAAGAAGATGA 

CCCGTCATCATCGATTGCAGGCCTTTCATGGAGTACAACAAGAGTCACATCCAAGGA 

CAGCCGGCGAAGGCTGCMCAGGGCAAGATCACTGTCTTAGACTTGATTTCCT 

TTTCCAAAGAAATTATAGTTTATGATGAGAATACCAATGA^ 

TCC CTTG AAG AGAG AAGG CAA^_G AACCCCTGGT ATTGAAAGGGGG ACTCAG T AG TTTT AAACAGAAC CATGG AAACCTCTGTGACAA 
75 CTCCCTCCAGCTCCAAGAGTGCCX3GG AGGTGGGGGGTGGTGCATCTGCGG CCT CGAG CATGCTACCTCAGTCTGTCCCCACCACCC ' 
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CTGACATCGAGAACGCAGAGCTAACGCCCATCCTGCCCTTCCTGTTCCTC 
CAGAGGCTCAACATCGGCTATGTCATGAACGTCACCACGCA^ 
GCTGCCAGCCACAGACAGCAACAAACTVGAACCTGCGGCGGTACTTTGA^ 
GGAAGGGCCTTCTCATCCACTGCCAGGCCGGCGTGTCCCGATCCGCCACCATC^ 
5 ACCATGACTGACGCTTACAAATTCGTCAAAGGCAAACGACCAATTATTTCCCCG 

TG AGG AAGACCTAAACAACX^CGTGACXJ CCAAGAAT C CTT ACAC CAAAGCTCATGGGCATGGAG ACAGTTGTGTGA 

HUMAN SEQUENCE - GENOMIC 

T ACATTTTT AGT AATACAGGACAGG ATTG CAGGACACTTGGT ACAATTCT CAT CTTTT AGG CT ATG CTTTGCT CTGT CC ACCAAAG 
1 0 GGGAAAAAGGGTGTATTAGGATTCTCTAGAGAAACAGAACCAACAGGATGTGTATGTGTATAAATGG 

GG CT GATG CAAT ATGGAGGCTTGGCAAGTT CAAAAT^ C CAGGAAAGATC CAATGTTGC AG 

TTAAAGTCTGAAGGCTGACTGGCAGAATTCCCATTGCTCAGGGGAAGTC^ 

GAGGCCCACCCAAATTATGGGGGGTAATCrcCTTTAClTAAAGTCCAC 

CAAACAAACACCTTC^CAGAAACAAGCAGAATAACACTT^ 
1 5 CAGAAAGGAAGATTCCACTAGCTTAAACATAOUU^ 

AAAGCAGTTGXSCGCCTGTCTAAGCCAGTGCAGCCrT^ 

GAGGTGGTC^^CCCAGTCTCTGCTGAAGTCAATCAGAATGAGTTTTAGTAATA 
AC^CCCCTCTGTAACCAGATCCTATTCTATTTAATAGCITGTGAAA 
2 0 CTGCTCTTTC^CTTCTTTCTGCCCAAAGTT AC CC 

AAACAAAAC^AAAAACCCAGTCAGCCCACn^TTC"^ 
GTTTGATGAGTTAAAATGTCTGCCrTTAACTCCTCTAATCCA 
GAAAAGGGAGTCCAAATCTATTGGCATTTCTATTCTGCAGAATATAGA^ 
ATTTAAGGTTTAGAATGGCGGTTCATAGAGTTGCrrCAGTTTTCCT 

2 5 GCTAAGAGTCCTTGTGAATTATACATCTTAGTTTACTACTGTAGAAAAATG^ 

GCACTGGCAGATGAATGAAGCACAGCGCCAGCTCTCTGGCTACTTTCT 
GTGACCTGCAAGTTAATGGCAAAAATAACATTTGCTTCTTTTCT 

GATTTTTCTCTTACCTGTTGAATAACTCGAATTGACTAGACCTGACCTGATTTGGATTAATTTGAAT 
GTGGTACTGTGGGCTCAGTATTTAGCATAGCATAGACACTTAAATTCA 

3 0 GGCCCTTGCTGGTCATGAGTATAATTACAAAGTGTTTGC^ 

TGTXaAAATCCAGGTGGACTTGC^CCCATTCACAGAGCATTAGAAA 
AGTAGTGGCATTTCCTTTCAGAGCACAACCACCAGGAAGACGAAGAACTCACT 
TTCTGAAGAAAGCrATTTATTGGCATCAGTCACATGCTAGAGAAAATAAACACTG^ 
GGAAATGCATGCCTTTGAGAATTACAAGAAGGAAGATGTTGAACG^ 

3 5 AAGAAGAGGAATTTTTTCTAAACATTGAAGACAATCATA^ 

ATAATGAGATGGATTATAATGAGAAAGATGCTGATGTTTTCTAAATTCTATGGGAAAGA^ 
TATGATCACAAC^AAACAAAACAAGTGGTGTGTAAACAAOTTAGCTCCCC^ 

TCAAAAGAGT AGTG AATTTAGGG ACTTTGGAGGTTTCT AAGTTTCCAGCT CCAAAC CAGTAGCT AG AAAG CTATC 

ATCAC^TGGACCAGGGTATGCCTCTAGAAAAATGAATAATTCAAAC^TACGTTCTT 

4 0 AATTCATATGTTGAAATCCTAAGGCCCAAGGTGATCATGGGGGCA 

CrAGAGAGAACCCrCACACTTTCCACCATGTGAGGACACAGCCAGA^ 

CCTGATCATGCTGGCACCTTGATCTTGGACTTTCAGCCTCCAGAACTGTGAGA 

TATGATATTTTGTTATAGCAGTCTGAATGGAC!AAAGACAGTATGTTTAAAAAACATTCT 

CTCACTG AACCG AAG CTT AATG AGTTCAAACTACGTGCTAG CTC CCGGGCC ACCGGCTAGGGATGGGACGGCAAAC^ 

4 5 TTITCTAATAGAAGTGTCATAGGCTCTTCCTTGAATTTACCAGTCCAATCCGCAGTAGCAAC 

AATATTTGAGACTGTT CTGGAAATAAAGTGG AGGTCTGGATCCATTCCCCAATGTGG AGGCTG CGGTTGT AAGAGTG AG ACCACAT 
GGAGATTCACGTCATGATGGACTTAGAAAAGTTCAGAAGAAAACAAAAGAAAAAA^ 

ATATACTGCAGTCTGGAGACTGAGGGGGACATGATAGAAGTTTATAAAATCACAACAAGAGTGAATGAAATAACACA 
AAAGCAGATGCTCTGGTGCCAC^TTGTCTGGGTTTAAGTGTTGACTCTGCTGCTT^ 

5 0 CCCCATCTGTATTTCTGTTTTCTCATCTTTAAATTAAGCTAATAAAAAA^ 

AGTGAGTTAATGCATGTAAATCACTTACAG CAGTGCTT AGCACAGAACAAAACACTTAGTAATATGTT AC CTATG ATTATTTTTTC 

CCAAATTCTAGLATCTTTAAAATCAGTTCAGTTAATATTCAATTG 

ACTTCTTATATGTAGAAGTGGTACAGTCrGAACCTACTAGGAGATGCTG 

5 5 CCTGGGGTTGAGAAACTGGCGCTGACAGCATATCGTAATG^ 

CTTTGATGTAGCAACACATAATAATAACAGGCCATCCGCCACAG^ 

GTGTGTTATTTTCATAAAGAAAATAAGTTTACATCCAT 

GCAGATGTTTGTGGGGTTACAAATAGGGGAGCAGTGAAGGGGAAGAAAA 

AGCATATTAATTCCCCTAACCAGCCAGTCACCrCTCTGTGATCAGTAGAGGCCAGAGTGAACCIT 

6 0 CATGCTTTGGCCATGCCATCCAGTGAAATTGCTTTACA 

AGAGGATGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCATG 
TAGCCTGACCAACATC^TGAAACCCCGTCTCTACTAAAAATACAAAAGTTAGCTAGG 
TACTCAGGAGGCTAAGGCAGAATTGCTTGATCTCAGGAGGCGTGGGTTGCAGTGAGCCAAGAT 
GCGGCAGAGTAAGACTCCATCTCAAAAACAAAAAAAACAAAAAAOU^CAAACAAAA 

6 5 CAACCAGGATCCTTTCITTAACrGAAATTTTACTTTGGAAGCCCATG 

TCAAGTGTG AAGG TGTGGAGG ACAAGATTGTGGT CACAG AGGCTCCACCAGTT CACCC CTGCTCCCATTTCTATTG CCTCCCACTT 
GCACTGATGTCTTTTACCCTTTGTATATAAAGTTTTCAAGATCCAATGT^ 
CCACACTTCGAGTGGCCGACAC^TGTGTCAAATGACCACCACA.CTG^ 
TCCTGAGTCACAAGCACACCTGCCCATGAATTTTGTAGGAAAGAACTGC^ 

7 0 GCACATGGCCCTGCCTCTGTTTCCATCATTACAGTAGCCACTGTCT^ 

GGAAGTTGTAAGGAACAGAAATTTATTGCTCACATG 

TGCCTGGTGAGGGTCCAATCTCTGATTTCAAGATAGCACCTTGTTGCTGTGTCCCCACAG 
CCTGAAGCCCTTTTGTAATGGCCCTAATCCCTTCCATGAGGCCTCTGCCCTCATGACTTAACCACCT 
AATAGTGTCACACTC^GATTAAGTTTCAAC^TGTATTT^ 
7 5 CTTATCAGCCCTTCCCTAGGCCTGGAGAGTGTTGCTGGATGTTTT^ 
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TTTCCTATTCCTCAGCTTATTCCCCACTCCTTCTGGTATAATA 
CTCTGCCTTAATTCrAGTACCrCCAGAAATCTATCTTGACCTTGAG^ 
ACTTAACCATTTAAGTGCACAAAAAAGAAGTAATTTAATTAATACATTTGCT 
TAAGTTATTGTCATCrTTTGACTTTAATACATAGTACTTCAG 
5 ATATGCTCAATACAATCATGTTACAACATCCAGACTGAATGAAGAT 

CTG CTGCTGGTT ATTGATAT ACAGATCAG ATTTG T AGC CATTGG CTGACT AG CATT ACGAATTT AAGTCTCAGGG CATT AT ATTT C 
AGTGT AGT ATTTTCACATTT ATCGCTCAAAAACAGG AAT AAAAT CACTT CTT CTTGT CCAAAAG ATGTT CTG AAG AGT AGTG AATG 
GAAAAAAATTCCATCTATAATCTGATTTTTTTCACCAGTC^ 
TACAACITTGCITATATTCATAATCTGGTAATTATTCCACA 
1 0 GTTTCAGCCTGGATGGCTTAGATTCTTAGTCAACTAAATTATAATAATCTACT 
ACAAGAGTTACATTTCTTCTTGAAATTACTTTTCCATT^ 

AT CT AT ACTTTGGG CT^ AGATGTTGAGG CATT AGAAATGAATTCATCATTT AT AG AT CAT AT AT ACCTTT AAAGAT AGGCAC CTT A 

TTTCCT AAAGCAAGTAAATT AAT TT T TTT T T CCACTGAATGCCITTTATCAGCAGGGGATCTTTTTTCCCCCCTGAG CGCTTG AAT 

GAGTTCCCTTTTTCTTCTGAGTTGGGGTATTACCATTCT 
1 5 GTCAGAATCAGAACTT AG AACT CT CAAACT CTGAGTT AATGAAAAATTTTCC CAT AATTGTCT ATC CTTT 

CAGCCTTCCAGCTCCAGATAGATCAAGGGTAGAACCTCTGGGAAGTTCTT^ 

TCCTGACCATAACCTGTTGATCTTCAAACATTTAAGAACCAAA 

ACGTGTGTGTTGGTGTGTGGATGGAGAGAGGTGTAAACCAAAAATAAAATTCTA 

CGCCTTCCCAGGCAACCCTCTGAACGGACrrAATTTGTCAACCA^ 
2 0 AAGGGATGTGAGGGTGGGACGTGCCCCATTCATTATATATCAACCAACGGACTAATTTCT 
' CTGAAAGACTC^TTCAC^CCATGAGGGGAAGTGAGGGTGGGATGAGCCCTGTO 

ACATTTTACAACCTATTCTCTCTGAACCTGCTGCCTGGAGACrr^ 

AACCCATACGTTCCTTTCTTTGATCTCAGGTCnTTA 

CTGAAAGCCCACCCXSCTCCTTAGAGTTGTCCAGCCTTTTTG^ 

2 5 GCCTCCCTAAAATGTATAAAACCAAGCTtaGACCCCAACCACCTTGGGCACGTGCT 

TGGTCACTCATATTTGGCTCAGAATAAATCTCTTCAAATATTTTACAGA 
CTCTCCTATGTAAGACTATCGGATGTGTACGTGATGCCCGTCGTAACACTGGTCT 
TGTAAGTGGAGGTGCCCAAGGGGATTTCTCTTCACAGCTCATTGAGC^ 
TCAGATCTGCCCTGAAAAOVATTACATGAATATGCCCCTGCTACTCAGTATACTACT 

3 0 CCCCATCC^U^TCAGGTAAGGGAGAAAAACGATAGCCCAACATTCCCTAGTGCTCATAATGTATTA 

CCCTTCrCTCACAGATAAGAAAAGAAAAAGCTCTACTTCTTT'AGGGTTCT 

CATCT C CACTTCTTTTTG T ATCCCAGGG CAACTTTC CTGCAGTTTC C CT AATG AG AAT ATTTTC CGGACT AGG CAAGGGGCGG CTT 

GCCCCGCCCCCTACTCTTCTCCCTGAACAGGNNNNNNNNN 

bTONNNNNNNNNNNNNNNNNTOW^ 

3 5 CTCCTCCGGCCCCATTCAATCTAGCTATTTGCACTCGGCTCT 

CCGGTCCTGGCCTCGGCAGGGGCCGTGCGGGACCG AGGGGTGG CGG CCG AGGCATGTGGGGCGGGTXMGTTGCGCGCTGCTGGCAC 
GGCGCGGGGACCCAGCGCGCCAGGCGGGAAAGGGGGGCGGCGG CGGG CGGGGACCC AGACCCGGAGGGGCAGCGGGCGCGTAGCGG 
GCCCCAGGACGCCOSGCCAGGGGATTGGCCrCTGCACCGCTGCTCAGGG 
CTCCTCCGAGCAGCCCTCAGTGGCGCCGtfXTTAGTCCTCT^ 

4 0 CGGGG CGCGGAAGTAGGTGAGCAGCAGGGCGGCGCG CGGCGGTCAGGCCGGG AC AGCCAG AGGGGCAGCGACATCTGCGGGCX3GG C 

GC1TCCCGGGCGGAG CAGGGCCCTCCX3GAGG ACG CAGGCCGCGTCG AGG CTGAAGGGCCTCC CTGTCCGGGACGGTCAGCTGCCAA 
GTTCGGGCCCTTAGCGCGTGGCCCCC^CCGCGGCTCGGAGCGGAAGCGTCGCCCCTGGCTGCGCGCT 

G AAGACCCGGCGAGGTCCGCGAATTTTGG CTTAGCGTTTGTCAAGTC AC CTCGAG CGACTCGGAG AGAACCCGGCTTTCCCACTAG 
C^ACGGTGATCGCAGTCTCTTTCCTTTAGAAAGTTACTCCG^ 

4 5 TTTGTCCCCGGGTGAGGGAGTGTTGGAGG ATCCCGTCCAGTGGGAAG AG CCAGTTTTAGCGGAG CAGCGCGTCCAGCCCGCTGACA 

CTTCAGCCTGGTAAAGGGTTAACTGCATCCrGCCGGTTC 

CTCITTTAAGTTTTTGATTGGTTGCCAATGACATCACCGC^ 

CTTAAAGAGATGAGGCCACXJCCTTCTITACCCGCCCCCTAGGCT 

CCGGATTCTTATTGGCCCAACCCGCCGTCGGTCGGCGGCGAGTCCGCCCCCTCAGAACCTTACATTTACA 

5 0 AGTAACTATGTTGCTGCTGGAGATCAATGAAGCCGAGTGAATGGGGG 

GGTGGAGGAATACACTTATTTATGAAGTAAGTGGAAGTCACCGCGACT 

GATGCGATGCTGACGGAGGAGCTACAGCAGGTAGGGTTATAAGGTTCCGGCCGTTGGCTCCTGCTACCCT 
CAGTCTACTTGGATTTGGGGOSAAGAAAAAATGAGGTTTGTTCGG^ 

TCTCTCCAGCTCTTGCCTAGTGGTGGAAGGGAGAGGGGATAGCTCAGTATTTCTCTG^ 
TCACGGGCAGTATGTACGCCCGACCCTCTGCTGCAGGGATCCAGGTCTCTCTATACACTCCGTGTGTC 
TCTGTGTGGAGGTGGGGGTCCCAGCGCGTGTCTCTCTGAACACGCTTATGCGTCCCTCTCAGA^ 
CGTGTCCTACCAGGTAACACAATAGCAAAGACTTTC^CCGCTTGGTCCAACCGGGAA^ 

6 0 TCTTCAGAAAAAATACTACGCAAGGGAAGGGGGGTTGACTCCTTCGCrCCC^ 

AGATCTGCCGACGTGATGGGTGCGTGCCCCGTGTGCGCGCGTGAGTGTGGATTCCGCCTGTC^ 
TGGTGACGGTCXSTGTGTAGACX^ATGAGAATGGTGACGGTCGTGTGTAGACGGATGATA 
AGTTCGTTTAGGTGAGCGTCTCTTGGCGTGGGGTTTGGCGTCTTTGCTATGTG 
TGAGTTGTAGAGTGCCCCTGTGCCTACAGGAGGAAGGAGCTGTCTTTTTGCA 

6 5 ACAATTGACTGTCCCTTTCAACGAGTACAGGCTAATT^ 

TGGGATTGAAACCTTATACAACGTGGCTGCCTCCCATTGAGAGAAGCTCTAGGAT^ 

TGGGGTTTCCAGGTTTCCAGATCTGGCTCCATC^CrrTGAGATT^ 

TT AATTT ATAAGAGGCCAAAAGACAACATGTGAGGTTTTAGTATGACTATGTTGTGCT^ 

7 0 GGCAAGAATAGCTCCAAGTTAAAGAGATTCACTTAAGGGGATTTGCAGA 

ACCTTTCTGTGAACCATGCCAGTGCTCTTGGTTACCCCTTTGC 
GGACAAATGGGTCTITCTCAGATGTTTGCAGTGTACTTCn^ 
TCTGTCTTCAGCTTTTCTAATTATTTGTGCTTATACCTTCTCTACAGC 
CAT'rTACAACATTACATCCAAATTT^ 
7 5 TAAAAATGAATTTCAAGTTTTGATAATGGTATGATAAAGGGCAATTAGGACGACTTTGCTCAGAG 
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GGATTGGATGAGACTGCCTTTTGTGTCACGTGACTTTA 
TTGAATTGCCAGTTTTCAGCCTCCTCATGCCTCCGTCTCCTTTAGACGAC^ 
CAGGATCTCAACCTTTGTTTAGACTCTAGTTACCTTGGCTC^ 
CGTTGTGTCCCTCAAGGCTGCGAATCTGACGTATATGCCCTCATCCAGCGGCTCT 
5 CCAGCTGCTGCACTGTGGCAACCTACGACAAGGACAATCAGGCCCA^ 

GGAACCTCTACCACCTGCCCTGCTAACCAGATGGTCAACAATAATGAGAATACAGGCTCT 

CCCTGTGTCAGGGACCCCCAAG CAG CTAGC CAGCATCAAAATAATCTA C CCCAATG ACTTGGCAA^ CA 
AGAGTCACCTGCCGAGTCAGGGCCCTGTCATCATTGACTGCAGGC^ 
GACATTAACTGTGCCGATAAGATCAGCCGGCGGAGACTTG 
1 0 CAAGGACTCTTTCAAGAGGATCTTTTCCAAAGAAATTATAGTTTATGATGAGAAT^ 
AGCCACTTCACATAGTCCTCGAGTCCCTGAAGAGAGAAGGCAAAGAACCTCTGGTC 
AGOICGGTCCTGATTAGCCGGGTTCACTTTGGGT^ 

AGACITT ATTCTT CT CrrCCCTT CT ATG CTCriT ATTGl^'U'lX^TTT AGT AAGAAAAACCCAG CAG CACTGAACTGGAAGCCAGAA 

GATCTGGGTTCTACCCTTCATTTGGGTTCTAATTTGCATTGTGCAAGTCATTrCAC 
1 5 AAAGGGGCTITATACACACCAGATGGTATCTCAAGTCATGCCTGGTCTGCTA 

TTGT CT CTCAGACTT AACGAATGTTG ATGG AACAG AGG AAGG AG CAACT ATG GTG AACT AGAAGTAAACCT ACAGGTACCAG TTCT 

TACTGTGGGAAATTCTTTCTCTTTCTCACCTTTCTGACCA^ 

GCTTCTTATTCTAAGGCCCCATATAAAGTATAAGTTCACCAATTACATCC^ 

TTTATGAAACACTCCCCAACCCATTGTTTTTTCTGGCCT 
2 0 GGTCACTTGCCCCAGTATACTGAATGCAGAATAACTGAGATTAGCAGAACTGAGCT 

CGGAGCACCTTCCATACCGTGTTGCCTAACCTTTCCAAGTAGCCTGCATCCTGTGCCCAGCCT 

TACTGACTTTTGGGGCACTGTACATCACTCTCATTCGGCATCT 

TCTCCTCTTGTGTTCTGAGCCAAGGTACGCAGTTTCTGGGTCACATCTCCAAC 

CTGAGTTTGAACACATTGCATATTTAAATTGCAATAG^ 

2 5 CCATTCTCCTTCTTGTCTTCCCAGTGGGGCATT 

CAATCTCCTCTTCCCACCCTCCGCTCTCCTCAAACITGCCCrCATG^ 
GTGGGCTTTGCTGGGAAGGAGAACATGAAAGAATTGTATGTAGAGAGTTCCAGAAGCTT 
TGGTGGGTTTTCTTGAGGTCACAAATATGTTCAGATTTCCTCCATATAGCT 
ACTTTGAGTGATTCCCAGGATATTTACCCTGCTATTGTCTTACCATAATCTTGATGTGTCT 

3 0 ACCGAGGAATTTCCCATTTGTAATAGTTTGTCTTGGGGAGCAAAGAGGAGGATAGCCAATGAT 

CGCAGAGCAGGTGTTCATGATAAAAAGCAAGTCTTTTGTGTCrTCATGATCTGAAGA 
AGACCTCCTAATCATGGGGATGATTACAAGGAGTTGC^GACCCT^ 
CCATCAGTCATTGAGAAATGAATCTCCTGGGTGCTACTTGTGAAAGTG 
TCTGGACAGCCTAGACTTCTTCCTCAGTGTCAGAGTAAGATTGCCT 

3 5 GGCGCCCTGGGCAGCCACTGCGAGGCCTCGTCATTTTCGAGT^ 

AGAGGTTAGATTATTTGT AGTTTTGGGGGCCTTTTCTT CAAATATCTGGGAATTT CATTGAAAATATGTAGTTCAGG AGTGAAGGG 
GTCTATATTCTACTTTATAATTATCATTATCATTTTGGTATCITTTGGGAGAATG 
GGAAAGGAATGGGTCAATATTAGAATGCCCTTCGTTATCCTTTCAACTTGGTAAACCAGAG 
GAAAGACGGGGTGGGCATTCCCCAAAGGCATGACCAGCAGAC1TCTCTCAGATTCTTGAG 

4 0 GCTTCTCCTCTCAGATTTATTTGAGTAATCCCCTCCCTTGGTGATAAGATTGTGTTTCAG 

AAGGAGTAATTTTTGCAGGGAAGTTGGTGGGAGGGGGGCATAGTTAGA^ 
TTCAGAGATTTCAGTACAATCTTGGGTAATTTATGGCTGGCCC 
TCCCTGTCCAAGATCCTTGTCCCCCTGTCTCCATGCCATACT 
GCCATTTTAGTAATGCTGTGTCAGGTTGGGAAGAAACCGATCAGAGGCAT^ 

4 5 AGTGCCCTCCCTG TTGTG ACCTATGCTAGAATCTATGTTTTAGTCTCATCTTCTAAAGAAT AAATGGCA C 

TTCAACCTGGTGAAAATGATCCAAATGAATAAATATTTAGGGATTTGTGAGGTCTTTGAAAGAA 
GATTTTTAATCAACTAAAACTTGCTTTCTAGTCAAACTGAGTAAGAAAGC^ 
TTTTTGCTATAAAGTTTTTCAGGCTCCCTTTCTTTTCAGTAAAGCTGGGT 
AAATAGTCTACTGCCCCTTTTTCTTAAAGACAGTATATGCCTTCAAAAATGTTA 

5 0 AAAAGTGTGTAATGAACATCCATACACCTATCAACCAGATTAAACAATTATATTT^ 

ATTTAAAAATATAGATAATGTTTTAATGTATTTAAGTATGTTTAATGTGTAAATTAATC 
TTTATAGTATACTATGAACATTTTCACTCGTAAACACTTGAGTGTCATGTACCC A TTT 
TGTGACTGACTGGCAGTCAGGTCTACAGAGGAGCTTCCCTCCCTGGGAGTAGGAGGGCT 
CCCCCTTCCTCCCTGGTATGTCAGAGCCCATGCCTCTACTTGCTCTATGCTGTTGTAAAACAAG^ 

5 5 TTATAGGACTGCTCTGTACCAAAGGCCTTTACTGGGGGAATGTGAGAAGAAGAGGG 

GATTTGATCATGCAGGCAGTTAGGCCTTTATACTATAAAACTTCATCACTCC^ 

CT AG TTGAAAAGAATTCTG AAAT ATACT CTTATTGG CAAT AG AGTT CT ATGACAAAG TGGAT GG CT AGGCTCAGAAATC AG AG AC C 

TGTGAAAAATATTCTGTTTCTGCCCTTTTT^CGGGGGTGGGGTG 

TGAGGATTTTAAAAAATACCTGAAATAAAGAC!ATATCTACATTCATAAACG 

6 0 AGGCTAAAATGTCAAACATTAGCCTrGTAGCAAATGATGGAGAAATCAAGGCAGAACTTCGGATCT 

TTGGCTCCCCTCTAACTAGGTATGTGCCCTTGGGGAAGGAACTGAATCCTTCAGTTCCTTA 
ATAACAATAGCTATCTCAAAGTGTTAGTTTCTGAAATAAA 

TAAGCACTCCATAAATATGGTAGAATGTTTTTATTGATATTGATTGAAAGGAAATGA 
CTAAGGAGTACAAATAACAAGCTAGGTATAGTAGAGTGTGTGTGTTGCT 

6 5 AAGTTGCATAGAGTTGGAGTGC(IAAGTAATTGCAAAAACATATTGG 

TGTTTTAGGGAAATTTGTAAAAAC!ATGATAATAATTCTGTCCAAATAAT 
TTTTGTTGCCrCTCATCATCTCATCAGGAGGATTTATTTT 

GGGTGCGCTGGTTCCTGAGCCTAGGGTTGTGTTGAGTTAGAATGTGAAGGAAAAAGGAGGGAAGGA 
NNNNNNNNNNN ^^ 

7 0 CTTAC^CCTGGGTTCAAATCTAGGC^TITCCATTGACCAACTGTAT 

TCTTCATTGTAAAACAGGGACAGCTCCAGCATCCTAGGTTGAAACAAGGTGATACAAA 
ATCAAGCCTCTGGACCCTGCTCTCCTACTCTCATGCACrAATG 
GGGCAAGTaSTTCTAAGCAC^GTACTGTGGGAGCCTGTTAAGTATGAATACT 
TAATCATCCAGCTTGTCATGGACAAGTTCAAGAAACTCCT 
7 5 GG GAAGGAAACTGAATAGG T AAAG CACTTTG CCTATTTCTGTTTTCAAAG CACGTG TG TAT AGGAG AAAGG AGAAGGG AGAATGGT 
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G AAT AT G T ACACTTTTCT A CTTGGT CCTTT GGG CAAAC C C AT CTG AAGGT CTGACAAAATT ACAGG AG C AGAAAAACTTTCCGTC A 
CATACT CCTG GTG CATACAC CCTCAGGGACT AAG TGGGAGCTTTTG ACCAGT AG CAAAAAT CTAG TTC CTT AAACCCTG CATT ATC 
AGAAGCCAGAAGCTCCAAGCCCCAGCATTGCTGAAGAGCAGTTTGGGCATG^ 
TCCAAGTTGGTTGAAAAAGATAGACTACATATTAAGGAGAGAAATTCTAGrrCCAGGAGAGGT 
5 GAATCCCAGCTTTGAGAAGAGCAAGGAACTTTAGAGGGCCCCTAATCCAGTAGTT^ 

AAGGTGATTTATAACCTATGATGCCCAGTTCGTGTGTCCAGAAAGCCTGATTAAATTG^ 
TTTTTTAAAGCTCCCAAAGTGATTCTAAOIGGCA 
GGAATGGGGCTTGTCCAAGGTCACATGCATGATTAATTTTAGAGATT^ 
GATTCAATAGATATCAACAGACTGAGGTATTCX5TATGTATCATTATTTA 
10 ATTTTTTG AGT ACTGATG CCTTTTTTCTTG AATTTTG AG ATTTGAGTT CAGA^ 

CAGCAGTTCATGGTCTATATATTTACCTGGATATTAGAATATGGACCAAATATTAATGT^ 
TAGTGGTTTAAGCTGTTAACCTAGTGGTTTCAGCTGTTAACAGCTATCAAAACAG^ 

TAGGGACTTTTTGGTAGG TGG TCTC CACAGCTGT AGAAACTTGTGGGCAGAGG AGATT CATTTG TTTGTTCAAT AAAT ATTGTTGA 

ATGCCTTTGGTGTGCT AGGCACT AATCTGCAGG CTGGGCAATCTG CAT CAC AATG CTT ATCATTCAAATG CAAG AAAGACAAC AT A 
1 5 TATGGATGGGACCAGTCTTCCAGATCCTTTCCAT 

TTATTTAGGGAGTGTTTCATCITCACGAAATTTTGCT 

AAATAGCCAGCCGGGTGTGGTGGCTCATGCCTCTAATCCCAGCACTT^ 

GAGACCATCCTGGCCAACATGGTGAAACCCOSTCTCTACTAAAAATACAAAA^ 

CAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCrTGAACCCGGGAGG 
2 0 AGCCTGGCGACAGTGAGACTCTGTCGCAAAAAAGAAAAAAAA^^ 

GGGCCAATAGGATTTCTTT C CAATGT CTTGATCTGAAAAGAAATGATTC ATTGGTTTTT AATGA^ 

TGTGCCTCIAAAAGCTCAGGGCrrGACAAGTCTCCT 

TGATGATAGGAATAAGGGCTTCGTTTATTTTCACTGCATGCTCT 

AT ATTTCAG AGGCAAGG TTTTGG C CTG CTTT AAAAAAAAAAAATGTTTGCAAG TACAATT AAAAACCAG T AT AAGGG ACAGGG G TG 

2 5 GGATGAAAACCTGTCTCTAAGATTACGAAGCCTGCGTTATTTC 

ATTTTCATTGATTATGTTTCTATTATATATACTGTAGACTCTATATTCAC^ 

AGAAAATG AGTATG AC CTCTAG CCTG AATTTCATT AT AAAAG ATTT AAAAATAT ACATTTT A 

ATTATGAAGTCTTTGATTCTTTAGATGTTTTCATCCATAACCCAAGA^ 

ATCATCT ATT ATT ACTT AAAAG TTGG AG TT ACAT AT AG AT AT AT AGAT AT AT AG AG AGTT AT AG AT AT ATGTAATTTTTTTT CATT 

3 0 CCAGTCCCCCACCCGAGCAAC^CCCATATTTTATGTAACTTAA 

AATGAGCTGTGCAGGCCTAGTACTTTCATATAAGATAAAGGCTCTTACCTATTCTCTGAATCT 

GAAAGAAAAGGATTTGTGGATGCATCAAATTATCTGCATTGCCCTTAATGTTTTATC 

CCCAACCCTGGCCCCCGACrTGAATTTCATAGAACACAAAACAAAGCGCAACTCTTCCT 

TCTTG C CTT ACCACTTCAGAATGGAC CACCATCAGG CATCTGTTGTC CATTG CACACAT CAAGT AAACAG ATT AAACTG CCCGAGC 

3 5 CAGGGGGTCGCATCAGGCCTGTCCTCCCTTTGAGGAGCTTAAACCTCTCGTGTTATA^ 

CCTGAGGTAAGGAGGCAGTCACCGTTTTGCATC^GAACATCACTATCCGGTT 

AATCTCACATTCTTTGTAT CACAGCCCTGAG CTTCACT ACCATTG AGCTAAAT ATT ACC AG CTGTTGCT ACTGGCAAGAGAG AAGC 

AAAAAACAAATTGAAAGGAGAAAGAGGAGGGGTGGGGGAGTGGACAGAAAAGAATAACATG 

CAACATATCTTCTAGCTCCCAGACCTGAGTTCTGGGCTCCAGTATTGAAGACTC 

4 0 TTCCTGTCTTAAAAGAATAAGACTAGACACAAACTTCTAAGAGGGCCTTCCCT 

TAATTTCTATTTTTATTAAAATGCTATGTTTTTATGATAAGACTTTAA 
ATCTGCCAGCAGAACATCAGTTAATCACATAGAACTACTCTG^CTCTGTGACCAACCT 
CCAGATAGTTTGGAAGAAAAACAAGTTAAAAGCTATTGAATGCTTTCTAGATGCTGGGCATT 
TAGTTCTCAGAGAAGCTCTGTGAACGTCATTTTATATCCTGACTTTTATAGCTCAGG 

4 5 CCTTG TTATT AACT AGTAAATG AGAGAAT AAGGATTG AAAT C C AG TT CTT ATG A1TTCAAAG C C CAAATT CTTTTCAGGCTGG CC A 

TATCTATTTGCAAGTCTCATTCCTTATGGTGGCTTTAGTTGTGCTTTACCT 
CCCATTTTTCTTGTATATCCTCAAATAAAAATTCCATTGCTAAGGTGAGTAGCCTATTCT 
ACTAACCTTGCCTCAAATTATATCCCAGCTTTTGGATACTTTAAACTTTGGT^ 
AAACAAACTTTTGAAAACTGTGATAATTCTAACATATATGAAGTATT^ 
50 ATGATCCCTTTCCAATTTACA.TACGAGGGACTGAAA.CTCGGAGAGGTTCAGGCACTTGTCCAGTGTTACT 
GATCCTGGGTGTAGCTCTGGTATACTTCTGCTCAGTTTGCTTTTGATATCT 
ATAGAAGGATGATATTTCCTGCTITTGCACACATTAATAAACCCACTTTCTGGA 

T AC AAGT ACAAAG TT AGGT AAAAAG AACAAG AAAG AAT CTGTGACG AG T ATG ATG ACT ACAT CCTAAAG ACT CT AATG C CAAAGGT 
ATTTGATTGTTTATGCTTTATATTCATTATCTGTCCTATTAGTGTGGATGATAATGAAAACTATAGATTG 

5 5 TGAAGAAAGAATCTCTAGTAGGTTATGATTGCCCTTTGGGAAGGTTAGT^ 

ACTGTTTAGTTTCTCTCCTCTCCTTTCCTAATCTTTCTTATCTCTGTTCTTCCCTT 

GTAATCTGGAGGTCTGGCAAGGACAGACCCTGGGAAAAATAATAATGAACACATTTGAAGGACAGCAC<5ATGGACTCCT 
CTGTGTTAGGGGCAGCATTCATAGAGGCAGAATCTGTCTTCTGACAGAGCACCCAGCTTCACACAGTGAA^ 

CTCAGAT ATCAACT AT AAGGTTTCG T CACT AAG TT CATTTT ATG CTG CT AAAACAG AAT ACCTG AGATTG AG T AATTTACAAAG AA 

6 0 CCATAGGTTATTTGGCTTATGGTTCTGGAAGCTGTGAAGTGCAAGAGCAGGGTG 

GCTATCACATGATGAGCAACTGTGTAAGACAGTCAGAAAATGGAGGCTAAGAGTTCATCCTTTTTATTAGGACCCACCCCTGTGAT 
GAT CCACT CCATGGAATGGC CATGCAACT ATGGTCTTAATCC ACTCTTG AG AG TGG AG CC CT C ATGA CCT AAT CT CCTCTT AAAGG 
CCCCATCTTTTAATACTGCTACAGTGACAATTAAGTTTCCAATACAT^ 

ACCCCAAACACTCAGGTCCTCC^GTGAGAGGGGAAGGGAGGGGGACATC^GTGATCAAGGCAGTAGAATTACT 

6 5 AATTTAAAAATCTGAGCATTTTTTATTCTGAAATATAAAAGTAATCAATTTTAAGAAAGC^ 

T CACTG ATCAT AAGTT CTCACATTT ACATG G CAAGGTG AGTTTT CTG AAC CACAAC AACCC CAGGATG ACTTT AG^ 

AAC CACAGAATTATAGTCCATTG CACAGCAAGCATTCACTTACTCTATGTTGAATTAGAAAGGAAAGACATTTTT CAAAGAACATG 

GTGTTACAGAACTGTGAGAGATGTTATCAAGCATTTGATCTAACCATCCCATT^ 

GGATATAAATAGCTATTAGCAGTACTGGCATAGGATTCATGTTTCCTAACTCTCAGTT^ 

7 0 T AATCCTTTGG ATT AG T ATCTGAGTTGTTG TT AAATAAG CAAAT CTTG TGGGTTCAAGT ATT C AG CCAGG TCATTCCAT AATTTTT 

CTCATTACCTTTCTTAC^AGCGTATATAGTGACTAATTGAATGAGCCGGCTGATG 

TTCTTGCGAAGGGAGTATAGTTTGGAGGTTAAGGGCACAGATTCTC^ 

TTATAATTTTGGACAAATTCACTTTTCTCTGCCTCTGT^ 

CAATTG AG ARTTAGAT AACATATG T AGAAGTG CTTTGAACAAT AACG AATG CAT ACTAG CT AACATTTATTG AAAATCAACCAT AA 
7 5 TTATAATATTCAATCTCTACAATAATAATTCATAATCTGCTTATATTCA^ 



951 



WO 03/008583 PCTYUSO 1/5 1291 



GATGTCACCACTAATGCAGTACAAGTAGGAGATACTTT^ 

G TT TC TT TTGGGGTGGATAGATAACTACATAGGG TTTTTTTTTTI T ' 1 ITTr CCTGAAATAGTGCTGTGCTG AAAAATTATATGCT A 

TTTCTATGTAAGGAAATCTTATAAAAGTCTTAGCAGAAACACTTTATG 

AAAAACATAATATTACTAATCCCGCCCTGACTGGCCTAGTTTCATTTTTACA 
5 TTTTCAGGGTTT CTCACAT AGT AACTGAAATTCAITI"!' CACTTT AT ATTG CATG T ACTTTTG CTT CCTGTG TTG TGAT CAT AGATT 

TTCTTCTTTAATGCTGTAACCTGTCTTATTACCACATTTTACT 

GTATGACAAGAGAAACCTCTTGAAGGAAACCCTTGTTT^ 

TGGGTACTAGCTGCCATTTTGGGTGCTAGTTGCCAGTTTGA 

TTCTTTCCTGAGTGTAAGATCCAAATACCTATCAAAGTTGTGAGTrT^ 
1 0 GATCCTGAATGTGTGTGTTTACCACATGTCCTGTAAGGCACATATTGC^^ 

AGGAAGCACCACTCAGCGCTATTAAAAATCAGAAGCTATTCTGGAAGTGAATG 

TAAATATGAGTCACTCCCAGTTTTACACTGTAGATAAAACACTCTGAAAGGGA 

TGCTGAGTAGATGTGTTTGTGTCTCCCTTCCTCTCAACTTT^ 

TTAAGCAAACACAGGCAGCAAAAACACATCAGAATGGAAAG 
1 5 TTAAATAGGAGATAATTCTTTGACAGATAATTAGCTTGTC 

TTGCATATTATGTAAAAATCTATTCATGTACACTTACTAGGTGAATCTCIT 

TCTGCTCCAGTGAATAGAAAGAACTGGAGGAATGCTGTGGTTATGGAGTCCAGGAAGCCCT 

AATGTGGT AT ATCATT ATT AAAAGACACAG AG ACTT CAT ATT AAT AT ATGCATTT A CTTTT AGG C 
CTGAGAATGAGATGCTACAAAGTTGCCTAAATAATTACCTTGAA^ 
2 0 GAGCTTTTTCCAGGGAGTGGAGTGAGTGATTGGCTGGGTGCAGCT 

GGCACAGAACAGTTGCTCAATGACTCAGTGAGTCAAAGCTCCTCTCT 

TCCTTCAGGCATCCCCATGTATCTGCGCACTGCTTTTGGGGTGTCTT^ 

TCATATTAAAAOSTGC^CTGGGAGATGTTGGAGTGCTGAA^ 

CAAGTTGAAGTG T TTl"! 1"! 1 CCTGAAAAT AGAGTGG CT ATTTG ATGTT CC ATGGG AG CC GAATGACTTCCTGGTGT AGTGTGGCTT 

2 5 ATAGCACCGTAGATTTTAGTCATTGGTCACTACCCCTCCIATGTGCA^ 

AACATTTGT AG AATGGGAACAATT ATGAGACTGAG CATT AACTCATTT AATGGGG AT AG CAAT AGTATT AAAT CT AT CTG CAG CGT 

CCAATATAGTAGCCTTGCTGCAAGCCAAACGTGGCCATTGAGCACTTGAAATGGGGCCAGTC 

AAAATATATACCRGATTTGAAGACTTTATATAGAAAATAATGTAAAATATTTCATT 

G ATATTTGCACATGTT AAGAAAAAT AT ATT AG C^tfVT ATT TT AC CTGTTTGTTTTT ACTTTCTTGAATG CAGCTACTAG AAAAAAAA 
30 AATGCTTGTTATTATAGTAGCCTGAATTCCATGTCTATCAAATGTTGCCGATCTC 
TTTTTAGATGAGGCAATTGAGGCTGAGAGAGGTTAGGTAAGTTC 

TTTCTAAACCCTTGGAACTCCCCGAGTGTTAGGAGTGTCTTTGTTATGCATGATC 
ATGCGATCACTCAGGATAAGAGTTTATCATCAGAAAGACATCCACATGATTAGAAGGTCA^ 

3 5 AGCAGAAGAGAGTGGGACTGAAGATTGAGTTCAATCACAAGGCCAATAATTTAATTATGCCT 

CTGGACACCAAAGCTCAGTTAAACTTCCTAGTGAACACATTG 
CTAGAAGCTCCATATGCAGGACCCTCCCAGATATTGCCCTGTATGTC 
AAAACTATCATCATAAAGATAGGGCTTGTGTTCTGTGAGTCATTCTAGTGAATTATCAAG 
TTTATAACCAGTTGGCTAGAAGTGTGGGTGGCCTGG^AACCCCT 

4 0 AGACTGTGGCCTTAACTTGGGGAGTCTGCACTGACTCTGCTTAGTC^ 

CTTTTATTTGGATGTATGTATTAGGAGAACAGCAGCTGAGAGAGCTAAGTGCT^ 
TCCTTGGTGACAATGGCCATTTGCATTCTAGACTTGGAAA^ 
GGAGGCAGGCAGGCCTCTGAGACCTTCATACTCAAAACCCTTCACGCTGTAA 
ATGTTAGCGGTGGTGACAAACATATGTGTTGGCCCCTCITTGACACCAGGGATTCT 

4 5 TGGGAAGCTGACACAGGGTGATACTATGGGTAAGGAGTATTTCGATTTTTTGTTC 

ACGCTTTTGTGGTATGTCCTATTACATTTCATAAACIAAGGTATTTAGCrTO 
TTTTAAGCCAAGATCCATGTGTTTTATTTATTTTGGTTTAGTTTCT 
CCTGGTATCCACTACTCT^CTGTTTCCTGCTCAGGCCTGGCCCATGTTCT 
GACTCCTATGAGTATCTCTTGGGGCCACCTTAGAATGCAGATTTCTA^ 

5 0 GTTTCCGTCACCAGTGAAAACTAGGAATAAAACAAGATTTTGGAATGTATAAAAGAAGCCT 

TGTTTTTTCCrAACATGGGCCAGCCCTGGGCACTTTTCATC^ 

CTAGGG ACAGTT*CACACCAGATAACT ATGAGGCAGTGTTCTCTG CCACTCAT^ 

GGGGAGAGCAAATTCATCAGAACCAACTGAAGAGGTACAGAACATGAGGCATTCACTTCT 

CCAGAATACCAG AGGCTTCAAGG CCC1TTITCTACCTTTCAAGAGCGTTAT AG CGTGCTATTGCTCTTTGAGAGATCTAGGGCTGT 

5 5 AACAGCCCTTATTTTTAGAAAAGAGAGAACTCCTGTGTTTGAAGAAATAGCT 

AAGTTCAAGTAACTTGCCCAAAGGTGCACAGTTATTAAGAGGGATAAGAGGAACTCCAAAT^ 

TCCCTTCAAAAGGAGATAGAAGAAACAGGGCCACCCGGGGTTGGTGAAGATACTGTCAGAATCATCT 

AAGTGAGAGGGTCAGCAAGCTCTTGAGAGGGCCCACCTAGATCTTAGCTTGTGTATACCCTATACTGAGCACTGTGC 

ATATTT AGTG CTTAATAAATGTATATTAAACTG AATTGAGGG CCCCTCCTG CTGCTCAGTG ACATCC CTCAGGGTCAGTTTCCCTC 

6 0 CTGAGCCTTCTTTTCCCTITTCATGACTACCTTCCATG 

GGAGAGAGGAGAAAGGGGTAAGAGGCCACCAAGTGTGATTTGCCTTGGATTTCAGCCCT^ 
TAAGGGAGGCCCAGCACCCCTGTAAATTGCCTCTTCCTCATAAGTC^ 
TAAATCCTTGTGCTGAATTCACCTGGCGAGTCCTGCACCTGACT 
GTGTGTTCTGAAAATGTTTCITCTTCCACT 

6 5 TGGGGACT A lTriTTT CAATCAAATCT CG CAAATC CAT AG CAT AATG CTGAACAGCAT AT AAT AGATGCTTTACAATAAACTG GTC 

CTATCTCATCTCATTCCAGTGTTCAATGAATCTAAAAATATTTAATGAGCA 
ACTTAGAATTTCTATTAATAACTACAATGTAGTAGCTGATT^ 

TCCATTTCAGGGTACAAAGAGGTGAAACAG CTTTGTCAATATCATACTGGAAAGTTGTGGAGACAGT CTGGCCCTAGTGCACCAAA 
CCATTACACTCTATTTCCTCTAAGAGAAAGAACATCT 

7 0 ATAAATAC^TAACTTCAGATAGCGATAAGTGTTCTGAAGACAATGAAACTCTTAA 

AGGGTAGCTGTAAATTGAATGTCAGGGAAGACTTTTCTGAGAACAC^ 

AAAATAAATTTTTATCCTACCATACTTTCX3GAACACCTCT 

AGCAGAATGTCITGCAACTTAATTTTGACTCT 

C CACCITCC CCACACTGTTTAT A7\JVTCACT ATTT CTG CLAACCCCCTC CCTGAGT CCCATT AATTTTCT AGG ACAGTT CG CAGAACT 
7 5 CAGGGAAATGTGTTTACACATTTATTATAAAAGATATTAC2AAAGAATATA 
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TGGGGGCTTCATTACATAGGCATGATTAATTACAT 

GGTTGGGGGATGCGGCTGAATGTCCCAATCCTCTAATTAAGCTGTTTTAATTCT^ 
GGCCCCCAGCCACTAGTCATCTCATTGGCATTCAGAAAACACTCA 
GAACCAGGGGCAGAGATCAAATATATATTTTTTCATATCACAGTATCACACT 
5 AACAGCAAATGCAAAGGCCTGGAAGTGGGAAGO^GAGCACCCA 

TTCATTCATGTCATATCGTACCATATCATATCATATATATGGTTTATCAAGGCTATATTGAAGTTATAGAAC^ 

AAACTTAAAGATTACCCTAAGTCCTGATTGTATAGGGGTGGCAAAAGAGGTCA 

AGGAWSGGCTGCCCTCTGAGTCCrrTGCTGTCAGGGCTTTTC 

AGTCTGATGGTGCCAGATACGGGATGCATCCCAGGTCAGCGCTTCATTCTGCGTGCCCC^ 
1 0 TCCTTCACTTAGCCTCAAGTCCATGTTGACAATATGTAATTCCAGCCCACCT 
TTTCCCCCTAATCTCAGAAGGTGGTCACCCAGGCAGG^^ 

GTGCATGGAGACCTCA.CCTTGCTTCAGGTGGACCTGTCAAAGCTCAGCCCTCATGGT 
CATGTTTAAGGAAAGCCCATGCCTTTTCTTTCTGCAAGTTGATCC^ 

AT CCAG CTCAGTGACTTG AGG AAGTTGTTG CT ATG CCTTTTGTG AAGTG AAAACAT CGGTTC^VGG AAAAC AATTCAAA^ 
1 5 CTGACATTTGATCCTTTCCCCTACCACTTCCTGAATTCCCTAAGCCAGA^ 

AT ACATACTGTTTGTTGGACATGGGGGTTrT ATT CAAAG CAAACTCAG ATG CAGCATTTTTGGCTGGGCACTTCCAT ACATGATTT 

TAGCAATGGAACCCTATCCAGGCnTCACCTCTGTAACTTTGTCTT 

AGAGGACCTTTTGTGAATGTAACTTCCAATGGGAATGTTT 

GAAGATTTATTTAATTGGACTTTACTTAAATCCCATTGGGAA^ 
2 0 AAGATAAATGTTGGTCCTCGGGGTGGATGGTTGAGCAATAGTTTTCCTTTTGTACAGTGG^ 

GCTTTCCTCTGTTTGTTTGTTTAGCAATGCTTCATTATT^ 

GACTTAATGCAGAATATCTGGTTGAGCTCTCGGGAGAAATATGGCTTTGTGT^ 

TCTGCAGCTTCCCAGGATGATCTCATXX3ATTTTTCCCCCTCCTTTAAT 

GTGAGCAAAAAAGGTGGAGCCCTAACATGGTATAGAGTAGCCTCCCCAGTACGCAGAATGTCTCC^ 

2 5 AGAAGGATGCCACATGGAGOTCATGTGCCAGTGCTGATCTGAAAGGGCATATATCT 

GTGGGCTTCTGT^GTGCATGTAGCTGGTTCATGGAACTAAGGTTCTACAAGG 

CCTTTCAGCACACCCCCACCCCCTGCTGCCCCCGCCCCCACCCCCACCACCCTGCCACCCCC^T^ 

CTTGGCTTTAAGGTATGACACCAAGTTGGGTTGGCTCTCAGGGGCT 

AAAGTAAGTATTATACGTTGTATTTCTAAAAGTAAGCTACACTGGGAATTACAAGTGGCACT 

3 0 AAATGCCGTAGACTGGATAGCITAAACAACAGAAAGTTATTTTCTTATAGT^ 




AAAGGAAAGGGTATGAATTTTGCCCAGGATTTGGAGTGTATGTAAATTACACCAT^ 

3 5 TCTCTCTGTTTCTTTTCCCCTCCCCCCTCTTTTCTTTTCAAACACATATT 

TCCCACTGAACTCAGAATAAAACCTGCCTAAGAGGCCCTGTGAGGAGGGTCCCTGCCrCCCTTC 
CACCCGCCrCTTGCCCAGTCATTGTGCCrCCCCACATTTGGCTCC 

AGTGTGGTGTTCCCTCCCTATGTTCTCCTGGATTCCGTCCCCrCTCATACCTCAGAGAGGCTGT 
TCCCCTCTCATAGCACCCAGGAACTGGACAGCGTCra 

4 0 TGGTTGTCATATGTTTCCTCCTGATGGAGTGGGGAGGGGCAGGGAAGGGGCCTC^ 

AATG1TTGGGGAGGGTTTGATCTTAAATACAAGAGATTATAACAGTGAAAAATGTGTTAATGTAACT 

ACCAGAAAGAAAACTGAACTTGCAAGTTTTCAG^ 

CCCAGACTGGATCTTGGTAGGCACCCTTCTTAGCCATTCACTTGATTGAAGCCC^ 

4 5 TCTTATTTTCTTTAGTTGGCATTTTTGTTAGTGAGAATGGAGTTCAATAGAAGTAGA 

TACACAGGCTGGGAGGAAGGTCTGTGGATGGAGGGGACAACTGGAACCACCAGAAACATCT 
GGATTAAGGGAGAGCAGAAGGCACTTCCCTGCCCCTCTrGGCAGAGGGACrGACTTC 
GTGGTTCTTACTGAAACCTTTGTGAGACAGAATGGGGAGGGTGAGT^ 
GTTTGGCTGGCAGGGATGAGTGCTCAATCACTCTTCCCAGCCCACCTTTGAC^^ 

5 0 TTCGTTTGAAATTCAAGCATCAAAATGTATTCCAGGTAGAAGTCT 

GAGAGACAG AG AAGG TGT AGGG AG AGGGAGAGAAGGGT AG AAG AGGGG AGAT GG CAGGG AC7VGGG AGAG AAG AGGG AGAG<X3G AGG 
GGAGGGGAGGGAGGAGGGATCAGGGTAGAGGAGCAAGAGTTGGGGAGGAGAGGTGAGGGAGAGAAGAGTGGAGAAGGGGGCA 

AGGGG CTTCCTGG CT CT G GGAACT CTTGAG ATCTTAACAGTT CAAATC CTG CT CACATAAAAGTCAT CTG CTCATT AAAAT AACAC 

5 5 AAAGCTGTTTAATTGTTGTCTTTGTGGAAAATTACATGACTGTCAT 

TAAATGAGGTAACAG ATTTAAAG AAACAGGCTGT ATTTACATACTT crGAGGTATCCTCTCAAAACAATCTGCCTGTGTT ATT CAA 
ATACACACAAATTAAACTTGTGGTAGCTCAAACATTAGTTATTCAT^ 

ACTGCACCTCCCTTCAGGGGGTAGGAACCAGAGGCTGACTTTTCTGTGTGG ACTG AG CTACTCC AGCCGG CGG AAATCCAATCCCC 
AGCTCTTTTCTATACCAATTCTGCCCTGAGGCAACTTCTGAGC 

6 0 CTG AC AAAAACTTTC CAGGG TCAG CATTTCTTGG AAGGC CTC C AATT AAACTTT ATTTT CAATTTT C AT AAAT CACAATT CCT AAA 

TCIATTGGATGTTTCTGACTCAGTTTACTTTTTTGAGGTA 

CAAAG GTTTTATCAAAC AAGG AAGG AAAC3AGT AT ATTCGAAAAT AT AAGAAAJ^ 

GACTGGAAAATGTAGAAGTATATGGAACAAAAAATGGAGAAGAAAGCACACTGCT 

GACTGTTTTAGTTCrTGGTAAGTGTCATTGTCAAGTATTAGCTTCTTTCTGGAATGCT 

6 5 GAATGGCACATCATCTGAGAGAAACAATAGTAACTATTGAAGTGGCTACCTTC 

AC CAAGAAAATTT AGGAC^T AG ACACACAAGAGGAGTTT AGAAGCIAG AGGT CAAAAGAAAGGG AAAGG AAAA CAG CT C 

GCTCTTTAGTGAGAGAGAGGTGACTTCCCAGAGGAAAAAGGCCGACTGGTC 

GG CAGTATCTGATTTACATAGGGCTCACAGATTGGTTTGATCAGGTGTGAGGTTTACACAGCACT 

CCCTAATCTTATTATGCAAATGAACTCTCCCCTTGCCTGGCGCCATCTTGTCTGCT 

7 0 GAAGATGGAGCCGCCATTTTGAACATGTCTAGT^ 

GTCTGTGTCTGCAGCTCAACTTTACAGGCTGCTCTCTGTTGA 
GGACnTCTGTACCCTCACTATCTGCCTAAGTAATTTCTTCTTAACTCCT 
CTCAAGCAATGTCTATTTCATGTAAGCCTCAGTATTCTATATTATGTTGTT^ 
CTCAAAATTCAGGCAAATTCTGGTATCAAC Vm 
7 5 CTAAATAAAAATGAACAACTTTTATTATTGGAAGATAGTTTTGG^ 
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AGGAAAACACCAGGAAGATCTTTTTATTTTATTAAGTTTTTC 

TTTCCCTGTAGCTCCTATTGGTATCTAACATATCAGCAGTCACCTGTCTTCATGTGACTTC 

TTTAAATAAGACTAGTCCTGCTCTCCAAGGACCTAAATTCTATTAAAGCrC^ 

TAAAAGAATATGTATTAATAATGCAGTTATAGTTGTAGAATCAGGCTGCATGTATA 

TGTACGTCCATTTCAAAAATTTCTAGAAGTATGCATATATTTTTAG^ 
TAAAAAATGTCTTTTTTTTCAAGGCTTCTGAAAATCGTAGACATAAAGAACT 



TAGCAGAGCAGAGCTACAGTGGGCTGTCCTCATGGCTCCTGCTTTGTCCT 
1 0 GCAGGATGGTCTCCTTTTCCTACTGCTCTGAAGCAACTGTACCAGCATCAA 
TAGTCTCTGGTCATCnTCCTGCATCTGACTACCTCCAGTGCT 
GGGTTTGAATGCCTTCTTAAAATACCAC^GATGGCTATTCT 

C CT ATTTTATAGTT ATGGAAAGAT AGCCCAGAAT ACTTO^AT AACT^ CAAAAAGTCACACACTTGG CATGTGG CAGAC CCACTAT 
TCTAAATCCAAGGCAAAGCT Cr 'l T T' r ' rri " A 'T'i' , r i Tr I l ' U w lTl ' rr GCTGCTTTATATATCATCCACTGT Cl 'T ll 'CTAGAGTTGGCTT 
1 5 TTAAGATATATCTGTAGATAACTGTGTCTCCTTCACACAGGAAAAACCCATGAATGGAGGGA 
CCTAAAAGACTTAATAATATAGCAAGCCAAATAATGTGGATGGGTACTCAA 
TCTGAAGTTTTATGTATATGTGTGTATTTTAAGCTTGTGATACT 

AGCCAGAC CACACTGGTTTGAGGGAAG CAGT CAAATG ACGTG AT C AT AAG AAGACTT CTCTCTG ACTCAT AG TCAGC CT CCT AAAT 
CTCTTTCTTCCCCTGAGTAAGATAAATACTTCTTTATCTCTTGTTT 
2 0 TCTCTGCCTCTGTTTGATAGGGCCTCCCTTTACTCTTCTTGGGAAAGAAATA 
AAGTAATACTTCCTTGGGTATTTTACAGTAGGAATGATTTTCATCT^ 
CTGTAAAATACCCAATCACATCTATGGAGTATAAAGAACATTTCATAGAAAACAAGAA^ 

GTACCCACCACCACCCCACATGTTTCTC CTGTT AAGAT AGGTGAT ATTCTG C C CTG AAACTT ATG AAGTGACT CTGACCCTGT CTT 
TGAAGAAGATTTCAGCCTTGGCAGCCTGAGATCAGCTCCAGTTCATC 

2 5 CTTGAGTGGCrrCTGATGTTCTGTTTCCATGGAGTATA^ 

AG CAATGTCTCTCCCACTTATAATAAAACACCTAACTCrGATGTAGCATTTTATCC^ 

TATATTACAAGTCACAGATATGTAAACTGAGATGCAGAGACATCAATAACTTGCTCAAGGTTACAAGGTA 

AGTTTTAACTCAAGCAATCTGGCTCCAGTCTATGATCTTAACCACT 

3 0 CC CTGG TCTTTGG ACATT ACTTCTGTTTGCTG AC CTT CAGTG AG AGTGTT CCTT AAAT J rrCTGGAAAC AGCTCAU"l"l"i CT AG CTT A 

GCATAAGACCAAAAAGGGCTGTCCACCITTAGAGTATAAAAATCTCr^ 
CACAACCCAGCGCATTGCTTCTGAATCACATTACAGAGGACTCTT 

GAAATGCATCTATAAAGAGAAGTTTCCCACTAAAACTAGAATCCATTGCTTCAGTAGGACTGTCA 
AGAAACTGGCAGGCAGTATTTCCACATATAGTTCTTC^ 

3 5 CCATACTCATTTTGGCAGAGAACCTACTTTTGACATAAGCTTTGA^ 

CCCAGCATTGTTGCTCATGGCTACAATATGTTCXSC^CTAATAACTGCAGGCT 

AAATTAGAGG AAAGGAAGAAGG AGGGAAG AACAGTTTT ATTTT CAAAAT C CAATT CTG TGGAAAAACCAG AAAT G CAAAGACAGT G 
TTTGTTAGGAAAGCTAGGACCAACAAAGTGAAATCAGATATATT^ 
AGTATGTTTTGCAAACTTTAAAACTGCTCGGATACCTTGGTTCCT 
40 GGAACATTAGTrrACTTATATAGAGTCTTTATATTAAGCAGA' r T'l r TTT TTT rT T AGATCITTAAAATGGTAGTTTCCCAAAGTGA 

TGAAAAGAAAAAGCACTTACAATGTGCTGGACATTGTTTTAAGGTACT^ 
TAC^TACTATTGTTATCCCTGTTTTGTATACAGGGAAACTGAGGA^ 
TTTGGCTGTGCTGGGGATTCAAACCCAAGAAGTCCAGCCT 
GACTATGCGTGTGCATAGATCTGTTGGAATTTAGAACAAAC^TGATGATACT 

4 5 TCGTGTATCAAAACCTTTACATAGGCTTCCTGAACTATGArc 

TGAGGCTTAATGAGGGAACCTGACTTGCACAGGGTCTCACAGCTGCTAGGAGG 

TCTGACCTTTGCTCTTAACCTC CATGCTACACTGCTC CTGCTTTGTTTGTTAATTATCATGTTCTAATATTCnTAAATAGTAATAA 

ACZATAACATCATCAGGGTTCCTCATTAAGTGATCAGTCACTTTGGCCTCTG^ 

CTGGAAAAGAAAAACCAGATTAGCTTTACTCTTCCCCCTCCCCTCGA 

5 0 CCTTCCrGCAGACTTCTCTGGAAAAAGCCACCACrGTGAGCTTCCACT 

TTTTAACTGCAGTACAGAGCTGTGCATAAGCCTTATGGAAAACTGATC 

TATTTCTTCAGACCCCTCAAGGAGAATTTCCTT G TT r ACCTTATATGTTTAAATT 

CTCCCACCTCCAAATATTATAAGAACAGAGTTCCTACATGGCTrAAACATACA 

5 5 GAAGGCCCATGCCACCTGTTAGCATTACATCATTGGCT^ 

GTATTGTTTGCACTCAAAGAGCTGATATCATGCCAACTGATGTC^ 
GGGAGCCTCCAGCTCATGTCATTGAGAAAGGAGGTGTAGAGGCAG 
TTTTCTAGCATGTGrrTGGT T TT' nTT TTTT Trr TT^ 

AGAGAGGTTTTTCCCCTGCTTTTAGAGCCTTCTTTTGAATTTGTTTTATTG 

6 0 ACTGCCTTTTTTG7LAGAAAAAAGTGAATTAGGTTCCCA 

AGATTTTAACTCTTCAGAATCCCTTTTTGTGGAGGCTGTGATGATGACAATG 
CrC^TGAAGTTATTTTCCATATTTTCTTTCTTCT 

GAATAAACTATCATATGTAAGGGCAGAGGTTAATTCTAGAAATCAACTGAAATTCCT 
CTAGGGTAGGTTTTGACTTCTTACTTTCAATGAGTAAAGTGAACATAATATT 

6 5 TTAAGGAAAATATGATTTCTCTCITTTACACAGCCAGCTCCGGGTCCAT^ 

AGACTTCCTGTCCTTATTGCAACATAACCGTCTCITGTTCATGCTTTGGGTTTTCTG 
ACAAGCCATCTGTCTTTGAGGGTAGATGTTGTGTTCTO^ 

G AAAG CCAG AGTG TAT CTAGTT CTGTTTGAGT AGGG TT AGAGCT CAGTTGG CACT AAT AATGT CAATCCAG ATTTT ATAGCACTT A 

7 0 CATGTATATTATTATAATTTGAGCTTTGTAATAATCCTTTGGGTTTATCTGTGCATGCCATT 

TTTAAACAATAAACTATGATCCGTAAATTCCCTAATTTAAACAATTGT^ 
TTTTACITAGCITTTAAAAAATTTAATATTAAACCAAAATGTTTT^ 

TACATAATACTCCGTGGAATGAATGTAATAA CnTn 1 1 L 1 1 1 1 ATTTTTGTGTTTTACTTGGCCTTTTTAAAATATATATAAAAC 
AGATAACATTGTAGTGAACATCTCC»TATATCTAGCTAACTATCTT^ 
7 5 ATTGCACTTCTTCAGGGGAAGGTTTGCATTTTAAAAATG^ 
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G TGGGAGGCAGAGCAGAG CCG T AAG CTCAAAAG CATGC ATTCACACAT CCAAAAGAGGTTTCTTG ATGGGTTT ATG TG CCCACAAG 

CCTTGGAAAGTCGAGGAGTAAGCAACCGCGCTGCTCTCACATTCTGTTCT 

GTGGAGAATATCAAAAAAGATTATAAGAGCAGTCCTTCACACCCCACCCTGGCTTGTCTT^ 

C CTGTGGGCAATCATAT AAAT AAT ATAAATTAAAAT AGTGAATG C CTG CTGAGTT CT ACTGTGTG CCAGGAG ATGTGCT AAGTATT 
5 TT ACACGT ATTCATTTATCTCAGG CT CACAATTGCACT CAAAGGGAG ACACCATCAC C CTCAGTTT ACAGATGGTGGAACTGAAG C 

ACAGAGATAAATGAACTTAAGTAATAGTGTCAGGAA^ 

TGCTTGTAACCACTCTGCTGTATTGGGAGCAAACTTGTCCCTCTGGGT 

TGCTCAGATTCCACCCGAGGAGGTCATCTGTGCLTrr 

CTTTGAGGCAGGTCACATTTGCAGCTGAGGAAGATTGCCT 
1 0 AAAGAGCACATTTCCAAAGTGTTC CATG CAGTGAT ACT CC C CG AAG CTTGT CITGGGTCT CCAAATGG ACTTTT AGTTTTT^^ 

GTTTTG TTTT ATTTTT ATTTTTATTTTT^ CATGAGCTACAATGCAAAC 

TCATTCTAATGGGGAGGTGGCATTCTCTGCCAGCTCTGAAATCTCA^ 

GTAGTTTTAAGCAGAACCATGAAAACCTCTGTGACAACT 

GCCTC^GCTTCCTACCTCAGCCCATCCCCACCA^ 
1 5 TGGClAATGAGCAGGATt!;CTCAG<lACCTGGACACCATGCA 

TCTACCACTATGAGAAAGGCCTGTTCAACTACAAGCGG CTGCCAGCCACTGACAG CAACAAG CAG AACCTG CGG CAG T ACTTTGAA 

GAGGCTrTTGAGTTCATTGGTAACTATCTCCCATAGGG AATTTTCATCCGCTG CCTTCCTT TTCC CTCCAG CTCTTGCTTTGATAT 

CAAGTTCAAGGTTTATTCCTGTGTTGGAAAAGAAAAGCTC 

CTCCTAAGTCATAGCCTGTCGAGTGTATCCAAGCCATGAGGATGGAT^ 
2 0 CTAGAAATCCTC^TCCCTCAAGTTCTTCAGTGAAATAGTGGTAGGGGTTAGCAT 

AGTAAGGAATCGGAGGTO^GAGCTGGTGCTACTGCTACTTGATCTACTA^ 

ATGACACCCCTTCCAAACMACCTATGCCTTCCGGTCAC^ 

TCCTTTGACAGTCTGCCCCAAACTCATTGAACTAGGAAAGATCAGGTTC 

ACAGCACG CTTTG ATCCCCAGAAGGAAGCAACTATTAGCCT AATCCTAAT CTA 

2 5 TAAGACTCAAAATATCGCCGGGCGCGGTGGCTCATGCCTGTAATCCC^^ 

CAGGAGTTTGAGACCAGCCTGACCAACATGGAGAAACTCCATCTCTACTAAA^ 
CCTCTAATCCCAGCTACTC^GGAGGCTGAGGCAGGAGAATCGCTTGAGCCT 

ACTG CACTCCAGCCTG<KKAACAACAGTGAAACTCTGTCTCAAAAA CACCTGC 
AATATAGCAAAACTCAACATTTATTTTTTT 

3 0 ATTACAGCAGGTTGTTATTCCAGGAAGAATACAGTTAAGTC 

CGTCAGCACAGGACCATGGAATATTtSAGTCTGAAAGGAAATTCAGGGTGATC^ 

GTCATAGGAATTCTAGTGTCCAAGAAGGGGCCAGAATCCAGGGTTTTTGACTCCCAA^ 

CAATGTTTATGATGCCCTATTAAGTAATTCCCAACTGGCAACACTGAGGA 

3 5 GAGCCAAACCTGGAG CG CTG AGG AAGAAAAATTTCTCAACAAG CAT ATGAGGAACITCTGGTTG AT ACATG ATGTGATGTTATGAC 

TGAGATTCCATCTGACTTAACTCCAGTTGGAGGAAGTTAGGAAATT 
AGAGGATTTCAACTAAAGCTCCTTCTCACATTCCTTATAGATTATTCAG^ 
AAAACCAATAACAAATAGCATTCAGCCCGAAGAACAACTCCAT 
ATTGGATAAACTCAAATCTTACAGTCTTGCTTTAGTAAAAACACAGCAG^ 

4 0 ATAAGCTGAGCAGCATCTAAGTTTAGTGTTTATACCAGTTTATT^ 

CAGGCACTCAGAAGCTCATTCTCTCACCACCATGCITTCrGCTCTTGACT 

AAAAAG AGAGATG CTT ACTTTT CTTG AAAT ACTGTTG ACT ATG AGAAG TAAACTT CAT CTGT AAATTTTT AAGATGAAAATTT ACA 

TCAGGTTTTTTTTCCTTGGTCCACTGGTGGGTAGCC^ 

CCTACCrcTTTATCTAATTGGTGTTCCTTCAAGGAAGCTTTCTTAGGGATG 

4 5 AGGTAGTGCCCACATGCTATTGTC CTG AG CTTTT AATATATTTG AAAACTT AGCAAGTCTTCCTCTTTCAAT^ 

AGCCATTGCTGGGATTCCTCCATCTTCTGCATTATTGCCACACTG^ 
GGTGACACCCTCGGCCTTGTATACTACTAGCCITTAGTTAACTTTGCAAC^ 
TCCTAGTTAAGCTTTATTCCTTTAACCrrTAATCAGGGTTTA 
GCAATTCTATAGAATAAAGAAAGTGTTTTTTCTCTATTACA 

5 0 CGAAAATTGGAATTAACATCCTTAATCCAAAGAGGAACTGCATATATATATATATACAC^ 

CATATACATATATATGTGTGAGTGTATATTTTCATGTGTGTGTATAATTTTTCCCCTTCCTCT 
TTTCTTTCAGTAGTGTAATTCCAGAAAAAAAGAGAACAAGACATTTG 
AGCTTGTTCAACGTGAAAGTTAACTGTGTTATAGGTCTGGCTTCCGGCCACT 
AATTCTTTCATTTAATCTTCTCTTTAAATGTAGATGTATAGCnTCTTGTTCG 

5 5 TGACATAAGTATACGTAATGAACTAGTGGATTATACrTGGCTTTAAAACT 

GCAAAGTTGAGATGAAAGTTrCTCTCTCCGTGCCTCrCTCTTCCATCTTCCCI^ 
AGCTCACCAGTGTGGGAAGGGGCTTCTCATCCACTGCCAGGCTGGGGTGTCC 

AGCACACT CGGATG AC CATGACTG ATG CTTAT AAATTTGTCAAAGG CAAACGACC AATT ATCTCC C CAAAC CTT AACTT CATGGGG 
(IAGTTGCTAGAGTTCGAGGAAGACCTAAACAACGGTGTGACACCGAGAATCCTTACAC 

6 0 GTGACAATGGTCTGGATGGAAAGGATTGCTGCTCTCCATTAGGAGACAATGAGGAA 

TTTTTTTTTTGTAGTTCGGAGTAAGTTTGTGAATGGAA^ 

ACTTTTGATGCCATTGAGATTC1ACCTCCCACAAACTGACAAATTAAGGA 

TATAATACAACTTGTTTCTCCCCCrrTT'CCTTTTAAGCTA 

AGATACTACACACTTTAAACCAATT AAAAAG AACCAAAAGTAAATAGAAAAGACATTGAATCACC^ 

6 5 CTGTCCACACAGAAAACAAAAACC(ZAACCAAACC!AAGCCCTGTTGTG 

TCTAACIAATCCTTCAGGCATTCTTTAAGGAGAAAAAGGATACCTT^ 
CITCTCTGGGTTTAAGAGATTTT 1 TTTGAAATAGTGAG 
GTTTTAAGTGTGCAAACAAGTTAGAGCTGGCAGCTGAATGATAGAC^ 
CAACAATATATCAAATACTTTCCTTCCCACCITTTTCCTTTT^ 

7 0 TTGTTACATATGTATATCAGAATGTAAGAAAAAAAAATTTATTTAAAAATATTTTTCGCAAAA 

CTGTTGATTGTGTAAAAATTTATTTTCATTATATAAATGAAACTGGTT 
TCAGGTTCAAGCCTTTCTTAAGCTCTGGTAACTTGCA^ 

TGAGGAAGGGAACATATCCAGTTTTCTCACTGTGAAACAGAGTTAGCAAGGAAGTGGAAC^ 
CTGTCCCTTATATTCTCACAATGCTGTCATCCTCTCCT^ 
7 5 CCATCTG CCTTTGCGCCTTGGTGACCAAATCCCTGCCAGATTGATTGGTG^ CT AGTGG CTATTTGAT AAA 
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TACTTAGGACAATCTTCCftTTTAGGG ATCTTGATGTTCCrCCCCCCACCCCCCCACCCCATTGTCA T TT 

GGTGACATTAACCATGTAATGCGTTGCAGCAAATTGCT^ 

GTACAGCTTCCTTATAACACTGGGTTAATGCATAGCTGGGTTCCCAATTCACTGTTGAT 

AATGCAAAT ATTTT CTGTGTTTC^TT AAATT AAAAG T CTAG TGGG ACTTG G C CAT AAGG AAGATTT ATGG C AAACTGAAATTTG C C 
5 TGGGGTGAATAAAAGAGTTGAAGAATCTG^CATGTTGGATTCTA 

GAAAATGTTAGGAAATATTTTTGACATTTAAAAAAATAGCTTTTCTACCAAAA^ 
GACTGAATTTAACAGGAATTAACAGCTGTTTTAGATTATTI^ 

GGTAATTT AG CAG ACATTT CT"rrriX3TTATTTT AAGTCTAAGTGTT AAATTTCTGATAG CATGTAAGG TAT ATTGC CTCCCTCTGT 

CCTCTCCCTTCCCTTCCCTCTCCCTTCCCTTCCCTCTCCCTTC^ 
CCATTTTCCCACTITGCCTCATTCCCTCTTTGCC^ 
TGCCTCTTTCCCTCTTTCCCrrCTCT Cm ^ ^ 
ATGGAAATGATTTATGTTTTTGGAAGTTTATGATTTTGAGAGA 
1 5 CCAAAGATTCTGAGAGGGACTCAGGTCTTTTCTACTAATCTC 

ACTTCATGAAAATAATACAGATTAACCAAGCCATAGCTCTAGAAACCAGACTCCACATGTTGCTGGGC 

GCT ATAAGGCTG CCCTAGG AAATAATTGTGTCTCCCCAAGAAAGCCAAGTC CCTGACCAACG CATGAACCATTCTCAG AATCCAGC 
AACGGTTGTTTTATATTAAATGAATCTAATTACTITAGTAG 
TTAGGTTGAACAAAACAGTAGCTTTAGGTTTCTATATATATTACATTTAT^ 
2 0 ACCATGATAGGGAGTTTTCATTTCTTGAAAGAGTT^ 

AAGCTGACAAAAAACATGTACAAGGGGAAATATTCAAGAAAAAATGTACACAACCACACAATCA 

CATGGCCTTTCGTAATACGGGAAACAGGAGAAAATGGGAGGCTAAAATGAAAAGAAAAA 

CCTGAATTACACTCCAGAAGCTGACTAAGATAGAATGCTAATAAATTGCTCAACTAGATATTCTGAT 

CAAGTTAAAT AG AGATCAATCAATGATGG CCTT CACATTATTTGGGG ACAGG CT CTTG TAAAT AGGTCACAACACATTTT ATTTTA 

2 5 TGCTTTTTTCCCCAAACATGAAATATCAGGAAATATAGTTGCTGT^ 

TGGAGGT GAGATG AAGGGTGGT AG AAAATG ACT ATTTT AG CACTGGC ACC CAGAAGG ATGGG T CAT AATTCAG AAG TG CCC CAAAT 
CACACCTATCTGTAGAAGACTTCCTGCCTTTrTGAGGTGGTGGAATATGAACACC^ 
CGTGACAACCTTTGTCAJGTGTTTGATACAAGGTGCATGTATCGGTGTTTAATAAG^ 
TATTCCAC^ATTAGACCTCAGGCAAGTGTGGGATCTGGTGAAGAAGTCAATGGGAGTGTGTTGCATCT 

3 0 ACTCTCCGGGTCAGCACAGCCACCAGTAACAAAA 

GAACC CACAT CAG CCTTTTACCAACCTCAATGGGGG T G ACTT CCAG AAGG ATCT AGTG CCCT CTG CCATG G AG CTG AACATGT AC C 

TAGCATAGGACTTGGCAAAGCTGAAGACAGAGCTTTGGGAGCTT 

AGGGTAAATGTGTGCTTGCCATCATAGAGTTTGGTGCCCCCTTCAAATGGTTGCTG 

ATTTCCCCTAAGGCTAACCTTAACTCAGTGTGAATCAGGAGAGGAAATTCTGGAAAATATAGCTCCTGATT^ 

3 5 TGTAGAAAGCTACCAAAGTCCATCCCTTGCCAACTTGGCATCCATGTCCACTTCT 

TT AG CAAAACGATG CTTCT ATGT AGT ATG ATGCAACT AT TTCT ACAG ATGAAAAC AC ATTT AAAAG ACG ACTGGGTG C AGTG A 
CTCACACCAGTAATCCCAGCACTTTAGGAGGCTGAGGTGGGTGGATCACTTGAGG 

GTGAAAC CTG T CT CT ACGAAAAAT AC AAAAATT AGAC AGG CATAGTGGTG CATG C CTGT AAT C CCAGCT ACT CT AAT CACT TG AAC 
CCAAGAGGCAGAGGTTGCAGCGAGCCGAGATTGCACCACTGCACCAGCCTGGGTGATAGAGTGAGATGATGCCACA 

4 0 AAAAAAAGATACATAGCCTCATAGGTTACTTTGTCCATT^ 

CCTGAACCITGTAAATATGATCnTATTT^GAAATACGGT^ 

CTAATTCAGTGACTGGTGTACTTATAAGAGGAGGGGAATTTGAACACAGAC^ 

GGACAAAC^IAACATCAAAGATTACCAGCAACCATCIAGAGGCTACAAGA 

4 5 GGAAGTGACCCCACTATCACATTGATTTAGGACTTTTCGCATACAG 

TTTGTGGTACTTTGTTATAGCAGCCCTAGGGAACTAATACAAAGACT 

AAAAAAGAATTGGTTAATATACAACCAAATATATTTATATCAAAAAATAAATATTCATAATTCT^ 

G ATCTTGTGGT CATAG CTGGTATTTATAACTTC C TT C TTT CACTACC CATTCCATAT ACTTGTGTCCTCAG CAAGCACATCACCTA 
GTCATGGTTTCTTGCCTGTTGGGGTGATGTCIAGTTTTCATTTTTGA 

5 0 TTCCACTTACTTTGTTCGTTACAGCAACAAAAGTAATTACAAG ACTT GAG AGT 

TT CTT ACTCCT AT AATCTGGTAGTAAT CCAATTTCCCCAG TCAT C C CAG C CAATACACCAAT ATCTGACTTTG CTTGTTG ATT CTT 

TAGCATAAGGAGCCCAAAGTCGGCAGCAGGGAGTATCAACTT^ 

CTCCCTTGKXiJGACCAGAACITCTA^ 

ACAGTAAGAGGAGCCCTTTCCATTTrTCCCCCTTGATTCCTGGATCT 

5 5 TTGATCATTTAGAGAATTTACTGCATCTTTCAGGACATTA 

TAAAAGAAGAATTCCACCTTTCTACTGCTTTAGAGTGATGAATAACAAGTTAAGACTAATG 

TCTTCATTTGCIX5CAAAGT AAGTTCCTTGGTCAAAAG CAAAG CACTGTGG AATAACATGAAGGGAAATAAGATATAAGAATTCTTT 
AAG TCCACAGATACTG ATTTT GGCAAAGGTATTACAGGTAGGGAAGACAAAT CCCT AT AAA 
AAAATACTGCCCTTTTCAGGCCriTGTGAGTAGAAGTTCATGTTGCT 

6 0 TGTTCACGAGCTCATTGGTCAAGAACAGAAGAAACTGGAAAGAAAAACCT 

TTAAGATTCTTACATGCTGAGGTTGAGCTTTGATCAACATTCAC^ 
GATTTTC CTGT CACT AATTTTCCAGTCATGTTT CTTCT AAGTCCCTTAT^ 
GTAGATTTATACCTCTC^CTGTCTTTCCTTTGAGGCAAAATGAACAATTAGGTACA 
TTTCTTCCCCACACCACCAGTATAGGGCCACAATGAT^ 

6 5 TAAACAATGCCAAATATTTTTCAGTCCACTGGTCACTGAGTGCTTCCCATG 

ACAAATAGACACCRTGGGTAACTGGGTCACTTGCTTATTTAATATACTTGTG 
TGAATGAACAACTTAGAGGCTTCCATTTGATCATTCATGCACAACT^ 
CAGTTCAGGTTGCATTATAACTTGATGGCCCATC^TCAAGCTTCC 
AGAAGAATATCTGAATTAAAAGAAGAATTTTTCTGAAGAAGTTGTTATAGTTT^ 

7 0 CIATCT ATCAGGTT AATGG CITCATG CAAG ATTCCTGCCT CACCATGGAATCT ACTGGAT CT AGATGT AA 

GAGCAGTTTGTATGGCAGCCTTGTATTGTAGAGGCTTTTCTTGTTTTGGACCTC 

TAAATGGATTGTAGTAGCATGCCCAATGAAATATATATTCTCTTCAA 

ATAGTAG^AGAGGTCAGATATAGCAACCTGTOVTC^TGTTCATGAAAGGAA 

AATTT CLAGT A ACATTCACTGG AT ATAAAGTG AATTTTGTGG AATTTATC^ CAAGT AT CTT AACATCATAT CTAG 
75 AACTAGTTACTTTATCCTATTTACCAGGTACAGTAATCATGATATAATATCATC 
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TGGATAGACAATTATGGTCCCTGTGGACTAGATTACAACACAGGGCTGTAAAGTTGAAATAGCCCTGAACT 
CATTAGTAACCCTGTTGGCTGGAAGCAAATTGCTTCTGATAGTATATACC^ 

ATAAAT ATTTT AG AATTTG G AGGACAT ATGGTCTCTG C CAT AATT ACTG AAAT CTG CTGTTGCAG CAT AAGAGCAG CCAT AC CT AA 
TATGTAAATAGAGAAATGTAACTGTGTTCTAATAAAACTTTATTTATAAAA^ 
5 TTGCTAATTTCTGGAATTTATTAACAGX3TATAGAGAAA 

TTGATTTGTTCGATCAATGAAACTACAGTTGGAACAACAGCT 
CATTCTTCAAAATTCATCTGTGTTCTGCATAGGATAAATCAGCAAAATA 
TGTTTCAGTTCCTGTTGGTGGCTCTAATCTTCATACTCCTTTAAGAATT 
ATTGCTX^TAATAAACTACTACACAACTTAGGGACTTAAAACAATAAC^ 
1 0 ATAGGGACAGTTCATCTCTGATAAACTCATGCAGATAAGAAATG^ 
AGATAGCTTGCTCAAATGACTGGCAAGTGCTTCTTATAGTTG^ 
CTCCA<^TGGACCTCTTCATAGGGATCCTTGGGCTTCC^ 
GATGAAAGTTGTAAGTCITCTAAATCACrrA 

ATGCAAGCCCAGATCCAAGGGTTGGGTAATTAGACTTCACrTCraU^ 
1 5 ACGATAGTTCACTTTGCACAACAAAGCCAAAACTAGGAGACTGACAAAGGTTACT 

ATAT CG AATTT CACAG AT CT ACTAAACCACTGGGGG CT CCACTGAG C AGT ATTTCAAATGG ATT ACACT CTACCAAGT CAGGAAGA 

TAAGTAAGAGAGGTAGAGAAAAAGGCCCAAAGAAAAGAACAGAACACAAGTAT^ 

ATTTCAAGGAAGTCATCAGAAAGACATTTCCTTTCTCCTAATCT 

TGCTAGGCAGCACTGCACCTTAGCATTTATCCATTTATTT 
2 0 AACCATGGCAAGTGTTTATTAAGAAGACTCTACCCTTGAG 

ATACTTGGTCTTCCCTATAACACTTACGTCAGTGTCTTGGTTGTC 

TACTCCATTGGCrn-rGACATCTAAAACAGTTGTCAACTTATAAAACTCCAAAATA 

TGGGATGGTATCAAAAAGCTCTCCTCTACCTCGTAAGGTTGTCACTGCAGACAGTACTTCAAGG^ 

GCTAATTTAAAGCTGTCACTCCTTATTTTGGATGGGAGGGAGAA 

2 5 ATGGTGGAAATAGCAGCAAAATAATTATTATTACAGTCGTAATGCTCTGCATTTATATGG 

CTTTT CTG AATATCACCT ACAAATT CAAAAACAATT ATG AAGAAG TCAGAGT AAAGCAT AATT AT AC CAG AAATTT ACT CTG AAAT 

GTAATTCAATACTCTAATACAATTTTACTCTCAAATATAATTCAGTGAAAAGT^ 

TTGGA TTTTTTTTTTTTTTTTTTTTTTra 

TCACTCCAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCGTCAG 

30 

HUMAN SEQUENCE - mRNA 

CCACGCGTCCGCAATGAAGCCGAGTGAATGGGGGCTGAATGTGCGAGTCCATAGCTGAAGAGGAGC^ 

ACTTATTTATGAAACTGTCTTGAGTTCTTCTTGAATTGCCAGTTTTCAGCCTC 

GTAGTGGCACTATCTAGGCCCGTCCGACCTCAGGATCTCAACCTTTGTT^ 

3 5 TAAGAGCCACCCTCCTGTCATCGCCACCACCGTTGTGTCC 

CCCGCTCGCTGAATTGTGGATGCAGCAGTGCCAGCTGCTGC^ 

ATTG C CG CTGG CACC AC CAC CACTG CCATCGG AACCTCT AC CAC CTG C C CTG CT AACCAG ATG GTCAAC AAT AATGAG AATACAGG 
CTCTCTAAGTCCATCAAGTGGGGTGGGGAGCCCTGTGTCAGG^ CATCAAAATAATCTACCCCAATG 
ACTTGGCAAAGAAGATGACCAAATGCAGCAAGAGTCACCTGCCGAGTCAGGGCCCTGTCATCATTGACTC 

4 0 TACAACAAGAGTCACATCCAAGGAGCTGTCCACATTAACTGTGCCGATAAGATC^^ 

TGTCCTAGACTTGATTTCCTGTAGGGAAGGCAAGGACTCTTTCA^ 

CCAATGAACCAAGCCGAGTGATGCCCTCCCAGCCACTTCACATAGTCCTCGAGTCCCTGAAGA 

TTGAAAGGTGG ACTT AGT AGTTTT AAG CAG AAC C ATGAAAAC CTCTGTG ACAACTC CCTC CAG CTCCAAG AG TG C CGGG AGG TGGG 
GGGCGGCGCATCCGCGGCCTCGAGCTTGCTACCTCAGCCCATCCCCACCACCCCTGACATC 
45 TGCCCTTCCTGTTCCTTGGCAATGAGCAGGATGCTCAC^ACCTGGACACCATGCAGCGGCrGAACATCGGCTA 
ACCACTCATCTTCCCCTCTACCACTATGAGAAAGGCCTGTTCAACTACAA 

GCGG CAGTACTTTGAAGAGG CTTTTG AGTTCATTGAGGAAG CTC^CCAGTGTGGGAAGGGGCTTCrCATCCACrGCCAGGCTGGGG 
TGTCCCGCT CCG CCACCATCGTCATCGCTTACTTGATGAAGC AC ACT 
AAACGACCAATTATCTCCCCAAACCTTAACTTCATGGGGCAGTTGCTAGAGTTCGAGGA 

5 0 AATCCTTACACa^GCTGATGGGCGTCGAGACGGTTGTGTGACAATC 

AATGAGGAAGGAGGATGGATTCTGGTTTTTTTTCITTCTTTT^ 
GTTAAACACTTTATTTTTAACAAGTGTAAGAAGACTAT^ 
GAGGTTAAAGAAGTAATTTTTTTTAAGCCCAACCIAT^ 
TTGTAAAAGTTTATGAG 

55 

HUMAN SEQUENCE - CODING 

ATGCCTCCGTCTCCTTTAGACGACAGGGTAGTAGTGGCACrATCTAGGCCCGTCCGACCTC^ 
TAGTTACCTTGGCTCTGCCAACCCAGGCAGTAACAGCCACCCTC 

TGACGTATATGCCCTCATCCAGCGGCTCTGCCCGCTCGCTGAATTGTGGATGCAGCAGTGCCAGCTGCTGCACTGTGGC^ 

6 0 GAOVAGGACAATCAGGCCCAAACCCAAGCCATTGCC^ 

CCAGATGGTCAACAATAATGAGAATACAGGCTCTCTAAGTCCATCAAGTGGGGTGGGCAGCCCT 

TAGCCAGCATCAAAATAATCTACCCCAATGACTTGGCAAAGAAGATGACCAAATGC^ 

GTCATCATTGACTGCAGGCCCTTCATGGAGTACAACAAGAGTCACATC 

CCGGCGGAGACTGCAGCAGGGCAAGATCACTGTCCTAGACTT^TTTCCTGTAGGGAAGGCAA 

6 5 CCAAAGAAATT ATAG TTT AT GATG AGAAT ACCAATG AAC CAAGC CG AGTGATGCC CTCC CAGCCACTTC ACAT AGT CCT CG AG TC C 

CTGAAGAGAGAAGG CAAAGAAC CT CTGGTGTTG AAAGGTGG ACTT AGT AGTTTT AAG CAG AACCATG AAAACCT CT C 
CCTCCAGCTCCAAGAGTGCCGGGAGGTGGGGGGCGGCGCATCCGCGGCCTCGAGCTTGCTACCT 
ACATCGAGAACGCTGAGCTCACCCCCATCTTGCCCTTCCTGTTCCTTGGCAATGAGCAG^^ 
CGGCTGAACATCGGCTACGTCATCAACGTCACCACTCATCTTCCCCTCTACCACTATGAGAAAGGCCTGT^ 

7 0 GCCAGCCACTGACAGCAACAAGCAGAACCTGCGGCAGTACTTTGAAGAGGCTTTTGAG^ 

AGGGGCTTCTCATCCACTC<CAGGCTGGGGTGTCCCGCTCCGCCACCATCGTCATCGCTTA 

ATGACTGATGCTTATAAATTTGTCAAAGGCAAACXIACCAAT^ 

GGAAGACCTAAAGAACGGTGTGACACCGAGAATCCTTACACCAAAGCT 
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Table 45 

MOOSE NOMENCLATURE 
ICSGNM Flil 
Celera mCGl402l 

5 

HUMAN NOMENCLATURE 
HGNC FLU 
Celera hCG39646 

10 MOUSE SEQUENCE - GENOMIC 

CTGTGTGTGTGCAGCCTGAGATTAGAGGGGACAAACTG^ 

TCATTCTGGAGCTGAGGAACCCCAGGCAAGTCCTGGGTCTCTTTAA^ 

ATTTTAAAGGGTTATGGTTACAAAATTGTCTTGGGAACATC!AAACAATAGTT 

AGTAACCTAGACTGGGTCCTCAAACTCTGGATCTGTCTAGCCTTAAGCATCCCCAA 
1 5 CTGGCTATTTAACCTGTCTAGTTTAAAAGTCTGAGTTGGGGGA^ 

AATC^GAAGTCGAAGGATCTCTGTGAGTTTAAGGCCAGACTGCT 

TCTTGAAAAACCAAACC^CCAACCCACOUVCCAACCAACC^CCM 

AACAAACAAACAAACAAACAAAAAATGGTCCITGTCTGTTTGGTG CTTGG CTTTATGGCACCGGAGATGAACATAGGGTTTTACTA 
ACCTAGGTGAGGACTGTGTTTCTGAGCTACAACAGCTGTTG 
2 0 AATATAGTGGTTAGAGGAGTTTATCCCAACATrcATTCCCTTrCACTC 

TCCATCTGGTCCATCCTGGTAAAAACCACAGTCAAGGGATTTTCAATCACCCTT^ 
CACACACACACACACACACACACACACACACACACACACA 

TGTTGGGGTTAAAGGTGTGCATC^CCACTACCTGGCTCACACTrTTAAGTTAGAA 
CTATAGCCTAGGCTTGAACGAACTACAGTGGCCCCCATCCACTCTACT 

2 5 TGGTGGCroGTGGCTGATGACCGTAGGTCCTCTCTGTGTGTGCATGTGGTCC^^ 

CTCATAACATGATGCTGATTTGTAAGGACrAATATGAAGTGGGTGGA^ 

CAAGAG AACTTCTATGACCCTGAATTTGTTGACG CAACCATGAAATTCCATCGGGCTTCAGGGGTATAATGX3TTAACTTG CATTGT 

AATTTGGATGATTTATAGTCACCATGGGCGTGTGTCTCGG 

TGCTTTAACTTGAATATGAATAGCACCATCTCACGGGCCGGGATC^ 

3 0 TGGCAACGTGACCATGAATGCAGTGTGACCAGCCACCTCATTTCTACTGCCACCATACC 

TTAAACTACAAATCAACACAAACACTTCCCTCTTTAATTTGCCT 
GAAAAGGGAAGAGAAATAGAATCTGGCTCTTAATGGAGGAGATAAGACTGTACATATC 
AAGTTTAGTAACTGGTGACTGACAGGTGTTTCACATGATCCTCCTTATCCT 
GGAATCAGAAGGCTCGCCCTAGAGCACAGCGGGGTCCCTCTCCGAAGTTCATCGCGCCCT 

3 5 TCTCCCTGTGCAGAATGGTTCCCAAAGCAATGAAGGC 

TTGACAAGAGAAGGAAGTTCATGTCGTTATGAAAAGAATGCAG 

GCGGATATGCCAGAGATTAGCTTTGCAGAGCACTAGTCTGGGAGCCGCTCTGTGGTCCCCCTGAACATGA 

TGACATTCCTTTTCCCCATCACCTCCAGCAACATTCATCAGCTACTGAA^ 

AGAGGGATGGGTCTTTGGGAAATTACTGCCTCTTTGCCCCACATCTTGTGCCCTTCT 

4 0 CTGAGAATGAGCAGGAAATCCAGGACTTCTCTTCTGTCTrGTT^ 

TGGAATATTCCTGAATAGAGTGTAGAGATAGATCGCCAGGCACTCTGGATGTCTGTCTTTGCCTCT 
GTTTTAGCTAAGCTGTTTAAGAGATGGACTTGGTGGCTCTTCCTTCA 
AGCACCTACTAACCTAACTAAGGACAACAAAAGGTAGAAGATATATTTGTGAATT^ 
TATGAACACACACACACTCATATGTGACAAATTCAAAATAAGTG^ 

4 5 CACTTG CCAAATCCG T ATT ATGT AAACAGAG T AT ATGCAAG CAGGCACT ACCACT A CTGT ATC CATT ACTTTGG CCATGG ATG CCT 

CCTACGTTGGTTGTGGAGCCACrGAAAGAGCCTTGTCTCTGCCACCTAGAAG 

GCCTACGGGGAA(IAGCATCC!ATCACTTGAAATGAAGATTGGA 

TTGTTTCAGAATCTAACAGTTATCACCAACTGCTA^ 

AGAATACCAGTCnTATTTCCAATGGCTCCTGGCCATGTAAGACTCACTGAGCATCCGGCAAGTA 

5 0 TTCTTC C CTG CTTCAG ACAAAGCCAAGG CAAAG AGTTTGCCAAGGG CTT CTG G T CAG CATTGGG AGAGGT AAGG CATG AAAATGGA 

AGGCCCAAGCC^GTCAGAAGGAGGGGTGGAAAGGAGGGAGATAGGAAGATAGGGAGGGAGGAGGGAGAGG 
NNNNNNNNNNNNNNNAGAGACAGAGAGAAAGAGAGAGAGAGAACTATT 

TG GG AAAAG AGGGGAAAAGAT AAAAT AG T AGACTTT AAG ATGATG C AG TAG ACTT ACAGATG ACCATGTCCTTTG CTCAGGAG AGT 
AG CGATTGGGTG AG AGGAGG ACAAGTCAT CC CAGG AGCT ACTT'GATTCCCTCTTCCAATGTTTCTCTACAAGGACT 

5 5 TCATAAGTATTGTGGGGAGTCCTGAAAAATGGCCACCCATGGTCAGAGAGCTACA 

GCAGGTCTACTGAGCTTTCTTCGAACACCTGCTACCTAATCC^ 

CTGGG CCCTCX3 ACAGCACTCAG AGGCCACCCTGGTAATGAGGCAAAGGATGCCTGCT CCTAATCCTACATA 

ATAGTCAAAGGAGTTCTCATTCACATGTCTCTTTGTCTCCGTCTCCCCCCAGCCCCCGACTCATTC^ 

TACCTACTAGGTX3CCAGATGCTATTCTAGGCAAAGGACTTGCATGGAATGATGCCAT 

6 0 GGAGATTAAC^GCTCATGACTCCATGGGCAGTGATGGTTCATGATTGGGAGT 

CTGCAGCTGTGTGTGTGTGTCGGGGGGGGGGGGCTGTTGTC^ 

TCAGAGCGTGCTTCTCCATGCAGGGACACCGTGCCCAGTACTCAGGGATTGTGGTTAAAGTAA 

ATAGGGATTCTCCTAGAATTCCCAACAGAGTCTCTAAGGCCrTCACGTATTCCTCATTCTCA 

GGAGCTAGCTGCCCATCTGAATTAGGGATTCCAGAACAGCAGGACTC 

6 5 CTTTAGGAGAATTATCTAACATGCTTATGGCGG CTTGG AACTTTCCAAATGCCCA 

CTGTGGCTGAGAAAGATGGTCGAGTGCTGCTATTGTAAGCCTGCGGCTACTGCCAG 

CTG ATGTCT AAGGG CAG CAAGGGAAGG CTGT CC CAGAT ATG ACAATG ATGGTGATGGTGG TGGTGGTGGTGGTGGTGG TGATGGGG 
AAG AGGAGG AGGAACAGCAGAAGCAG CAG CAG C CAAT CAAAGTGG C CTCTTTTCTTGAAC ATTTAGGG AAAAGGCACAGGG AAG AA 
AGAA£CATTGCTCAGGTTCCATCCGGTCCGTCCCTGTCT 
70 TT AAAACTAAAGATCTAATTATTTACTCTGTCC CAAG CAGTTTACCCAG ATG ATGTCATTTTATC 

TTCATTTACGCCCCACCAGGTTAGGGCTTGTTGGCATCCCCAAGCTACAG^ 
CACTGGGAACTAAAGGAACAGAGGATGAAGTTTGGGTTGGCACTTAGAGCCT 
ACCCACAAAAATCACTTTCAAAGGGCCAATTGTCTAACTGGCAAAACA 
AGAGGGCATGGATGGAAACCCCACATCATCTTCATTTTCTGTGTGCATGAACA^ 

7 5 GCCTAAAAAAAAAAGAGGTGAAAACACCTCTCATATCACTTGGATGGATCAA 
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GTAAGTGCTTTGCTAAATTGTCTACTGTTTCTATTTA 
CTTATCTGTACTAGGAAAGOVAGCTAGTTTTCATC^ 

G ATGGGGGGAAGTGTTGC CCT AGGC CTT AGCAGTCCACTGCTTTGT CT CAGAGCT CAG C C CG CT AACT C AAG CAGTC CCTCTGAAA 

GGACTCTAAGGAAAGGGGTCGCCACAGAGTACCACTGTGCTAGAATATTTG^ 
5 GGTCTGCAGCTCAGGGGTGAGGCACCAAGGCATCCTGACTA 

TGGGCTCCCTTGTAGTGATCCCTACTGGGGTGCTAAGAAA^ 

AGCATGGAACCAACAGGTCTTTCCAAGAATAGCCTGTGTTTTCTGAATGAA^ 

TCATTAGAGACCAAGAACAGTGGTGAAGGTGAATCCCCCATTGGTGTTCCTITTATATT^ 

TCTCAATATCTCGCCACTAGAGCTGCTTTGACAGCAGCATAATGCTC^ 
1 0 AGTTT AAG CATT AGG AATTTT AAGGTTCAAGTAT AGAG CTTTGAGGAGCACACATTAAGT AGCAACATCAG CT C AT AAAGCTACAA 

GAGAG AGG CAGTGCT CCCTGGCACACT CACTCTCAC CTGTCTGC CGCAG CTrGCACCTTGGT ACTGGCTT CTGCATCTCATTGTCCA 

TGGTCTTCTCAGTCCACAGAAGTGACACAAAGGCCTGCCACCACTGCCCAGCACCCATCCCCTCCTCTC 

GTTTCCATCCTGGATGTC!TGTTATCTCCAAAGCAGTTTTGAAAGGGTCCTCA 

GGTCTCAGGCTTGAATGACAAAAGGGGTCCTATTTCCTGGCAGGTGTGGCAAGAATGTCCTCCCC 
1 5 AAAATTAAATCAGCTTCTTTGGCACCTCAGCCCCC 

TCAGATTCAGTCCAAAAGCCAAACAGGAAACAAACC^^ 

TCCACACTACTATGTGGGCAGAAAGCCCTTTGCTATCAATGATTGTAAATACAATAAATCC 
GATGCATGCATGCTGTATGGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
TTTTAAAAGATCCAGAAATTCAGGACATTTTTCAGGAAGGTT^ 
2 0 CAGAGGTTGTAGTGGCAACTCGCAAATGGTTTGACAGATAGGGCAAGGCTGGGTAATTGAAAACC 
TCCATGGTAGAAATACAGCAAGTTCTTGGGGGAA 

TGAGATTAGGCACAGGAGATTGGGGAAGGGTAAGAGCAGTGTGAGGAGAGATCTATCAGTGTACGTGCACCT 

TCAGAGGTGTGGGGGATTTGGGGATGAATCGGGATACTCTCTTCCTGTGATGCT 

TTGTTTTAAATAAAGTGTTTATCCTAAAATGGGTTCrAA^ 

2 5 GTTTCTTCTGCCTGCAAATCTCACTTCCTTAACAAGCCTCCAAGTCTTAGCTCTCCCATTTGT^ 

TGCGTGCTGTGCATGCTGCACCATCTGTATGTCCTCCTTGGGCTCAGCATTCCTGCCTTTTGCT 
GGCAGTGATGACAAGGGCCTTCCCTTCCCCCACAGCATCCCCATGTCCATCTCTGCCT 
TGATTTTGCTGGAACTCTCAAGACTGACTCATGACTAACTAGAGGCATAGT 
CTCTTTCAACAACAGATCTGGGGCTTGTTCTGCATTTTT^ 

3 0 TAATGTCAAAGATTTATGCTCAGCCTTCTGAACAGCCATAACrTCACAGTTTTCAAC 

AAGTTTAAGTTTGTTTGAGACACAAAGCCAGAGCAGAACCTTAACCGAAATGAAGAGTGGA 

CAGAGCCACAGCCTTCCCCTGACCCTGCCCAGGGGCAGCAGGTGAAGCC 

CGTCCAGACATAGCGAGGGTATTTGCTTTAGAGAAGGCATTCCAGGCAG^ 

3 5 AACAACATAAATTTATTTGCTCACAGTTCTGAAAACTGGAAACCCAAGGTCAGA 

CTGCTCTGG CTTACAGAT AATCTGT CTCCCTAOlTGCCTTACATGGTCAGTCCTTGGCACACAG CCACC CTT AGGGTCTTTGTGTG 
TCCAAATCTCTTATTATAATGAGTCAGTTGAGTTTAGAGCCCATCTTCACATTCT 

TATTGCCAAGTGGAATC!ACATTCT AAGTTACAGGTGGTT AG<^ATTTG CGG AATTCAG CAT AG AA 

CAAGAACCCAACAGAACCCTTGTTTGAACACIAAATGGTCATCACT 

4 0 TAGCTATTTCCCATTGATTAGTCCATCTTTCCTTCACA.GTCGGGTTACATGG 

GGCTGCCATTTCTTCTGAAACATGGTCAGCTTGCTTTTGTGTACT 
GCTGCTAAATGTGATTTGGAATAGTGGGGGGGGGGGGGGATC^ 

GACTATGTTTCTCAACCCATTACCAAAAGCCTCCCCTGAGTCAGGAGATGGTACAGTATG^ 

AG ACCTT ACACTAACTGCCTTTTCTATATATCAG AG CTTCCTC CTG AATT ATTTGGCTGAAT CATCAACACT AAGATCTGCAGGAC 

4 5 ATACTCATTCCCGCTAGCCAGTCCTCAGCACACCACTGTTCACTCrTTA 

GATTCCCTGATTCCCTTACATAGCTTAGGTCTAGACAAGCrCTCCGGTGGCT 

TTCATCTCTCCTAGCAATGTGGCTATGGCACACAGTTAGTAAAATGTGCTGCT 

TCACCTCTTGCAACTCTGTGTTTAGCCACCCTGTCTCCATTCTCCTTTGTGGGGCTTCT 

5 0 TTCTAGTGTTCTTCCATTCCAATGTTCAGACTCCTTTCTCCCCACGCA 

GAACAGTTACAGCTTCATGAGAGGACTGCCCTGTGCCACTGTGATATCTGAGAGTTGGGGTGTCGCTGGCT 

ACTGAATGTCCTATCTCTTCTCCCCTCrTGCAGGAGGCTCTGTCTGTGGTGAGTGACGATCAGT 

AGCGGCIAGCCCATCTCCCCAAGGC^IGATATGACTGCTTCGGG 

CGCAGCAGGAGTGGATCAACCAGCCAGTGAGAGTCAATGTCAAGCGGGAGTATGACCACATGAATGGATCC^ 

5 5 AGTTGGCCAGTCCTCATGAAGCAGGGAAGCCTGGGCGCAATTCCTCCTAGGACCACCCTC 

ATGGGTCAGGCCCC AAGG CCCTACTGCAGT ACTCAGTGCGTTTGCTGTCAAG CA 
TAGCAGTAAGATGCTCrTCCTGTGCTCTTGAGGATGTTTTAAGGTCCTCT 

TCTTCAAGTAAAAGTGGAAAAGTGGTGAGAAAGACAGACTGTGTGCCAGGAAGCCAGCCTCACACATGGGAAGTGTC 

TGGGGC G GGGTCATT AGG AAAGG CCAGGT ACAGCCHTGTTT ATGGAACTGGGA^ CACACGAAC CA 

6 0 ACCCATTGTCAGGCAGCCCCAGCGACAGGCACTACCTCAACATTTTGGCTAAATTATCT 

TGTCTAAGCCAGAGTCAGAACACCAAACATGTTTATCAAGTGTGT 

TCAATGAGTTAGTGACCACAGACACACAAGAGGGAAAACTTTAGCCTAGTTTATTACCTGATA^ 
TCC!ACTTAGCCTGAAGCAAACAGACATGAAAGTAAAAGCAAGTGAGTAAATACAA^ 
AGTTTATTCCACCAACAGAGGCTGACCCTCAGGACTGCCAGGACACT 
6 5 CAATCATTGTCCTGCAGGGATTGTGGACAGGAGTGCAAAAAGATAAGAAGTTACTGAAATAGCCCAGATA^ 
GGCTAGGGTGACAGTAATGATGACAGAAGCCCAGGGGAGGTGCATAAAGA 
ATAAACATTGAGAGGTGGGGACTGGATGCTGCTGAGTCTGGACCTTTGAAAGGGGA 
GGGTGATGCCTTGAGGAGGGGACGTCCTTTAGCACCCTTTGTGAGCAAGTG^ 
GGAAGCCAAAGCCCGTAGGATTGGGACTGGAATTGAGGCTTAGGCATCCTCTCTGGG 

GCCCTCTAGAACCCTCAGCCCTGCCACGCTTCCATCCCATGAACTAAAACCAGGAAG 
AGGGTTAGGGGAGCTCyVGAGAACCTCAGCCTTGAGCCCrcCATC 
GCAGAGTGTAGTTTATCAAACACTTGTATCTAGTCTTGTATCACTGCTCCCT 
CCGTATAGAAAGGAAGTCGAGAGAGGCTAAGCTGGCGCCAGCCCAGCCTTT^ 
75 TGCCIGTGCGTCATGGTTCTGTACCACGATCCCATTCTGCCAGCACCCTCTGGTGATG 



959 



WO 03/008583 



PCT/USOI/51291 



GCCTGCACCATCACAAGGTGTTCCTGATCCCTACTGAAAAATCTAGAT^ 
ATCTAGTTATCATTGCCTGCCAAGCCCACAGTGTCTCTC^ 

TGAGACCTATTCTCTCAACAGGCAACAGTGAAAGTCATCCATCCTTTCACAACA^ 

CATGTCAGAATCTAAAGTCTTAGCTAAGTCCCAGATTCT^^ 
5 TCCTGGAAACCATAGCACACCACTACCTTGGTTTATCTTCCCCAAGTCCCI 1 C I TT CTAAAACATTTCTTCrCCATTTATTCATAC 

AACTCTCTCGGGTTTTTGCCCATATACCACCTCTTCAGAATCCCTGACACT 

CCCTC^TGGGTTATTCAAGTTGTCTTCTCCCTCACTAGAATATG^ 

AGGAAAAAGCTAAATGGGGGCACAGCAGAGCAGGCCAAAGTCAAGTTGCT^ 

TCCCTTACCTGAAACATTTGACATGTGTTTCAGK3CTCTTT 
1 0 TATGATAATGGCTGTTTTGTATGCACGTGTGCTCAGTC 

TCTCTTGTGC^TCAGGTATGGAATTTTCCTTCTGTGGCCTCATG CTAGTACC CAAACGATATTTCAGATGGAGAATCCG AGGAACA 

GATGTCCATATGGGTGATCCCCAAGGTCACAATGCCCAG^^ 

TGGGTTCCCTTCTAACATATCTGTATCACTGAGCTACCTC^CC 

TTTAAGAGCTAAGAGAATCCTTCAAGGAAAAAGACTCTGGATCCATTTCC^ 
1 5 GGAAGAAGCCCGTGTAAGTATTCCCATAAGCTGGCTGAATATTTACCTGGCTTTGGGTC 

AAGATCTAGTGGAGTCTAGAGCCACCCAGGTAGGCATAGCCATGGTTITCACCTAATC^ 

GC CAGG ATCAAAG ACTTTCAGAAAG ATG CTTTT ATTTGGGATG AAACCAT AC CT AGCATTAACTT AG CATTATTT AACT ATGTG AG 
AGAAAT AACATGT AAAAACACCAGGCATGGTGG CATGCTCTGGTG ATCCCAG CTG CTGAGGAGG CACACAGCTGGAT CT C CT AGG C 
TCACTGG ACAGACAG C CTG CT AGG CAAGCTG CAGAT CTT ATG CCACAAAAAGGT AAATGGTTT CTG AGAAACAAAACTCAAAGTTG 
2 0 TCCTTTGGTCTCCAGAGGTATGTACACCAGCATAGAACTGCATGCATATATTTATA 
ACAGGCAGACAGACAGACAGACAGACAGATTTAGAGAGACAGA^ 

AGAGGCAGAAGCAGAGGGCTAAAGAGGCAGAGAGAGAGAGAGAGAAGAOUUVTGAAATGAA 

TACTCATCCTGAACAGTGCAACTTTCTTATTTTCTO 

TATATACAATGTATTGGGATTXMAGATATATGCAGTG 

2 5 CCCTTCTTTTACCAGCTCATCCCCCAGGCAGGAAACAACACrrCCCCT 

AGTGTCTCATGAGCACTTCCTCTATCAGTGAAGGAAGTTGACGGGTTCAGTCTrCCACC^ 
GTTGGAATGGCCATGTTITATCTGTGGGTCAGCATTTTATGACACCCCT 

CTGCAGGGTTCTAT AGGCTTAGGGTGTATATAATATAGATTTCCCAGTTAGGGCT CTTATCGGTCACTTAGCCTCAG CATC 

TGATGAGGTATGAGTCTCGACACTATCnrrCACTCTCTCAA^ 

3 0 ATAAATATAAATATTCTGGAGACTTAGCAGCATGTCCATTTA^ 

AGCCACAGGCTTTTTACCAGGTTTATGGTACCAGATGTGTATCTCCT 

CCACCATATGGT CAAGCCAC CGTTGTACCAGTGGATTTATAATG CCTGTCAAGTTGTATTGTGCAGCAAGCATGAGCACTGGTACA 

CTTCCTCCrCTAGGAGCCTGCATAGAGCCrCTGGCACCACAAATGCCAGC 

TTCTTTGTGTCTTGTGACCAAAATGTGTGTCATCTTCATCAATAGGGTCATA 

3 5 GCATTTGATAATTTTCAGAAAGTCCAGTTATCATAAC 

ACAGACACCATGCCTGACACCATGCCTGACGACCrCAGTTCTACCCCCAC^ 
CTAGTTGTCACCTC^TTTCAATGCCCATGCCATC^CATGTACCCA 

TAAAATGTAGAGGAGGGTGTGTCAATGCTCCAAAGAAGAAAACAGAGCAGGGCAGACAACrTGGAA^ 
GTCTGTGGCACCCAGCACTCTGATGCAGTCGCCACTAACTAGCCCCATGTTGCT 

4 0 GATCTGCTATAATGTAATATGCACATTGGACCTAAAAGTCTTAGTATGAT 

CTTATCAAAATATATTrAATATAATAATCAATAACATAATATATTGATATGTTATGTTAATGA^ 

TCTTAGATTTATTTATTTACTTATTTATTTATTGTGCGCCGGA 

GATGCCTATAGAGGCCAGAAAAAGGCATCAGATAGATCCCCTGGAATGGAAGT^ 

4 5 GCAAGAAO^GAAGTGCTTTTAACTGCTGGGTAATAATATCTCTGGCTTCT 

GGTGAAAGTGACCCTTGTGTCTCAAATATGTAACTCACAATCTACT^^ 
AGTGATTCACAGCACATCCCTACAAAGTTCTTTCAGAAGCATGGCTCT 

G TC CG TGT CTCTGGCAGG AAGGAAGTACTCCTTG TCTCTG AAG CTG CAAT ACG CT CTT C CCTTTT ACC CT AG CAG CAAG C C CTCT A 
GACCAGGGTCAAAGGGTCATTCAGAGGGGCTGAGATCTGAGGCAA^ 

5 0 TGTCTCCCTTCCACTCCTGTTGTGGTTCrAATCTG 

T AAACTTCAG AAACTTG ATC CAAG CTGTGGTTTG CAAAT AACTCTCAG CTG C ATC CTGT CTCGAC CATC<jGAATTGGAGAAAAC AG 
TG CATG AGG G AAACACTAACAGTTTGTG AGG G AG AAAACAG AACTAG TTTT AAT ACCTGG CATT CAGG AG CTAAACTCAGCATC AC 
TGCCCTGAGACAGGCrC^ACTGGCCAGOnTATCAGGAGG 
TCCCAGGTACCTTCCTAAATTTCTAATTCTAAACCTTGCATGGGGAGTAAGACA 

5 5 GGTTTTTGTATGGGTTTTGTTTTUTTTTAT 

TTCTCACAGAAACCCTAAAGGCGGCCTAGATTCCTCAGGTGTGCAGTGTGCATGCTACATGAATGACA 
CCGTGATGCTCAGTGGTCCCTTGCTGGCTGCGTGGGCATCGGGTGGTCCnTGGCT 
TGATCAAAACGGGAAGGTAACTTTGGGAGAAGTTTGAGATCCCAGTTAACCCCT 
TGGAGTCTGAAGACACTTCTATGGACCCCTAGCAGTCTTAAAGCAGTGCTCAATAA^ 
60 GGTTACGCTOTAC!TGCTGAGGCCTGCTGCCTGTAGCCrCCCGGGGGCCTGAGGTGGAGATGCTCT 
CCTGCTCTATTGTGGCAGCCTTAGGCCCTCCTGTCACCTGCCCCAGTACAGTCAGCTG 
TAAGTGCGGGTCTCAATACAAATTAGCTGCAGCCGTATGGGCAAAATC 
CCCTCCCTCTTGTTCTCCTTAAGCCCTCTTCACAGGCACCAGCT 

AGGCAGTGGTCACCAAGCCCATGGCTGGGCTGGCTCXSCrGAGGCAAAGGATAGAGACAAA 

6 5 GAGACATGGCTTTCTCCTITATCACCCTGAGCTAGACTAG 

TCCTAAGGAAGTCCCTTTGAAGACCrTGATGGAAACTGTTTG 
CTCAGGCAGGTAATCAACCACCAGGGGTCATACAGCTGA^ 

TACTGGTCAGCCAGGCACTGCCAGCGATGGAGCCCAGGGGGCCCCAGCTCCTCTGCACCTG 
TTGTTG CCATTCATCC CCACCAAGCACGCTTGGTTTTATGTCTTCATCTT^ 

7 0 TGGAAGCAACCTTTCCACCCATGGCCACCGTACACAGCTCCAAGTCAGGGAGA 

TCTGCCTGGAAAGAGGCCAATTTTAAATGCAGAAATTCCAAGGCCAGACACCT 
AGTATCCGTAAATGGGAGACTTTGAAGTAGGCACCTCCTCTTTT^ 

GC^AAAGTGTGTGTGCTGTCCACTCTCCCCACCCCCACCAGCCAATGAAAATTCCAGTATCT 
AAGCTTCTCTC^TATTTGGATGGTTTCTCATCTGACAAGAATCT 
7 5 GAGTGTGTGTGTGTGTGAGTGTGTGTGTGAGTGTGTGTGTGTGAGTG 
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TGTGCTGGAGGGTGTGTATTTCTCTAACTATATGTGT 

ACATTATATACATGTAAGTGGTAGAGTATATATCCAT^ 

ATGTGAATGTGTGTGTGGAACATGTGAACATATTTGTATTGTGTGTGTAGACCT 

ATGTACTGTGTGAAGTAGTGTGTGTGTGTGTTTGTGTGTGCATGCACACATGTGCATATCTGTGA^ 

TACATACATATATGTGTACATGTTATCACTGTGTATATGATGCCTGTGGTAAAGTGTGTAGAGCATAGTGA^ 

ATGTGCATATGTACTAAAGTTAGAAGCTGAATGTTTTCATTATGTATGTOT 

ATGCGTGCGCACACAGTTTCCTGGGGGACAGCAGTATGCTCTGGCCAGCTCAGCTTAG^ 

GCITCATCTCOlGCCAAGTCCTTCAACAGCAGAAGCCAAAGGG CTG AGCGCTGGTGAAAAGCCC CGATGTGCTACTGCCTCTCCTC 

CGACTGCCATGGGGACATTAAACCTTCAACAATCTGGGACTCTGACGGCTGGGTCTTGG 

ATAGTAAAACCrTAAATTCTAACATGTGTAAGTTATACATCTTATTTTCA 

AAAAATTACCGTCTC^CTTCTAATATCT^ 

GATTAAGAAACAAATTCATTTAATTTTCTTTATTGAATGTGTTCAGGCC 

TTCTCC1TCTTCCTGGGCTGGTGACTGTCTCCTTGCTGTGGGATGGGT 

CAATCTGTCCCTGCCCACTGCAGAATGACGGAGAGCAGACTGACCGAGGGCAGCAGC 

TGCAGAGCTGGCAGACTGTAATCAAAACCTCGTCTCATCCCCTTG^^ 

TCCCAGGCTGTTAGGATGATGCTTGCTGTCTGGCACTACCATAGTGCAGTATCCA 

CTTAGTCTTAGTCC^GGATGGGTAATGTGCCTTCTTCCTCTTC 

CCTCTTGCACTTCCATAGTGCTTCCTATGTGTGTTTACTACTCA^ 

ATGCATTCCAGGGGTTACTCTTCTTTGCAGCACCCAGCTCCTCTGTAT^ 

GTCCTGCATCCATCTITACTAGCCATGTGAAACCAGTAAGCACACCTTTCCTTTC 

TCCACACATCTGAAAGGGTTATCAGGAAACTCAGATAAAGCAAACAGCTTATGAAGATC 

ATAGCTGGGCTAGCAAGTACTGATACCATCTCAAGAAACAGAAGAGACAGGGTAATA^ 

ACTCAGTCCTGTAACCAAGGTAGACATCAGCGTCTATAGGTTTGTGGAGAAGGCCTATCTATTACAAG^ 

AGTGTTGACCACTGGCAATCAGGTTGAAACTGGACACAGCAGACAG 

ACACACAGGGTATTTATGACAGGATCAGGATAGGTTCCACGAGGAAGGTGGCACTAAGAGTTCATTGT 

AGGGATGTCTAAGTGACTGGCCAATTCTGTTGAGGCTGGAGTGACATGTCCCCATAATGATGGCTTCT 

ACTCCAGGGAGTCTCCGGTGGACTGCAGTGTCAGCAAATGTAACAAGCTGGTGGGC 

AGCTACATGGATGAGAAGAACGGCCCCCCTCCTCCCAACATGACCACCAACGAACGGAGAGTCAT^ 

AGTCAGCTGCCTGGCCAACGCCCACACTTCCATTATTCAGGGGCTGTAATGGAAGGTTGGCCCATCT 

G CAC C CT AGTCTTT AGTGG CAG ACT CAAACCAGGGAAAG TTG CACAT CAT ACGT AG CT AC C CT C CAAGG C AGGATTT CAAATG TCA 

CCCGAGCCCTGGCATGCTGATTCC^GAAGCCCCACTGTCAA^ 

GATGAATCTTGGGC^GAGGCAAGGCTGCTCACCCCGCCATC 

GCATGTACTTCCTGCCTTCTGCTAGATTCCACCTTCCCAAAATTACAGCCAACT 

AACAAACAGAACGCCTCCACGTGTGTGCCACAAGTTATGTCATTATGTAGAAATAGCCT^ 

TAOKSAGTGGGGGGCCXKXK^GATCrrATTGTTAAATCCTCTAGCCTATT 

CTGCCT AGGTATGGAGGGAATTCAAAAACTG CCACTTTAGAGTGGGTGCCAGG CAGAATGTCAGG AGTTTGAAT AAA 

GTGAATTACAGGGCGGGAGCTAAAACTAACCTTAACCAAAATAACTTCACCAGAGAG^ 

ATGCAGTTGTGTCTrAGAACGATAGTTTGTCTTGTTGATCTGAA 

TTGTTTCC ATGGG AAGTTGT ATCTG CTATCCTAGTGTGACAGTCACATITCTG AG CTCTGGG CT AAATTCCACA 
CAGAAGGTATCAGAAATGAAGGCAGAAGAGGAAGGGCTCTTGCAGCTCACAGGACTAAAATCTG 

CAT AAGGGGCTG ATCGCAAGGAAGG CGTCACCTGGTTT ATTTGTTTGTTTG CTTGTTTGGGCTGTTTG AGAAGGGGCTTCACTGTG 

TAGCACATGCCGTCCTAGAATTCACTGTGTAAGCCAGGCAGCACCTTGCGGCAATCCT 

TAACTATAGGTGTGCACCACCCACTCCTGGCCAGAAGGTCTTTAT^ 

AGATCATCTAATTGGTTTGTCACCACCCTCAGGAGAGCIAGCT 

ATTCTTTTTAAGCAGACGTGGTGAGAGACTCTCTCCACCCOICTAGGCCAGGAGAG 

CTAAGAAGGAACTCTTAGGGGATGAACCATTAGAGGAATGGAGATTACCTCAGG 

XCATAAGGCAGACTGTTCGGTTGTCTCTCTACCCTTGAGGTAGTTAAAA 

GAACCGACTTAATCATAGCAGTAAAAAACAAAACTGGCATGTTTATATCATGCT 

GCTTTAGATCCX3CTTGGTTATTCTCCTACTGGTTAGATCTGTATGCATTACTCCT 

TGGACGC^CGGGAAAGTCTCACITGGGCAAGTCCACAAATCAAGGCAATAAAACATTAG 

GATTTTAAGACAAGTATAAATGAGGGTTCTAGTTAAT(^CCATTTCCACATCGAGATGGGAA 

AAATACATTTGGAAGAGTTTAGCATGATCCAGAGTGGAAGATTTATAATTCCTTTTACT 

AGAGCCCAAAGCGAAGAGCGGAAATAATAAAATAAATAAAATATAGGGCATAAAATTGAATGTAC^ . 

CCTACAAGTAAGGAACTAGGGGAAGCTATGGATGAAATGATACAC 

TCAGATTTCCATTCATCTATAACCCTTGCTTGATGGT^ 

CACAAACCCCAACATGAGCAAGAGGGCTAGCTCGGGGAGATCTTCCGGGTCCT 

GCGGGATCCTCACAAGACAAAGCTGGGGCGGGGGGGTGTTCAAGGGGCCTGCTTCT 

T AAAAACCCTAGCAGTAGTCACGAAGG CAG AACTTCCACGGTTCCTTAGC C ATCT ACAAAATGGGT ATAACAAG ACCCATTTCATA 

GATTCTCATGAGAGAAAAAAATGGGTCAGATAGAGAGCTGGCTTGTGGTAGGATCT 

CITGGAGTCTTAGTATGACATACAGAAGCTGTGACATATGACAGATTTTTTTCTGT 

TATAACCCCTAGAGATCACAGCACCTCACTCCCn'CCCCACCACCATTGCTGGC 

TTGCAGCCTCTAGAATGAGAGGAATATGAGTTGGTTTAAGAGGAGTACAC^ 

TCAAGCAG AAAAGCAGAAG CAGGAGC AACCAG ACACCCTGTAATATAAAAT ACCATCC CTTCTCCAGGAAGCCTTCCAGGACT 

CAAATTCIACCCTAGACCCTTCAACCCAAGGTTCATTGAACTA 

TTATGCAGATGGGTGTGTCTGTGTGTGTCTCTGTGTGTGTGTGTGTGTG 

GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTCTCAAATACTCAACCCTCCT 

GTATCAGCATGCCTATGGAAGTCTGTCATTTATAGCATCTCCCACCTTAGACTGAGCAGG 

AAACAGAGGGGGCTGGGAGAAMCAGGTTTGGAGCAC^GGCCTCAGT 

TGT CTCT AG AG C CATTTCT CATCTTT AAGT ATTT AAAT AT CCT AAAAATG TTT AGGGGTGGTCTGG T AGAAAT CTT AACACAG CTG 

TCCCCCCCCCCAAAAGGAGGAAACCCAGATTGGATTTATGTCAAACAGTAACGC 

CAGGAAAAGCAGGCTrGTGTATATTTCCCTTCCGGAGACTCCGGA 

TGCAAATGGGTCCCCTTCCCAGAAACGTTAGTTACTGGGGATTAT^ 

TGACCTCCAGAAACTTTAAAC^CCATcrTGT^GGGTATGTGTTGAGGC^ 

GTGTCAGAAGCCATGGAACCAGACGGAGAGTTCTGAGG 
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GGAAGAGAGGGGCATAGGAAGGAAAGATGGGTGGGGGTGGGGGGGTTAAA^ 
AAAAAGGAGAAGArcGAAGAAAGAGGAGAGAGCAAAAAAAGAGGAGAAGAAA^ 
AGAAGCAAAGAGGAGAAGATAGAGAAGAGAGGGTGGAGGGGAGCCTAGGCTGCTGGGCAGGCATC 
TCCACTGCAGACCCC^CACTGTGGACACAGGAGCACGTTCGACAGTGGCTGGA 
5 TGACACTTCCTTCTTCCAGAACATGGATGGCAAGGAATT GTGTAAAATG AACAAGG AGG ACTT C CT C CG AG CC ACCTC CG OCT ACA 

ACACAGAAGTGCTGTTGTCGCACCTCAGTTACCTCAGGGAAAGTAACT 

GGTCCCCAAGAGCCAGCCCAAGGGAAGGATACCCTAATACCAGTCATGGAAGACCTATGCCAAGGAA 
ACCCCATCAAGGTCACATGTGTTCCTACAAAGCCTC 

1 0 CTTCTGCTAACOlAAATTGTACAGAGATAGAATCrCTGGTTGG^ 

AACATAGATATGAGATCTAAGATTGTGGGAGCTGACAGACTACTCCTCCTATTCT 

GGGGCTG CTTCTGCCCACTG CCACG AAACAACAAG AGCATCT AG TG GC CC AGGTCTGTT AGG AGGCACTG AAAACTTTCCCJGACA 

GAAACCCAGGGAAAGCTTAGTAAAGAAAGATCTACCCAATAGGAGGCTCCCTAGTACCAC 

TTTGATAGATATGAAAGGGAATACCTATGGACCTGCTAGTGACTCA 
1 5 TGGAAGTGAGCTCAGTGGTCTGCAAAGATGAACTGAAGCCACAGGAAG^ 

GCCAGCCACGTCAATGATTAGCGGGGCAGAACAAGATGCACTGTCCTCTA 

TGAGAAGATTTGATTGTTrTAAGGCACCTTTGAATT 

CTGGAATGGTTCACTTTGTATAGGATCTCCATTGTGCTTT^ 

TAGAGAGCAGGGCAACACTCTACTCACATGGGCrAGAAAATGA 
2 0 GCCTCTGGGCTGGCTATCTTGCCACCTCTCTAAGGC^^ 

GTCTGAGTrGCTGCACCATGAGGTCAGTCCAA 

ATGCTrAGCATTTAATAATGATTTGTATTTCTCAGCCTTTA^ 

CTGCCATGATCTGCACAAGCAGGAGAGGCAGTAGGCACAGAGCATC^ 

CTGACTTCTAGGCTCCTCCCGCATCATCTATG 

2 5 CTACACAAAGGGTCCAATAATCTTGGCTTTCACAGTAAATGCITTGTC 

AAGCCCATTAAGACCATGTGACTTTCAGGCCTGGATTTCTGACCAACCCGTAAA 
CAAAG AATCTGAG CCACAG AT ACTG ACATG AGG CATAT ATCTGCTG CTCAG AAGG T 

CT ATCAAGCCTTGGCAAG CCTGCTAGT AG ACAAGGGCAGAT AACAAGG CATGATGAAGG TATG AG CAG CT AT AG AGGGAGCAGGGA 
ATAGAGTTTGGTGTTTGGAGGCTAGTGAGAAAAACCATGTAGTCT 

3 0 ACAACACACGAAGTTTCCAGAAGGGGGATT^ 

GCCTTTGAAAGGGCAGTAAGCATGTGGTGATAACAGTTGGGAGTTCTACACAG^ 

ACAT AAGTG TTGCATGTACTTTTATGCTCTACCAGCCTTTGCTCCAGGAAGTG ACCATGCTA CCCCCTCCCAGCTTCOCATG 
AATCTGCAG AT AAGAGACACAGACACTGGG CTGGAG AGATGGCTCAATG ATT AAG AG CAACTACTGCTCTTCCAGAGGTCCTGAGT 

3 5 TGTACTCATATGCATGGACGGAAGG AAGGAAGGAAGGAAGGAAGGAAGGAAAGGAAAGG AAAGGAAAGGAAAGGAAAGGAAAGGAA 

AGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGA 
GGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAGAAAGAAAGA 
AAAGAAAGAAAGAAGTAAGTCTTAAAAAACAAAAGACACAGACACACAGTTGTTCAATTTCA^ 
GGGGATTACTATCTCCTGCCrTGGAAAATGCACACTTCTCTATACT^ 

4 0 AGTTCCATCTAGCCCCTGCCAAACATCCCTGACCAAC^ 

GCTGCTCCATTCTTCTCCCTCCAAGGCTTGGCAAAAACTCCCC 
TCCTACCTGTACCTTCTCTCrrAGCGATTGGCCCATGGCTTTCr^ 
TCAGACCTACCCTTACACATAAC^TGTGCTTATGACACAAA^ 
TTTAAATTn , CTTGAAATGTAATACAGATTGTAATAACCTTCT 

4 5 CIT T OTTTTCAAATTATGG Tn 'l T m T rAGGA 

AGAGGAGGCCATTTGTCAGACATTGTCTGCTCTGTGATGGCTG 

TTTTTGTTTTTCCAGACTGAATTACATGAGCTATTTCATG 

ACAAGTGTAGATTCTTAGCATCATTGTGTTGGGCTTTTGT^ 

AGGGAAAAGAG CTGAGACTCACAACAT AG ATGT AGGG C AC AG CGT AG AG T AAAGG GAGAT CAAT AC AG AAAAACTGGGG ACT AGCC 

5 0 AAAG CATGGATTG ATGGAAGTCAAAATTGGCTCAG CAAGATGG^ G CT AAGACTGGCAATCAT AG 

TTCAATCCTTGGGACCCATATGATGGAAAGACAGAACCAACTCCCATTAGATGGCCTCT^ 
AAGTACACACACACACACACACACACACAATGTAAAATTTGATTGC^ 

TGTTTTG TTTTG TTTTTGTTTTTTGTTTTTTGTTrTITTG TTTT AG ATTCCTTG TAG GG CG CTG CATTCTTG C ATCAAATTTACTT 
TC^CAAGAGACAGTTGGCTATTCTTGTTGTCTGGGA^ 

5 5 TGGCCAATGATAAGGAAGAAATAACAATGTCCTCATGGATGAGCCCTGCCTAGAGGGTCT 

TGGCTGTTCCAGATGGGGTACCTGTGTAGCATTAGGAAAGAGATGAA 

TCAGCTGAAATATCTGAGACACAAGCAGACACCAGCCCAGGTGTCATTCACTTCCTGTGGACAGA^ 

GAACAAAACTTTGATTCCCATTTCTTGACCGAAGCAACTC 

AAAGCACCTGTCCTTACCTTTAGTCCTCGGACCACTGAAGCTTTCAGTCTGAGCT 

6 0 TCATGCAGGATGCTTGAATATGCCTXSAATTAAGATGCTG^ 

CAATCTCTTCTCCAGCCTCCAACTTGAGGGCAAAACCAATGTCCCCATATGG 
TTGCAAAACTCCTCTGCAGCCTCGAGGAGTGAAGCCAATCAAACC^ 
AGACTCTGGGTGTGATTGGCAGCTCn'GCCAGGATCTCCCAGATATGCCC^ 
AGTGAGATC^GTCCCTTCTCAGGCCAAGGCGAATGAGGCAACGCTATCTGTCTCCAACT 
6 5 GGAGTTTGGGGATGTCTGGGAGCCCAAAAGCTCCTGGGATGAGTGGCCCAAAGCAGA^ 
CGT TTTCAAAGGAAGTGAGTGGAATGTCCATCTGGGCCTG 

C CAC^GG ATG AAAT ACATC AAGTATCAG<^ACACCA,CAG AAAAGG ACTCCAG CTTG A TTCCCATTTTT AATTTAAACGTAATGGAA 
AACACATTTTACACTCTGTGCTATCCCCATCTGCAACCTCAGATGGCCATC 

GTTAAACTTTCCATCTGGATCAGGCCAAGAAAAGAAGGCAGG CTT CCAAT AAGATAAAGTGCCCAGTTGCCTGAAACTCAGTAGTC 
70 AGCTGGCCTGTTGTTTAGCGTGTAGAGCCTTAATTCCCTCCCCAGTGTGC^ 

GAGCTAGGAATTGACTGGCTGGCTGGGGTTGGGTTGGGGTAGATGCTGCTGGGTTCTATC^ 

GGCTCTGGCACCAGGTCCCCTGTCCACACTGCTCCAGGTGTCCAGTT^ 

TAAGGACTTCCCATCTGCAAGTCCCATCCTATGTTCACTTTACTAACTCCT^ 

TAGGGATGTCACTCAGTTGGTAGAGCACTTGGCTAGC^TGCAGG^ 
75 TGGTGCCTGCcT\»'TCATCTGCAGCACTTGGGTGGTGG 
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GCTAAGCCAACCTGGTATACATGAGAATCTGACTTAA 

AG AAG AW1AAAAAAGAGAG C AGAGGLAAAGAAAAAG AG AAA CTCTTAATCTTGAAGG ACCGGGTC 

ACTACCAACATTTTAGAGCAAAATGTGTCTCAAATCAACAGACACATATA 
CACACACACACATAACAGAGTGAACAACACTTTAAATTAGCT 
5 CCAAGACACCTGCCCTCACCTTCCAGCACCT ITTCTTTCCCTTC 1 ICCTTCCCTCCCCTCCCTCCCITCIU'CCCTTCTGCCTTTAC 

TACCCCCCTTACTATTTGTTTGTTTGTTTATAATTTCK^ 
TCTTGCCTTAGTCTCTGGAGTGCTGGAATAAACGTGCATCCGTTCIT^ 
CTGC^CTGAACTGAGGACAGGTCHTATAGAGCAGCCCATCAG 
TAAGCCTTGACTTCCCTAAAGGTTGATCTCAGCTGTGGGAGGAGACTAGCATA 
1 0 TAAGAGAGATAGTGTCTGCX5AAGAAGGGGGGATGGTTTAGTCAAGA 

CCAGAACCCACGTGGAAAAAGCCAGTTGTGACAGCATACACTTGTCTCAGT^ 

CCTGAAGCTTACTGC^CAGCCAGCCTTACCTAT^ 

TAGATAGATCTTT1IAACAAGGGCACATACTCCITTCGTC 

TGTGTGTGTGTGTGTCCTATATAGCCATTGATTGTGGCCCTTGCTTrCAGGAAATTCCT 
1 5 AGGTCAGATCCTCAGAGCAGGAGGAAGAAACTCAAAACCTCCAAT^ 

AG TT AACGTAATTT AAAAAAAAAAAAAAAG ACACAAGGGG CAC C CTTGG CAT C CTG AGAACCTGGTGAAAGAAAACCCAGTG AAAA 

GAAAATGTGTCTCCCTGGGGAGAAACCAGCCCCACCCCCATGTAGGTTGCCCCTCCCCTGTGAA^ 

GGCAAAATCTCTCCCCTAGGTTACCATGAAATATGTATTTCATGCAGGCTTTCA 

GGGAGTTTCCTGTGGCTTCACGTGTGCCCAGAGAGGAACT 
2 0 TCTCTGTCTAACTTTGTACTAATGACTGGAGTGGT^ 

TGCCTGAGGGAGAAAATGCCTGGGACTGAGGC^CAGACATTCTTGCTCTTACCC 

TGATTCCATTTCTGAAGCTCTGTCCAGTCACGGAGAATTATCCCTC^ 

CAG CAAGGCCAGTGCCCGCAAG CTCTG TG TAAAGGGTTTCCATTCTG TG TAACTG AAAGCAGTC ATG AT AAGTG CT AGCCTCTG CT 
GAATCTTTAAAACAAGCTAGGATCTCTGCAGCACACCTTCTGAGCT^ 

2 5 ' CCCAGTATAGACGAGGGACATGTGACCAGGGGAGCTATACCCAAAT 

AATTTGTCACTAATATTCTTGGAAGATGGAATGTTGCCTTCGGGTGGCTGTXn'CC 

TATAGCAGTCCCTAAATGGTACCCACCACCTCCTGGTTTTCACCTAGGAACTGCTCAGACCT 

AGGTCTTATGTCCCATAAGAGGTGACITTGGCTGCCCTGCTTAGACCC^^ 

3 0 AGTCAGTTCCCTTGGCCACATGTCGCAAGGAGAAGAGTTGTGCTTA 

GGCCTCCTTTGTAAAAGCAGATCCCAGCCCTCATTTTGAAGGCTTAA 
AAAATGCCTCCAATTTATAAAATAAACTCTTGGGATC^CTTTAAACAAA^ 

TTTTACATGCAATCACAACAGAAAAACAATCGTCTATCCCAAGTAGCTCCACTGGACCTGGTGCA 
CCGTTTAGAAGACCACACACCAGCGGGTTGAGATCTGCTGTTCTAGACACTGAGAAAATTGTCT^ 

3 5 GGTGGGGTCCCACAATCCACGGCCCACGCAAGGGTCAGCAGCCrrCCTACTCT 

TCTAACATGCTGGGAACCTTGGCTTTAACACACAGGCCTTTTGGAGAACGTTCTAGATTCAG 
TTTTACTTTCATTTCAGGTTCACTGCTGGCCTATAACACA^ 

AGTCTGTCCTCTCATGCGCCCAGGTTCCCTCCACTCTCACCACCGCACAGGTCCATACT^ 
TGGACACCAATTCCTTCCTCGGGAAACTATTTTTCTAGGACACAAGA 

4 0 AACAATAACAAAAAAAGCAAACTTTTTTTTTCTTTCTAAAGACCATGTT 

GCTTTGAAAGATCTCATAAAAACATCACAAGAATATAAATTAAAGTC^ 
ATGAATC^GCTGACTTTCO^CTGACACTTTGGGGG 
TTGTTTGTTTGTTTCCTAAAAGAAAATTTAGCATTTTCAGrrCGT 
ACCTGTTCTIX3GTTTGGGTTGGTTTAGTCAAACAG 

4 5 AAATGGTTTTAGTGAGGCGCCACAAGGATGCTTCTCAGAGTCTTCATTGGTAGGGACTGAC 

GCACAACATCTGTGAGGCCCGGGGTTTGATCCAGCAACACACACACACACACATACACACATA^ 

ACACACATACACACATATACTCACAAACACGCATATACACATACACACAAACACT 

ACATACACTCACATATACACACATATGATC^CACACACATACACACACATATATTACACCCAAACACA 

CAC ACAAATT CACAT ACACACACATGCAT ACTCACAT AT ACACAT ACACTT AC^ CACACACACACACAAACT AT CAC C CAAC 
AGTCTGATGATCTCCT 

CCCTGGTTTTGTCACATCrCTGTCCTCAGACTCACGTCAGGTTCAGGGTTCT^ 
CAGCTTCTGAGGATGGCTGACAATGGGGCCATGGCTTGCAGGTTTAG 

AGGAAGGATGGGAAGGAAAAGAAAGAAGAAAGATGGACGTGAAGGGAAGAGCAGGAAGGGAAGGC^ 

5 5 GGATTTACTGTACAGCGTGCTGGTACCATCAGAATTAACCA^ 

CGTACAGGTTTAAATACTGGTTTAGGTTGAGAAACTC^TCTGCAGACAATGCAGTCACTAACCA 

TTCAGTGCAGCCCTGGCCCTGGGCrCTACCrGCCAGGCTGAGC^ 

CCTGCrcCCTCATTCAGGCTGCTCGGAGCCGCCAGCT 

GCAAACGCAGCACCGGCTGCCCCCATTCIAGGTGGTGATGGGCATTTATCrcCCAGCCT 

6 0 OVGTGCTTTGCAGGTGCATCTrTTGTTCCCAGTAAAGCT 

GGAGCCAGCCTGCCACAAAATAAAGTACATTAGCAATACAAAGCTAACAAACTCCTCCCCCC^ 
CCrCTCTTCrTCCCTAATAGCTATCTTTATTTTTT^ 
TCrCTCTTTGAATAAGAGGCACACACGGCACTCAATCAGAGCTGGGTGT 
GCAG^TGCTAGCTCGCATCAGGCGCAAATC^IAGAAGCTGAGGA 

6 5 AGTGCTGCTTTAGTTCAGAATGGCATGATCGTCCTTTCTAGCCTGCCT 

TTGTATGTAGAACCTAAGACAACATTTCAAAACT 
AACCATTGAACATTTTCTACCITATTTGTCAAGTTTAATGTA 
AGCCTCGGCTGTCACTTAAACAGGATTTTGGAGTGTGCCTGGTGCGG^ 
TCACTCCGCTGTTTTTGTTTCCTTTCTGTGC^^ 

7 0 ATC^TAAC^AAAGTTCCCCTTTCACTTAAGCCACTGTCATAAGAGAGAA^ 

AAGGTTATAGAGGGTGAGGGGCCAGTGGGGACCTCATGCAGCTC 
AGCAGTCAATTCAATATTTTCAAAACAGCTAATAGGATTGTGTG 

TGTTTGCATGCGCACACATATCTGTGTACATGCATGTGGAGACT • 
»CCTGGA^GCCATCCATTCAGCTATGCTGGCrGTCCAGTGAGCT 
7 5 A GAGCCACACTGTGGCAGGC^TATGGCrTCTAGAGGCTCCC 
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CCAGCTGAGCTGTCTCCCCGGCCCCAGTGATTACTTTTGTC 

CAGATAAGCTTAGTAACATCACTCTGAGTTCTCAAAGTACTTACAGTCACTATATATT^ 
ATGGCCCTTATTGCCAAGGTTCCTCCTCAAAAACACAGTCTTTTCCCTCT 
TTGAATTTCTGCAACATTTTTCTCTGCCTCTGTAGTTTTTC^ 
5 GTATACCCAGCCTCTACTAGACCrTAGCCTTCCGTCTTAAGGGCACAGAGAGCCCC^ 

GGACAAGTCCTCTGTAACTTCCTTCTCTGGAG CAACCTGGAGCCCTGGAGCTCACTGG ATGGAAATGGGGGCTGG CTGCCAGAG C 
TGCAGGTAGAGAGAAGTGCTTTGCGTTGACTGCTGACAGAGAGAGAGTACTC 

AATGTG CAGAGTGTGG CTAAGTT AT ACACTTGAAAGTGGTT AAAACGGT AAATTTT AT CATAAATAACA 

acaaatactgaggctaaggcccagacataatgtgttaattctaaatct^ 
1 0 taagttctagcrggacaagccctgcttcgt 

ttctgaccttctctgtccccaggtcatcctagacaaagacctg^ 

GTTGATGATGTCATTATTACTGTTATTGACAACTAGGTATGAGT^ 

ATTTGGCTCTCTAAAAACAGATCATGTGTTGCTAACTTGGGTATATGGGGGAGCACTGAGG 

CATAAGCCTTCTAGAGCCAAGCTTGACCCTACTCCCTTCTTCCCTGTCCTG 
1 5 GGTAGATCCAAGGTCCCAGCITTCTCTTTGGTTTTCTGGGCAGCTCCT 

TAGGCAAGAGTAGGCACCTAACTTCAGAGCTCCCCTACATGCCTAAGGCCCTC^ 

TACCTCCAGCTCTCCAACAGGCAGCGGATATAGAGCTCCTTGTTCACTAACTTC^ 

TCTGAGCTTCTAGGCCCCCACCTAGAGCCTTTGAGACTTGATGCCGAAGGTGAAAAACT 

TAGATGGCCTGCAGAAGCTCCATTACTGGCACTTACCTTGACTGATGGATAAAAA^ 
2 0 GAGAGGGGCTAAATCGCCCCATGACACATATGCTGGATGCCCAGCTGCCCATTTATGGATCATGCTT 

AGAArrCCAAGGACAAGGGCCAAGCATGGAATAGCAACCCTTCTCTAGTATCTACTT 

ATCCACACACCATCACCATGAGATCGCAGCTGCTACCTTCTTGTCCCCGTCCTGCAGTTAAGGATAATA 

TG TTCCTT AGATG TCACC CATATTTGGG GCAAAG CAAGACT ACC CGTG CT CG CTTCGGCAGC ACAT AT ACT AAAATTGGAATG AT A 
CAGAG AAG ATT AG CATGGC C CCTG TG CAAGG ATGACACG CAAATTTGTGAAG CG TT CCATATT AAAAAAAAAAAAAAGACTCT AC C 

2 5 CAACATAGGTTTGATTTGCTCCCCATAAAATGTGGGAGGGGTGC^ 

ATGTTTTCCACTCACCTAAGGTAAGCACCCAGCTCCTCTTTCAATGTTATACT 
TGCCTTTTCTGTCTCAGTATATTATTTGTATGGCT^ 

TTTTGGAGAGACTTCAGTCCATTGTGGTGGAGAAAGCATGATAGAGGGAGTGTC 

GTXSGGCGATCCCATGGTGGTGGACCAAGATTCAATGGTTAA^ CCTAGTG ATCCACTTCCT C 

3 0 CAAACAGTCTCTATCTCCTAAAGGTTCAATGAAATTGTACCATCCAGT 

TGTTTCCGATTAAGTCAGAACACTCCATTTC^TCATGTATAAAGTGAGGCT 

ACATGCCAAAAATGCTTGATTVAATTATATTCATAACAGTTCATTATTATGATTTATG^ 

GTTGGGGAAATATATTTGAGGGTTACCAGAGGCTAACAATGTGAAGATATTGTACTCTAATGGATG^ 

CTCCACCAT AGCTGAGAAGCCAAGCTG CTCAAAACCTTCCTAGGG AGAGAC CCCTAT AGG CGATGTCAGGAAGTGCTGTCAAATCA 

3 5 CATTCATAAATGCAAAGTCACTTGTCACATTCAAGATCTCCAAGGGGAAGGA^ 

TCTTTTATACAAAACAAAAAAACAAAACAACAAAA^ 

G CCAAAG C CACC CT AAC CTG AAAGAT ACT ACAG AAGACTGAAAAT ACG TCAGAC C ACAAG CATTT AG T AT ATGAGAG ATC C AAG CT 
GTATAGGACTTTTTAGTATTATATTAGTTTACCTTTTGTGTGGTA 
CACATGCAGAGTTGGTTTCTTCTACCATGTAGGTTCTGTGATTTGAACT 
40 CTGAG CCATCTCCCTGGCCCAGTGTGTT CATGCAAAGGATGTAGGTGGTCAGTG CCTTAGT AAAGAATGGACCTGAATTCTGTGAG 

TCGTTCTGCCTTCTCCCAGTTGCTCATTGATTCTCTGACITATGGAGGAm 
TATGATAGCCAAGGGGCGCCAATCACACCTGCAGATGAGCAGTAGAGTCTCAGAAAGTCG 
TCCATCGGAGTTCAGAATGACATCATCCGAGAGGTTATGGTGTAATGTG 

GG ATGTGG CT AT AG AAATGT AGGAGGGG CAACTCTGATTCTGGGGTGTGTGTAGG CACTG CCATATGCCTCCACGTGTGTTCTGGG 

4 5 TGGTATTTTGAAAAAGGTGAGTCCCTATACACGTGAACAAAGTGTCAGCGGGTGGCATGTGT 

TGGAGACAATGTGCTTAGGCACTTCATGGTGAGACTGGAATCTCAGCT 

TACTAAGGCCCAGTGGTAGTGGGGTGGAGCTAAAGTGAGATTGGCCATGGCTTTTA^ 

GGCTTCACTGGACTTTTTCCCTTTTATGTATCTTGGAA 

ACAGTTAAACGAGCAGAATGTCTGCATTCCACACCTGTGTACAGTGAACGAGGGTG 

5 0 GTGAGTCTATGGGGAGGGGAGAGTAGGTGAACATAGAAACAAGAATAGAAATGATGTGGTGTGTC 

ATGTCTCTGTCTGTATCTATGTGTGTGTGCTTGTGTGTGGGTGAGGGTGTGTAATTGTGTGTT^ 

AGGCAAGATTTGTGATTATAGATAAACAGTAGCTTTCTCTCTCTGGTTTGTATTTG 

GAGTTAATGTCTTGTCTGTTTCCTATCTGACCCTCTTTAAG 

5 5 TCGAGAAAAAGTGGCAGAGAAAGAG AGACAG AGAGG CAGAGAGG CAG AGAGAGACACACACAGAGAG AGAGAG AG ATTGAG AAACT 

GGAAACAAAGAATCGTTGTCTCTGAGCCCTGGCAGGCAGGAAACCCCAATCTCAGGC^ 
TCAGACTCCAAGTTTCTGAATTCAATTTGACAGCTTGTCTGGG^ 
GAACTGTTATTCCTGTCCCAGGGTGTGCTGTGTGCTCTGTTATTGCACACGTGTC 
ATTACTGGAAACAGTTTTCATGTTCCCTAAGACAAGTGTAAATTTGTTCCAGGCT 

6 0 AACCTGGTGAGAAGT AGGT AG ATGGG CGAGGGGGG ATGAGGGGAGGGG AG CATTTGCCCTGGTGTTGTGCATGTACACACT ATTCT 

TACTAGGATTTCTTTGAAGTACACACACACACACACAC^ 

CTGGAAGC^GGACCACACTGGGTTCTTTGGATCTTTGACATGGTGTTGTGW 
TGAAATCCAGGGGCAACGTGTAAGTTTATt^CJVCAGCACCTTCTGGACAA 

6 5 CTGAATGGAACTGTACCTACTATGCCT 

CCTTCAGAGGCCTTGGCCCTATAGCrTTGCCTGGTTCTAGTCTCCTCT 
CTCCAATTTAGTAGTGTTTAGTATTGGACAGATTCC^GAGGAC^^ 
CCATTGG ATGGG CAG ATGAAAGTCACCCTTAACNNNNNNNNNNN^ 

7 0 CCCCTTCCCCTrCCCCTTCTCCTCCTCCTTCTCTTTCT^ CI 1TCTT TCTTTCTTTCTTrCT TTCTTTCT 

TTCTTTCTTTCCTTCCTTCCTTCCTTCCITCOT 

TAAATACTTAGTCATGGAAGATTTTTCTATGCCATAAAACCTTATTAGTTTAATGCT 
CCCAAATCACCATGTTGGTTTGAAGCAGGGCAGAATCTGTTCT 
TTCCTCTTGTATTTCGACCCAACTATGTCTTAATTACTACrc 
7 5 G T AAAG GTl"r CTTT AAGCT CACAG TTT AAGATTCATCATG GTG AGAACTr L' ci\H^CrGT AGG AG TTTG AGC CAATT'oT 1 CACATTG C 
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ATCTCCAGTTAGGAGTTAGCAGAGAAGAGCGACTC 

GGAATGGTGCTGCCCACTTCrrACCTCAAGTAACCTAATCAATTGTTCCTC^ 

CAAGTTGACATTAATAACATCACAGCATGCCAAAAGACCCATTTCTTTCA 

CTGCATCGATTCCTGCAATGGTTCAAAATTTCTCAATGTTACCGGCAT1TCT 

5 TAGATATATTTTTATAATTTGC^TGATG^ 

TTCAGTAA&AGGCAT ATT ACACTGAGT AAAAC CATTT CCATGTG TTG CAATGAACTTAATCTGAAAAAACGAG AGTGAGACACAGA 

AGCrCCCrCCATGAAGTTAACTTTTCCTGCACTGGGGCAC^ 

CTTCCATTGCTTCTGCAGTGTTTTCCCACCTCAACCTTTACA 
1 0 GrrTACATCAGCATCCTTTCTCTCACCATAGGOT 

GTTCTCftTTTAC^TCAACATGAClTTCTlGTGC 

TAAATGAGGCTGTAGGTACCCATGTATGTGTTCTGCGCTGCTAATC^ 

AAGGAATGTGATTCCTGTAAACACTTGAAGGTAGAGGGCCAAATGTACAA 

GGGAACTTTCCAACCnXIACTCCATTCCTC 
1 5 GACCATATCAC^CATTTCTTGGTAGCAATTCACACTTGCTAAGTAAGCTTGGC 

TATCAGTCCTGCCGCCTCTACCAAGCTGCTTCTTGCGTTAAAGATCT 

TTCATTTTAGTTCTTAAACATCITCACCATGTGCTCTGATTTGTGATGG 

GTTTCAAGTTAACTGGCAAGAAAAGGGGATTGTCTACAAGTGCTGTT 

ATACGTGGG CTTTG C CCAATGT ACTGATTT AG CT ATTCCAGG AG GTTTGTTT AG CT AT ACTGG AAAGTAT AGCTGG CCT CTC CCAT 

2 0 CCAGGGCTTCCACATCGGCAGTTTTGACCAACCATTGCTAGAGAAAAGAGC^^ 

AAATGACACAGTATAGCAACGAATTACATTGTGTTAGGTATTTTA 

TCCCTTACTTACACAGCAGTGTGTATGCATGGAAGTGATACCCCAGTCGTGAGATAGCGTAAGTGGT 
TTTGGGAGGATGATCATGCGTTCTATTTAGCACTGTACCATGAGAATTTGAGCAT^ 
AACCATTTACCCTCACATGCAAAGGAAAGACTATGCTTACAAAAAAAGCTTAT^ 
25 GTCTGTCTGTCTGTCTGTCTGTCTCTATCTACCTACCTACCTACCTACCTACCTACCATCCCCGTAAAG 
TTTTTTTTCCTCTCTATTTGAAGAAGGAAGAAGAGAGAGAAGCACAGA 
CTTCTAGTTAAAAGGATGGGCTGTAAGCAGCACCATTTATTATTTTAATA 
TGGAGAAGCCTCAACTTAATATCACAAAGAAATCATTTCTCCTGTAGAATCC 
AGTCTGAGGCTGGCATGGGGGTGTGAGGGCTTTCCCCTCTTTTGGGAATCCGCAGGTCTCACAAGT 

3 0 CGCCTCTTTGGTACAGCTTGAATTCATGTCAGGCAGAAGC 

AATCCCAAATCAATCCTACCTGCTCAGTGTGTTTACCTTACTCCGACGAACTGTGTCTCTTCC^ 
TTTCTGAACCTAATTCCAGATTATCTTGTC^TCTTTAATTGTGA 

T AAATAAC CTTGACRGTTG ATGGATTTGCTGGGAAAG AAAATCTAATCTTTTTC AATTG CAGTCTTGCTCT 
ACTAAAGGAAGAATCAGAAGAAATTATCTTAGCACAAGAAAAAATTTATTrCTAGCCAGGCA 

3 5 AGCATTTGGGAGGCAGAGGCAGGTGGATTTTTGAGTTCGAGGC 

CAGAGGAAACCTGTCTCAAAACAAAAAACAAAAAACAAAAAAACAAAAAACAAAAANNNN^^ 

AAACAAAAAACAAAAAAACAACAGGAAAAAATT AAGTT CTT ATCTCCTATG AAT AAAAAAACCTGGAAAAG AATAGTTTGGGGAAA 

TGTCTAATGGGTAAAATACTGCACAAAGGTGAGGTTAAATTCAGGTCTGCGTTACCCACATAATTAAAAAAAA 

TGGCATGCATCTGTAATTCCACCCrGAGAATAGAGACAACTTGAACCTAGGGGCTCACTGGTC^ 

4 0 AAGCTTCAGGTCCAGTG AATGAC CTTGTCTC AAAAT AACAACACT AATCCT ACT AAATACCAATT^ATG AT ATAG AATG C AGTAAAT 

AGGAGGAAGATACTCAAATACCACCTCTGGCATCCACATGTGCATACCAA^ 
TACACACACACACACACACACACACGCACACACGCATGCACGCACATGCATAC^ 

TGCTGACTGGAAAAGATTAACTACAATTCCATAGGAGACTTTGGCGTGTGTGAAGGGTTTGTGTATACATGA 
CAAAGACCCTTGTGAGGATTAAATATACTGTTTGCrCTTC 
4 5 TACTTTTCAGAGAGGGGAAGTCACTCACCAAGGTGCACACAGCACATAACTGATGATCTG 

CGGAGATCCCGAATTCTTrCCTTTGCAGTGAATGGCTTTACTGTT^ CTTCCCC CAAAT AGCTTTAAAG CT 

TTAAATGAGACCCCAAAACAGTCCAGTTCTTGATAAGATTCAGTT^ 

ACTCTTTGAGCCTGA^GCGTTGGCTGGCCCAGTTACTCAC 

TG AT AGTCTG AGT C AT ATTTTC AATTT CT AAT AAAATTCTT ACATGGC CACAT ACGCTG TCT CC CAG CTTTG AACCTAGGGT CTCA 
• 50 TGCTCATTTGCATGTCTGGTTTAGATCATGTCTGGGATCCCTTGAAGTCTGAAAC 

AGATTCTTACCAAAACTAAGGCCATTTTCTCAGATACTCGGATGTAAGATTCCACGTTTGACATTG 

GAACAAAGTGATGCAGTCTCCCTTAGTCTGACCTCACCTGCCCACTTCTCCTATGATGACT 

CTTTAGCATTTTAGCTGTCTGTCAAAAGGCCAGTCrCGGGAACTGCTC 

CATGGATGGGGGAAATCAGACTGACTCTGAATAACACGAAACCAGCAAACAATCAAAAGATGT^ 
55 TCCTCCAATATCAACCCACTCATACTACATGTGAAATGACAAAATGGAACTAGCT 

AATCCCAGTACITTTCAGAGGAAAACCATGGAGTTCCCAAGAGCTCCAATTTAATCATC 

TCTACTGCATTGTTTACTATTTGTTCTGAATTAAGTTCCACA^ 
CGTGGGTCTGGCAACAGAGCAGGCATTCAACACAACTTCATTGAATGAGTG^ 

6 0 GCTAGTACTGGGGTGGGGCAAGAGCAAATTTGTTGAGGGGGCAGAAGTGACTAAGGTCCGGTCCCTCTGCT 

TAGCCAGAGAGGTAACACACTTGACACTGGATGGTTGAGCAGAC 

GAAGGTGTGCAAAGGTCAAATCGTGAGGCAGATGCAGGATGCAATCGCCCATGGCAAAGGAGGACGTCG 
CTCTGTGTGCAGAGCCAGAGGGCAATCATAAACAACTCTTTATTGCCTAAA 

TTTTAAACCTGACATCCATCTGACCAAAGAAGTATGCGATTGAGTCGTGTCTACTGTAACCCAGTGTGGAGGGG 

6 5 TAC CCAATT CAG CT AGT CAATCT AAATT CCTCTT C CACGT ATGG CTC AAAG C CC C ACTTTG AGGC CTT ATC AAAAT AAAT AGTTT A 

TTTAGACATCTGTGCCAITTGGACAGCC^GCCCACGGGGCAGACAAAATGAAAGAAGAG 
AAGGTGGCAAGGCTGTGCCCAGAGAATGTGCCTCITACTrT 
TTTCCTCATGAATCTTCTTTTCCCTCTGCACACRGAATATCTTAAACACTA 
AAGCTGAGGGGCTGCTTGAATTCCAGGACAAGATCCTTGAGAAGA^ 

7 0 CCIAAGCTAAAGCAAAGGATTCCCTTTTTAACTTCCTCTCC 

AAAATCCCACTGGAGGGAAGGCTGGGTGAGCTTrAGGGAATGGATCAA^ 
AAAGAGGAACCTGAGCACTCCTCAGTCATGGAAGAAGCAAAAGCTGACC^ 
TTAAAACCAGGGTTCAGGATGACCCACCACTTTCCTTGGAATGTATTCCTGGAATAACTGT^ 
CTTTGGGGAl"ICTTGGGCCTCTCTTCCrrT 
7 5 GATGGAACAGG CATCTATATATCATGTC CAAAGTTCCCATCT ATGG AAAJVCACAATTGATTCCCACAG CAGGTCTGTTAAGATCTG 
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TTATGATGCTGTGTAGGGAAGCAGAGTTGJVAGTTCAAAAGACAAA^ 
GGATTAGATTCCAGGGCCATGGATGGCCACTGAGACCACCATTAAGCCCT 
ACTCTGTTTAGACTGCCCATTTTCITCCTAAGTTTCriT^ 
TTAGCAAATCAGAGATGATTTCAGATGAGAGACTGTTCCAGAGTTGTGAACTC 
•• 5 G GTGTTCAGCCCAAAGGAGAGTCCAAAGCAGAGAATTCCAGTCAGGTTGGTGGT 
CTACTAGGAGAACATGACCTCGAATAGAGACTCCA^ 
ACAGATGACACCTTACTTATTGGCATCACTGAGGTTGGCAAGCAGAC 

CTGGCCAG CAAAAAACAGAAATTCAGCGAGAAGTGAATC'l 1 ITT T CCACTAAGCAT AGAG AGCAGGGGGCCCAGTGTGGTGAGGGA 

CTGAAAGAGAGGCTGCTGTGTGCTGCTGATGGCTGTGTGTGGGTGATGTCATGA 
10 TGCGTCTTGGACAGTTAACTGGAGGCATCAGGAGAAAAC 

ACTGGTGCGTCTATGCCATGGTCACAGCCAGCAAAAGCCTTCCCTTTCCAACGGG^ 

GCTGTCTTGGAATCTAAGAGACCATTTAGTGTCGC^TraVGCCACACTG 

GATGGGAAGCAGAATTATCCACAATGGCTGGCAAAGCCCAGAATCCCCGGCATGAGGCCCAG^ 

ACCTCAGGGCTGCCTGAAACAAGAAACCTTAGGCAAAGCCCACTGGTGGCT 
1 5 TAGGACTCCTTAAAGCCAGGGGGTGTCAGCCTTCCCTGAGGCAGAAAGAG^ 

AAGCCATCTGCCACCACAGAGAAGGTCTAAGAGCATCACCTCGGGTGCrGGCC 

CCACCTCAGA Cn ' 1 " ! L I X CTCTAAAGCCAGATGTAGCTGTCAGGATGCGGGCTCCATGTGCCACATGTGGGGCGTCTGTATGTGTG 
CACTTGGCTGCTTGGAGCCGCCACAGCCCGCGGGAGAACGCTGCACTATCT 
AGCCAGGAGTGTCTGTAGGAATTTGTACAGCAGGCrrTGCCCCCTTCTCT 
2 0 TGCCTGGTACTGTAGCTGACCTCACTGCCAGGCTTTTATTTGGGGGACA 
AACACT0CCCTGT6AAT6ATGGAACCAGTTCTGATAAATGCCGCCT 
TC!ACAAACTAAGATGCCGATGACCACCCAAGATGACACAACATTACG^ 

AGATAAATGAGTGGTGCCCAGCTCCTGACCTCCCrGAAGGAACATACTCTCAGCTTCCACTTC 
CCGGTGTTGACATAGTCTGTAGTCCTTATCACCACAGGCCTGAACTAGCTAGAGACCTAGTTCAAGAAGGTAAGCG 

2 5 TTTTCTGCTAGCTGACAACAAAAGGGATTGTGTGCATTTAACAAGGAGAGACACTC 

GAGAGTGAGAAGGATGGGGAGACGTAGAGGGGATGCGTATTTAACAAATGGG 

AGTGAGAACAGCACAGATCATCTCAACGCTAACGATATCTTTCTCCTAAAACATGGGGCACA 

TTGACCCTTTCACTTCAG<rrCCAAAGGTCTCCAATGCTTTGAGCATCC^ 

3 0 TTGCCACCTCCCTCTCAAAACACrGGTTTTTCATTTCAGTCTTG CACGAAGGAGACAT ACCCTGCACCACAACTCOTCTGTGGTG C 

ATTCACCAGCTTGTGCAGTAAATGGGCAGGCCCTCACACTGCTTTGCAGATT 

GGGGCTGATGGGAGAGAAGAC1GTATTTTCCATGTTTGATTCCACTCACTGGGTCA 

TGGCCATTCATTGCTCAGTGAACTTCATAGTGACAGTGTTGCAAGGCTGGTTTATGGA 

CTTCACTTATGGTAGGACCGAGTAGT ACTGAGTTGGAATCC CCAGG A CGAACTCAGCATC 

3 5 CC CCAC CTTCG AGGTCTGCTG AATCTTAG ACTG AAAAT AG AAT CTT AGAAAAGATGGG CTG CTG TGG CCT AC CTCAAG CTTCAACT 

TCAAAACATTATTCCTCACCTAGAATTACCAAAGAATATGGTG^ 

C CAGGAGGAGGGCTTTCCGAAG CAAGTCT AGGTTGAAATTTTT AAGGCAGATTTT ACAAGTGCT 
CATCTCCTGGAACATAGTGGGTCCTTCTCCTTTTTAGTTACTCACCT 

TGAAAGGAGGGAACCTCAGTTGAGAAGGATGTAAGTCTTTATCTTAATTAGTGATCGATGGAGGATGCTGTAACG 
40 TGGTGTGATCCCTACATTGATGGTCCTGGGTTCTATAAGGAAGCAGGCTGAGCAAGCCATGGA 

GGGAGTATAAGTGAAGCAAACCCTTTCCTCCCCAAGTTGTTTCA 

CrTTTCGAGGTCATCCACTTTCTCTTCTGTCCTTACAAAATGACCT 

ATGTCTGGAAATGTCTTTGCTCTATGTCTCTCAGAGTCA 

4 5 CCTAAAGGTGATAGCTTAGTTCCTACAAGGTCCCAAAGGGCCACCCAACCCTTC 

TCCTGTATGCCAGCCATGCCCACAGTACCACTAGCCTTTTCATGGCTCTCCCCCA 
TCCCTAAGGCCTCTAGTGCTTCriTGCCTTCCCCTCCTTTGTCTCCAATAGTTATTGACATCTCAACC^ 

T AAT ACCT GAT AGTCTTTCAG ACTG TGTCATGTCTAAATG C AAGG ACTTTG TTTT AT ATGACTG ACTT CT CT AGTGCATG CACACA 
GCACCGTTCAGJATGTTGATTCTAAATGATTGACAGGAGACCGAACTAAT 

5 0 GGCTTATCCTAAAGCTGAATGCACCCTGTGGGACTCCACAGATCCCCTCAGAAAGTGTCCAT 

TTATAACAAGGCTAGTTGACTTAAGCTGTTTTGAGGAAACAAAGAAAAAAA^ 

GCTGTGTCTCCTTGTCCTGTTCCCTGCCTCCCTCTCCAACCCCACCCCCATGCTCTGTTTTGCTTTT^ 

TATAATTTTAAAAACTGAGCTGACTTTTCCAGACCCATGAAGAATGTGCGTGTGGAGATGG 

CTTTTGGGTTGTCITAGCCAGTAATAGTGAAGCCAGGCAATCAGAT^ 

5 5 CTCCCCAGCCCTTCCTTATTCTAATTTGTTTGCGAGGCTCAGGACAGGCC 

TGTTTGATGAGGTTTGTGTTCCATTTATTCAGATGGTTTTTATGGTTCCTT^ 
TGTATCCTTATCCCTTCTGGTAGGCCTGTGCCATGTCTCAATGGCTT^ 
ACTTGAGCAGTTTCTGAACATGGTTGTGGAAGAGAAAAGCTGAGTCAAA 
AAGATCTCTGTCGGTTCATTCTCTGATCAGCCCAAGTCCTCACCACAATTCT 

6 0 GCCCCTTTCTGCTAC^GACATGTCTGAGAGCCTTGGTCAGCCTA 

CTATGAGGTGCGAGGCTCGAGTTGGCCrrTACAGGAATGTCATGGGCCTG^ 
CATCACCTCTTAGATTTTCATCCTTGCCCAACATCCTTGCATCAGTGAAAAGAGTTT 
TCTTTTTAGAAAATCTTGTAAAGTGGCTCAOVAATATTCrC^ 
ATTTTGCGTCTACAAAGACACGGGTCAGGGGAGACCTGTGGGCrTCCCAC 
6 5 CCAATCCTCCAACITGAAGTTCAGGATTCTTCCTACTGCTTCAGCT 

AGGGAAGACGATACTTCAGGAATGTAGTGGTGCTTAACAAGTTATTGCT 

TCTT C AT AGAACGAAC CTT C CTGCAAAAC ATACAGAG CTG ATGTGTCCT CTTTTGTTCTTTCTCAT AG AC CCTTCTTATGACTCTG 
TGAGGAGAGGAGCATGGAACAATAATATGAACTCTGGCCTCAACAAAAGTAAGTAAAT 



70 



75 



AGATGTCAGTGTGAATTTGCAGTGCCTGCTCAGATTTT^ 

GTACTTGGTTTTAGAACTTATAAGGTTTGAGX3TTTTCTCGGGACA^ 

ATTTCAGTCAGGACGTTGAATCTCTGGGACATATATTCTCCA.CATGACAAGGCTC^ 

GAGATTAACATGAAGGATAACCTAAGGGATGTTGTGCATCACAGACATTAAT^ 

&TCCACCCAGTATCTTCCATTAAGGGGCAGTGAAAGAGAGCGATTCACT 

NNNNNNNNNNNNNNNNT ATG AGTG TG CAG TGTTTGTTCACTCTGCATG CA6GACATGACG CT ATGGGACTCACAAAGT AT AAGCCA 
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GGCAAGGACCAGTTATTTAGGCAGGAGAGAGAAATCATGGACACACATAT^ 

CCTGCTACCGTTAGAGCTACTCAGACAGGCCATGTCTACGGAATGTCTAGGGAACAGGAGA 

CAGGGACGACCATGGTGGTATGTCTGAC^GTGAGGAACCrAGGTTT 

GG AT AGTGAATGAT AAT ATGG AAG T AG ATT C CAGATCTTCAGTCCT AAGG AGTG CTG AAGAGTATGTGAGTCAGTTTGAAGTATG A 
GAGGTCCAGTTACACCGAGCCACCCATCACACAAAAAGCAGCAT 
TTGGAGAGGCTCTCACGAAACACAGTGTGACAGGCACCAGGTGGTCTGTGCT 
GCCATGTCTATTTGTGGGGGACACGTGTGTACATTT^ 

GAGACACTGG CTGTTCTGTTTT AT C ACT CTGTCCCTGGTCC CTTT'G AG ACACTCTCTT ACT ATTT CTCG 

GCTAGCCTGACTGTCCATCAAGCCrCGGTAATCTTCTGATCTGTGCCAA 

CAGACATGTCAAGTTTTTATACGGGGATGCTTGGGATCTGGGCCCATGTCCT 

GAAGCTCAAGGTCCTACATTTCAAAAGAATGGTCTAACCCCATGTCAGC^^ 

ACTTCATTGCTAACTCTTCTGCCCCGAGACCAGTGCCAGCGTAGCGGGTTTGA 

CATGGGAGAGTGTGGTCCCGGCAGCTCATGTGCCCTCCTTCTGGGAGAT^ 

CTCCACAGGTCCTCT CCTTGGAGG ATCACAGACCATGGGCAAGAACACTGAGCAG CGGCCCCAGCCAGGTATGTGATTCAAAT ATG 
TAGCCTCTCACTTGTGTGTTATAC^GTTGTGGAGTCAGTCAATGGTGG 
TTTTAGGCCTGGGGATTTAACCAAACACAGGCCAATAATATTCAGG 
' TXTITGTCATTATTCCATCAACAATACAGTACCGCT 

ATGGATATGACTAGATTGTATGCAAATAACCCATTTTATGTAAAGGACTTGA 
CAAGTCCTGGAACCA^^ 

TTCCGAAGGGTGAAAGGATTTGCTGAAAGACAAGCAGAGCTGAACT^ 
TACATTTTACGAAACCATACAGAACTCCCTTCACCTAGTA^ 

ACCCIT AAT AAAGAAATGACTT AG AG ACTGCAGT CT AT AATCCCT CGGT AG CATGG CCT CTT ATG ATGGC^TGC CCTTGTTTCTG A 

CCCTGACCCCGCCCCTGCTCCTCCTTGCTTGCTTCAACAA^ 

AGCCCTCTGT^CAGCTACCCCTCTGGCCACTTGTCTTCTGGGGAGAGC 

TTGCACTCAGCrCTGCTGTCrTAATTAAGAATCTGAATTAAGATCTGA^ 

CATTTCCAGCCAAAATATAGAAAGAGAAATTGTAGTAAAAAGTGA 

CAAGTTCOACAACAACAACAAAATTCAATATCTACTCAAACAA 

GCTTCAAGAACTGTTAAATCAATTCTTTTTAATTCATTAATAATTACT 

TGACTTCATAGCTAAAATACAAATTCTGCTTAGTGTTGAC^ 

CCAGCAG CCCy^CCCACTCTTCAT AGCTGAG AAAACAAATACCC AGAGGTTGTC CAAACCATG ATTT AGTTC ATG AC AT AG CTGGG 

ACTAGAACCCAGGCTACCTAATGGTTTTTATAGTCCATCGGAAAACT 

AATGGCTAGTAATTGCTGAGGAACCTGCTTTAAGCATGTTTAATAAGCA^^ 

AGATGCTTTCCAAACGTGTGGGCGATTTTGTTTGGGGGGTAGTGACAGCATAATCTAAAAG 

GTGCTATTACTCGTCTACGTTGACTGGGCCATAACTTTGATCAAGGCCT 

GAACOTTTGAAGTCTCCTTCATCTTGATGCTGCTCTCAGATC 

TTATCAGATCCTGGGGCCAACCAGCAGCCGCCTAGCAAACCCTGGTGAGT^ 

CGTTATATCTGTAAGGTTACACAGAAACCTTACTCCTCGGTACG^TACAGGCACCTGTC 

AATATTCAACTGTGCACAAACAAGAGGCAGGGCCTGTTCATGAGTAGC^ 

CCTCrTCGACAGAGGTTTATGTCGATGGCAAATGACTC^ 

GGGCTCATGGATAAATGTCCCCAAGTGTTGTGGTTTTGACITGAGTGCTAT^ 



CAAACCATGGAGATGGGGAGCTCAAGGCAACAAAAGCCAGGCCCACGTGCCTGCTGCACCT 
CCTCCCTAACCTTTAGCTAAACCATTCCTO^ 

GGAACTGAACACTAAGAGCCGTGCACTTTTTCTGGAAAACAGCCCAGCTTAGTGAACCCCAATTCT 

CTACCCCACAGCCACCCCCCAAGAGGTTCTTGGATACTTGACACTCTTTAAGGCCA^ 

AAATGGAGATTAGATAGTGACTCAGGTCTAGGTAGAAAGGTCTAAGGGGAGGGTCAAAAATCT 

GCTTCCCTGAG ATCCT AAGAGCTTGGGACAAAGCCAG CATTCAC CACAACCATCCTTTGGGACAATCTGTCTrAAGATCTAACCTT. 
GCTTACTACGCTCATTTTGGCTGCGGCAGTAGAGTAACAGTATATACrrGTACCCTTTTTT^ 
AGTACACTGTATCTGTCTTCAGACACACCAGAAGAGGGCATCAGATCCTATTATAGATGGTTC 
AATTAAACTGAGGACITCrGGAAGAGCAGTCAGTGCTCrTAACCGCT 

AG CTGGAT AAG CCGGT CTTTG ATG ATGAGCAAGCTTTTG AAG TG ATGG GTGTGTG AATTG C CC CT CTGTG ATTGT CAAC C ACT ATG 

TATGTCTGTCAAAATATCACCACATGCCCTGTAAACGTGTACAATTATCATGTGCCT 

GGTTGTCTCCACTTTCCTTTTTTGTTTTGTTA^ 

ATAGGGTTTCTGTGTAGCAGAGCCCTGGCTGTCTTGAACTCCTAGACCAGGCTAAC 

AAGGACATGAGTTGTCATGT/TT AGGAATCAG AACTGAGACT A(TT<3 CTG CATGCAGGG AGTGGGCAGATCCA 

GCTGTGGCAGTTTCTCCTGGAACTACTGTCCGACAGCGCCAACGCC^GCTGTATC^CCTGGGAGGGGA 

TGACGGACCCTGATGAGGTGGCCAGGCGCTGGGGAGAGCGGAAGAGCAAGCC 

CGATACTACT ATGACAAAAACATT ATGACCAAAGTGCATGGCAAAAGGT ATGC CT ACAAGTTTG^ CCCAGGC 

CCTGCAGCCACATCCAACAGAGACATCCATGTAC)AAGTATCCCrCTGATATCTCCTAC^ 

AGGTGAACTTTGTCCCGTCTCACCCATCCTCCATGCCT 

CCCACTGCTGC^ATCTATCCAAACCCCAGTGTCCCCCGCCATCCrAACACCCAOSTGCCTTCACACTT 

CTAA<^CCAGTTGGCC1TCTGGCTGAAGTTCCAGC!TCTCACITTACT 

GATAAGAAAACTGGATGTTCITTCTrTTGGATAGAACCTTTC 

TTGTTTCCTCTACCTCOVAAAAAAAAGATCATTCCATGAGCCAGTCC^ 

AATATTTAGGTTACTGGAACGGTTTATACCATC^ 

C^GTTTCTTATCAGCACACGGGTCTCATCATTGTAGGATTCCCTACGATCAT 

AGACTT AGT AAAAGT CAAGG CAGGATG TTTAT AAT CTT AT CTT AGG AGG ACT CAATTCAGTGGATGG CAACTGGAACACTGG CT CT 

GAGGCCAGTGAAGTTTTTTCCCCAACTGGAATTTAAAAGATGTGT^ 

CCTCACAATGGGCAGTATGTGTTTGGGTCACTCTTCTCCTCAGAAATAGTCAGAATATC 

ACACTCCTGACGGGCAGAGGATTAAATAACATTTTTTTGGAGGGTT^ i T i' GTTTG TTTT TGTT 

TTTGGGGGTTTTTrTTGTTTGTTrTTrGTTr^ 

RCAGTGAGTCCCAGAACTTTGAAAAGTCATGGGGATTTCTAAACTCAGATTC 
AGGTO^CAATC^GAAGGCAACTAACTGTATAAATTATGCaGAGTT 

967 



WO 03/008583 



PCT/US01/51291 



AAAAATTAAAGAATAAGTAAACGAGTTGACCTC 

TAATTTGTACATCTTTGTCAATCTGTACATTTGGGCTATTTGTACGTTTTTC 
TTGAAAACGAGGAGTTAAAAGTCACTGAGTTTTTAGGAAGAAAA 
CTGTTTAGACCTACAACGCTTrAGAAATGTTTGTA 
5 ACACG AGG CACT CT C CIXXTIXTT AT AT ATGCAAT AT ATTTTT AGATGTG CAAAT AT AT AT AT AA'f 1'ITTCAGGT AATCGTG ACTTTT 
TAAACGATATTGTTAAGGTGACAACTCTTAGTCCACTGAAGACT 
■ri"rrrCrTCTTCTCrrCAGAACTGGTGACATTGTTl M rCTTATT^ 

G CTGG TG TGTGTGTG TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT ATGT AAGAACACA 

AGAAGGGAGTTTTTATCAAATAAATCAGTTCTCTC^ 
1 0 GGACTCCAGACACACAAGCTTCCAAAAAGCTCTCCCGCTTCCACCTCCCCT 

TCCACGTATGGCTTCTTTCATGAGTTCCAGGGATCAGAGTCAGTCATCAGGCTTA 

TCATTGGCCCCGAGGAAAAAAATAACTTGCCTACTTATTCTTGTACCATCGATAATCTCCT 

AAGACACAGTATTCTGTGAGTTTCAGATGCTTGTTAACAATGACGATTT^ 

TTGTAAGTGAGAGACAGACCTAGGTAAANNNNNNNNNNNNNNNNNNW^ 
1 5 CAGCAGCAACCGTCTCCAGGACCTGGCCAGGTAGACTTGACTCTGTTC 

G AG TCT AGCCTG ACTT CAAC C CCACACT AGGGTCTT ACG T AGACT CT CTGG CTCT ATTTTTCAGCCCAAT AG AGAAG CTTG CTTCC 

AACGGCITCAGCGTTAAGCATCATGCTCCTCTCCTTTACAACAAAGATGACT 

CTCATAGAACTAGTAGCTCTAGAAACCCCTTAAAGAAGTTTGCAGCCTCTGG 

CTACGTGAAGAAGATGAGAGGAGTTCTGAACACTGGTCTCCAACTTCTA 
2 0 TATCCTTTATGTTTATGTATATGCTC^TTTGTATCTTGCATTGACAAAAT 

G ACTGAAAAT ATT CAGAAAGTGTTTTAAATT ATTGGGT AAAGAGAGCT ATTGAGGTT AA CAATTG CTGCCCTT ACAGAGAACC 

CAGGTTGGGTTCCCAGAATTTACATGGGGTGACTCACAACTGCCT^ 

CCTCAGATATCTCTACACTCACAGCATATGGTTAC^^ 

ATATACTGACTCTTTATTC^CATTTTAATOIAATCATGAGTC 

2 5 TTATGTAGATGTGATTTAAAAGATAGGAGAAGAATATGTAAACAGTGTGCCATTTTAAATAACT 

GGTGTCTTTAGGGTAGTCCTGGAACCCATGCTTATCCAGAGGAAAAT^ 

AGAAGGTG AT ATG AAAATGTTT AAGCTG ACATTCTGTTTTCTT CC CIT ATTT AT ATTG CTTCATCC CTCGTT C CAACATT CAAAAC 
GGAAACAGTGAACATTACTGAGACGAACACATATATTTAGAT^ 

GGCT CATGCAT AGAAAG ACATGAGGCAAGT CAG TTATGTT CAT AAGT AGTCAACTTTCAAGTCTCAT AGT CAACT 

3 0 ATGTCAGTTTGCAAATCTACAAAGTCCTATTAAGGGGGA 

TCACACCATTAGTTGACCTTTAAAAACGTGTGTGTGTC 

CCACCTGGGTGCTGGTAACTGGAGTTGCCAGATGAGGATGCCTACATCCTAACCCT 

TCTCAG CCACTGGTTAGCCCTCT AG ACAGTTTTGTTTCCTACACAAAG CTGCAATG CTTrCTCACAATCCrGTACCCAGAGTGTCA 

3 5 TGGTGGT^CATTCCATGATCCAAAAGCTGGTCTACCCTGACAGGAGCAGAGCAATTTC 

TGACATTTAGTGCCCAGATTGAATGGTCTCAGCTGACAAACAGGCATTTAT^ 
ACTTGTGCCATGAAGGCCATTACATCTGATAAACTGTGGCAGCAAAGCAGAGAGGGCACAGCA^ 
AACCGATCCTCTCCATGTCCCCACCTCAGCTCCGGACCTGTCCACAGCACT^ 
TCATGGCAGACTGCTGTAGGCTCGCTGGCTTATAAACAACAGAAACTCAGTGTTC^ 

4 0 AGCACCAGCAGATTTGTCATTGATGGGGGACTCACTGTTTATTCCTGAT^ 

CCCAGGCAGCAAGCAGCITTCTGACATAGTGTGTGTGTGTGTGTGTGTGTGTGT 

GAGCTCCTCTTTATAGGTCCACCTCCAAATAGCCACACATTGGGGATTAAGCTTTA^ 

TATTCAATCTACAGCAGAAATTTTAACTTCCTGGTGTCAAAATATTGTTGCTAC 

TTTG AAAATTTTTT AG AAGT C AAAAAACA AGTCATTTAAGAACAATT AAAGTGACCAAT AAT AC CTG CATG CAGTTGGGG AATG 

4 5 AGAATAACACATGAGTTCTCTCCATGGCAAATGAATGAGATGTGGTTC 

AAAGT ATCCAGAAT AC CAAGG AAG AAAACAATATCCATTAAT C CCAAG CAAG ACACAG CCAG AG CAAGAT ATT ATTTGTGGAT ATT 
TTAAAAACCACTGCAAATCAAAGATTATTTTTAAAAATTGCATCTATATTCAATGTATATTAA 
TTTCACTATTATATATAATAATATTTATAAATATATAAC^CATTATATATACAATATTCTCTTATTCCATAA^ 
CAATTATTTCCACTGTTCTACTGCATTAAGCAATAGAAGTAATTT^ 

5 0 CTATGCCATTGTATATAAGGGACTTGAACAACCAGGGGTTTGGGTATCT 

GGTACAAATAATTAAAGTrTl'ITTT AAG ATTT ACTT CTCTTACTTT 

GTGCCACGTTGAGTTCCCTGTACTTAAGGAGGTCAGAAGAGAAGGGTTAGATCCCTGTTCCT 
TGTGTGTGTATTGAGAACTGAACCCTGACCCTGTG CAAGAT 
GAGAAAGTTTGGACTCTGAGGACTTGCTGTCACACCGCACTTTCCCTTGTCT^ 

5 5 TTTGAAAAACIAGCTGGGAAAGCACACTTTGGAAGAATGTGCCTGGGGA 

CCTGTACCTTGCTGTGGTCTGACTCTTCACAGGGGTTCCTT^ 

CAAAGAGGTGGAGTGTCTGCGGAGACCAGGAGCGAGGCTGTAGACTAGACAGGGGCAGGGGTGGAGTTCTGGTGA^ 

TCACTTTTATCCAAGAAGACTTTGTGATTTCTGAAAACATGCACCGAGCATTT 

TGCACGCCTTTAGCCATAGACCCTCTAGAGGATTAGAAGAGGTCAGCTGATTAGAATACT 

6 0 GGCCACTTCTGAGAGGAAGTGGTTCAGCCAGGCAAACCCCCAGGGCTGCA^ 

AGAAATGCCGGGCTCATGTCTGGGGTCTGTCTCCCTCTCTCACTCCTACT 
GGCAAArrCCAAAGCACATAGTC^TTTACACGTAGCITAAAAGCTCTAG 
TACAATIGCTAGGTTAACCTTCTGGGCTTTCACCCAGTTAACTCGTGT^ 
TAAATGAGTGTGAATATAATCCCAGGGACCCTGCGTTCAAATTTrGAC^ 

6 5 T AGGGT AT CAT CT CCTG C CAGTG AT CCTGTCT AT ACA CACACAT CAAAG AAACG ACCTGG AAGT C CATTG CAAAATG CTG CACAG G 

ATGTGTGTGTATAGACAGTGTCACCTGCTAGATTTCTCAGGC^ACAATATTAAAACT 

ACTT'AATGAAGATTTT T ITTTTATTTGTGGTC^ 

TGTTTTTGTTTTTCAAATGAGTGAAAAGTAGGAGTAGATTGAGTTT^ 

TTG C CG TGTGGGTG CTTG TCCTTCACTGG CTCTGT ATGGT CACCT CAAGTCAGT AGAG CATG CAAGAAG CTTT AGGG CTG CAAAT A 
70 AAAGAGAATGGCCTAACGTGAGGCTGAAAGGGAACTTTGTCAGAAGTATTGGCTAGAT^ 

G CAG CTGCAATT C CCTGCCTTG AT AAG CTTGTG CC CAACATTCT CACAG ACCTACATTTCCT CCGAGG CTTCACGGG AGGTGGGCA 
T AC CGCTXTTCTTG TTTG CCACTGTGGTG ACACTGGTTCG CAGCAACTGGTCACTTG T AAATGTGG AGTGAAG CAGAAGAAAG CGTC 
CACAGTCAGCAAGACTGAGTTGTGAATCCAGCTGTCCCTCCCTGAGTTATGAAT^ 

AAG CT AGGACrGTTGT AGTATCATCCAGC^ ATATCAAAGAAG TAACGT AJ^G ATC AGGT AATGAAG TG CTTGCT ATCT GTTTT AAAA 

7 5 AAAAATACGGAACTTCAAAATATTATATGGATGTCTTAGTCGATTGATACTTCCTTTT^ 
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ACAGAGTTAAATTAGGAGCTGAGAAGATXX3CTCAATGGA 
CTCACACTTAGGAAGCCTAGCTTGATATCTGCATC 

TGCTCCAAGCTCAGTGTAGAGATCTGGCCTCAAAAAAAAAAAAAAAAAAAAAAACT 

GACAT CAACAGGGAGAGAGAG AGAGAAAG AGAGAGAGAG AGAG AGAGAGAGAGAGAGAGA CTACTT 

TATTGTACATAACTATTTTTAGATTTCAATCACTTGGGACAGGAGAGAGA 

CCTGAGTTGAAATCTTAGTTCCATACACATCT 

GCCTTGCCAGTTGTGAGTCTAGTTCCAAGTTCAATAAAAGACCTTGT 
XGAAACCATTCTCTAGCCCTGAGGATGTGTGCACTAGCCTGCTTCTGCACATACAAGG 
TACACAGTCACACACACAAAGACAAAAATTATAAAGCCTTGAGATCATCA 
GTCTTITCTTGGGCTGTTTGAGAGGAGACTCCTGACAAATGCrC 

GGTCTT CTCCTAAGGAGT C CAC CACAAGTCAG AG AACCAGAAAATTAACCTG CAT AGGTT ACTT CC CTGACACTCTCTGGCAC CAA 

TCA CriTI ' LTlT CTGCATGGATATITT 

GCAAGACTTCACTAAATTX5CCCAGCCTCTCCTITAAACTCC 

TAGTAATGCAAATTAGCAGCTTTGTGCCACCAAGCCAGCCCATTATCATGTC 

GGAATGCCCAGACTTCCCCTGACTTTCAGAATCTTGGGAAT^ 

TTAGGTGCTCGGGTGTGTGTTCATGGTTAGGGCTGCCTC 

AGCCTATGTGGAGGTACATGCCCTCAATGTATCTCGATATGAGGGGGTTCACT^ 
CAGGTTCTTCTAAAAATTTACATTTTTCCTTTT 

TATTAAATTCTAGTGCATGTGTGTGTGTGTGTAAGTATAATAAGTGTTCCT 
TTATTGAGAGACTTGAATTTGTTCTTCTCATTAGTT^ 
AGCTTACGCATGCTAACTrCCTTACTTGTTCCTrCAAAAACACAT^ 
TAGTGAGAAATGATTTTTCAAAGCCAATATCTGTGTGCT^ 

TAGCACTAAGCATTATATGCATACATAATAATATGCATATACTATGTGTTTATATAC^T^ 

TAAGCTAATATGTACTCACTCCATATTCATCT 

AGCGCAGGATTCCTCTACTTCTTCCTTTCCATATTGTAATC 

AGGCC CTG C CAAAAAC CAGTTTTT C CrCTCTGGACTGTC CG CACC CACACGTGACAGGCTCCTTCC CTTG CTT AGG CCAACC C CAT 

GGCCCTTCCCTTCACACAGTGTATGCCCTCTTCAGTTGTTCAAACA 

CTCCAGCCTGTGGCTGTACTCTGACTCCCATACACAACACACT 

GGCCAACCCACTGCCACTGCACCTACAGCTTCAGG 

CTG ACT CTG TCT ACACGT ACACAT ATG CACC CATGCATGTGGGT ACTTT CTGAG CT AGAAAATATT AG AAATGGGG AG AACAAT AA 
TTATAAATTGGCACCCTTCTAATGATCTTATGTTCCATTATGATATTTTTTTTCTAAATGAC 

MOUSE SEQUENCE - mRNA 

AAAGTGAAGTCACTTCCCAAAATTAGCTGAAAAAAAGTTTCATCCGGTTAACT 
GCACAGGGGAGCXSAGGGCAGGGCGCTCGCAGGGGGCACTCAGAGAGGGC^ 

GGGG CT ACGAGGTCAGGCTGTAACCGGGTCAATGTGTGG AAT ATTGGGGGGCT CGGCTGCAGACTTGGCCAAATGG ACGGGACT AT 

TAAGGAGGCTCTGTCTGTGGTGAGTtAACGATCAGTCCCTTTTTCATTCAG 

TGACTCCTTCGGC^AGTCCTGACTACGGGC^ 

AGAGTCAATGTCAAGCGGGAGTATGACCACATGAATGGATCCAGGGAGTCTCC 

GGTGGGCGGAGGCGAAGCCAACCCCATGAACTATAATAGCTACATGGATGAGAAGAACGGCCCCCCTCOT 
ACGAACGGAGAGTCATTGTGCCTGCAGACCCCACACTGTGGACACAGGAGCACG 

AGCCACCTCCGCCTACAACACAGAAGTGCTGT 

CCCAT ACAGACCAGTC CT CACGACTGAATG TCAAGGAAG AC C CTT CTT ATGACT CTGTCAGG AG AGG AG CATGGAACAAT AAT ATG 

AACTCTGGCCTCAACAAAAGTCCTCTCCTTGGAGG 

TCAGATCCTGGGGCCAACCAGCAGCCGCCTAGCAAACCCTGGGAGTG 

CC1*ACAGC<5 CCAACGCCAGCTGTATCACCTGGGAGGGGAC CAACGGGG AGTTCAAAATGACGGACCCTGATGAGGTGG CCAGGCGC 
TGGGGAGAGCGGAAGAGCAAGCCCAACATGAATTATGACAAGCTC^ 

CAAAGTGCATGGCAAAAGGTATGCCn'ACAAGTTTG ACTTCCATGGCATTG CCACATCCAACAG AGACATCCA 

TGTACAAGTATCCGTCTGATATCTCCTACATGCCTTCCTACCATGCCCATCAACAG 

TCCATGCCTGTCACCrrCCTCCAGCTTCTTTGGAGCAGCATCACAATACTGGACCT 

TGTCCCCCGCCATCCTAACACCCACGTGCCTTCACACT 

TCCAGCTCTCACTTTACTGGATACTCTGGACTCTAAAAGGCA 

GATAGAACC 

MOUSE SEQUENCE - CODING 

ATGGACXK5GACTATTAAGGAGGCTCTGTCTGTGGTGAGTGACGATCAGTCCCT 

CCCCAAGGCAG ATATGACTGCTTCGGGGAGTCCTGACTACGGGCAG CCCCACAAAATCAACC CCACCGCAGCAGG AGTGGA 

TCAACCAGCCAGTGAGAGTCAATGTCAAGCGGGAGTATGACCACA 

AAATGTAACAAGCTGGTGXjGCGGAGGCGAAGCCAACCCCATGAACTATAATAGCTACATGGATG^ 

CAAGATGACCACCAACXaAACGGAGAGTCATTGTGCCTGC^AGACCCCACACTGTGG^ 

GGGCTATAAAGGAATACGGATTGATGGAGATTGACACTTCCTTCTTCCAGAAC^ 

GAGGACTTCCTCCGAGCCACCTCCGCCTACAACACAGAAGTGCTGTTGTra 

CTATAACZACAACCTCCCATACAGACCAGTCCTCACGACTGAATGTCAAG^ 

GG AACAATAAT ATG AACTCTGG C CTCAACAAAAGTC CT CTCCTTG GAGGATCAC AG AC CATGGG CAAGAACACTGAGCAG CGGC CC 

CAGCCAGATCCTTATCAGATCCTGGGGCCAACCAGCAGCCGCCTAGCAAACCCTGGGA^ 

CCTGGAACTACTGTCOSACAGCGCCAACGCCAGCTGTATCACCTGGGAG 

AGGTGGCCAGGCGCTGG<5GAGAGCGGAAGAGCAAGCCCAAC^ 

AAAAACATTATGACCAAAGTGCATGGCAAAAGGTATGCCT 

AACAGAGACATCGATGTACAAGTATCCCTCTGATATCTCCTACATGCCTTCCT 

(XTCTC^CCCATCCTCCATGCCTGTCACCTCCTCCAGCTTC^ 

TATCCAAJVCCCCAGTGTCCCCCGCCATCCTAACACCCACGTGCCT^ 

HUMAN SEQUENCE - GENOMIC _ W T» SBSM rr»rr S8& 
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TAAAGTACTTTCGTCTGCAAATAATTGCAATTGCATCTTCTAGGA 
GAAGTGGAATTGTAAGTGAAAGGGGAAAGGCACTTCGGAGGGGGAATGATGTTC^ 
AGGCAOK3TGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGA 
TGACCAACATGGTGAAACCCTCTCTCTACTAATTAGATGGGCGTGGTG 
5 CAGGAG AATT ACTTGAC CCTGGGAGGCAG AGGTTGCGGT AAG CTGAG AT CGCGC CTCTG CACT CCAG C CTGGG CGACAAG AGTGAA 

ACTCTGTCTCAAAAAAAAAAAGAAAGAAAGAAAAGAAAA 
CGGCCTAAAATTCX3AGACGGCAAAGTGGAAACGGTTTAGGGGAGGAGTAGAA 
GAGGTGAGGGGCCAGAGGO^GTAGCAAACCTCTTTAAGCCTGAA 
AAGGTTAAATGAGGTGATGATAGTAAAAGTTCCTTGGAAACTTC^ 
1 0 AAAAATGTGGTCCTTGTTTGGTTTTTGTCTGTTGTTT A T T T T G ' 1 IT 1"! G TCG TTTGGAGAT AATT CCACT ATGTGTCTAT CAG AGG 

ACTTAGACTTTAATTCrcAGATCGATTCCCCCCT 

ACAGTCATGGTGCTGATTCCTGCCAGCCAGTGTCAATAACCACAGTGAAAGGATACT 
TGGGTTTTCGGTTGCTGAGTAAAATGCTCGGAGACGTTCCGGI^ACTATGG 
AACCAATTATTAjGGCTCATGAGTTTTGTTATCCTTCACTGAAGG 
1 5 GTGGCCTCGGTTCCTCTCCTGCGGGCCTCTCCATACGGTTCCTCAGCTTG^ 

TCCACGGACTAGAGGCCCAAGAGAGAACAAGAAGCAGGTGGAAGTCATAATACCTT^ 

CATTTCTTGAGGCAAT CATAAAGTTCCATCTGAGTTCAAGGGGGAAG AAAAT AAGCT CTG C CTTTTG ATGGG AAG AGAG AGG CAAA 
GATAAGAAATATTCTGATGGTCATTTTGAGAAAATGCGACTGTCC 

2 0 ATGAGGAGCAAGAAATGCGCACATGTAGAGAGGAGAGAGAAGCAAAAGCCrTA 

TTCAGATGAGACTTGGAGTGACCCCAACCCAGGCTGATCACCCCCACTGCTGCAGA^ 
GAGGGCTTTGCACCTAAGTATCTTGTTCGCATTTCATATTCT 

TGTAATAAAAAGCCAAAACTAGCGTGCTCCCTGTGGTGACCCTGGGTAATGGATATGCCAGAGA 
TGGGAGCGGOSGAGTGGGCCCACTGAGCAGGGCAGCCACAGGGAACTTGGTCTG^ 

2 5 CCAGCCAAGCTGGTCAGCCCCAGACAACTGTCCTTAGGAGATTCAGGGCAG 

CATTGGTGCTTCTCAGGCCCCATGTCTTCGGTCATT 
GGGTCTCCCACTCATGTCCACTTCTGGCATTTTTCCTCCTCTA 
ACACAGGATAATGGCAGAAATTTCTGTTATGTCTTCCCGAGTGAAATGGG 
CCTCTGGAGGCTTTTCCATCTCGGCCTTAGCTAGACTGTTTGAAGATGCA^ 

3 0 TGCGGCCATGCTCTGATCACATCCCATCACTCAACTCATGTGCACATG 

TGGAAGACACATTTGTAAGTATAACAAGAAAAGATGCATATAGGCACACATATGTG^ 

TT CGTGAATGCACAGATGT ACCCACG CCATTCTT CTG AT ATTCACTC AGT ATG TGGTGTAC T AT AAAT AGG TG CTAG 

CCACCTACCATTTTCATTATCTCACAACAGATGCCTCCTCTCCTCATTCACAGGGCTA 

TGGCACCTAGAGGCCTGGACTCAGAAACCAGAGACAGGCAGAGGACTG 

3 5 AGTGGACACGCAGATTAGTGAAATCGGCTCTGTACTCTAGGAGCCCATGGGATTTG 

TT ATTAG T AGTTT AGG CAACT AATCTTTT AAACAAT CAACATTCGG AG AAGT AGGT AT CG CT CTGG CTTT ATCTGAACT C C CTCTT 
GACCCATCTCGAACCCAATGGTTCTGACGCCATTTCTCAGCATTC^ 

ACACTGTCTTACTCTGTTGCCCAGGATGGATGGTATGATCATGGCTCACTGCAGCCTCCAC 
CCTCAGTTTTCCAAGTAGCTAGGACCACAGACACAC 

4 0 CGGAGTCTCGCTCTGTCGCCCAGGCTGAAGTGCAGTGGTGTGATCTCGGCTCACTGCAAGCTCT 

TCCCACCTCAGCCTCCCAAGTAGCTGGGACTACAGGCACCCGCCACCACGCCTGGTTAATTTT^ 

GGTTTCACGATGTTAGCCAGGATGGTCTTGATCTCTTGACCTCGTGATCCACC 

CGTGGTTTTTTTATTTTGGTTTTGTTTTTTTTT^ 

GCTCAAGCATTCCACCCGGTTC AG CCTCCAAAAGTG CTGGG ATT ATGGG CATGAGCCAT^ 

4 5 TCTTTTTTTTCTTTGGAGACAGAGGCTCACTCTATTGCCCAAGCCAGAGTGCTGTC 

TCCrcGGTTGAATCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGG^ 
TTTTTAGTAGAAACAGGGTTTCATTATGTTGGCCAGGCTGTTCTCAACC^ 
AAATGCTGGGATTACAGGGATGAGCCACCATGCCTGGCCACAT^ 
TAGGTACTTTACTCATATTATCTCATTTTATCTTTATCCAG 

5 0 GAAATGAAGTTCTTGGAAATTGTACT AGATG CGTAATGAG CAGCAGATGAAG CTTGG ATAG AAACTCAAATCTCTCTGACTCAAGA 

TTTATGCTCTAAGTTACAATAAACACCGGAGTTACACTTAAAGAGCTTATGGTCTAATTG 
ACCTGTGAAGAGTAGTGGTTATGTGCTTAGGCTGTAGACCTGAATT^ 

G TTATTTCACTTTTCTG AATTT CGATTTT CTTCAT CAT AG AATTGC CAGG AGG ATT AAATT AGG 
TTACTCATGTAAACAGTGTTTTGTATTTAATAAGTGCTGTGATTATTACCACTTTTATCTGT 

5 5 TATTATATCATGTACTAAAGGCACCTTGCCCAAACTGGAGGAGAAAAAGTATCCCACCGGTTCAAGT 

ACCCCAAGCAGCTTGCCTTGCCAGAGGTCTC 
AAGGGACCCTATGCCACCCTCTGGATCTAGGACTCAGT^X3CT 

TGCTTCTCAG AAG CCAATGACT AAGCCAGCCCAGGGG1TCCTTTTACAGA C AAGAAGG AACC CCAG AG CTG A 

6 0 GCCCCACATAGGGCTATOAGCCCACGCCTCCTC^ 

CCCACAAGTCTTTT AAAT AATAACTAACTTCTCTTCCCTTGAAT AAAAT AATCAGC^ 
GGAAGGAATGTAGTCCCTCTGAGGATGGCTGGTGAATTTGCATGCTGGTGTTCTTCCTACACC 
CTTATTGATGATTTATCCGTGGAAAGTTCITGAACGCTAG 
TAAGTTTAAACTTTAAAAAGTTTTAAGAGTTTAAGTAAAGA^ 
6 5 CAAGCTGCAAATAAGTGAAATTTCTGCCAGCTCAGCAGCTCCT 




CACAAGAATGCTCTCCTCCCCAACTCTGCTCTTTCACrTGGGTCT 
TTCCACITAGGGGCCCTC^CCTCCCITCCCTTGCCCATATTGGCTT^ 



7 0 CTTCGTCTTCTCTCrCITCACAGCCCCCATGATCTATCT 

TGA1TGCAAATAGAAGGGGGATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCACAT 
TCTCACGAGCTGTTAACAGCTCAAAGGAGTCATAGGTCATCATCAGAA^ 
CTTACCTTACACAATGGCAAAGATAATTGTGCATCAGGGTGAAAGAGTAA^ 
AAGGCACAGAAGCATACCTGAGTAGGGCTT^TCTAGTGGAGCAGT 

7 5 TTGTGCAATGTAAGTGTGGCAGGTGGGACAAAGATGACCTTTC 



970 



WO 03/008583 



PCT/US01/51291 



GGGAGCTCTCTGAGGTTTTTGAGC AGATTGTGG CAGG ATGACAGCAGTATTCACAACTTGCAAGGG AG CTGG ATCCAG C CAGGG AG 

(X5CACAAGGTGTACTGAATGAGGGAAGAGCATGTCAGTGGGTCGTCAGGGAGGGAAGAGTCT 

AGAAGTGTGAGGCA.TTCCCGGTGAACCGGG^ACXSCTCTCCTCCT 

TCTTTGCTGTGAGTCTTGATTTTGCCGTCTGC^TGT 

T AGG ATAATGATGGGAGG AACTTTTTGC CCATCTATGGAGCATCA.CTGTG CC C C CTG CCTC C CGGG CTT CTTTG CCTC TGATA 

TTCTGTCATCTTCTCCCCCAACCTAGAAGAGGGAAGGGAAAGAAA 

AGAACGTAAGCCATTCTCTACAGCAGAAGATGCTACCTGATTCCTGAAATGTCATCT 
1 0 CTT (^CTCTT ATCACCAAAAGACACTT AGGGGTCAT AGGTGGT AATG TGT CCTCTG CTTC CAGTG CGG AGCT CAAGGAATTTCATG 

GAGTCAGGGACAGCTGTCTGAAAGGTTGCAATGTCACCCTTTTCAG CTCAGAACCATAATT CCTTG AG CCCAGATAACAGTTCAAA 

ATTTAGGATATGAAGCCAAAGCTGAACTCTAGCTX3AAATCAAGAGTGGATATAA 

GCTTTACTCTGAGCATGACTCAAGGGCATCTGGTAGAA 

TTGGGGGATGTAAGTGTTTTAATATTGTGTGGGCAACTCTGCT 
1 5 GGTGCAAAAGCAGGACTCCTAGAGTATATTCCAGTATTTTTTCCCTGATCCCA^ 

GAATTATGGCTGTCTGTCCGCTGGGGTGGAGAATGGCCCTGGGAGACAAGAGA 

T ATG AAG AGCATTCATTCTG AAGAAG ACATTCAATT CAGC^AATGTGAATT ACAGAAGGAAGGT AGACAAAT C CCAG CTTT AACAAG 
TGCTGTTCACCTATCACTACATTTTCAAGTAAAGCTGGTGTATCCATTTGCT 
CTTAAGCAAAAGAAATTTATTTGCTCAGAGTTCTGGAGACTC 
2 0 GCCTCTCTCCTTGGCTTGCCAATGACTGCTGTGTCCrCACATGGCAGCCCCTCT 
TCCAAATTCCCTCTTCTTATAAGAACACCAGTCATATTGGATTAGAACCCAC 
GTCCTATCTCCAAATACAATCACATTCTGAGTACTGCGGG 

AACAGGTGGCCTAAGAGAATCTTTGTTTGAATGCTAATAATCATGATACCTAACATGTTCT 

ACTTCTAAGT ACTTTGCATGT ATTAT CACAACT ATT AT CACAATT ATAG ATTATAT AATT ATTG ATT AT AT AGA 

2 5 T AT AG ATG ACT ACCAT CTTCTCTG AAGACTT ACTTTGG TGT AATT ATG TG ACTGT AGTG AACACT ATGCCCTGGTC CCTGACCTTG 

GATGCCTGTGATTCGATGAGACATGGAATAATGGGAGACTCTTACCCTCATGTTT^ 
GTTTTTTAGTTGTCGTTGTTGTTGTTGTTTCGAGACTGTCGCCCAGGCTGAA 

ACTTC CGGGG CT CAAG TGATCCTCC CAC CTCAGCCT CC CAAGT AGCTGGG C ACATG C CACC AT ATCTAGCT AATTTTTTCT TT CTT 
TCTTTCTT T CT1 I'CI 1 ICTTTCTTTCTTTCTTTCTfTCTTTCTTTCTTTCT^ 

3 0 CCACATTGCCCAGGCTAGTTTAGAACTCCTGGACTCAATAGATTCrCCC^ 

GCC ACCATGC CTGG CC AAGG CTT CTTTT CTT AAT C CATG AT CTCCCAAAG T CACACCCAAGT CAG TGATGGG AT AAT ATGAGTGGG 
AGGACCCAACAACCTC lTlT GGAGGGTCrTGCCCTGACTGKSCAC^ CACGTGTCTG AGGTTCCTCrTTCGTCCTTTGG CTGGCTGA 
GTCATC^^CATTG AGATCT AC!AGGATGCACCAG CACCTCCTAGAGGGC CCTCAG CAAACCACTGAGCATTTCTT ACAAGCTACACC 
C CCACAC CAGATCTTCTCATTT AAT C CACCT AT CG ACT CTGTG CT AAAACT ATTTTT AT CC CTGTT ACAAATAAG GAAACTG AAAT 

3 5 TGGCCTGGCCAQ^CCATGGGCACCACCCTTCCCTTGTGTGAC 

TTGCCTGGAAAATAAATGCTGGATCTCATCTCCCCTTCATTGCTGATC 

TGGAAAAATCTAGCATTCACACACCACCACAATACT ATATTC CATGCCAAGTCACCACTATTCTTGG CCTCCCT ATCCCCAC CCTC 
CTITTTGGGGGTCCCAGACT ACAGAACCAATAAACA CCCACT CC ACTGCCCAAAGT CCCTC 

CTCACCCAGGTAACCTGCAAACACCCATCTGGTTTGCATAAGGGCAAACCTGCC 

4 0 ACAAGCTCTCCTGGGGCAGCCCTGGGCCACCTTGCCAGCTCAAAGCAAGAGTGTC^ 

TCT^TCTCTTCTCTGGCCCTGCAGGAGGCTCrGTCGGTGGTGAGCGACGACCAGT 
CCATCTCCCOUlGGCCGACATGACrGCCrCGGGGAGTCCTGACTACGGGCAGCCC 

AGTGGATCAATCAGC CAGTGAGGGTCAACGTCAAG CGGGAGTATGACCAC ATG AATGGATCCAGGTAAG CTCACCAGGCCTGTGCA 
GGATTGGGGGAAGGCACAAAGTCGCCAGATCTCCAGCIAAGT 

4 5 TGGG CCAGGAAGCCTGTGTCAGTCCCTAG AGAG CTAAGG CAG AG AGGG CCACAG ACTGATGATGGGGTGGAGTAGGGCT AGG AAGC 

CCCCCTGGCTCTCTGGGGCCCACGAAGCCCTCAGAGGCCAGGTAGGAGTCAGGCC 
TTATTTCTTGTTAATCCCAGGCCCAGGTCCAGTCCTGTTGGGGACAGCTTA 
CTTAACCCCTCTCGGCAGGTCTCAGCAGAGCAAAGGTGACCAAACCA 
ACCTTCCTGTGGGGGCTC!ACTGTGTCTGGGAGGGGG 

5 0 GGCTTTGCTTGTAGAAATGGAGACAGGAGGGTCCAGGG AGAG AAAATC^ CCT CA 

GTTTTGCGTCCTGCCTCTGCCCCTTGGGCAAGTCATCGAAACCCCTCAGAGCACCAGCT 

ATTC!ACTT CCTTATGGTCCrTACTCTAGCGTTACTCTG ATG CCTCAGGGAAAGG ACCTGGGATGGTGCCTGAGACG CGTTAGGAAA 

GTGCTCTATAGTGACAGTGATTGTTATGAGC^WVGACTTGAAAGACAGAGAACATCAGTCT 

CTGGGCCAAAGGCACAAAGAGATGAAAATAAAAAGCAACAGAGGGCCCTC 

5 5 GC CAGAGGCTG ACTTTG AG CACCG ATGTG ACATTT AACCCAG AGG CTGACTTTGAG CACAG ACGTGACATTTAACCTCATCAGAGC 

AGTGACTTAAGACTAGCGAGCTGACAGTCCTTGCGCACAGACGGTAGGATtKGAGAGTCAGAAAG 
ATATCCCAGATAGGAAATAATGAAGTTCTGAACTAAGTGATGATAGTGATTTTGGAAAGT^ 
AGCAAAGAGATGATTAACAGGGCTTAGGTGAGTAAAGTGTGAATGAAGGGGGTGCA 
CTGCACATTTTAAAGGCTGGTGCTGTGAGCAGCCrGCAGAAAGGTGGC^ 

6 0 GAGATTCCTTGTGAGCAATGGGTAGAACATCCAGATAAAAATGTCTACTAC>G CTTTG 

CTGGAATTTACGTTTGGGCATCATCTCTAAGGAGTCTGAACTGT^ 

AGAGAG AAGGGAATG AGCATtKSTTGGGAGGGAG AACTCCTC CAG AGCAG CTC^TGGCCTCAGACCTG CCTGTGGGAACTCCCAGGC 
CTACCACATTTCAACCCT ACTAACCAAAACCAGTG ACTGGGA CCAGAT CCCAGGATTT CAGG AAGGCTCAGA 

GGTTCTGAGCCTGCAGCCCCTCAGATGAATCTCrCrCCAGATGTA^ 
65 XGTGTTGAACGTTTTCCCATATTATCTTACCACTTGTTCCTCACCACTATACTAAGAAGTAGTTACT 

GGAAGGCAAGCAGGGCTTTCAGGGGACAGTGCTTTGT CTAAGG ATTTAAAGGAGGCAAAGTTGGAGCCCGAACCCAGG CGTTTGGT 

TCCAGCCCt^CGTGCCATCTGTTAGGCACCTTTCTGCT^ 

ATCCCCACCTGGTGAGGAGGCCTATTCTACGATGCCTCCTAAATGGGGCC^ 

AGATCCC^CTGGTGGAGATTCC'IT'rrri'riTr'nT'ri'l'Tr'l riTGAGATGGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGG 

7 0 XACAGTCTTGGCTCACTACAACCTCTGCCrrCCGAGGTT^ 

CATGGCACCAGACCTGGCAAGTTTTTGTATTTTTAGTAGA 
TCATGATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATT 
TGTTGTAACCTGAATCTGCCTACTCCnTACCTCCACTGCCCCTGCCTGGTGCAAG 
GCATCCTGCTCTCCCCACTTCCTCCTGCCTTCCTGAAGTCTATTCTCCGGACAGCAAGCAGC 
7 5 AATCCTCTTGCTCAAAACTCTTCAATGGC^rAT^ACAAAATGCAAAG 
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ArrrCCCCCTCCCTCCCACTTCTTCCCTCCGCGCCAGOT 

rrrAGAATCTCTTCCCCTGAGCTCCrrTCATTCTGGA^ 

TTTCTCTACCTGAACCTCACCTTTTCAGAGGCCTCCCCTGGCCACCTO 

AGTCTCTGTCCTAGTTCTTATCACCTCCCTCACTGA 

AAA^AACTCAGAACACATGGCTGAGCAGACTCAAAAGAGAAGGTGCTTGTTTAAAGTTAG 

G^CCTTCAATGCTGAGCCTGAGTTGATGCATGGGCTTCTGAGTTCCATCTTCATGCT 

CT^TG^G^AGGJJVGTATGCTGCTCACCACTCCCT 

nGAGGGTCCCCCAAG^ 

GGCCTCTGTGAATA^CCAGTGAACAGACAGGATATTTACCTGGCTTTGG 

SSSCT^C^GCAGClTAGGCACAGCrATCGCCAGCACCCCA 

^^AT^CA^A^AAATAAATGCCAGGCTCTTTAAATCTCTCATCCT 

tgataatgacagttmtcacttg^acatgatt^ 



* ACTGG^^TTAGAAGACTTACTATGGAAAAAGT 



CACATAC 



TCAATGCAGCAGCCACTAGCCACAGGTTGCTACTGAGCACT^ 
ACTGGATTTAGAAGAC 
TXC JVC TT AACAT ATT A 

£^£S"caOTACTCAGGAGGCTGAGGC^^ 

SctctacttctmtgctSctgctgggccatgctggtctggact 

GCATGGGCTTCTGAGCTATGTCTTCATGCTCACIXSGltf^ 
TGAAGCTACCATC^CACTTGCTTTTCACCCCAGTAGCAAA 

^gcStctcagagacagagtggcttgcccatgttc 

^GCTGGG^OT^ 

TCTGGA^GC^CTGGACACCTAGGGACTCACATTCCACACAGAGAAGGATCAGCTCCTG 
CGAOTCCCCTTTCTAAACTCTTGTGTTCTC 

TTCTTTTCTTGGAATTTGCTTTTAAGTTATTC 

rrra^CCCGCGAGGCCTCACTTTGGCC^ 

taScccaa^«£^^ 

TTTCAAA^CTC^CTTCATCCAAACACCCAAGATCTTACACCTTGTTTTAAATGGCCTAACACATTC 
CCTACCCTCGG^AGCTCTCCTGTGGCCTCTGTCGCCCTGAGATC 

CCCTGCTCTA^TCTGGCAGCCCCAGGCTCTCTTGTCACCTGTCCCAGTGCTATCAGC^^ 

gtSSc^tacaaattagcagcagtgcaggcaa^ 
^m^agtccS^ccgcacgcagggcttgcgctggctggacwaggcacggtgc™ 

CTGGCAGG^AGCGG^CG^GGTATGGCTTTCCTTCTTTCTCTCCCTC 

^Sca^c^g^cgagc^^^ 
a^agaggag^a^gaagcggaactgggagaacttctatgggtcagtgaatga 

A^CCAGCAAGAG^CCCTCCTC 

^AG^G^SAAATCmCTTTCAGCTTTTCACATTTATTTTCCTC^ 

TC^CTACreTACATGGC^TAACCATAAAGGCTCACTCCATTTCAG 

AGTTT^AACTCAGAAATCCCCAGGGGCCCGGCTGATC 

C^GCTCACTTGGTTGCAMCCAACTrAATCCTTCAAACTTCT 



GGCTGCCAGACCATGTGCGTGAGTGTGTGTGTATGTO^^ 

re 

ATGTG TGCACCrrGTTCTCTGTTGAAG^^ 



CTGCATGTGTTGAATGTGTGTGCAGATGAAGTGTACGTGCATGTGTGTGCATCnTCTC^ 
TGAAGTATGTGTGTTCCATOTTGGAATGTGTACAGTC 



ACATGTGCATGCATGTGTGTGAGTGTGTGCCTGCAGTTAGCCCCAGGGCGGGTGAGGGGACCCAGTGCACTCGGGCCAGAGCTCAG 

attttcagcccatctSccctctgtaagacatcaggaaaaaattaccgtcto 

GCTATCATAAGTGATC^CAATATTTTTTTCCTTTCAGATTAAGAA^ 
T^COTCCCTGCCTGGGTCTCAGGAAGCCCTGGGCTC 

rrcCT^GGAACG^CCGGAGAGGTTATGCGGCCTAGAGTTGGAAGGCTGTAA 

^======= 

^^^^^^^^^^^^ 

^^CCOTGAAAAGGAGATAGTAATCCTTGCTGTGCCTATCCnKSGGGCTTATTGGGAAATTCAAA 
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ATT ^ OT ATATATGTGCAATAGAAATAACTCACTG 
GAAA^ATGACTATCCTCTCAAGAACTCAGCM 
rrnAG^CGGGCACCAAGTC 
GTAAG^CACTCAGACAGGGAGGGCTC 
~^ B HCTa™TAATTATAGGATTC^ 



r^aAGCTGCAGTCAGACAGGGAGGGCTG(^ATGCCACACAAAGAAG 
T^mA«3TACrrGTAATTATAGGATTGGGCTG<^ATTTGA^ 

TC AT CATGACACCAAGTGTGGC CGGG CAATGGCTC CATG CT CATCAT CGG AGC CCTGGGCTCATGATTCAGAGG CTGAGG CAAG CT 
rrCCTGTCAGTGCTGACCGCCTCTGGGCTrTGTCTOT 
AGCTGGTGGGCGGAGGCGAGTCCAACCCCATGAACTACAACAGCrATAT 
ACCAACGAGAG^AGAGTCATCGTCCCCGCAGGTAATTCGAGA^ 
CAGGGAGCATCTAAACCTTTATCTGATA^ 

CTCGCGCCTGTAATCCCAGCACTTTCAGAGGCCAAGACGGGTGAATCACT^ 
AAACCTCATCTCTACTAAAAAAAAAAAAAATACAAAAATTAGCCAGGTGTGG 
CTGAAGCAGGAGACTCGCTTTAACCGGGAAGGCGGAGGTTGCAG^ 
ATGAGACTCTCTCTCAAAAAAAAAAAAAAAAAAA^ 

ATGTCACCTGATGTGTGGGTGCTAGAAGCCCCACCCTTAAACAGACCACATC^ 
CGGGCTGCTCAGAGAGCAGAGGCAAGGCCGCCTCCACCCAAAA 
AGGAGGTCAGCTGCAGAG^CCAGTCCTTCCCTCCAGCCTCGTACTTCCCGGCATCT 
AGCTCTTTCAGTTTTAAAATAGAAAGAACTATATTTA^ 

AAATAGCCTCnTTTTCCT AGGTGTTGOTGCn'GTTGGT CCCAGAAAAT AAAATCAT CGTT CAGTCCTTCAGTCCATTGGCTGGACAC 

TCCACCCCAGCTAGCACCAGCAAGCTCAGCTGCCTGCACAAGAAGGGTC 

AAGGGGCAAGGATGTCCAAATCTTTAAGCAAAGTGATTACAGC^ 

GCAAAACTCCTTCCATTTGTAAAAGC^TAAATGTAGTTGCCT 

ATCTGCTTGTAGGTCTAAAGGCAGAAATGTTAAGGGCTGCCTCCTC 

TCACCATATCCC^ATTACAGCTGCCTCCCTGTGCTCTGGGCCAAAT^ 

GGGG CTTGT AGAAATTT CT CTGT AC CCITT AGG ACTT AGAAATG ACT CCT AAAGCAATTCT CTG 

AC CTAACAAAGG AAGAGGAAGAAT CACTT CCATTTATTGAGTG CTTTCT AGGTGCCAGG CAC CACGTAAATTGCTTT AT CTCCATG 

GCATTACTTAAAACACCTGTCACAGGAGA 

TOAAACTCTATTAGAACTGACAGGTGGGAATCTCTTTGTA^ 

TTCAAACTCCCCTCCTTC^CT 

TTTGAACCAACCITTGGCACCATCTCATTTTGGTCGTCATTT^ 

TGTCGACCCTCCAAACTrrCTTTGATCAGTGTTGAAATC 

CTATAGCGCTGGTGGTGAAAATAAAACAAATTCATG^ 

AGCTCCGGCTAGATCTTTTAGGTCCTCACACATTTTGGCTTCCTCCA 

ACTCCACAAATCAAGGCAATAAAATATTAGTGGAAT 

TAGTTAATCACTAGCTCCATGTTCAGATGGGAAGAAACTAGGACT 

CAGAGTTGAAGATTTATAATTCATTGTGCTGTAGAGGGAGAGTAATAATATTGTTGCAGAGCCCA^ 
AATAAATAAAATATATG<^TAAAATTGAATTTGAGGCCTTGGATACT 
ATGCTACATAATCCAACTCTCCCGAGGATATGTTTCTCTAGAATTCTG 
GCCTAATTGCTTCTGGAGAGTAAAAGAGCAAGAAATGTATCTG^ 

G AG CTCCGCTTrCTGGCTTCAGA\GCAGTC^ CTTTGGGGGAA 
CCCAC CTCTTCAGCTT AGTGAAGTCCTAT AATGGTTT CTATTGG AACCCCAGCCCAGATACTTACCAGCGATGGGGCTTTGGGCAA 
AGGACTTAACCITTTAAGACTCCATTT^ 

GAGCAGGTGGCACATAGCACCCACCATCTGAGCCGTCACTGCTCACA 
TGACCTCTGAACCCAAACTCTCCTAACTCCTCCCCTTTGCCAGCTCC^ 
GGCAGGG CCTGGG CACTTT ACAAA^CAG ACT ACTCGGGCCCT 

t7(-aaggacagttggtck;atgcttgtgtaaaagtgctggga^ 

TGCCC^CATCCACTCCTAGTCATTCAATCCTGGTTCCCTGCTTTGCCAGG 
TCCTTGAGGGGTAGCACAGATATO 

TACAATAAAATGCCGAGTCTCTCCTCCCCCAGGAAGTCTTCCT^ 
CGAACCACCGCCACACAGTCCATTGCITAGTAGTGGTGTGCTTGCCT 

A^CTGTGATGAGTGATAAAAGAGTTCTGCCTACTGTGGAACCT 

ACACA^CACAC^CACACACACACACACACACACAC^^ 

AGAGATGGCCTCAGTGGTTACCTTACCT 

AAATATCTAAAATATGTTTTGGGGATGCCCTGGTAGAAATTTCTAC^ 
TtTTCAAGACTAACACCGCTGGGTTTTT 

AGGGATCCGTGTTCITTCATGAAATCATTAATATTCTTAAAATACAGAT^ 

GAAATTGTGTTT ATTTTCATTTTG ATGCCACC^ CAACTGCTCTCCAAAACTTTAAAG AGTGCTCTGTC 

^TACGTAGTGAGGCAGCTAAGCACCTCCCTGGAGTGCTAAGTGGTTCA^ 

CTAGGGGAACCATGTGGAAGGGAGCTCTCCCAGAGAGAAAGAGAAGGG^ 

Sc^C^AGCCT^ 

GAG^CAATGGCTGGAGTGGGCCATAAAGGAGTACAG 

TGTGTAAAATGAACAAGGAGGACTTCCTCCGCGCCACCACCCTCTACAACACGGAA^ 
rAAAGTAAGTGCCGCCGAAGTACCCAGGGCTGGGAGGAAGCnTGGCA 

CGTGTTGGGCAG ATGCCGCCGG AG CAGCATCGTGGGG CCTGTAGTGTTG CCAAGGTCACGTGGGCTCCT ACAGACCCTGTGAGGGG 

rAGACCCAATCCTGAAGTCTCAAAGGAAGTAAGACGAGAAAT^ 

TGGTT ACACTCTTCCCATGAGACCGCCTTTCCAGAAACT AT^ 

TTTCCTAAGTGATTATTTAAAATTCCTCCAGGA 

aGCTTCCTCTTAATCTTGGGATACTCTGGCACIACCCTA^ 

nCAGCCAAAGGTGCAGGTGTGTTAGGATCCATTGGAGGCTT^ 

»^rrra a n a A ARAAAGCAGGCAGTTGTCCAAG AG ATTTATCT AGTATGAGGT^ 
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CAGATCCTTAGACAGCAACTCTGGGACCTGCTTTTCCTGG 

TGTGGACCTTTAACTCCCCATCTTTGGAATCCACTGTCCCTTCAGCGACAGTGTT 
GGAGAAATGAAAAATCAAAAAACTGGCTTTGGCAGTTTCCA 

ACT AGTGCGAGAAAGT AGATGATGCAGT ATTTGTT AG CT CCGGAGCAATT ACCT GATGT CTGAG T CTTG CTGCAACATGAGGTCAG 

CCCAAGAAACAATTTCAAAAAAGGGGACAATTTTCCA^ 

CAGCTTTTAACTGTTTGTGGAATTTTAA^ 

TGGAGGTTAGCATGAGAAGGAGAGACCAGCAAAGCAACCTGATTCC^ 
GCCAGGAGTAGAGAGGACAACAGTAGCATAGACAGCATGCTCTGAACTGTTAGTTA 

GATGCCCTTGTTGGGAGAATGTTCAAGG CATTCTCTCCATGTGGGG AAAG CCCATTCATGCCATATGACCCCGGGGCCCTAACCAG 

TCCATAAACAGCCCAAGGTCCCCTAGTACTTCCTGCCACCG^ 

GTGGTCACrGGTCGCGAGTAGAGAGACCTrTCTATTG 

GGGGGGTATAGCATGGCAAGAAAAAGCAAATTGCTTC'rri C T TT T T AAGAATATTT ATAGACAGAAAGTGCCCCCAAATCAATAAA 
AATATTAGGTTTCCCAAAAGGGACTTATTAGTGGAGGATATTTCCAGAGTG^ 



TTGAAATG ATTATTGGG AAACACCATGAG AAGTGCTT AGGAT ACAATG TTGGGTGGAAAAGCTCATT AT AAAGTGG TCTGTG CACT 

GATAGAAAAACCATGTCTGCCTGCGAAAGCAAAAAATTAAAATAATCGTGTTAAATTAAAA 

TATTGAAATATCCTTTTGAGGAACCTAGACTTAAAATGATCCACTGTCAGGGA 

GTGTCTACCTTGGGGTTCCAGCCTTGGGGACCCTCTAG 

GCGATCGTGTCTGCTCTGTTTCTGCTCTGGATTGCrGGGAACT 

ACACTTTATTGTTTTTCCAGACTGACTTCCAGTG^ 

GAGTCAAGGOVAGGGGAGAGGGTGCTGATTCTTTGTGCm 

TCGACTTTAGCAAGAGATGGATGGAAAATAGCTGAGATCCITCACT 

AGCAGAATCGGGGACCTGAGACTGTGGCTGGTTGGAAGTCAGTAGAGTCCAAGTTGGTGGCACTTCT 

CTTTAT AGAAAGTCTAGAAAAAGGAAAACTT CATAGGAT AAGAAGCTTTGG CATCATUVTTTACATrTCACAAAAGGTGCTTGACTA 
TTT ATT CTGGTGGTTTGGGAGGGGAG CT AGACTTTT ACATTTT AAATG CAAATGT ATTTT CCAGGGT AG AG AAG AAAT AG CAGTT C 
CCTGTGCCCCTACAGACCCTGTCTGATGGCCAGGCATGCTAGGGAGGTGCTTGTTCT^ 
GGAGCTGTGTCACACGCCACACAGGCGCCCACCCCTCCCCCCCACCACCACAGGAAATGCAAGGAGGACT 
CCACGGTTCTTGGGGGTAATTCCCATGAGTCCTGGGAAACTCCA.GGGTTTACACCT 

GGGTGG ATCACCTG AGGC CAGAAGTTGGAG AC CAGCCTGGCCAACGTGGTG AAAC CC CG TCT CT ACT AAACATACAAAAATG AG CC 

AGGTGTGGTGGTGGGCGCCTGTAATCCTAGCTACTTGGGAGGCTGAGGCAGGAGA 

TC^GCTGAGATCGCACCACCGCACTCTGGOT 

CAACTATCTACAGT ACCCATTTTGAAGCTG AACCCACTG AGGCCC^ CCAG CCGATTG 

AGAGCACCTGGCATCrCAGACTAGGGCCTC!TGGAGTCCCAGTCCCTGGCCT 

AAGCCAGCCTGCGCC!ACAGAGGCAAGGCAGCCTCA<rTCCCGG 

GTTGT CATTTGT ACC AAG AG AT AAATG CTTT C CAAAAACTTT CCTGTGG C CT C CTGGAAGG AGG C CG ATT AAG CCAAACCT AT CAC 
CACGCTAACTTTGTCCTTCTCTGGGCCTGACTGTCGGC 

GCAGCAGCTGCTGCGGGAACGAGATTTGTCCCTTCTCAGGCCAAGGCGGGGAA 

AGTTCAGGCTCAGGGCCGCTGGGGGGAGTTTGCGCAAGGCTGGGAGCCCCGAGG 

GAGCCTTTTTTAAGTTTTACAAACAAGTTCGTTTTGTATGTCAATCTGGGT 

TTCAAACAGTGGCTGCTCAAAAGAATGAAACTGCGCXSACCGGGAGACCT^CGG 

AAACTTTAATGGAAAACACATTTTGCATTTTCTGGTCTTTCCACCT 

ATTAAGCTGGTGCAATCITATCACTTCCAGGCCTGGGTTTGACTGGTGAAGGAGAACA 

TCCCTGGACCCAGTAGTCAGCCAGGATTGTTGTTTAAAGATCCC^GAGCCTT 

GTTGATATAGCAGTGCAGCCCAGAGAAAAGAGAGACCTGGATTCAATAGGCTGG 

r^GCGAGGCAGGAGACAAAGGCTTTGGCACTGGGACGCTTGCGCCCCCT 

GGGAATGTGAAGGAGCAGAAACCGTGATTGTTGGGGTTCTA 

AGATCCAGGTACTGTCTCTAGCAGCACATGGATTCAAACATGGAATGGAAAATGATCT^ 
CATTCTATTCTCAGCAC^AGCTACTTTATGCTTTTTAGAGCAA^ 

ATATACAAATATATACATACAAACACATATACAGAAACTTACAAATAACCAAGTAGGCAAATCTTATAOT 

TACAACACAGAGAGAAGTGCAAAAATCATGAAGTGCGCTACGTGAGACATTATCGTC 

TCCCGGGCCAAGGAGTAGAATTTCAGCCCCCTGTACCCTCTCTCCACCCCACAACCGCrrCCT 

CTTCCCAGCCCCATAGGTGAGTTTTGCCTATTTCTACGATTTAGCCTGGAGGCAGTGGGATCTGCCAGATTG 

GAATAGACCACGATGGGGCTCCTGCCTTCTGCACTCTCTGGCTGTGTC 

CTTTCATCTGTAAAAAGGAATAAGGGAATGCTGGCTCTGTATGGCTTATGCAA 

GCACACCGTGCATGTTCAGGGAGCACTGCGTCCCCACCACCACCTTCATCATGGCCTGTTTACCCTGAT 

GTGGGGGAGCTCAGCAGCACAGGCCTGTGGCAGGGTCAGGCCAGAAAGAGCCT 

AGGGAGGCCAGACTCCAGGAGGACCTGCCTGCATCAGAACAA 

CGGAGAAGAGGAGGGAATCTAGGTCCCTTGGGAGAGAGGCCAGATGCCCCCAACCAGC^ 
CTAGGAGTGAAACTTGAACCAAGAAC^CTCTTTAGG^ 
TGGAGAGGTC^TGACCCTTTCTCTCAGCTTTCTGGGAGTTTCCT 
GGGAAATTTAATAAATGTGGTTTATCTAAGGTTCGTGCTTTCGGCTACATCTGCACTAA 

CCATCATTTTTCTTTATTTGCTCCAGCAGTGAGGTTT 

CTACTTCTGGGATTTTTTTGTATTCCCTCCGAAC^m 

GCAGCAGAACCCATTCATAACTTTAATCAGAGGACAGGACGAATGT^ 

GTCACATCTGTGGGAGAAAGGATGTAAAAATAGTAACGATC^TAGCTAGCATTTATGG 

GGACATTGCATGCATTATTTGATTTAATCTTCATAATGCCATTGAGTGGTC 

A^AGATGAGGGAAATCAGTGGAGGG^OUVAGCTAGTC^ 

CATACAGTCGTGCTTTGCTTCATGATGGGGATAGTTCATCTTCAT^ 

agc™ctacacactgaggctgtatgctacagcctgtg^ 

^ACAACCGTAGCACAATGGCAAGTATTTt3TGCATCT 
Tr^TC^TCTTTCCCCAAAACATCATT^ 
TGAACATGGAATAACCAAATGTAGCTGGCTCTTGGATGT^ 
CACTTTGGCATACAGCGGX3 CTCAATACXACCCCCGTC^ 

GGC^CAkTCACCTGAGACC^CCTGA 
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GCATGCCTGTAATCCCAGCTACTGGGGA^ 

GCGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTTTCAAAAAAAA 

CTTGGAT AGTGCACACATTTT CT CTGAGG CACACAACCAAACGC CCACAGCTGGG TG AGGCGG CAACTC CATTTCGG CATCCCTTC 
CATGACTGGAGACTAAAAATAGGGGTGTGTCTGACAGCCCATTCCCACCCCTGGCACC 
5 GGGGAAAAgTOTTCTT ^ 

CCTCGTTTCAAGGGCTCTTTAAGATTTCCTTAGGAGAATCTCAAAGAGAATG^ 
GAATTCACTTTAAACAAAGGAAAAAGTCTAGTTTTAGGAAAGAAGGGAGAATG^ 
AGTGCn^GTACAGAAGAGGTATGAATTGAAGTTTAATTTGAAAAGA 
GTGACTCCTCAGGTTCAGTGCGTGCTCTTAGACATAGGTGAAGGGAAGCAAAAGTTCCTA 
10 CCAACTCCCATTGCCCCCTCCACCAGTAACATACCTGTGTACCTTACCCTCACCTGTGAAAGGACCC^ 
AGATACCACAGCAGGAAGGAGTGATACTGGCTTCTCCTCCAGGCCATGGCAAAGTACAA 
GCCCATGGACAAAGCAAAACAAAAACTAAGAAGACTGTGCAACAGAGATTGTAGTTAGCCTATGA^ 
CTTTTGATAGAAGAAATCTGCCAACCTCTGCT 

CACATGGTGAGTTAATAACAACTGGAATAGACCCGGGTTTACTGATTCTCAATG 
1 5 CAGCGCITCAGAACTATGAATATGGCAGCTTAGGGGGCGTGGTGTTTGTTT 

CTTCACTCCATAAAGCTATTACAGGCCTGTATGATTGTCTCCAACCAAATGGATTCCATTC^ 

TCTGTCCCTCTTCCCCTCrrCCCCATCTCTTTCCCTTTCTACTCCCTCCTCATGTCAT 

TGGTTATAACCTGTTTATGTTTTGCCTCTCAGGTTCACTGCTGGCCTATAATA 

GTGTCAAAGAAGGTAAGTTTGTTCTTTTGTGCACTTAAAATTTTCTTCTGTACC^ 
2 0 TTGCAGAAAACCAGCTTGCTTGTTCCACTCAACTTTTCCTAGCATACAAAGACAAGCCAA^ 

TG CAAAAAT AC AGT AAAG AAAGTTTT ACAGT ATCTTGT AAT AAT ATCACAAAG ATACAA.CTTGAAGTT CAGGAAA 

ACATGTTACTTTTAGAAAAAAATGAATGCTGCTTACTTTCAAAGTTG^ 

GAGTT CAAGG ATT CCAG CATGT CT ACCTTT AAAGG Tr rr i 'TT C CT AAGAAATTTT AG CAGC CGG TCATG CTGGCT CACACCTG T AA 
TCCCAGCGCTTTGGGGGGCCGAAGTGGGTGGATCACT^GGGXSTCAGGAGT^ 

2 5 TCTCAGCTAAAAAATACAAAAATTAGCCAGGTGTGGTGGCAGGTACCT 

GCTTGGACCCAGGAGGCAGAGGCTGCAATGAGCCGAGT^ 
AAAAAAATTTTAGCGATGTCAATTCATTTTTAAAGTGGTG^ 
ACITCCTaKBCAAATGATTTTTGCCAGTGAAGATTTTAG 
GAGTGGTTCAGATACTGTTAGATTTCAGGC!ATTGAAACAAGTGAATAATTC^ 
30 aaataaTTATTAGGAACCAGAGAGAGACTGGAAAAAATATTTGCG 

ttctaagggaaggacc^tttggcttaaaatccgcatggggtgtgaaggaaacagagagcc 
aaggttacaactttgatc^cacagcactttc^ 

3 5 ttatacaaiiataatgaatccgcttttaaacatttgtgtttttcat^ 

tctcccactotttttcctttccaaatcagat^ 

tcagatttcactttttgacctattttaatatgctgttgcaaatgtct 

ttgaaggtcagggcaccttt-ggcttcctctg^ 

TCTGTCAGCAAGAGGCCAGCCCTGACTCTGGAGATCTrGGCCTCTAAGGAGATG^ 

4 0 CCZACrGTCCTTTGATATGTTCTACATACTGAGTTTTATTTCA 

GGGAAAAATGACTAGAGGGAGAGAGAGAGAGGGAAGGAAGGGAAGGGAAGGAAGATAGGGAGGGAGAAA^ 
GGAAGGGAGGGAWK3AGGAAGGGAGGGATCGGAGGAACCCTAACCTCT 
CTTCAGGCTACAACCTGCTACCAACATCAGAATTAGCCAGTtSAGGCTCTCAA^ 
TCAGATATAGGGTTTAGTTTAGAAACTGGTTTAGCCAAGTTGAAAT 

cactgctSctcattcagg^ 

TCCAGGCAAACACACGCTGCCAGGACCTCATTCAGGTGGTX3CC 
AAGCCTGTGCTTTGCAGGTGCATCTTTTGTCCCCAGTAAAGCTCT 

GGTGGTCTGAATCTGTCTGAACGACCCATCAAGCCACAAAACAAAGTACATTAGCAAGACAAAGCT 
SO CTCC CTGCCTCCrCCCCACCTCTCACTGGCTCTCCTGCTCTGCT 

TGGCTCCATCTTCTTTGAATAAGACACAt^CAGCCCCAAACAAACCC^ 

GTGTCTCTGTGTGTGAGTG CAC!ACTGGTt3CAGTCCACAAAGGGCAGGTG CTGGC ATCAAG C ACAAAGCAAGGAAGTTGAGG AGATG 

CTGTCTATTAATGTCTTCAAAGCACTTATCAAATTAGAATGGCT^ 

TGAAAATGGCTGCAATCTCTAGGGGTTATTTGTGCCCAGGAAGTAAGAGTAGATTTTAA 

5 5 CCTTTAAATTAAAT ATTCT ATGGACATGGATTGGATCCAGCTAGAG CAGTT ATGCTCT ACCTTTAGTTT ATCAAGG CTAAGGCACT 

TAAAAAAATAAACTAGGATAAAAGCTATTGTTATCCATTTACAATAGCAGC 
CAGACACCrCCACATGCTTTGCrTTGGAGCCrAGCACTGGGTCTTTGAAGTAGATAT^ 
TGCK3GTTCAGCACTATTAAGTGACTGACTGAAGGGTACACAGCTACT 
GACTAGAACCCCCTCACTTTCTGACAAATACAGCCTITCTCTTCCAGCT 
60 TTGGGCAAAGATGTGAATGTTTTGGATTACTTGATTT 

TTCTCTCTTTCCAAAGATGAACCAATAAACAACTTATGCTTC 
CTTTCAAAATACTGACTCTAGGACAAACATTQCATGATTGCACr^ 
GGTGGTTACCAGGAGCGGCAAGGAGTGGAAATAGGGTGTTAGTGTTTAATGGGTAAGA 
TGGAGCTGGATGGTGGTAATCGTTGCATAGCATTGTGAATGTACTTAATGCCACTA 

6 5 AGTTTTATGTTACATATTTTACTAAAATTTTGTAAATGCAAAAATTAAAATAAA 

GCATTAATTCAGAACCTCCAAGGGTGACTCCTGGGCATTGGAAATGTTTTAGAGCT 

G C CAAG CT AG AGAAG C CTG CTTT ATAGAACAG AGACAAACT AG AGAGAAATG AGGCCCATGTT AGAG CT 

AGACACACTGGGCGGTTGTCTGGACCCAGATTGTCCTAGAAAAGGAATAGCAGGCCCT 

GTGAAGAGACTGTAOSATCTTGATTAGTTTTTTGTCATTGTCATTGATGACTGATTT^ 

7 0 AACTGTACGTTCGTACATCTCAGAAATTTCTTTCTGACGTGTCCTC^ 

ATTTTATAGGAAGGATACTGGGGTTGC!AGAAGTTAAGTGGCCTTTTCAAGTGTG 
CACACCTTGTTCTCTCCCGTCCTCCAGGCrcCAGATGTGCGTGTCT 
GGCAAACCCTGCCGGAGGCCTTCCCAGCACACTCAGCrGGCCCAGC^ 
TGCATGCCTAAGCCCTCATTTCCTTAGTTCCTCh7GCTGA« 
7 5 xtccttgtttaTTAACTCCTGGAACTCATCAACCGGCCAAGGG^ 
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GGACTCCCATTGAACACGGAAACTGAAAACCACAGCCCCACTGCAAGCGGAATGGCCAT 

AAAAAGTCTGTCGGCTTGCATGCTAAGTATGGAAAAGAGAGGGGAGAAACTACCT^ 

GGTGCTGAAGCCCATGTTAACGAGAGGCrGGCTTCTCTCCAGGAGAGTCCrrA^ 

TTTTGGCAGCCTGGTAGGGTTTGCCAAGCACGTTCCTATCCATTATCCCGTTTGTC 

f^CTGGTAGTATTATTATCCCCCTTCTATAGATAATGAAAACAAGGCACAG 

ATGAAGCAGAACAAGACTCAAGTCCGGX3CCTTTGGTCTGTTCTG 

TTGATTCAAGCATCTGACAACATCn^AAGGATTCTrTAATACT 

ATAGCATACTGATTAAAAACTTGGGCTCTC^CTCAGACTGTG^ 

GTATAAATGAGGATAATTGTAGTGCCCCTTCAGGGATAATGTAAAACTCATAAATCAGC^ 

CAATAAAAGTTATAATAAGGATAGTAAATTGTTATTCTAATTTATTATAGTAC^ 

GCTAATTACAATTTTTTTTTTTTTTTTT^ 

CTGCAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCT 
AGCTAATTTTTTTTTGTATTTTTTAGTAGAGACGG 

CCCGCCTTGGCCTCCTAATTACAGTATCCCGCTAATTACAGTATTTTTGJ^ 
GAAGGATACTATACATAGATAGCCAAGCTGCTTTAAAAOTACT 



ATAAAAAATTAGTCAG<3TGTGGTGGCATGTGCCTGTAGTCCCAGCTACCCGGGAG^ 
GGCGGAGGTTGCAGTGAGCGAAGATCAGACCACTGCACTCCAGCCT 
AAAAOU^CTATTTAGGGAGACAC^CCTAGGAAATATGAAGGAAATATAAAACT 
CACAAGAAATTTTCTGAGTGTAATGAAGAGATGGAAAATCATTAATGCT 

GAGAG ATCTG AGG CCAGAG AAGGTGTGGACC CAG CCATGGT C ACACAGGG AGTGAAGACAAAT AT ATACAATTATTC CT ATGT AGG 

CAGATGTTATAGATACATGATACCTTAGTCAGGAATTGACCTAAATTCTGTG^ 

TATO&AGAATCTTCTCTGCATAGAATAC 

TAGTCAAACAGAGAGACAGACAGCTAAGTAAGTAAGGAACTACTCTGCm 
TCAATTTTCCCCTGCATTTAGCTCCTGTGTGGATGTCCm 

GCAGC^TGGCAGGGCTGGAAATAGATGTTGAAGCAAAGGGAAGGAGGAAGGCACC^^ 

TGTGAGCTTTCAAGTTGGGTTITGGAGACAGGAGCCCAGGTGAGCCCTT 

TCAGGAAGACACCCTGGGGTAGAACCTGTGCACCATGGCT^C 

ACAGGAAGCTCAGCGAAGGAGCGGAGAGCCAGGGACTTGTGTCAAAATCAm 

TGAAATAAGACTGACCATGAATTCATGATTGTTAAAGC!AGGGTC 

jvTATCITAGAAATTCTCCGTAACTCAA 

ATCTTTGTCCCTCGCGATATGCACTGCCTGTGTCAGGCTGATGCGCTCC^ 

TTGTGTGTGTGTCTGTGCAGGAGGGCGTGAGCTGGTGCTCAGTGTGGAGGTGGGCTGTGT 

GGG«3CTAGGTCTGCAACATCTGTACCCAAAGGAAAAGGCCTGTAATACT 

ACATCTCTCATGCAGCAGGTTGTGTGACCCCACAGGCCCCATCTACGTCAGCT 

TCACAGGAGAGCCTCTGAAGAACATTGTCATTATATGTrCTTCCTTTATC^ 

GATGATTGCTGGATGTGTCTGTGTGTGCATGCATTCGTGTGTGTGTGTtn-GTGTGTGTGTGTGTC 

TGGTCCAAAGGCAAGATTTATCATTATGGGATAGACAGTAGCT7GCTCTCTCTA 

TTCCTGAGTTAATGTT CTGTCTGCTITCT ATCTGACCCTCTTCGGAGAGTTGCA'i^ T T 1 1 T 1 1 CACGAGAGAGAGATTA 

AGAGAGAGAGAGACAGAGAGAGAGAGAGAGAGAACGAG 

TGGAAACAAAGACTCGTTGTCTCTAAGCTGGAGCAGGCAGG 

GTCAGACTCCAGCTCTCTGAATTCAATTTCACAGCTTGTCTGGGA 

AGGACTGTTATTTCTTTTCGTGGGTGTGCTGTCGGA^ 

CAAATTATTGGAAACAGCCCGTTTCCTAAGACAAGTGTAAATTTGTTCCAGGATA^ 

CCTTATTAGAGGTGGGCCTATTCCATTAGAGGAGGAATTTTGTGTATGTAGGTTTTACTTCT^ 

CAAACAACAATATCTCCCAATGATCATTCTCCCCAGTTTCTGTGTTAA 

rr^CTGTGGAGGGAACTCAGATGTCCTCAAAGCTACCAGAGGT^ 

AAAGGAGGCTGTCTTCTGCCTGGGAACAC^CCATCCTGGGGTTTm 

GAGGGACTGTGAGGTGATCAC CTG CCCTTCAATGTGTTCCT ATTGG AAGACCATCCCGATTCAGCTCTGTGGG ACOT 

AGTTCCAGAACTGTCTGGATCCCCCTCMAGCGTAGAGGTGTCATAGTm 

CCAAAGACCTGTCCTTCAAACATGACTTGGTTCCTGTCTACTCTATCTTCAACCA 

TTGCCGGGTATACCTTTCATCCGTCTCAGTCCCCACCCCCAGTCCATATTCTAGGAAACCACACT 

GATCAGACAGCATCACCTCATTACGGTCTTGTGCACCCTTCCCCAGGATGCTTTGCT 

CAATAGATTCTAGCCCAGCAGTCTTCTGTCTTGTGAGACGGGAAGCGTCTTGAAGGTAGGGACrr 

ATAGGCAAAGT ACCTT AAACTGTTC CC CAGCAGTCTGGTGT AGAACT AAC C CTTTGAG AT GTTT CT ACAAGGTGG ATG TTTTGG CT 

CTATCCACTCCAGCTGCTGGTGTGGCATTGCCTTTCCTCCCTACCCTGCCCT 

TTGGAAGCTGTGATCAGTCCCATCCCTGAGCAGGGAAGCAAAACTGATAAAGAGCAGATT^ 

CCCCTTCTTGGCCCAACCA^^ 

AGGACACX3COUVGCCGAGGAAAGGTGACGCTGC7VGGGAGAATGTACCrTCT 

AG ACAGCCACAATAAGAGCTAACATTTGTTAAG CAC CT ACT ATGTG CCAAGAAATTTCCTATTCATTTCACATTTAT CCTTCATAA 
AACCTTGTTAAGTT-GGCACTATGATTATTCCCATTTTACCT 

AACAGAAACAGAAGAGAGCCTGTTCCTGTAGTCCCCTCCCCIACCTAGGCAATGGCAAGGGCTCCJ^ 

TCTACCACCCCCATCT^TGCATTTCCCCCAAGTAGCTGATGTTGTCTAGTTGCTAAGATA 

CTAAAAAATCAGGAGCCTGACCCAGTGCAATCAATCTTATTTGC^ 

ACT AT AAAGAGGTTGTTTGC CAGGTGGG ATGGGG CGG TGGT AATGG AG CTG CAAAGT AT AT AAAAGAAACATG CTTTGTC CACG CT 



TATGAGTCAAATTCTTTCTTTCTATCCATCCCTGGA 
TTCAGACATCTTCTAGTGAAAGACCAACX3TATAGGAAGCATCATT^ 
ATTTATATAATTATACACATATCCTTTTGTGTTTAAT 
TTATATTGAGTAACACTCTATTGATGAGAATTTCAATAAACTGAAT^ 
CCAGC^CTTTGGAAACCCIAAGGTGAGAGGATCACTTGA^ 

TACAAAAAATAAACAAAAGTAGCCAGGTGTAGTGGTGTGTTCCTATAGTCCCAGCT 
GA^GCCCGGGAGATCGAGGCTGCCATGAGCCGGGATTGC 
A^AAGCAAAATAAGAGTGGGGGTGGATAAATTCAAT^ 
TTTCWGGGTCTATCAGTTTCATGGGTAATCCTCTGCCTCTC 
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rrr^rCTTGGTAGATGAAGGAATGAGGA^ 
^CC^CCTCCCCTGCCCCAGTATTGAGAGCCAG^OT 

TACAGAGTTO^ 

^r^^TGGGGCAGGGAACTGCTGCCAGGCTGTGOTACTGTTG^ 
CTATT^AGGC^TG^C^CTCTACCAAGTTTCCCCTrGACACTACACGCTCATCCCACT 

C^^C^AATAT^ 
rlllrTTGm^CCAAATTCTCATCAACCAAArrCTTT 
CCAGACMAGATCTATGAGC^AGGGCATCTTCTA^ 
TTTCTAATCCCTTTTTOTAGATTGAATAACTCTCTCTCnTCAGAAGCC 

ATCCTTCTTCGAGWIAAGAAAGTC 

^ag^a^gSgIatatagaaagca atti 1 1 1 111 1 1 1 1 1 1 1 1 1 1 TGAGACAGAGTCTTGCCCTCTCGCCCAGGCTGGAGTTC 

tg£ctg1ctgc1^^^ 
^tc^^a^caaaLatccctc^ 

ACTCTCTCKGTCCCTGCCAMTATCTA 

aSSg^aSca^ 

TTCCCTGTTTGGAOTG^ATTCACGAGAAATTATGOAATTT^ 

^aI^cag^gccgaga^ 
aScIaIa^aIct^ 

cl^^GA^TAGGTCAA^ 

^r^^ATCCAAGACCACAGACTTGACAATCACCTGATTCAAAATATGTGAAAACT^ 

T^CCTT^ACTATTCCTCTCGCAAACATCAAAGAGATGCTGTCAAGTTrCTTAG 

TCTCATG^ATAAAAAACTAATTAAAAGGAATTGGCTTTCCAGAGGGACTTTGAG 

catccctccactgcctaCctccc^ctctggtttc 

ggcaagaatgaatttt^agagaaac^ 

gta^aaag^^acactcaaaggctaa^ 

tcaaS^cacagaagctg^ 

TGAA^TTG^AACCTATGAGGGTCGGAGCAGGGCAGAGGGCGTTGCGTA 
CGTCTACATGACCCAATATGGAA^^ 



g?agSa^tc^Sgcttc^ 

TCCCGGGTTAACTGACTCGAAGTC^^ATCT 
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CTTCrrcCAGGTTGGTGGCTCGGAGAGGCTGCTCCTCCCCAAAGACTGG 

TTTAjGAGGATTTCCTAGTCCAGCCCTCTACCAGTACACGTCATAACCTGTC^ 

AGTITOTCCCTGTCGGAGTCTGG<^TGAATCrrCCTCCT 

xctgccattgatttgagcatttctcctgtgcatcctcactctttcragaaagg 

TCCTGGGAGCATGAGCAGCCTTGCCCATTGCTCGX5CTGAAAGGT 
GTGCTGCC^CCTTCTTAGCAGCAACCAGAAATTCAATGAGAAGTTTCAGG^ 

AAGGTTTTATG ACACAGAGGATGAAGAGAG AGG CTTTTGTGGG CTGATGTCGCTGTG TTG CTGG CCT CCGGTCAGGTCACAAGGTG 

GAGCG CTGCATG C CGGCAGGG ACGTTTGGGCTTT CAGACAG CAGAGG CAT C AGAG CAG AAG CAAAT ATCTTCCCAAAGATGTTTGG 

C CAT ATTTCT AGGACCAGGTT AAAAGCTT CACACAGAAGCT AAGTTAACATTGGT ATGTCT ATG AGAGGGTCATGT CCAG^^ 

CTTTCGCCTTCCAACAGAGACTTTGrcGAAAAAAATCTCCATGCT 

GCCATTTAGTGTCAAATTCAGCCCCACTAAGCATCACATAATGGTGATACTGTTAA 

ACAATGGCTGGCAATGTCTTCATTCTCCAGACTGAGGCTCCGGCCTC^ 

CACSAAACTCCAGGCAAAGCTCTTTGGGTCACTCTTCrGA 

CAAGGTGTCAGCCTTCACGACAGTGAAAAGTTAGTTTGTCAGAACCAGGX5 

GAAACAGACCAAGATCATCAACTGGTACACTGGTGCCGACGGC 

TCAGTTCTCAGGAGGCAGACTCCAGGCACCACATTCGAGAGGGCCTCCA 

GCCTGAGCCCCACTCCATGCGCTGACGGAGAGTCC^GCAGGAGCTCT^ 

TGGGGATGTGAGCCACTGCGTCAGGTCCCrrGGTTCAGGCTTCTGCA 

AGACJATGACCACACTCCTAAATTCTTCTTGAGTGACrrm 

aaa<^cataaaatgcacaccactt:caccgagcagcattcaggccttc 

CATAAATAGGTGGATTGTGGTCCACTCAGAGGGGACACAGGAGTGATGTAGTCTGTACCCCT 

G CCTGGAGGCTT AATTTCGTG AAGAAGTCAGGGT AGG AG CACTGGGTTCT CTT cagct agtg atatcatgaacaat aagaagaatt 

GTGTCTATGTAGCAAGTAGGAGAAAATATTGGCCAAGTTTATTCATTCAACAAATAOT 

TGTTCTAAGTGCTTGGATGTATCAGAGAACACAACAGAGATCCCTACTC^ 

C CAT AAACGACAAAT ACAAATT AATT ACTGATT AAATT ATG TGAT 

GCGGGGAGGAGGGAATTAATGTGGATTTGGAAGGCTGCCCAACATGTT^ 

ATTCTTGATTGCTAACATCCACTGAGGCCACCTCTGTGCTGGGCACT 

GGCTGGCCTGGTTCTTCATACCCCTAGGGCAGAATGCAAGACTTTAGTTC 

AGAAATTGCTCTGAAAATGCTGGGTTTAACTTAAATCTTACATTGACCAGACGTCT 

TGTTAAACCCCAGAGCCAATGTGCTCAGGCTGTCCTTTGTGCAGCCAGGTACCA^ 

TAAAAAAGAAATC3AGA(X3GAGGCCTGTGGTCTTTGCTACTCT 

TTAGGGCATGGCCTGTTGCTATTTATGACTCTATTACTTCACAGTGAGAGCAC 

TCTGTCACITCCTTTTTCACCTATGTGAGGACTGTAGCATTGATTGGGACTTCCAA^ 

TGGGCCTGTGGGCACATTGXICCCAT AAAACTCTGCCTCCTTCCTAAGG CCCTGGGGCACCACAG AAAGATGGGC CTGCCAAGC CAC 
TGCTCTGATAAGGCATGCCTTTAACCAGGCTCCATCACTCAGAGCTAC^ 
GGGAATTCCATCATGC!ATTCAGATTTTTGCATGCCTGCTCTGTCCCTGGAA 
AGTCAACGATAACTATAGCCATGGCCATAAm 

TGAAGCOTCAGTAGACTAATGATTGAGCAGAAAAAAATGTTTAGAAAACTTGA 

CAAGTCAGAAGTTCAGATGACATGTACGATTCCTGGAATTTAGTGATTTTT^ 

CTGCrCCCACrAGAATTATCTCCTTCCCCCAGXSCCCT 

CTAGCATGGCTGCAAGCGCAAAACCTCCCTTAGTTCCCACCTGC^ 

CCTTGAACATGCCTCGGCCCTCTCTCCTCTCCCTCCACATTGCrCTACTTC^ 

TCACCTTGACATCTCACCACCCACCTCTGCCTTGTTCCTACCTGATC 

ACTCACCTCTCTGTCAACCTCCAGGAACCCTTTATGTGTCCTCCATGCCTTAGGCACA 

ACCCTCTCATACTCACATGCAAGGACGAGGAACTTGTTTCCTATTACTTC^^ 

GAAGTTGGTCCCAAATCTATTACCAGAGGCCTAAGCAGCAGCCTCCTCCTGCTCAAAGA^ 

ACTTCCTCTCCAAGACATCTCAGAAAATGCCCATCAGAGCCTTCCACCTCAT^ 

CTTGTTTGTTTTT AAGCAAGAAG AAT C C CTTT AG AGGAGGAATT AGG AAAG AAAAAAAAG T CAAACAGAAACAG AAGGAGTGGAGG 

TTTGTCTCTCTCTTTTGTCCCCTGCCTTGCCCTTCCTCCCTATCCGA 

AAAGACTGTAATTTTAAAACTGAACGGGCTTTTCCAGAGCCTTGAGGAAT^ 

TCCCTGTCTTTTGGATCITCTTAGCCAGAAACAATGAGGCCAGGCAA 

TTTGAGAGGCTCAGAAGGGGCTGAAGGGCATGGTTGTGTTTCACCCCGTC^ 

TGOTCAATGTCTCAATGG^ 

TATGGAAGCATGTGGAATCTTTCAAGAAAGGGGTTACGATCACAAAAGTTCGATTTGAGG^ 

AAGCCCTTAGTCATGCCCAGGTGGGGAGCAGGTCTCCAACCTCTCATTTCCTGATCCAG^ 

CTTCCCTTTCTCTGTGGTCATCAGCCCAGCCCTCTCCACTATCAATAGAAAAGCCCGAGAGCCCTCAGAG 

CCACCGCTGTGATGTCATCCGACATCATCCATGTGCTGAAATCAGGGACTGGAGTCGG 

TGAGACACTCTTCTTATGGGCCTTTCACCCAATCGAGACCTTGCCT 

GGACATTTAAGAAATGTCTAACCTTGTGTCAGTTTATATTTC 

TCATTTCCACAACCCCATAAGGTAGATAAAACATAAATTTTTATTTCCGTATTAAAATTG 
CCCAGGTTTCACAACAAAGAGTGTCAGGAAGAGAACTTGAACCAATCCTCT^ 
TCCCAGATAAGACTACAATCTTCTGAAAGTCAGAGTGTTAATATTCCAACCCCAGGGG 
CITTAAAAGCCATTGCTGAATTGAATTTTAm 

CTAATGTACCCCTATTTGTTATTGTTCATTAGACCCTTCTTATGACTCAGTCAGAAG 
GCCTCAACAAAAGTAAGTAAATGTTTTATAGTTCTTT^ 

CACAGG AG AG CAG CAAGCATTTGTTTCCCTC CTTTTTGAGT AGACATCAAAAT^ 
GTCCTTGAAAGATGATATGGG ATTTT CGTTTAGATTTTTG AG TTTTGTTTTATTGCGCT CGGTTTC 
GTTTTCTTGAGACAAGGTTCTGTACCAATGAATGGTCACTGTCACT 
TATAATTTTGCTCCCGTAACTGTTGAGGCTAAGTCGTACCAAT^ 
ATCTCACCACTTGGTGGAGAAATTGATGTCTCAGAACCCTGCCATCTCCT 

GGGGGGTCTGGGAC1ATCAG GTTCT CCACCAAGTGGTG AG ATG ATG ACATGTCGA^ CAAT AT C CC CCAGT AAGAGAAT AA 

GCWGCAAGAATGACTC^TGAACTCCTGTACCAGAAATC^ 

m n anr^AATTCTTTCAAGAAAGAGGTTGGGGTTACAAATT AATGCCCAAA 
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AGAAATC AAGCACACACAGAACTCCT TTCTTCTGGTTGG T ATCAAGGAAGG CTTTGTGG AC CTG GT AG CACTTG AGTT AGCTCTG G 
AAGGAGAGGCAGGATTTGGACCTGTGGCGATATGGGCTGGGGAAGACATC 

TTCCTGGAGCTTAGGATGTGTGATGCAGAAGAAGTCTGGAAAGGTAAGAAACAGAATTGTAAATGA 

GTTCrrTTAGGCAATGAAGAGTCACTAAGGTTTTGAACACT 
5 CAGTCAGACTAAGCCAGAAACCATGCAAGAAGCAATACAGATCTGTGATGCAGAAGTC 

GAT ACCAAGGAGGAAACT CACAAG ACAGGGACAC CAGGAAG CAGCTGAT CTTTGCT CAG ATGT CACTTTCTC^T AAGG ACTTCTC 

T AACTTTCCGATTTGAAAATGAAATCCAC AG CCT CAGT ACCATCAGTG ATG CT ATACTGTT AG TTT" ACTTTTTAAAAATTTGAAAA 

TGAAAAGGTGACATTTAAGCAAAGACAAAATTTT^ 

ACCAGGAACTGCITGTTCATTCCTGAATCTCCAGCCTGGAGA^ 
1 0 AAGATGAATAAATGAGTCAGTGGAGAGTGGGCGAAGGAATGAGTGAGAAAGCACATCT 

CTGGGTCCTACTCACTGCATrrCTTTCCCTCTTGCCACAGGTCCTCCCCT 

ACGGCCCCAGCCAGGTACCTGCCCAGGATATGTAATCTCTCCTTTGCTTCCTGCA^ 

AGGGTTTAAACAATATACCACATATGTTTTCTCTrATTTCTCT 

TAACAGGTACTGACATCAAGTAGAGACTTGTTAAATATCAAAGGTGAAAGAACT^ 
1 5 TCAATAAAGAAACTGAGGCCCGGGGAGGTGAAGTGACTTGCTAAAAACCATACAGCT 

CAGATCCCCCAAATTCCTGTCCAAGCCTT^ 

AACAATTAAGGCAGAGGAACTTTGAAAGCCATGGACCTGGAGACTCCGTATTG 
CCCTC^CCTCTACCTTACTTGTGCTTCTAGTAAA^ 

TGCCTCCTGGAATTTCCACTCAAGCTTTTCCTTCTGGAGAAGCCTTCCTCCTG 
2 0 GCTGCTGTGCCCAAGCCCCAGCTCCTCTCTCTATCTTCITATGTGATGCAGCTGAG^ 
GGTAGGCAAACATTGTTCCCCATCTCCACAGAAGGCAGAGC^ 

TTG CTG ACTGTGGATCAT AAG T CAGATGT AGAG AGTTTT AAAGACTGTC CAAT AACAACAAAATT ATGCATT AGT C AT CAC CTGC C 

AAAAATAATTATCATATTAATGATGTGGATGAGAAAAGATGGTCTC^ 

GTAGAAACCCAAAAATCATAGTGATATTTTTAAAACTGC 

2 5 TGCCAAGTCGAGACCCAGCTCTCTACCATGATTTCCTCAACTAAGA^ 

G ACTTTTT CATGG CTAAATAAACACAT ACGTTTTGAGG AAAG TGAGATTTT AAG AAT AGTCATTTT AT AG CTAAAATGT ACACT CT 
GGTlTGTGCTGACATTTCTGGCTTGTGTGAra 

CCCACTCATTTGTCAGATGAGAAAACTGATTCCCAGAAATTGTCCAACAAGT^^ 
GGCTGCTTGGTTTCTAC^CATGTGTTTTTCATCCTAC^ 

30 • AATG ^ TTTCTTTGTTTTGC 

TTCTTTGTTCTTTCrGTAATTATTGAGATGAAGACCACAAATGT 

TCTGAAATCAGTCTTTCCTGTGATCTTGAAATAAAAGTATAAACCCTTATGGTTTTCTT 

AAAAACACTGAAGCAAATTTCCTTTTTTATTTCCTT AG^ CCCGACCAGCAGT CGCCT AGCCAACCCTG 

3 5 QTGAGTTTACCTTGG CCTGCAAGCCTTTTTTGCCAGAATG CTAAGTCCAGA 

CACCTGAGTGGGGAGTAACATGCACCTCTTTCCTTGAAATGTTC^ 

TT ATGAGGTGTCAAGGTTGGTTAGCTGAGGATACTT CT AGGAATAAACACATC AGCACTGCACATGTG CACTT AGGTGTAG ATGAG 
TAATAAGTGGCACAGGTAGGTAGGCGTTTCAATCAGAGAAAATGGGCAAGAAAGAGAAGAAGACAACT 
TCTCCCACATGCCAGG CACAAG TTCATAAATATTCCAAAGGGTTTTATT^ 

4 0 AT AAAAGAG ATGTGTCCTGGAAAAGTTG CAG AT AT AGT CAATTTG AC CTTGT ACTTT AAAG ATTT AG T AT CAAAAAT AACTCT 

GAGGGTCCTTCTAAACCAAGTCCATCC^GATTGGGAGCTAGCAC^CCCTTGTCG 
ACTCTGCCTACCCATT'CCTTAGTGCCTC^TCATTACTGC 



AAGCTCATGGCGCAGCTTC!TCCCAATG ACTCCCACAGTCATG CCT^ CTG ACAGGGAATT CCCTGGGGG 

4 5 ACATG ACCACT AATT AG AG ATCAG T C AGTG ATT CAGG CCITTCT AGATGAAGAG ATTCAAAAG CCTTG CCAAAG GG ATTGAGGGGG 

GATATGGCTCACCCAAGATCACACAGCTAGTTCTGGGCAGAGTTTC 

AAGCATCTTAGGATCTAACCTCGCTTATTATACCCATCTCCAAGGCAAGCAGA 

TCAAGTCGATCCCAATGTCGAAGGAAACAAAAGGTTTCTTTAAAAGGATGA 

TCTCrGGGCTGAGGTGTTCTGTTCrCTCCCGTTTGCCTCACGG CGTGCAGGAAGCGGG CAGATCCAGCTGTGG CAATTCCTCCTGG 

5 0 AGCTGCTCTCCG ACAG CGCCAACGCCAGCTGTAT CACCTGGGAGGGGACCAACGGGGAGTTCAAAATGACGG ACCCCG ATGAGGTG 

GCCAGGCGCTGGGGCGAGCGGAAAAGCAAGCCCAACATGAATTACGACAAGCTGM 

CATTATGACCAAAGTGCACGGCAA7SAGAT ATGCTT ACAAATTTG ACTT CCACGGCATTGCCCAGGCTCTGCAG CCACATCCG ACCG 
AGTCGTCCATGTACAAGTACCCTTCTGACATCTCCTACATGCCTTCCTACCATGCCC^ 

CAT CCATC CT C CATG CCTGTCACTTC CTCCAGCTTCTTTGG AG CCG CATCACAAT ACTGGAC CTCCC C CACGGGGGG AAT CT ACCC 

5 5 caaccccaacgTcccccgccatcctaacacccacgtgccttcacacttac^cagctact 

CTAGCTGAAGCCCATCCTGCACACTTACTGGATGCTTTGGACT 
TTCTTTCTTGTTGGATAGAACCTTTGTATTTGTTCTTTAAAAACATTT^ 

AAAGAGATGAATAATTCCATC^CCAGTATGCCAGTTTGAATTCTCAGTCTCCTAGCATCTTGTGAGTT 
GAATGGTTAAGTCATC^TTCTGAGAAAGAAGCTGTACGTTTTCT^ 

6 0 GTTCIAGTATGACACAGAATCATGGACITAACCCGTCATGTTCTGGTTTC 

TAATTTTAGGAGGACCAAATTCAGTGGATGGCAACTGGAACATTGATTC 
GGAAAGAAGGTTTGTGTGTTTAAGACGCCAAGGGCATTGCAGAATCCCTCTCAGTC 
TAGAAATAGTCAAGATATGAACTAAGAAATTTTAATGCAAATACATACATTCCT 
TTTTTTTTAAATGATGACAGTGGTCCCAGAACrTGGAAAAGTTC 

6 5 TTGTCAGACCATCAGACCAGGGCCAACCAATCAGAAGGCAACTTACTC 

ATTAAAAATAAATAATTAAAAATTAAC^TAAATAAACGAGTTGACCTCGGTCACAAAAGCAC^ 
TTATTTTTTTTAATGTAATTTGTACATCTTTTTT 

CAT AAT ATG CCTATAGCTGAAAAGGAAACAGGGCTGTTTAAGTCACTC^^ 
AGGCATAC^CAAGCAGGGAAAAGATAATCCATTTAGATCTTTAATGCT 

7 0 CTGGGAAAAACAACG AAACTTT CC CTTGTGGAGAGG A^ CTG CTCT AT AT AAGCAACAT ATTTTT AG ACATT AAAATAT 

ATATAATTTTGCAC^TAATTGTTGACTTTTTTAACTATATTAAGTGTTAA^ 
ATTTGACTAAATAAATGGTTCCTTCTCTCAGTGCTGAGGACAGTTTTC^ 

^^orTaTTTArrTCTTGCACTATaU^ 
GTTTTTCATTGTTGTTGATGATGTTTGTAGT 
7 5 G CAGCT AAGG CCATC^iAT AAACCTGT ATGT AAGGACTGGAG CAAAG CC^G CTGGT CTAT C CAG ACTTGT CTGTGAGATTT AACTCT 
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GCAGCCTCCCCTGGGCACTTCAGACCCAGA^ 

GACCAGGACTITATGGCTC^TGCAGATTTTTAAGGTCAT^TI , TCTT CCCAAGGAAG AAACTTG CCTCCAGTTCCTTCACTGTTAGG 
TAGCTTATTTTCATTTTCTCTATTTTACAATGA^ 

TGGT AAC C CTAGAACCTTCTTAGGGT AACACT AAGT ACCTTCT AG ACAAC ATG TCT AC CT AAATG AAATGGGATGTGTTTCGGAAC 
5 ATTTGTCTCCAG rnTTl 1 1 1 AATCTTGCACCCTG CCATTTAAAAAGATCTGTAAAGCACATATTCrCAACATATGCACATTGATT 

TATAAATCATATATACAAACTGTTACATTATTCTTCATATTAGAAAACAAATA 

TAAATTGAAACTGAAATTCTACTCTTTTCCTCCTATTTTCT 

GGTCTTTAGTGAAAAAAGTGTATGCATGTACTGGATTTCTAGTGACTTCCA^ 

CAAGCATAGAAAGAGGAATGATGAAGGCCAGGTATGTTTAAGAACAAACT 
1 0 CAATTTCCC CAT CAATTTTG AAAT CTGCAAAGTATTCATT AGGG AAAATGG AGGCATATATTT CTGAG TG CTGT AAGAAGATGCTC 

AGCCCATTAGTTCATTCTTTTAAAGGTTTTCT 

AGCTTTAGCC^AATCTCGCCCCCTGAATATGTTCCACAGTGATTTC 

GGGCACCATTTTCTCCTCATAAGC^TTTGTACAGAGGGGGCACT 

CTGGCTCCAGGGCATGAAGCGGACTAGTCCCAGAGGACAAACAGAATC^ 
1 5 GGATCATTGTGTGCCTGTGCCATGAGGTTTCTGCCATCTAACAAACAGTG 

GTTCTCATGGAAGCTGTTCCTCCCCACAGTTTGGGACACTGAC 

CTATAACAAACTACCATAGACTGGGTGGCTCATAAAAAGAGG^ 

GGTG CCAG CAG ATTCAGTGTCTGG TG AGGG C CCTTGC CTTGGTTT AT AG AGGG AACCTTCTCACTGTGTG CTG AT ATGGTGT AAGG 
AGCAAGGGAATGCrCTGGGGTCTCTTTCATAAGGGAATTA^ 
2 0 T ACCTT CACATTGGGGATT AGGTTT CAGCAT ATGAATTC CAGTCT ATAG CAG AGGT CATCTTTTT 

TCAGTGTCGAAAAATGTTGTTGAGTCCCACAGAAAGG 
TTTATTTTATTTCTGGAGATAGGGTATCTCTATG 
CTCCCAAAGTGTTGGGATTACTGGCGTGAGCCACCACACTGGGCCT^ 
ACAGCCACTAGAAAACGTAATGACACTTAAAAAGATTGTTATTT^ 

2 5 ACACAATGGGAAATGGATTAAGAGGTTCTCAGATAAGGTTTGCTAGTT^ 

AGGAAAGCAGCATCTGTCCACCCCAAGCCAAGTTACTGCCA 

CAGT ACTT CATGTGAGAGGTGGTTT CTGTGG ACTTGG ATT CTT AC CGT ATTTC CT CT AAT AT CTGTTCATT ACAAGCAC CATG T AA 
ACCTGTTGCCTTAAAAACTCTTCAAAAGGTGTCCTTTC 

CCAGCCCCACTCCCACACCTTGCTCAGAAGTTCCTCCACCATCAACAGCACCACT 

3 0 GAGGTAAGAGGATGCTTTGTATCAGTGGGGGCAC^GATGGGAGAGGTAATAAGAGCCGGGATGGGGGCCAG 

CTGGTGAGCATACAGAAGCTrTGCTGTTArrCAGGAAGGAATGO^ 

GCTACTGAAACTGACACTACACGCCTTCAGCCAGGGAGTCCCGCTAAGGGATTATGAAAGGC 

CCAGGCT CC CC CGGGGGATGGACACTT G TG AG AGG AAG TGGG TT AGCCAGT CAAACCTCCAGGGTTT CAGC CAG TTGAG AAAGG AT 
AGGGAGGGAAGTTTTAAAAGGAATGTGGGCCCACAAGCCTGGGATCTGTCT 

3 5 GTTTCAGTCTGTCCGAGCTGGTAAATCCCTAAGCACATAGTCAATTTACACT 

TT CCAGTGAGACCACTCACTCACCT ACAACTGCTAGATTAACCTTCTG CATTTT AACTGAGTTAACTCAT ATGTTTTCTTAGTGGG 
AGGAAAGAGGCAATTTGAGTAGACAAATGAAAATGAATATAATGCCAGAAAC^ 
GTAACTTGTCTTCATCGATTAACTAGGACCTGAGTTATACCTCCTGCATGAGTTTCT 
CCATTGTAGACTGCTGTACAGGAGGTTTATGGATAATCATCGTTGTCT 

4 0 GTCCTTGGTAGCTGGTGGGCACCCTGTTTGGGGAGTTCAGTGCAGGGTCAT^ 

GGTG CIAAAGT AG AAATAAGTTGGGTTTTCCAGAGAGC CTTTGCATTTTCCCCATCCACGGCCCTCCCATTGCCCCACTTGGG CTTT 
TGCCCTTCCTTGGCTCTATGTGTGCACCTAAAGCCAGGGGAGCGTGTGGCCAAGCTGCAGGATG 

T AAAGGAATGCTAAAAATGTGCTTTGACAG CAATGTTGGATGGGATTTTTCAATCGGCT AAGAAAT AG AGGTTGGGAG CTGGGATT 
T^ATGCCCGGCAAGCrCGGAGCTTGATTTTTCCATCTA^ 

4 5 GTAAGATAGTGTACATAAAAATGCTCTGTATGTTGAAAAGAACTAGGAA^ 

CTGTCTTCATCGAGAGGCAACTCCTCTTTACCTTCAAAA 
ACACTATTCTATTCTATTCTATTCTGTATTGTGAATATACATTGTGATT^ 

TT C CAG AAAAGTTAG AACAGT ACGACGAACAATTTTT AG TG AATCATTT AAG A^ CAACCATG CTTCATCAACCCCAAA 

TGCTTAAGGGT ATATTTCATTT AAACAACAAC CTTTT CCT ACAT AGTCCAGT AT AAT CAAGAG AAT C GAAT AACT AATGCAAGCGG 

5 0 ATTACTCCTATGTAATCCCGAGGCACCATGCACTGTTGGCTGATTCTCACCACTGGAGTC^ 

AAACACTGATTGTGTATAGTTGTCATCTACCATTAGTCTCCCTCGGTCTIT 
TATTCTGTAGAATGTTCCCCAATTTGGGATTGCTGGTATCTCTTCGCGAT^ 

AGAAGTCATGCTGCCITCCTCTCCTGTATTCCATTGCX^TGGTACATCATTTCAACGTATCCCGTTATTG 

TTTAAGGTGGTGCCTCCCAG CCTTCTCCTTTG AAAAGTT ATTC CTTT AC CCTTTGT AATT AATAAGT ATTTTGCGG AGGTTT ACTT 

5 5 TCAAAATATGTAAATATCCCATTCAACATCAAATTTCAAAGTATTCATTCA 

TTATTGAGTGGATATGATCTGTTCTTTAAAGATTACATTGTCCCAATGGAAC^ATATTCACGCTGGCT 
TTCCCATCAGTCTTTAAGTGCTTCCTTACTTTCTCGCACAA 
CCTCATGGATTCCTAGTTCTTTTTAGTGGAAAATGCTATTTAGAACCCAATACCT 
GTCCCTGCTCCCAGGCTCCCTCAGTGTACAGAGCCAAGCAATAGACGTAGATAC^ 
60 TGACTATATGC^CTCACGTACATACATACATATACAGTC^ 

GTGATTTTACGGTTGTGAATATCATAGAGTGTACTTCACACACCT^^ 

CCTATTGCTCCGAGGCTACAAACATGCACGGCATGTTACTCTACTGAATACTGTAGGTGACTGTAATACAAT 

TATCTAAACATGGAAAAGGTACAGTAAAAATAGGATAAAAGTTAAAAAATGGTACACCTGTATAGGGC^ 

TTGCAGGACTGCAAATTAATCTAGGTGAGTCAGTGAGTGAGTGGTGAGTGAATGTGAAGG 

6 5 AGACTCTGTCAACATTGTACACTCAGGCTACGCTAAATCTACAAAACCTTTTCCTT^ 

GT AATT CTTT AT AAAGTTCTAAATTTTTTAACTCTTTTACTCT^ 
ATAAAAATATCTTCATATCCnTATTCTATACACTGTTTTTATTT 
TGAAGACATGAATTCATATTTAGCCTATAACCTCCCTGAGGCTGTTT^ 

CGTATATCTAAAATCTATCATTCTAAAATAATGATAAAAATCTATCATTTTAAAGTAATGATAC^ 

7 0 ACTATCATTATOUVGAATTACGTACTCrACACCTC 

CACAGACATGAAGATTGCATTGCACTTCCACAGXKSCATGAGC^ 
CTTCTC^CACCACCATTGC^TATGGGGTCTTTCACGGACCTAA^ 
ATCTATATTTATTTCTGCTATCTATATGACTATCCATATACATCT 
TGA<3TTCACTCAGAAGCATtCATTTCTAA7C 
7 5 AAAGAATGGTTTCCATTAGACATCCTATAAGCCCTTCCTTGTCAATCC^ 
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^rrc^CA^CTCTGTGT^^ 

^rSce^CCAGCTGTAGCCCTGACAGCC^^ 

ft^OTCCCATCAGGACGCGCCACACGGA^ 

rrorT^^CCCATGGCTCCTGGAGTCAAGACTTCAGTGGGAGGGGGAGGAAG 
TGCT^ACACATTTACAACAGCACATATCTCATTGGGTGTGTGTATGTG 

!?SSt^cctgcttccatattctct 

^Stttoacatccagaatctggcaagcagct 
^taataa^cgaggcacagacaagtggagaaaatgaaatgttaam 

™£^ttgttgtgttctgtcam^ 
Sg^aatIIgggaa^^ 

ctSS™^S™ac^ccaatctttggtgttaattggtttcc 
StSttma^gcc^gagccgcttoctttcagaaatgtttaatcaa^ 

TTAAAMGTGGGGAATGCAAACTCTACCCATAGGGACTTAGTCTGTAGGGAAGAAATATGAACTTGCCACTA^ 
AAATTA^^TTATAGCTAGATCTGCCTTCATAArmTATTrCTGATGCTTCT 

S^StScct^^gaatccaagacacaaaactcaaatc^tccaa™ 

^^GGGCTGCCATTTGTGCTCAAAGCCAACACTCCAAGGTTTTGTGTAGATGAGAATGAAGAGGGAGAAG 
ATTTC^TGCCGGGGGCGGTGGCTCATCCCTGGAATTC 

A^CCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA^ 
^A^A^CMGAGATTCAGGAGGCAAAGGCAGGAGGATCGCTTGAA^ 

™tatgccagcctg«3cgacagagtggacttggtctcaaaaaaaaataaa^ 

AAGAGCTAT^CACCAAAGACACAGGCACCCCCTATGACGTCCTGCTCATTTAGGAAACCCATATTC 

CGT^SaSSt^^ 
aSttI^^C^^ 

TGCTrcCTGGAGGGAGCCTATTCTACTGCCAGCATTCTGGCTAATCAGAATGGATACAGAT^ 

gcctgctggttttcctgggcgag 

SccIaSgcacI^^ 
cSaa^gaaggosctgcgaggtcaggct^^ 

AT^ACG^ACTATTAAGGAGGCTCTGTCGGTGGTGAGCGACGACCAGTCCCTCTTTGACTCAGCGTACGGAGCG^ 

tca^ccagtca^tc^cgtc^^ 

AAA^GCW3CAMCTGGTGGGCGGAGGCGAGTCCAACCCCATGAACTAC 

c1££a?SaccaccajSm 

GGGCCA^AAAOTA^TACAGCTTGATGGAGATCGACACATCCTT^ 

S^ta^^ttctggcctcaacaaaagtcctccccttggaggggcacaaacgatcagtaa^ 

^^atccgtat^atcct^^ 
^gSgctctc^cWgccaacgccagctctatcacct^ 

Sc1™gacc^tgcack 

rar^^l^GTCCATGTACAAGTACCCTTCTGACATCTCCTACATGCCTTCCTACCATGCCCACCAGCAGAAGGTGA 

CTCCC^T^T^^^TCTCACTTCCrCCAGCrTCTrTGGAGCCGCATCA^ 

TACCCCA^C^CAAOT^CCCCGCCATCCTAACACCCACGTGCCTT 

SScSSgc^tggcaactggaacattgattgtaaggccagtgaagttttcaccc^ 



nTr&arn T ATG AACTAAGAAATTTTAATG CAAATACAT ACATTCCTG AAAGACGGGGAATTAAATT ACT AAT 

SSStgctc^^a^aaaagttgtagggatttctaaact 

rS^Sc^CC^CAGAAGGCAACTTACrGTATAAATTA^ 

I^aattaaaaattaaga^^ 
™agctgIS£S^ 
^St^cc^tgtggag^^ 
aataaatggttccttctctcaaaaaaaaaaaaa 



"^cg^a^a^aagg^gct^ 

Sc^GCT^TCGG^G^C^TCCAACCCCATGA^^ 
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r a anATGACCACCAACG AGAGGAGAGTCATCGTCCCCGCAG AC 

^^TAAAGGAGTACACKTITGATGGA^ 

<£^CTTCCTCCGCGC^ 

OTATAATACAACCTCCCACACCGACCAATCCTCACGATTGAGTGTCAAA 

GGGGCAATAACATGAATTCTGGCCTCAACAAAAGTCCTCCCCTTGGAG 

^^C^TCCGTATCAGATCCTGGGCCCGACCAG 

^GGAGCTGCTCTCCGACAGCGCCAACGCCAGC^^ 

aS^CCAGGCGCTGGGGCGAGCGGAAAAGCAAGCCCAA^ 

AAAAACA^ATCACCAAAGTGCACGGCA^^ 

GA^AGTCGTCCATGTACAAGTACCCTTCTGACATCTCCTA^ 

ctccccItccatcctccatgcctgtcacttcct^ 

TACCCCAACCCCAACGTCCCCCGCCATCCTAACACCCACGTGCCTTCACAOT 



982 



WO 03/008583 



PCT/US01/51291 



Table 46 

MOUSE NOMENCLATURE 
ICSGNM Tk2 
Celera mCG21547 

5 

HUMAN NOMENCLATURE 
HGNC TK2 
Celera hCGl640726 

10 MOUSE SEQUENCE - GENOMIC 

AGAATCCAAGAGGACATGACAAGTGCAGTGGAGGAAGAG<KXjGCAG cttcctgttt agccaaagaagggatggagagt cggggggc 
GCACAAGGCAGACCAGGAGAGACGAGGTTCTGGGATGTGACTTGGGAGAG 

GTTTTTTGTTTTTG'rrT TTTT CT AG AACT AACTTTGCAATCTCTTG CTGAAGGG C CAC CT AGACCT CAC CCGATGATTTCCATTTG 
TCTGTTTTGACAGCTGTGGTAGCCACTGGAACAGTTGAT^ 
1 5 CTGCTCITG^ACTCTTTTCCrCCCGTTGAGlTCT 
TGTCATCCTTGGTTGTrATCTCTrAGAAGTGTATC 

GGAAGAACTCGGAGCAGCAGAGGAAGAAGAAACTATAATCAGAATATATTGTAT 
TTTTGTTTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGG 
ATCTGCCTGCCTCTGCCTCCCGAGTGCTGGAATrAAAGACGTGCACCACCACGC 
2 0 AAAGAAGTGGGGGGGGGCCAAAAGTTAAAAACATGCTCTAATTTAT^ 

TCCTGAATCCCTGAGAGCATCCCCCAAGCTTGGATTGATTAGCAGTGAGCAGAGAT 

TGACTAAGGTGTCT^CCTAAGGCAGGAAAGGTAGrTAAGGTTTAAGGTTG 

TCTCATTATCTTGTCAAGTCCACACGTGTATAAGACTAGGAAACAGAAAATAACGGTGACTT^ 

2 5 GTTCAGGGGCTTGTCCCACGATGAAAAAGATCATAGATGTCACAGTTACCACCT 

ATACAATGGGTTTAGTTCCTATGGGGAGGTTGGGTGCATGTGTGGTACT 
ACCATGAAGTTACACATATCCCTAGCCCCCAAATGAGGTCTAAAACCAATAAGATTTCTGTCA 
GGCTAACGTGGGTACTGAGTCCTGCTTGTGGTTTAGAGGACT^ 
GGTCTGTCCCCTCCAATTGTCTCCTCTCCCTCTCGGTCT 

3 0 GTTGTCl^CTCTAAAACACTCCTCCCAAAGCCCrTTCCrrTTCAC 

GTTTTCTGCTTCTTGCTGCTTCTCAAAAACATCTGCATC 
GCTCAGTAATCTGCAATCTCCTATTGGACTTCAGTTGAATGTT^ 

AGGAGGAGATGGATAGAAACATTGTCAGAAGCAAAAATAAACCTGTGTGCATTTTCATGTAAG 
TTAGAATTAAGGGACAGCTGAAGTCTGGAAACAACCTAAATGACACCCAGAAGTGAATGGATAA^ 

3 5 ATGCAGCTCAATGATTGGTGTGAGAGGTCACTAACTTAGCACATGTGATGGCCTTTGTTAAT^ 

ATCCATACACATACCACATACATACATGTATACACCCACACACAAAAACATACATATACGCATC 
CAAGTCC^TACATGCATGAAAGGCAGGGGAATAGGTCTATCCATAATGGAATGTTAGGJU^CAAA 
TATACTATTGACATATCACATCGATGACTCTCATAATTAGATGGATGAAAGATGACAAGA 
AAATTCTAGGAAGTAAAAGGTCATCTGCAGTTACAGAAAAAGCTTATTTATACAA 

4 0 ACAGAGAG<^CATGAGTGGACCCCTGGGGCTAGAGTAGGCAGGAAGAAGGGATTATCGTGGCCATGAGGAA 

GG AG C CAAT CATG TG C CACTTAACAAAT AGATCCACT CTGGGAAAT ACX5GTGAGTG ATTTTGT CGTG CAAGTT AT CAT AAAAAT AC 
AGATACZTAAGATGGCGCTGATATCACCAGGTGACATCGTCTGATGGG^ 
TTTGCCAGGCATGACTGTATTGATTACCCTGATTATGGTGATAATTT 
ACACTGGAAATGCAAGTGTCTTTCCCATGAACT^ 
45 AAATTCAAGTCTACATTTTCTCTATTATATAGTCCCCCTTTCACCATC 

AAAAATTCGCCTCTCAATTTCTCCATGAAAATTCCAGTGGTCAC1TCCCAGG 
CAGCTGATCACAGCCTCAAGTTTCTTCACTTAGCACCCAGGAC^CCCTGACTTA 
CTCTTTGCATGTTCnrCCCTGCCCTG CCTTTACTGCTCACAG ACTGC 

TTCTTCATGACTCCAC CCACCACTTGACTTTCAATACGACCATCTCTCACCCAAAT CCAGCTCTATCCAGTTCAGTTTCGCTACTC 

5 0 AraSCTCTCACAGOTATCTCTAATCTACTACTGCATTCAAAGCT 

GATACCAGTGTGTCGCACCTACACTTTTTCATATGTGTAGGCCCCTAAATTCCC^ 

C CTGGTCAAAATG CAAAAATGTTGACTG CTGTGTGTG ACAG CTGACCTTTGATGGTTTAATGTGC^CAAACTT'rTGCTTCATG TAT 

GAAATGACrTAAAAACACTGTCTACTACCCCGCAGGCTATC 

GGGTTCTATCTTCGAGATATCTCATTACACACTTGTAGATACTCCAAATATCCT 

ATGAGCTCTAACCCTGGTACATTCCCGCCTCATTTACT^ 

TCCGTCTCACTTAACrCTGACCGATACCATCC^CAAAGTCTAACCCCAAGACTACAAT 

CTCCTCCTCACGTTCCGTTCrCTGCCTTTTATTTCTCTTATCCCCTAGC^ 

ACTCTGCH'GCCCrATCTCCCATCCCAATATGTAAATGTCTTGTTCACT 

6 0 TACACGAAGCACCAAACTTCTTATTGTTCAGAGCAGAGATrCTTAAGGl 

AAATTTTCTTCTGGTAT ATCTCTCGCTTCTGTATCCT CTTT CATGTTTGTGTCC 

TGGACTAAATGGAAGCATTCTATGAGGCAAGAACTTCn'GTTCTGTGCCTGGCATAGAGG 
AGGCGGAGGCAGATCTCTGAGTTTGAAGCCAGCTTGATCTACAGAGTGAGT^ 
CTCAAACAAACAAACAAACAAACCAQUUU^CAAG^ 

6 5 TCCT/GAGTTCAATTCCCATCAGCCACA 

AGCTACAGTATACTCACATTAAATAAATAAATAAATAAATCTT^ 
AAAAAGAAAGAAAAAAGAGAAATTTTGTGGTTGGTGACATGGCTC^GTCAGTAAAG 
AAAACAACAACAAACAAACAAACAAAGAAACAAAAACAAACAAA^ 
TATCCTTCTAATCCTACCATTGTtKSAGTTGGAGACTGTTTG 

7 0 CrGTCTCAAAAGCCAATGTGGGTGGCTCCTGAAGAGTCACATATTTTGG 

GTCTTGGTAACTAAATGGAAGAATTCTGTGAGTTAGGAATCCACTTTTGT 
CTGGAACACATTAAATAGTC^CATCGTGTACAGCTITG 
TCCTGGAGCTGGAGTTAAACAGTTGGCCAGGAGCAACTCAACATG 
CTCTCGCACrTGAGC CATTACCCAGTCTCTCAC 
7 5 TATATTGCTCTGGCAGAGGGCTTGG A Tl"l X >TTrTCCAGCATCCCATC 
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TGATGCC CTCTTTTGACCTTCAAAGG CAC CAAACACTCGTGTGG TG CAT AT AAAT ACAGG CAGGCAAAACACTCTCACAT AAACGG 
AATAGATCTAAAATAAATAAATAAAGCAAGCAAGCATCTGGGTGGTTGGGTGGTAGA 
GACCAGCCAAAGACATGCTATGCGAACCTGTTAAAACCCAAATGGGGCT 
TTCCAGAGGTCCCGAGTTCAATTTTrAGCAACCAGTTGGTGTGGCT 
5 TGTCTGAAGACAGCTACAGTG TACT CACATAAAAT AAAT AAAT CT^ 

ACAGAACAAAAAACAAGTTATATTAAACTTAAACTCCTCCCCCCTCCCCAACT^ 
AAACTCCAGGTGGCCAGTCACCrTCTTGATTAAATAAGGCAAA 
ATATGAATCATAATTTGGAGCTACCTCTCCGAATCACGGATTTCIX^ 
ACTTGAAGTTTGAATCTCCTTTACAACATCCTCTTTAGTGCCC^ 
1 0 GTCATTTAAGTTTGTCGCCCCTTCAATATAATAAACTC 
TCAGCTCTCCAAATTTTTTGAACGGCTTAAC1T1T 
GCTTTTAGGTATGGTTGAGGAGAAAGTTGTATTGTA 

TG AG CCATGAGAGGAG AGGG CAGGGGAGAGAGG AAG AGGAAG AGAGGGGGACGGGG AG AG^XIAGA 
GAGAGTGTGTGTTGGGGGAAGGGGGTGGGGTACCAAGAGAGTCTGCAGAGATGGCC^ 
1 5 GGG AAATGAATCCCAG CCTCTGGG CTGGAGAGG TTT AAGG T AGGGG CGGGGTGAGAAGTT CTGGG AGGAG CT ACAGGTACAG AGTG 
AACCCAGTCCCTGGCCGTTTGTACTTCTCTAACGAGC 

AGAACAGACTGCAATGCCCCTGTTAGGGCTTGGAGCCCTAATACGGGCAGTTT 
TAAGTTCGGTAACATCCAATGTCCGACTTACAGTAAAAGACC^ 

AGTGGTGAGACTATACAGCGCCTGGTCTGGTAGAGGTTATCACCAAGTTCGTAAAACAAGTC 
20 GCGTGTCCCTACATCATCTGAAGGAGGAAGCCACTCTACCCTATAGTTCTT 
TTGCGTCTGGAAAGGCGTGGCTGGAAACCAGGTGGCCGAGCAAACGCTC 
CACTCCTGACTACCGACCGCCCTGCATTCCCGCCCTGGTCCTCACCAACTO 
AGGGCCGACGGCTTACGACCGGCCGTGGAGTCTCCAT CTTGCCCACAGACC CAACAGT 
AATGGCTCTTTCCAGGACTCX3CGCCACCAGCTCAACTGCC<5GGGCT 

2 5 TGCGCACTAGCGCGGGCCGCCCACTGGCTCCGCCTC 

AGCCAGGTCCCCACCACTGCTCCTCTGC CCTGG AGCTTTCCTGTGG AGAAGCACT CAAGAATG C 

CAGCCCAGGCTCTTCGCCATTTGAGCATCTCAAATGCCACCTCCTTA 

X TTTTTTTTTTTTT T 1 1 rTTTTTm AGATCCCATCCAGATTACAAGTATAATATCCCCCAGGAAAACGTTGATTCACTGGTAGTA 
AGTTAGACTTAGGCAAGAGAAGTTCCGATGTTCTGTTAGTCTGGTTGACCAC^ 

ATGTGCACATGTAATAAAACATCCAATGGTATCT^ 

AAGCAAGGATCTGTCTGCTTCTCTCCTCGAAGGGTCCCCATGGTCTGGGCCA 
TATGGCCrcGCCCItnTAGTCTGTCCTTGCTCTCAGAACG 

TGTCCCACAGTTTATAAAGTGCTCCGGTGTAGCCAGGACAAACCATCTTGGAGC^ 

3 5 CACTT CCATT ATG TG AATT AAOT CTCATGAlTi'Cl'iTTGGAGAACTG AAT AAC CACC CGTC CT AGTTTG CTT CTCTGTG AT AAAAC 

AAACACTGACCAAAAGCAATTTGGGGGAAGAAAGGGTTATTTGGCTTACAG^ 
GGAGGCAGAAGAAGCAGAAGCAGAAGAACTAAGAGTC 
GCTCACCTGCCCAGGGACAGCATTGCCCATAGCAAGGAGGCAAA^ 
CTTCATCAATTAGCAACAGGJATATGCTCCACAGACAT^ 

4 0 TCCCGAGGTGTTTCTAGTTTGTGTTGCCTTGGTAAGAATTAACTAGCCCACAACCCCAGTCCAT^ 

CATCTAAGATCTGTTCATTTGGCniAATCTGAT 
CACACACACACACACACACAACTGTCXSACTTTGAATT^ 

CCTGTTAGGGTGATGGCTTTTrTGTTACAGATCrAAAAGAGTAAAACAAAACCCAGA 
ACTGCCAGCATCTTGCCACCTGGAAGAAGTGGAAATCITCr^ 

4 5 TGGGCTTATTGGTCTTGAATGGGTGAATTATAGTACCCAAGC 

CCTTTTTTGTTCATTCaGGCGAGATATTCGTTATGCTTTCCCGAGGAACA 

CATTTGGGAACAATGGCTTTCACGGCCCTCTCACCrGTCTCCCT 

CCTTTGAAGCCCAAGGACAAGCTTGGAAAATGTT^ 

CATCCTGAGTAGGCGGTGCGAGGTGTGGATGGAGCCGGCAACCAAGGAAGTAATCGGATATACTAAGCAGTC 

5 0 GCCATCCATTCTCTAAATCTACTTGGGTCAGGGCCATTCTGCGGT^ 

GCCTAGAGGGAGAAATACTAGCGGATCCCTTAAGGGGCGGGGCCGGACA^ 

GTAGGACTTGCACATCAGCCAGGGACTATGCTGCTGCGGTCXi CT CCCGGTC CCTGCGTAG CGTGGGCCCGGG 

GAGCTCCGGGAGCCCCGGGAGCCTCGATTCGGGCGCCGGGCCGCTGTGGGCACCTCG^ 

TTTGTG CCACCTCGTAGGGGCCAC ACTGAACCCCGCC CCCTCACCTGCTAGTCCCCAGGAAGGGT CTGGGGACTGTGGG ACCCTAA 

5 5 TGAACTGTCAGGATCGGCACrTAGTGGGGCTCAAACGCGGCTTCrrCGGCCC^ 

TTAGGCTTGGGAGGTGGCTAGTTGGGGCACGAGCTGTATTTAATCAGCTCr^ 
GTTCAGGCTTGCGACTTTCrCCrCTTGCTGGCAGTGCGGATCrrT^ 
CTCGGGCTGCTGCTCCCAGGTGGGTTGGCAGTGTTGCCAGTCTTGGCCTAGACAGC 
TGTAGGTTATCCCTGGGTGCGCTGTCAAGATCCTAAGCACACAATCCAGGTGA 

6 0 TAAGCAAGCACACATGGGCCTGGGCCCAGTACAGCGACACATACGGACCCAACT 

GTAAG CCAGGGAACACGCATACG CAATGACTGTGTGGTCATTTATCAGGTCAGCACAGAG CAGAGTG AATAATTGAAACATCG ATT 
TCCTTTCCTTTTTTAGATAAAGATAGAGAAAATGATAAGGAGAAAAAAGCAGTGGTAAGTTT 

AATTTCGGTTTGGTTTTGTrr i TT TlXITlTri '' lT ' ri ' T ' l li^GGCGGAGAACAAAAACAATG^TGCGCACAGTGAAAAACCAAACC 
CAAGACGATATCCAGGGTCACAAATCTCTCTTCCTTCTTGTGGCCCGGTTCCTCTCC^ 

6 5 GTGTGTACGG<5CGGGAGTGGGGCCTCGAGCTGCATTCAGGTAGTAAGCACTCATGTTCAGGTGT^^ 

ACTTCCTTCTGCTGTCTAGTGGAGACCAGAGAATCCA03AGGGACCT 

CTGGGGGTTGGGG CTGTGAGGTCAAAGGAAGTGCAAGTTAG ACAGGTTTAAATGTTGCAG CAGT ATTTTGGTTACCACCAGCCCAA 
GGGGTTCAGTTTCnTCCTTTGAGATCATCTCACCCCAGGCTGGCCT 

7 0 GGGATTTCAGTrTGTGTGTCACTCCCAAGTGGTCAGGGCCCGTTTCTGTA^ 

AAGGCAG CTGG AG CACCTT AG TG CCT CTGG AGG GTGGAAGGTGTG CC CTC CTGT AC CCCTTGGTGGGG TGG AGGT AGTCAGG A 

GGAAGGCGTGTCCCTGGGAAGCAGGAGTGACATCACAGGAAGAGGAGGCT^ 

GAGCTCTTGTTGATGTTACAGAAGATTCAGGTTTTTGTTCCT 

^ATCTCACCCCGCCTCTGCTOCT^ 
7 5 TATAAAATACATCTAAAAAGTTAGAGGAGGAAGGTGGTGGTGGCGGCACAGGCCT^ 
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AGTTGAAGGCCAGCCTGGTCTACAGAGGGATTT^ 

TAAATAAAATGAATGAAGGAGAGAAAGAAAATATGGATAGCTAGTGACATAG^ 
AAGTGTTGGACCTGTTCTGGATATGGTCTGATGCACCCCTGGGTGTACAGTAGCAGTTTGTAG 
CTCAGGCTCAGGGTAGCCATTTAAGTACATTTTTGTGTGGGA 
5 ATTGCGCACGGTAACCTGACAGACGGGACAGTAAAAAGGACATCAGTGAA 
GGGGACTCAGGACTCTGTGCAGCCTTGGGAAGCCTTGATGCC 
AGAGAGCCGTAGGCTGGCTTTATAGTCTGTCCTTAGATAGTGG^^ 
AGAGTCACCAAAGGACTTCACCATAAACTGTGTAATTTTTACT^ 

TGTCTTACCACGTAGCCCTGTCTGAATTGGAAATCACAGATCCTCCTGCCTCCGCCTCCCTAG 
10 TGTAACCTTTGAGGAGTAGAAATTTATTTGTTTCGTGTTTCTGGG 

G CTG CAT ATG TGTTT CAT C C CATGGAAGAGAAGAG AAGACAAGACCT^ ACCCACTCCACAG AT CATG CCTTTTTTT ATCAGG AACC 

CATCCACTGATAACTGCCCTACCTCCACAATACTACT 

CAGGTACACCTGCATGCCAGAAGAGAGCATCAGATCCT 

GGACCTCTAGAAGAGCAGCCAGTGCTCTTAACCACTGJ^ 
1 5 AATCTCTCTGCAAGGTCTAATCTCTCAACACTGTTGTACCGGGGACT 

CGTAGCACTGCCACGCACTGTGGTCCGACrTGCCTTTCAGGGTG 

GAGGTACAGAGGAGTAGCGAGGTCTCCCTCrrAGGAGTAGGAGAAATGCTACTTGTTTC 

GATTGGCAGTTCTGTAGGGTTACGGTGATCATTACTATTGTATTGCrCCTCCCCTG^ 

TG AACT CACAGAGAC CAC CTAT ACAATGAGAATGTTT AACTTCCTTTTTTTGAT AT ATT 

2 0 AGTTGGGTATGGTGGTACAGGCCTGTAATACCAGCATTGAGGAGACTGAGGTGAG^ 

CAGCAAGACACTCACATACATCCCCCCCCCC^tfUVTAAA^ 

AGAAG AGGACAT CGG AT ACAATT ACAGATG G CAG ATGGTTGTG AG CCAT CATGTGGTTG CTGGG AATTG AACT CAT GACCT CTGGA 
AG AGCAG CCACTGCTCTTAACCACTGAG CTTATCTCTCC^G CCCCCCTCTGTCTCTAAAT AACT AAAAAGTAAAAATG AGAG TGGGT 
GGATTTCCGTCGTAACGCGTGCTTAGCAAGCATCAGGT^ 
25 AGTGGCrGGAGAGTGGACTCTGCITCTGGGGGAGAGTGGGCT 

CCTACAGCAGCACAGCCTCCTATCTCTGTAATCTCTTACT 
CCAGATTTTCCATATGATGTGCTGGGCTCATAGTAGGCCTTCA^ 
AAT(^CCAOTATTGGTGGTAACITCTAGCGAGGGTGTC<?AG 
GACCCAGAGGATGTGCAAGAAGGTTCTGAAGGAGCAATCrrGTCTTAGATTGA^ 

3 0 CCAAAATATCGATTGAGGGCTTCTCTGTACAAGTCAGCA^ 

TGATAAAAGTCAGGTTAAGGAAATGGGAC!ACTTTTATACTAACAC^ 

TGACTGGGCAGACAAGTTGTGTGCAGTTTAGGTGACGTAGGTGGCAGGGA 

GGGCAGATAGCTTGATTGGTATCTATGAGGCTCTGGGTTTGACCCCTAGCT^ 

3 5 GGGTTTTAGCITACCTCTCAAAGTCCTGGCCCGAACCAAGAACCTGGCT 

TGCCTAATGTACATTTGTCTTCTGCCAGGTTTGTATTGAGGGCAATATTGC^ 
ATACAACAGACGTCGAGGTAAGGCTCTTACAGTGGGTTTGTAGGAACCCGAAACA 
GAGCCCTGTCCAGC^GAAGGCTGATGCGAGCTGCCAGTGAGCTGCACGTCrGATCCGOTGC 
GGGGGAGCTGTAAGACAGCCCTGATGTCAATTAACATGTCAACATGAAGTCAGTC 

4 0 TGTTTGATACTCAATCCCrcGGCCTGGAGTTCAGTTAGTGTT^ 

ATTTCTCAACTCCGGATTTTCTTTGACTATACCGTGCTGCCA^ 

TCGGCCTGTTTTCAAGTGAGGACACAGCCCCCTTAAAGAAGA 

GCTTGTGGCCCCTGTATGCAGTGGTACATGAGCTCCCACCATTCCT 

AACAGT ATGG ATCATCCTGGACAACTGCTTG CTTCITGGGGAGCTGGGCTTTTGG AATGTGCCAGG CAGTG ATAGTCTGTGTAGTT 

4 5 ATGTGACCCCTGAGCTTCCAGGCCTAGGTCACCITTCCTGGTAGACAGTACTTGAC^ 

TTGAGTCTGTCTTGTGTGACrCCATTGGGCAAGGATACl^ 
TGGTGCTGTGACCTCTCATGGCAGTTAATATTGGCTGTCAGTC 
TTGAGTGGTGACCCTCTGAACAGAACTTGAAAAGTGCTTATATCOT 
CAGTAGCTAAAATGATCCAAGGCACCGTGTTAAGTGTTTATGGCATATCT 

5 0 AGGTAAGCACTGTTATCTCTGCTGAGGCGAGGAAGCAAAC^ATAACTG^ 

TGGCAGCTTGCCAGAATTAACGACTGCAGCAGCCTCCCCCACrCAGCXSTGGTTTa^ 
TTAAGTAGAAACTTTGGGATGTATTTAAGTTCrGCATTGTTGTACGATG 

CTGCTGTCTAGCAT AGGAAAGAAAAGCATGG CAAGTGTT AGGCTT^TTGTACCTGAAGCTTCAGG CATCCGCTGGGGATCTTGCAA 
CCCCAGGAACAAGGAGGCAGCACTGTGCCTGGGGGCTGATGAAGTGTTACCT 

5 5 CGATATGTAAATTGGCAGCCTTACTAATATATAATATAACAGGTCATATAATT 

AAAGGATTTTCCCCCCCTCITCCTGCTGTGGGACACCAAACT 

CAGGCTCCAGGGTTTCAGCCCCATACTTAACATTACGCCAGTGA 

CTTGCATCCCTGGGACCCAAGCTGGAGACGACCTGTCATGAC^ 

G AATTTCAT CTTGT ATGTCAGTTTCT ATGTT CATGCAATTGGATCTAC CTCTGAGG AGAGT C CTCAC CTAT CAAG TAGTGTGTGT C 

6 0 GGGGCCAGTCCCAGCAAAGGCrrcTGGAGCTTCAGGGATGCTGTCC^TGTTTGT 

G AGC CT ATT CCC CTCT AG ACTG AGC CAGTTTGCAGT CAGT CACAGGAAGGACTT C C CCAC CT AG AAGG AG CTGTGG CCGTT CAAGG 
GTTCATCCCACCCTTCGTCTGTAGTGCrTGCCrrcCCrGTATTCT 
TAATGGAGCCTGTGCTCAAGTGGAGAAATGTCCATGGCCATAACCCTCTGGTGAG^ 
CTGTGAjGTGCCTAGGAGTGCTGGTrCTGAGGCCTGTGCCCAGCACA^ 

6 5 GGGAATGGC^AGCCAGGGATTTCTGAGGCCTGGGTGGGTTTGCAC^ 

GTGGCTTAGCTTTCCTTGGCACTTGTTCCAGGTC^^ 
GAATGGATCCCTGTGGTTTACCCTGGAGCTACCCATGTTCCrTGCC^ 
CCTTCTCCCTCCTAC^CTCrGAAAGCTTTGTGCTGCCTGCT 
AAGGACTGTGGTCCTGGGCTCCCGTGCTATGGTATGGGGTGCTTAGTATTC 

7 0 TGCCTCXSCAGTACCTCGGCAATACCACGTCTAGA 

CTTTAGGACCCACrTGCTCACTCACTCAGGATTTATT^ 
AATTTAATAAAGCCTCTGGTCTCCTGGCCCTAACATCCACTGT^ 

GG AG CTGTGAGG AGAG CT ACAG C AGG TGAAAGGATG CT AGTT AGAT AAG AGAGAAG CTCACAGTTTT AAAAT AGGAATCGCAC^ 
GCTGTGACAGGAATCTCACTTGTXSTGCATGTGGGAAGAAGATGGGA 
7 5 CCTTCATAlWWNNtfN^ 
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NNHNNNTAAGACAAGGGTAGT^ 

ACAAGTGTTATCCTTACCATCTCTTCATCATCATCGTCATC^^ 
ACACAAGACTGCTGCTTGTGTGACAGTTACTGGCT 

GCTTCTGACTTTGCTGAGA rr ii Ll I lGACCCATAACTAGGAAAGCCTTrAGTTCTCAGTGCTGAGCACCGAGTGCGGAAGTGCAG 
AAAGGAAGCTGGCGTCCTGTCACATGCCTGTGATCT 

A^ACACACTOTGCAT^ 

AGTGCTTTTATCTCTAGGTGACAGAAGATTGAGGGCrrGGCCrrCTAT^ 
nrATTTGTCAGAGACCTACATGTGAGGGAGGGGGCACrc 

G TGGGGGC CTG CTTGG CATGCTTGCAG AGCACTAAGGAAGTTTGTGTG CCTGG AAG CTGATTGGC CAGAGAAAGAAGGGT AAGAGA 

CAAGTGGATGT^AGGTGGGTTTCTAGGGCCTGTGGTTCJIGATGGGGGTTTAG 

GAGATGTTAGCAAGCTCATTTTGGCTGCTGGATAGAGAAC 

TTTCCTCAGGCAGCACATCCTGGTCTGGCTGTTAGGGGGCTCT 

ATTCCTACAGCTGTCCTGAGTACATCTCTC^CCTCAGCCCTGCCCCr 

GGGGCCTGCCTGTACTTGCCACTTGGACACAAAGTGCTTTGCTITCCTCC 

GG C CTCACACTG CAGACGT ACGTGCAGCTCACCATG CTGG ACCAG CACACG CX3C CCT CAGGT ATGT CACAGG TCT CACTTTCAACC 
TTATAAACATCCTTCAGATTCAGGTGTTGTCAGAT 

GACTTCAGTAAGGTCTCCTGTCCTCACAGGTGGCTGTGCTGGTGCCATCCTGAGCT 

GGAT AACACACACAATG CAAC CTT CTGAG ACTGT ACTT ATGG TCTGTGAAGTGT CTCCTTGAG T CTCAGGC ATCT AAACT ACT ACC 

CTTGTCTTACTTGATATCAAGCCTTGAGTTCTCTCTGACTTCTAGAAAG 

ATGCACACAAGTGAGATTCCTGTCATAGAGCTGAGTGATTCCAGCTTTTGACCATCC^ 

CATTGTCACCTGGGGGCTTGTTAGAATTGTACATTTCTCATCTCCTCCCCAGATCTACTC 

AGGAGCCTGCTTTATCCAGCTGTGCAGGACTCCTGGATATGTGTTCTGCGTGTCT 

GGCTTTCAGATTCTCGAAAGAATCTGATTCACCTTGGGTCACAACCAGGTGT^ 

CCTTCCATTAGC^GGAGCAGGTTGTCTTAGGAGGTCAGGTGGCAAGGGTGACCATGGTTC 

AGGTAGACCCTTTCCCTGTAGGAGACAGGATACCCCTATAGCCAGTTACGTTAGCACGATCA 

TGTGGCTCCCTGCAGCTGTAGGTGTCATTCTTCTGTCGACATCAGTCTGTATCTTTACAA^ 

GAAGAATATGAATTTGTCCTTTCCAATGGTTTCCTAAGTGACAGAGAGCAGAA 

CTGCTTTGCCTAAGTGGGCATTTTAAATTATATCCGATGATCTGTGAATATTATGGGATGTGGGCT^ 

AGG CAGAGCTGG CTGG ACT CAG CAGG AG AGAGATGGCTGAAG AGGGAG C CGGAAACAGATGGGAAAGAAAG AT AGGAG CAG CACGT 

GCAGAAGGGACAGAAGGAAAACAGATGGGTGCX3CAGAGCCAGGAGGCCGGGAGGTAAGCAGCG 

CTCTGGGTCACACTGTCTACTAGGTGTCGGATGGGCAGGTCCCTCCT 

CCATGGAGGGTGTGAGCAACTTGCAGAAAGGTGATGTGCAGCCT 

GCCTCCCTAACCCTACTCTTTTGGGGCTCTGOT 

GGCAGCTTTGCCAACGTGTTCC^TATCCTAGTGAGTCTCCCTTCTCCTGTGAGAGCAACTGT 
CCACATGGCTGCCnTACTCTCCTCTGTCCrCTAAACTTCAGG^ 
GGCAGATTTGGCTTCTGTTGACTGTTGTCCTCCCTCAGAGTCTCT 
AGTTCTT<^GGGCTTCCTGTACCTTCCTCCTCGCCTGAG 

GGAAAGGTCAATTTACAGCGCAAGATACATTTTTGTAGAAAACCTGTATAGAAGGTACTC 

ACTGATTITGTGTATGTGTGTTTGTGTATATATATTAATATGTATATGTGTGTTTAGGTGTAG^ 

AGTGCATAGGTGTGGGTATCTAGGTGTGGGTGTGTAGGTGTAGGTATGTAGGTGTGGGTGTC 

GGTGTGGGTGCGTAGATGTGGGTGTGTAGGTGTG AGTG CAT AGGTGTGG ATATGTAGGCATGTGGAGG CCAAAGGACAGC CTTGGG 
TGCTGGrrCTCAGGAAGTATCTATATTACTTTAATTArr r riTCl 1 rAGGTTTGTTTAGTTTTATGGGTATGAGTGTGCTTACATA 
TATGTATGTGTTCCAGGTGTGATCATGGAGGACAGAAGAAGCT^ 

GTGGGTGCTAGGACCTG AATCCAGGTCCTTTGAGATAGTAGCAAATCCCTGAGCCATCTCTCCAGCCCT i GAG 

ACAGGGTTGCTCACTTCTTAGTTATCGCTTCTAGGCTAGGCTGACTGGCCAGCATACCT 

GTGCTAAAATTGCAAGCAGATAGCCrCATGCCCGTGCTTTTACATGGGATCT 

TACTTTACCATCAGAGCTGTCTCCCTGGCCCT AAACT ATTGCTGGTGGAGTT AG 

ACAGTATTGATGAGCTTCGGTTGCTTTATAATGTTGTGTAACAACCACCAACCC^CT 

TCAGTTTGGGAGGTCAGCCAAGTGGCTCCITTTTACCrT^ 

TGACCCAGGTGGTCCAGGCAACTCATTTCTCTTCCTCTCACCTGTCATCCCCr 

TCATGTGTGTGGCATAGGGCAAAAGTCCCAGTGGACACACAGCGGCATG 

GGTCCCATGCCTAGGGATGGCAGGCGAGAGGGCCAAGGGGGTCCATGGCACAATGC^ 

CTTAGAGCAGTTAGTCTCCCCCGTGCTTATGCTGTGTATAGCTTCTGTCCTCAG 

ACCCGGGCTTAGGGCTCCAGTGTGGGCAGGGCCT^ 

GCAGACTGAGGTCTGAAGTTTCAGCCTTTTCTGAGCTGTTTCC^ 

CCGATTCTTACCACCTCTTCCTGCTCTGAGCCTGCAGTGC^ 

AGCTTCCCAGACCCCTCCCTCCCATCACAGCTGTCCTTATGGTTCTG 

TGTCTGTGTGTCTGTG CCTGGTGCCCATGGAGGTCAGAATAAGG CATTGG ATCCTCTCAGACTAAATCT ATGTACACAGCTAACTC 

CATTTCCTAGGTGGAAGCAGCAAACAGTTCAAGACAAAGGATAATTATCTACATCTTGCT 

AGTGAGAGGCTCCGATTTCTCTTATTCAGGTCATCCCTCAjGGCACCACTGA^ 

CCTCTATTACCAT ACTG AG CAAGTCCCCTTGTACTG AGGCCXi CCCIAGGGCTCTCTGTGTTT ATCAG^ 

GTG ATGTAAACGAGGTCCACCIAGAGGGCAGCCACTGCGT AACTACAGGGGTTCCTG CATCCCT ATGCTTTAGC CCTTTTCCTGCCG 

GGGAGGCCTGGCCTTCTGTCCTGGGAACAAGACTGAGCTCTGAGAGTTGTCCTTGG^ 

TGTGTTGTACTCAGCCTGTGGGTTCCATTTCACACGTGGGACCTGAGGTTTT^ 

TCCCAGGTTTTTGTCCAGGTCTAGAGTCGAGCAGCTGTCAGCCAAA 

TTCAGAGGACCAATAGGAGGAGAAGGGAGTTTGGGGTTCTTATACGGTCACTCTC 

GTAG^GGTGCTGAAAAAAC^GAGGCTGATAGATCCAGGACT 
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GGTTCTGTAATGCAGAATTAGCTTTAGTGGTATCCTTAGCCTGTATTTGGACTAAAA 

CTGGGCACTGGT ATGCTTCCCCACTTCCTGTG CTGCCCATGGCTTG CTAATGTGCAGTGGTGGAAAGCCCTTTCATAG ACGGGTCT 

AGTCATGGCTGACCTGGCAGTGCCAGGTTCATCTTTTCCCCAGGGATCCCT 

GCTCTGCAATCTGTAATTGAAAGGTTGGGTTTCTGGAGAAGCCCACAGATATGAAT^ 

TCTCCGTGTTCTTGCCTTTGGGATCrrGAACAAGTAACTC 

GCTCAGTTACATCJVCTAGCAGTCCGAGAGCTTTGTGTGGATT 

ACTGAGCTGTGTGCACTGGGCTGCAATGCAGAGCTAAAGGT 

AGTAAGCTCTGTGGCTGAGGGATTGAGCAAATGTTTTACAAGACTC 

TGGGTGCAGCTGGGAACATGGCGTTGACTCACCTGGCACATGGGAACAATG 

ACCAGGAGGCCAGGCCACAGAAACACGGGGGATCCAGGTTAGCCCCTCAGGGAGGACAACAAA 

GAGGGCAAGGCTCTGCTGGAACACAGGGCTAGAAGAGGAAACAGTC^ 

GTGTGTTATTGTGTAGCCTAGGCTAATCTGGAACTAAGAATTm 

CAGCCTCCTAAGTGCTAGGATTACAAGCAGGTGCCACITCAACCAGCCT^ 

AACTAAATAGAAACATGGTACAAAGTATGGTGAACAGTGGGACTGTC^ 

TTGGTCAGGGTTACTATTGCTTAATGAACACCATGACCCATAGCAGCTTGGGGAAGAAAGTA 

GCTCAGCATCAAAAGCAGTTAGAACAGGCAG 

GGTGCTGCTTACTGGCTTGCTCCCAGTGGCTTGCTCAGCCTGCTCG 

GTGTGGAGCCCAGATAAGGACACAGTCCCTGCTCTATCACATTTATACCTTCTTCT 

TTGCTTTCGGTTAGACTTGCTGGTCTAGGAGCTCTATCTC 

CTTTTT , m TT CCCT ATGGGG ACTTGAACTCAGG CACT AATGCTTG CAG AGT AGG CACT CTT ATTT ACTGAG CCTT CTTC C CAGCA 
TTGCTTTTTT AGT CTT AAT ATTT AAACTGTG T AATAGTGCCG CCTCACTCTTGTT CT G AGCACTG AT ACTTTGCT CAAGTTT AAAA 
AGTAAAGAGCCAAGGAGGAAGAGTCTACAGTCTGCAGCTCAGAGAGAAGCCCT 
CATATGTTTTTTGTGTGGTTGAGGCATATGCATACAATTTAGCCT 
TGTCTGCGTATGACTTTCCTACATCATCCTTCAGCTACT 

CTGAACAGTG GTGCATGGCG AG CAT CGATGAGCATTTTCACAG AGAATGAGTTCT AG AAGTGAAAAT^ CTGATCCACTCCC 

AACTTGAATGATGTCTCCACGTGTAGAGGGCAGACTAAGATGCTTTGTCTCCTTGG 

CTCCTGGGGCCAGCTCCCATCTCCCAGTGGTGGAAGGTCAGACAGACAGATGCT 

TGTGGACCACACTATGCAACATGCTCTTGGTGCreCCCTTCC^ 

ATGGGTGAATAAAAATCTTTGACCTCTGATTGCAGTGGGAAGATGCCCGAGGTTG 

GATCGTCAGGAACATTCATGTCTCTGTTGATCTGATAGGTAAGACAGTCTCTCT 

GCTGGGGTATACTTGGGAGGTTCTACCITGGATCGACAG 



ACTTCTGGTGGCCCCCATTTGTTCTGTTrTGATGGCTTTTCAGTA 
TGGCTTGCCAGACCCTGTCTAAGAATGGAATTCCITTAAGCT^ 

CCTGAGGGTCGGGAATCCTGTTGGTCCCACAGACTGAGCAGGCTTGGTGAGGCTCCTGC^ 
TTGTCTGCTCCCACACCAGGAGCACAGCACTCAGTCGCTTG 

TACCAGAGATTAAAGATGAGGTGCCGGGAAGAGGAGAAAGTCATTCCGATGGTAAG 

AGCCCTCTCCTTTCCTCCACCATGACACTATCACGCITATCTGGAGCAAA^ 

CTATGGCTITGGAGCCCTTGTGGGTCCCAGGGCCACTTTACnTCCTATGT 

ACAC^CACACACACACACACACAC^CACACACACACACACACACACACGTACA^ 

ATATACTCATTTGTTCTGTACAGTAGCTTCTCTTC^ 

TTGCATAGTAGCrrCTCTTCAGCTCTGA^ 

GAACAGCATATTGGCAGAGTTTGTTCCCIATTTATCTTTA 

ATGTGTCCTGGGGCCTAGCACCTAGCAGTTGCTTAGTTAATACTGGACTGAGTtS 

GATAGATAGGGGCAGGCTCTTTTGGGACATCATGGTGCTCTGAACCTAGGATGAGACCT 

TTAATCCCAGTACTCTGGAGGCAGAGGCAGGCAGATTrCTGAGTT 

AGAGAAACCCTATCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAGATGAGACCTGAAW 
AGCCAGGTTTTGTTTTGGACTAACAAGAGACATCCCAGCCATGAAGGCCTA 
GACAGGAGTGATAGAGGATGGAGCATTGTGGGCTGTGCTGTAATGAGGCTTGGCATIT 
TCCATTATCCAAGGGGTTGCCAAGGTCTTTAGGACAAGCrrTGGC 

TGmCcXrGGCAGGAATACCT 

CCTGTTCTGGTAAGTCACTCTCCCTCCCAGAGAATGTCAC 

TTAGTAACATGTCGTACTCATCATCACC^CAAAGAAGTCCTCAGTTACTCTGGACCTG 

GGTGTCCATTCTTCAGTGGGTCACACGGAGCCCATCGTCACCCATCGTCATAGTTAGGTGACTGGGCATCT 

GTGGTTGTCCTCAGTTGTCACACAGTGGCACCTCAGGAGCTGCTGAGGTC 

GCCAATGTAGCAGTGCTGGTGTGTCCTACCTATCCATATGTCCTACTGTGTGTCCTGTGCCCCAGTGTGCTCCT 

CAGGCAGGCAGGTGTACTTG AGG CCGTCTCACATCCAGCCGGTG ATCATGGGCTTTTGTTTATGCC CTTGGGCTGT AGGCCACAGA 

GACATCCTGGTGCTAAGAGACAGGTGATAAGGGACATGTCACTGTGGGAACTTTCAGAGTT^ 

AAACCTGAGAGGGCTGGGAAAGTGGCAGAGTGGCAGAGAACTTGTCTAGACATTGGAGTCC 

CAACTCCTGTCTCAGTGAAAAGAAAAGGCATTGTTGGTGGGATGTG 

GTGTGGTTCTGACCCTCTTGGCCTCrCAGCCTCOT 

TTATTTCCAGGTGATTGAGGCTGACCACAACTTGGAGAAAATGTTAGAAC^ 

AGAACTGGAAG CATGG ACCCTAGGACTGAGTGATCTACAAGACAG CGCCAGG AACAGCCCAGC CAGG^ CTAGGA 
GCATTTGGAAG AATCT ACTCTTAGG AGGGTCTGTTAT CTAATCAG ATCTATTTTCTCAATGGTCrTTTCACTG CAGT ACAAATGGC 
TTTTGTCTGTCAGATGCTTGCAGCATACATCCCTCCTGGTCCCCCT 
CTGACTGTATCCCTGGGGTTGTTCTGGATCCTCTGCATC 

TTGAAGCCCAGAGAGGTGTGGGGACTCGTCAAGCCAGAGCATTGTGGAGGGAGCT 

TCCTCCTAAGCTGrrCCCCACCGCCCTGCTGCTCTGAGCACCAGCCCCCGCCT 

CATCTTTCAGTACCGGGCCAGTTCAAGGAGACCAGACATCACTAGGTGT^ 

CCTGGTATGAGGGTACTCTGACCTCTATCCATTCCTTTGTCACCTGGACCT CCTCTG C 

ITGGTGCC^GAGTCCTTGAGGATGAGTCCTCCCCAGACTCTACCAGTTCCCTG 

CCTGGACACTTTACCCTTTGCTGAGCGAGGCCAAGGGCAGCC^ 

AGGAG CAGTGTC C CtrrTGCTC CAGGGAG ACTGACAG TGG CCTT AACT C CTTTGAAGTGTTTTTC CATCTGAGCAGATGTCATGG AC 
CACGAGCATGAAGAAGTCATCTCTATTGGATCAGAAAATACCTTAGCTTTG 
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ACTTGAATCAGTCATGACCAAGAGAGGCCGGCCATTCCT 

AAG C CITG CTGCTTCTTGACATTCTAG AATGGATGGAGTG CATCTGG AAAAGACTGGACACAG ATG AT ACAGGGG AATGATGACCG T 
ATACCCTGGAGAAGCACAGCACAGGCTGGGGAGGGGGCTGCAGGGGAGCCCT 
GGGCCAAGAGTCCAGCTCITGGTCCTTTCTGTrrGGACCAATTCTT^ 
5 GTCCATCAGCACCTGCAGTCCAGCAATGGATAAACCTGAGAAAGGCAGGAACTT^ 

AATTTGCAAACGAACAGT CT AGT ACTTGC CCTTTGCTCT ACCAGAAC^ CTGTTTCT CTTTCTCT C 

TAGTAGCCAATAAAATTCTACAGCTAAACCATGATCTAAATTCATTTTCA 

TATTCTCTCTCTCTCCCTCCCTCCTTCTCTCCCCCTCTCTCTTTCTCTCCCTCTTCCCT 

CTT^G AGTCAGGTT AT ATTT^ AT AGACTGGC CTGG CCITGAA CAGAAG T ATTTTTGCCT CAAAl i ill G AGTT ACAGG CATG 
1 0 TGACACTATGTCTGGCTTGTGTGTTCACTTTTGAGTT^ 

ATAAAAGTATGCCTGGTGACCATAGAGGCCAGAAGAGGGCATTGGATCTTTTGGGACT 
ATGCTAGGACTTGAACCTAGGGTCCCTAGAAGAACACCCAGCGTGCTTAACTCCrcA^ 

CAAACAC CACATTTTAT AGTAT AG TG CGCATGG CT CAAAAAAGAG CCTGCTGT CCTTG CACTTCAACACTG CAGACAGGAGATCAT 

CTCCAGT AATGGCT AAGGTTTGCAG AACCTT C CAGCTCAAAAG ATCACACT C AAAACAG C CAGAAG CAT CTTTC C CAAGGACTG CT 
15 ATCAGG CAGACCT GGT AGGGG CAGGG GTCATGGAG CAAGT ACCAGCACTAACAT CT CATGGC CAG AAAAGGG CAGGT AACACATGG 

TGGGCCACAAGGCCTAGTTCTTAG ACACAGGGCATCACTG CCTTCAGCTCTCTC CGAGTCGCTGGTGGAGCGTGGCAGGG ACTGGC 

ATCCCTGAAGAGCTAGAGTGTTGTCACTIXIATCCTAACCAAGACC^ 

TTGGAGGGTCCCTTCTCAGTGTTTGGAAAACGACCTGTGGCTAAATAAATC 

CTGGCCTCCACGATTCCTCAGGCTAAGTAGTAGGCTACTGAGGTGTGGTATTCCT 
2 0 GGACTCGCTTCCTAAACGAAAACATTCTTTTTATTTTGAATC^ 

AT ACACACATATGT ACCTGTG CACAGTGTG CACGG AGTCCAGG AG AGGG C C CTG AATG C CCTGGAGCTG T AGTTTCATGTTGTGAG 

ATGCTGGACGACGGTTCCGGGGATCAGATATGGGCTCACTGTGTATTT^ 

CTAGGGATAGTTCAGTAACTTGGGGCTCrTCT^ 

ATATGTACATACTGTGCACACACCCCTCCGTAACCAGGTrACCTAGCAATATGGGCTCACT 

2 5 C CTTG AAACAATGGGG TGAGTTT CTCTTCCTCT AACAAC^ CAAAT AT ATTCCT AGTG TGTCTGTGT AT A 

AAGAATAGAAAGGCTGGTGTGGCCTTC^GCTGGTTTTGGGTGTACATGC^ 
TTTACCATCTTATATGCCTATGCCTAGCAGGGTGTGTGTATGATGAATGTCAGATGCATT 
GATGGTTAGATGTCCAAAGCTATCAATCAAAACAGAACCACCTCTTA 
GGGCATCCTCACCCACTCGTCTTGACAGTGTGTCCCTCrCC^ 

3 0 AAGGGAGCACATTCAGAGACCTTTTAGATCrrGTAGGGTCAGGCTGGAATC 

ATACAGACATAAAATGCCTTTAGTACACACCTTCAATCCCA 

GAAAGTGATGTCTAATTGAGTGGTAGAGTGACAAATGAGAGAAAGArrTGACAGAATA 

GAAGGAGAAGGGGGAGGCGGGGGACAGGTCTACCAGGACAATATTACTGAGACAGATTG 

CGAGAAGGAACCAGAAGATTAGAAAO^CTGCCAAAGTTCCGAGGTCAAGCCGA^ 

3 5 TTGAATCAGTCAGCTTGGAGAGAAGTTTTGAGCAACTGAGTTGAACCAGC^ 

TATT'CAGCAGTAAGCCTCXiAAGATGCCAATTATATCTGG 
TTTTTTTTTTTTAGCCATTAAGTTAATTTATTGAAACAGTTG 

CATATTAATGGAAGCAGTCCGCTCGACyATCTCAGAAGGACAGTGTAGATCACTGAGTAGCTTGTGGATCCTCC^ 
CACGTTTGGAACCTTCACCACAAGGGGTTTNNNNNNNNNKN^ CTGAGGCATGGCTT CCTG 

4 0 ACCGACTTGTTCTTCAGCAGGAGGGAACAGCGGTGAGTGGGGA 

TCAGAAAGCTGAGCCCTTATTTTCAATTCTTTGGG 
ACCAGTAAGAAAGGCAGCAGGTTCACAGGTTCTGAGATTACGATACAAG 
CCCAAGTCTAGCTGATACTCAAAGTCATACCTGTCCAATATTGTGAAACTT^ 
GGGAGATGCAGGACTTTAGGTCCTTGGCGATTCGGGCTGGTTTTGTGTGTCAACTTGA 

4 5 . AGCCTCAGATAAGGAAATGCCTCCATGAGATCCAACTGTAAGGCAGTTTCTCAAT^ 

TGGTGCCATCCTTGGCCTGGTAATCCTGGGCTCTATGAGAAAGCAAGCTGAGC^ 
CTCCCATGGCCTCTGCATCAGCTCCTGCCTCCAG^SATCCTGCCCTGT 
GCAAGCGAAATAAACCCTCTGCTCTCTACXSTTGCTTTCTGG 
GACCACGAAGTCTGtCCXSTGGGAGCCAGAGTCTCCACTCrCCCTGATTTTTG^ 

5 0 ACCCAGTATGGAGCTGAGCCTGGTTITGCTCTGATTCTCTCG 

CACTAATCACCGTGTTTGTCTTCCTATCTL'TITC'ITTAATCCCTTGGCTGT 
GCCTCCTTTTCCACTCTCCCAGACCTCGAAAACTGGCAGACTCA.G 

CTTAGTAAGTGAACC ATTTCCCCAGATCTAGACACCTT AGTCC CCAGC ACCCATTGGGCCTAAGAGCCACCCIACTTCTAGTTCT C A 
AAGGGAAGCCATGATTGGGAAGG AGAGTGAAAACGCTG CGTATGGTTG CTG AAGTCCCCTCTCCXXCCTCIACATGTTTGAACCTGT 

5 5 C CTG CTTCTC CACGG CAG CTT CCTGGCCTG ATGC CTGC CTCT ACIATTTGGC CGGCTGAAG CCrGGTTATACTCTCTGGGCCCTCAA 

G AG AGTGATG CG CT CAGAGAG AAC CTGCT CT CACTG ATG C C CAAGCACTT AG CTG CAGGG AAG CAACGGGGATTAGTGCGGAGGTT 
GGACAACGGCTATTCAAGAACTTCTTTTGTCTCAGCAGGCCGGAGGACCT 

ATGAACTTGGGCTCATAAAGGAGAGAGATCCAGTTAAGGATGCGGCCTC^GCACAGGGTCCCAGGCCTGAGACT 
GTCATCAAGCTCATGTGTAACACCAGGGAAAGGGCTGTTTTCTGT 

6 0 CTGCCGCCAACCACAGCCACTTAGAGGCTCCAGCAGAGTCCTGTCCAGTCCAGTCA 

GGAAGAGAGAGCATCTCITCATTCTGGGTCAGGAACC^ 

GGAGACIAGTGTGGTGGTCAGAGTCCAGAGGGACAGTATATCAATCCATGCCCTTGGACT 

GACTTATCACTTGGACAATTTACTC^CATGACCCAGTAAGTATTCTTT 

TGGGGCTCAGAGCCTGTCIAGAGGGTCAAACTCTCCATTTTA^ 

6 5 CCTGATGAGGGTGACAGTTTGGTGGCTGAGACTATAACAGGGCTGCC^ 

TCTGCTACAGCCCAGTGGTCAAAGGTATACCTAGCTCCCAAGCTCAGACCCA^ 

AGAG AGTTAGCATGGGCTKKXjAAGTACCGCGAAGCT AGGCAG AAO CGG 

TGGCCTTTATATCCAGTCCTTGGACAACCATGTTAGTGTCAGA 

TGTGCAAACGAGAGCCTTGCATGGAGAGGGTACTTGTGTGGGCACCAAC^TTGTCT 

7 0 ATGTAATGGTGCCGTGTGCATGCACACGGTTTGTATATATGAACACAGGCTTAACCCAGTGAT^ 

ATGGACTGCTGTTAGCCTCGCTAGTGACTTGTTCTTTCCGTCTGTGC^ 
GGTTGCTAGCCAAAGCCTGGGACTTTCTGTGCCCATCCAT AATG ATT CAGCTACG^ 
CCCTGTGTGAAG6CCTCTTCTGATCTAAAGAGCCTG 
GAGAGCCACCTTATGAGCGACACGGAGCTGTCACCTCCGAATGCCTGA 
7 5 GTTGCCACTGGGAGAGTTTGGTGGCCTCCATCCT 
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C C CTCACCGG CGCTTTTG AT AGT ATGGCCATGTACCAGG AGTTGG TCCTG CTTCHT CCT^ CTT AC CAT C CG AATGATGG AC CATC 
ACATGCCATCrrCCCAGGGATTCCAGGCACCCAGCTGTTCrC 
GAATGGGCAGAGCCCTCAGCAGGATGCCTACAGCCCAAG^ 
' CAGGAGTTTAAATCGTTAGAATGAGCGTCAGTGTAGGT^ 
5 GGTAGAGCCCCACAGTGCAACATAGTAGCAAACAGAACAGAGG^ 

TATCAGAAAATGAGACCCAGAGACAAAATCTCCTCTTGAAAATCAGCGGT 
TAGCTGTGAGTGTCCCACAGCAGCAGTGGGCTTAAAAATAAAAGGGAT^ 
CAGTGGCAGCAGAGAGCCAGCAGGGGGC^GTGTGTGCriTGAGTT 
AACCTGTGGGTCCCTGTCTCATAATACCCCCGACACCCCCATC^ 
10 TTTTTGGGATTCTCTGTGCTAGCGGAAGGAAAACATGTCC 

T CTGT CTGAAGGGCAATG AGAAAGATG AGTG TTT CTG CAGAGGG AAGCATGG AG CATATTTCT ATTTTC CTCAGAAGGAT AAG CCT 

ATGGAGTACTGAAGGCCAAAAGAT^TACCTTGTGTATGTGGAGTCAAGTTC^ 

CAGGTCTCTVGAGGAGTC^CAGCAGGCTCCATATGAAGGGGGCT 

CCAAGTAGGAGGATCTCAGTTCAGATTCCCAGTACCACATAAAGTCCAGGTGCACAAG 
1 5 CTTGGGTGGCCAGCCAGCCGGTCTAAACTAATAAGGAAGCAGAGACAGGAG^ 

T AAAACCCAAACACAAGCAG AGAAACAAAAGAGGAAAT CCCATGAGACTGTG CAGACAC C C C CTCCCT ACAAGAGCG CCAC CTGG C 
GCAAACTTGAGAAACAGCCAGCTGGGCGTCTCAGCX5TT CAGAGGG 
CTGAATTTGCAGCCACCAGGAACCCGCCGGGGCCCCTATACACCCCTCGAA 
GCATTTATTTTGAGTACAGAAAAGATACATTGCTGCTTATTCA^ 

2 0 ACGTGGTGAATAGGCX3GGGATGTTTTTCTGCCCCCACCACTTTCCAGAGCT 

TCCCCGGGGTGTGTTTTAAAGGCTGGACAGTACCACAGACAGACATCTTTGCCTA^ 

GG CTTGTAAACAAGG CTCC CTCC CC CAGGCCTGGGACTCT AGGG TG CTG C C CG CAG C CTG CAGCCG CTG CC G CCT AG ATG CT ACAG 
AACTGGTGGTTGGGCCCTTTCC^CTCTGCATCAGAATAAATC 
GAAAGGGATTGATGAATTTTCTTAGCCCTTCATTC^AGGAGTTGAA 
25 GATGACCGTCAGAAGGTAGGAGTCCGGXjAGGCAGGAGGTCTCACrCCCTGCCAGCGGCAGTC 
CTTAGCTTTGGCAAGCCGGGTCTITCAAAGATGGCCTCGTCACCTTATATAGTCTT 
CTAATATCTGAGCCTATCTCCAACTGGCTGCCTCCACACACCTGCTrCCGTGA 

GACATGAAGTr r r iT TT GT ' lTX 'r r T l " ! IGL T lT'l Tr i TT'l 11 1 TTT 1 1 1 TTTTTGTGGGGTACATGTGTGTGTGTTGTCTCTGTGT 
ATATCCCAGAGAGGCAAGGCCAGACTTCATCAGCCCAGTCCCCAGCTGGGAGTTACC^ 

GTCTAGGAAGCATTATGTGTC 

ACACTAAGGACATCGAGACAGGCTAAATGGGCTGTGAGGACCTCC^ 
GGAAGTGGCAGGAOSGTGACATTCTGGCTCTTCAGCCCTGGAGTTGAGAGATAG^ 

3 5 GTCTCAGCTCCCAAT AGGAT AAGCCCTGGGGTAGGAGTTATGG CAGGAAG CTGACCAAGCAGTCTCTGCAGGAGAATCAGATTGTG 

GCCATAGCCTAGAGGAGGCTC^GGAGAGGATAGAAAArTGGGTTTCCTATCTCACCTCTGCCCAGCAC 
CACCAGGCACCAGGCTGGGGAGGAAGAGATACCATGTAGGGAACACCTCATCCAAAG 
GTGGGACATGTGCATATGCAGGTCTCTACATATGTGTGGGTTGTGTCA 
TCATATATATGCATGTGCATGGGTTGTGTGTGCACCTCTATGCCTGGAGGAGAG^ 

4 0 AGCTTGGCCACTCACACAATCCTTTCAGGGCTGGGCATCGGAGCCTGGGTTGGGA^ 

TTCTGGTCCTGGCAATGGCAGCAGGTCAGATGGAGAGCCTGTCCCACACAC^ 
TGTGGAGTCTGCTCTGACCCTGGGTCCTCTGCTCTTGCTGACG^ 

AGCCTGGGACAGGAGTCAGTCATGGAGTAGGGCC^TGGTGCATCCGTGGGGACATACAGCTGAGT 
CTCATAGCTGTGAGAGAAGAGTTTTCACTCAGGGGTTGTAGATTCT 
4 5 CAAGGAGCTGCCCTCATTGTAGAATACAGCCAACTTCATATATGCTGGCT 

GACACCCTTAGCTGGGTCAGACTTGACCCTCACATCCCATGTGAACCAAAGCAAGTCCT 

TTAGATTCCCTtKSAGCAAGAACAGCCCTGTCTAGGGAGGAGTCCCAGCCTGGGCACTGTGAATGGGCTCGGTA 

CATCTCCACTGCTGTTTTCTAGATTTAGGAATGAGTCTCTAGCCCTGAAACCAT 
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MOUSE SEQUENCE - mRNA 

ACTATGCTGCTGCGGTCGCTGCGTAGTTGGGCTGCCCGGTCCCC^ 

CGATTCX^G^GCCC^CCGCTGTGGGCACCTCGCCGCGCCTGGCCTCCGGATAAAGATAGAGAAAATGATAAGGAGAAAAAAGCAG 
TGGTTTGTATTGAGGGGAATATTGCAAGTGGGAAGAC^ 

GAGCCTGTGCTCAAGTGGAGAAATGTCCATGGCCATAACCCTCTGAGCCTCATGTACCA^ 
GCAGACGTACGTCCAGCTCACCATGCTGGACCAGCACACGCGCCCTCAGATGTCACCT 

GCGCAAGATACATTTTTGTAGAAAACCTGTATAGAGGTGGGAAGATG CCCGAGGTTGACTACGCG ATT CTGTCTGAGTGGTTTGAC 
TGGATCGTCAGGAAGATTGATGTCTCTGTTGATCTGATAGTT^ 

GAGGTGCCGGGAAGAGGAGAAAGTCATTCCGATGGAATACCTCCATGCTATTCACCGCCTCTACGAGGA 

GCCTCTTCCCAGCTGCAGCCCCTGTTCTGGTGATTGAGGCTGACCACAACTTC^AGAAAATGTTA^ 

GCCCGGATATTAACTCCAGAGAACTGGAAGCATGGACCCTAGGACTGAGTGATCTACAAGACAGCGCCAC^AA 

GCCAGGTTAGCTCCTAGGAGCATTTGGAAGAATCTACTCTTAGGAGGGTCT 

TTCACTGCAGTACAAATGGCTTTTGTCTGTCAGATGCTTC 

GAAGCCCTGTGTTAAAATCACTGACTGTATCCCTGGGGTTGTTCTGGATCCTCTGCATCAGGTCACCTC^ 
TCTTCACCCCAGGCAAGGATTTGAAGCCCAGAGAGGTGTGGGGACTCGTCAAGCCAGACA 
GTACCTGGTTTGGGCTGCCTCCTCCTAAGCTGTTCCCCACCGCCCTGCTGCTCTGAGCACCAGC 
GACAGTTTGTGAATTCCATCATCTTTCACTACCGGGCCAG 

TGGCATGAGCCCAG^GTGTCCTGGTATGAGGGTACTGTGACCTCTATCCATTCCTT 

AAAAAAGGGGGTTATGGGACCTGGACACTTTACCCTTrecrc 

TCTGAGCAGATGTCATGACCCACGAGCATGAAGA^ 

GCACTGAAGGTGCAGCTTGTTACTTGAATCAGTCATGACCAAGAGAGGCCGGCCATTCCT 
GTTTGCCAACAAAGACAATTTAAGCCTTGCTGCrcra 
ATACAGGOGAATGATGACCGTATACCCTGGAGL^ 
GCTGCCTGGGAGCTAGAAAGAGGGCCAAGAGTCCAGCTCTTGGTCC^ 
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AGACTAGCATCCAGGGAGATTCTCCATCAGCAC 

TACTCATGGGGGCATCTTTAGAATTTGCAAACGAACAGTCTAGTACT^ 



5 MOUSE SEQUENCE - CODING 

ATGCrrcCTGCGGTCGCTGCOTAGTTGGGCTGCCCGGTCCCCGCGTAGCGT^ 

TTCGGGCGCCGGGCCGCTGTGGGC^CCTCGCCGCGCCTGGCCTCCGGATAA 

TTTGTATTGAGGGCAATATTGC^AGTGGGAAGACGACATGCCTGG^ 

CCTGTGCTCAAGTGGAGAAATGTCCATGGCCATAACCCTCTG 
1 0 GACGTACGTGCAGCTCAC(^TGCTGGACCAGCACACGCGCCCTCAGATGTCACCTC 

CAAGATACATTTTTGTAGAAAACCTGTATAGAGGT^ 

ATCGTCAGGAAC!ATTGATGTCTCTGTTGATCTGATAGTT^ 

GTGCCGGGAAGAGGAGAAAGTCATTCCGATGGAATACCTCCATGCTATTCACCG 

TCTTCCCAGCTGC^GCCCCTGTTCrGGTGATTGAGGCTGACCACAA 
15 CGGATATTAACTCCAGAGAACTGGAAGCATGGACCCTAG 

HUMAN SEQUENCE - GENOMIC 

CAAACCTCACT CTTG C CCCAG CT CTTGTTCACAACTCAAT ACCCAACCATT CAG AT ACAC CTG CAGGTC CTGGAGGGTCATGCT CT 
TTCTCCACCTAG ACCCTCACATATG CTGCTGCTTTTCTAGCAGJUVAAGCACTTTCCCTTC CCTGTATCAGTCATCTGCTAACT ACG 
2 0 TTTCATCCTTCAGCCCTCAGTTTGGGCACTTTTTCTTCACAGCOT 

TGTTTTGTCTTAACCAAAGAACCAATTACAGAATCTA^ 

AGATCAGAAAGGATTCTGGGTTTCGTAATGGAGCAAAA^ 

AAAGTTTGATCCTTAGAGATTTGGGGACTATTCCTTTCACTGTGCT 

2 5 CTGGCTCCCAGTTTATTACTGGGACATGTAAGAAACTCCAAGGAATTATGTTTGGJ^ 

ATTAAACTCTAGAGAAGCAGAGTGTGAAAAGCAACTATACAAGACCT 
TCTTTAGAAAAATTTAGTTTAACCCTTTCITCCCACTAAATGAGGGC 
CTCCAACGAAGTCCAGCTCTGAAGTGTACATCAAGTAGTTTTGAGAGGAGAG^ 
TTTAGGACTTTTAAAAAGCTTATGATTTGCCTGAAATGCTTGCCACTG^ 

3 0 TG CTT ATGGTTT AAG AAT AG CTCTAAAGTCITTl'CTTG TT AAT AGG AATT ATCAGCTTGG ATATTAG CACTGTGGCATC CAAG AGG 

ATGAG ACCACTG CAG C CAAGGAAGACATGG CAGCAATCAGTCCAGC CAAAG AGGAG AAGGG CAG CAGGGGGATTCAGGAGGGAGAC 

CTACAGAGACAGGATGCTGGGAATGAGACTTGCAAGAGGACAGAGGAGAAGTCACTCAGTT^ 

GGCTGAGGGGCTGCATACATCTTAACGAATAACnTCAA^ 

AGTTATTCCTAATTATATGAGTATGGCAGGTTACAGT 

3 5 CAGATATTTTCAAGTGAGATAACATTGTACTTTAGCCTCCCCAATTAG 

AG AT ACTTGTGT ATCTGTCTC CAT CT CT AT CCCAATTTTGAGG C CAT ATT AAC ATG CAT CCAAAT AGGT AATAAAGATT AAGAATG 
CACTACAATTTTGTTAACATGGTCCTAAATTTAAATTTCTTTAGAAGATCTATCCAAGTGA^ 
CTACTGATAGATGGTGCAGAGAACTTATGCTCTGTGGCTGCCCTGCATCCTTGAGACCCTCT 
GCAATGGGTAGAGCTCCTCCTTCCTAAGCCTAACAGAGATTACCTTAAAGGTTTAAGAGAAGCAA 

4 0 TCTCACTATGGTAGGAAAAAGTAAAATTTAAGGATGAAACTGAACACTI^ 

CAAGT CTG AG T ACAT ATAAAAGTATG AAAAAT AAGAT AACGGTG ACTT CAAAT C CAG TG AT AACAAT AT ATGGTTCAGGGG CTTGT 
GCGATGATAAAAAAGGTCATAGATGTCACAGTTAGTGCCTAGAAGAAAAAAACCCACGATTTA 
TTAAACTGTAAAAATGACAATTTCTAAAATAATAGGCTGGGCACAGTGGGTCATGCCTATAATCCCA 
TAGGAGGATCGCTTGAAGCTAGGAGTTCAAGACCAGCCTGGGCAACATAAGGAGACCTCATCrC 

4 5 TACTGGGGTTTCTGCCATATACACACACATATAAAA 

CTGGGTATTCCTCTCTGTTTTAAGTCCAATTAATTATTCCTCTTGTGCTTCT 

TTTTTCCACCAAGTACCTCCGTTCCCCCnTGTACTTTCAATCTG 

GTTGTGTTCTTTTAAAATACACACACACATACTCCTTCCTTTTATCCT 

TT ATG CITCTTCAAAAAGTCTGT ATACTGGCTGT CT AT AATTT CTT AC CAT C CATTT AT CACT C CTTAATT CTT AACAATCTGGAG 

5 0 TCTACrGTCACJ^TTTAAACTTAGAATGCTGCCATGAAAAATACTGAGGAGCAAAGT^ 

GCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCGCCTGAGG 
GGTGAAACCTTGTCTCT ACT AAAAAT ACAAAAATTATCCAG<5TGTGGT AG CAGGTGCCTGT 
GCAGGAGAATCGCTTGAACCCAGGATGCGGGGGTTGCAGTGAGCTGAGATGCCACT 

CTG T CTCAAAAACAAAAACAAAAAAAATTGTC^GAGAAAAACAATT CT G ATGTTT AAT AGC TTT ATT ACCAAAGT AC CATTTTii A 

5 5 CACACCCCACCT AAACAATGCAATCAGAAT CT ATG CTATTTT AAGTGG AAAAAAAAACAT AAAAAACTCTG AAATTT CTT CATTT C 

CAAATGAAGATAGAGAACATC^CAATGTTAAGTGAAAAAAAT^ 
AAAGCTCTTTCCAAAGCTCTGTGTTTTTGGTAAATGGTT^ 
GGTAACATATACATACTCATTTTAAGTCCATATAGGTTACTTATTCATAAT^ 
CTTTTAAACITCTAAACTGATCAGTTTGATTACAAATAACTAGT^ 

6 0 AG CJUVCACATAATCTCAAACTACTCCAATTACTCTATTGGACCAAGGTCTTCAG CCCACAGATT ATGAAATGAAGGGCCACTTTTT 

CAGTCCCTATCCTCTGTAACCTTTTTCATCATGGTGATTTGCACTGGTGATTAC 
CAAACTGTGTTCTCCTGTGATTTATATTTACAGTTCTTACATCTTTCCTAAGCT 

TCTCACT AAT AAT ATTT CAGACATTAC CACT CTATGG AT ATTGCTTTTG CT AACAT CAG TG ATTT CAGAAAT ACT AAATTTAATG A 
ATG CTTTTTTCTTTTTTT CCAGTACTTTG^^ 

6 5 TTATTGCCATTGGCTGTCCrrCCCCTACTAGTGlTTTCTTTTGGCTCCCGTGTCAGTTCT 

CAGOTCCATTCTCCACATTTCACACCAAGTAG^ 

AAACTGAAATCCCGCCCCTTCCTCCTGGATTCACAACCTCAACTGGTG^ 
AGATC^CTGTTCCTACCCATTCCCCAACTGATCAATAATCGGGTCCTGCTAACTTAATGC 

7 0 TTCATCCTT ACAACT ACTGCCTTTCATCX3 TCTTTT C CATGAACT ATCATG ATGAAATTT CT AACATG CAAATTTGTTT ACG CATTT 

CTTCATCTATTTAAAATTCTTCAGTGGCTCTCCAC<3GTCTT^ 

CTCCTGTCT ACTT CTTG AGAAAAGACTTG CTTCACAATTTATCC^ CAAACTCCTGATC CTTTTTTCACATTTG CTTT AT 

CAATAGGCTGTCTCATCTCAGTGGATGGTGACTCCATCCTTCCAGTTGCTTAG^^ 
TTTTTCCCCACATCTGGCTCATCACCVL n ,TCTT 
7 5 TGCITTGCTACCACCAGCCACCATCATCATTrCTTGCCCAGATTACT 
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ATCCCCTTCTGTCCTTCCCTACCAATCTACTGTTAACATAG CAG CT AGT ATA1T IT rUTT ItSTTTTTGAGACGGAGGCTCACTTTG 

TCACCCAGGCTGGAGTGCAGTGGGGTGATTTTGGCTTACCTCAACCT 

CCGAGTAGCTGAGATTACAGGCATACGCCACCATGCCTGGCTAArrTTTT 

GTCAGGCTGCTGGAACTCCTGACOTCAAATGATCTC 

TGGCCTAGTATACTTTTAAAATGTAAGTTAGATCATATACTTTACTTAA 



CTCTTCTTCCAGGTATTCTCATAGCrAACTCCCTCATTTCACTCAAATCTTTGATC^ 

ATTCATTACGCAACTTTTCTCTGCAACTCCrrCCCCGCCTTAC^ 

CC^CCTTCTAGAATACTAAAATTTATTGTGTTTGCTTCCCC^ 
1 0 TTGTTrTTTTTCACTAGTATTTTTCAAGAGCCTGGAATAGTGCCT 

GAGGTAACACTCAACrrCTTATATTTCTCTGCATCAATTG 

AACCCTAGATTGCTCTAGGAAGAGCTTTTTTCTGACACCTCTCCTT 

ACATATCTTAACTATCTGCTCATGTGTCTGTCTCATAATTAAACTGAAGAACTCCATGA^ 

TTGCTGTTCCAGCTCCTAGCCCAGAGTTTGGACCATATTAAGTATTCAAATGGT^ 
1 5 CATTCTCACAGGACCATTCTCACATGGTTCCCCCTTCAGTGTTCA 

GTCACTTTTGCCCTGAGTATTTACrCCCAATCAAAAGTTATT^ 

TCTCAACTT ACATGAT ATGAAAG CT ACTT CTCACAAT CACAGACriTTGGTGTG AGAAGGATCTT AT ATG CACi T tGTTCAACC CA 
CTCACCTTAAGTTTGAGTCTCCTTTATAACATCCT^C 
CCCAAGTGGCCATTTAGTTTTGGGAGCTCTCCCAATATAAGATTCCT 
2 0 CTATCTCCATTTTGCCTCTCCCATGTCAACTCTTTAAAAC^ 
TCCTCCAATGATGAGTTTCrAGAATAGATTCCIT 

GATCAACTTCCTCCAAACCTGTTCAGAAGTGACTGCAGTGCTCCCGGTAGG 
TCCTGGTTCTGGGCTGTATCAGCCTTGAACTCAGTCACATCAAACGTCT 

TGAGG CTT1 1 r i' n ' HTl 'C T l l 'CAAATCTTGTGGGCTTCTGAGAGGTATAACTGCAGTTGTCTCXSCCT 

2 5 TTTGTGGGTCCAAATGTAGACCATTTGGGTAGATCTGATTTTT 

CAGGACCTCrCTGTCGCCTAGGCTTGTGTGCAG 

CCGTCCCGGCCTCCCAAAGTGCTGGGACCACAGGTGTGCGCCACGGCGCCCCGCCCTCT 

AAACCCTCCTAAGGGGAGGACAGTGCCGTATCTCTCTCCTTCITCCCCAAGG^ 

GAGGTTGAAAAGCAAACAGGCGTTCAGCTTTGCACTTCACACCC^ 

3 0 CGCGGCCCGGCCTCACCAGCCGCAGCATCTTCACGTGGCCTTTCACACTGAA 

TTATCCATCGCGTCTGCAGACCCAGCAGCAGCACTTTCCCTCAACT 

GCAACCGCCACCGCGGCGGAGCACCGCCCTCCCCTACTTCTCGCCCAGCTCGGCTTCCCGCT 
CGCGTTCGCTCTCGCG CATGCG CCCCAACGGGATCCCGCGGCCGTCCGCT CTGGCTTX3TGCTGT 
TTAGCGGACCTAGAGCTO^CACATATTAGCGGACCTAGAGGTGGGCACGCAGGCCAG 

3 5 TGGCGTTAATGACGGCAGCACTCCAGCTCTCCTGAGGAGTCnTCCTGTCGGGAAATCCT 

GGGGGCTGTAGAAATGGCGGCCCCATCTCCCAACAACTTGGGCATTGTG^ 

CCGTCTAGAGCAGCCACCATAACTTCTGAGCGTTTATTGCTAGCTGATATATATCAGAAAAATACAAArrCC^ 

TGGTCTGCTTCTCTCCCrGCAGGGCCCAGGTTCTGGOlC^TAGTTGGTGCAGAAAGTG 

AGGGCATCACACTCGGGACTTGTTCTGCATATTTTTACTTTTGCCTCCC^ 

4 0 TTCAGATACTTAATGTTTTTTCTCAAGTGCTGCCATGAAGCCAGATCTCCAC 

ATATGTCG CTCCTTTTATGTGATTr ACATTCTATCTTGGATAATTTG^ CGTAGTTCATTCAGATCrGTTTGGATC CTGC 

CCATCTCAGCTTCAGTCCATTTCATTCTTTTAAATCTGATCGACAG 

GCCTTAATCCTGAAGTTTATTTACGGAGAGCACACTTGCTAGGTGTGTGGCAGATA 

T AGAGGCAAATGACTTCCTTCTCCCTGCCTAGTGGTGACTGCCAG CATCACG CCCTCCCGGGAG AGGTGAG AAACCCCTCCACGCA 

4 5 AGCACTGGAACCTTCACAGTCAAGAGTGGCAACAGCTCCGGTTACTGGACT^ 

ATCTGGGCTGAATTTTTGCTGAGTATGATGGAAT^ 

TGCCTCCTCTGGAGGGATCTAGTACGTTTAGAGTCTAGACGCTGGAACTG 

AGAGAAATGGGCTGGAATTCTAGCTTCACCCCTTAATGAATGCTTCT 

CTCTATCGCCCAGGCTGGATTG CAGTGG CCACGATCTCAGCTCACTG CAACCTC CG CCTCCCAG ACTCAAGCG ATTCTCGTGCCTG 

5 0 AGCCTCCTGAGTAGCTGGGATTACAGGCGTGCGCTACCACGCCCGGCTAATTTTTG 

TTGGT CAGGCTGGTCTTG AACT CATG ACCT CAAGTGATCTACCT^ TG CTGGG ATT ACAGGC C CGAG CC AC 

CGCGCCCAGCCGCTTCTGATCATTAAAAAAAAATTTTTTTTTTGC 

CAGTGCAGTGATCTCGGCTCACTGCAATCTCTGCCTCCCAGGTTCAAGCG^ 

CGGGTGCCCCC CACCACACCCAGCTAATTTTrGCATTTTTAGTAG CG ATGGGGTTTCG CCATGTTGG CCAAGGCTGGTCTCG AACT 

5 5 TCTGGCCTCAGGTGATCTGCCTTCCTTGGCCTCCCAAAGTGCTGGGATTACAGGC^ 

GTTTTAAAAAGACTAGTCAAGTCCAGTAGTCAGAAGGGGGGAAAGAGTAG^ 

AACAAG TT AC CT AATTCTCTGAGG ACAAG CTCGG AG AATGGG AG AG ACAG TT AT CTATTT GCAGGGTTG TTGGG AGG AAT AAGTGA 

CATCATGAGTGTGTGCCAGGTGTCTGATTACAGAAGGTGTTCAATTAATCTG 

TTTGCCATCCATCCTCCGAGTGTTGCCAAGTTATGGGTGCGTTCTGCCIAGCGTCC^ 

6 0 CGAACCTCTCCCTTCGAGTATTTCTCCTCTTGCTGAGATGAAATGCGACCGGGTCT 

GCGCCCAACGGCTGGACTAGCAGTCGTCCGCGCCGACTCGCACAAGAAG^jAA 
TGCTGTGGCCGCTGCGGGGCTGGGCCGCCCGGGCGCTGCGCTGCTT^ 

CrcCGGAGGGTGCAGCGCCGGGCCTGGCCTCCCGGTAACGCGCGTCTTGGTCCCGCCTCCCAGGAGCCCCT 
CCCGGCCCCTCGGCTTC03GAACCCGCCCGAGCCCGAAGCGCCTCTTCCGAGGCGCGGGAT^ 
6 5 GGGGCGGCCATCGGCCGTTACCCGGGCGCGAGACCGCAGCGAGCGCTGGGGGCGCTCGGCT^ 

GGGGGCGATGGGGACTGGGAGCTGCCTTAAGCaSATTCTGCTCACACTCTGGT 
GCTCTCCTGCCCTCCCTGGGAGTGGGGACCCACACAGAAACGGGTCACTGTT^ 



70 ACCACGTCCGCCGCCCACTCTTCCCCGTGCACCGCGAGTGGAGGCTGTTCTGCCTTTCCAC^ 

TCTAAATAAATAACGTGGAAAATGCAAAGTATAATGTACTAAGTGAAAGAACT 

CAGCCCAGG<5TGGCACATCGGCGAGACGGTGGTGCGCGCCTCCAATCTCGGAGGGC 

GAGTGAGCATAAACATTAGTGTTTTTGTGGTCATGTAGCAAATTAGGGCAGAGGA^ 

XTCCTTTCTl'TTGTAGATAAAGAACAGGAAAAAG 
7 5 GGTTTTTTGAAGAGGAGAG AGAAAAAAGCAGAGTAAAAAG CAAAAAT ACATTCCATAGTAAAAAGCAAAaaT ACAAGAGG ATCT CC 
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AGGGAGAAGGAAGTCTGTCACCCTCCTAGAAGGCCACCTCTGTTACCAATTTCTTGTATCT 
GCACACAAGTATTTGTATCCTTCAGGTAGATTTGGGCTGTTAAAGCTTAAGGGAT 
IX5TTGTGGGGAAGGTCAGTTTCATCTTGGGTTGGGCAGCTCAGATGTGAGGG 
TTGAGACTGGTGGGTGGGCAGGGGATCAGAGAGAAACCATTAGGGTCCACAGATC 
5 CTGG CT CTGGGCTGGGGCTCTG ACT AGTGTGAGG CTGAAGGTCAAAG AAGGGT CAGGTTGGTGT CCATGGGG AT ATTGGGTG CTTT 

AGCTGG CTGGGGT AGGGAGTG AT ATTTTGGT CACAAAC CAAGGCTCAATTC CTTT AAAAGAACIX3 AAAGTCCCTAG CTCCAGTGGCT 
GGGAAGGGGGTGGCTTGGCAAGTGGTGGCAGGAGTACCGTGGCTGGGATCT^ 
ACTCAGGGTGGTGGAGAGCTCTTGGGAAGTAGTGAAACC^CATAG 
TTAGGAGAAGGATATGCCAGCGGGTGCCATAGAAGGGGAGGGAAATGCAAAGT^ 
1 0 TTTATACATTTAAGAGACGGCGTCTTGCTGTGTTGCCCAGGCT^ 

CCTGAGTAGCTGTGATGACAAGTGTGAGCCACTGTGCCTGGCTATTTATTAGT^ 

T CTTTGTGCTTTTTGGGTG TGGTTTGATG TGCCCCCAGACACACAATAG CT AGTTCAT AG AAAGTGGT ATGGAG CCTG CATTCAGG 

CTCAGGACTGTTTGTAAACATTCAAGTCCTGTGAGCTCCTGTG 

TGACATTTGGGAGTCTGCGGTCTGTCTGATGGGGGAG 
1 5 GGAGAATGGTGGAACTAGAACAGCAGTGAGTATGGTGGCTGTGAGTAGGGGGGCTGATGACTGAGA^ 

TG CCTG CTGGGAG ATGGAG CAGAGTTTGAGG CAGGAGTGTTGG AACAGG T ACAAGTCACAGAAGAATGAG AACCTGCAGGC CTGGG 

CTTGTGGCCIAGTCCCTAGAGAGGTGGCTGGGGAGGGCTGGGGGATA^ 

ACTGAAGGGCCTCACCCCCACCCAGCACGATCCGGGTTGCCTCT 

GTGAGACTGGAAGGGGAGAGACCTTAACTAAACTAGAGATGGCACAGAGCCCG 
2 0 GCTCGTTGGTTGACTGGTXjTTGGTCGGTGAGGAGAGATCT 

CACAGTTT ACAATATT AGAT AATTGAGT AT CACIAACAAAGTTTTTT AAT AAAATAACTGGACAG AACT^ 

TACCTAATGGCTCATCCCTATAATCCCAGCACTT^ 

GGGCAACATGGCAAAATGCTGTTTCTACAAAAAATACAAAAAAT^ 

GGAGGCTGAGGTGGGAGGGTTGCTTGAGCCTGGGAGGTCAAGGCTGCAGTGA^ 

2 5 ACAGAACGAGACCCTGTCTCAAAAAAAAAAAAAAAATCAAATAGTAGGGAG 

ATGTAAAGCCATTAAAAACCATGTTTTTGAAAAATATTTAGTGACCTGGGAAGGC^ 
AACAAAACCAACAAAAGTATATGAAGTATGATCCCAAATGAAA 

AACATTAATGCCGAAATTCTCAGTCTAGTTAAGAGATAGATGTGGGGTTAACAAGG^^ 
GGGTAAAAGTGTCTTCCC!TTAGTTATTTGAATGTTAAATGAAATTCAGCT 

3 0 TACCAO^AAATTCACCCATTTAAATGTGCAATTCAATGATTTTTAGTA 

TACAATCrTAATTTTATTTTATTTTATTTTTATTTTTTA 
CACAATCACAACTTACTGCAGCCTIGACrTCCTGGTCr 
CATACCACCATTCCTGGCTAATTTrTGTATTTGTTGTAGAGA 
AC!ATTTCCATTACTCCAAAAAGAAACCTTGTGCCCATTT^ 

3 5 TTAGXSGATTTGTGTTTTCTGGACGTTCCATATAAAATG^ 

ATTTTrrm^rrCl'l 'l TTTGAGATGGAGTTrCTCTCTGTT 
CCGCCGCCAGGX3TTCAAGCCATTTTCCTX3CCTCAGCCTCCCAAGTAGCTG 
TTGTATTTTTAGTAGAGTCGGGGTTTCACCACGTTA 
TCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCTCCTGGTTGCITA 

4 0 ATTAGTATGTCATTTTTTCCTATAGCTGAATAGTATTCAT^ 

CGTrTGGGCAAGTTTCTTTTGACTGCAGATTTCTCAGG^ 

ACCCAAGCCAGTTTGCCTCTGGAACCCTGGGTTCTAATTAAACATCCAAATGA 

TTTCTTCrTTTAGGGATTGTGTAATAAGGTTTGATCAGAGAA^ 

4 5 CAGACTGGAGTGCAGTGGTGTGATtrrCGGCTCACCACAACCT 

TCCCCAGTAGCTGGGATTACAGGCATCCGCCACCACGCCTGGCTAATTTT^ 

CCAGGCTGGTCTTGATCTCCTGACCTCAAATGATCCCCTGCCTCGGCCTCCCAAA^ 

CTGGCCACTGAGAATGTATTTCTTAGCACAAGAATC 

T AG ATGG CT CATCACAGTTT CAG TT ATAAT CATGGAAAATTGGACT CAAGGG CT CAT AG CAGAGGAAAATGG AGATAAATT G TAG C 

5 0 GTAATCAGAGAGTGTTGGGCCAGGCCX^CCATGTGCTCCCACAG 

CCACACACTCATGAGCTCTTTGGCCGTTCCAACGAGGCCCTGAGTACA 

AAACGTGCTCAAAGATGAGGACTGTATAGAAAATAATATTTGGCATGAAATATG CTCATATGT CATGGGGTTTATCTCTTAGTATT 

GGGATTATGGGTGATAGTTTCTCCTTTTTGATTTTCTGTATTTTCI^ 

ACTTTAAAAAGAAATGTAAAGACrrCCCTTTGCTTGCTTCCTGTTCCCCCITCCATT^ 

5 5 TTGCCTGCTGTAAACCGGAGGGCIAGATTGGCTGAGCTCATTCAGGGCCTG^ 

AGGAACAAGTTTCTGGGAGAAGGGCTTATCTGAACAACCTGTTAAAATGCTTC^ 
TGTCTAGGTTGAGCTCTTTTTATATATCTAGGAAGAGGAAAAGTTTTCT^ 
TAATGGGGTCTTTTCATGCCCTTTTGTTCATTCCCTTGGGTTCA 
CAGGATGGGAGAGTGGCCTTGTGTCTCCnGGATACCCGTT^ 

6 0 GTTGCTGGGGAGACTTTAGTGGGGTCGGGTGAGACGGCCCACTGTGACCTG^ 

AGTCGTGAGAAGGGGCCTGGATGTGAATTCTAATTCTGATTCTTCCCTAGCCAAGTCrGGCACTTGGGTAAG 

CAGGTTTCTCTGGGTGCTTCCZAGCCrGAGTG(ZAAAGGAACAGGCCT 

AGGATGTCTAATGTGTTCTCAGGGGACAGTTTGTCTTATGTATTGGAGCTGCCCGTTCC 

ACTGGGCCTATGTCATCAGCAGTG CCTGGCTGAG CAG AGCTTGGTCAGAATC ACTGGGCT ACAGGTTCAAGGGCTCCAGCCTAGAG 

6 5 TCTGATTTAGGGTCTC^ATGCTACCTGGTGCCACCTCATGTTO 

GACAGGGAGAAGCTCTGCTCG&GCCTCGTGGTTCCCATCCTTGC 
GCTTGGCCGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTT^ 
GACCAGCCTGGCCTACATGGTGAAACCATGTCTCTACTAAAAATACAAAAAATTAG 
AGCTACTCGGGAGGCTGAGGCAGGAGAATGGCTTGAACCCAGGAGGTGGAGGTO 

7 0 GCCTGGGCAACAAGAGTGAAACTCTGTCTCAAAGAAAAAAAAAGTTACTGA 

TGGTCTGGGCTGTGGCATCCCTGTAGAGATTTTTAA 
GAATACCTTCTCTCCCTAAGATGAGTTACTTCrAGGATGCT 
CTCCCACTAGTGCCATGACCCCCTTTCCCTCCCACGTGCAGTCATTGTGAGCAGAG 
CTTCTACAAGCCTCATTTTTGTTTTCTG1XSAAAAATGTTAG 
7 5 GTGTTACCTGTCTGAGGGCTTGGCCTGTGACAAGCT 
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CCMTCAGTGACCAAGTGCTTTTTTTTTCrc 

ATTGATTAAGCTCTCGTTAAGGAGGCAGGACACTTAGGCCTAAAAT 
(X3CTGGGCAGACAAGCTGTGTATGGTTTAG 

GAGGATCGGCTTATCTGTTATTACTAACMTGGGACCTTGCCAGTTTCCCGAGG 
5 CTGAACCAATTATGCTGGTTACATGAGCCCTTTCCGTA 

ATGCATTCGTCTTTTGCCAGATCTGTGTCGAGGGCAATATTGCAAGTGGGAAGACG^ 
GACGTCGAGGTACAGCCTCTATGCTAGGTTTGAAGGAAGCGGAGAAAGCTTGATGGGA 
GTCCAATAGGACTATAATTTGAGCCACAAGTGCA^^ 
GGCCGGTGCAGTGGCTCAAGCCTGTAATCCTAGCACTTTGGGAGGCGGA 
1 0 GCCTGGGCCACATGGCGAGACCCCATCTCTACAAAAATATGC 

TGAGGTGGGAGGATTGCTTAAGCCCAGGAGGTGGAGGTTGCAGTGAGCT^ 
GTGAAACCCTGTCTCAAAAAAAGAAAAAAACCCACAAA 

TGT CAACATTT AAT CAAT AT AAAAATCATCAATGAGCT ATTTTGCATTCTAT CTTGTT AT ACT AAGT CTTTGGAAT ATGAT ACTGG 

TTG AGCATC C CTT ATCTGAAATG CTTG AGACCAGAAGTGTTTT AGATTTCAG AATTTTC CAT AAATG T 
1 5 AATGAGATATCTTGAGGATGGGACCCAAGCCAAAACACGAAA 

CAATATTTTAAATAATTTTGTGAATGAAACAAGTTTGTC 

TCCAGAAAGTTXTGGAGTTTGGAGCATTTAGGATTTTGGAT^ 

GTAACrTATAGCATATCTTAATTTGGACTACTCATATTTCAAGGTCT^ 

TGCAAGTGTTGACTGCCTCAAAGTCTGGTTAGGAAAAGAGTCTACATGGGTAAG 
20 GTGTGACATCTGAGATAGCAATGTCTGAGCAGTGGGTATTACTATGCATAGGCAATTA^ 

GTCACTAAAACTAACCTCAGAGATGTTTAAGAATCATCTCTACACTAAGAC^ 

ATAATCTTCTCTATTGATGGCTCTCi riTC 

TTCCTTTCCTGTTTTTACTTCTGCTAGAAATATATGC^ 

TGAAGGGGGAGGATCACCTGAGCCCAGTAGTTTAAGAGTAGC 

2 5 TTAGCTGGATGTGGTGGCACATGCCTGTAGTCCCAGCTGCT 

CTGTGGTGAGCCAGGACTGTGCCGTTGAACTCCAACCTGGGTGACAGAGTGAGGCCCTGTCT 
AGAAAGAAAGAAAAGAAATATATATAGTTTGATAAAAATAATAATAAACATTATATAGCCCTTGTCCTGCTG 
TGTGCAGGAAAGAGTTAACATAGCAGACCTAAGACTGCTGCCCTTTAGAAAT 
GGGAACTTAGATCTTGGGAGGGCCCCCACCATTCCTAACrGATAACAGTGGCTCT 

3 0 TTATGCTGACACCTGCTTTCTTCCTGGGAGTCCGGAATTT^ 

AAACCCTGAACTCCCAGGTTCAGGCCAGCTTGCCTGCTAGACAACATTTCACATGTAT^ 
• GTGTGTACTGTGOVGCTCCTCTATGAGAGC^CCCTTGGAAGCTTGTC 

TTGCTGATCC TTT T T rC T T T TTCH CCTT IT rr r T T l TTlTTTTTT m 'CTGAGATGGAGTCTTGCTCTGTCACC 
GCAGTGGCATGATCTCGGCrCACTGCAACCTCCGCCTCCTGGGTTCAA 

3 5 ACAAGCACCCACCACCATGCCCAGCTAAT TriTl ^GTAT^ 

CTCCTGACCTCAAGTGATCCACCCACCTCAGCCTCCCAAAGTGCrGGGATTACAGGC^ 
GATCTTGTTTTGTGTCCTTTCGGTGTAATCCATCTTAGCTGTGAGTATGACTGCGTGTTGTGTCCT 
AAACCTGGGGAGGAGGGGGGGCTTGGGGACTCTTGACCCCTGTCCTGAGTGATATCA 
GTCTCACTCAAGACCTTGGACCCGATTAACAACGATAGTTAACATCCAGTGAGTATA 

4 0 TTCATCATAAATACTAAATGATCTCACCCTCACAAGAATAGTCCCTCTGAAGTAGG 

AACCATGGCCTAATGAGGTTAAGTGGCTTGTCCAGGGTCGCTAGGCCAGTGTATGA 
GATCCCAGCCCAGGCTGTCTGGTTCCAGGGGCCACACCCTTGTTTGATTCTGTGGT 
ACTCTCCCTTCAGATCTGAGCTCCGTGTCCCCTGACCTTCCCAGCGAGGTTATGCTTCC 
ACCTCTCTTTTATACCACTTCTTGGGGTTTGAAAACATGT^ 

4 5 TAAGGTTCTGTGAAGGCAGAGGCTGGGCTGGTTTTGCTATTGTTC 

ACTCAATAAATGTTTtSTTGGGTGAGAGTGCTATGGATGGAGC^ 

TCTGGTGGG CCTGTG TGGTG CTGTGT ATG CTACTT AGGGG CTAGG AAG AAGACCT AGAAG AAGGAG AGGGGGTTTGAG AGG GG CAG 
GGG AAGGG CAT ATTTG AACTGGATTTTGAAGGGT AT AC AGG AGTTTT C CATGTG ATGAAAG TTGAT GTG AAGTGGAAT GGGGTGGG 
AATGCTGGTCAGAAGAAACAACGTAAGCAAAGACACAGAGTATCTTGCTTGAGGGTGGAT 

5 0 GGGGTGATTACrr iT Cr iT l'ClTT r rA^ 

GCAACCTTTGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCrCCCGAG 

CTAATTTTTGTATTTTTAGTAGAGATGGGGT^ 

CTTGGCCTCCCAAAGTGCTGGGATTACJVGGTGTGAGCCACTC 

TG CCATTGGTCTTTTG TCACCT ATGG CCC CTTCTC ACATGTACT CTGAAAG TTC C CAGATGG AT CCTGGCACTCTGAAG ATCTTGG 

5 5 CCCCAAGAACACCATTTAAATCrTTTCTAGCACACACCrc 

TGTCTCTAGACCACAGATGGAGGCTCTAGAAATGCCCAGCAGAGATCCTGATAT 
TGGCAGTCATCCTAAAAGGTACAAACAGGACCAGiAGGGAAAGAATGCCT 

GG ACT ATCAG CAG CCAGTGCACCCAGTGCAGTGATCAG AGGCAGGAAACGGAGG CAGGTGG CCTGC CAAGGCCTAGCC CTGTCCAG 
TGTCCCTCCTGGCTTCTCCCACCCACCTGCCAGCTTGATGTGGTAGGCAGGCAGG 

6 0 CAGAGCCAGCTCTGACCTTGGGGAGGCTACCAG^GCAATAGAGAGGTAGXjATGTACCT 

GAGTTATTCTCAGGACATTTCTGGAAGTAATGTGAGGAAGCCGTGTCTAGATAGAGTTGTGACCT 
TGGCCTACCTTGGGCACCCAAATTTGGATAGTCTTTTTCTCTCTTTTTCCCTTCTCT 
TTTACAGGGTTCAGCACAGAGAAACAGTAGCTCTGATGTTTTCTT 
TGTCCGTGGCCACAATCCTCTGGTGAGTGGGTTGTTGGTTGCATGA^ 

6 5 TCTGCCTGAGCCCTGCCTGCCCAGGCATTCTCTGCGCTCTTAACTGAGTTGGAGAAGACCAGGGAGG 

CATGTTGGTGTTCTCAGGTCAAGGATCTCTGAGGCCTGAGTCGCT 
ATTCCCAGTGGCCGACCTTAGCCACTAGGGGTCTCCATTTCTTGTTGCT 
GAACCCGGGACTTCTCAGGCCTAACCCCCCCAGGACCTATCTCTAAGTAACCAGGATCCTTG^ 
AACTIXSGCCCTGTCTGGTAACAGGGGCCACCTC^ 

7 0 GGGCTGTAGTGTGCTGTGTCCCTTCCTTCACCTTCCCCAATTCGAGGTT 

GTCCTGCAAGAGAGAGCTTTAGAAGAGGATGTTGGTTCTGAGCTC 

T AGGAAATTG CACTT AT AT AAG TTG CCTGAGGTCACACAG CT ACT AAATGGACTTGGAAG CCTGTG CTG C CCTGTG ATTTTTTTTT 
TTTITTTTTGAGATGGAGTCTCAGTCTGTCCCTAGGCT 
TTCAJ^GCGATTCTCCTGCCTCACTCTCCTGACTAGCTGGG^ 
7 5 TAGAGACAGGGTTTTACCATGTTGGCCAGGATGGTCTCCATCTCCT 
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GGATTACGGGCGTGAGCCZACTGTGCCCGGCCATGCCCTCTGATTTTTAGCAACACAATTG 
CTGAGTTCTACTAGGAGCAGGGACTGTATCTACTTTGTTCATTGTTATTTGCTA 
GGCTTGTGCATTCATCCATTCATTCAACCAATATTCAGTGCATGTCAGCCATGTG^ 
TAAATTCAAAGCCCTTGCTCTCCTGCCCCTAATATTCCAGTGTGGGGAGGCAGAAG 
5 GGC CT AGGAAG AAAACT AAAG CAGGTG AAAGGATGTTTCTT AAAT AAGG CAAAACCTCTCAG TCTTTCAATAAAGAATC CTAAAGA 

GACATGAGAAGAATTTCrTCTCTGTG CATT CGGGAGAAGATGGAATACCAGT CATCTT CT1"1XKIATAT AACAAAGAAACCTCTGT A 
ACATT CAT ATTCTCT AG AAG CCAAGGAGGAAC CCAGG CTT AGTT ATT AAGTAAGACAGCTAGAG TTG CAGGTGTT AGGG ATCAT AA 
GTGACGTTTCACGGGACAGAATCACAGCCTCAGAAGTCT^ 

AAAAT ACAAT AGAAG TAG CCATTTTT AT ATAAGTTTC CACT CT AGAC CAAAAGAAAGATCGAT ACTTTTGTGAAAAATCTCAATT^ 
1 0 ATCACTrTATCTGAATTGGTTTTCTGATGCAGAAAGGTGAC^ 

TATGAGATTTTCTCTCAACTGaUSTCAGAGGAGTCrc 

AAATGAAAAGCAAAAATGAGGAAAAGGCCGGGTGTGGTGGCTCA 

CACTTAAAGCCAGGAGTTTCAGACAACCCTGGTCAACATAATG^ 

TAAAAATGAGGAAAGGAGCAGAACAGGTTTCTTAGCAAGTATGCTGGAGCA 
1 5 TTGAGTAGCTGGGTTGGCTTCTGTTTTACATGAGATAAC 

CTGCAG ATATCTGGATGAAGAGTCTTTAGGCAGAAGGCG CTGCAGAGGCAAAGGCCCTGAGATGGGACCATGCTTGG CATGTTTGA 

GGAATGCTAAGTAAGCTCATGTGGTTGGAAGCCAACCAGGCaVGAGGGAGATOSGTAGGAG^ 

ATAAGGGTTTGGATTTCATTCTGCATGCAATGGGAAGCCACTGGAAGGTTCT 

ACAGACTCACTCTTGTGGCTGGATAGAGAACAGACTACAGAGGGCTCAAAGACC^ 
2 0 G TGATTTT ATTTT ATTTTTTGAATTT C (ZAAAGAAAAGT AATTT ATTTTTTGCAGTT AGGGCAGCTGAG AAGTC CAGG GTTGAAGGG 

ATGCTAGGTC^GGTCTCTCTCCCTTTTCTCATAAAGCCAC 

CCATTCAAGAGCGCTCreTACTCATGATCC^^ 

GAGTGCAGTXXSCGCGATCTCGGCTCACTGOUIGCTCCGCCT 

2 5 GACTACAGGCGCCCACCACCACACCTGGCTAATTTTTTGTATTTTITAGT 

CCATCTCCTGACCTCXSTCATCCGCGCACCTCGGCCrCCCAGAGTGCTGGGATTACAGACGT^ 

TCATTTCTTAAAGGCACCACCTGCTACATACTCCCACATGGGGGATTAAG 

CAAAGTATTATTTGTTAAAGATX3ATCTTAAAGM 

AT AGGGTCITGCT AT ATCA.C CCAAG CTGG AGTGCAGTGG CACAATTAT AG CTCACTGT AACCTTT AACTC CTGGG CT 

3 0 CTCCTGCCTTGGGCTCCCAAAGTGCTGGGATTATAGGCATGAGCC^CTGCACCTGGCTCT 

ATTATTCTCTTATAGCTTriTT rrrr T rri ' TT TTTT ir .TGAGATGGAGTCTCGCTCT 

AGCTACTTGGGACrcCTCAGGCAGGAGAATCGCra 
GTCTGGGTGACAGAGCAAGACTCTGTTTAAAAAAAAAAAAAAGATTTT^ 

3 5 CCCTCATACATAGTTGGTGGGAATGTAAATTAGTACAGCTACTATAGAGAACAGTTATGAAGTCAGGCAG 

TTGTTTTTTTTGCTCAGGATTGCTTTGGCTATTTGGGGTC1" 
TGCAAAGACTGTCATAGGTATTTTGATAGAGATTACATTGAATCT 
TTCTTCTGATCC^TGAGTGTTGAATATCTCTTrrTTTGTG 
TATAGATCTTTCACTTCTTTGGTTAAATAGATTCCTAGGTATTTT^ 

4 0 ACAGGGT CTCACTCTGTCATCCAGGCTGGAGT ACAGTGGTACG ATCT CTCCTGGGTTCAAGCGATT 

CTCATGTCTCAGCCTCT AGAGTAGCTAGGACCACAGGTG CGTGCCACCACACCCAG CTAATTTTTTGTATTTTTAATAGAGACAGG 
ATTTCOTCATGTTGGCCAGGCTGGTCTTGAACTCCT^ 
GCATGAGCCACTGTGCCTGGCTGGTATTTTATATTCTTTGTAGGATCG 
TCACTGTTGGTGTATATACATGCTACTGATTTTTATATGTTGATTTGTATCCTC 

4 5 AGTTGTTTGGTGGAGTCTTTAAGG TriTlT ^ 

GGTTTGTT ACCTGGGT AT ATTTCATGATGCTG AGGTTTAGGGT ATGATGGAT CC CATC ACCTA^ 
GGTACTTTTTCAACCTTTGCCCCCTCCCTTCCTCCCCA 
ACCCATTGTACCCACTTATAAATATCCACTTAAAGTGAGAACATGC^ 
AATGGCCTGAGTCTTTAAGCITTTCTAAGTGTAAGATCATGTTGTCCGTGA^ 

5 0 ATGCCCTTCATTTCTTTGTCTGGCCTAATTCCTCCG<3TCCCTC^ 

CCTATAATTGTTTCAGTGTCTGCCTTTGTCCACCCTGCTGTCAGAGCCCTGTGAT^ 
CCTGTCTAGGTTTCTTGAGCTGTCCTTCAGTGCCTTGTGAGACCAGG^ 

TTTC CCTTCTG CAGGGCCTGATGTAC CACX3ATGCCTCTCGCTGGGGTCTTACGCTACAG ACITATGTGCAGCTCAC^ 
AGGCATACTCGTCCTCAGCTAGCTTTCAGATGCTCAGTTTCAGCCT^ 

5 5 GGAAGGGAAACTTTACAAGGAAGCCAGGCTAGGCCTTCAGTGCTGTGCAGGCCTCTCT 

GGTTCTCTTCTGACCTGCCTCTAGTGCCCCCACCTGCAAACCT 
CTTTATGACCTGACCACTGCTC-GGCTTAATGCCGTTTAACTC^ 
ACCTGGAAACITGTTAGAAATGCACATTTTCAGCT 

GCGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

6 0 GGCGTGGTGGTATGTGTGTGCCTATAATCCCAGCCACTTGGGAGGCTGAGGCAGGAGAATGGCTTGAACC 

CAGTGAGCTGAGATCATACCACTGCACTCCAGCCIGGGTGACAGAGTGAGACTCT 
AAATTTTCTAGTCACATCCCAGACCTXjCTGAATTAGAAATTCTAGGGATC 
TGACTCGAATGCATGCTTTGTTTXjTTTTCATATTTGAGAAC 
GAGACAAAGAATCTGATTGGTTGAGCTTTGGTCAGGTGTCGAGCCCT 

6 5 ACATGGCTCCTTAGGCCTATTCCATCAGTCAGGATGCATCT 

CAGCACCTATGTTTCAAAGCTGTAAGCACAACCTATGATGCCAGCTTTAAGGCT 
AGTTAACACK^CA^GCTGCTGGGCTGGACAGTTCGGTCCCAGTCTC 
GTCTrTTAAAATTGAAAGATGTGAGTGGGCCGTTTTTGGTGAATAA 
TCCACAAATCGAAAGATCTGAGGTTACTGCATT^ 

7 0 TGTATCTAATGATATTTGAATAGGAAGGGATATAAATGCACTCCTCTATTTAAGGAAATCTACCT 

CAGACCAGG AAAG ATG AG CAG AAGGAGACC!AGAGGGATGAATGGGGAG CT AGGAAG TAAGGAAAG AG AGGATGG AAG AATGGAAAG 
GAAGATGGATGGG CAGAGGATGGATGGACAGAGGGACAG C^GGGAGAGGGAGGAAGTGCCTGGGTTATGGCTTTGAATCTGCCTTG 

CCTCrT^CTTCCCAGAGG^CGCTGa^^ 
7 5 GCTGTCTTCCTCCCTCTTGGG CATCGTTCTTACCAGTTTCCTTGGGCAG AGCCCCCAGAACGTCACC CCCAACCCGACCCTCTTGG 
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GGCTGTACCTCATGCATGTGGTCCCTCrGCTTAA^ 
GACAGCTTTGGATGGTTTCTCCAGGGCTTGACCGTGTTCTAG 

CAGCCTCGGCATTG CTGGGCCATG CATGCAG CTTTGGGGTGTGGGGCAGGTCAGGGTGG AAGGGGG ACACAG CCTGAG CCTGGTGG 
CAGATGTGGCTTCTCCTCCCGGCAGCGGAGTGAGGAGGTCCTCCTGCTTC^ 
5 GTCCCCAGGCAGGGGGAGTAACAGACATTTCTCJ^ 
TCCTTTAGGTGTCATCTGTACGGTTGATGGAG^^ 

TGT AGTTTGAATAGTTG ATAATTT ATTAAACCACTAAI' T'l'lT CAAAATG AAATAGAAACTTGTTTATTCGATTGACAGATATGG AG 

TACCCGCTCTTGTATCAGTTATCCATTGCCATAGCAA 

TTCATTGCCCAGTCAGCCAGGGTGGTCACCAGGCAGCTCT^ 
1 0 TAGGATAGCCTAAGCTU^GGGTG ACTG<^GGTGGCTCATCT CTrr T CCTCnTGCCTGTCACTCTCCTTCTGGGACCAGCAAACCAG C 

CCAGGTGTCCACTTCTCACGGTGACAAAGGGTAGAAGCAACA 

GCTAGCATTCCATTGGCCAAAGCAGATCACATGOTCACATGAGCCCAGG^ 

GGTACAAGGCTAAAGGGAAGGGTGGAGAGTTGGGCGAGTCAGACGCAC^ 

TCCCTCATGAGAGCAGACrCTGTGATGTATACCTAGTTAGGGGCTTCTGGTG^ 
1 5 TCCATCCCrGTGCTCAGAGACTGCCCATTTCTAGGGGTCTTGGAAATGGGTTGCA^ 

TACTGCTGTCCTTCCCTCGCAGAGCTGTTCX3GATC 

TGCCACCCTGTCTCCCAGCCCTCCCCGACCACGCrGTCTAGATAGGGCAAGCTCCTTC 
TTTTTGTTTTTTGAGGCAGGGTCTGGCTCTATCA 

CAGG CTCAAG CCAT CGTCCTG CCT CAGCCT CCCG AG TAG CTGGGACT ACAGGCACACAC CAC CACAC CTGGCT ATTT^ 
2 0 TTTTAGTAGAGGCAGGATTTCGTCATATTGCCCAGGTTGGTCTTGAACT 
AGTGCTCGGATTATAGGTGTGAGCCA.CTGTGCCTGGTCCCCTTGT^ 
TTAAGGCAATCATATGAAATTGTTTAACCAATTATGACCT 

GTTTATTGCCACCCACCCCCCCGGACCATAAGCTCCTAAGGGCAGGCACCAAGTTTGTTGT 
GAGCAGGCTCCrrCGTTGATACCTGTTAGCTATTGTTGGATGAGGGCACCTGTGG 

2 5 CAGGCTCTGGTCCCCCACTGTTTTGCACCTTTCATTTG 

CATGCCTTTGAGCCTCTGTC^CCCCCCAAAATGAGAG CCTCTGC CCCAAATCCACCCAGAGCATCAGAGCAATGCTGTGGG CTCAG 
GGGCTCGCAGCCAAGGTGAGTCCTGGCCGTGGCTTTG CCCCCHtSGCCTCAG AC1X1TCT1X3TGGGCTG CAGCTTGCCCTCTGAGCA 
GGTGGCATTCCTGACCTTACGGCCCCAlTri'CACCTGACAACCTGGGTTTCCA 
GGTCCTCAGCTCCTGCTCCTTAGCACCTCAGCCAGAG^ 

3 0 ATGTTTGTGTACTG CTCTGATTAG7UVCTGTAACCCGAGGCGGAGGTTGCAATGAGC CGAGATTGCGCCACTGCACTCCAGCCTGGG 

CAACAGAGCGAGACTCCGTCTCAAAAAAAGAACTGTAGCCTGAGGAACAGAT 
CAGAGTGGGGCTGTGTCTGTCTTTTAATCACATTCAGCT 

TTGAGATGGGCTTGTGCTCrGTCACCCAGACTGTAGTGCAGTGCCATGATCATGGCTCACTGCGGC 
AGATCCTCCTACCTCAGCCrCCTGAATAGCTGGACCTACAGCACATGTCACCACTTCTG^ 

3 5 GACAGGGTCTCCCTGTGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGCGA 

TTACAGGCATGAGCCTGATTAGAGCTCTGCTCGGCCTCGTGG 

AGGGGAGAGGATAGAGTGTGGCATTTGGTGGGCCTGGGTTCT^ 

TCTAAATTTCTCTGAGCCCATCTCTCATCTCTGATTCCTCTTGTCCAT 

TT AGGACCCTTTTGTGGGTCCTTG CCC^CCTTGGTCTTTATTAAATTTGTG AGTTC CCCAATCCT ACC CCCAAACCTGTAGGCCAC 

4 0 CCAGCGCTAGGTAGATTTCCCCAGGAGCCATTTGTCTTTGTTTGGCCGGTCACA 

CACGCCCATGGTCCTGCATCACCACATTTCTGATGCGTCAGTTCT 
CATGGGCCTTGCCTGGAGCTGTTCTGTGGGTAGTTTAGGAAGAG 
ACTTTATATTTATGTCTCTCCCTGGCCCTCCCAGAATGCATGGCCT 
AACTTACAGTGTCCTTTCTGTTCTTGGGAT 

4 5 GTTGGGGCCTTGAGATCCATCACTGGGATCrCTTGAAGCCAAAGGCTCT 

GGGAGCACTGTGGCTGCCTGGAGGGCCAGGCTATCTACCATGGTCTC 

ATCGGATTCCTCTGAAGGCAGAGGAAAGCTCCCCACAGCACTGCTTCCTCATCCT 

CCAACAACCCAGAAGTTTCTTGGAGTCTCCTATTTTA^ 

GTTTGTATCTAAAAATAATAATTTACATAACTTATCATTGAGGACAATGGATCTTC 

5 0 TTGGAGTTCCCACrCCCTCAGCCACATAC^ 

CTTCCCAGGGCAAGTGATTCACTTGTGAACTGCACCTGCTTATGTGTTCCCTGGAAACG 

CAGGGCAGGGACCCATGGACAGGAGCTGCATTTACTGTCTGG 

AAGCCAGTGCTCCTTCCTGATCGGGCAGGAACTAACCACAGGA 

ATGAGGAAGTGGCAGGAGGCATCTAGATTTCTCTCATCAAGGCCAAAGGCTTC^GGGCCCACT 

5 5 TGAGGTACTAGCCCAAGGGGGAGACTCTGCCTCTTTCCAGACAGG 

TT ACAGGCAGATTCCACTTCAGT AGCACTGAGGTGGGGCCTG^ CAGGTG CCCAGGTGGGACCAT ACTCCT 

ACTGGT CGGGGATCACACTTTGAGG AATACGGG CCTCAGAAGAAGGTGX3CAGGGCCGGGTGT AGTGGCTTATACTTGT AATCCCAG 
CACTTTGGGAGGCAGAAGGGGAAGGATCACTTGAAGCCAGGAGTTC^ 
AAAAAATTTAAAAAATTAGCTGGGCCTGGTGGTGCACATCTGTAGTCC 

6 0 CCCAGGAGGTCAGGGCTGCAGTGAGCTATGAGCATACCACTGCACT 

AAAATAAAAAAAGCTGGTAGCCAGGTGGTCCACAGGGCGCCCTCCATCC 
AAGGTGGGCTTCCTTGGGGCTTCACTGGACCTTTGGGG^ 
TGAAAAGTTCTGCGCAGTGCAGGATTTCCTGTTTCCTGCACCT 
GTTGCKK3GGGCTGTGTTGTGTTATCTTTGTTTTTAAGGTTTCT 

6 5 GTAAAAAGTATTTTTAGCCAGAGCCACAGAAGGGATAAAGGGAAGGAAGACT 

CCTCGATGGAGCGGTCATCCACAACGAAGTATTACTGAGAAAGTAGATG 
TTTGAGACAGAATCTTGCTCTGTTGCTCAGGCTGGATCACTCAGGCGATCT 
AATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCGCCC^ 
AGGGTTTTACCATGTTGGCOVGGCCGGTCTCGAACTC 

7 0 TAGGCGTGAGCCACCATGCCTGGCCAAAGTAGATGTTTTTAGAGGTTTCGGCTCCT^ 

CCTATGCAAAATGTAGCATAGAACTGACTTTGGCTGGGTCCTTTGCCCATACCT 
AGTTCAGCCAGGTACTGGGAACCACACGGTGGCATGTGGGGCGTCAGCTCCTGCCTGGG 
GCTCTAGGAAAAGGCTTCCCAAAATGCCATGATGGGGAGCCCTTC^ 
GGGTTGATCTTTTTCTCCCTCTCAGAAGCTCTTGCAACCT 
7 5 TCAGGTCCCCAGAAAAGCCCACATATTGGAATCCTGCATCTCCTCGAAGT 
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TGAACAAATGAAGAGCTCGCCAGGGAAAAGACCCACCT 

TGCAGTTATGCCACCAGGGATTCGAGAGGGCTGTCTTAATTGCT 

ACTTCTTTTTCT 1 1 1 ' lT ' lT GAGACGGAGTTTTGCrGTTATTGCCCAGGCTGGA^ 

CCACCTCCCGGGTTCAAGCGATTCTCCrcCCTCAGCCTCCCGAGTAGCTGGGATTAGTCA 
5 TATTTTTAGTATAGATGGGGTTTCTCCATGGTGGTCAGGCTAGTCTCGAACTCCCGACCT 

CAAAGTGCTGGGATTACAGGCGTTAGCCATTGCGTCTGGCCAAAGACAGGAACCT 

GCTGAAGGTGGTGAGCAAGATTCATGTAGCTAAAGAGGCCACACAGGTGACT 

AAGATTGCCTGGTCGTCCTTTTCTAAGAACAGAAGTGAGGGC^TGGA 

TGG CTTGTGGTTGGGGACACIXSAGTAAGACCTGGTGTT^ 
1 0 AAGTTCTTCCCAGCTCCTGTCTAGACTGAAACCTCCTt5GTGTG 

GAGAGACACTGGACTTGATCCGAGTCAGCTGACAGACTGGGGTGGGCGGGT^ 

AGAGCCAGGCTTGGACCCCTGCTCAGACAGAGTTGGCCTTCACTTCTCTC 

AGCCTCACAAGGTCTTGTGAAAACAAAAGGAAATAACGTATACAAGGGAGCTCTTCCATTC 

CTTGGCCTCAGGGAAGGATGCAAGAAATTGTACTGATGTAACAGCAGGATT 
1 5 GTCAGGGCAAGGGTGAGACCCCCATGCCCACTGTTGAGTTGTGTGCTTGGATTTGCCA 

GTGCATTCTGTAGAAATAAAGGCAGTGGCrTGTCTrGTCGGTGGTGTGGGGAGCCC 

GCAGAGTTCCTAGGAGGAAATACAGTTAGAGCAGCGCCCACTGTGCCCCATGTCCCT 

CT CACT CAG CTCAGACACTGGTCATT CAG T C CTCCTGG ATC CCC CAG CCTCTTCTGGCACATGG TTCT AGTCATGGAAGGG nTTC 
AATAGGTCTGGCCTACTGACCCTGCCCATGGGAGG CCACCCXrACCAAGGACCCCITCCAAGAGCATTGGG AAGGG AGAGG CTGCCC 
2 0 CTCAGTCTGCACCAACCCTGTGGGGGAAGCCTTTTGTT^ 

CGATTCCGAGGGGAGCTGTGGCAGGGGCAGAGCCACCXSGAGTCTGTCAGTCCCAG^ 
TCTCCATGTGCAGACCCCCGTGTACTGTGACTGTTTTGCAGTGGTGGTTGGTT^ 
GCTGTGGTTCAGTTCCATGATCATATCTGTGGGCACCCTCCCAGCTCTGGGATTGGACAGAGTCT 
TGAGGTACTCCGGTGGCTAGTCCAGCCTTGGACACTTGGGAGACTGTCCCATGGGCAGCTCA^ 

2 5 GGCCCCACCCCTTTGGTGCCACCTCTTCTGTGCCCTCAAGCCATGAGTTTGCTG 

ATTGCACATCCTCATTTTCTTAGAGTCAATTTTGTTCT 

CTTG AACAT CAAATATT CGG AG CTGAACT AACTG AAACCC AACTCAGGGT AG CCTG AACAATG AGGTGT CATCAACTC CCAACAG A 

TGGCTTATTTGGTGCCTCTGGCCTGGTTTCTCTACATTTTGCTACTGTGCATCCT 

ACAATGTAGTCGTGGCAGCATTGGGCTTCTCACTCGCTGGTTGGAGTAAGGGCAGGTC 

3 0 CGGAAGTATCTTTTCTGTAAGTCCCTGCAAACCTCCCTCACGCCTGCCTAATCCAAAT 

CTGATTGACrTAGCTTGGCTTCCTGAACGCATTGCTGGAACTAGGG^ 
TTGGAAGCTTGGGGTGGGGTGAGGTTTCCCTGAAGCCCTTGGGTCTGTGGATGGACCCTGA^ 



GGTAAGCGGAGAGGAGGCAGGGAGGCAGCCTGTGGACACAAAGTTGTCACCTGAACTTGTAGTGCCT 
TCCCCTCTGCAATGCTGTAGGATTTCCTTCCCACTCCTCCAGCCT^ 
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AGGGCCCCAAGACAGGGCTACTTGTGCCCTTGGCCTCCCAGAGTGCCTGCCTCACTCTTCCA 
TCCCCTCGTGGGACTGGAGTCAGCCCCTCCCTGCCCCCTCCCCACTGAGAGCCTCCCATTGCACATGG 
CCCATGGCCAGTCrTTGCCGAACCCAGCTCTGAGCGGGACCCACATGGAGCCACTAGTGGAATGGAGGA 
4 0 TCTGCTAGGCCTGTG AGGGTTGTCTGCCTGCTGATGGG CTTGTGCCCTC CAGCTGTGTCCATTCCTGCTCTCTCTTTCCTGTTCAT 

TTCCTCCCAGGGCACAGGAAACCCACAGGAATACCAGGAATAGATGCCAGATTTCTCCTTC^ 
T in 'C l TGCC rrriTrr CTTTCTTTTAAAGTTAACATTTAA 

ACTTTGGGAGGCCGAGGCGGGTGGATCATTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAAC^ 
AAAATACAAAAATTAGCCAGGCATGGTGGCGCACACCTGTAGTCCCAGCTACTCAGGAGGCT 

4 5 AGGAGGCGGAGATTGCTGTGAGCTGAGATCACGCCIACTGCACT CCAG CCTGGGCGACAGAGTGAAACTCTGTCTTAAAAAAAT AAA 

AAAT AGACC AT CCTGG CT AACACGGTGAAACC CTG TCTCT ACT AAAAAT ACAAAAAATT AG CC CGGCGTGGTGG CGGGTG C CTG T A 
GTCCCAGCTACTCGGGAGGCTGAGGCAGG AGAATGGCGTGAACCTGGXXS^ CA 
CTC CAG CCTGGGCG ACAGAGCG ACACT ACGTCT CAAAAAAT AAAAAAAAAAAAT AAAT AAAAACT AAAAT ATGTAG T AATGCT C CC 
CAACTCCCACTGCTGATTTGAGAAGTAATGTCTTGTTAAAATTTGGAAAATA 

5 0 GTAGTCCTGCAGCTTGAGAAAAGCACTTTTTACATTTTGATATTT^ 

GAAATGTGGTTGGGATATACACAGTTTTAGATCCTGTTGCTTTCACTGAAC^ 
ATTTTACAATATCATCCTTCAGTTACTGCGTGAATTTTCTGTTATATAGATTTACCAAG 
GCCGCX^TGAGCATCATGATTATTTTCACAAAATAAAl'lTTTTTGTTTTGTTTC 
AGTGCAGTGGCATGATCATGACCCIACTACAGCCTTAAACCCCTGGGCTCAAGTC 

5 5 ACCATAGG CATGCGCCACCACC CCT AGTT AATTTTT TTGT AGTTTl^TAGAGATGGGGGTCTTTCCATGTrGCCCAGGCAG ATCT 

TGAACTTCTGGCCTCAAGTGATCCTCCCACCTCACCCTCCCAAAGTGC^ 
ATAAATTTCTAGAAGTGAAATTGGTTTGCTGATTCATTGCCTAACCTGAGCACT^ 

CCTTTCTCTGCTTAGAGTTCCAATCCCAAGTGAATTCTCCCATCACTCCGTGCCATGGCCCTTGCAGCAT^ 
CCC^GGCAGCAGGAGGTCGGGAATGCTTAGGGTCCrrGTOVCTGCATC 

6 0 TGGTGACTCCTTTCATTG AGAG CCGTCTCAAGATCCTTTTGGGGGAATTGCATAG CCCAGAAGTTTTCTCTTAGGTGAATAAGC CC 

TTACAACTCTCATTGCAGTGGGAAGATGCCAGAAGTGGACTATGTAGTTCTGTCGGAATGG 

ACGTGT CTG TTG ATTTG AT AGGTG AG ATGCAC CCCTCC CACCTC C CT CTCTTGTCTT CTCTCCTGGGTCC CTTG AGG ATGTGCTTGG 

CTGGGCCCCAACCCTTGGGGGCAGTGGGTGAGGCACTTCCCCT^ 

CAGAC^GAATTCGGGACTrCTGGTCTCrCATGGAGAAGAACCCrGGC 

6 5 CAGACCCAAATCCACTGTGATCTGATGGGTTTTTCTAGTTTGCAAA 

CTC^TGTGAGGCCTGTTTTGAGCTCTGTCTATGAACAGAATCCCATTCAG 
AGAGAATGAAAGTCXJAGGCGGAGC^TCAGTGGTCAAGCCAAGAGCTC^CCCTCCCTG 
CCCGCCTTTGCCTGTGTGCCTGCTTTGCTTCCAAGCCATGAGCAG^ 
CGGACCAATCCTGAGACTTGTTACCAGAC<aTTAAAGAAGAGATG 

7 0 ' TGACTTGGTTAAAGACAAACTGGGAAACCACCTCTGTCCAACCCGGC^^ 

GATCTCCTTGGCTTCCTGTGGTC^ 

TCCCCATCTTGGCAGCACC^GGGCTTGTCTGTTCTGCCrCCTGAAGATTC 
C^TTCCAC^AC^CAGGACCCAGCCCTCTTGTTCACTGC^ 
7 5 ATCCTTAGTGGTGTTTGCCCACTC-GAGAGATGCCATCCCCTTCTGA^ 
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AGTATACXXTCCGGGCTGTGCATCTCGCCCTGGrc 

TG AT ATTTC CAATACTCAT CACCACTCGTG CAATTT CC CAG C CCAGAACCTTT CAGTG CCTTG CTG CTGG CT AGAATG CAGT CCC C 
ACTTTGTAATACGGCTTATGACCCGGCCCCAACTGCCTCTCTC 

AGGAACTAGTTGTGGCCACACACAAACAACCACATACACAGTGGAGTTATGCAAATGTGCCTTT 
TAAAGTGTTACTCAGATGTCACCTCCTCCAGGA^ 
T T TC T CTGATCATGGGCCTTAGGACAGCATACTCACACTA^ 
AAGATTGGGTCTTATTCCCTCOTTGTCCTCTGCATGTC 



CTGGACTGGGGGCCCTGGGAAGCCTATCCAACTCCTGAAATCCTC CATTCCT 
1 0 GCCACGAG4XKXCAGGCGTCACTCTGGGCAGCTCCTTTGAAA 

GGCCAGGCTGGACCATGCTCTGATGCXSC^CAGCAGCTCTGTC 

GATCCAAGCCATTGTCTTAAGGGTTTGACCAGGTC 

AAAGGCTGCTCCAGAGTTGACACGCGAGCCTCTGGGA 

GCACGAGTATCTGCTCTGCCTGGCATGCAGGCCCCATGGCTC^ 
1 5 CTTTCTGCAGTTGGTGGTCTTTATAGTCTGGAGGGTCTATAGGTTACGAGTT 

CAGTCCTCTTGGTGGCAAGACAG AGCGGAGGGTTTAGAGGCTGCCCAGGG CTGGGCCC CTGCCTGAAGG AACTGAGGCCCTGCTGC 

TGCAAAAGTGGCAGTAAATGGGAGAAGCCAGGTGTGGGTCCTGCCm 

TTCTCGGAGAGGGGGATTTGGCAGGGTCATAGGACAATAGTGGA 

TTCCTAGGCAG AGGGCCCTGCCG CCTTCCGCAGTGTTTGTGTCC CTGGGTACTTG AG ATTAGGGAAT 
2 0 TATGCTACCTTCAAGCATCTGTTTAACAAAGCACATCTTGCATCGCCCT^ 

CAGAGAGCACGGGGTTCGGGGTAAGGTTATAGACTAACAGCATCCCAAGGCAGAAGA^ 

CTCCTATGTCTA L"1TC ' T ' 1 ' T CTACACAGACACAGTAACAATCTGAT C 1"! l ' (Jl " ri ' l 'CCCCACATTTCCCCCTTTTCTATTAGACAAAA 



TTCCCAGACTGAGCGGCCGGGCAGAGGGGTCCTCACATCCTAGACGATGGGCGGCCAGGCAGAGACACTCCT 

2 5 GGTGGCAGCCGGGCAGAGGCTGCAATCTCGGCACTTTGGGAGGCCAAGGCAGGCGGCTGGG^ 

TCACGCCACTGCACTCCAGCCTGGGCAACATTGAGCACTGAGTCAACCAGA 
G CTGGCGGATCACTCG CGGTTAGGAGCTGGAGACCAGCCCG^ 
GTCAGGCATGGCGGTGCGCGCCTGCAATCCCAGGCACTTGGCAGGCTGAG 

GTTGGCGGCAGTACAGTCCAGCCTCGGCTGGGCATCAGAGGGAAACCX3TGCAAAGGGGAGAGGGA 

3 0 CTGCCTTTTGCCTTGGCTTTGATTCAGGCGCCAGGGGAAGGGCCAGTGTTTC 

GCTGGTGACACTTGTGCTGTTTTGCTCAGCCTGGCCACCGAGTC 

TTGCAGGGTTCCTTTGTCTGAGTCACTGCCCATGGTCATT 

GCTCTGTCCACTCCCGACAGACTGCrTGTGTTTCC 

CAAAGGCAGCCTTTTCCCCATGGCAGCCCCTGTTCTGGTAAGTAAGACTCC 

3 5 GACCCGGGGAATGTCAG CCACCTTG CAGACAGGGGAGGG CACAGTCCTTT AGAGTTTG ACAACTT AG AAGAAGG AAGGTGGGA 

CTGGTGCTCTCTCTCCCAGTGCCTGTTTCTGGGCT 

CCATCACACCTGGCAGTTGTCACTGATCCAGCATCTTCAGCGAAGACTCTAGACCCGGGT 
GTTCCTCTGATGTGACCTTCCTCCTTCCGTGGATGACACAGAGCCTCATTTCA 

TGTCATTGTCTGGTCTGGGCTGTCACACAGTGATGCCCTTAGGAATGCTGAGGTGTGTGGAACTGTCT 

4 0 TCTGCGTTCCTGTGTTTACTTTTTACAGTAATC 

C CGTCAGCATTTCCTTACTTCC CCATCTCCTTCTTTTCCAGTTTGCT 
CAGATGCTCCCAAGAAATGCGGAGTCTGGGGTCCCTCCGAGT 

TTGGCTGTGAGGCAGGGTTGTGCGCAAGCTTTGGTGCATGGCTATAGGAGCATTGTTTGAT 
AGCTTTTGTGCGTGGCCATAGGAATATCrGTTTGGG 
4 5 TGAGATATTACTGTAGTA ArmU ' l^ lT a ' G T TT GTTTTTGAGACAGAGCCTCACTCTGTTGC 



CACCATCCCTCGATAATTTTTTATATTTTTCGTAGAGA 

GTGATCTGCCTCCCTCGGCCTCCCAAAATTCTCGGATCACAGGCGTGAGCCACrGC^ 
AGTACAGCTTATTTTTCTCCCTCATTATGGAAAAAAA 
5 0 AGGGTCTCACTGTGTTGCCCAGGCTGGT CTCAAACTCCTGGGCTCGTGTAATCTTCAGGTCTCAGC CTCCCAAAGTG CTTGGATAA 

TAGACATGAGCCACCATACCTAGCTCTTGTAGTACATTCTTAAGTGAA 
CGATTTGGCTCTAACCTTCCTTTTTATCTGCCTGCAACCTT 

TCCAGCTGTGAAAGGAGGATGCTGCTGAGGTCAGTGCAGCCTTGTGACCCATGTGTCTAATTTCCAGGTGATTGAG^ 
ACATGGAGAGGATGTTAGAACTCTTTGAACAAAATCGGGATCGAATATTAACT 

5 5 AAAGGTCTATGGCTCATGTCreAAAAATGCCTGCTGCTGCCAAGTTAGCTATTG 

GCTTTGTGTCTGGCCAGCTTGATTTTCCTAATGGTCTCAT^ 
AATGACAAACGGGACCAATGGGTTTGCCAAGCCCTTTGCTGTTCGCAGCCTCA 

TGCTGAGTCGCTCTCATGAAGCCCTTAGGGAGAGCACCTGTTGTGTGCCTGACACCACGCTGGAGCTGTG 
CTTCATTAGGAGGCCGAGGTAGGAGTCTTATATCCCAGGTGAGGAATTTGAAGCT 

6 0 ACAGCCTGTGTGCAGTGGAGCTGGCACCATTCAGACTTTCAGCCGACTCAGCA^ 

AGCTGTTCCCCACCGCCCTGCCTCTTCCGGTTGGAGGCTCTCATGTCTCTTTGGGGAGAGCTGGCAGTGTGCGGAGCT 
TTTCCCAATATTGAGCAGTTCCCAAGGACAGTCAGCATTTCTAGACTTCC^ 
CAGTGGTTTCCCTGCCCGAGGTCGCTGCAGCCCCATCTACCACATCTTCATGTGGACAT 
GTGCTCAGTCTCCTTGGTGATTAAGGTCCTGCTTGAACTGCTGCCAACTCCATGTCAG 

6 5 TGCCCAGAGCCAAGCTGGGGCAAGGGGGAGCCAGCCCTGGCTTCCAAGGCTCCCG 

CTCTTGGAATTACTGAAAGTCTGTGGCCCAAGAGAGAGACACAAGTGG 
TGCCTCGAGCCATAGAGCCTGAAATCATCTTTGTTGGCTCAGAAAATACCTT 
CTGCCCCTTGGATCAAGCAGAAAACAAGAGAAAGAAAGAACGTTCCCTTTGGG 
GCTCTGGAAGCATTGCTGGTCCTGAAGCTTTGGAGGTGGGCAGAGAGAGCTTCAAGAA^ 

7 0 AGGCTCAACTCTGGAGAAACAGGCCACAGCCTCrrCAGAGCAGCrc 

GTGAGTGTGGGCACCTGTGCATGCAATGCTCCGACTCTGCAGAGGTGCCAAGTGCCC 



GTGTCTAGCAGCACCTGTAGCCCACGTGCTGCTAATTAGC^ 
TGTGGGGTCTGTCTTTGAGAATTTGTAAATGAACAGTCCAATA 
7 5 GATTTCTCTCT CTTTTTTTTTTTT T TT TT TT GCCT 
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GCTCTGTGATACCCAATAACTCCTTCACCTAAGCCCTGTTG 
GCTGCTTTTGCCAGTAGATTAACGTGTTTTTATTTTTCAAACT 
TGTCCTCCCTGCCTTCCACCTTGGAGAGGAGGCTGGACGCATCAGCAGTGGCCAGG^ 
CACACCTGAAACGGCTGAAGCCAGCTTGCACAAGGGCTGCTC 
5 CAGTCATAGACACCATGGACCAGGGCAGGAGAAGGGCAGATGGCACATGGGCACAACA^ 
CATGGCTGGGAGGGGCATGGCAGGGGCrTGGCATCCCTGTAGAGCCAGAGGGGCCA 
CTCACCTCATCCTTGCTGTGAGACTGGAGTTCCATGGGGACAT 

CTGACTTTTTTGTTG CACAT AT AC ATTTT CTTTCAATT AG CATTT ATTTCAGCTTTT ATTT AAG CTTTTTGACAGT ACATGT AAAA 
TATATGATTATAACCATTTAAAAATACCTTATGTACCTGGTTTTT^ 
1 0 GAAGTGTGTAGTGGGGTGGTCCAGGGCTTTGTGGTGGCTTAGTGGCCATCGGGGTCCCAGGC^ 

CA^ATCCCACTCCTCCCCATCCACTCTTTAATCCAGATGCT 

CTTACGAGGCTTTGCTAGAGGCCCTGGGAGGTGGGGGGAAAG^ 

CATCAGCGCCGTGGGGTCCTTTGGGCTTCACCTCCCTCTCCCCTCTGCCCCC^ 
1 5 CCCATCCCGTACCGTAATGTCCCTTGCAAGGCCTAACTCTGTGAGGAATGAAACCACCT 

OTAATTTACAAACGAGTGAATGTGGACCTCTGGAAACATTACCCAGCTTC 

TTCCXSTAGGAGCCCTTTACCTAGATGAGAAGTGTCCCCCCGCCTGGGC^ 

CATTAGACCCGAGTGACGGTGGTGC CATTT AAACCTCAGAGCAGGTAAAAGGTGGTCC^ 

GGAATCTGAATCACTTCrTTTTTCCTAGAGCCCTX^ 
2 0 ATCTCCTTGGCCCTCTCGACCCTGAAGAGTTAGTTATTAAAATA 

TGGAATAGCAATAATACATCCCTGGCTGCCCTGGGCTTGGCCAGGATTACTCACT 

TTTTCCTAATAATCCCATCCCCTCAGCTTTCCTAAGGCTAGAGTGAATTTCGTGT^ 

CAAAAGCTATCATTAAACAGAAGCTAAGAGAAAGCCTATGTCGTGGAATCC^ 

GCTGTCTTGAATTTCTTTCTGTGTC' , rrTl'CI''rrT'l 1' TTT C T T T AAG ACTG TTGTTT ACCAGACTGGG CT CTGTGG AACACAGGTGT 

2 5 CCTGGGAGATGGTTAATCATTACAAAATATTGGT^ 

T CACATGGTGGAGT ACTGTG CAGCT AT ACAG AAG AATG AGG AAG AT CTCT CTGAATGGTT ATG AAGT AATGTCT 

T ATGTGAAAAATGCAATGCCCAAACG AGTGTCT ATAGTATG CTAC CCTTT ATATTAGAAAG AAG AGAGGCCAGGCGTGGTAG CTCA 

CGCCTCTAATCCCAGCACTTTGGGAGGTGGGTGGATGACCTGAGGT 

CTCTACCAAAAATACAAAAAAGTTAGCCAGGCGTGGTGGTGGGCACCTGTAATCCCAA 

3 0 ACTTGAACCCAGGAGTTGGAGGCTGCAGTGAGCCGAGACTGCGCCATTGCACT 

GAAAAATAAAAAGAAAACCGATAGAAGAAAATACACAGATGCAAAGGAAATATAGGA 

GGTAAAGGCTAGAAAGAAGGGAAGGATGAGACTGGGCTACAGGGATGAGAAGGGAATGACACTTCT^ 

ATTTATTTATTTTTTT G AGACAGGGTCTCGCCCTGTCACCCAGGCTGGA 

CTCCCAGG CTCAAGTGATTCTCGTGCCTCAGCCT CCTGAGTAGCTGGGATTACAGGTGTCTGCC^ 

3 5 G ACAGGGTTTCCC CAGTTTGGCCTGGCTGCTCTCAAGTG AT ATGCCC AC CTTGGTCTCCCAAAGT ACTGGGATTAGTG AACCGCTG 

CACCCAGCCTGAAATGTCTTTTTCTATAACTGACTGTTACATACT 

AATAAATAAGAGCAACCAGGATGTAGGGAAAACAGAAAAGAAAATTACAGATGAAC 

ATACTGAAGGGGGCAGGAAAGAACAGAACTAACAAATAACTGTGGAAAAC^GTATCTTTACT 

AATTTCTG T AC AT AAAAGTTGT AG T AT AGTT ATTAAAAATGTTT CTCGAC CATG CGTGG TGG CTCACAC CTG T AATC C CAG CACTT 

4 0 TGGG AGGC CAGG ATGGGCGG AT CACCTGAGGTCAGG AGTT AG AGACCAGCCTGGCCAACATGGTGAAACCCTGT 

T ACAAAAATTAGCCGGGCXjTGGTGGTGCACGCCTGTAATCC CAGCTATGTCGGAAGGCTGA CAGA 
GGTTGCAATGGGCAAAAAAAAATGTTTCTCACAGGAGTATAGGTTAGTGATTCTGA 
TAAATAGATTGTAGACTATGAGGGCTGGATTTCTTACrGTTAGTGAAAGAGG 
GTGCTGGGTTAGAATAGGAGGTGTCAGTCTAAACTCATGATTTTAAA 

4 5 TTTGGGAGGCCAAGGCAGGTGAATAGCTTGAGCCCAGGAGTTAGAGATCA^ 

AATACAAAAATTAGCCAGGTGTGATGGTGTGCGCCTGTAGTCCCAGCn'ACTTGGGACGCT 

GGGTTCAAGGCTGAAGAGAGCTGTGATTGTACCACTGCACTCCAGCCTGGGTAA 

AATAAAATTATATATAGATGGATACAGAAATATAGATGTGTGTGTGTGTGTGTGTGTC 

ATGTTTCCTAGCTCTGTCCTGAGTGTGTCTAGAGGCAATGGCACCCAGTAACAGTGAGCAC^CCCGGTACCCAGTTCIT 

5 0 AAACATCATTCTCCTGCAAAAGGAACCAGGTCTCCTAG 

CTGGAATATCTTGTGGTGCTAGAAAGAAAGTGCAGGGATGTGGGGGGATGATGGAGCCATGTC^ 
AAGTTCCTAACGGCCAAAGCTGGAACAGTTTGAGGAACAAAATAAATTATGTTAGTATTGGATTCT 

T ATC CACG AAT C CATG CTG AT AT AAACAGTTG AAG AAAT AAGT AAACAGGG AAGAAAGGG AG TG CTT CC^ CCAG 
TTAATACACACAAAAGGCAAGAGGGAAACAGAGACTTGCCATTACATAA^ 

5 5 CTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGATCACAAGGTCAGGAGATCGAAACC^ 

G AAACC CTGTCTCT ACT AAAAAT ACAAAAAATTAG CTGGG (^TGGTGG CGGGCGCCT CT ACTCTGG AGGCTGAGGC 
AGGAAAATGGCGTG AACACAGGAGG CGGAGCTTGCAGTCAGC CGAG ATCGTGC CACTGCACTCCAGCCTAGGTGACAGAGGG AG AC 
TCCGTCTAGGGAAAAAAAAAAAAAAAAAAAAAATAAATATATATATATATATATATATATAAAATCACA^ 
GATACTAAAATTAGTGGACAGAAGTTTGAGGAAAAACAAGATTTGCATAGTTTCTATTTCCCCT 

6 0 GGAAAGAAATTAACTAATTTGACAGTGGAGATGGAAACCCCATAGA 

CATCATGAAGCCTTTACAAAGATGCACTGAGGACAGCATCITTT^ 
CCCAC^CTGGAATGCAGTGGCGCAATCACAGCTCACTGCAGCCTCGACCTCCTGGGCT 
GCTGGGACTACAGGTGTGTGCCACCAGGCTGGCTAATTTTTTTTTTTr 
CCAGGCTGGAGTGCAGTGGCGGGATCTCGGCTCACTGCAAGCTCCGCCTCCCGGGT^ 

6 5 GTAGCTGGGACTACAGGCGCCCGCCACTACGCCCGGCTAATTTTTTTC 

GATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTG 
CATTTTTTGTATTTTTTGTAGAATAGGGCTTTTGTCATGTT 

TTGG CCTCTGAAAGTGCTGGG ATT ACAGTGTGAGC CAT CATG C CTGGCAG CATCATTTGTTTTTTTGG TG TGG TAT C CTTG C CAGA 
AATGCATACTTGACTCCAGTATTAAGAAAGCACCAGACAAAC 

7 0 CTTTAAAAGCATCAAGATCXSTGTTAGACAAGGAAAAAACAAGAACTATTACAG^ 

ATGTGGGATCCTGGATGGGCTCCTGGAATAGAGAAAGGATTTTAATAGAACACTAGGAAACT 

ATAACATTGTATCAATATTAATTTCCTGTT CTTGATCATTATATTAT^ CAATGTGGGATCC 
TGTATATGCTCCTGGAATAGAGAAAGGATATTAATGGAACACTGGGGAAACTTAAAT 
ATCAATATTAATTTCCTGTTCTTGATCATTATACTATGGTTACGTAAGATGTCAACT^ 
7 5 GAGTTCTTTGTACTATTTTGCAACTTTTCTCT 
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ACACTGTAAATATTACAATCCCAAATGTTTAATATATGCX^ 

GGGT ATCGCTG AATGGTGGAAATTATGGTAAG<^ ri"lX^ CTTTCG 1 1 1 1 CAT ATTCATCTGT ACIT I' L.~r AAGTITTCT ACT CATTTGA 

TGGTTTAATATGGTGGATCCTTGGAAATAACGGTGAATGAGACAAGATGAA^ 

GAAATCGAAAATGAATACATGAATTAAAATOriTGTCCAGAA 
5 AGGTGACATTTAAGGGGAGATGTGAATAACAAGGAAGAGAACATCGCAGGGTAGG^ 

GCAGAGGCTOGGAATAGGGCCAGGCTCAGCATCTTCCAGAACC^ 

AGGGACCTGAGATGAGTTCAGAGGGAAAGTGAGGGGCTGAGCCT 

ACATGAGAATCATGATCAAATCTAGGTGGTTTTTT 

ATATATAGAGACGGGGTCTrTCTGTGTTGCCCAGGCTGGTCrCAA^ 
1 0 GTGCTGGAATTACAGGCATGAGCCACCACGCACAGCCCrAATG 

GCAGAAGGCTGGC^CCTCCAGGGACGTGAGTGAGAGAGGATGGTGA^ 

CAGACTCTGTTGTGAAAGAGAGGAGCTAGGAATTGGATGAAGG^ 

AGAGAGAGATATATGTTGGTCAAGGGTGATGTCTGCATTGGATTGG 

AATGCGGGAGCGGCAGGCTTAGTGGAGAGAGCAAAAGTTCrc 
1 5 AGAATGAATAAAAAAAAAATAATTAGCTCGCCTTTCCT 

AGGTCAGGTGCCCTGGGGGATATGCCTTGGAGTGGCTGTCATGX3 

CTGACCTGCTCTG C CTCCAG AT CAG CCTG IT CT ITTTTTTT I CCATTTATTTATTTTAAGAGAC^GGTCTC^^CTGTCACCaVG 
GCTGGAGTGCAGTGGTGTAATCACGGCTCACTGCAGCCTTGAGCTCCTGGGCT 

CTGGGACT ACATGG CTGG CTAATTTT AATTTTTTTTTTAAATT AT TGCACAACGTG CAGG TTAG 

2 0 TTACaTATGTATACATATGCCATCTTGGTGTCCT 

CCCCCGTCCCCCCACCCCACAACAGGCCCCAGTGTGTGATGTTCCCGTTCCTGTGTCC^ 

AAAGGACATGAACTCGTCATTTTTTATGGCTGCA^ 
TTGGACATTTGGCTTGGTTCCAAGTCTTTGCTATTGTGAATA^ 

2 5 GATTTATAATCCTTTGGGTATATATCCAGTAATGGGATTGCTGGGTCAAATGGTATTT 

CACTGACT AC CACAGCGGTTG AACT1G TTT ACAGT CCCAC CAACAGTGT AAAAGTGTTC CT ATTTCTCC ACATC CTCT CCAGCACC 

TCTTGTTTCCTGACTTTTTAATGATC^CCATTCT 

GCCAGTGATGATGAGCArrrri'l'lCRTGTGTCrTTTGGCT 

CTTTTTGATGGGGTTGr iTG ri T ' lTn ' T CTTGTAAAT^ 

3 0 TAGATTGCAAAAATTTTCTCCCATTCTGTAGGTTG 

AATGAGATCCCGTTTGTCAATTTTGGCTTTGGTTGCCATTGCTTT^ 
GAATGGTATTGCCTAGGTTTTCTTCTAGTGTTTTTATGGTTTTAGGTCTA^ 
GTATAAGGTGTAAGGAAGGGATCCAGTTTCAGCTTTCTACATATG^CTA^ 
CTTTCCCCATTTCCTGTTTTTATCAGGTTTGTCAAAGATCAGATGGT^ 

3 5 TCCGTTGGTCTATATCTCTGTTTrGGTACCAGTACC^TGCrG 

HUMAN SEQUENCE - mRNA 

GCCAAGTTATGGGTGCGTTCTGCCAGCGTCCTAGCAGTGATAAAGAACAGGAAAAAGAGAAAAAATC7U3TGAT 
AATATTGCIAGGTGGGAAGACGACATGCCTGGAATTCITCT 

4 0 GAG AAATGTCCGTGGCCACAAT CCTCTGGGCCTG ATGTAC CACX3ATGCCTCTCGCTGGGGTCTTACGCT ACAG ACTT ATGTGCAG C 

TCACCATGCTGGACAGGCATACTCCTC CTCAGG CGCAAGATACATTTTT 

GTAGAAAACCTGTATAGAAGTGGGAAGATGCCAGAAGTGGACTATGTAGTTCTGTCGGAATGGTT^ 

C4GA05TGTCTG1TGATTTGATAGTTTACCTTCG^ 

AGAAC4GTCATTCCGCTC^AATACCTGGAAGCAATTCACCATCTC 

4 5 GCCCCTGTTCTC^TGATTGAGGCTGACCACCACATGGAGAGGATGTTAGAAC 

AGAGAATCGGAAGCATTGCCCATAGGAGGCAAAAGGTCTATGGCTCATGTCT 

GAGCAATCTGGAAAAACTTGCTCCCAGGAGGX3CTTTGTGTCTGGC 

CTTTGTCATGCCTCTCTGGCCCTCGrGGGTAAA 

ATTCCCCCGGTGCCTCTCCCATGGCTTTGTGCTGCTGAGTCGCTCTCATGAAGCCCT^ 

5 0 ACCACGCTGGAGCTGTGTACCAATCX3TCTCAGCCTTC!ATTAGGAGGCCGAGGTAG 

CTC^AAACJGTTGAGGGGCTCCCCAGAGGTCACACAGCCTGTG 
ACrTTCCCTTGCCCTGGGCTGCCTCCTCCTGAGAGCTGTTCCCCACCGCC 

GGGAGAG CTGG CtAGTGTG CGGAGCTGATAACATTTTC CCAAT ATTG AG CAGTTCCCAAGGACAGTCAGCATTTCTAGACTTCCACA 
AAATTATGCTGCATTTGGCTGGAGCCCGGTGTTCAGTGGTTTC^ 

5 5 C^ACATTGAGATTCACATGCTGGCTCCTGAAGGGT G CTCAGT CT C CTTGGTGATT AAGG TC CTG CTTG AACTG CTG C CAACT C CAT 

GTCAGGGAAGTCGCTTTTGGTGCCTGGCTGGT 
CXSTACTGTCTGTGTCCTTGTATAAGGAGCTTTGCTCTTGGAAT^ 
AAGTCTTTTTGAAGTGTTATTTCATCCAGGGAAATGCCTCGAGCCATAGAGCCT 
AGCTTCACTCIAGCTGGACTGCATTGAAGGCGAGGCTGCCCCTTC 
60 GGATAGTCTGGAAAGTTGGGATTTGCAAATAAAGGCTCTGGAAG CATT AAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGGTGCGTTCTGCCAGCGTCCTAGCIAGTGATAAAGAACAGG^ 
AGGTGGGAAGACGA(^TGCCTGGAATTCTTCTCCAACGCGACAGACGT03AC4GTC 

6 5 TCCGTGGCCACAATCCTCTGGGCCTGATGTACCACGATGCCTCTCGCTGGGGTCTTACGCT 

CTGGACAGGCATACTCGTCCTCAGGTGTCATCTGTACGGTTGATGGAGA^ 
CCTGTATAGAAGTGGGAAGATGCCAGAAGTGGACTATGTAGTTCTGTCGGAATC 
CTCTTGATTTGATAGTTTACCTTCG^IACCAATCCTGAGACTTGTTACCAGAGG 
ATTCCGCTGGAATACCTGGAAGCAATTCACCATCTCCATGAGGAGTGGCT 

7 0 TCTGGTGATTGAGGCTGACCACCACATGGAGAGGATC 

GGAAGCATTG CCCAT AG 
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Table 47 

MOUSE NOMENCLATURE 
ICSGNM Nupri 
Celera mCG22384 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG1745228 

10 MOUSE SEQUENCE - GENOMIC 

TTGCAGATCTAGACATTGACATCTGGTCTTCTCAAAGCTAAGGGCAGGC^ 

TTC LT1 r CT TCTT C T TCrTCTGTTAAATGGGGTTTGATlTCTTTGTCTGATAAATGAGGCTGGGA^ 

AGCAGTTCAGAGCTCTTACTCTTTCAGAGGACCCAGGTTTGATTCCAGCA 

GTGCACATX^TACAGACAAAATTCAAGCAAAATAGTCACACACACAC^ 
1 5 AGGGAATTGTGATTGCTGGTCTCTTGGGCTACAGCCAGGAGCTG 

CACT A TrXT'rrr AGTT ACCAGGAAAAATGT ATGAT AG CT AT ATTTTGCTGAAAT ATGT AT AACG CAT AGG CACTTGTTCTT AAATT 

TCATGCAAATAGTATTACCTGGTTGGTTGGTTGATTTGAGAAA^ 

GTAGCTGTTGTGTCTGTCATTGTCCCAGCCAAGCAACT 

ATCCTAAGCTGGGCITTCTTGACTCTGTACACCTCCrCATGAA 
2 0 ATAAAACCCCAGTCCCTGTTGTTTGCAAGTCTGCCTT^ 

C^VGAGGAAGGGGAGCCCATCCTTGTGGGATGGGTAGGAGAAGGA^ 

TGGAAGGAAGCCCAAGAAATGCATTGTTTCCCTAGGGTGAGCTTTGGCA 

GAAGT AGACTTTGTCTTTATOTCTCC^GGGAAGGAATTTTAACCAA^ CTGGGATAAGCTTGAACTCAGGTGTGAACTAC CAC 
ACCCAGTTTATGAAGTTCTGGAGATAGAACCAAGAGCTTCAGGTATGC^ 

2 5 CTCGCCTCTTTACTTGCTGTGAGAAACTCTGCACGTAGAA 

CCTAAGCACGTGTTAACAGCAGCIATCCAGTCTCAGAACCAAGAGA 

ACCAGGCACAAGCTGTAATTACTGGGCTGTGCTAGGTATTCCTGGCAACCACC^ 

AGC7lATCTCCCTCCCCCCAAACAAAAC!AAAATCCCAGAATGGACTACCT 

CCAATGAG TCA.CTTTTG AATTCCT ACTG AT CTGAGTGGAGTTT AAAATX"!" I'TC CACAAAT CC J i"l CT CAGGTTT CT ATTCTTTCA 

3 0 CCCAGCACCTACTCCTCTCTCCCTTCCCCTAGTGTITACATTTCCri'l'Crr 

CITACTCCATCATCACAGAGTCTCCTAGGATCTTGGAGGTCCT 
CACAC^TCCTGAGGCACCTTGTCAGAGTTTGTGTGTTTTGGG 
AAATTGCCTCCCCTAGTGAATAACAATACTATCTTCAAGGCGCTGTGCT 
GACGGAGTCTTAGCTC^GTGTAACGGCCTGTAATCCTAGCACAGAGGC^ 

3 5 ATAGTAACCCGAGTCTTAAAAGAAAAGAAAGCIATGCTGAGTATGGTG 

TAGAAACAGTCAGATCTCTGTGAGTTTGAGGCCAGCCTGATCTATATA 

TGTTTCATATACATACCCACAAGCAAAGCCAGGCTAGGTGGTGACTGCCTTAAATGCCCCAGCACT 

TGAGATCATGTCAAAATAAAAAGGGAGAGAAGGATGAGGAATGGGAGCTTCCCTTGC 

CT CATGGTAAATAAAACTCAGGTTTGGTTTGGCTTT^ 

4 0 TAGACCAGGCTGTCCTCAAACTCAGAAATCTGCCTGCCTCTGCCT 

ACAACTAAGCTTAAGAGAGGTG^ACTTCCTTGTCAGGCAGCTGTACTCA 
GCTCTTTGTTTCTVTGGACTCCTCACrrCCAGGAAG 
TGTTATACATTGTTATTAAGTCTCCATTCCrTACAAGAATAAGTTTCT 
TCTACCCTGAACCGGGCCCACTCTTTTGTCTTGATAAACCCCTGAG 

4 5 TCTTCCCAGGTACCTGATGGCTCCCCTTTGACATAAT 

CAGCTGTTTCCTATTACACTTATCTACCCCAAGGCAGCCTTGAGCCCT 
TCTTrACCGCTGTCTTrTCrTTTCCTTTTCTTTTC 

TCTCTGC CT CTC^TTAT AGGCATGAGCCACCACTGCCCAGCATGG ACAGCTGT^ 
GTTCCCAAAAGCCIAGTGAGGGGCrGTTCTCKlAAACCCCAACTTAGGGTAA 

5 0 ATAAAGCTCTCTTAATCAGACAGTTGTGGAAATGGTTTTl^ 

AGCCAGGGCTAOUUW3TGAGACTTTGTCTCAAAGCAAAAGAAAGAGAAAGA 
TGTGTGTTCTAGGCCAGCCTCAAACTTGTGATCCCCTTCTTCAGCCTTCCAAACAGC 
AACTAAGAGTTCTGCCCCTTGGAAGTTAGAGGGAGGGAGTATATATGCATGTGCATATGCATGTG 
GGCAGAGGGTTGGGGAAGAG^CACACAAGTATTTCACAGJVGCT 

5 5 TTCTGCCACGTGGGTCTTGGTAACTGACCTCAGATCCTCAGGGCTGACAG^ 

CAAGAATTATCCTTTTAAAGCTCCCAGAGACCCCACT 

TTGATGACACACAGCCAACAAAGGATGGAACTAAAACCTTGGCTCCAGGCA^ 

TAATCAATAATAGTTTGATTCTAATAAAGACAAATTTCTAAAAAGTG 

CCTGGCTGTCCIX3GAACTGGTTTTGTAGAGCAGGCTGGGCTTGAAC 

6 0 ACTGG CGT AT ATCA.TG CCTGG T AAG AT ACCTTGTT AATGCT AAGTGGATG CTG TGCAG AT AGT AAACT AGAG CAAGGTGG CG CAGG 

ACTACAGCTGCTGGTGTGTGTGGAGGTTTTGTGTTTGGCCTGGGTTTCT 

CAGG CTGGT CTTAAACTTGAG ATCCTCCCGCTTCAACCTCCTGAGTGCTTGGG CTACAGGCATGTGTTACCATGGTTGGCATATAT 
GTGGTACTTTGAACAGG^GTGTCTACTGCTACATTGAGCATCTATCTCCCT 

CTGCAGG CT CTGT ACAAGGCXTTCK3TCCAGGGAAGACT CAAATG C CTGGGAAGG AAGGG AGTG GAGAGGG AG AGGCCTGGCAGGTGA 

6 5 CTCAGTGCAAACCCHTGTCAGTGTTTGCCCAGCTGGGGTCTGTGTG 

AGCCTCTTTGAGGAATAAGGAAATAACATGATGTCACATGTTGGCrrCTGGGCTAAG 
GTCCTGGAATTACACAGGAACAAACTGAAACTTGGAAGTTGACAGCATG^ 
AGGAAGGAAAATAAAGGTAGGAGAGTTGTGTGTCCATGTGTCCCACATCCT^IACT 
TAGTAAGCTTTTTTAC^GTCTGTTTTCCrATAACCTGATGTCAA 

7 0 AGGACAGGTAACAAGG ACATCATTTTCGGTAC CTTTGTCTTGAT ACGTT AGCCCTGGGTT TT GTTTC I ll'GCTTATTTTAACTTTT 

TTTTTTTAAATATTGATTTGAGGTCAGGGTCT CCCTGGGT AACCCAGG CTGGCCTTGTTAATGTIT AGTATCACATTAAAGAACTTC 
ACAAATCAGACATAATCCAGCAAAGCAATTTCT^ 
TCACTTAATTGTTATTCTAACGCAAGTATTGACTGAGTCA.CCTCT 
ATTTATAATCACACACAAAGGAATTGAAAGAACAGAGGGCTXTTG 
7 5 AGTTTCTAGTTGGGCATTTTACATTGTGGGGAGATTTATGGCAC^ 
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ATTCTCGGAGCCTTGGCTTTGAGATCTTGT 
TCCCTTGTGTTTATTTCCACCTTGGGATTAGACCCAGAGCCT 
TTTCTGCCTTGGGACAGAGTCTCACCAGGTTTCCC^ 
GTTTCCTCAGTAACTGAGATCTCTCATTCTCAACACCAGAGTCAGGCT 
5 TCCAATTAGCCCAGTACTGCTCTGATTCAAGCAACTTGCTGGTTT^ 
GTCTGTCTGTCTGTCTGCCTACTTACCATATGTTTGTC 
CACATGTGTGTGTGTGTGTGTGTNNNNNNNNNN^ 

TGTGTGTGCCTGAGTGTGTACACCTGTGTGTGTGCATGCCTCTGTGTGTGTGCCTCTGTA 
TCTAAGCTCCACCCCACAGTTACCrrGGCAACAGCCAGGTAGGCCTGACT 

CTCTACTCCTGTTCTCCTCTACTTCTCTCTCTCT 

AAT AAACT CT ATTCT ATACT AT ACCAT CCTGTGG CTGGTTCCT CAGGGG GAAG TG C CTT AG CATGAG C C CACGGAGG CACCACTT C 
CCCCATACCTCACCACACCTCCATAGAACATATCTCCCTTTTCTTTATTTTTTC 

G CAGGAGGATGCTTT ATTC C CTG AAACAGGGTCTG T CACTG AACTTGG AGCT AGGCAGCAG CT AAT AAG CAGTC CTT CTGT CTG CT 
1 5 T CACT AGTGTCACAGTG CTGGGGGCAGAGGTGTGCACATGG CCGTGC CTGGCTTTT AACCTGi^^ 

TTCrTGTGOVGCAGTGCTCTCACCCAO^ 

AAAACAGATGCTTGGTTTTGTTTATGTTCAGGGGAGAAAGCC^ 

TGCAGCAGGCCTGGGGT^CATCCXaAGCCTTGTGCTGACA 

CltKSCCAGGGTTGAGJlGGGTTTGGTTTTTCTGGJ^ 
2 0 TGGAGACAGCAATGCAGATAGAATTGGTGTGGGCACCCTTTCTGGGC^ 

AGGAAATTCCCCTTTCCTCCGAGCTCMCAGCT.C^ 

AAGGAAACAGTGTCTTCCAGACACAACAGGACTGATTGATAGACAT 

GTTCAAGCCACATGGGGTCACIAGCA.CTGAGAGGGGAAAGTG 

GAAAACTTAGTTTCCTCCAATAGAGTCTC^CrcGCTGCATTAACC 

2 5 ACAACAGTA C r i ' rTA TCGGC m 'ClTTTCTCACGGTGTTTTCATC^ 

AGTTTCrGATTTTGTGTTTTTATGGATTTGTGTGTGTGTGTGTTTTGTTAGGACT 
CTATTAACATATCAGGGCAGATGCTGTCAGCCTGCCTCrCT 
CCACATGCrACCCTAAACTCTCCCTTCCCACAGTTTCTCTGAT^ 
CCTCTTGGTeTCTGGGTTCTCAGG rriTrrr G lTn ' rr TC 

3 0 TCGACCCTCCTCTCTTCTCfrTCCCCCTGTCT 

CTTTTTAATTTTTTCATTCATATTTCTGACG 

GGGGCATGGGGAGGGTCTGGGAGGAGTTGGAGAAAAGAAACCITGATCAGAATACAT^ 

AAAG T AG AAAATTTCATTT CACATG CITTTT ATTTGAACTT AAAATT CATACT AATT CTT ATCAAACAACC CAAGGG AAG CT AGT A 

3 5 TGGACTGAAAAAGCITACTATTAGTTCACACTTGATTCATACTGGCT 

GTCAAAGGCAGGTTGATCTCTGAGTTGAACACTAGACTCAAATCT 
TGAGACTGCCTAAAAAAC7UVGGCAGGGGGTATCAAAGCACA 

TAGAAGTCACACGTATTTATTGTGTGTGTACATGGATAGGAATGTGAGGGTCAGAGGACAATTTATGGGAC^ 
ACTGTATGC^JTCCTGGAAACCGATAAACTCAGGTTGTCAGGCT^ 

4 0 AATATTTTATTTTAAATTCCTriTCTCCACTCT'rrrTI'lTTl^'lGAGG^ 

GGGAAGTTGTGACTAATCCTTACACrGTTCCCGCTTGTGAAGGTTTTCGA 

. AGATAGAGTTATCCTCTGTTAAAGGGGGAACCTGAAGCAATCAGAGAGGGAAAGTGACGTGTCCA^ 
TGGATCTGAGCTGACTTGTGATCCCAGCATTTAGGATGAGGAGGCCAGAGA^ 
AGGATTCTGTC^GAAAGAAGGGTAGAGGGAGGTGGATCAGGAATCAGAGTC 

4 5 CTATCCTCCATCTTGCATGAC-GAAGAAGTCCCTGGGGACTCTC 

GG AACTGCTGAC CAAGCAAGAAGTGGGTCAGTGTCCCTCCTCCCrCCCTTTTCAGTATGT ATG CTGCCACTACCTCCTG AATCTG A 
ACCAGACTACAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

CCAGGCCCAGGAGGGGCTTGTGGCAGAGGCTGAAATGATAGGAAGGACTGGCCTATCTGAOTCCCTCTG 
CCCAAAGACTGTGCCCAGATCCTGTCnX5CTTCCTCCGCCCCTGAGGCGAGAGCTTTCCACGTG^ 

5 0 GAGTCCTGTGAGGGAGGAGGGAGGGGAGGGTGCTCCTCTCTGCTCAGGACT 

rTGCATCAGACAGGGCTTGGAGGGCGGGGGCCAACCCCGTGGTTTAGGGAGGAGACAGra 
AC<3GCAGACATTCAGACAAGGAGACACAGGCAAGACTTTGGAGAGAGCAGACT 
CCCTTCCCAGCAACCTCTAAACCTAGAGGATGAAGATGGAATCCTGGATGAATATGATCAGTAC^ 
GTAAGGTTTTCGTGGTAGCTCCTTGGTCAGCCCATCTGCTTCTCACTGGGTACCCCGGTTTACC^ 

5 5 ACTCCCAAGTAACCCCTTCTAGTCTCATCTACTGAGTTCTGTCTCCCTTAGCCCCAAGAACT 

TACTTGAGTAAACAGCAAGGATAGAGTGCTTCAGACGTGAAGACGTGACCGCCCCACTCTGGCC^ 

cacacacctcctctacctcctgctccaccagtcgggggaggtccxsaaag^ 

CCCTAGC CCTGG CG GG CATGAG AGGAAG CTGCTGAC CAAGTT C CAG AACT CTG AAAGGAAAAAAG C CTGGCG CTGAGACGG AG CTG 

gaggtgagtcgggagagccgggcctaagtgtgcagagcgggaagcggggjuvccaagagct 

6 0 agaggcacttgcag^caagcaggacaacctgtgcttgatctc 

aattgcaagctgtgttcccgagggagccctgcagatagatgtccaggcagggaagtgataagaa^ 
ttccatcagcttggatcagatttcac^gcatctaacgccac^c^ 

aggtttct aggcaacagt cagtccatcagttcagttccaggagcacaataaag cttgaggtcatcaggcagatgtgacatctttcc 
taagagtgggtaatggtctagggaggggagagtctc^gattatcattctggagaacttgagtgaggtcg 

6 5 tgggggaggtctgactccacacagagccaaaagctcaccgggttgttaacaaaatccactctcagat^ 

atttgactttatcrcctccattgtggagacacacggcatgattaa 
tgtacctccaatgtccaccgactcccatatgctcaaagt 

ATATCAATAAATtlAAAAAAATAATGTTGGCCGGACAGTGGTGGCGCACGCCT^ 
ATTTCTGAGTTCAAGGACAGCCTGGTCTAAGGAGTGAGTTCCAGGACAGCCAG 

7 0 AATACAAAAAACAAAAACCAAAACCAAAACAAAACAAAACAATAG^ 

GACCCGTATTTGACTGAGCTCCCTGGTAACCAGAGGTGGGGGGAGTTCCCATGGAGCCTITCCAATC 

GGAAAGGGAAACT G AGG CAGAT AGAGTCAAG TACTTCTCT AAGG T AAT AG CAAAT ATGAGTAG GAAAGG TGGAAGT CAAACT CTGA 
TCTCTGTCCCTTTCGTTGCCTGCITTCCCT 
ACAC CCAGCVITGGAT AC a /K^ a -CATC A 
7 5 TGCTTGATTCTTTCCCACCTAAGTGAGGTTCTGGAGAGGG 
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AGAAGCAGAGGGTAGACGGGCAGAGCTAGGAAGGCAGCCCCCAAAAGAACTGTGGGGGAAATAAATCGGATC 
CAACAATAAATCGTACAGCAGTGACAG C Tl M l t i GCATGTGTTGTGTGT C X ' riXS TTGTGTTTGGGTGTGTTCTGTCTGGCTGTGTCCA 
TTGTGTGTCrCTCTAGGAGGCTGATCTGGAATGGGAGCGTCCTTAT^ 

CCTTCTTCT AATGT ATG CAG ATGCTTTCIAGGAG CCTGTTTCT ATGG TTTCTTC CAGGG AAGl'l'GClTTGTTT CTTl'G C AGTG CTGT 
5 CCCAGAGACAACCTGTGGCCAGTCCCCTACATATTCTTGAGCACCA 

TTCTATTAATACCATTTGCAAAATGGAAAAAAAAAACAGC^TCCATGATTTA 

GCTCAAATT CAGT CATGAAAT ACCAGG CTGG CTTC CAGGATGGGGTGCTTGGTGAG CTGGTGAGAAAGCTCT C CCTAAGCATGGTG 
GGCCGATGGGTCTCAGGACCAGAATTCCAGGCAAATGTGAAGCAAATGAGAAGATGGT^ 
TGAGTTTGGAATGTTGCAATTTTACTCAGACGAAACAATG 
1 0 CAGGTGGCAAGTCCATTTTCTTGAGGCTGGTCTCAAACTC 

CACCACTCCCACTTAGCCTATTCGTTCTTAAAAAGAATTATTTATTC^ 

CACCAGAAGAGGGCATCAGATCCCATTACAGATGATTGTGAGCCACCATGTAGTTGCTGG 

CAGTCAATGCTCTTAACTXjCTGAGCCATCTCT 

ACCTGCTCACAGTTTAGTCACATGCTTGAAGAGAGAGGCCGGGGAATGCC^ 
1 5 GGTGAGGGCAGAGTTTTATTAGAGGAAGAAGATGGAGCATACT 

AAAGTGGGAGGAAAGAGCCCTTGCTCTTGCTCTGGGCTGAGCCTTTGTCCCTTCT 

TTGCACTTAGATGGAACCXSGAAGCTAGGGTACTTACGGAATCAGTGGCCTCAC^ 

CATATCCAGCCACCTCCTGTGCAGCTGGGAACATTAGGACAATTTCTGGTCT 

CATTCGGAAGACAGAG G CAGG CAG AG TTT ATGAGTTTGAAG CCAG CCT AG TTT ATGT AGTTCCAGG C CAG C CAGGACT CAT AG CGA 
2 0 CACCCTAT ACTAGAGAAGGGGGTTGGGGGCGG AGGGAGGGAGTG A CCAACCAG 

GACTCAAAGTGACACCCAATAAATCAAGCAACCAAGCAAGCTTT^ 

GT CTT CAGCATTCCTCTGCTGGGCC CTCTCCC CTCCTT CTG TTG ATCT CTC CT CTGGAATG CCACACCAG CT AACAGG CT CACGGA 

2 5 TATTTCTTTTGGCAGGTGGTTTTGTGGAAATTGTCCTGGTGGCAATGGAG 

GCCTTGTCTGTGCAGGAATTCTGTAGACAAGTCAGGCCITGAACTCATAGAG^ 
TTAAAAGTGTGAGCCACCCTACCTAGCCCAAATCATTCATTGCTTTGACATCTGTTAGTT 
GAAGCACAGACTTTGTAGAGAGGGACACATGGTACAGCTTTTAGCTTTAGAA 
CAGCAGTATGCATCTGGGTGTCCAGCTTCAGAAATGACAGCG 

3 0 TTGTTTGTAGCGTTTGTTTTCTGCGGGGCATCAAACACAG^ 

CTGCAATAC-GATTTTACAGTTATGTCTCCATTTAATCTTTATAGTAACCCAA 
CAGAAATCTAAGGCCCAGAGATTTGCAAGTTCATACTAGACCACATATAGTAA 
CGCAGACTCAATCAATACCACATGAGACTCTGAGCTCAGGTTTTCCAACCTCTGAC 
CACGGGGTTCTGGGCAATGAGCAACCCTAGATTCTAGATCTTCTAAGGGTGGGTTCGAAAACAAC 

3 5 ATTAAGGTCAAGAACACCCGTCCTCAGTGTCAATGGCCAGTCCTAGCTGCTGGACCCTTGAGCACCTGCTTCT 

CGCCACCACTTACTCTTGACCTCTGATTGACAGCCCTCTGCCT 
AATTCTTAGAACCTTGGAGATCTCAGATCAGGAAGGACCCTGAGCAGAATGTCCCT 
TAATGTCTGATTCTCTCAGTTTCGGGGATTCCTTAGTAGAACTAGAGAAATCACT 
AACTTCTCAGATTTCCGTAGCTGAAAGGATTCCCAGCCTCTTGCAGAGAATTGAGAAACT 
40 TCCTTTCTCAAAAATTCAAGGCTTGACTGGCCATATATATGTCTGCCrc 
TCATATAAATTTCAGGATGACTCAAAGTTAATTTTCAACTO 
TGCTATGCTGGGTGCCATGGGGATTCCCCAACGTCATCTCTCACACAGCGTT 
TATTCTATCTACTTGTCATTGAAAAACCACCCCrTTACCrGTTTTACTC 
CTAGGAAGCCATGTGCTGCTCATAAGCACAAGTTCACTCTGGCCA 

4 5 GAGGCAGGGGGATTGCAATCTCAAAACACAAGACAAACAAAACT 

CCAGGATGGCTCATGACTTTAATCCCAAGCACTCAGGAGGCAGAGGCAGGCTGATTT^ 

TAGAGAGTT CCAGG C CAG TC AGG ACT ATA7AG TGAGAT CCTGTCTAAAAACAAACAAACAAACAAATACAAAAACCAAG^ 
AGTTTATGTGCCTTTCTACAATGACCAAGAGTCACAATTC 

AGTATACTTGGTGTGACCATGGATGGATGTCCTCCAACTTACAATAGTTCAACTTGT^ 

5 0 ATAGCCCTAGCTCGCATCCTCCTGCCCCTGGCTCCAGAATGCTGGAATTAATGACACGTC^ 

AAAAGTCATTCACACTCAACAGAAAGTATACTCTTGTTTCTGT^ 
TCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCGAACTTCGAAATC 

T AAAGGCATG C AACACCACTG C C CGG CG TG AAG ACAGTCT CTTGGT AC C CAGG CTTGTG AACTTGAT ATTTTGTTG AG CATG AC CTT 
GAACTTCTTTAAGTATCATTGTTGAGGGGAGGCCTAGATGTGTGTGTGTGACATCAGAGGACAACTTTGTGGGA 

5 5 TTATACTTTTACAAGTGTTCTAGAATTAACTAGGGTTAA 

AAACTTTGGGCTTCTGAACTTTTTGCCTCCATCATCTGAGTGCT 

TCTTAATCCTAG CACGTGGG AGGCAGAGGCAGGTGGGTCT CCGTG AGTTCAAGGCCAGCCTGAACTACAGAGTGAGTTTG AGGACA 

ACTGGGGCTATATAGAAAGATTCTGTCTCTCCCACCCACCCCAAAAAGGCACACACCACTGCT 

TCAAACCCAGGGCTTCATG C&CACGAAG CAAGC^CTCTATCAACTC AGCCAT A 

6 0 TTTTTTGGTTTTTCGAGACAGGGTTTCTCTGTGTAGCCCTGG 

GAAATCTGCCTGCCTCTGCCTCCCGAGTGCTGGGATTAAAGGCGTGTGCC^ 
CACTTATAATATATTGTGTGTATGGTAGATGTGTGACAATCAGGACAA 
CTGGGGGATTGAACTTGGGCTTGGCGGCAAGCACCTTTACTTGCTGAGCCAT^ 
GATTTATTTATTTATTATTATAAATAGGTATACTGTAGCTCTCITCA 

6 5 TTGTGATTGCTGGGATTTGAACTCAC^GACCTCCGGAAGAGCAGTCAGTGCT 

TAAACTTTGCCTGTGTGGCAGTGCACACGAGTGCAGGTGCCGAGTTGCTCCCCACTGGAGCTGG 

GCCTGATGTGGGTGCTC^AGTCTTCTGCAACCCAAGCTGGA 

CTCTCAAC CACTGAGCACAGTTCCCTCAGTCCCT CAG CCAG CTTGTGCAGCCT 

GCTGCCCCCCCCCACATCAATCAGCAATCAAGAAAAC^ 

7 0 TGATGTTCGCTCTTCCCAGGTTTGTGTCAAGTTGAAAATAGAACTAACCA 

GAATTCATGGTTCCCCTGCTTCAAGCTTCCCATGCTAAGGTTGCAGGTGTGTGATGACACCCCCAACTTT^ 
ATTTTCATCTTACAATGGACTCATTGAGGCATGAACCCATCATAAATTGG 

TCC CATTG CCTC C CACTG TTCT CT ATATTGACAGACAGCTT CT ACC C CAAAT CCATG C CAACAGTG T ACACTT CT CTGTGG ACACC 
TAC a -CK^AGGTCGTGTGATTTAACCTGA 
7 5 AT AG AAGC CAAG T C AAAG CCT AGGCCTTCCTT ACTT CTG AC C CACTGAG AT AGGTCAGW ACTG CCACAAGAGGG CACAAACC CAA. 
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ACTGG CTCTGAG CCCTG TGCAAGGAAAG CA.TGATGTGTGGTTGC CTTCACCCTGG C ACTCT CCAAAAG CTTTG CCAT AAGACATTT 
G ATT ATG CCGGG TGGTG TTGG TG CATGC CTTT AATC CCAG CACTTGGGAGG CAG AGG CAGG CAGATTT CTGAGTTCGAGGCCAGTC 
TGGTCCACAGAGTGAGTTCCAGGATGGCCAGGGCrACACAGAAAAAC^ 
CGCT/TCTGATGTACC(^^GCTAAAATTAGACGTAGCGATACAAATGA 
5 CCCATAAAATATTCTTGGTAAACCCCCC^AATCAACTCAGATTC 
GTGTGTGTGTGTGTGTGTCTGTGTG03CATGCATGTC 
GAACCGAGGGTCTTGACACATGCTATAGTTATTCCACTGAGCT 
TTAATTTATTTATTATATGTAAGTACACTGTAGCTGTC^ 
AGCC^CCATGTGGTTGCTGGGATTTGAACTTCGGACCTTCGGAAGAGC^ 
1 0 CCTCTTCTTTCCTCTTAATTGACCTAAAAATCACTX3 

GCCTTGACATCGCTAACTTGTGGCCTTTGAGGACAGTTC 
CGTGAGCCTGTAGATGGACCATAGACATGAGATAAGAAGCT 

CACTTGGAACTAAATT CTACTCTG CTTTG CAGATAGACAAG CTGCTTATGGG AGAAATGTTT AGTCCAAGACTATAAG AGAACTTG 
GGACCGGGCAGTGGTGGTGCATACCTTTAATCCCAGCACTTGGCCT 
1 5 TTCTCTGCAGTGCAGCAGCTAAAGACTGTACAGTCTTTC 

GAAGAGGATGACATCAAGCCAGGTATGGTGGCCIGTGAGCTCTGCCTC 

GTCTTGGATATATAGTAAAGTCCAAGGCCAGGCTGAGTTGTCTC 

CATGCCTGCAATCTCAACTTTTAGGAGGCAGTGGC^ 

ATCAAGAAATCAAAGACAGAAACTGAGCTGTAGCTCTAGTGGCAGAGTGTTTACC^ 
2 0 TGTATCCCAAAAAACAGAGATGTAGACGAGAGAGTTGGGAGTGCAGGGTCCT 

TTAGGATGGTTGAGGTTGTT3CAGAGCAAGGAAAAACT 
AAACCAAATCTGTTGTCTTCACTAAGCGAGCAGCTTT 
CCTTCAGGAAGCCTTGTCACACAGCATGTCCTCATTTT 

2 5 CTGTATTCCCAGACTrCCCAGAAGCCTCTTCTTGAGCTCCATCCTTTGCTGAAAGGG 

TGAGGAGTGGTTTAGGCAAAGGTGAGCACTCAGAATCAGCCAGAGC^ 

CTTCC CAGG CT CTCCAT AGGGTCT CTACAT CTCTTTTT CTG CTTCT ATGATCAG TT CTACCTCAGAGATT AGCTCATTTG C(J rri G 
AAGATTAACTCACCTITCTGGTCCCTTCCACTCTCTGGAGGTTG 

TCCATGATTTGCCGAAGAATGACACCCCAGACTCAAATAGTATGTAAACACAAAGAGTC 

3 0 GGGACCTCCCATTACCAAGATAGAGAAACAACCAA 

GGGCTTTGTCTTACTCCGTCTCTAAG^TCATTCTGTCCCTGG 
CAGGCCAGGTGATGGGGCAGTTTCTGACTAX3CTGCCTGAAGTITG 
GTCTTGGAATCAGCCTAGCATTCTCACCAAGTCAGGATGGGTTCCACTGGTGGC 
ATAAATGCAGCTATGACTGAAAAGGCTAAGAAAAGAACACTAGCCTCTGTT 

3 5 GCTCTGTGCCCGGACCTTGGGAAAGAAGGC!ATGATTGCTTAGATTGA^ 

CAGCATTGGGAGGCAGAGGCAGGCGAATTTCTGAGTTTGAGGTCAG 
CAGAGAAACCCTGTCTTGAAAAACCAAAAAAAAAAAAAAAATCTG 
CTGAAGGTTGCTGGGAAGGCAGTCACrAGTTGGCATTCTTrCAGCAAG 
AACTACAGACAGATACTCACAGGCAGAC^GTTGATGTTGGA^ 

4 0 TTTACTACC^CCTTTCTCCTCCCCTGTTCCACCCTAATCC^^ 

GCTGATTAAAGGCATCAAGTCCCTTGGGGCAAATCCAATCTTCATCCT 

GCAACTGCAGGAATCGGCAGAGGCAGAGCTGAAGCAGTGGGAGTAGCAGCAGCC^ 

TATACCCTCTCAAGGGTTACCAGAATTCCAAATGTAAACTATCTGCTGCTG 

45 MOUSE SEQUENCE - mRNA 

CACAGG CAAGACTTTGG AGAG AG CAG ACT AGG CAT AATGGCC AC CTT G C CACCAAC AG C CAA CCCTTCCCAG CAACCTCT AAACCT 
AG AGG ATGAAGATGGAAT CCTGG ATG AAT ATG ATCAGT ACAGC CTGG C C CATCC CTGTG TCGT CGGGGG AGGTCGG AAAGGTCGGA 
CCAAGAGAGAAGCTGCTCCCAATACCAACCGCCCTAGCCCTGGCGGGCATGAGAGGAAGCTGCTGACCA^ 

AGGAAAAAAG CCTGGCGCTG AG ACGG AG CTGG AGATAAGG CCAGAC CACAGACACCACACC CAG CAATGG AT ACAGGACAT CACAC 

5 0 CCAGCAATGGATACAGGACATCACACCCAGCAATGGATACAGGAC 

TGATTCTTTCCCACCTAAGTGAGGTTCTGGAGAGGGAGCCCTTCCACCCTTCCAGACCC^ 

GCAGAGGGTAGACGGGCAGAGCTAGGAAGACIAGCCCCCAAAAGAACTGTGGG 

AATAAATCGTACAGCAGTGACAGCAAAAAAAAAAAAA 

55 MOUSE SEQUENCE - CODING 

ATGGCCACCTTGCCACCAACAGCCAACCCTTCC CAG CAACCTCT AAA 

GCCCTGGCGGG CATGAGAGGAAG CIGCTGACCAAGTT CCAGAACTCTG AAAGG AAAAAAGC CTGGCGCTGA 

60 HUMAN SEQUENCE - GENOMIC 

ATGCCGCTGCACTCCAGCCTGGGGGACAGAGTGAGACTCCGTCTCAAAAA 
AAAATGGGAGCACAGATGCACCGTTGCGGAGGGCGACAATGATGTCTGGGCTCG 
GGGGGCACGGCCCCTTG<»CCCGAGGTGGGAGGGTAGGGGACGGTGTCTGTGGCCCAGG 
TTCAGGGCACTCCGTCTACACCCGTTACACTCCGCAGCAAAGCCGTCCCAGGTCT 

6 5 GAAGACGCIACTTGACCTGCAAGAGACTGTTTGAATACATCTTCCCTAAGAGG 

GCTGAGCTCCAAGGTCCCCCTGACTGAGCCCTGGGATCCTCTTGACCTTCAGCAGAACT 
GCGGCTGGATGAGCGTGCCTCCATGCAGGGGCGCTCAGGCAGAAAGT 
AAAATTACAGCACGATGTTGCCCTCCCTTGTTTTTTTGACATACAC 
TGGGCCAGACAGCCCAGTGGAGGAGACC^CCAGGACCAGCCCTGGTGAACGTGAA^ 

7 0 ATCAGTGACCCCAGCTGGAAACAGATTTGATATTTGAGAAT^ 

TTATGGGTTTTTTTGTAAGTATGATCTTCACTAATCAAACT 
TACCTCAGGGGTTITTGTGTGGTTTTTTTGTTTT 
TAGCACAATCTTGGCTCGCTGCAACCTCTGCCTCCCCC^ 
GTACACACCACCACACACACCTAATTTT^ 
7 5 TTGCTCTGTCGCCCAGGCTGCAGTGCAATGGCACGAT 
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CTCAGCCTCTTGAGTAGCTGGGATTACAGGTGCCACCATG 

TTGCCCAGGCTGGTCTGGJUVCTCCTGACCTCAAGTGATCTGCCTG 

CGCTCCCGGTCTTAATTTTTGTATTTITTTG 

CAATGTGCCCTCCTCAGCCTCCCAAATTGCTGGGATTACAGCGT^ 
5 AGGTAAGGAGGACTCCCTAGGAGCAGTTCGCAGTGGCAGGGAGGCATATTTTATTACT 
AAAAGGCAGAATGATTTGGATCAGTTTAATTATTAAGTTCTTTA 
GTCCCAAAATGCTCCCTTAAAATTATTTCCAGATATTTAAGAGCrrT^ 
GAAAATCTATACTATAAAGTAATCGTTTCTTTAAAGATACATTITAAGCAAGAGGGTTr 

AAGCACAAGCCATGTAT ATTCCGl T ' lTrCT 'l' TCiTIC 'l' rTCTT TC TT T GTTTTTGTAAAAAAAATTTGTTTTGAGCCAGGGTCTCA 
1 0 CTCTGTTGCTCAGGCTGGAGTGCAGGGGCGCAATCACCGCT 

AGCCACCTGAGTAGCTGGGACCACAGGC^CGAGCCACCACACCTGGCTAACrrTTO 

GTTGCrCAGGCTGGTCTCAAACTCCTGGGGTCAAGCCATCCGCCT 

AGCCATTTCCTTTTTTTCTATGCAAGAGTATTGATATATGTACTGAATCATC 

CTTGCTGGGATCACAGGCAAGAGCCACTGCGCCTGGCCCAAGTTTAClTTTrri' I'l'Vl'GGAAATGGAGTTTTGCTTrrGTl'GCCCA 
1 5 GGCTGGAGTGTGATGGCGCCATCTCGGCTCACCGCAACCTCCGCCTCCCGGGTTCA 
ACTGGGACrACAGGTGCCCACTACCACGCTGGCTAATTTTTTGTATT^ 
TCTCGAACTCCTGACTTCAGGTGATCCACCCGCCTCGGCCTCTCAAAGTGCTGGGAT^ 
ATGTTTACTTTTATAGTCCCAGCACCAAGCAAGT 




CAATTTTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGCCATGTGCC^^ 
GGGGTTTCACCATGTTGGTCAAGCTGATCTCAAA 

CAGGCATAAGCCACCGCGTCCAGCCTTAT^CTAGGTTrTTCTACTACTGTGTTTTA 



2 5 AATGTCACACCTCTATTATGGTGAAAACTGTGTATTACTCATAGTTATCTATACATTTA^ 

TGATG TTCTGCAAC CTGTGTTTTT CATT AGGAATGT ATT ATGGACAT CTTTCTCAT CAG AGAT AG AG ATACATTCT CT CT CTAAAG 

AGTATTGAATGGCTACAGTGTTTCACAGAAGGAAACTGATCAGTCAC 

TTGC 1 TTT 1"I TAGAGACAGGGTGTTGCTCTCTCGCCCAGGCTGGA 

ATCCT C CTG AGTAG CTGGGACT ACAAGT AC ACACCAC CATG CCAG T Cl , l'I"rriTU"l"rrri"l' , ri , TTT AAGATATGGGG TCTCG CT AT 

3 0 GTTGCCCAGGCTGGGTTTCCTCTTTTTCACTACTACAAACAA^ 

TTCTGTAGGAGAGACTCCTCAAAGTGGGACTGCAATATCAGGG 
CATTAAAGACAGCATCTGGCTGGGCXSCAGTGGCTCATGCCTGTAATCCCAAC^ 

GTT AGGAGTTCGAAACCAG C CTG AC CAACATGGTGAAAC CCCGT CT CT ACTAAAAATACAAAAATT AG CTGGG CAT CATGG CAT CT 
GCCTGTAATCCCAGCTACTAGGGAGACTGAGGCAGGAGAATCGCTTGAACCCGGGAAGCGG 

3 5 CACTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCC 

TGATATCACACAGCCAAGAAAACTTGCAAATGCAGCTGCCAGATTTGACATT 
ATATTTTGTTTAACCTCATTTAC^TTTCTTGTGT^ 

AGCCTGGGAAGCTTCTGTTCCTTTGAGCTAGTGACTTCGTACATCCCTCAGGCT 
TAGGCTGGAGTG(IAGTGGTGCGATCATGGCTCACTGTAGCCTCAACCTCC^ 

4 0 TACTTGGAGCTACAGGCATGCACCACCACGCCTGGCTAACTTTTGGATT 

CTGGTCTTGAGTTCCTGGGCTCAAGCAGTCCTCCTGCCTCGGCCT 
CCCACAAACAGCCATTCTTGCTCAAACTTTGCCTCCAC^ 

AGGTAAGAGGACCACTTTAGCCAAGGAGTTCAAGACCAGTCTCGGCAACATAG^ 
TGGGCATGATGCTGCGTGCCTAAAGTCCCAACTACTTGGGAGGCTGAGGTGGGAG^ 

4 5 CTGAGOVAGGCTGCTCCACTGTACTCAGCCTGGACGACAGAGTGAGAACCTGTCT 

CTCATG C CTGT AATC CT AG CACTTTGGAAGCCG AGG CGGGTGG ATGACTTGAGG T CAGGAGTTTG AAATCAG TCTGG C CAACATG A 
TGAAACCCTCTCTCTACTAAAAATACAAAAATTAGCTGGGTGTAGTGGTG 

AGGAGAATCG CTTGAACCCGGG AC^CGGAGGTTGCAGTG AGCCGAGATCGTG CCACTGCACTCCAGCCTGGGTG ACAGAGCAAGAC 
TCTGTCTCAAAAAAAAGAGAAAAAAAGAAAAAAGAAAAGAAAATTAAAATTAAATTAATT 

5 0 AAACAAGCTTCACCTCCTCCAGGAAGTGTGCCTTCATCTCCCT 

CTGCGTTTACTCCAAGCTGATGGCGCTGGGTTGTCATAGGTGTGCCTAGCAACGTCACGAAGGTGT 

T AAACTT C CT CG TGG CAG ATTTT ACATG CAC CTGGAGG CTTGAGGATG GGAAAATGAAAC CAT CC CTGT AAACCT CATG CAATTAA 
TCAGAG AACAG AGAGGAG AAATGGAAACAAGCCAGGCTTGCAG CACACCCAGCG CTG ATCATGAGGTCAGCTGCCCCCTGATCTG C 
TTCCTCTTAGTTGTCTGGTGCCTGCTGTTCTAGAATCACGTAGACCTGGTTAG 

5 5 AGCAACTTGAACGTTATGAAACATTAAATGTTACTTTTGAGAAATTCCTTCAGGTCCTGCGTG 

AGCCTGAAGGACCCCACCAGGAGCTAACTCCCCAAAAATGTCTTTT^ 
ACCCCAATTTTCTAGCCTCTCACCCTCCATAATCCCCTTAAAAGCCCCATCCCAGAATTC 

CAGCAGGCIATACAAATCTGCTGTTCCTGTAACAAAAAG 

6 0 TGCTAGAGGGGTTCCAGCAGGCTC^AGTACAATGGCGTGATCTCGGCTCACTGCA 

OSCCTCAGCCTCCCGAGTATCTTGAATTACAGGTGCATGCCTCC^ 

ACCATGTTGGCCAAGCTGGTCTTGAACTCCTGACCTGGTGATCCACCCGCCTCGGCCTCCCAAAGTGCT 

CTACCGCG CCTGGCC CrX " l"l i TTTGTTTTG T ' lTlV lT lTTr GAGATGGAGTCT CAGTGG CGC 

GAT CTGGG CTCACTGCAAGCTCCGCCTTCCC-GGTTCATGCC ATT CTCCA 

6 5 GCCACCACACCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGG 

GGTGACCC^CCCAACCTCGTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCATGCCCTTCCTCC 
TCTTTCTTCCAGCACCTCTAAGATCAAGGCCAGGGAGACCTCCTTC 
CCTGTGAGGCTACAATTCGCCTCTCAGTTCCTGCATTGCAGAGC^^ 
ATGTATTCAACTAGACATTTACTGAGCTCAGACTGAGTGCCAGGCATTGAGGATACA^ 

7 0 ATAATGATAATTAAAATAGCAAGCATTTATTGCACACTGATTATCTTCTAGGCACTGCGCT 

CGGGTCTCACTCTGTCGTCCZAGGCrGGAGTCCAGTGGCGCGATCTTAGCTCA 
ACTGCCTCAGCCTGCTGAGTAGCTGGGATTACAGGCACACACCA 
TCACCATTTTAGCTAGGCTGGTCTCGAACTCCTGATCTCGTGATCCACCCAC 
AGCTACCGTGCCCGGCCCCAACCTTCTTTTCTTTTTCTTTTTCGAG 
7 5 TGACCTO^TGATCCrrCCACCTCAGCCTT-rCAAAACA 
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T CCCACTG T AT AGACGAGTAAACAAACATGCAGACT AAG CTTGTCT AAGAG CACACAG CT AACT AGGGGG CAG AACCAAG CTTT AG 

ACCTGAGAAGCCCTGTTCCAGAGCCTACTCCTTTAACCATTAACTAAACT 

GTAAAGCAAACAAATAAAAAAGATACTG<3CCAG<5TGGCTCArcCCTGTAATC^ 

CTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACCTGGTGAAAC^ 
5 CACAAGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTG^ 

CATACCACTGCACTCCAGCCTGGTGACAGAGTGAGACT 

TCCTCTATTCCCCAAGCTGGCCTTGAACTCCCAGCTTC^^ 

GAGCTACCGCACTAGGCCAATCTTTGCITAATTTAAGATTATTT^ 

CCrGTTrTCATTCAAGTATCCrrcTCTTCACTTTCATTTCCT 
1 0 GCTAAATTCAATTCCGGCTCAGGCAAGGGAGAAACACAAGC^ 

CTCATTCTGTTGCTGAGGCTGGAGTGCAGTGGCT 

CTCAGCCTCCraAGTAGCTGGGATTACAGGCACTTGCCAC<^ 

ATGTTGGCCAGGCrGGTCTTGAACTCCTGACCTCAGGTGATCCACCCACTTTC 

AGCCACCACGCTCGGCCTCATTGTGTCTTCTGCCT 
1 5 AGACAGGGTCTCCCTCTCTCACCGGGCACTGTAJGCATGTGTCTTTC^ 

AAGATACAATTATTATCATCCTTATTTTACA^ 

AGTGGTGGGTCAGCGGATTGAACTGAGGGCAACTGGCTCCAGACCCT 

GAGATTACATTACTGGGGACCCTGTTTCAGGATATTAAGGGGTTTTCCAA 

GCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCAGAT 
2 0 GGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGCATGGTGGCAGATGTCTAT 

CCAGGAGAATTGCTTCAACCTGGGAGGAGGAGGTTGCAG^ 

AGACTCCATTTCAAAAAAAAAAAAGAAGACAAGCAAGATGTTCAGTC 

GTGCTGATATCCCTTCCCCCTCCTTCCTCACGTCCTC 

CAGGCCCAGGTOAAGCCCTCGATCAGGTGAGGCCATGATCCCAACGAGTGTTTTAGC^ 

2 5 CGAGAAGCGGGACCTCCTCTATGGAAAATATCCAGCTGGGTGAGCCT 

CTCTTCCCGTCTCATGACTGTGTTTACTGGGCTGGATTTTGGGAAGG 
AGACTGTCK3TCrGAG<3AGGAGACACAGCCTTATAAGCrcAGG^ 
TTAGGAGAAGAAGAGAGGCAGGGAAGACAAGCCAGGCACGAT^ 
GCCCGGAGGACGAGGACTCCAGCCTGGATGAATCTGACCT 

3 0 TCCACGGTCCTTCCTCCTCTCCCGAAATCAGGACCCACCCCTCnTGTTTCCTCT 

TTCTTTTTTTTTGAGAGAGTCACTCTGTCACCCAGGCTGGAGTGCA 
CGAACTCCTGGGCTCAAGTGATCCrCCTGCTTCAGCCTCCTGAGTGG 
GGGTCTCACTATCTTTCCCAGGCTGATCCCAAATTCCTGGGCr 
AGGAGTGAGCCACCCTGCCCAGCCCACTCACCCTTTTCTAGCCAACCTGTTC 

3 5 ATCCCAGGAGTGCTATGTTACTCAGCCTGGGACCTCACA(^ 

CCTGCTCTTCCCATCCCCAC!AACCCTGAACCCCTGCCTCT 

CTCCTGCCTTGGTTACAGGAGGTGGAGGCCGGAAAGGTCGCACCAAGAGAGAAG^ 

GGaCACGAGAGGAAACTGGTGACCAAGCTGCAGAATTOVGAGAG^ 

GGAGTCGGGGGGCCCAGAGAGCTCAAGGTGGTGCTTCTGCCATGAAGGACAGGCCGG^ 

4 0 TCAGGCAACTGTTGTCTTGATGCAGAATAAAACGAGACATATGTTTGATTGTC 

AGAACCTTTCCAATTGAAGGGAAATTTGACTTACAGAGAC^ 

CAAGTGAGTGGGAGAGGCAGAATTAGAACCCAGATCTCTGTCCCTTTTACCACCTGCTTTTCCTC^ 
CATGGACACTACACCCAGCAATAGAGACGGGACTGCGGAGGAAGGAGGA 

ACCC CT AGGACTT ATT CC CGCTG ACTGAGTCT CTG AGGGG CT AC CAGG AAAGCG CCT CCAAC CCT AGCAAAAGTG CAAG ATGGGG A 

4 5 GTGAGAGGCTGGGAATGGAGGGGCAGAGCCAG<1AAGATCCCCCAGAAAAGAAAGCT 

CAGCAACAATAAATAGACACGCACGGCAGCCACAGCTTGGGTC 

AGTGTTCCTGCTCTTGCAGGGTCTACGCTGTGAATGCAGCTT^ 
TCrGTGGGCTCCTACGTAGTGGAAAGGAATTTCTTCTGCT'rT^ 

5 0 TTTGTTGATTtSTGTCCTGTTCATAGACGGG AACG CT AC^ATG AGTG CCATCT AAAAGTCAGAGAAAACTGAGATTTAAAATATT A 

AAAGCO^GCCGGGGGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGG^ 
AAGACCAGCCTGACCAACATGGTGAAACCCCATCTCCACTAAAAATACCAAAAAT^ 
CCAGCTGCTCAGGAGGCTGAGGCAGTAGAATCGCITGAACCCAGGAGGTAG 
CAGCCTGGGTGACAGAGGGAGACTCTGTCTC^AAACAAAO^ 

5 5 TCACAGCAGTTTAGGAGGCCGAAGTGGGAGGATTACTTGAGCCTAGGAGTTTC 

TCTACAATTTTTTTTTTGAGACAGAGTCTCACTCTGTTGCT 

CCTCCCGGGTTCAAGCAATTCTCGTGCCTACGCCCCCTGAGTAGCTGGGATTACAGCT 

TTTTTTTGTTTTTTTGATACAGAGTCTCACTCTGTTGCTCAGGCTGGA 

TCCTGGGTTCAAGCrATTCTCCTGCTT^GCCTCCTG^ 

6 0 TTTTTAGTAGAAACAGAGTTTCACCATGTTGGCCAGGATC^TCTCAATCTCTTGACCTCA^ 

GTGCTGGGATTACAGGCGTCAGCCACCAC^CCTGGCCAATTTTTGTAT^ 
GGTCTCAAACTCCTGGCCTCAAATGATCTGCCCACCTCAGCCrTC 
GGAGTTCGAGACCAGCCTAGGCAACTYTACCAAC^CC 
TAGTCCTAGTTGCTCAGAGGCTGAAGTTGGAGGATCCCTTG 

6 5 GCAGCCTGAGCAATAGAGTGAAACCATGTATTGAAAAAGAAAGAAAGAAAGAAA 

AG GAAGAAAGAAAAGAAAGAG AG AAAGG AG AG AAAAAG AAG AAAGAG AG AG AAAG AAAG AAG AAAGAAAGAAAGAAAGAAAG AAAG 
AAAGAAAGAAAGAAAGAAAGGAAGGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAG 
CCGGCACAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCC^ 
GCAGAAGTTGTCrGTGAGCCAAGATCACGCCATTGCACrCCAGCCTC^ 
70 GAGAGAAAGAAAGAGAGAAGGAGGGAAAGAAAGAGGAAG<3AAAGGAGGGAGGGAAGG^ 

TTAAGGGATTTGCCCTAGCTCACTCAGGGCTCATATTCAG^GATCTTATTGTCTTCAGATATGA 
GCTGTGGGACAGGGTTCTGGGAAGTGAGATGGGGGAGGGAGATTCTTATTC^ 
ATGGC^CAAACAGGAAGTGTGGTTGGAGCTGAGATTATTTC 
njG^ATAaAAAGTTACTCCTCTCTTAC^ 

7 5 TTTTACTCACTGTTCTGCCACACCCAAAACATGGCTT 



1005 



WO 03/008583 PCT/US01/51 291 



ATCTCAGCCTGTGGAAAGAGTGTGGAAGAGACAGAG 
ACATCTGCTCCGAATGTGGCTACATGGCCACACCCAACT^ 

AGCCAAAACTTGGG CAACAGCAGGGTnT ATGATG AAAGGG AAC CAGAG AGAG TGAATACACG AAGAAG CT AGCAAGAAACATTG C 
CCACATGCAGACTCAACAGATGTCGGGGGAGAAAACTCT 
5 CACATAAGCCCCACACTGCAGAACTTGGAGTCAGTGKjTTCATGGG^ACCCAAGGA 

AGCCACAGAATGAATCCATGACCCGGAAGGACATTTAAGG^ 
TGTTTGAATACCACTATGGAGGGGGAACTTGCT^ 

TCTGTCGCCCAGGCTG<JAGTGCAGTGGTGTGATCTCX5ACCCACTGCAACCT 
1 0 CCTCCCTACTAGCTGGGACrAOtfXSTGTGGGCCA 

GCCCAGGCTGGTCTCAAACTCCCGACCTCAGGTGAT 

CACCCX3GCAAAGGTTGCCTAATTCrATCTCAGGGCAAGTCTG^ 

GCTTCTGTTTrrCTGGTCCTTTCTGTCT 

TTCTCITCCTCCCrcATGGCrrGCGTGCAG 
1 5 CCCAGGAGTTCTAGCCCAGCCTAGCAACACAGCAAGACCCTCTCTCT 

CTGTGGGCCCACCTACTCAGGAGGCrGAGGTGAGAGGATCT 

CAGCCTGGGCGTCAGAGTGAGACCTGTCTCAAAAA 

CTCCACCCCAGGCCGACCCTCCTCTGGGATCACACl^ 

TGCCTCCAACTCCTCTTTCCATACAAGTTACATAT^ 
2 0 qaggCCTGTGCTGCTGAGAGGGACAAATAGAATCCT 

AGCAAATGGCAGGAGCAACACCAGGCAGTCGGCCCTCAGCCC^ 

TCTGCTGTGTTTGGAGCTCCTATTCTGTGAGAAGTGT^ 

CTATTAATGATCCCCTTTTTTTCAGACCG<3AAAACTGAGGTTCGGAGACA 

GTAGGACTTGGAATTATGAACTCAGGTCTATCTGAATTTTAAAGCCTGGATTTTTCC 

2 5 GTATTGACAATGAAGAATCCTAAACTCTCGTTAATT^ 

AGACTGGCCTCTCTTCTGTCAAAAGAGAAAATGAAGGCTG^ 
CCTTTGACACCAGCCAGGAGTCCTCACACACTGCCC 
CAGCCTCrGCCTGAGAGGCCTTTAAGTGGCTGTTCTG^ 
CCTACCCAGACTCTTCCTCTTCTGTCTCCAT^ 

3 0 GGGGGGATCCCCCTGAGAACTAGAGAAATCACTGACCTCTGTCCTACCCCAGGGGCTTACAGAA 

CCACCTTCCCCTTTCTCAAAGGCCAGCTCACAG&TAC^ 

AGACAGGGTTTCAGTCTATCACCCAGGCTGGGGTGGAGTGGTGCAATCACTGCAG 
TCGTAGAGATGAAGTCTCGCTGCTATGATGCCAAGGCTGGTTTCAAACTCCTGTCCTCAAGTG^ 

3 5 AAGTGGTGGGATTACAGGCATGAGCCACCGTGCCCAGCCCCAAAATTGATTTTT^ 

CAGTAATAGTTCCAGCTACCACTATTGATAATCTACCTTAG^ 
TCTGCTGTCTATACATTCAAACTATACTATATATATTAT^ 

GCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCACCTGAGA 
GGGGAAACCCCATCTCTACTAAAAATACAAAAATCAGCCA^ 

4 0 GCAGGAGAATTGCTTAAACTCGGCAGGCAGAGGTTGCAGT 

ACTCTGTCTCAAAAAAAGAAAAAAAAACTACCTCCTTATCCT 
CCAACAATGGAAAACCATGGTCATATGTCATAATTAGAAAT^ 
TTTG<K3AGGCTGAG<;AGGGTGGATTATCTGAGGTCAGGAGTTTCACACCA^ 
AACACAAAAAAATTTAGCTGGACATGGTGGCAGGCAGCTGTAATC^ 

4 5 TGGGAGGCAGAGGTTGCAGTGAGCGGAGATCGTGCTACTGCA^ 

AAATAAATAAAATAAATTAACTGTAAAGGCTGAGCATGGTC^ 
ATTACTTGAGGTCAGGAGTTTGAGACCAGCATGGTG 

CCCCTOTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTTCCTGAACC^ 
CCATTGCACTCCAGCCTGGGTGACAAGAGCGGAACTCCATCT 

5 0 GTAAAAAGAAATACCGAAGTTAGCCAGTCTGGTGGTGTGTGCCT 

GAGCCCAGGAGTTTGAGGCTGCAGTGAGCTGTGATTGCACCACTGCACTCTAGCCT 
AAAAAAAAAAAAGGAACACTTAACTAAAACTTTTTTTAA 
TTGGGAGGCTGAGGCAGGCGGATCACCTGAGTTCGGGAGTTTC 
ATAOVAAATTAGCTGGGCXjTGGTGGTGOITGCCTGTAATCCCAGCTACTTGGGAGG 

5 5 GGCAGAGGTTGCAGTGAGCTGACTCCAGCCTGGGCGACTAGA 

GCAGCTCACACCTGTAATCCCAGCACTTTGGGAGGTCGAGGTGGGTG^ 
CATGGTGAATTCCOTCTCTACTAAAAATACAAGAATTAGGTGGGCATTGTG^ 
GAAGGCTGAGGCAGGAGAATCGCITGAACCTGGGAGGCGG 
CAGAGCAAGACTCCGTCTCy^AAAAAAAAGAAGAAGAAGGGTAGGGCG 

6 0 GAGGTGGGTAGATCACCTGAGGTOVGGAGTTCGAGACCAGCCTGGCCAACATGGTTA^ 

AAACAAAAATTAGTTGAGTTGTGGTGGCACGAGC 

AGACAGAGGTTGCAGTGAGCCGAGATCGCACCACTGCACTCCAGTTGGGTGACAGAGCTGAGACT 
AGAAGAAGAAGAAGAAAAAAAATTATGGGAGATCATTGTTTTGGGTC 

CAAAATGTAGTTACTCATGCTAAATGTCACATAATCAAACTGAAACTTTAAGGAAGGAGATA 

6 5 TGAAAACAGGAGATTCCAGTCAACCTGAGTO^TGTAATATTGGTCCCCT^ 

CCTCATGTTAACCAATCAG CTTT TTTCCTATTGT TCTGT r T CCTTGGCCAGTC 

TGTACCAATT CATG AATTG CAAAT AAAAG C CAATTAGATCTAT AT CCAAAATT GTTGTCATTTTGTCTTTTG ACACGGTGAAAGG T 

GAATAGGAGAAACTCCACCTTGGTTTGTATATAAAAATATTATCCAGGATGGTGGCT 

CO^GGCGGGCAGATCACCTGAGGTCAGGAGTTCAAGACCAGCTTC^ 

7 0 ATTAGCCAGGCTTGGTGGCTCGTGCCTATAATCACAGCTACTAGGGAGGCTG^ 

GTTGCACTGAGCCGAGAACACTCCACTGCACTCCACTGAGCCACTGCACTC 
ATCTAGCCACAGCCATTATTATCACACTGGATATTTACCAAAGCCTGCACTAG 
TTGGAGGCTGAGCCAAGAGCAAACTAAGTGTGCTGATACCCATGCCCCT 
GGGGAGCAAGATTAGAGCACCCAGTCCCTGAGTGCCCTGCTATCAGCGGC 
7 5 AGCATCCAGGGCTCTGTCAACTATTAGAGAAACATTTTATTCTTTT^ a * * * CTGAi. a ACAAAAC 
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AAGTTATTGCTGTGGAATCXJCAAGATAATCCCCATCCTCTGGTCT 
AACACTTACTTTTATTTTATGTTTTATTCTAGAG^ 

CACTGAAGCTTTGACCTCCAGGGTCTCAAGTGATCCTCCCACCTCAGCCTCCTGAG 
CCCTAATTTTTCTATATACACACACACA^ 
5 TAGTGTGTGTGTGTCTGTGTGTGTGTGTGTGTGTAGAGATGGG 

CCTGGGCTCAAGTGATCCTCCTGCCCTGGCCTTCCAAAGTGCTGGGAT^ 

TT AAATT AT AAAATT AATGTTTCTTTGTO^CACTTCA^ CATGGAATTCTCAT AT CAGCTTTGT AT CCTGTTC 
TCITTTCTGGAAACTAGTTCGGAAAGTT^CA T^^ I TT T T AGATGGAGTCTTGCTCTGTTGCCCAGGATGGAGTGCAGT 

GGCGTGATCTCAGCTCACTGCAACCCCTGTCTCCCAGX3TTCAAGCAATTCT 
1 0 TGTGTGCCRCCZATACCCAGCTAAATTTTGTATTTTT'AGTAGAGACGGGGT^ 

CCTCAGGTGATCTGCCTGCCTCAGCCTCCCAAAGTGGGATTACAGGCATGAGCCACCGCGCCT 
ACTGTTTGGGGAAAT AT CATTT ATT C C CTCXIACAAAT ACTTT CTGAGT ACCT ACCAT ATG 

HUMAN SEQUENCE * mRNA 
1 5 CAGCCTTATAAGCTGAGGGAGTCSGAGAGGCCCGGGGCCAGGAAA^ 
AGACAAGCCAGGCACGATGGCCACCTTCCC^CCAGCAACCAGCGCC 
TGGATGAATCTGACCTCTATAGCCTGGCCCATTCCTACCTCX^AGGTGGAGGCCGGAAA 

AACAC CAAC CG CCCCAGC CCTCGCGGG CACG AG AGGAAACTGGTG AC CAAG CTG C AGAATT CAGAG AGGAAGAAGCG AGGGG CACG 
GCGCTGAGACAGAGCTGGAJSATGAGGCCAGACCATGGACACTACACCC^ 
2 0 GACAGGATCCAGGCCGGCTTGCCACACCCCCCACCCCTA 

CCT CC AAC C CT AGCAAAAGTG CAAGATGGGGAGTG AG AGG CTGGGAATGG AGGGG CAGAGC CAGGAAG ATC CCC CAG AAAAG AAAG 

CTACAGAAGAAACTGGGGCTCCTCCAGGGTGGCAGCAACAATAAATAGACA 

TTGTTAAAAAAAAAAAAAAAAAAAAAAAAAA 

25 HUMAN SEQUENCE - CODING 

ATGGCCACCTTCCCACCAGC^CCAGCGCCCCCCAGCAGCCCCCAG^ 

CT ATAGCCTGG CCC^TTCCT ACCTCGGAGGTGG AGGCCGGAAAGGTCGCACCAAGAGAG A CTGCCAACACCAACCGCCCCA 
GCCCTGGCG^GCACGAGAGGAAACTGGTGACCAAGCTGCAGAATTCAGAG^ 
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Table 48 

MOUSE NOMENCLATURE 
ICSGNM Zfhxlb 
Celera mCG8151 

5 

HUMAN NOMENCLATURE 
HGNC ZFHX1B 
Celera hCG19701 

10 MOUSE SEQUENCE - GENOMIC 

CATAGCTTTGGCCAAACAGCATAGCTTTGGTCCTGAGGGT 

CAACCCAAAGTAGAAACAGAGCGCCTACTCAGAGTACTAGCAGTTTTAAAA 

CCACCCTTTTATGTACCrrrAAAAACATTTAGCCACTTTATTC 

GTCCTTTTGAATTTAGAAAACGTTGGCAATGTGGGAAAGAAAGGCATT 
1 5 TCTGAATAAAACIAATTTTATGTGATTTTGCAGGTGGTTTGAGCTAT 

TCATATAATACTGACCTAATTATGCATAGAAAAAGATAAGCCATTTACITG^ 

AAAGGCTCCTATTTTCTGAAAGAAATTGTTCGCTTTTATTGCAGAAAA 

TATT C CAAAGG TTTTGTGTGT ATT CGGTTAGTT ACAGT AATT AT AC C CAATAACTGACAG C CAAAAT CACGTAGGAGAAAG CAAAG 
GCTGTTTTCTC^CACGGGCCCTGCTGCAGAACATCTGCT 
2 0 GAAATT AT CAAATAAATTGAATTTTCT AATTTT AAC CTTTCTT^ CACCAGGCCG TC 

TCACCTGGGCAATGAGCAAAGGCTClTTTCrCCCCCTTTGCTCC^ 
CTTTTTTGCCTT'CTGTGCrTGGTAGGGTCrCCGTC 

ATACAGGGGTGAGGAAAGGTGCTACCTCAGAAC^CITGAGCAAACCTTAAGTTGTTTTT 

TTC TTCTTT CTT T CTTTCTTTCTI rCTT T CI rTCTTTCTTTCTT TC l ICCTTTCTrrCl'TTClTlVlTTTCCTCCCAGCCTGTTGT 

2 5 TTAGGCCTGAGCAAATAAATGGGAGTTTAATTCAATTAGAGCCTGGCTTTAGTCAG 

TCCCCTCACATTGTTTGGTTTACACATCAGTGlACTCCCTTC 
GTCTGAGATGTCACTGTCCCTTTCACCGTTGTCrGGTrCTGTCAATT^ 

GAAGGG T ACTGCT ATGTT AG CCGAGGTGGCTT'CACAGTAAATT ATTT^ T CT CACAGACCTGAAC CTAAT CCTCTGGG TAGG 

GAGATGAAGTTTGGAAGTCATCCTTG^GTATTAGACGTAA 

3 0 GAGCTCAACTCACTGGCATGATCTTGTTTTAGGTG ' rr T CTTTGGG^ 

GATTAAGTTTGTAATGACCCATGGTTTTAAGCCATGGTGATC^^ 
CAC^IACTCCTTTCTGTTGAGCCACACCTCTCCAAAGACAGATGACTCTGTA 
ATTTATTAAGGTGATCCACACCCCTGGCTGCCTTACAGGGGATTATTTTCTTCT 
GGGTTAGACACACCXSCTGCAGCCTCACAGAAOVTTACTTAGACCCAAGAATTAT^ 

3 5 ACTCACCAAATAGGTCTGTCCX3ACTGACACATACTTGAGAACAGTTTG 

CAGAAAATCTTTTGCTGGGAAAACATTTCTTTAATATGTTTTTAC^ 

GTGTGTGTGTGTGTGTGTGTGTGATTAATTTGGTGGGATGAAATGGCTAAGTGTATATCAGTATCT 
ATCACAGATGAGGTTCACGGGTXKJGGCAAATCTGTAACAATTGGAGAAAAATATTAAT 

TACAAAACATTC CAAAG T ATC CTGAAAGTT ACTGAC CTT ACAC CTTTT ATGG ACTTTGAAAG TTGTGT AACCTTTTCAGGGG AAGG 

4 0 TGACAGACACAATATCTAATTACACIAAGGACATACTTCCCATCTTGTGATACTTGCTTTAT^ 

TGACTATAAAACTT AAAG CTAGACAAGGTCTGCC CT CATTT CTG ATAAAAG AAAAAC CC CAAAACAATTCTGATGG C CCAGAGCAT 

ATTATTTTCTACTTACAGGTCCTAAATAAATTACAGTTCTTTT 

TCTTATGACTTTGCTCGGGCCGGTTrACATATAGCAACT 

TTTTGCT CTTTGG ACATG CTG TTGGT AG ACAT AAATT CATTTGGT ATT AAT AAGAGTAAT AAGGCT CTG AAGAACTG ATG ACG AAT 

4 5 AGATACACAATGTATATAAAATGGATTTGTAGAGCAATTGTTTTGAGGTT^ 

CAGGTACTTTXSGGACACATTGTTTA^ 

GCTGAAAAACATTCTGAAGGATGGAAATAAAAACTACAAAGACTTGCTGGATAAA 

CAATGACTACTTTCCTTTGGCAG<^TTGAGCAAGGTGATCACAATC^ 

TTTAAAAAAGAAAAGAAAAGAAAAGAAACAGATGTCAAAGCTTC 

5 0 TAAAGTGTCTTGTATTAAAAGATAACTGACTTCACTATTACACATTCrTTGACTAATATCT 

CTTGCTTTTGTT CACATCTGCCATCACCTTTTAATGTTTCCTATG TTAAG CAAGAAG TTT AGAGGGG AAATCAG AT AC CT AAGATG 
CCACTACCATGTGCAAGTTTAATCAGAACCTCTATAACrrTTATC^AAATTTTCT^ 
AACAGTAGTGATTCCAGTTGGATGAGAAGCTGCCAGTCTCCCTAAGATGCT 
CGCCCACACrAATTGTGGCAAGGCAGAAATTCCTCTCTTGTTTCATTTAATATGC^ 

5 5 ATTTTATCTCTACATTCAAAACTGCTTTTGTT^ 

GAATGTTCCTGGGATAAAAATTAAAGTGACCAAGCTAGTTTATAA^ 
TGTGTTGTACTTAATAGTGCATTCCATTTGGATGTTTTCAGTTGGCATAGTCTCT 
GTGATAGATAAATCTCCTCCTGTATTAAAATTTAGAAGTTTAAGTTCATC 
CTCTGTTTGAATCTGTCTTTGCCTTGTAATGCTCACTAGACCAGATTATAAAATGAGATTGG^ 

6 0 TGGCCCACCCCAAAGCAGAGAGAGTTAACAAATTAACAATGATGAAGGAAAGATGTTCGTGAT^ 

AGAGT CACTGGG AAGG AAAAATCC CTGAAAAT AAAGG AGCCAC AT C CTGT CCTG AGT AG CACAT AGT CAGAAAG CAAGATGCTT AG 
GGGTXKSGATAGGATAACACCTCACCAGAGTTCAGATGGAAAGTCTAAGGAAGTCCTACC^ 

G G AAG CAAG CCAGGG AC CT CTGCTT ACAG CT C CAC C CAAAT T AAATG AC CT C CAAGCTCT AAAACAAT AAGGAAAGCAAATGACTC 
AAAATTCCAAAGTCCTTCTTTAAACTCAAGGCTGTATCATCAGTGC-G^ACA 

6 5 CAGTAGCAAACAAC^GAGCACTCTTGTCTGACTCTGCCTTGAAGCTACCTTT^ 

TAGTGGTTGGTAGATGGCTGCACCTCCTTATAGCTTCTCCACATTGCAAGCAAAG^ 
ACAGCAGTC<5TTCTAACAGTTGCATCACCCTGCATTT'CCATATTAGTGT 
CTAGACTCCCGTGACAATCTCGGAAGCAGGGAACCCATTATCATTATTT^ 
AAGGCC^CACAGCTGGTTAC^TGTTGGTGCTGGGACAAGAAGGCAA 

7 0 ATTGTTTCCAAACTACTGCACK3AAACAAGCK3TGTATTCTTAGATTCT 

AAC^X^TCCATGAAAGTAACAATCATCAAAACAATTTAAAATTAAGCATCTCCATCTTA 
AGGCATTTCCATTCTTTTACTTGTGTAGTTTAAATTT 
GCTATTATTTTTGTGAAGATTTGCTTTCTTGCT 
TCTGCCTGT^TTACCTTCTTTGTTACAAGTCACTAGCTGACCT^ 
75 AACTTGACTGTAGACTTCGTAAAGTGATCAGTGC^CCTCCTTGTATTTTCCTAGG^ 

1008 



WO 03/008583 



PCT/US01/51291 



TGATGATGGCATTGAAGATCCTATTGGTACACTT^ 

TGTCTATAATGTGCATATGTAATTTTCCTTTACATCAATAAAACTTCTGCTCCTC 
TTCTTTCGAATGTTAATACATTAGCTACTGATACCTTATTTGTACCGTAACTGACA 
GAAGGGGTTAGTTAGGTAACATTGTATAGGGCATCTTCAAAACAATAGTCCTGCTC 
5 TTT AAATGCT AT AATAGGCCATTTTTG CAAATGTTGAAATTTT CCAGCAGGT AT AGACATT ATAAAAATGTCAC^ CT ACTG 
TTCTTTTACATGTCACAGGAATACAAAACTCTGAAAATAAAAGT^ 
TATGACTATAAAATTAATTGTCACGTT^TACAATTTAGAGTACTG 
GGCAAGGTGAAAATAATCCTTATTATAGATCAACTATGCAACCTTT^ 
TCCGAGACTGTTTGCTATAAAGCAAATTTAAGCATCATTTTATAC^ 
1 0 TGAAACTTTAATATGATTCTACATAAAATAGTAAGCCATGCCA 

ATCAATTATCATGTAAACATGGTATCACATGGCAAACCTCCCTATAATACTGTACAGTACCT 

CCAATG CGACAGTTTCACAGAATTCT AATTAATTCCTCTGAA^ CTG TTTTTGG ACATT 

TATTACTGCTCTTTAGTTGGCTTTTTGCTGTGCCTCAGAGA 

TT C CTTCTTGTT CATTG AATTG CAGAGAGGAATT CTTTGGACT C CCACTTCTTTGTG TCTTTTGAAG T AATTGTTTTGTGCT AG CA 
1 5 TGTAGCATCAAAACATCTAGAAAATTGTTTTGTTTAT 

AAAGTCTATGGAAGGATCTACATGCTTAAGTTTTTAAAATAGTTTGCA^ 

GCCCAGTGTCCTTCATTCAAAACCTGGACTAGTTATTTCA 

AGTTCAAGATCAACCTTGTCTACATAGTAAACTCCAGGCCAGCCAAGGCTA 

GTT AG AAAAT AT ATTGTT AGTT ATT AT AT AAAATTAAAGCT CTT ATTT AT A^ CA 
2 0 TATACTAGCCTTTCTATAGGAAAAAAAGTTTCGTTTTATTTTGTTTTGT^ 
TTGATGAAAATTGTTCCCAAACITTCTCTGAGAAAAAAAAATCTAACT^ 
TGGAGAGCCTGGGTGCAATCAGTGACTTTAATTGTAATC^^ 
TTCTAAAGTAAGAACTAAGTTTGCTATGAATTACTrTACCTGGATTAACCAAAA 
CATXSACTATCCTACTTTCCCCAGCCCTGGAAAAGATGGCGTACTGC^ 

2 5 AAAGTATTTACTAGTATTTGTGAGTGGAAACCCTAAAGGTTG 

ACCCTATATGGTGAGACATGACTTCATTGCTTGTTTAGATTGTTCT 
TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 
CGTCACATGGGATATTGTAGCCCATGCAGAAAAGGACCTTAAA^ 
CCAGCCTTAGATAATTTTATTCTCTCCTTACTGGACCTTrCTTTAT^ 
30 NNNNNNNNNNNNNNNNNNNNNNNHN^^ 

CTCGTTGCTAAGAAATGGTTAAATCTCAGTCACCAGGGGGAAAAAAGAGCGGC^ 

ACACAG CT AGAGCAGAGACGTTGG AATATCTCrCTTTTTGTAGATCTGAGAGCATrc CCCAGCCTGAACTTGCCTAAGCCCTTGTG 

GTCACACCTCTTGAAAAGGTTTTGTTTCCCCCGTCAATTGGTAT^ 

GTTTTC^TTTTAATTTGCACCTGTTGTGCAG 

3 5 GATATGCACGGTGTAAT CrriU ' l ^TCCCCAGCCrrCIACIATGACAAAAAAAAATCTCT 

ACCTGTCAAAGCAAACAGAT ATGACTGACTAAAATTTGTAACTAGACATGTTT CCAGAATTCG CCAAAAGCATTATGAACCTGACA 
CACAATCCITTTCATTTTTTTTCCCTCCTTTT^ 
. TGATAACTTAGCGCAGGATTGCTTTATTAGGCTCAGAGAAATCAAAATGAACCTGAAAAT^ 
CCCCCCTCCCCGTCTCTCTCTCCCTCrCTCTTTCTCTCTCCCTCTTGTTCCCAT^ 

4 0 TGTGTGTGTGTGTGTGTTTCTCXIAGAGTTGGGTTAACIAGTT 

TGAGCTAATGCCCCGTGTTTATTACTCTAATCrrTTCTTGTCrcGTG^ 
CCTGTGAGGGCTGACTAATTCAGAAACAGCCACCITCAATTGAGTTCACAGA 
ACTGAGCGTTGTAGGGACTGCAGGCrGCAIIAAGCAGCCCGCCCGCTC^ 
TCACTTTGAAATCTTCTGTAATTATGACACTATC 

4 5 TTTCTGCTTAAAAGATGAGGACATAAATGACCTTAGCTCIATGACGGTTGAAATAGAAGA 

TTTGTGTATCTGTCrGTGTATTGGGCAAATAGATGGAGAGAAAGAGAGAGAGA 

AG AG AGAACTG TTTCT AGGAACAAAACT ACAG TT AAATT ACTGAAGTCAT CTTGT ATTTATAGT ACAAC CTT AT AT AG T AAACACG 

TCATGTCACTGAGCTAATAAAACTGTCAACrGTAGATCTA^ 

CTTATTATGTGTGCAATATAAGCAGAATTTAAGCATGTGTACCTCTGT^ 

5 0 AAAATAAAAAGAGTACACAGCTATAATAGAAATAAGAATACAAATG 

TG AGTAATT CAGTTACGAAAAT ATT AATT ATTTT CTATTCACCTGAATATGTGGGGG CACTGGTATTTTCAAAGCAGCATTTTGAA 
TGTGCCTGTGCATCTCTTCCCACAGTGGTGAACTATGACAACXSTAGTGGACGCAGGCTCGGAGACAGATC 

CATTGCTG AAG ATG AT AG CCTTGCAAAC CCTCTGG ACCAGG ACACCAG CCC^ CAT GC C CAACCATGAGT CCT CC CCACACA 

TG AG CCAAGGG CTG CT AC CAAG AG AGG AAGAAGAGG AGG AG CTAAGGGAGAGTGTTGTGG AG CACAG CTGG CACAGTGG CGAGATT 

5 5 CTGCAAGCCTCTCTAGCCGGTCCAGGTAAGTGTCTGTCTGTCTGCCACTGTGTGGCCCCTTCOT 

GCTTCCCAGCAGAACACCTGCAGCCACATCATTCAAATAGTTCITTCCATCGCCCT^ 
TGATGAAGTGATAGCJUITGTGCTCACAT 

TATCAACTCTAGTGTAACAAACACAAAACATGAAGTACTTGTACATTTTCAATG 

6 0 GCGCGTGCGGGCATGCACGTGTGTTTTGTGTGGGAATTGTGT^ 

TGACACCTTCTATTGGAATTCCTTTTAATTCTATTGTA^ 

GTTTTACCTGGTCAGGTAGCITTCTTACTATTTAACAGTTAAGTACT 

AGTTAAAAAACAAAACAAAACAAAACAAAACAAAACAAAACA^ 

AACAAACAAAAAG ACCGGTCCAGTGC CAACTGTG ATAGAAGTAAAACAGATCAGTITCAATCTGAAACTTCCGATGT CACTG CACA 

6 5 AATGGAAATGCCTGAGCCTGAAATTCCCCTGGAGTGGAAGACAGGGCATGCTAACGAAAATCCAAG^ 

GTGGCCTGTGAGAAGTTTTTTAAAAGTCCCCTCTAATTCAGACTTTGT 
CCGTTATGTAGCCCACCTTGGTTTTGCXSTTGACAATATAAATAACTACrc 

AGAAGCAGAGCAATTGTTTGCTGTGTGCCAAGGATAATAA^ 

7 0 GTGTGTGTGCrCGCGAGTGTGTATGTATGTGOSTGCGTGTGCGTGTGCGTGTGCGTC 

TTGACTGTGGTGTATTTGTTTCTCATTTCCCTAATACACCGTGCCAAGTTTGGTAT^ 
AGGTTTCTCGAATCTCAGCAATCCCACACATAAACATTTATCGTTACATCCATTAAGACT^ 
GGAGAACTCAGCCAAAGATGCTTGAAAGATAAATACGTTTAATTAGGGGCAGA 
AAGTCATTAACAATCTTAAATGAAATTTTTATTTTTATGATAGCCATTTAC^ 
7 5 GTGACAGAAACAGAGTGTGGCGAGAACTCA 
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TAGATATGGAGGGCTATAAGAACAAATGTTCGGATGCTTTAAAT^ 

TTT CACC CTT AAAGGAAT AATT AAT AACAAT AT AAT AAATG T ACAG CCT CAAACTT CTTTTT AAAAAGTTT AATT AAAAAAAGAAA 

GAAAGAGAGAGAGAAAGAAAGAGAGAGAGAAAGAAAGAGAGAGAGAAAGAGAGAGGGGGAAAGAAAGGAAGAAAGAAAGAAAGATA 

GAAAGAAAGAAAGAAAGAAAGAAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGAAAGAAAGAAAGAA^ 
5 AAGAAAGAAAGAAAGAAAGAAAGAAGTCfcATAGTCGAGAAACAG^ • ' • 

AGTTTT T TTGTGTGTTGGTCATGG ATCATAATCTCAAGCATCTCAATGT ACAATGATCCA 

CACCTGGAGTTAATGGAGACAGTGATGTAAATGCTCTTACCTTTATAGTAAATAAACAAG 

AAATGTAAATCCTAATCTCTTTAGGACAAGAAAGTCACCCAGCTATTACCTG 

TCTTTTTTATTATTGTTCTAAAAGCGCTCTGATGTGGTC 
1 0 TTT AT C CTG ACTTGT AG AT AACACA'l"l"rrr C TT CTGAT AAAAT ATTTTT C"iT ATTT ATTGTT CAAAACACACACTAT AG ACAAT AA 

CTCTAATTCTGGTGTGTTTTATTTAGGAGTGTTTTCT 

TGAGATAAGCTCCTCTCCTTTGCTCTTTCTCCCCTCGCT 

AGAGTTCTTACTGTGCCTTTCCGGTCCTAGCACAATCAAAGTTTTGTGT^ 

GGTGGACACAGTGTGCTGGAAACCATGACATCATTATCATGGACAC^^ 
1 5 TATTCAAGGATTCTTCATCCTACAGAAGGAAGTG'l 1' CTl' 1 ATTTTT ACAAATGCAC CCTTT AGC CCCAAGGGTGACTGGAAAG TGC 

AAACTTTTACAAGTGTTTGAAAAATAAATATACTATGTGGTTTTAGACACCGTAATGCTG 

AGCATGAGGAAGGCGGCCAGTGAGGCTTGCCAGGAATTGAGGGGAGGCATTCCTAAAGTGCT 

ATTCACATGGACCCTAACACCTG AGAAGG CTCAGTGACTCATCTAACTAGAATG AG CTTAATATTCCAAAACAAAAATGGCCGGGC 
TCAGCAAGTCAGCTGGGGTAAAGACATTGCAGGTAAAAGTGGCACGAACATCTCTAAAC^ 
2 0 TATAGCTAGACAATGCACAAAGGGCCTACTATTCGGCTGTGCAAATGACATTTATGAATTTGG^ 

CTCTT ACCTTTTT AT AAGAAAATGACTATGAGG AAG GATGTCCAG CAGAAAT AG C CAAT ACT ACCACCATT ATTT AAAGAAGTAAA 
ATATAAATAAATAAAAATAGCTGTTAACCGTTTTCCCTTCGGTGACTCAGTGTTCTCT 
TCTTGGTTACCGGAAGTCATGTGAGAGAGAGGTGTGAGG&CGAGAGGCCCAGAGGCT 
TACCTATTTTGTTTCTTTTGGGGACCAATAAAAATACCAAAAAGAGCTTCCC 

2 5 TCCTGTGTATTGAACTTCCAAATGTTTCATGCAAAATATCCCTGTGAAAA 

TTAATTTGAAGTGGGTGCACAGTCTAAATAAATATTATTAAAATTATGT 
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTC 
AATGTGCTCTTTCAGCTTTGATTCTATATACAGATAGATTCAATTTTGCT 
AGCAGTGTATTATGTGATAAACAAAAAAATGATGAGGATACATCTC 

3 0 AGGATT ACAGT ACAGAAATCCTCG CCTCGGATCATGGTCTGCTCTCATT AGT ATTGTACTGT ACAG CTACAGTTTCTGAGAGTCCA 

TGTTATGTTTGTTTATTTGAAATATTTTTAACT 

AAAATTCCCATTTTTCTCCCTTATTTGAGAGTATTACGATCT 

TACATATTTTCATCTrAATTAAAACATTGAGCTACTGTTTTC 

CATAAATGCTCAAAACAAAAAGCTATTTATAGAAATGGGGAGGTATGAGAAATAGACTTTAGA 

3 5 TTATTITTAATACATCTGAGATGTTAGTTTGTAAAATCTAAGTATTCCAAGTT 

TTT CTTTTTCG AG ACAGGGTTTCT CTGTG T AG CC CTGG CTG T C CT AG AACTT ACTCT AT AGACCAGGCTGGC CTCGAACT CAGAAA 
TCTGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCGTACACTACCACT 
CCAGCCTATACTAACTCCATCATAGCTATACCAAGCIAGAAAGAACATGA^ 
CAGAGACAGCIAGCTACTGTGTCTCTTCATAAGTTCAAATACTTCGTCA 

4 0 CCCTGTTTGTCACTAGAACTGCCACCAGAAGAAAAGCCCTTCGTTTGCTTCAAATAAGC^ 

CGCTTAAAGTAAATAAGTTCTTGTGCAGCCTCCACTGTTGAATTCA 
GAAGCCTCACTGTTTCCCTTCAAAGACrrCCTCCTTGTTCCrATGTTTGG 

TGCT AAAG ATAATTTTCCC CCACAGAAGAAATGAAGGAAGATT ATGATGCAATGGGGCCAG AAGCCACG ATC CAG ACCACAATTAA 
CAATGGTACAGGTAAGTTACGTTCACATAGCAAGATCATGTTTGTTCT 

4 5 ACATCCATTTGCACCTCCATGATTAACAAGAAGTTAGACTra^CAGGTCTA 

TTTTATTAAGCAACTTTGTCTTCCATTCTTATACCCTA^ 
TACAGGATGATTTGTATGGCTTTCTAGTTTTAGCTTTTAC^ 
ACTACTTGATATCTGTCATTGAGAATTCAAGTGATATGATGAAGTACAAATTGA^ 
ATTCCTTAAAAAATTTTATCACTTAACTTCTCTCAGATTCrCTCTTCA 

5 0 ATTTCTCAGACAGACTTTTAAACAAGAATTAAATAAAACTGATGAATAACC^ 

TGCTGTGGCTGGTTAAGGGTGGCATTCCATGTTCGCCCCCAAACAGCIATCTCT^ 
TGTGTATCCTTTTCTGTATGGCACTACCITGATGTT^ 

CCCCAATACAAAGTCATTCCCTAAAATAAGACAAGAACAAAAATCACATAACTGTCT 
CACTTTAAAATGGCTTCAGAAGTATATTTAATTAAGATGACATTCACATTT 

ACTCCTGACTTACTCATTTGAGTAATGTCAAATTGAGTACAA 

AGAGCACTGATAATTCTAATTTCTAGATGCAGCTTCAACAATACCTAGATTA 

TCAGTTTCTATATCTGTAAAATGCATATGTTGAGCTAGCTGACTGCTCAGAATGCTTCT 

ACTGTTTCTAT AAAAATAGAAACAAAAG CG AAATGATTTTGTTT AAAATATAACAATGTAC CCT AAATT AAGGACTCTC CAAAACC 

5 0 TCCCAGTTTGTGAGCTTCAAGATGGCCTCAGGTTCCCTTGAAGAAAAATGTCTC^ 

ATTTCCTTCCTGATCACTGACAGTGTGATAAAGTGACAATTTAGCATCCATTGTATAT 
AGTACTCAGGTCCAGGAGGTTTGCAAATCAATTTTGAACACCCT^ 

TTCTT AG ACTT CAT ATG CATG C AGTCAATAAAGAAAT CAAT AC7VTAGTCATTGAGTGTTTT CAAT AT AGTCC CTGTTGTGTGCATG 
TGACGGAGATGATGGCCAGGAAGACTAAGTCATAATTATTATTTAATTTGGGAGAGATC 

6 5 AGTAAGTGAAAATATGTGATAAATCAAATGGGAGGAGTCATAATTATTTGAAT^ 

TCTGTCTAGGTTTTATGAAGGAAGTTGAATTTGAACTAAGTACATTTTCAG 

AGAGAAAACATAGTATCTAAGTAGAAGATAATGGGCAAAAGATATAAATTAAAGGGAATTTATGCGAA^ 

GAGAACTGTTAGAGAATATAAAGAAAGCATAAGAAGAAAGAAGTTGGGTCATGTTGTACCACCATGATGTCT 

TATAGCCGTGTAGTCATAAAGCCATGAAGTGATTTTTAAAAAAAGTTTGTCATAGGCGGAAGTGG 

7 0 T AGG CTGGT CTTT AACTCAT AGAGATG CACTTG CCTT ATCTT CACCAGGACTGGGAT CAAAGGTGTG CGGT CAGGC CTGG CCAACC 

CAGCATTTTTACACAC<3TTCTGGGGATCAAACTGAGGTCCCTGCTTC 
ATTGACGAGACAATTTTGAAAACCTTTGATGTCCTAGAACATCCCACAT 

TTTGAGGCTCAAGTAATGAAAGCAAGTTTGGAAGG CTCCTAG AAACT ACCCGCATCTACCCAGTGAGTCT CCTCACCTCATCTTTA 
ACTAGCATCCTT AAATT AATTATAGAAGAGGAAGGTTTAATATTATTCATATATAATAATGATAAAGCT 
7 5 TATTTCATAACTGATTACCATCAGGATTCA1TT1 
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ATTCATCTTAAGCAGACAGAATTTAAACCGATGCTTGAGC^ 
AAACATAAAATTTTATAGGGCAATGGGGAGGTCAAGTCATAATAACAGA^ 
AGCTCTGCAGGCCAGTCATCATTTATTGAGACCCTACTAAGTGT^^ 
GTAAAAGAAAAGGCACAGGGATGAGAAATATGTCCCAAATATTTAACACAACA 
5 ATAATTTTTACTGCTAGTAATGTACGTATGTGTTACATTGTTTC^ 

ATACTGCAGAGTTTCTTGACAGAGAACCGGGATCATATGTTAGAGCTAGAGCCAT 

CTTCAGGAG AACAG CTT CATGAGCATGCCTGTGTGCAAAGAG AATAATGGAAAGG CTGGACACACAAACGAGAGG AGGCTGAGGAG 
GCTCITAAAAGAGCACGATGGCAACAGTGGTAGCAGTTGTTTTTGTCT 

GAATT AACCAAATTT AAGGTGT ACCGGGGAACTCTCTCAT AT AG CTT AT ACT CTTGAAGTGGACAGGAAAAAAAAAAG AG CTTCAG 
10 TTACTAGCAAATAAGTAGGAGTATGGATGAAATCTCAAGTAGAAATGGAGAGTAATT 
AAGAAAGAATATTCTGTTTTGAAAACATAGCCTGAGATTAAG 
AGACACATAGATAGGACAGGAATACCTCTTAGTGTGCTAAGGTAAGAGTGGAGT 
AGGCAAGTGCCCAGAGCAGACGGGGGATCIGTCTG 

AGGATGGGGGATGAGAGCACXJCATGTTCCTGGTTCTGGTTCCGACAGATTG^ 
1 5 GGGACAATTAGGCACCAACTACITACrACCTGTTTGTTC 

TCTGGCTCTCCrGGAACTCACTGTCTAGAACGGACTGACCr^ 

TTAAAGGTGTACACTAACTTGCCTGGAA^ACTACTACCTTr^ 

CATTTTACTTGATCTATATAGATAGTGCACT^ 

AGGCATGGCAGAACACTTCTCTAACrCTAGCATTTGCG 
2 0 CAAGGTTCTG T CTGAAAGAAACAAACAAAATGCTTCAGAAACTTTAT TTTGTTTTT 

CTCCATAGTCCCCTCAACATGCGCAGTACTTTATGTAGTGCTGGTGGTTGATAAG 

ATGATGAATGAGCTTGTAAAAOVGGGAACAGTGATTTGGATCAGCAGTGTGA^ 

GTGTTTCAAG AAAG ATG CAAGGAGGC AGGGCTCTGAGGGAGAAG CCT ATG AGGGATGGGGG ATT AC CG AGGG CTG ACACAATTT CA 
TTTAATCCGAAGAAGTGGAGCTOTTGTAGGAGGATGTCIATTAAGCATGTGGG 

2 5 AAA03AAGCGTGAATGAAGTTCCTTTTTGGTCTCTTACT 

TAAAATAATCGTTCATAATAATGTATCTTTGCAAGATGCTTCCAAGAGCTA 

TAAGAAATACTGAAATAGGGAAAATACAGTCCTTATATATTAAATAGAAAGGTTAAGCATCT 

TTCTTTTACCCCATTATCCTGGTTTGGTTTCTGTTGCT^ 

GGCTTACAGGT CAT AGTC CATCACTG AGAGAAGCCAAGG CAAC CAAGG C AGG AG CTTGAAG CAT AAAC CAACAGAGG AATGTTGTT 
30 • TGCCAGCTTGGTTTCAGGCTCATATTCAACTTGCTTTCT^ 

GGCTGGGCCATCCrTAACAGGAGTTAGTGATTTAGTAAATGCCCCACAAACATGCCCG 
TTCAACTGAGATTTCTTCTTCCCATGTGTGTCATGTTGAGAACCAACCAA 
TAACCATAGAAACAGAAGAAAAAGGAAAGAGGGGCAGGGCAAGGAAAGAATGAAAAGATATAA 
TCAGCAAATAAGTAAATTTTAATTTACAAAATAAATCTACAC^CATAAAGA 

3 5 ATTITAATTGGCTTTTTTAATACTAACTTTATAGTGTAATAAT^ 

TCTAGGTCCTTTTCAAATGTTATTAAATCAAGCATGATAAACAGCAA 
GATGTAGTTGAAAGAGCAATGGGCCAGGAGTCCAGAAACGTGATGTCTGGCTCCAGGTCT^ 
GAATCGTTAGCCTCACTACATCTTGGGGGCAT CAGCTTCTC AAATGAG AAAATTAGGCT AG AAATTTTGGGCT^ 
CCTTTCTCCTCATGAAATTGCCnTGGGAAACACAATTTAA 

4 0 GCCCTGCTCTCTTCTGCTCTCCCTACGCAGTGAACCTCGCCTTGTAATTGGTCCCTAAGCT 

GATCATTTTAAGAACTACAAACAAGGTACTCTACAAGTTTCATATCCCACCCCT 
TGTCATTTCACAGAACGCAAACAGCAGAGTGTAACTGTAGGATGAATACT 
TAACAGTTCAGTCCATAAAATAACAAGACC!AGTGTTCTCATAGCAAAT 
CATCrGAGCCAACAACACAGTAGTTTGACTCCTCTGACTGGG 

4 5 CCTCTTAGTTCTCCTACAATGGATCrrrAATTAAAGGTTTGAAGTTTTC^ 

TTGAACATCAGGTATCTCCCAGCATAAGATGAGAGACTGTGACGTATTAGCAAACATAC^ 
AGTTCCO^AAAGGAGGACACACACrACTCTACTATCTTGCTACnT 
AAGGCTGGGGAGATTTTGTCATTCAGCCATACAATTCATACAATTCAGCCATACA 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 

5 0 AATGACAAAAAAAAAAAAAGTTGAGAGGAGAAGGAGCGCTGGACAGGA 

CAC CTGG AAGAG TG CCTTCACTTC CTGG CTC ATT AGTG CTGGGGTTTT ATT AG CTTCATCTG CAATTAGG AG CAGCCATCT CT ACT 
GCAC AGOUVGAGTT CCCACAAGAAATAACGACCACT CCTAGCCTCTTCTTG CATGCCTCCTTCACTCATTTCAAC AAATATTTAAT 
AAGCACTGTTCTCCTTGCGATGCCGCCTGTGGTTTCCAC^CATCTCC^AAATA 
GAGAAATCTGTGTTATTTTGACTTTTTAATTGAAA 

5 5 ATTATGGCAATTAATTCAAAGTATGGATTGTGGTTACTTTAAGGATACCCAAGGAGGTACATC 

AATTACGAACGAAACCCAAAATAACTGGTCCAAATCX3GG 

GATGTGTTCTAGACTTCCCAGTAAGTACTGGAGACACATGGCCCACACCTCTGACAGT^ 
GTGCTATGAACCATTCAAGGGTTCACATGTGCAGATGGTAGC 

GGAGGCAGCCCGATGTGAAGTGATCCCGTGAAATTACTCAGAACCAAGCTGTGTGCAGTATGAGACA 

6 0 T AGAGATTCTTAAGTCAGACAAGATTTT CTCTATGG CCTGTT AAAACACAGACTC 

TTTCAAAT AAGTTCC CAAGT AT ATTG ACAC CACCGAT ATG CAAAC CAAAC CTT GAAAACCAATGACT ATCCAAGG CAAACACTGGC 
TGTTTTATGG CTTTCT ATGAAAAGAAAGTATG AACTGCTGAG CAAAGTCACTCTCCTAAAGACAGCAATGCT AATTCCAGATW3TT 
AGATTACATGGTTAAGGGAAAGATGAACACATCTTGTTTAGCTAGGGAAT^ 
ACAGCAAAGTCATTCAAAGGTTACTGTAAGCCrCTGGGTCTTGATGTTCA 

6 5 TGGCATCCTGGACTGGGCTACITCCTGCTGTGGAGAGAAATATTTG^ 

AGCATGAATCTCTCACTC^TATTTGAACATGCCCCCTCAGAAAAGGCCATATAAGAGCAGGGGTGGAGAGA 
ATTCTCTG CAAGAGtflAG AAACTTCT CTGTTGCTCrTCACGACC 
GAGAAGCTGCTGTGTGTCTGTGCCTTGAGAAAAAGATCATCCCCTTCCCCATTTT^ 
TAGCCAACTTTATACAGAGTAGTGAAAATTCATATACCTTTC 

7 0 TTGCTCTGTCTTCTTTTCATCAACTAGTATTAAAGGTC 

GCTTCTCTTCCCTTTGCCTTGACATAGCAATGTGAATAAAAGCTAGAri^ 
CCAAACTGGCTTAGTAATAATATTAAAATATATCrrCTTCAGTTTTTA^ 

ACAGAGAAGCTGCTTTCTTGTTCT AAATACAGAAAAT AAATGG CAAAAC^AAACAAGAAAAAAAG AAAAAAG AATCAG CTT CGGGG 
TCACATTATAAACTCGACX3GCCTGTAAAGTATAAGTAAAAATCA 
7 5 TATGGGAACTGACTGATTCCTGAGCCAGTCAGTGGTCCTTTTCTC 
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TTTAGACTAACATTCGTTATCTACCTTAAAAAAAA^ 

TTAATAAGCACGACTAGTAAGACTGACTAACATATTTAAAAGAAAAGAGAGCC^ 

ACCAATATTAAAACACATATAGCCATGAGGAAACATTTTTAATTAACAT^ G ATGG AG AAAAGTTT AACT AAC CTCAG 
GGAAGGTTGAG AT ATTTG CACGACGTT AGGG TAAATTTT AGGAGT AAATGCAAAATCC C CGG AG AAG AT ATGGG AAG AAATTACTT 

GGCAGCCAGGGAGTGAAGAGGATTATAGAAGGAACAAAAATC 

TTCTTAGCAAGCTCTCCACCTCAGTCCTGTCTCCACTACACTGTACTATA 

ACAATCAAAATTTCATCCTTGCCAACAGAGGTAACCAAAATG 

GTTTTCAGAGACTGG AACATGG CGCAGTTGAGAGTAG AAAAGG AAGTTT CTGGGG CGTCATAT AT AAGGAGAGGTG AG AG TACAGT 
1 0 CACTAGAAGATGTAGCCCAGCCTCAGAACIACACAGCTGGATGTG 
AACCCTCCACTCCTAAATGGAAAATATCAGTCCCTTCACTCTC 
CCGGATGCTCATTTTGGAACTAAGTCCTAAAATCATAGGGAAGGGATCCA 
TTAATGATTAGGCTTGTTTCAAGTGCAGGTATGCAACTGGGAAAATGGGAGGG^ 

TGT AATG CTTGG T AGTGTGG ACATGAAG AG CT AG AT CT AGGCAAAGG AAAATTCT AAAAAAAAAAAAAAAAAAAAACAAAGAGAAA 
1 5 CTTACATTGCTTTTCATGGTGGATATGATGGCTCIATTTA^ 

AG CAG AT CCGCG CCCACATG AATGATGGAAAG CC CT AATG AAG CGAATAGACTTAAATG ATT AAATGACATTTC CT CATG AAGC CC 
CITTTGGATACTCACTGAAGCCATGTCTAACTTGT 

GAACTGACAATTGAACAAAACTCTAAATCCCT CGCTTAAC CATCAGATAACCTTGTriTGlTTTGATGGAAACATAGTGTTAAAGG 
CACATTATTTTCC!ACATCCACCTCCTCAATGTTTCTTCCAA 
2 0 GTGACCACAGATGATCCTTAJACAGAACAT^ 

GG CGAAGTCTGATCTAGTAATCAAATTCCCTTGCCAAAGAC^ 
CAGAATTTTTCTCCCTTTTAAGCCCAAAGACTATTAATGCAAATCCAATTC 
CATATCCAAATGAAAATTCATGCAAAGATAGCCAGATAATTCAAATAGTT'G 
ACATCCATCCACTTTTACrGGGTTAC^CTAAGGTCATCAAGTGGGACAATATTCT 

2 5 CT AATAGGTATGTGTTCAGAAGGGAGCTGTAACATTACTTTCAGTTGTC^ 

GT CATATTGTTTT AAAACATTCACTTT AG CTGGGTC^ CTGGAGCCAG AGTT AGGCGG 

ACCTCTGAGTTCAAGGCCAGCCCAGTCrATAGAGTGAGTTCCAGGCCAGCCAGGACTACACAGAGA 

AAACCAAAACTAAATAAAACCATTTATTTTAC!ATTTTC 

TXKjTAAGACTATTTTAATGAGCCCCAAAACGGGCAAA 

3 0 AGCAATGG AGCAAGTTC AGTCATGTTATC ATTATCCTTTGAAAG CTCCTATGGCCTTGTGAACCGAGCAG CTG AGTCATGATCCTA 

ACTG ATGG CT AGT AAAT CCAACTAAGAACAGAT AT AAGCG AAAGTT CAAGGT AAA^ CAT CTAG T ACCTT ACATTCCAAA 



GTTACAACATGCAGAATTGTTACCACCTAGTTTGGTGGTCCATGAAATATAAAGAAAATTCT 
AATGTTTGTACATGATTTCAACTCTTTAGAATTCTAGTTTTCT 
3 5 ACAGATACAGAAGACACITCAGCATGCCACTCGGGCAAGCTC 
TCAATTCCAGCAACTTTTTTCCGTCTCTACC^ 

TCCCAATCCCACrGTACCCrCAGCCAACTCTGAGCAGAACCTACCCTTCCT 
TCATTAAAAAGTACAAGTGTTTTCATTCTGTAAAAGTGTCATTTT^ 



0 TCTCCTTCTCCTTCTCCITCTCCTTCTCCTrCT 

CTCCTTCTCCTCCTTCTCCTCCTTCTTTTTCITCCTCCTCTTCCTC 

AAATGAAAGACCTT AGGAGGG AAATTT CCCCT CCATGTTCC C ACTTCTGCG ATCTGTTCC C CAACATCCT ATAT ATTTTTTT CTTT 

CACTCTGCACTCCAAGTTAGATACCAAGATGTCCCTGITCCTAGTCAC 

AATTCCAAGTTTTCCTCATTAAAGTTTTCCTAATCTTTAGAGCTCA 

4 5 ACCACCACCATAATACCCAGTCAAATCGGAAGCAATCAGTTT 

' i TL T ir C 'l' I 'TCrTTC T T'ICTTC C rrrC'iTT CT T'l'C'i 1 TCTT T C T C 1 C TCT CTCTTTCT (JTCr'l'TCTTTCTTTCTCTCTCTCTTTCT 

TTCTT TC TTTCTTTCTTTCTT T CTTCCCCTA^ 

CTT T CTTT CTT^CTTTCTTTCTTTCTTT CT Tr CTTTCT^ 

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGGTGGTTTCCTGAAGGA 

5 0 GCTGTTT AGGCTTTCAATCATC^CATGTTAAACAAGTG CACAAG TTT 

ATAATAGTGTGAACCAAAGGAAAAATGGGGAAGAAAAAGAAAATTC^ 

GG AT ATTTCTT AT AGG AAACCCTGCATG CTT CTTG TAG C AAAGAAAAT ATGG T AAAG ATCT AGTGG CATCAGAGGGCACTG CTTTC 

AAGCCACAGTAGCGTGAGTCGGCCAGGCACTGTGCAAGGCCCTTTCCCCAGCACC(^TAAAGAGGAACTAACAGTTA 

TCCTACATTATGTATAATCTTTTATTTGGGATATGTAAGCCATAACAAATTTCAGAGATG 

5 5 TCTTGTGACACAGATTGTTGGAATTGGTCCACGACAATTATTTCA 

TCATGCTGCTCTTTTATCAGTGGGGCTCCATTACTTTATAT^ 

ATTCTGAGCAAGGATCAACATTGCAGTGAGAGAAAGAGAGAGAATGTGAGTTGTTAGCCAAGTCAC^ 

GTGTCTCTCCCTCCATT'CACATCTGTCTCAGACTCTGCAGACTTTACTCTCT 

AGATGCATCGGTAGAGAGGGAGGTGAAGTTTAGTCrGGCTTCTCAAGAAAATTAAATT^ 

6 0 TATTCAAGGGCITGTGGAAGGGCTGGAGAGGCCTACACTATCT 

T AAATGCACTTTT CTT AAGACAAAAAAGAAGTTT AAT AT AGAC CATT AAAGTATCTG CAATG TT ATGTTAGG CCAT G TTT AGG AAA 

TTTTTTTTAATTCGATTTTAACATTTTTTAAC^ 

TAAAAACCAAATTTGCCTACTTAAAAAAAGTAAGCTCTTTTACCT 

CrCirCTGACTAATGAGCTACITTAAGTCTTACTAGTA^ 

6 5 AG G AAGAAG AC C CTGAAAAG ACCTTTCT CTCTTG CTT ATGATTT CAGTT AAG AATG CAAATTG CACAT CAGACTTTG AGGAAT ACT 

TTGCCAAAAGAAAACTGGAGGAAAJGAGATGGCCACGC 

CCAGAAGCCCCTGAGGAGCTGTCTCGCCTTGGCACGCCAGAGGCC^CGGGCAAGAAGAAAATGG 

GCCGCGATACTGACCTCTGCAGCTGTTAGAATTACAAATGCACT^ 

GAGGCATGGAATTTCATGAGCAAAGATTAGAACTGTAATTTTAGGGTATTGTGAAATTT^ 

7 0 G T ACATCCAAT ACAGAAT AAGAACAATT ACAG CCCATTT CT AAG AG AT AAAAAT AT CT AAATGT ATTC CAAT CTTTCAG AG AT ATT 

TTACTTTTGAGGAAAAAAAAAAGTTTAAGATCACCGACCCTCACCTT^ 
CTGTGAATTCCTCAGTTACATGTGAGTCTAGGGTTCAGGGGGACTGTC^^ 
AAATTCCCCCTCCCCACAAATAAGCTCAAGTAATTAATTAATTTTCGAGCGCAGT^ 
GTTCAAGGGGTCTGGTTGTGAGGAGGCCGGGGCGCAATGATGGCGCTGTGC^ 
75 CGGGCAGCCAGCCCATTAGCTGCCATTGATTTACra 

1012 



WO 03/008583 PCT/USO 1/5 1291 



CTGGAACTCCAGATGCTTTTGCCCAACTGCT^ 
ATCAAGTACCGCCACGAGAAGAATGAAGAGAA^ 
GCATATC4GTGACXJCACAAGCCAGGGACAGATCAGGTGGGGGTTGGTC 

AT ATTGCTTTGATTGG CAAT(^TT ATT AATTTTTCATGGCATTTTG AAG ATG CAG AT CTTTT AATGGT AAT AG CTTT AATTGAGGT 
5 CT AAATGATTTTTTGATGGTCTCTT CT AATATG ATTT AAT AGTC^ CGAATGAGTGTGTTAATTTCT AATTTAGA 

AATTTTATTTGCACTTTAACTTGACTTTAGTTTC^ 

TATTAAACATCTCAGCAATTTTAATTCCATTriTCACCrAGTTT^ n 1 1 ATAT AGGACCT AATACAT AG CAA 

AAGAAGTGCATATTTCTCTCATTTGTCACGAAACACTTTTAAAT^ 

TTTT AAGGAG AT AGAAAGG CCATAT AAG TTAT ATG CATAT ACACAT AAT AAATTGGAAAAGTGATAAATGATAAAAAACATTGGTT 
1 0 ATAATTAAAATGACTCCATCTCAATATACATGCTTGTAGTTAAATGCCAAAATTGTTCTGCCT 
ATAAGAACTCTGGCTTGGAAAGGTAGGAATAGGGTCTTAAATCCTTGAAATC 
GTTTCAGCAAAACTGCTCTTTCTATCCTCCTGGGAGAATTGACTGTG 

GTGTTT CAT AAGT C^CAAT AACTGGTCCTACAG AT ACAG AATTGC CC CT AAATGAACTTTGAT ATT AT ATATTTTTGAG ATGGTT A 
AATATGACATTCTCTATAAAACTCCGATTTGTCCT 
1 5 G C AAGAATGG CAATACTGTGTTACAAATTTTTACTATTTAAGG CACTAT ATCACATT AT CACACCATTTGTTTCTGATTGGACTTG 

TGAGTTGAATGTTTTCCAAAGATCAGCTATAAGTAATAOT 

TG CACAAAAAAACTCT AGGAAAAAAAAAAAGC CT AC^T ATTTATCT AGT AT AACTGC^ CACATG CAGAAG TG TGATTG 

TGTTGGCGCATTATGAATGGTAGCTTTGGTGTTGAAAAGTCCCCTCATTTGCT 

GTATCATATTAACAGTGGCACAACrAAAGTTTATAGTAGTCCTCTGAGGGCrc 



20 




TGACTGTTGGGTTGTTGTTTITTTTTTTCCCC 
TTGGATGGTATCTGTGCTGCCTCrAACTTGTCTTTTCAGCAAAL"!'! 



2 5 ATGAAATCTCATTGAAGTCTTTCCTACTGACTTGTGATAAAGCTGTT 

ATGAATCAAATCTGATAGGCAGCACCTAAGAAAAACACAGTT^ 
ACCAAATATTTATAGTCAAAGGGTTT TriTrr TT'l T ' 
ATTrCAGTTTGATTTAGTTTACACTTTTACCTGGAAA 
CAATTXTATAATTTTCAGGAAAGTATTTTTITCC^ 

3 0 CTAAACCAAAAGGAGCCATGAGCATCTCTACAATCATACTTTCT 

G GST AATTGTGAAG CAAGGG TCCG ACG TG CACGGG AAG CACITTCIT AAAAG AG AAAAATTC C CATTTT ATTACCAGCATT ACCAC 
TITTATTGCAACTCTGAAAGTCTGACCAGATAAATAAGCATGA^ 
ATTCCTAAGCAAGCTCCTTTGTGCCTCCTGAACAAAGATGACGGCCATTTT^ 
CACX3TAACCGCAAGTTCAAGTGCACGGAGTGTGGCAAGGCCT^ 

3 5 GGTGAGTCACAGGGGGGTCGCTGCATTGCATTCTTTCCTACTAGAGAAGAT^ 

GAGTGTTGTGGGCTTACCCCTTTATTTCATA^ 
CAATTCACAAGGGCTTTGATTTGTGTTTTCTGCTAAATCrAGA^ 
CTGCAATTTCAGAAAGATATTCATTTTTTTTTTG 
TGCTAACCATTTCTTGACT CCTGATAGCT ATACATCAT AAA^ 

4 0 GCTCTGCCAAGCATTTTATTACCAAATTGGATGCTATTCAAAGAACT 

ATT AAAG CCATGG CT ACACTTCCCTTTAAAT ACTGTG ACATCTTT ATTGTG CT ATTGACCAC^ CT GTTT AA 

ATAGGAGGCTGCTGATGGCCTGACAAC^TTTGGGATTCTAAC^ 

GAATTATGTCCACTGGGGGTAGATAAAATCCAGAGATAGTTATGGTCATATAACTCTGArc 
CTACATTCGACACCTAGATTAATTTCATTTGGTTTCACAGATTC^ 
4 5 TCTAATGCTAATTTTAATTATCCTTTCAGGTGAAAAACCT^ 



TGAGCAGACAGGCTTACTTAAGATTAAAACAGAACCA.CTAGACTTCAATG 
GCAGCAGTCCCTTTATGAACGGTGGGCTTGGAGCCACC^^ 
5 0 GGTGTAGGGATGGAAGCCCCTTTACTTGGATTTCCCACTATC 

CiAATACGGTTTCTAGGCAAAAGATGGACTGCAAGACGGAAGACJVTT^ 

AGCCAGAAGAACAAGGGGTAACTTCTCCCAATATTCCCCCTGTCGGTCTTC 

ATTGACTATACCTTAGAGAAAGTCAATGAAGCCAAAGCITGCCT 

AAAGAAAGAGAAGTTGCGTACTTTGATAGATTTGGTCACTGATGATAAAATGATTGAGAACCACAGCATATC 

5 5 GCCAGTTCTGTAAAGAAAGCTTC CCGGG CCCTATTCCCCTGCATCAGCATGAACGATACCTGTGTAAGAGAATGAAGAGATCAAGG 

CAGNNNNNNNNNNNNNNNNNNNNATGAAAACATAGTCC 

CTGTACTTTCCXJAGAAAGGACTGACAAGCCCCA^ 

ATGGAGCCCAACTCnPGATGAACTGCTGAAAATCTC 

AAAAGTCTACCAGTATTCGAATTCCAGGTCACCATCACTGGAAAGGACCTCCAAGCC^ 

6 0 AAGACTCITTGTTACCCAGGTCTCCTGTAAAACCTATGGACT 

AGTTGTGATCCTCCTCTCAGGCTAACAAAATCTTCCCATTrCACC 
TACTCCTTCTCCTTTAAATCTTTCCTCCACATCTTCT 

AGCTGCAGGCTGAGCCGTTGGACCTGTCATTACCAAAACAAATGAGAGAACCCAA^ 
GCTACTAGCATAAACITAGACCACAACAGTGTTTCTTCATC^ 

6 5 AGAGTTTTCAAATTCTAATAACCreGACAATAAAAGCjyvCAA CATGAACCCATTTAGTG CCAAGCCTTTATACA 

CCCCTCTTCCACCACAC4AGCXSCATTTCCCCCTGCCACTT^ 

GG ACTGG ATCAGATG AGCTT CCT AC CG CAT ATGGCCT AT ACCT ACC CAACGGGAGCAG CT ACCTTTGCTGAT ATGCAG CAAAGGAG 
GAAATACCAGAGGAAACAAGGATTTCAGGTAATGAGATGAGT 

TCCCAGGCTGCCCAGTCCTAAATAAATTGATAAATGACCTAGAAGATACAAAAGGTGGGGGAAAAACACTCAGT^ 

7 0 ACCAGGAATATGAGAGCTTTGAAGCAAGCTAGCTAAGCCAGTGG 

TAGTTTAAGGTTAGGCAGCATAGCAGGATCCGGTCTCCAAAACAAACA 

TAGTAGTTGGTCAGCTTACCGACATGCTTGAAACTGAAG 

TACATATTTAAATTTTGATATCTATTTTGTTTTATAAAGAAGAGTTCATTTTC 

T AGG ACTT CTTGCCT AATTTTCAAATCC TCA^^ A GG TGG T AGAATCTGAAAG CAGGCATTGTTTTGGCATT ATTTTG T 

7 5 ATAATCCCTGTTTGCJUUUVTAGG AGCCAAAGTGGCAGTG CACGGCTGTAACAGCTGTCCAGTATTCCAATTTACAACACATCTGC C 
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CTCTTTGTAAATGCATTTAAGCCTATTTAATCTT CTCT AGT ATTAATT AGTG CAAAT AT AGTGTAAGAAAACTAAACTGTGGGCT A 
GCAAATGGAAATGTATGTTACCATGCAAGTTTGCACTCC^ 
ATCCACATTCATTATTGCTCAATGCTGAAGGCrcTTTTAACA 
AAATTAGAAAATTAAATATAAAAATATAGCITAAjGG Ci 'TTC 
5 TCTAC CT AAATAT AAAATTCAATT AAAATG AAG CTTCTT ACAGCT ACCAAT CATTTTTGAAGT AAAGTACACTGGAAAAT ATGGT AA 

TCCTGATATTTAATGTTCACTCTTGGGTTTTCTCTGGTATTAAAAA 
AGAGAAGGTTTATGTTTTGTTGAAAAGTGAAACTTCCATC 
GGCATCTAGCATTTTTGAGGATCTCTAGTTGTCAGCCTATGCTTAATT 
TATGTGATTATATCAOU3GGAGACTTGCTGGATGGA 
1 0 T CTGT CTCGAAAGAAGAT AAAGAAG ACAGAAAGTGG CATGT ATGCATG TGACTT ATGTGACAAGAC ATT C CAGAAAAG CAG TTCCC 

TTCTGCGACATAAATACGAACACACAGGTATGAGCGACTTTGACTAGCTAGCT 

TTTAATGAGTG CACTTGTGCGACAT AAGATGT AG CAT ATGCTGACACCTT CAACAAG AAAGAAGT AATACTTTTGCAT AACT CAGT 
AGGACrTTTGTGTTGCCTGTTTATTGGATC^ 

AATTTGCCTATCATGTCAAAGGAAAATTTAAAAGCCCAGGGTAATGAAATGGCATGACAAGAGCT 
1 5 ATTCTTAAGCTGTGGTGAA.TATATATATGTGTAAAAATTATTATGATAAATCTG 
CTAAGTAAGC^TGCTTTGGGGTGACrri'C'l'l'lt^lTTTCATAA^ 

AGAAAGGCAAGAGACCAAATAAGCCAGTGAATCCTATAACTGAAAGACTAAAGTAATATATCCA^ 
ACCAACCCTGACAAGACAACAGCCCCAGGGCTCCCTGGGACTACAGGTGGTATG 
ACTTTCCACAACTCCGGATCAGATCTGCAGCTCTGTAGTAGGGTTTCCATCA^ 
2 0 ACAAAGAAACCTAGTCTTACCCTGTCCCTAATCCCTGTCCAAGTCTTCT 

AAAGTATAGAGTCATATTCAAAATATTGTAAATGGGAGCAGGATGGACAAAGTGTAGGGAA^ 

AGTT AG CATTG ATGCATGTGGTGTGGAAGAGAAGAGGAGGTGTT AGATCTT AACTCT CTGCAGAT AGGGT ACATCTT AAAG AT ACC 

rTGGGAAACTCGGGTACTCCAAATCTTCCCAGATCCTrCACTAGCCATGGGTTGGAGG 

CGCTGGGCGAAAGTGGCCCATGCTGGACCGCTAGTGCTGAGAGCACTTTC 

2 5 TCCAAC CAGTT CAG CTCCTTTC C CTCCAACCAT AAT ATCTGTGG TTATT ATTTTAAAAGGCCAATT AATG TT AAAACACCC CAAAT 

TGGTTTGCAAATCTACACAATTTTTATACTTAAGCrATATATTTGAAACA 
CTTTAATACTGAGTCTTATTTAAGTATTTACAGAGTAACAAATATTTTCATGCCT 
ATATTTTGTATTTCATrATTCAGCTACAATAGGCACATC!ACT 
TGCAAACTCTATCATTTACTACATGGTGAGTGTAAAACTTTT^ 

3 0 TTTATCTCCTATAAATAAGCTAGTCCATAAACTTCATAGATGTCCAGCTAGTTGAGAGAC 

ATATATATTTATTTTTTTTAATAAAAGCACGTTATTTTAAAAGTAAA^ 

AGAGTTTC CAG TCTGTT AAC'JT T'l'TC^TCT CITG ACCA^CATCT CRTG ACATTAATATTGT CATTGT ATT AGG TT CCATCCAAG C 
AAAAGTCTTCATGAAATGTGTCAGTATGATGTATATGTAAACTTAGCTTGCTTTCT^ 

3 5 TATTAATATCTAATAATATTTTAAATCAGTTAATATGAATATTCTAAGA 

GATCTGTCATTTC-GTTATTATATGTTGATTATAAAT 

AAGAAGAAAATTAGTGTCTTATTTATTCCTGGGAGTTTGTGCTCITTTGCCGTAATCT 
AAAGTCATACTGCTTAATTCTCTTACCAATGGCTTATTGCTTGCTT 

TAGTACAGAGAAAGAAATACTTTGTAAATGTTAACTTCAATTGGAATAATTGTCATAAATAATGACTGAAATA 

4 0 TCTTC^TAGGCACTGTCAAGGACTCATTTTTACTCCACTAAAATTTCAACCTGATAT 

GTGGTAGGTGCTTTCAAACCACCTTATTTTGAAAAGAAATAAAGAT^ 
ATTTCTGGTCAATGTTGTTCCTAAGTTATATGGCAAAATTAAAATTA 
GAAAGTCCTCAAGATTACTAGACTATTTAGTATAAGAGTTGTG 

ACAGTACTTTATCCTCCAAAGTTGTCTGTGATATATAATGTAGCCCTATTACAAGTACA 

4 5 AAGCTCGCTGCCTGTCCITATAGGACCCACCTCCCATGCATTCTAAAAGATCT 

CCAGATGCTGCTCTCATTCTACAATTTTGGAAAAATAAAACCTGTATTCATATA 
ACCTTTGATCCTGAGCTGATAAC^CAGCAGCATAAAAAAGATATCACGGGATAGTGTTGTTC^ 
ACAAGAAAATTTTCATGTCTAATTTTTCTCCATTCAC^TTTACrTCTT 
GAATATAGTAGATGGTTGAACCTGTATTAATGGACAAGAGTTGAGAGATTTAACCCTAA 

5 0 CTAAACACTTTATAAATCCTGTCTCCATTATTCTATCTATGGCCT 

AAGACCCTGAACCTAGAAGCAGTTGACATACCCAAGACTGTACGTACTCCAGCCTGTCCT 
AGCTCATCACCCTTATTGAGCACCATTCTGCCCCCATGGAACAATTGCrAT 
TTAAATGTGCACTTCCAAACAAGGAAGACCAACACTATACTGTAGCAGCTTGA 
ACTCCTAAGTTCTTTTCATATACTCTTGTGACTCCTATTGTTACCTACCCAAATGAAAA 

5 5 CC^CTCTACCAGAATATTCTGTTGTCCAGTTTCCCTCATAGA 

AATTAAACITGTTATCTTCAGCCTATTAGCTAACrTGAAGTAGGTTAATATAATTCT 

CAATTT ATGTTTGAATT CTT ATCTTACAGAT AAAAAGCAAAAC^ C C CCT AT CA 

CTCACCACTAATTTATATCGGATGTTCCAGTAAAATGACAACCATC^^ 

T AGTGG CAATG CCCTG G AGG CATT CAAT AACATT ATT AAAG AT ATTTGATT ACT AATTGGGATTG TAT AT CACCCT AT AACTG AAA 
€ 0 ACCTGAAGTGTCCAGAATGGGTTCCTAATCTTATAACATGACACTTAGCAAATGAAA^ 
TTTAATTTAATTTGCTGAAACAGGAGTTTGTAGGGGAAGTAG^ 
CGCCACACATTGATTTTTAACTCTGGCGCATCAGGCTGATCTA 

ATGACAG CTC CAAAAGGTGTTGAAGGCATTCCAGTG CT ATTTGTGG CCTGG AATCAACCAG ATGTGTTT AGG CTTGG CAGTGGTTT 
CAAGTTCCAAACACTGGTTAGAAACTGCCTGGAATGGGAAAGGAAGCGCTAGGTTCT 

6 5 AGTGGCTX3GCATCCCATGGAAAACTCTAGCGGGGAGGGGGGGGGGGGGAAACAC 

ATCTTTCT AG TTTG CATTTCAG ATGGTTT CTTCCCT AAGTTTT ACTCTT AGAATC CAATG ACTGATCAGACACT CTT AG CATCTTC 
TTGCCTCCAGTCTGTT CTTTCIAT AACGTAGCAGTTTCCTCCTGTTTGAGACTAGG CCCACTTTCAAAG CAGCTAG CTCATGATGGA 
CCATACAAAACCAGTG ATGATACCATTTCTATTG CTCTCACTCAAGTT CTTGCCTTAAAACAAAACAAAACAAAACAAAACAAGAC 
AAAACAAAACAAAACAAAACAC CG T AAGG ATGAG AAG AATTG T CTGGCGG T AAAATGTTGT AGG C CACT AAAAATGTCTTGTTT CC 

7 0 GAGGACGTTTTTCCGTTTGAGACAGTGACAAAGCCTCCAT 

TAGTAATCGGTAGAAAACCAGATTATGAGATGAAAATGGACCTTCCTAACCGACTTG 
A(ZATAAGCAGACATGCTCATC^CTG^ 

GTCGTCTTGTTCTTTCTCCCAGGAAAGAGACCACACCAGTGTC^GATTTGTA^ 
GCACTCGAGGCTGCACTCCCCCGAGAA^CCCTATCAGTGTGACAAATGTGGCAA^ 
7 5 ACATGAATCACAGGTACTCCTACTGCAAGCGGGAGGCGGAGGAGC^ 
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GAACCCACCGAGCTGCTGATGAACCGGGCTTACCTGCAGAGCATCAC^ 
GCCC^GGGATGGCGAGAGCGAGAAGGAGCACGAGAAGGAGGGCC^GGAGGGTT^ 

AGGAAGAGGAGGAAGAGGAAGAAAGTGAAAATAAAAGTATGGATACGGATCCCGAAACGATACGGGATGAGGAAGA^ 
CACTCGATGGACGACAGTTGAGAGGATGGGAAAATGGAAACCAAATCAGACCArc 
5 AACT ACTG CATTTTAAG CTT CCT ATTTTTTTTTTC CAGT AGTATTGTT ACCTG CTTG AAAACACTG CTGTGTT AAGCTGTTCATGC 
ACGTGCCTGACGCITCCAGGAAGCTGTAGAGAGGGACAGAAGGGGCAG 
TAAAAAAAAAAAACTGCATTATGCAAAAATTTTGTACAGTG 
AATAGCATTTATGCTTTAAGCAGAACTAGAAAATTCTA 

T AATTT AT ATT AAT ACATTTTAAAAGGTG CC CG CACT GAGTATTTT ATT ATTATTATT ATTGTTATTCCTGTTT A 
AGCCTAAGTAACCACACACTITTAGGCCT 

TTTAAGATACCGAAGTCTTAAAGTGACCTGGATGTGAAGGAAAAAGTAAGATC 
AAAGCCTTATGCAAACCTTTTAATCTGTGTCTCTGCAAGTGCCA 

CCAGCTTCAGTGTT AAGCT CAC CX3TG TTCTTTG AAGCACC CGTG TCAG TATT AGACGACT AGG C AGCAGTTC CTT AGTTT ACATAT 
15 GTTTGTGCAATTATTTTCTGTACTTTTTTGTTCAT^ 
TTTTGCATTCATTTAATTTTAGGTCAA 

AGTGTACAGTTATTATAGTTCTTCAATATATAGATATATTTTAGTAAAAAAGGAA^ 

AAAG AG AAGAG CATTTATTG TGTTTTGGAACATT AATTGTGAGATGGG ATTTT^ TTTT 
TCCAATTACTGGAGATTCCAAATTTGGAAACTTTTGATACAATCTTCT 
2 0 CATCTTGTTTTTTAGCTAACCTCAAGAGGGACTGTT 

AAAAGTATTATTGAATTTTCAATTTGTAATTTCTTTTGAAAATGACCATGCT 
AATGAGTTCTGTACTTTTAGAGAGCTTTCCTTCAATGACCATTAACATGTAAC^ 
TTCAATATAATTTTAGTTTAGCAATAATTTTGACTGGTACCAA 
AGCTTTTCTCAACTGATCCTGATTCCAGATTGTGTATTTTTTAT 

2 5 CATTTCTACTTTTGCATTACAAGTATTTAGAGATTATGTGTTAAAAACTCACTTTTC 

AAAAGTCT AATTTCAAAAAG ACAG CT ATT ATTCAATGTTATTT ATAAT ATG TAACTTTTTT TCT AAGAG TTGGGATAATTTATCTC 
ACTTTTTGAAATG CAG ACTG TAG CTT AT C CTTTATCT AAG ATT ATAAAATGGGGGGTGGGTGGG AAAACAAAG CT AAAGGCACATG 
CTAACAAAAATAACCTTCATTTTCCAAG 

GTTTTTAC!ACAATGATTTTTTTTATCAGAAACCAGATTCTAATTCCTA 

3 0 ATT AC CTT ATGAAATTT ACAG CTCTTG AAT AT ACGT AACTAT AAT AT AG T AG CTGTCCACATATTTTTTCT ACTTT AGAAT CAAAA 

AACAAAAGCATC^TTTTGCTATTGAATTTGCTAAAACTTTAAGTGTGACACCTCAAGTTG^ 
AGCCATTTTCCTAAATTTCATAAACATTGTTCCTTG 

TGCCCCAAGAGAGAGAAAAGATGTAATGACTATACAGAGGCAATTGAGCAGATACTTGTAGA 
AAGCCTGGTGAGC^CTGGCATTAGGAGAGATGGCATAAAGCATTCTGCTCCTGTCA 

3 5 GTTTACAGCATTCTTTCCCTATTTCTCCTTCTTTCA^ 

TAGG CGAT AT CTTTATT AATT CAAC CTG AAAAT ATCAAGC CATT AAATTTTGT CT 
AGCAATGTAGGTCTCTGCTGCCCATGGGGCATATCTGTGTCCCTACCCACAAGAGACAGGACC^ 
ACTCTTTCTCCCCAGAAAGAAGAAAGCATGCATGAGAAGGAGCAAAGGGG^ 
TCCACCATTGTTTTTACAAATTACTTTTCACAGGAAGGCT 

4 0 GACAATCACTACAATGTCCTATATCTTAAGGTTTCTTTCCTTTTTAAAA^ 

TGCATAATCTTCCAACTCCAGGGAAGACTAAAACATTC^CTTATCCTAC^ 
AAATCATGACACATAATCTCCITTTCTTATCGAATACTCTTCCCTTGT^ 
GGCTGGGAAAAGGTGATGAAATCCTGGAACTGAACACCCCGCCCATTTTCT 
TCTTrCTTTTTTGCACATGACACTGTAGGCTTAGTTCT^ 

4 5 TTCATCTCCCTTAGCATGAACTTCAGCTTGCAGTGGAGTGGGAGGGCCT 

TATTAATAAATTTATTAGATACTTTTCACAATCAGATAACTT^ 

ATCAAAATTTAGTCATAAATTTGAAGATATTCCCCAATAATAAGTTTATATATATACATATATATAT 
TAGCCATGTTTTTTTGCTCC^GGATGTGTGGCCATGCCAGGCT 

GC CCCCGCACATTGT AAT AAACACAG CTG CATTT ATTTGAGT ATGTGTTC C CG TG TACATGT AAAAACATCCAAACAAACACACTC 

5 0 AGCAGATTTATTTCATGTGCAATGGGGCAATTATTCAAATA 

CACTCAC^GAACTAATACTAAAAGAGGGGGAAACACCAAAGAATTTACGTGGGGGAAAATATATGTG 
TGTTATAAGGTAGCCAAAGCTATGGCTCCCTTCTTAACTGAAACACAATGAT^ 
TAGTTATTTCACTCACTTGATCATTACATCAGAAGGACTGCAGTGTT^ 
ATAAGGATGAGACATCTCTATATTTCAAAAGCAGAATAGTTCTTTCTACATACTT^ 

5 5 GCTCACATACGTAACTATAAATTTAAAATGTTATACATATAGAAAAACATTCCTACATTTTA 

TCCTGTGATC^TTTTATATATTAATAAATACTACAATATATATGTGAACACATTACAAAT^ 

AG CCTCTCTTCAG T AT ACTG^CAT AACCT AATAT ACT AAAATC^GAAGGGG CTTTT AGT CACTC CGTGAAACAAAGA 
TGAAGAAACTACATC^CTTGTGCCCATCACGAAAAAGATTCATACTGAAGGCTTAGC^ 

CTCTGCACAGTGAA CTTTTT 1 1 TTT 1 1 1 lAGAACTTGAGACATTTGTGATGTTGGCTGTTTAAATCTTTGTTACCTTC^ 

6 0 TTGAAATTGTACATATTTAGTAAATCATGCAAAC^ 

CCATGTTTAAAAAAAAAAGACCTCCTTTCCCAAAATAAAAAATGTCAATACTAAATTTAAAGAAGTATAAAGGAA 
TTTAGAGCAAATATTTAAATAAACATGGAGAGAATTGGCAACATGTTCTTTTGG 
AAGTTCCAGAGGG CCT CTGTTTCACATTCAAAG ATAAT AT ATT AAT CTCTG AATT AAAG A 
CCTCAGTCTTCATTTGTGTGTGAAGTGTAGCATTCATCAGAGACATGACATTGAGACGACATGATGCT 

6 5 AACACCTCTATTAAAACAGTCACCTACCACCTATAATCCAGTGTGACATATTAAAATACTCC^ 

CACCATAATTCCCATTTGGCCTTGAGAAGAATATGACAAAGATTCTAAT 
GAAGATTAAAGTGGTGACAAATGATCTTTTAAGTATTTCTAATTGGTCATCCTCATTCATA^ 
TCCTAAGGCCCCTGACTTGTGAGTGCGCCCCGTCAGCTTTCTTCATGCTACAGAATGTGGC 
ACCCCAATGTGTTAAAAATAGACTTCTTCTCCC^CAAGAAGTT^ 

7 0 GAGG AAGCATTC AGAGAAAGATTGTT ACCTTGTG AT CTGACCATTT C CAG AACAGTCAAAAGGCTCTCTCC CT CCNNNNNNNNNNN 

NNNNNNNNNCTCTCTCAAGTGTGTGTGTATACGTTTGTATGAGTGTGTTTGAGTC 
AATGTATATATGTATAAATGTGCATGAGTGTTTGA 

TGGGCATGTATGTATGTGTGTGTCAGTGAGCATGTGTGTATGTGTGTGAGTTAGTTTGAGAGTA 
TTTGTATGAGAGAGTGAGTATGTGGGTATGAGAGTGTGTGTAGCTTTACTAAG^ 
7 5 TTAAAAACCAATATTCAAGAGATGGAGTCTAAACAGAGGGC^ 
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TT AAAATGTTGATACTGGAAAAATACAAAGTTCTTAAGAGAGCrrCATAAAAACT 

GTCAACAATTAAATAGAGAAAATATTAAATTAG CTT 1 1 1 l CCCCATCACTTAGAAAACTCAGACTTCTAAAATAGGGCCT l " r<J ' l "rr 
AAACAAT AATCATTTTTTG C CCATTCTT AAAATAGTTGT ATTAT AGGG CAAAT AATT ACAGT AT ACT CCTGTC CAA'rnTGTTT AG 
AGAAAGAGAAATATAGGCTTTAAATTACITGAAAATGCTGATGAATA 
5 T AAACTCAAAGGAAG CAGGG CACTGCATTT AGTGAC CACAG CAACCTGG AT CTTGGTTTTGACATGT CTTCTGAACAAT ATT ACTT . 

CCAACAGAAACATTTCCCAAAACTACAGGGCAGCCCAAGATCC^ 
ACCACTTCCTGCCGTCTGAAACACTCTGTGGAGGTCTACCATCT 
GGTTAAAGGCAGGTCGAAAGCAAGAGGGAAAAGCAGAAGTGCAAGCCACCCrrc 

GCATGCCCAAGAGT AATCAAC CTCAGTTCTAAG CACTCACCTTC CTTTTCCTCAACATCAAGAACGTTG ACAAT AGCCAAGACAAT 
1 0 CCCCTAAAGGGGAGCTTTTGAATCACTTTAGAATAAACGGTGGTGGAATAGTC^ 
CAAGAACTACTGCTACAGGTGTACCACCTTAATTTTGAAAATGATAATTA 

TTTCCACTGG CGTTTTCATT AT AAAATCT CATGTTGTAAATTC CTT CATGACTGGGACAAAGTT ATGGCATACT ATCT ACATT AAA 

ATTG ATTT AAAAAAGTGTT CCTATGAAT AAAAGCT^ CAATT AATCTAT C 

ACCCACACGTTATAGATGAAGAAACCGAGGTCTGAA 
1 5 AGCAGCTAACTGCATAAAAAGTCTAGTCCTCTGAAAGAGAGTAACATAGCCT 

GriV l TTl'CAAAGGGCCCGAAGAAGCJUSCAAAAGAGATG^ 

CATGCAAGAGCGAAATGAAAACAATCTATGTTGGAAGCCTGTGTTACT 

CACGTAGAGTCTCCACTGACAGGCAGAGATGTTCTCAATTTTGAGAAATA 

GAAGAGACTGAGGTCTGCAAGTGCTTGAAAACACACAAG 
2 0 TGCCAAAACTCTAGCCTTCTGAAACAGTGTTCTGTACTTT 

GCTC CAAGG AGCAG AAG CAGTTGGAG ATT AACAAACCAGT AG CAGCAGAACTG ATGG AAGAATACAAAAGAAAG T CAGAG TAGAAC 

TCCTGGACACTTCCTAACCCAAGCCCTAACCTTCAGATGCTA^ 

GGACCCAGAACCAGAGGTTGGGTCTCTCATTTTTACATAGATGATGTGATTG 

AAAAAGCCATCCACTTAATGGTCTAGGGATACAGTGGTATAGAAAGCAACCACACACCCCTT^ 

2 5 AGGGAGAACAATTACAAGGTCCCAGCGGATAGGAC 

G<1GGGCTAGGATTTGGGGTGTCAGGAAGAACCTCATGGGGAGTTGAGATGAAGGATAAGA 
ACAAAGGTGTAGGAACAACAAGTATAGACACGAGGCACAAGCGAGGCTGGCTTTGTGGACT 
TAAGAGGGGCAGAGATAAAGCCAGAAGACCACAGGCTTTTGATGAACTTCTTAGGT^ 
GAGAAAATTGAATTTAGTAATCTCTTGAGTTTTGTACAGCGGTAGTCTC 
30 GAATTTTGCAGCCCTGTGTTITGGAATGGGGGGAACCTCTTTTTCTCAG^ 
CCGTCACACTGGAAAGCAGAAATTGTTTt^CCO^AAGAAAGTTT 
TGGTGGAAGCCCTTTCCCCAGTAGGCGTTTT^ 

CATAAGCAAGGGCCTCAGCCATTTCCCCrrCCTCTGAACTTCTTTTA 
CCAATAAGAAGTAAGGAACGTGCAGCAGGTGGTACCATCTGTGCCCGACTAATTCCAG 

3 5 AGTTGTGGGGAAGGGAAGAGAATCTGACAGCGGCAGAGTCAGGGGAGGGGGTGGAGGAGGGCAGCGCT 

ATCCGCAGGTGGGAGGAGGCAGGTTATTGATGCCACTGAACTATTATC 
TACAGCGCGTTCTTGCTAAGCTGGCAACTACATCCCACACCTGAGAGACAGACGTGCA 
GATGGGAGGGACAGCTTGGCAAGCGCAGCCCACCCCAAAGTGAAAAAGTAAACAAGAGGGC^G 
ATTTTCCTAGCAACCGAATGCCCTCTGTCTCCATGTTCATTCTTTGAAA^ 

4 0 CTTTCA CCCCGGGATGGCATTTTC CGAAG CAAACT CCTTGCC AT CG ATGG ATT CCTTACG AAG AACG CATCTCCCTTGG CAT ACTT 

AAAACCTGGGGTGAATAAATGCACTAGTTCAAGGATAAATGAAGTCGTCTGCTl'r 

AGAGCAT ACACAC CGTTGT AGTGT CCCAGGAAAAT ACAT CTTCTGAAAAG ATT CTCTT AAC C CACGG CAAAAATTCCACCAGTG CG 
TTTAGGTTAAGGCACCATCCAAGGATCGTACTAGTATTTGTTAAAGGCATCTAA 

GCrTAGCTTACCTTTGCTACAGCTGGGGGTTGAGCTCTAAAAGATGGGTGCATTCGTCCAAGAGGGCGTT 

4 5 AGCAGCAAGAAGCATTCATACATGAAAAGCAGTCAAAGTGGGAGAAATGACT 

AGATAGAAGTTCACCTTCCCrCAGAGTGTCTGCTGAAATACTGTGGATACCT 

AGATGTAGTCAAGATAAGCCGAGGCTACTGTAGTGCACTAATGCAATATGAC^ 

CAGA 

50 MOUSE SEQUENCE - mRNA 

CGCCGAGTCCATGCGAACTGCCATCTGATCCGCTCTTATCAATGAAGCAGC^ 
AAACAAGCCAATCCCAGGAGGAAAAACGTGGTGAACTATGACAAC^ 

TCACATTGCTGAAGATGATAGCCTTG CAAACCCTCTGGACCAGGACACCAGCCCAG CT AG CATG CCCAACCATGAGTCCTCCCCAC 
ACATGAGCCAAGGGCTGCTACCAAC^GAGGAAGAAGAGGAGGAGCTAAC^GAGAGTGT^ 

5 5 ATTCTGCAAGCCTCTGTAGCCC^TCCAGAAGAAATGAAGGAAGATTATGATGCAATGGGGCCAGAA 

TAACAATGGTACAGTTAAGAATGCAAATTGCACATCAGACTTT^ 

ACGCAGTGAGCATCGAAGAGTACCTTCAGCGAAGCGACACC^CCATTATTTACCCAGAAGCCCCT 

ACGCCAGAGXSCCAACXjGGCAAGAAGAAAATGACCTGCCACCTGGAACTCCAGATC 

CGACCGCGGCTACAAGCGCTTGACATCGCTGAAGGAGCACTVTCAAGTACCGCC^ 

6 0 TTTGTAGCTACACGTTCGCCTACCGCACCCAGCTCGAGAGGCATATGGTGACGCACAA 

ACCCAAGGAGCAGGTAACCGCAAGTTCAAGTGCACGGAGTGTGGCAAGGCCT^ 
AATTCACAGTGGTGAAAAACCTTACGAATGCCCAAACTGCAAG 

GCAAGAAATGT ATTGGTTT AATAT CAGT AAATGG CCGAATG AG AAACAAT AT CAAG ACGGGTTCTTCC C CT AATT CTGTTT CTT CT 
TCTCCTACTAACTCAGCCATTACTCAGTTAAGGAACAAGTTGGAA 

6 5 TAAGATTAAAACAGAACCACTAGACTTCAATGACTATAA^ 

ACGGTGGGCTTGGAGCCACCAGCCCTTTAGGTGTACACCCATCTC 
CCTTTACTTXK3ATTTCCCACTATGAATAGTAACTTGAGTC 
AAAGATGGACTGCAAGACGGAAGACATTTCAAAGTTGAAAGGTTATCACATGAA 
TAACrTCTCCCAATATTCCCCCTGTCGXSTCTTCCAGTAGTGAGTCATAACGGTC 

7 0 AAAGTCAATGAAGCCAAAGCTTGCCTCCAGAGCTTGACCACCC^CT 

TACTTTG^TAGATTTGGTCACTGATGATAAAATGATTGAGAACCACAGCATATCCACT 
GCTTCCCGGGCCCTATTCCCCTGCATCAGCATGJVACGATACCT^ 
GAAAACATAGTCCCCAACAAAGCTGGAGTTTTTGTTGATAATA 
AAGCCCC^TCAACCCATACAAGGACCACAT^ 
7 5 TGAAAATCTCCATTGCTCTGGGCCTTCCTCAGGAATTTGTG 
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aggtcaccatcactggaaaggacctccaagccgttagctcccaaca^ 

tgtaaaacctatggactccatcacatcgccatctatagcagaactccacaacagtgttac^ 

ouvaatcttcccatttcaccaatattaaagcagttgataaa 

5 gtcattaccaaaacaaatgagag aac c caaaggt attat ag ccacaaagaacaaaacaaaagctact agcataaactt ag ac caca 
acagtgtttcttcatcgtc!tgagaattcagatgagcct 
gacaataaaagcaacaaccctgtgttcggcatgaacccatt^^ 
tcccccixk:cactttcatgccaccagtccagaccagcatccccgggctacg^ 

CGCATATGGCCTATACCTACCCAACGGGAGCAGCTACCTTTGCTGATATGCAG 
1 0 CAGGGAGACTTGCTGGATGGAGCACAAGACTACATGTCAGGCCT 

GATAAAGAAGACAGAAAGTGGCATGTATGCATGTGACTTATGTGAC^ 

ACGAACACACAGGAAAGAGACCACACCAGTGTCAGATTTGTAAGAAAGrc 

CTGCACTCGGGCGAGAAGCCCTATCAGTGTGACAAATGT 

CAGGTACTCCTACTGCAAGCGGGAGGCGGAGGAGCGGGAAGCAGCCGAGCGCGAG^ 
1 5 AGCTGCTGATGAACCGGGCTTACCTGCAGAGCATCACCCCTCAGGGGTACT 

GGCGAGAGCGAGAAGGAGCACGAGAAGGAGGGCGAGGAGGGTTATGGGAAGCTC 

GGAAGAGGAAGAAAGTGAAAATAAAAGTATGGATACGGATCCCGAAACGATACGGGATGAGGAAGAGACT 
ACGACAGTTCAGAGGATGGGAAAATGGAAACCAAATCAGACCACGAGGA^ 
TTTTAAGCTTCCTATTTTTTTTTTCCAGTAGTATTGTT 
20 CGCTTCCAGGAAGCTGTAGAGAGGGACAAAAAGGGGCACTTCAG 

MOOSE SEQUENCE - CODING 

ATGAAGCAGCCGATCATGGCGGATGGCCCCCGGTGCAAGAGGCGCAAACAAGCCAATCC 

CAACGT AGTGG ACG CAGG CT CGG AGACAGATGAAGAGG ACIAAG CTTCACATTG CTG AAG ATGAT AG C CTTGCAAACCCTCTGGACC 

2 5 AGGACACCAGCCCAGCTAGC^TGCCCAACCATGAGTCCTCCCCACACATGAGCCAAC^ 

GAGCTAAGGGAGAGTGTTGTGGAGCACAGCTGGCACAGTGGCGAGATTCTGC^ 

AG ATT ATG ATG CAATGGGG C CAGAAG CCACG ATCCAG AC CAC AATT AAC AATGGT ACAGTT AAG AATG CAA^ 

TTGAGGAATACTTTGCCAAAAGAAAACTGGAGGAAAGAGATGGCCAC^ 

GCCATTATTTACCCAGAAGCCCCTGAGGAGCTGTCTCGCCITC^CACGCCAGAGGCCAAC^ 

3 0 TGGAACTCCAG ATG CTTTTGCCCAACTGCTGACCTC^CCCTACTGCGACCGCGG CT ACAAG CGCTTGACATCGCTGAAGGAGCACA 

TCAAGTACCGCCACGAGAAGAATGAAGAGAACTTTTCCTGCCCTCTTT 
CATATGGTGACGCACAAGCCAGGGACAGATCAGCACCAAATGCTAACCCAAGGAGCAG^ 
TGGCAAGGCCTTCAAGTACAAGCACCACCTGAAAGAACACCTGAGAAT^ 
AGAAACGCTTCTCTC!ATTCTGGGTCCTACAGTTCACATATCAGCAGCA^ 

GGAAAATGGAAAACC^CTTAGCATGTCTGAGCAGACAGGCTTACTTAAG^ 
TTCTTATGGCAACACATGGGTTTAGTGGC^GCAGTCCCT^ 

TCTGCTCAGAGTCCAATGCAGCACTTAGGTGTAGGGATGGAAGCCCCTTTACTTGGAT^ 
GGTACAAAAGGTTCTACAGATTGTGGACAATACGGTTTCrAGGCAAAACAT 

4 0 GTTATCACATGAAGGATCCATGTTCTCAGCCAGAAGAACAAGGGG 

AGTCATAACGGTGCCACTAAAAGTATTATTGACTATACCTTAGAGAAAGTCAA 
CGACTCAAGGAGACAGATCAGTAACATAAAGAAAGAGAAGTTGCGTACTTTG 

ACC ACAGCATATCCACTCCATTTTCATGCCAGTTCTGT AAAGAAAG CTTCCCGGG C CCT ATTCCCCTGCATCAGCATG AACG ATAC 
CTGTGTAAGATGAATGAAGAGATCAAGGCAGTCCTGCAACCTCATGAAAAC^ 

4 5 TAAAGCCCTCCTCTTGTCATCTGTACTTTCCGAGAAAGGA 

AAGCATACTATGCrATGAACATGGAGCCCAACTCTGATGAACTGCTGAAAATCTCCATTGCT 

AAGG AATGG TTTGAG CAAAGAAAAGT CTACCAGT ATTCGAATT C CAGG T CACCAT CACTGG AAAGGAC CTCCAAG C CG TT AG CTC C 

CAACAGTAACCCCACCACAAAAGACTCTTTGTTACCGAGGTCT 

AACTCCACAACAGTGTTACGAGTTGTGATCCTCCTCTCAGGCTAACAAAATC^ 

5 0 CTGGACCACTCGAGGAGTAATACTCCrTCTCCTTTAAATCTTTCCT 

AAATAGCTTCTCTTCCGAGX1AGCTGCAGGCTGAGCCGTTG 

CCACAAAGAACAAAACAAAAGCTACTAGCATAAACTTAGACCACAACAGTGTTTCT 

AATTTGACTTTTATCAAGAAAGAGTTTTCAAATTCTAAT 

TAGTGCCAAGCCTTTATACACCCCTCTTCCACCACAGAGCGCATTTCCCCCT 

5 5 CCGGGCTACGACCATACCCAGGACTGX3ATCAGATGAGCTTCCTACCGCATATGGCCTATACCTA 

GCTGATATGC^GCAAAGGAGGAAATACCAGAGGAAACAAGGATTTCAGGGA 

GTGACAAGACATTCGAGAAAAGCAGTTCCCTTCTGCGACATAAATACG 
AAGAAAGCGTTCAAACACAAAGACCACCTTATCGAGCACTCGAG^ 

CAGCCGrAGCGCGAGGCGCGAGAGAAAGGGCACTTGGGACCCACCGAGCTGCTGATGAACCGGG 
CAGGGGTACTCTGACTCGGAGGAGAGGGAGAGCATGCCGAGGGATGGCGAGAGCGAGAAGGAGCACGAG 
TTATGGGAAGCTGCGGAGAAGGGACGGCGACGAGGAGGAAGAGGAGGAAGAGGAAGAAA 
CCGAAACGATACGGGATGAGGAAGAGACTGGGGATCACTCGATGGACGACAGTT^ 

6 5 CACG AGG AAGACAAT ATGGAAG ATGG CATGG GAT AA 

HUMAN SEQUENCE - GENOMIC 

CTGCTGACACAACCCTACTCCCAAATTTGCAGGCAGGAGAGTGAAGCAATGTCT 
CTTTTTTAAAGTAGCTGTTTCAGCATTCAACTGGGGGCAAGAGGGGTTG 

7 0 TGCTATTCCATCCCTCTATGTCTCCCCACTCCTTCACACCTCTGAAGTTATGGCAAGACT^ 

GACCCTCTTATATTTACCTTGAACAAATAGGATCTTTGA 
GCAAGTCATATCGGGAGAATTCGCTGAAGAAAGGTTTTTT^ 
AGTGTGCATATGTCCTCATCTCAGCTGTTGACATCATATTTATAATCATGA^ 
GACCATATTACTTTTCCAACTTTATCAGTAA^ 
7 5 TGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGAT 
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ATGGTGAAACTCCGTCrCTACrAAAAATACAAAAAATTAGC^ 

GAGG CAGGAGAATCACTTG AACCTGGG AGG CGG AGCTGGCAGTG AGCCAAGACTGTGCCACTGCACTC CAGC CTGGGTG ACAG AG C 
GAG ACT CCGTCTCAAAAAAAAT AAAT AAAAAAT AAAAAT AAAAT AAAAT AAACAACACCTAA^ 
TTTCCATAATTTAACTTCATGTATTTTT 
5 • A T rr r G TATACAQTCATAT lTTCrri ACCACAACCAGCTTGAATCrAA^ TT i 1 H rTTTTTTT 

TTTGAGACAGGGTCTCGCTCTGTTGCCCGGGCTGGAGTGCATGGAGTGCM 
TGGGTTCaAGCAATTCTCTTCCTCAGCCTCCCGAGT 

TT m 'AGTAGAGATGGGGTTTCACCATGCTGGCCAGGATGGTCTTGAACTCT 

GTGCTGGGATTACAGGCGTGAGCCACTGCACCCAGCTCCTT^ 
1 0 TCAATACACAATTAGTATTTACCAAGC^GGGAGCCACATCCAGTTTATCTAAT^ 

TAACATTGAGGATCTAGTGTCAAATATACrTAATGTCCAGTTrCCAGTCAACAAGTGCT 

ATTCCACGACATCTCTGGACCATGATGTCTCCTTCATCACTAACTTGTCCTC 

CAAGAACCTCCCATGGGAGCTTTTCCTAAGTCGGGTGCCTGGGAGAAGAG 

AGGCACCAAAGCCAGAGAAACTTTGGCTCAAGGACTTGAGCCAATATTGTAACT^ 
1 5 GCAAAATAAGCCATACGTGGAAAAACAAGGGCCAATCACAGCATCC^ 

GCACAAGTGGGTC^AAACACATCTGGGAAAGGTAAAAAGATGTTGGATCT 

GGCAGATTCAAAAGCACACCAAGAACCAGATGCTTCAATTC 

AAAATGGACC CAGTTtS T AACCC CTTTCAG AT CTGCT ATCAGAATT AAATGAT ACG T CATAG TGC CAAGCCT AGAA 

AAGCAGGGGTTTTTTTTCCTTAGCTATTATTACTTTATAT^ 
2 0 TACATTTAAGTACCTGAAAGAACCCTGAAAAAAAAAATTGACCCTTAGA^ 

ATAGTTTGAGCCTTTTAACAGAATAAAGAGTATTAAAGAAGACCTTGAAGCA 
GCAGGTGCAAACCATTT'CTAGTCCCATTGTCAGAGGGAGAAAAAAGGTACAGGTCACACCCAGT 
TCTGGTTTGGTAGTTTTGACCTAAAAGGAGCATGGAATGGATGCTTTC^ 
ATATAAGTGTTTTGAGAAGTCTTAAAGAAAACAAGAACAACCAAAAACT^ 

2 5 TGGCATTTGCATTGATTTTTCAGTTTTACATCAAAA 

ACTTAATACACTGATTTTCTTCACCACTCTAGGTAC^ 

CAGTTCATTATGAATGGTCTTAAAGCAAACAGTGCTCTGACAGCCAAATTCCT 
ACAAGTCTTACTCAAGACCGCTTCTGAGCTTTTGCACATTTAACCCTTACACTAAAAG 

CTTCAAGTTCAT ACATTTGG ACT AAATT CCTTT^ CATATTGAATACACT AA 

3 0 ATTAACTGCTGAACTAACAGAAAGGAAATACTGTATTTATTITCACTTCTG 

TAAACCAGCCATAAATCCTAAAATGCTTTTCATTTTCTTrCITCACGTAATA 
TAAATAGTTGTTATTTTTAATAATCAAAATTAATTTGTACCATCTTTAG 
TAAAGCTACAATATAGTGAAGTTTTATGCATTACTTCATTTAATTCTCAATAAA^ 
ATTGGAGATTAAAAATAAAACAAGCTTATAGGAGACTGCTTATAATCAC^ 

3 5 TATTAATACTrTGGACTTTAACTGGCACAGGAGAATAAA 

CC CAT ATGTT AGTAAGTAAAAGAT AAAAAATGATCCT AAATACCAGGCCTG AGTTATTTGTCAAT^ CTGCAATTTTCCACTTTTA 

GACTTOAATTAATAGACIATAACTAAAAATTTAAATGACAAAATTCCAATAAGA 

TTTGGGGAGGAGAGTGACITGATTTTACATAGAGTATTGTTTTT 

T(ZATAAATGGTTTAC^AATTGTTAAGAAGTTTATGGATATCTAGAAAGAATATTACAGCT 

4 0 AAGAGAAATTTTGCATTAAAATGTGACTTTGT'lT C TTCATCCTGAGAGAATGAA IH ' rX ' l 'AAAAAACACATTTAAAATCAAGCATTA 

GATATGCAAATATGTGTTCAGATCAGAGATACTAAAATTATTCAAATGATTAAT^ 
AAGCAGAAAAGGCATCTTTCTAATACATCTTTATTTTrcnT 

GTTTTTAAAATTCAACAAAGTAAACAAATTGTAAACAAGTGAAAACTCTGAATTT^ 
GGGAAGAGCAAAAAATAATAATAAAGAAAACRAAACAAAAACAGTTATGTTCATGACATATGCCGCCT 

4 5 AGGTAACXJGACAGTAACCTAGCCTAGTCAAGAAAATAACCCAACAACCG^ 

TCATCAGAAGAAAAGCGCAGTTTAAGTTTCATAAAATTACCTATTGT 

T ATGG CTGT AAACAAACTTTTT CACATTG C CTT AATTCT AAAG AATG C C CCT ATGCCAAG CATTATGCATTTTTTC CAGG AAGACT 
ATATCTATTATATCTTGTCTTTTAAAATTATCAGTCTX1AGCTGATGTG 

5 0 TTGATTTATTTTATTTTTATTATTATTATTTTACTTTTCA 

TAGTCAGGGTGTCCCTGCCCTCCTGCTCGTAGATTAATTACACATTA 

ATAAAAGAGGCACAAAAAGAATCACAGCGAGGGAGAAGAGTGGATGTTAAGAAAGAAAACAAGA 

ATAGAAAGGAGAACGTAGGAAATGCAACTGGGAAAGAATTTCAAGAAA 

GAGGGAGGAAAAGCAGGAGGGAGATTTTTGGTTTTCAATTCCCTGGTGAGGCTGCT 

CGTCCCCCCCCCC<3CTCCCCGCCCCCCAAGCCTTAGTGAGATTGACAGTCGCGTGG^ 
TGGTAATCAGCCCCCGGCATCACCTCCCGGCCGCGGCTGCGCCCGCTCGGAGC^ 
CGGCGGGCGGGGAGCAGGCGAGCGAGCCAG^TTCAGCTGACCGGGCGGCGGGCGGGGGT 
GGGGAGGGCGGTCTGGTGGGAGGGGGCOMCACaSCCCCCCCCCGCGAasaSCrCCT 

6 0 GATCAAATG CAGGG ACCG AAG CTGG CGGCGGGGGTTGGG G CGGGGTG TGTGGGGGGGGGAG TGG CAG AG CGCG CG G AAAATTGGGG 

ACAACAACCAGACCTGTTTAGCATCAATAAATTAAGATGCTCATTTG 

GCAAAAGCCCTGAAGCCGAGATTCAGCCTTTACTTCAAAAAAATAAAATAAAAT 

GGTGACX5TGCGAGATTCCACAGCCTGCGGGTCTCGC!AGCAGGGATC 

GTTTG AAAGGGCTTCT CAGAGAGATG CT C CTGATTTTTTTTTTTTT AAT AG ACTTTC C CCT CCT ATC CT AGG CTCAACTCCG AATG 

6 5 GATTGGATTGCGAGTCTGCACGTGAGAAAACCGTTTGGCTTGGCTTGGACCCCTGCCGCCCCCCACCT 

CGAGGGGTCCCCTTTATCACCCTTTGCTTGCAACTCCAAAA 

TCTAGATGAAAGATCAGTTTCATTrCAAAACGAGAATAGGTTCCTTTT^ 

CTTTAAAAAAAATCATTAGCTGTGGCCAAATTCGAATTCCTTCTACAGCCT 

GCATCTGGGCCGCCAAGCTGAGC GGGGG G AGGGGGG AG CGG CTGTGAAAGTTGAGGAAATTGACAGACTGAG AGGTCAATGGGGGG 

7 0 ACAGGCACCAGGTCCTGGCTCGAGGGAATTGGAAAACAA 

GTAGTTTGTTTTG CC CATGGAAACCCCAAAGAATT ATTTTT CC CCCTTCAATGTTGATTG CAAT ATT ATTTCACATTTCCGT GTAT 

CTTATTTTGCTGTGTGAGGAGCCGAAAGCCAGCAGCCCCCAGTCCTCT 
TGATC<;CACGATCCCCATTGTCCCCGAACTGGCCCTCGG 
7 5 CCCATCACCTCTGCCACGTTGGGAACTCTCTCGGCCAAATGCCTG 
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AGCCGCAGCGCTCACCCCTCGCGGAGACCCTTCCCGTGCCACCATCC^ 
CACCCCAAATCAAATACTGGCTAGTCGGAACTCCCAGTGATAAATGTTGGGGAC^ 
AGGGTAATCAGATGATTAGGTGTTTTTAATGGAAATACT 
TTTAATATACATAATAAAGACAAGCGTTTTGTTA^ 
5 CTGTAAG CCTCCAATGGCT ACCTGGGTGCCTGAAGAGG AGTCTTG CCTGT AATAACrCTATGCAAAGGGCTCCTTCTTCCTTAGGA- ... 

GTGGTGGAAGGGTGG CTGTAAAGCAAACGTCT AGAG AAAG CTGGATT CCAATCTTTTAAATTAAAAAAGAGGAAAAAAAGG AAG AG 
GAAGAAAAATGGACATGAACAGTGGTGCTAAATCTCAGAAGAGGAACTTGATAAGTCA 
ATGAAAGAAATGTAAAATATAAAATTAAGAATGTGCCTGACCCATGTTGAAAATCTT^ 

GAAG ATCCTG AATG CCTTT C1TCT AT AGACACTTT ATCTGTGATGGGGTGGGGGG AGAG AG TTAATTT AT CCAG CAG CT ATTTT CA 
1 0 TT AT AAT AAAT ACTT AAATT AGGAAAGAGGTGAGT AAC CAACTGTG AAAAT CT ACTTT CTTGCATCCTAGAACAAGATCT AG CTGT 

G^TCGTGCTAGGTAGCTATTAACCACCCCCACCCCACTCCCCCCCCCCGCA 

GGAGTGTGGATGTGGCAACCCATGTGGTGGCTTGTCCCTCCTTTGAGTG 

AGGCACACATTCAAAGCACAGTTCAAAGCAGTTTATCITAATA 

TGGAAACAGGAGGGTGTAACATATTTAACAATATTAAAGCTTAAAAATTGGTGTGAGG 
1 5 CCTTTACCTTTCCGAAGAATCCTAAATGCTGTGAGGCTAGAAGTTTATT^ 

AAAAAAAAAAAAGTCTCACAGTTATTTAAGAAATTGAGCCTGAACCATCCGAGGATAGGA 

TGTGCTCAGCATCCTCAAATTTGGCTGTCTTTATTTACTTCT 

ACAAAACAGCAGAGCATTGGTTAAGAACACTTAAACTTCAAAAA^ 

AGT ACTTG CT AAATTG CTGACTTCCAGATGTGG AAAAT AT CTTT CTTG TTT AGG ACCT ATGT AACCTG ATTGG ATTACG ACAGAAG 
2 0 CGTCACGTTGGAAGC1TTTGAGTCATTATTTAATTAACCA 

TAACCAATCCCTTCAGAGCAAAGGTGGAGT CTTTTCTT TTT AAT CTTACTTTAATTGT AATATCAAAAAATATATACTCCAAAACT 

TAGACCCCATGCCGTTTAATATCATGCTCTCCCTTCCCTGCTTAAGTTTCTCT 

ACACTCTCCCTTACTTTGTAGTGAGGTCTCCCCGAGGTGTAGAGAGAT^ 

TT AG AGAC CCTTTATTAAAATGCCAAACT A C 'I' l ' l ' l T AATATTGGGAT CCAGTCCAGAAATTCATCATGCACACACCCT AAT ACACA 

2 5 TGCCCTAAGATGCAGCTCCCATGCAGCATTTTTTTT^ 

AGAACTTTGTTCCAGAAGCTGTACTGAGATACCrACACAATTTGATGTGCA 
AGAGACTATCTGGATTGAGGACCCGGGATCTTTCCCTTTAACTTTCGCCCCI^ 

TTTTCCGCCCTCT ACTCTGT AGGATTTAGTG ATGAGG AT AATG ATGCG^ CAT CTGA 

TTAACATAGTAAGGAATGTTTATTTTCAGTACATAAAATTGT^ 

3 0 GGAGCTTAGTGAGACATGAGAATATAGAGAGGTAGCTAATGTGTGGGAAGCTGAGCTAGT^ 

TCCATCTCATGCTGTAGGTGTAGGGAAAGTTGTTCAGGTGCTGCT 
AGTTGATGATGAGAGAGCATCTACTTTCAGATACTGAGTGTCrrT^ 
AGOGAGGGAGAGGAGGAAGGGAGGGA£X3TGGAATTTCATTTCT^ 
AGATCCTTTCCCAGAGAGAAACrTGGCGATCACGTTTTC^CATGATGCT 
35 ATAGGTGGCGCGTGTTTCAGGGTCTCTCGTCTCTCTCCTACAGAAAAGAAAAAGAAAAAAATC 

GACACTCTTGGCGAGGTTTTTGTACAGTTTGCTCC^ 

TTCATGCTTTTTCTTCTC^CCIATTTCK3GCCAAAACTAC^ 

TTTATCCCITTTTGGTGTGCTCGTCCTCC^TCCTCCTTTTCT 

4 0 GGTGTCCGTCGGTAGGAATTTCZAGGTGG Gri ' T AGCTCCrTTATGGCA 

TCACACrCTCTCTCCTTGCTTTTCTCTCCTTTTTCCCT 

TCCTTTGCTATCCAGCCC7VGGAAAAGTTTTATTGAGAATTATATGGCTGGAATAAAAACT 

TTTrcCATCTAGTTCCAATTCATTATATTCAGCAATTAGGACTTGGCA 

ACCTACAAATTCGGGGGCAAGTCTTrCTCTTAGACZATGGCAGTTCATC 

4 5 GTCrcTTTTTGTCTGCrTAGCGAGTTTCATTTCTTTATTTC^ 

ATTATAAATAAAGAAGTGAAAGTCCTCATGGAACTTGAGTCGTTTACCAAAGTC 
GCTGTCAGGATATCAGGGTTTTAAATTAGGATAACTTTTAAAATT^ 
AAGACACIATACACIACATATATCCCATTCCATACTTTGGCACT^ 
GAGCCCGGCTAACATGAATGTGTTTATCTCCCACCTACTGATATATTTTACATC 

5 0 ACTCCAGACTTAGGAGTAACAGGCAAAGATAAAGTTTACAGCACAAATGCCC^ 

AACTAGCAACTTACrTTATTAAACATGTATTrATATGCAGTCGTGTGTCT 

GACACTTGTGT G AC CATTTCTTT ATTTCAGG ATTTCC C CT AAC AATTT CAAATCAAT AGG ACAAGGAAAG AAATGAT AAAATATG T 
CAAACTTAAAAGTCTCCTACTATGTATGCCrAGGTAAATTTAAGGTGCTTT 

C CCTT CAC C CCTCTCAGCIAAATGTGTGGAAACTGATACTCACTTr C CAGG TTTTCTG CAAAG CTT CT AG CACAGGAGACAAT AGGT 

5 5 GTGGGGCGTGGGGACTGAGTGTGCCCGGCAGAGAAAGGGTTAATGGCCCT 

CCCCTGTTTAAACTAATCCTCGAGGCACAGGATCTTCCCAACCT^ 
GAGGTCCCCCTTCTCCAAGTTTTGACAGTTTCAACTGAAAAAATGCAGCGCCTCTCTCG 
AGAAAGGATACGATCATTTAGTAGAGCTTAAAAGTATTGCCGTAGTGGTTGGACTT^ 
TTCTACTGGAACTATTTAAGGGAACCAGTGTGGGGGAGCATGTGTGTGAGTGAGGTGGGGG 

6 0 AGTGGCGAGGGGGGGTGGGGTAAAGGACAGTGTCCAAAGAGGCTTATAATTATATTTATCATTC 

ATTTGGGGGCGAGGGTGGGGGGAGGAAGAGACAGTGCCTCGATAAACTCCAAAAGGGGAGGGGAGGGC^CGCT 

TCCTATAATTACTATTATCAATTTGCCCGAGTGTGCTTCATCCAGGGGCT^ 

TTGGGGGGTGGGGGGATGAGAAAAGATGAGAACGAAAAGAAAACCTCGCCTCAC^ 

TCGAAGG AAAG^GTC CGCCT AGCCCTGAGTCAAGCGGTCTTACTGT ACCGCCGTGTG CATTCCCTC ATACGGTCAGG AGTTATGAC 

6 5 TCATTTTGAAGATGTAATTCTTCTCTCTCTGATCCCCTCGCGGGTGCAACACA^ 

CAAG^3CTGGGGGTGGGGGGCGGGAAGAGCGGCGGCCGAAGTTTCGCTTTGGGGTT 

TGGCGTGGGCCCTC^GCGGCCCGCGCGCGTTTCAATGGGCGCGCGATGCCCACATTGTCGCTGTGTTTGGTTG 
GCGTGCTGCCGAAGGAGGGCGCCGAGTCCATGCGAACTGCCATCTGATCCGCT 

CCCCCGGTG CAAG AG GCG CAAACAAG CCAATCCCAGGAGG AAAAACGGT AAG AAGCAGC C CGAACCIAAACTTTTCCGGGCCACT AC 

7 0 G(TGGCTCGAGACCCAGGGAGAAGGAGGGGGAAAGGGAAAAGGAACCGAGGTCGCGGCGGCG^ 

CCTCCCTCCGCGGCCGGCGCCGGAGCCCGGCGCGCGCGGCGGCTGGACCGGGGGGTGT^ 

TCCCTCCGCCTCCGCCTCCCGCTCCGCTCGCGCC^^ 

GGGGGCCCAGGCG CTCC^TCCCGCAGGATCCCGGGG CGGGGGCGGGGGTGGCAGAGAGACTGCGAAGGCAGGAGAGGG 



1019 



WO 03/008583 PCT/USO 1/5 1291 



GCCCGGCCCATCGCGCCCAGACCCCGGGGTGGGAGGGAGTCGGCCGGCCCCGCGCGGGGTCCGGTGGGGTCGATGATGGCTCGGGG 
ACGGATGACTCCTCTCCAGCAGGCTCTGGCCCCCAGGCOGGGGACTGAAGGAGT 
TCTACATGGTCTGGAGGAGCCCGTTATCCAGGCTGAGTOIGGAAGGAGGAAGGAGA 
GATCAGAGGTGGGATATTAAAGGGGGGGGGTGAGAGTTTGGTAGGTCCGTTTTGGAGA 
5 AGAGAAACAGAGAGAGAGGAAGAGAGTGAGAGAGAGACCCCCTAGGTATT 

GATTGGCAAGAGCGCGGAAGGCGCCTGGGGGAGGGGAGAGGGCTGGGGACTAGC^ 

GAGGATAACTGAGTTTAAAAAGAAAGAAACCAACCCCCGAAAACAAACAAG^ 

TCTCGCTCCCCTCGCTCTCAGCCCGATCACCCGGCGCGGGGCrcCGTAGACGCTCCC 

1 0 TTGTGCGCGCCCCCGGCCTGCCCCAGAGCTGCGCGGCGCGGGGAGGGGGCGGCTG^ 

AACACAAAG ACGAGAG AATTTTGATT AGAGACAAATGATG CTTTTCT CTCT CTCT CT CT CTCTCTCT CT AGG riTTGTG TGTGTG T 

ACGCGTGTGTGTGCGTGTGTGTGTCTTTTGCTTGTTGTGGTGGAGGTX3 

GAAACTGTT CGTTACGTT ATCCTG CCTTTCCC CTTTGTGCTGTCTTTG ATCTATTTG CT^ 

CAATAATATATTCCGAGCTGCTTCCCCAAGATTCAGTTCAGGCT 
15 CTCTCTTAATTTAATTTTTTTTTCACTCTGGTGTCTGGCATTATT^ 

TGAGTAATTTAGTAGACACTAAAATTACACCTAATGTCCCCCT 

TATTGATTTTCTAAAATCAATGTCTGTTGTGGGG C ' rr TTT' l CCCCCTCTTACAAGTATCTAAAGCACGTTGCTGATCAACTGGATA 
ACAAAAGCTTTGTGTCAAGGAGGCTTTTTTCCCTCTTCTC 
AAAACAAGGCTCCCAGAAGTGTTCTTTTGCAGGGAAGACTATTTAAAGTGT^ 
2 0 GGGATTTTGCCACCGTTGACCAGCCTGCCCCTAGCAGTGAGGTCAGGCCAGAGCC 
CCTGGGAACCCCTAGGAGACCTGGCAGAGAGGGAGGCCCGTTGGGCGCTTTC^ 
ATGGACACTGCAAGTATTTCCCCTCCACCCAGCCACTAACTTGGGAGAAGCCT 
TTTAAGGCATTTAGCCCAGATCCTGCACACrTAATATCAAGCCTGTCCATTG 
TT3TTTTCCAAATAAGCTGTGGGAAAACATGCTGTATACT 

2 5 TT CTGGTTTTGAG AAGCAGGT AGAAGAGT ACT AC CACC CCAC CCCU' TrTf CA'riTTCT TT CTCATCAAACACAT ATCCAATGC CTT 

TCCCTTTGGGAATTGCTAGGATTTATCTGAACTGAGTGGATTTT 
TGTGTCTTTGCTGATAACACTAAATATGATGATGC^ 

GT AT AAG T AGAAG CTGGAAGGG AAAAAAAGATGGTCATCCAGGAACCT AGAATGG GACTTT AT^ 
GTATATATAGACTAAGGCGACAAGGAGATAAAACGTXSTAAAGCAGTGTGTXjTG 

3 0 ACAATGCTATGTACATTTAATAAAAAGO^ 

AAATACAACCTTCCAGATTCTCAGGCTAGCAAGATGCCTGGCAGAGGCAGTGCCAGG 

CTCCTTTCCCCCACATGGAAAGGATGAAATCTCTTCCCAGAAAATAAGAT 

GAGGCAAAAAGCGAGTTGCCCGCAGACAAGAATGTGTGTCGGTTTCAA^ 

TTTGT AACACTTTCACTG ACGCTGG AG AGG AGGGGG^ C GAG C C C CCCTTTTT CTTT CCC CTGATT AT ACCCCACT ATCT C CA 

3 5 CACAGCCTTGGAGTCAGAAATGAGCACTCGGAGCGGGAGATGCCCTG CTGCTGCTTGCGA.CCGGTGCGGCCCGTTTGTAACTTG CA 

AAGTTTGTTGCTTTTGCCCCTGATTCGGGCAGCG 
GGTCTGCCTCCCCATCCCGCCACCTTCCAGC^TCAGCCTCTGAAA 
GGAAACTGCTTGTTTCAGAGAAAGAAATGAAGTTGTCTTTTAAAGAAAAACT 
AAAGGAAAAGTTAAATTTGATTTTTCTCCAGAGTTTCCACTAAAGGGTTGGG 

4 0 TGCCTTTGGTCTCTGAACTTCAGTGGCAAAGAACAGGGATCAAGTTGAATC 

AGAAGAGCAGAAATGGTTATCCCTCTTTAAAATAAGCCCTCACTCTTTACCACT^ 
GTTAGAGGCAATGAGGGACGGAGAAGTTGCTCCCGTTTCAGAGATGCTT 

CAGG&3CTGCCTCCTAGCTCTCCTCCCTGAG 
CTTCGGATCACTGTCAGTCCAGACTrGGTTATCTTTGCA^ 

4 5 (nTCTAGGGAGCCTGTTTTGCnXSACTTTCAAAGCA 

CCATCTCCATCCr iT ri T CTCTGTGGAGATCTTCTGCAAGTTTTGTCACGCT 
CTGATCTATTTCTTTTATTTTGGTCTGGAA 

GCTGGTGTCCATAGAGTGACAGTCCACCAGGACTAAGGGTGGGCTGAGGATTTTAA^ 
AAATAAATTCACGTCTTCCACCATTTGTTTTCAAATAGGGTA 

GGGAAAAATGTATGTCTACCCTGAAGGTAGGAAAGGAGGGCGTTGCTAGCCT 
ATTCTTT T CCTTGCAACAGTTTTTCTCAT 

GTCCTTTTCCTCTCAGCACCGTGGCTTCCCAAGGTAGACACTATTTTG 
TCACAGACAATTGATTGTCTGCCCTAATGTGTTTCATTTACATGTTTAT 

5 5 TTCCCGCATTCCCACAAGGGGGCAATTGTCTGAATGGCCAAGTCAGACACCTTT^ 

AGAGATAAAGGAGGAAAATCTGTGATGCAGAAACACTAGTTGAAAATATACAGAATTAAATGT 

TAAG C CCAAAT ATG CTTTTT AGCCAAGATGTG AAGGTTG AAAAAAAT AATT CAGAG CAGAGGGAAGGATG ATTT AAACCAAT AAAT 

ATAGCCCTATTCCCCCTCTTTACTTCTTTTCTGTCTTTAGCA 

AAACATCCAGGCACCrCCTCATTATTTGTATGTTTGCTGTGAT^ 

6 0 AGCATCAGCTTACCACTTGGAAAATCTATTGAATGACTATTTGGGCT 

TTCITCATCAAGCAGTTTTTAAAAGGAAAACGATAATAATCAGTAGGCT 
ACTTTTATCTTGTGTGACAGTCATGTCAGAGTGAAATCTCTCAGGAAAAGTGTAACT 
TTTTAAGGTGTTCTGTAGTGTTTTTATGCrrGTGTAT 
AGCC^TCCATGCGTGTATCTTITGGCTTTTCTCTTCTTTCCGTTCTCT 

6 5 CCTGTTT AAAAGAAAATCACAGTT CAGATGGTGCCTTGCTGGTTCATTTAGGAAACAACAGTTATTAGAAGTTGGTCT 

GTG TGGT CCATGAAGACAGAAAAT AC CAACT CTTGTGT CAT CTGGTGGGAGGCAAATGGGGGAAAAACTTTT AT AATTTTGT AGT A 

AATTAGAGAGGGGAAAAAAACTGCCAGTCTGTGAAAAGATTTTTC 

TAGCrcAACCCCTGACACGGCGGTCTTTTGAATG 

CATATCTGATCCGCTGTGTCTCTCCCAGTAAACATCCCCTGAGGGGAG 

7 0 CAGCCCTTTCACCGCCGGCAGGCTCCATTTGCATTCTGCCACAG 

GAGAGAGAGAGAGAGAGGCACAGX3GTAGTAGCTGAGTAAAATAACTTAGGTG 
AGCCTTATATAGCTGATGCTCCACCAGGCCCTGTCCCTCCTGACAGGCATAT 
GC^ACCCGCTCAACACTTGAGGTAAAAAGTGTCAGGAGGGGATCXSAATTCT 
TGGACTrCCCGCCCTTAAGTGCTGCCTACGATTTGGCrrr 
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AAATCCTCAGGAAAACTGGAAAAGACTCT^ 
AAGCCAAGAAGAAACTCTCAAATGAGCC^AAAAAC^GAAAAA 

AAAAAGCTTTG AGGG CTGT AAAGTGC CACGTTTTTCACAC CTT CTC CATACTTTTTTTTCAG ATTTTTTT AG T ATGTTT AACATTT 
GTGAGATGACAGCCTTTTTGTATCTCAAACrGGATGCTTCAGGATT 
5 AGAAAGTTTGTGCCTAAAGGCCTCCTTCCTGGGAACTGTTTAAACCCTCGCAGGCAG 
ACCATTTTCAACATGCAAAAGTAA.CATGCAGTTAATGGTATTGGAATGAGACCA 

TCAG CG ACATT AT CTTGCTCCT AGACTGTCTrGTCTrTTTCTTCCCCTTT CTGAAACCACTTTTGAGATACCAAT AATTTT CTCC C 
AACAATTAGATTAAGGATACAATGCGCTCACACACATGCACCACACACAC^ 
GAAGTGCAATCTCTTCCAACAGCAGAGTGAATGAT^^ 
1 0 G CAAAACTAACGG AT AT ATTAGAAGTTGTGCGAACATGTGACACAGATT CCTGATT^ CTTTTTC CCAGAAGATTT ACAG A 

ATTTACAAAAAGAGATTATCCCTACACCTTCCACCCCT 
TTCAGTAAACACCTTTGGACACTIGCGGGGAAAGTGGGAGTCTGT^ 

GCTTT ACCAGGCAG CTG TGAT ATT AAC CT ACCCAGCAATG CGGGAACACTCTTTAACT CTTTGAGTG CCTGGC CC CTTG CGCT CAG 
CAGATAAAATCCTGGCTCTCCAAAGACACTCAGAATTCTGGCTCTAGCA 
1 5 GTTCAAGCTTTTAAAACAATGTGGTTTAATTAG^ 

CAAGGGAAAAAAAAAAGATAGAAAAAAGGAATCAAGGCAGCACATTAAAAATTCCAGGAAATATC^^ 

TCTAGCTTTTAAAATATTAGACTACGGTAGCTAGAGGGAATGAAGCAGCTGGGGCCT 

TTGTGATGAAGAAAGGGGCTTCAACTTGAAAAGCTATGAGA^ 

AGGGGGAAGT ACTGC CT ATCTGTCCAAACCTGGGAGGT CATG TGAAC CT CAGAGTCAGGGC CTCAGGCTGG C CGTT ACCAACTTGA 
2 0 GACTTTCCAAAGCAAGCAGAGAGGCATTTATGAGGAA^ 

TGCATTTGGGCTTAAATAAGATTGCCTTATTTAGTTTCGAAAAAGA 
AAGTAAAGGGTATGAGACATGACGGAATTAAAGGTGTTTGCTATGTTGTTACACCG 
GCCTCCTGGTGATTTGACAGCGCAGATCTACTTCCCTTCTAGGTGAGACAGCAT^ 
TTCGAGTTGCACCCrcCCTCCCAGATCCGTGAATCCTATT^ 

2 5 AGGAGGCTGTGGGCTCGGGGGCCCGGGCAGCG^CAGCATCACAGTAGCTGGC^ 

AATCATCCAATAATTAAGTTTTATGTTATTTGTATGAACGACCAAGCATC 
TACTTTTTGAACACTTTCAGACC^TGGTCAGGT 

GCAAAACCTTAGCAGTAGCAGGAGTACATAAATATAAATCAAAGACCAAAATTTA^ 

ATGAATT ATT CTCC GATTGC CAACCCAGAAGTT AAACTTTGTCTCCC CT CAG C AACAG T AACTG CGCAT ATTTiTi" CAGGCTTCTG 

3 0 AC AGGAAAGTT AGAGG AT CT G AAAAGAGAAAGGAG AAGGAAG ATG AT CACTG TGTCAT C AGATATGGT ACATTTG TG ACCCAAG AA 

ACT ATGAGAG CTGATAAAG C CATGT ACGTTT ACAGTGG CTG CTG CAG CCTTC^TT AG AAATTAAT ATGGGTTTCT AAAAAC CAGAC 
TTTGGAAGAGCAGGCAGCACAGACAC^TTAAGAGTGTAAGGGGAGCAAACGTTT 
TTTTC^ATTGAGGAATGTAATCATCATAATGATTCATGATTATTTACAAATTTAGT^ 
ATTTGAAGTAGTGGTTAGAACTAGAAGACGATACGGAGCTGTTAGTGTGCATATC 

3 5 ACGCTGCTATAAAATTAAGACTCCAAATAACTTTTCCTATAATCTGTGGGATTAAATC 

T AAGTTG T AAATTTTT CITGTTTG CTGTGT ATGATTT CATCTTA 

CTACAGTTGAATCTTTTGAAGTTAAAATTTTAAATCCT 

CAGTTGGCCACCTTTATATATCACAAATATATGAAGGTATACTTT^ 

ATCTCAGGTGAGAGAGGTAGATGACCACATTCCTGGGAATAGGTAACTGTCATTAATGATTTCCTAAAATGTAT^ 

4 0 AAACTGTAATGGATTTCTCTTTAATAGTTACGTTGCCCATAGA 

TTATAATAACTCAAGGAAAGGTACTCCTGACAAGATGATAGCCA.CCCCCTCCTCC 
GATTAAAATATTGTGCTATATAAAATCTGAGGTTTTTTTCTTTCTCTCAAATGGAT 
TGATCTATAAATGTAATATATTGCTTTCAATGTTTCTTAGTTrAAGAAAGT^ 
AGGAAATTGCAATTGCCTATTCTTGGAAGTACCTAAGTTTCTATTTTA^ 

4 5 GAATTCTCAGTTTTAAGAGGTGAGGTTTATATATAAGGAACTCAGTAAGCTG 

ATATCTCTGAGCAAAATGGTTGTCCTCTAAAGATTTCCCATCIACCTTTGC 
TGTGCACACATTCCCACATGCACACAAACACAAATGCACCCTAGGATCCAAACTCTG^ 
TCATCT AG CTTGTGAATGAAATACACTTAAACAAAAATAGT AGAGG CGAA 
TAGGGGGGAAATGTGTTCCCTATGCCACATGATTCTTACGAAACTTACACCTATTAATCT 

5 0 AATT ACTAG CTTTCTG CrTTTGTACTGTTTTCT C CCATTTTCCTT CAG T ATT AC CAATCTT AACACAG ATT AT C CT AAACTG ATTG 

CAAATTATTATGGTTTTTAAAACAGCATTCIXK3AGTCCTGAATA 

AAAGGGTGGGGGAGATTTAAAACCATACCGAACACTTAGAAATGTGTGGAGTTTTATACTTC 
GCCCIGTGAAACCAAGACCAOVAATGATACGTGTCAGTAGGTTCTTTGATA 
TGCTATGTTGAACAAACACCrrCCrCTGGGACAGAGTGATAAATGCAG^ 
55 CCACACCCCCCACTCCCAGCCCACCTCCTACCTCCTACAATCCAGCCACCTCCCACTCCACTTTGCCCCTTT^ 
TTTATATAGAGGTGCCCAGGATTrrTTAAGGTGTCAGTGGTGA 

AACTG CAGGCAAAGTCCrrTTrCTGGCAGGCTTTATCTAGTCriTTTAAATGGAACTC 
GAACTTTTTCCACCTTTGCTTAATCACTTGAAAACC^ 

AG CT ATT ACATTGTG CAACA CAGACAAAATTTTG CCTT AAAAAATCCAAACACTATTACCTCTT ACTTTT CACTTT CT AG CATTCT 

6 0 TTTAGGGCTAACATTTCIAAATGTATAGTCAACTC^ 

CCTGTTTACTGGTGCCTTTGTGTGTGCGTTTTTTCCCCCTGACAT^ 
TAGATAAACCATTATATTTTTTCCCTGCTCGTATTTTAAACT 
GGATGTTTGCTGTTGCTAGTTACAGCAAGACTTATCATAAG^ 
TCATTT ACT CTACCTTTT CTGT AGTCTCACAAATTAGGCTGCT 

6 5 CTTTCGCATCTGCAATTAGGACTAATTCTGACAGTTACAATTTCTG^ 

TACACGTGAAAAGCCCTTGTCTCrrTTAACCTTGTTGTTTTW 
ACATAACTGTTCATCACTTTCAGGGTTTTCCCTC 

CCAG AAACT CACAGGGT AGAGACKM AAAAG AGACAG CCAAAGT ACAGG CAGCAAACTG ACAG CAG CAG AG TGAAAGACAGAGGGG 
TGGGGGGAGGTTGAGAGCGAGAGCTTTTCAACTTTAGTGTGCGGGAAATCT 

7 0 CCTGAAGGAGCTTGAAAGGTTGAGATTCAGAACACCTTCAG 

CTCrCACTTAATCTGATTGGATGCCTAAAGAGGAGTC 
TTATTAAAGCTTGAAAGCCAAATCTCATGATCAGTGTTATAGGAGGA 
AACTTGTAGGAAAGTGAACAATCCCTCTCTTTACCAGGTTTGITTT 
ACTAGAGCTGAGGwACTGTCC^TT 
7 5 GAGATACTCCTTTGAATGACATGGGTTTGCAAGTTCTCT 
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TTGTTTAAAAGGTCAAAAAACATGGCAAAAC 
ATTCTATGATTGCAAAACTTGTATAATTGCITTTACAAAGAAA^ 
CCTTGATGTCTTAAAAAAAAAAATTGTCATCCTTCCCCGTCTCTTCTAACrm 
ATTTAAAATCCTCCTTACTTTCCAACTAGACTTGGGGGTCGAATAAGCCCT^ 
5 GGCATTGGCTGATCTGCTGATGCCCATTCGACAGGCACTGTTGAACTAAAGTG^ 

AAGGG CTGAAGATG CCTTG CTGGCATCATC CATGTGCAT ATTGG CCTTGTCACAGTTTTGCTTGATTT AATGCTCTGTTT ATTCCT 
AGATATTAAAACTTAAATTCTGAACAATATGTTTTCAAATACTACATACC^ 

GGGTCCATTTG CTCAT CCAGTTTCTGTGTAGC CAACATTAAAACACAG CTTTGGAAAT AAAT AGATTGCCAGT AG CCT CTT AG CTG 

TTATCCATTTAAAAATGGGATTCCAGTCTTTAAAAGTCTTATTAGGATAAAAGA 
1 0 TTATTTATTTCCAACACTGGATTGTCACATAAATTCCTAAGGAAGAGTT 

T AAAATGACAAATTATTAAAGAG AT ATT CAGT AAT AAAAGAAAG CCT AT AAG AAT ACCT AT AAGGCT AG<XACATCAC(^CTGAG A 

GAAAAAAAAAAATCAAGGGAGTTTATCTTAAAGTGAGCCCTATT^ 

ATAATTGTTATGAAAATCCCTGTTCCAGAGGO^CAGCTCGCCTATCTAAAGAA^ 

aSTCCCGTTGAATGTGTCrTGAGCTTTATCTACXSCTGCrc'r 
1 5 AGTGTTTGCTAAGAGATTAGTTAGCAAGTTTATAT^ 

GTCAGAGTGAGAAGATACAGGGAGCTAAGACTGACTAAGGCCrrGGGATCAGGTAC^ 

GAAAGGTTGGTGGGATGG CATCTGCCAGCAGGAG CACAG ATGTT ACCACTTTTAGAA^ 

ACTTTATGAAAAAAAAAATCTGCTGCC^lGAATACTCCTTTTT 

TACCTCCATTCATAAAACAGCATCrrrCTTGAGAGCTGTTACCAGTTAGOT 
2 0 GCCTAAAAGTATTGCTGCTGCTAATAACCGGTGAGACTGCACTAAGAATCT 

T CT ACCTG C C CT AAGCGTGCTTT CTT AGCAGTG ACTG AG ATGAATTGT CCATTTCl u I l 'I'l l CAGTC^TAGTi"i u X , TCTCCATGTTCACT 

T ATG CCAAAT AATT CCCACAGC CAGAAAGAGTG AT AC^GGT CC<5GGTGGCT AGG AAAGAATAAG C CTTT ATTG CTAGTTTTTGT AT 

TGTCCAATCCCATAGGTGTt3AACAAGATTTAGACATGTAATCTCCAACCT 

2 5 GATTCrTACrTAATATATAAAGTGAATTT^ 

CAAACAGCAGTTCATCCAAATAGTCCTCCrTCCCCAACCCCCAACCC^ 
CATTCTATCTTTAAATTACTCTAAATGAAGTCATAATTCTAGTTATGTTT^ 

C ACCTTTT AAT GAACTG TTCT ACTTTAGGAT CTT CT AAGGAAT AGAGTGT AAAAAGAAG CAAAG CAGG AGATGAAGAGGTCT CG T A 
AATAGTTTCCTTTGAACIATCCACAATTGCTTACATG 

3 0 ATTCAAGTGAGCACATTTCAGAATGACTCATTGGATGATTGGAGC^GACACAAGCCCT 

GAAAGTTCCTGAGATTG^GAGTGGGTGTGACATTAGGCTAGGGTATTCCT 
ATCTTCTAGCTATCAGTTAGCGTAAGGTTTCATAAACAGGTAATGATGGCTCT^ 
TGAGTGAGGAACAGGCTCTGCTTCATATAAATACGATCTTTGAAT 
AGCCAGGAGCATGAATGTGATCTTATTGTATCATAGTGTAGAGATGAGATAG 

3 5 GGTGTCGGC^CAATCCTTCCATTTCCAAAGCAGCAACTCTTTATTTTTC^ 

GATACAGATCTG7VGTGAGTTGTGCAAGCTTCAGACTTGCCAGCCCCCAGCT 
ATATTAGACrrATCAGATTGAAACCAAGGGTTTCTACrGTGGTGGAATTTC 
AACTGATACTAGTATGCTCAAGGAAAACATACTTCAGTTACAGTTGATTTTCA 
TCTAATTATATTTTATGCTAGGCTCTGCAATACCAAAATATAC^ 

4 0 CAGATGAAGTCACTCATTCCCAAACTGGTTTTCCAAAAACAATTA 

TGTGCTGTGTTACAATCAACATACCAGGGAAACAAAACAATT^ 

TTTCACATCTA<^GGGCGGTTAATTAAAATTGTGATTAAATGCGGTCGCGCT 

AATTTAAAAGTAAAAAAAATGTAATTAGCATTGGAAATTGAGAAATTGCCTC 

TAATTTAT AAATGAATGCAGTGGAGG CTGTGCAAATGCACAATTG CCTTCCTTGCCATT CTTAG CATTCTCAGCATGCAGGTGAAA 

4 5 ATG TTTT CAAAATATTG AGGT AATTTGAAAATCAAT C C ATGCATT CTG TATCTTTTTT CT C CCATTTGT ACCAAATGCTGAACAAA 

TGCTGCCATAGATGGCACAATCCATCAGTTTTCTACTTTCTAAAAGTAATGCTGCA 
CAACACAAATTGCAAGCCTAAAATTGTTACACGGGTCAAAATTGTACACCT 

AATGCTCCCATCAATAAAATAGAGACATGGGGGGAAATCTGTCCATGGTATAGGTGGTACAAACAGAATGACTATTAAC 
ATAAATACTTGAGACTAAATTTTTAAAGTTTAAGAAAGAAACAGAAAATCT^ 

5 0 CGTAACTTTCTTATTCCATCTAGAAACATCTAAAACCAGTTAATAAAGTTATCACTAGA 

TTCAATGTCATCTTTCCTTTCAACAAAGATGCAACTTTCT 

GCAAATGAGTGGATTCTATAGATTAGATACTGGTTAAAAATTATTTCTATTTACGCTGT 

TAAAAT CTTTT AATGG AAGAAATTT AACTTTCATGTG CTT ATT CTGACCCTCTG AACAAT ATGAATTTAAAAT ATTTT AAT AAT G A 
ATTX3AGAATATGGGGTTTGAGTATCTGGCTTGATACCACTGGAGTTAAGAGA 

5 5 TTGGCATTCTGCAAAATGATCTCTCCATATAAAGCTCATCCTAAGAATATGCTTATTTAATTG 

ATTCTCTCATGTAAAATACTCTACGTGTTTTTTGCTGTTTCAGGCA 
GGTAATACC^ACATTTTCAGGAGAAAGGTGAAAAATAAGAAATACTGTATCTCTGGGA 

TTTTT AG AG ACAGGGTCT CACT CTG TGG TTC AGACT AGAGTGCAGTGG CAC CAT CCT AG CTCACAG CAGCTG AGATT CCTGGGCT C 
AAGGGATCTCTGGGAGTATCTTTCTATCCTCTGATTTCAGGCCTAGTCTCGGGC 

6 0 GACACAAATGAAAAGTGCAATTTAGAGGTGACAGCTCAGCTGGTTGTTCCAACT 

GTTTCCCATTTAATATTACAAGTAGAAATAAAATGAGACCTCTA 

TCTCCTTTCACCAGCTCTTGCCTCCCACTAACTGTGATGATGACGATTTCATATTCCAGCA 

GTCCAGACTTCATCAAAGAATGGCTTCCTGGATTTCCTTCAG 

ACTTCAGCAGAGTGGTGGGATGGAGGAGGTATTATGACGTGCTCCAGGCTGCACT 

6 5 AGAGAATCACAGATGTCAATAGAGCGCGAGGACCTTCCTACCCCTTGAAATATTTGCAGTAGAA^ 

TCGTAGTGCCCCATGAGCTTAGCACGGCAATCTGGAGTTACTTC 

ATAATATATAATATTAATATATATTATATAATATATAATATTAATATATATTATATAATATATAATATTAATATATATTATAT 

ATATAATATTAATATATATTATATAATATATATTATTAATATATATTATATAATATATAATATTAATATATATTATATAATATATA 

ATATTAATATATATTATATAATATATAATATTAATATATA^ 

7 0 ATATGTTGAGAATGACAGTGTCTTTTTATACAAGCTCCCTCCACCAAGTATCATCTC^ 

AGAAAAGGCCTGGACTACATGCCTGTCCCAGCACTAGATTGGTATTTTT^ 

T AACTCAG TTCATTCTG CTGTT ATTACAAGCAT CACCTCTTTGAAG CTG TT ATCAG ATTCATG AAAGAAAAATGCGCA'ITTT CT1Y 
TTCTACTGAGGG CTT CT TT CACCTTTATTCCTGAGAGTGATCAGAG CTTTGAATTGTAAAAAAAGAAAAATTTTCTCATTGT ACAA 
T AATTGAAAAGT ATG TCAT CAGCTC^CTTT CT AC^TCTT CCTGTGGT ATGTCAG CAG AAAACT ACACTG TTTGTTCATTTTGCATT A 
7 5 ATTACACACCTACTATGTGCAC^CTCTGGCTGAGGTCTGTGGATCAAAAA^ 
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GTCTAGTGGGAAATGGGGGTCCATTTAAGATATGATGAAAATGTGGCAAGTGC 
ACAGAGGATGGCAC^CTAGATCTGGGAGAAGGTGGTGAGGGGTCA 
TAACATGGATTTCACGAGGAAGTGAGGGCACAGCACGCTGGGCT 
CAGGCATTGTCAGCCAACTGAGATGTCTGAAACACAGGGCATGAGCAA 
5 GGAGCAGAAGAATGTCCTATCCCATATTTTTGGCTTGTATCCAGTAAATAGGTGATG^ 

AGTGACAT AATT AT ATTTG CGTTTCAGTG ACACGTCTCTGG CAACTGT ACG AAAGATATT ATCATCT AAC^ TTGAGT ACTT A 

ACAATGTGCTGGGACCCGTGCTAAGAATTTTGCCTATGTCATCTGA 

TTTCATACTATTTTTACTGATGLAGGAAATTAAGGTTTAGAAA 

ACTGT AATCATGGAATTTCTr AGAATTACAAG CAACACAGG CAATAATG ATCATTGACT AG AAGGACATTTGG AAAAGAAAACCCAG 
1 0 CGTTTGAAAAAGAGTGAGGCAGTTTGATTCTGTTTCTCT 

AATTTATTTTTTCTTTCATTACTATCTCAGTTTC 

TGCTACATCTTTGAGAGTTTGGAAAGAGACAGTTA^ 

TTGCTGATCTAATTCATGACATTAAGGCCTTCCTCGGCCTTATC 

CCCTTTTCTGATTTAACTCTTCCATTGCCTAAGACTCACCAATTCT 
1 5 CCATGGCAAGTCCTAAAGCTGCTAACATTTTGTAACAACTATGTGACAAATTGTTGTGC^ 

GGCTTGCTTGTTGTGTTTTCTX3TTTGGCTGTG 

CAGCCATTTATCTTTAAAACCACTCCTGCACAATGGCAGAGGTGGGCCAAGCCACAGG^ 
CATCTTTTCTGCATATAGGGAGCACGTTGGACAGCAGACAAGACA1TTTC 
GCAAGTGC^GTGTAACTGGACTGCTGGGTGCCC^GTGAAGTGAGGAAAAT 
2 0 TCATATCTCTGTGCAGAACATGC^CAGTACITTG 

ATGTGAACATGTAAACATGCTTATATGAGAAGACAATTTGCTGTTGCTG^ 

CGTTTCCCTCCTAATGTAGTAGAAGCAGAATCTTTTTGTTTT 

CTCAGAGCCAGAGTGTCAAGTGCTGCAGTGAAAAATACTGAAGAT 

2 5 CACTCTGGGTTTTCATTTCTCAATTGTTTCAAGGTGTC^ 

AATAGCAATAAAAAATGTGATAAACCTGCATAGACTAGTTTAACTAAAATGAAGAA 

ACATAGCAAAACTGTGTGATTTAAATGAAATATTCAGAGTGGTCAACATAAATACACATGTAG 

TGTATACACTGAGGTTACTTCTTGAAGAAATAGGCATTTTT^ 

ACCCCAGAGATTGTCTTTCACTTTGAAAACAATCTCT AATATCAGGAAAATGC^ CAAGTTTG 

3 0 TCCGCCTGGGAACTGCAGCAAATGGGAGAAGTATGCATGGATTTC^CAGGTAACAGGAT^ 

GAGAAAAAAAATGACAGTGCTCTAAAGATGCCTAGGCTTTGGAGATGGAGTTTATC 
ATATGTCAATTTTATTACATCTCAGAACCAGCTGGGCATTAAAGGAACAGTAGCCTTA 
TTGTCrACCl'rrClTAAAGGCCCTAGAATTTAAGGAAATGAACTTAGAGATGTGTACCTT 
AGAGAACAACAATTTAACATTATGTCAGTTATCAACCTAAAAATTTGATTGAAA 

3 5 AATAGATACAGATAATAAGCAGAATAGTGATTTTCACATAGGTTGCACTGTCAAGGATT 

TTTCAACAGTCTAGATTTTACCTCrTCACrGATGATAGGAC^ 

TGCATGTGTGAAATATTCCTGTTTTATTAGAGTTTTTGACAATCTATTTT 

GCTCTGTCX3CCCAGGCTGGAGTGGAGTGGCGCGATCTCGGCTCACT 

AGCCTCCCGAGTAGCTGGGACTACAGGTGCCTGCAACCACGCCCGGCTAATTTTTTGTATTTTTAGTAGA 

4 0 GTTAGCCAGGACGK3TCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCAAAG 

GCGCCTGGCCGTTTTTGACAATCTAAAAAGACCAAGCTATTTTA^ 

AAGAGAGGTGAATGGGATGGGAATTTAAGATACTATTTTAATATC^CAAAGAGGTTACACAACTAAG^ 
GAATTGGTCCACATACTCTGCATGTGCTGTCTGCTAGAGGCT 

CCAGACTCCTGCTCCAATAACATCTCTAGATCTATTCACTTGTAACAAATGGAGCCC^ 

4 5 CACTCGAT ACAATG AAAAG AACACAGGATT CAGACC CAAACAAATTT GTGT C GA.CATT CTG ATT CTGCCATTT ACTCTCCATGC CA 

TGTGCCCCTGGGTGCGTITCTTAACCACTCTGAACTTCGGTTCTTAATGCTTA 
TTGGGAGTATTACACAAAATTCTAAAGACAGTGTCTGGAGCCTAGAAAACCCT 
GACTAATCAGGATGCCAGGAACCCTACGATGGAGGCACTTCAGTTTCTGGGCCTTG 

ACCGTAGCCTGAGATATTCGTTCCT' r rr r Tr r ' l lA CTCTTn3GAGAGTACTrAAAAAAAGAAGAAAAAAGAAATAGTCATAGTGAT 

5 0 TGGCATGCTATTCTACCTCTACATTTCCAGGAAATATACATGGTACTTTTTC^ 

TTCTGAGATGACTTTTGTCACAGCACITAAAATATCTGATTCCTATTrc 

TG CCCATCTCAGCTCAGG CCATG ACATACAAAAATTGTTT CGITAAAGCAAAGGT ATAATGAAT ACAGCGGGTA 

AGTATACATTATCATATGTAGAAAAAAGCTTTGGGTAACTTTATGTCAGATTCC 

CTTTGTGACAGCTATGTTATTITGAATACTTGAGTGGGTTTGCTTTT^ 

5 5 GGCTCTCTGCCTTTCTCAATGACACCTTTGAGACTTGGGAGTCA 

TTTCGACATCTGGGCTCAAATrrGGTTTCTACTTTCA 

CTTCTCATAATAATACTCATGTTATAGGGCAGTTGGGAAGAAATACTGGTAACAGGTAAACT 

TGTTCTGTCCCCTTCCTCCACTCTTCTTTGTCACTTGAGCAT^ 

AGTCAATCAACATTTAATACATACCATGGGTAGATATCTTACCCCTCT 

6 0 AAGGAAATGAGCATGCAGAAGTACCTTCTGTACTTAGAATCACAATCATT^ 

TCCCATCATCTTTTTTAGTGAAGCACTTTGGTGTTTCTGCAATCCTTr 
TCCCCCGATCCTTTGAAGACATCAGAAAAGAAGAAGCACAAGGTCTCTTGACTATACGTT^ 
GTTAATAGAGTTGATTAAAACACTTTGTTTGATTCTGCTTAAGCATACC^ 
GAAGGTAGTCCAAATATAGTACTACATAATGAGGCTCCCTGTTGGTAGTATATA^ 

6 5 GAAGATCATTTTAAATCTTGCACTGAATTCTGAATTTAAAAT^ 

ATAAGTGAATTAATTACTGCCTTAAGTTGGTAAAGCATGTGTGAATCCAAACT 

TTTTTGAACCATTTCCTATATGCTGAGTTCTGAGTACCTATTTAACTACTGAGGACATA 

TTAAATGTTTCAATCTCTGAAAATCACCACCACCACAACAAAAAGTCTGCC^ 

AGCA.GAAAAGATATATTTATAATCTGAAACTTATAAAAACATGACAAACTG i'TTTT CATTAAC 

7 0 ACTATTAjGKjTCG TT TTT TTT TTTT I T 1 I TCTCTTGAGAAGAAAC^AAAGAC^AAGTGGCAAG AATATCTTTCACTTTTG CTCACGT 

TTTGGTGGTGTAATATTGGGACCACTGAAGTAATTGGACAACTTTTCC 

AATGGAGCTGCTGCTAGGCTCAACTGATGGCGCCCACATGGCGCCCACCCCAGCT 

GCC^GAGGCCCTGCCAATATCCGCACATC^CTGGCATGATAGACCATAACAGCITCTT 

GCGTACTCGGGGAGTCCTCAACG CACGGATCACTGACACGG CACTACAT AACGGCTCCTTAGCAGCCATT ATTGGGTATCAAGTCA 
7 5 ATGCCTACTCAGGGAGGCTTTATCGCAGAGGCCTGCACGAGCAGACACATTCCGAATACTAAA^ 
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GCCCATGTGTGATAGCTGTTTTCCATTTTGAATTTTAACCAT 

AGCCCTTAAAAAATGAGTGCGAAAAGTGGAAGGGTGGAGTGGACATAGGAAGATGATC 
TCATTTTATTGTGTCAAAGTTGCAGAAAGCACTTC^ 

TGGCTOlGATGTAAATCCrAGAAAACGTAGTAGAAATGATGGCAGGATCTCTAACTGTAAAAT 
5 T(^TTATAATAACGTTATAGAACTCrcCACATCAAAAATACCACATTTTA 
TCTAGACTTCTAATAAATAATACGTTGAGTAGGAAAGGT^ 

AATAAT AGTACAG CTCCCATTT ATTGT ATCATT ATT ATGTGGCAGG CATTTGTAT ACATT AT CTCATTT AATTTTCTCAG CAGATG 
TGACAAATCAGGAAACTGTGGCTTAGCAAAGTTAAATAACTTGACT 
CTATATCTGGGGAGAGTCATGCTCTTACCATTGCATTGCCTCTAATGATCA 
1 0 CCCAACCCATTGTCrGCTTACTGATCAGGAAACTGA 

ATACCAGAAAGCAACAAGAGGCAAAAACAGCCITTCTCTCTGACAAC^ 

CGTTTGTGTTTCAGAAGGAATGTC CACTGTTGCCATCAGTCCTG AAAAG ATTTCCTTG CACCAAATCAAAATGGTTCGAGTTTGCT 
GTTGTTGATATT1TTTTTCCTTTTGTGTTCTGCTTACTGA CT AAGCAGGTG TT CCT ATGT AAACGAAC 
GTGCCCTGTTTGTGTT^CGCCATCCTCXjCTGCTGCGCACCCAAATC 
1 5 CGG ATGG CT ACCCT ACTAG AACAT AATT ACAC CGTTCCAACAT C CG CAGCTCACTGCTGCAG CAGAGTTGGAGAACAGCAGAAATT 
AATAATATATAATAGCATCATTCTAGTAATT^ATAAATGAAATAAAGTAGAGAGGAGATTCT 
TCTAAAAGACAAAGTATAGATTAGAATTTTAGTGG 
TGTTTTTGCTTAGGGAAAAGAGCTAATATGTTTTATTAGGCGAGAGG 

AAGG CAACACT AAAT AAAGGGGGTCTG AAAAAATCTCTGTGG ATTG TG TG CCCACTGT ATTT AAAAAGAT ATT ATT ACTTTAAGAC 

2 0 TT CAACATGTGCTGCATTTTTTCITTCATTGTAA 

AAGATGTGGCCCAACATAAACTTTATGTTAAATAGCACTCGTGTAAATTCT 

CAAT AAT AG AGT AGATTT CATCTTGTATGTTTT ATCACAATAAAAAAG AACATTCATG CC 1" riTT AAAAAC ATTATTTTGAGT ACC 
CAGTATGTGCAAAGGAGCCrCTTATTACAAATCATCXriTAGATTAAGG 
AACACTTTCTAAAGAGGCAAATAAACTATATGTTAAAAGTAACCAGTCC^ 
25 AAACTTTAATGTTTTTCTAACTTCATTTACCGTAGGTAAGATGCCTA^ 

CATGAGCCAATGCCCATCACACACACACACACACACACACACACACACACAC^ 
ATATTTATAGCACTTAAAGTGTTATTTAAGCAACTTTTCCTC 
GATTTTCTTCCTTTAATTAAACTrAGAGAACATGAAAATACT 
TTTAAGCAGTATTCATTTAAAGGTCCCCTTGAGTATTACTCTATGTGTTTTGATT^ 

3 0 TTTATTTCATTAACAGGTATTTATTTAAGTGGGTTCCACT 

CTCCATGTTGACTAATTTTAATAAGCCTAAGGTGCCTAGGGTCTTATTGAGTGTTG^ 
TGCAAAATATAAGCAAATGTACTGTAACAACACAGTC 

ATATGAAGCTGGGTGGGCCTCTGAACTACCTGGAAACTGCCACTCTCCTCCAAAGCTCCCCT 
TCCCACCCGTTCATGGAGACAGCAGCTCAGATACCCCCATTGAATTAGAAGGGGAAGGAGATT 

3 5 ATTTAATGGAAAACAGCATGTAAGGCTCACAGTGA 

TATTGCTGAAATGCTTTAGGTAATAAAGAATGCAGGCCCTAGTTCTTTTTCACAG 
TTAATCATTTGGTAGCTTGAATTCTTTTrOUSAGA 

CC AAC CTGATTTTT AAAGT AT AC CAAAATCAGCTTTG C CATTGG T ACTG C ATTT G AAAAC CT AGT AAG CTTT CTAACAAATT CATT 
GCTTTCTATATAATTGTAAAGTTAAATAGATAACTGAATATTTTTAGTGGATT 

4 0 CCTAAATC?AATTTCTTTACCTTTCC CTrTTT CT 

GTG TCAAT CTTTTT AAG CTTTTCAAAAATCT AC C ACAATCACTT ACC^ CAT AAGT AT AT ATCATT CAAAATTG TTC CTG AT ATG 
TGCTGGCTGTTGGGG CAGTTTCTTTTTC CT AC CTGTTGTGTGCCTTTGT AC CTAT CTTTAAT CTCTTCTTGCT AAG TG AT AG CT AC 
CATC^CAACACrGGCTTTCCTTGATAGCATATCTTCCTGCATG 

GGTAAATCAGAAGTTAAAAAAAAAAGCAC^GAGATAAAAACATCAAAAATGTACATCI^ 

4 5 AGTGTGTTCTATTGAAATGAAAATCATTTTCTAGTTTAAACT 

TCAAATTAAGTAGAGTGTTTGAGCTTCCTAACATTTA^ 

AAAATCATCAGAAATATGTAGACCAATACCAGTTCAGAAGAATTGACrrTTATTTTTGTAAT 

TTGTTTAGTTTTGTGCTTGTTAATCnTGATTTTTCCC^ 

AAACAAGTGTAAGTTTACCCAGGCAGCTTCACITACACCCnTCC 

5 0 ATGATAATTACACATTTCAACATGCAAAGAATTGTCTCTGTTTCAGTTC 

TTATTATATTAAAGATCATTTCCATAATTtnTTTCGG 
ACACATGGAAATAAGGAGAGGATAAAATTAATTGCCGAGTTCCACT^ 
ACATTATATTCTTTCTTTAAAAAAAAAAAAACCTTCTAGTACCTTATTTGACATTTT 
TTTTCAGGTCGACCAAAAAAATGTAGAAATACACATTTT^ 

5 5 GAAGGAATCTTTTCAAGAAAAGATGTGTTTGTTTTTGTTTGCAAG 

AGGCACCTCACACTTCACAATCAGACTTCTGGATTCATCrAGAGTATCACTCA 
TAAAATACACTGAAGCCTTAAAAATGCAGAGGTTTTGGCAAATAACACT 

AAAAT CAC CGATT CCTTTTrAC T TrLTTAAGCTAAGAAGAAAAGGAAGAGAGAGGCCAGGCGCAGTGGCT CACACCTGTAAT CCCA 
GCACTTTGGGAGGCCGAGACAGGTGGATTGCCTGAGGTCAGGAGTTGGAGACTAGTCTC^ 

6 0 TAAAAATACAAAAATTAGCCGGGCX3TGGTGGCGAGTGCCTGTAATC 

CCAGAGGCGAAGGTTGCAGTGAGCCGAGATCATGCCGTTGCACCCCAGCTCAGGTGAC^ 

AAAGAAATAAAAGAAAAGGAAGAGAGAAGGCAGCATTAGATGGCTGATGTTATGGCTTCGTG 

AACTCCCCAACTCCirAGTTTTCCAAGTATGTAAATGTTTCTGCTC^ 

TCrAGCCCAAATATA r rr i ' rX 'AAAAGAATCCCTXSCAGA T l M ri^T G CTTCTATAAAAAATATCC 

6 5 GATTTTTGGAATTAAATGATAATACTAATTAGCATTTACCTATACATCCTTT^ 

TATTTAATTITCrAAATCATTTTCCCTAGAGAGCATTCATG 
CACAGACTTTTCCCATTGGGTAGAAATTGAAAOkGAACTGAC^ 
CAAGATGCAGTTTTGTATGTTAATTATTTTTATTGGTTTT^ 
CTCTITACTCTGAAGGGCCAGGTTATAATTGCCATGTATG^CAAGAGTC 

7 0 GGTGAGTTTTTAGACTTATGGAGCTGCATGTT 

TTTTTTTCTTCATAATTTCTGTACAGGCTGGGOUSAGAGT^ 

GGTTCCCACTTCAGTCGCATTCCACTAAGCCTATTTTGGCACCATAGAG^ 

AGGAAGGTTGCCAGGAAGGAATTGAAAGXSCATATGCTCCTGGCCCC^ 

AG TGG C CATGGCAAJVT AOG AAAAGGTTTGGGTGGGAG AAACACAG AAGAAG CCATT C CTG ATT AGG CCATG TGTCTTCCACT AGTT 
7 5 TCCATCAGAGTGjGKjCTGACCGATCTGTGAAGATCTGGGATCATGA 
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ATTTGGGCTCCTGGCCCACCCAGCCCTGCCTTCCAATCTCAGCATGGCCTTGGACACA 
TAATGATAAAATGCAGGGACTGGACAGAGATTGCAAACTG^ 
TTTGAATTGGTTGCCAATATTTAAAAATTGGGCTATTTCAC^ 
CTUUlTCCAGATTTCCAGCTCCCCrcGAGAACTCAGATGATCTGGTG 
5 CAACTTTGAGTTCCGTAAGAAAAATCAATCAACTTCCCT 

TCCAAGACTCCTTTGJ^GCTCTAGTTTTAGAAtTCCAAATATGATT^ 
TATATTCACGGCAGAATGCTAAGGTATTTGTTTATA 

CC CATTT AGAACAAATGCTG ACTTTGAAATTTG AT AATGGGAGGCAGG AATAGTTCAG CAAAGAT AGC CTTTGAGGT AGTT ACAAG 
AAAGATGGTTTGGCTCCTGACTCACTATTACCTACACAGCGAACTGTGTA 
1 0 CAAATCAAATAATACAAGGCCCTTCAGTTAACATTTTTTCACATCTCCCACT 
AACTTTITCrGTTCATCTAATTGCAAACAAATTATA 
TTTTAAAAGTTATATTTTTGTAAATCATGTTTCCAAATCTATCCACTT^ 

AGTGTTT AAT ATTGT ACAAGGTGGATTG AGAAAAAT AT CTCATTTG AT CTTT ACAACAATTGTAGGAG CT AGACAG AGAAGACATT 
TAAT ACAAATG AGGAGT AG ACCGAG AAAAG TTGGATG ACT^ TG AAAAAGAG C C 

1 5 GGGATTCAAATCTACAGCTTTTGGCTCCATATTTAATGTTT^ 

GAGAGGTTAAATGGGAATTGTCTCATGAATTGTCTCAATGITTTCTA 

GAATG CTCGATTTTT ATT AAGGGAAG CTAACAGATATTT AAAAGCACCCTCCAT ATAT AAAAG CACAAT CCATGGAAGCAT AGTGC 
AAGGAATAATGATATAGACAAATGGAGGAGGGGCAATAAAAAGTCATAAA^ 
2 0 GAGGGAAGAGTCTAACTGTAAAAGTAAAAAGGTAGAAGATAGAACCGTTX3AAAAA 
TAAACAGCAAGAAGACTCCGTAGTAATTTAAGAAGTGAAAACTTTTAGGACAGAGG 
AGTAACAACTAGGAAGTGTGGAAGAAAAACCTGCAGGCCAGAAATCTGAGCAAAGACTTG^ 
ATTGAAAGGGATGTTTTAAGGGATTCAGACATTGAGAGGAAAGTAAATTTTC^ 
TTGGATTATGAGAAAGGAAATTAACATGGTTAGAACGAAAGAATAAAAATTAGTAAGAGA^ 

2 5 G7U3AAATGATATTAAAAGGAAAGAAGGCATGGGTGGAAAGTAGAATGAGATGAGAACCT 

GCGTCAGGGAAGATCATGACTCGGACCGCCGCACACTGCAGTGATGAAGACGGAGAAGCTAI^ 
AG AAACCTT AAG ATGT ATTTGGAGGGAACCCACC CCAG CAAGGTGAGAAA^ 
GGGGAGAAGGGGGAAAACTAGGGGAAAGAGAAAACAGCAAATCAAAGTT 

TTGGAATT ATTCATTT AAG CAT ATGGGT ATTGGCG TC CCTTTTCT AAAACCGGATCCTCT C CCTT ATT^ 

3 0 CTATTATAATAAATTCACATGGAAAAGTACAAGGATGAGCAATTTTTTACTTT 

GTTATCTGTCTGAAGGGTGACGCACTATAGTGGCATGTGAGAATGAGGCCA 

AAATT CAT AATG C AT AT AAAGT AACT CGTG C AC AC ATTT ATGGCAAAACAT ACT CT AGAT ATT AATTT ATATTG CTT AT ACTG AAG 
ATCrcrrGAGTrrrCAAAAAGTTTGATTTACAGAi 1 TTI CACATCAAAAATTTTTTGTAGATTCTATTTTGTAATTAAGATGTGTA 
GACTCCTCAATAATTCTTTACCTTGATGCTGTGTATACTCATTTATCrAGATTAAAA 

3 5 GT ACC CCTTGGGG^rCl AG AATTT AGAAGT CT CC CTTCCCCAACT ATTT CTAlJ I CT AC C 1 1 II CAATTG AGTT AAT AT AATGGCTT 

TAGAAATAGAATTTCCATGTTGCTATAAAAATGGGGTTAAAAGTCATAAC^ 

TTTCAGTCACACCCGGGCCATGTG CTCGGG CCACATACAGCATC CACACTTGAG 

GTCAGGGCACATAGAGGATTAATTTACTGAAGGTGGTAGACAATCCCr 

GCAATTAGTTCTCTTCAATAATAAATTTAATCITTTCCTCCT^ 

4 0 TATAAATGCTGGCCrrTGTCTAAGATCITCAACTCAAAGGTGTAGA 

ACAAATAATGCTATAATTATTAGAAAAAAACAGATTGAAGCAAGATACAAAAATTCTCAAAAGCAGCTATCTGTA 

TCTACTTCTAGAGAGTATGTGAATGACTGTATCTTTCATATCGTGAA^ 

AAACTATCACAGTAGTGACATTGCCTATTTTATATGAGTAGGGAAACTAGCCAAAT^ 

TTTTTTTTTTTTTTAACTTTTAGTATAGA 
45 TTCCTATCAG-GGCAGAAAGAATTTATAAGGGAGCTGCGACTTAT 

CAAGCAAGATTTTAAGAACAATTGTCCTTAAACATTCTTC 

AAGAAAAGTTCTTTCCTAACTTGGGTTGCrrATGCAGAGATGTGAAATTTCA 

CTGGGAACCCTGCTAATGTGCATATTGATGTACCCrrcTAGCTTCACA^ 

AGCTAAGAGAAATTTGAAAGGAGCCAGTTATTTTACCTGGGGAATTTCTCCCAG^ 
50 ATGGGGGCC^TCCTATTTTCTAAAAAGTTTTTATGGGGAGCCATAATATCACT 

TGCTTTCATTTGCATTTATTTAGTCAACTATGCCTAAAATACGGAAAA 

GTTTACTAGACTGGACTGACATGATATCATTTGTGCTTTGCGGGGTCACAACA^ 

GATGATTGTGTCTGGTGCCAGTCAGT CGCAATTAGGACCAGGATCAGGGAGG CCCGGGG CTTGTGCATCGTCTTGGGCAAG CTGCA 
TAGAAATCAGCTCACCTGAAACTAATTGTCACAACAGAGACCT 

5 5 TCGCAAAGATAGGTGACCTGCTGAGCAGGGGGTCTGTTGCTGGGGAAATCTTTC 

AAGTCCACCCCCCGCCCCCCCCCCCCCCAAAAAAAGTAGCATACIACACTCACT 
AATAAAGCAGAAGCCTGCTTTGGAGC^TGCCATAAATTGGGAGTCTTT 
GGCCTTTAAAGTTTAGGTGCTGOKSATTGCAGTGTGGCCTACCTTGAAAAT^ 
GGACCOSCACAGCCTTTTCTCCCTGCATTCCAGCAGGCC^CCrCC^ 

6 0 CTAGTTTCTAATTGAGCTTTA(nX3GTATCTGAGGTAGAA 

GAAACAATGGCAGTTTCTTGAATTGTGGTTTCTCAAATACTCACA^ 
TAATGCAGCCGAAGTGGTAATAGATCATAAAAATTTAGCAAAAGAAAAGTAAAAATC 
CAATCCCTGACAAAAACACGTATAAGTTGGAAATGACGTTTGATTTTATTG 
CTTT AGAAAACCAGATGAAGAAAAGTTATTCAAAGGT AATG AT AAACATTT^ 

6 5 TTCAATTGATTTTTAGCTTTTCAATCTTGTTTTCATAAAAAC^ 

CCCAGTGACATAGTAGATAATCTATATTC^CACAGGA^ 
AGTAACAGAGAGCATAGATATTTTGCATCTAATGATTTCAAA 
AGTTTCATTGCATTGGCGTTGAATACAAATGCTATTTTGTTTGG 
AAGGGCTGGATTGAAATTTGGACTTGTACCTGATAAGCAAGCTCAGGAATTAACT^ 

7 0 GGCTTAGAAGTGCAACTTAATCACAATTTAGATTAAAACACACACG 

GTCTGAAATAATTTCCACATAAGAGACATTTCACAGTGG 

TTTT AGTTTTTGAAT AAAG AACACTT AAAAAATTTAGATGCAAATT ATT^ CATT ATTCATG CTCTAAGTGTCCTTGT AGATTT A 
AACACTTCTGAGGCCACCITGATTCAACTAATTACCTAGGCCCT 
TGCACCCGATGAGCTTTTAAAGTGGCTTTACACTGAACTT 
7 5 TCTCTCAGGTGGGAAACCCTGGCTGGGCATCCATGGGTGACGGGATGAAACCCGAGCGTCT 
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TCTCTACCTAAGAATGTTCCTGCCTGATTCTAGTAGTTG^ 

AAAGTGACAGTGATTCCGCTTAACCCCTTCATCACCAG 

AAGGCTTCCTTGTAAGGTAATAAGTCTCTCTCATTTTACATGGTTCT 

CTATTAGAG CT AGGTGCTACCTCCACCACGGTATTCACACGAAATGGAGC AGGCTTCAAACTCAAAAGT AG CTGAAG AATACTTAC 
5 AATGGAAATGAAAGTGACGATAGAGCATCAATCT 

GGCTGAAGCTT AT AG CTGGCGTTTGTT ACAAAGT ACCCAAG CTTTTCCTTTC CAACTTC 

ATGATGTGCTGTTGGAGTT AT ACTGG CAGAGATT ACAGTAAAGGGGTTAAAGTTGGTTT ATGGTAG AAAAAAAAT^ CAG AAAC 

CAAT AAAAAT AAACACACTCAAG TAG ACT AAACTTGCTGGG CTTTT AGTTTT ACTTTT AAC!AAAGG AAGTTTAGAT^ 

GTTGCAG CTTTT AGAAAATT AGTTGACACTGATACTTG^ TGGT AT AATT AAT ATTTAAACATT CT AAT A 

1 0 ATGCAG CTTGCTTTG CATG CACTTTT CAGCCT CACAACATTAT ATTTT ATCTGGAATTGTG CT AGCATTTT CTTGGGT AGTGCC CA 

TTTTCATAGAGTTAAATTTAAAATAGGGTAGGGCCTCAGCTCTTATTC 

CATCTACCAGACAGTTTATTATTAAGAACJATATTTATTTGAGATAA 

TTTTTAAATGATCAGAATAGATTAAGTTTGAAAGTTGCAT^ 

GTCAAGGCAGTGCGAAATCAGCAAGTGATACATGAAACATGGAACCCCC^ 
1 5 TAAGGAATCTTTATTTTCAGTACATAAAATTC^ 

TGAAACAATAAAAATATAGAGAAGTAAGCTAATTTX3TGAGAAGCCTGAGCT 

AATCTCATGGCTGTAGGTGGTAGGGAAAAGTTGTTTCAGGTGCTGCCCT 

T CCTG ATCTTCTAGTTTGAATG ATGAGAAAG AGCATTCT T CCTTTGAAATTGAATTTG CATGC 

ACATATTTACTGCTCATTCTITTTAAGTGAGTAGGTGTGCT 
2 0 GGCCCACTTCATGATAGGTACTGCCCTGTGGGAATCCTGTCATTTAGTGAAGCATGA^ 
ATTGGGG CAT CTCATTTCGATTAATTCC CTGAAG CCTTTGCT 

ATTAGGCAG GTTAAATTTCG CAAGAAAAAAAAAAGAT AAG TAGT AGAAAAAAT AG C CAGTGT AAAAT ATT AGT ACACAGGAGCAGT 

AAGTAAACTAAGTTCCACACrrGAAAGAACCACCAGATCrCGAGAAG 

TTATGTCAGAAGGAAGTAGGTTITTACAGATCCTTGGACATC^ 

2 5 TGTCTATTTATTTTATTTGGTTTGTTTTTGAGAAAGCGAGTTAAAT 

AAAACCIX3CTACTGAGGTGGTAGAGATAAGTTGAATCCAACAG 
GACAAATTAAAAGCATTTTTCATATGCTAATCTCCACCTGCACA 
TTCAAGAGCTATCAGTAGTACATTACTTCTAGGAATTATGAC^ 
AATCTTAATTCCAGATGGGTGTCCTGACAGCAATGTGCCAGCAATATTGTGGAAAG 

3 0 TACACX5CACATATGTATATGTATATATATGGCTTTTA 

AAGGAATTACTTTTTTAGTTGCGAATAGAGATCAAAGTGAAAACA^ 
GCATAGTGAGTGTGAGAGACAGTCAAGGATGTCAATGAGAACCTAC^ 
AAGTAGATAAC^GTTTCAGGCCTATGTGAGAACGCATAAACTAAAACTTCATGCT 
AGAGTGAGAAAACCATACTTTTATTATGACAAAGCCTCTCCCATACITGTTGAA^ 

3 5 TTCTCTAGGTTTTATATTATCTTACAAAAAGCTTTTGGAGT^ 

ACTT CAG CTTTGAATG ACIAG AAAATTT CC CATGCTTGTTGAGTTTCC^ TAAG AAAGTAATT CTTGT AATAAAT AAAAATG 

TTATATTTCTTTCITCCAATGTTATCAAGTTAATAAACCCTTAC 
AACCGAAATACTCAAAAAGAAATCTTAAACATCAAATAATAACGAGGGTAATC 
TAATGTAAGGTCCTATCTCTTTrTAATAGAGAGTCCTCA 

4 0 TCAGTTATTACGTAGTACTCTTTTAAGACCCGTGAATTAAGTTATAATCAATGATTACAACACT^ 

TTCTCTTCTAGGGTAGCAGAGATCAGTTTCTGACACC^ 
AGGCAGTAGGCAAAATGATTTGCTTTGGCTTTACCTACAGGGAAACT^ 

T CTG ATG CCTTGTGACCAGG TGCT AACATGG AAAAG AGTT AG CATCATTGACATT CTTTG ATT CGG AAG CACAT AAGAGGT AGAT A 
AGGAGACAACACAGGCGACCCACCTCCAGGCCAGGAGAAGAGGACAGTTCCTGGAATGTC^ 
. 4 5 CCTATGTGAGGCCATCCATCKMTCTGAATCAATCAGTCTCATCCCAGGGCTCTGCT 
TCCTOUTrGCTGTCTTCAGGAGGGGCGCTGCTCAG^ 

CCAGGATTAGACACTGGAGAAAACTTTCTTTAAATTCTCTATAAGCTCTGCTCT 

TCATTAGACAAATAATTGACCCCAGAAAGCGAAAGACTGAT^ 

TCCTTGCCTK3AATTTAAACCATGAATTGGTCTGCTGTTATA 

5 0 AGCCTGAAGTATTTTCTTCGGCCACTTCACTCTCCTCCACGCT 

TGCTTCTAGATCATCTTCTTGGCATTCCTCATGTGCACATCCTGTTCT 

GCACACCTGACAGCCTCTTGCITG CT ATTTCAGCT ACT CAATCCCAGCACTTTGGGA 

GGCCAGGGTGGTAGGATGGCTTC^GCCCAAGAATTTAAGACCAGCCTC 

TCAAGGTATGCACCAGTTAAGCCAGCCACGTTTTAGGACCACATAGCA 

5 5 CTTTGGCAAATTCAATACCATGACATAAAACTACT 

CGTGGACCTCCCAACCAGGAACCCCATGGTTATTTTTGTCTCACTGCTGT 
TTGCTACri'TTTCTTTGCCAATGTTTCCTGAATCCACCCCTTTCCTTTCCATTCT 

GACAAATTTITTTTTCTCTTGTCTATGTTTTTGG 

6 0 GCTTCTAATGTTCACTTCATACATGTCACCTCTTCATC 

AGCTCTTCTGGTAC1TCITTTTTTATCGTTCT 

CAAGGGCAGATTCTGACACTTTCTTTCGGTAGTCCATCCGTGCr^ 

AGGTTCAAAACTTCGATTTCCACCACTTGCTACCTATGAC^ 

AAAATGGATTGTTGTGACGAGTAATAAGATCATTTATTTAGAGCACTGCTGGCACACA 
65 GTATTATTATCATATCTCATAGGACTCCCCAATCACACCCCGCTTCAATCACC^T<^T 
AATTTTGAGGAC CACAT AACTGGGTTTTAAAAATCTGGTTCTAACAGTAGT^ 
ATACCTAATATCAAACCAGGTAATGATGAAGACTTTAAATGCITTT^ 
GATATGATGATGATGATGATGATGATGATGATGAAGAAAACATGACCAAATGC^ 
GTGTCCAC^AAACATACAGTATTCCTCCTGAAATTAGTTTTTTTT^ 

7 0 GTGCAGTATGCATGAGATTTATTTTTCAGAAAG 

CAGGAGATAAAAOVAATGAAAACTCTGACTTGTTAATAATTTACGTTTTG^ 
GACCCAACACAGAAAATGCAAATAACATATATCC^CTGTGCACTGAAGAAAATTTGT 
AAAAAACACAAGTGATTTC^GGGAATGTTGACCACATTTACAGG^ 
TATTCGTGTTLAATGATCTCTCCACITACCAGGT^ 
7 5 AATAGGGGATTTCTTTTT AG AAAG C CACACCATT ACTTTT CT CT AACAG CTT ATGCAAAaGTTTTG TGGGGG^GC CJUULTL^CC C*A 
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GCCAAAAAAAAAAAAGAAAGAGAGAGAG 

GTCATGGAACTTTTTAGCTGTGCTGCCTGGGTAACTGCTCTC^ 
CCCTCTGACACAACC TTTTGTTG CATTGAGGTGTTTG 
ATAACCAGCACTTCTTTTACTTATTGTCAGCTTTTGG 
5 G CTG ATGT CCTGT CGCT AAATATTCTT CTT IT1T CTAAACAGG AAGG ATTTT AAT CACTT ACAT CAGGT AACT ACCG CTGTT ATG C 

AGGCATAGCAAGTGAAACATATTAAGGTCCAGAAATAAAGCATTTG 
AATCAGACACATCCTTATTTTCTTTTTGTCCTAGAACCGAAACA 
TACTAATTTTTCATTGAAGAACAACAGGCTTTGGTAATA 

CAACAAAG AAAG AG AG C CTT AGAATGAAAAGTGT AGGGGT CTTG TT CCAGTCTGCTG ACT AAATG CTGG AATATTTTTTTC CCAAA 
1 0 AGAAGGGACACAACCCAGGCTCTAGGAAAGAAAAGAAGCTTAAGTGG^ 
GACCTACATTCTGCACACAATAATGATGTTTTACG^TGG 

TTGAATTTTACTGCT ACAAAGTAAATGT AACAGCCCT CTGG^ CACTTT ATTGTTTG CT AAG CTGATGTCATC 
TTTCTACAGACTGACTTTAAAACAAAGGC^ 

AAGGTGAGCTGATAAAAAAGTATTAATACTTGGCTTTGATCATCITGTA 
1 5 CTCTATTCAAGGCAAACITCATCCrcGCAATTCCT 

TCATGCGTGTTTTAAATACATTTTTACTCTCTAAGGCAA'rTTCT 

ATCITTCAGAGATAATATATGTATGATTTTAAAAACAAAGTATGGAGA 

CCTGTAAGTGGTGACTGCAAAGTTTGCAATTTAAATAAGCTGTT^ 

CAACTCAGAGGGTTTGGTCTCTGTATCTGATGCACGTGTAATACAGCGTGAA 
2 0 TTCATTTCCACACTTATAACCCTTGAAAATGTATCAAATGAGCAG 

TAAATCAGTTACCTTTGAAGAAAAGTTAAATTAGGGGGAA 

TTAATGATCATTTTGAAGTAATAATGATAATGAGGAAAATATAATTTAGTGAGCTA 
GCATATAAATAACAGCCCAAAGGACTAAAGCAGCCTGATGGTGTTTTT 

CT CTGGTT AG AAGGAAAACACAGGTCAAGGAAATTGGGTTAAGGGAAA^ CT AGGTGT AGAATTAT ATGAAT 

2 5 AGTGACTAATTGGGAGTCTCTAGCACTCAGATCAGTGTC 

AGCTGGATACATCTTTGCXSTGCTCrATGAAGGCAGTGTGTrAGTTC^ 
ATGATTGTGACTAAACCAAGAACTGAACTGCAATGTTTTAAAAATT^ 
TTTAAAAAACCTTACAACTATTTTGAAACATTTTACT 
TCATTTATTGTATTTGACAAATACTAGCTACTTACCATGTC 

3 0 TTAGTGTCAGAAAGAACTGCTTGGCAAGGGTAAGAGTGTGACACATTGTGAGTAT^ 

AGGACATGCTTGAGCTGGCCTTAAAAAATCACCTGAGT 

AACAGCATCCCAGAATATGCTCTGAGTTTTAGCAGCACAGCCCA 

TC!TTTGCCCrTCTGTCCTCACCCCCGTTGTACTGTAATACACX3CAAACATC 
3 5 GCTAACTCCATCGTTGGAATTCGACTGTCC^CCCCTTCAAGAATCAAAGCATCT 

ACTTGTATCOVTAAAACTCACATTTGAGTTGTCTATCATGAGAC^ 

TAAGAAAAACTGGAAAGAGGAACAGGTAATATTTAAATCACCCACT 

TGGTTTTATCAAATTATTAAAGTTTGCTTTTTGTGTCCGACTTAGCAAGAAGT 

GACAGTTCCAACTGTAGGAGGCCTTGTGTCATTACCTCrATAAATGGAATCGAGCTT 
40 CCTTTGATTTCCCACTCTGCTTTCrcrrCCCGGC^ ' 

TCCATTTGCC^GATCCCCCCAACCCAGATCTTCOlAACTACTCCCCrTrGAAGT 

TTCCATCCCCTCrCTCACTGCTCCAGTTTCTTCTCrrCC 
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GGCAAGACTGAGT AAG CTGG CTCAGAATTCTCTGAGTGATTCAGCTCTGCCCA^ 
ATAATAATGAAAGAAATGGGTTTTAGTTGTGAATATAAATCCCACAAACTTG^ 
ATATGGTGGTCAGATGGCTCAGAGAATGCAGAAGTGGATGAATGAAGGA 
GGATTCTTTCCCTGGAATTCATTAATGGATTTTTTAAAGACTCnG 

C AT T lT TTTCCrGGGCAGTGGGTCCATACGTTT AAGCAGATTCTTAAAAGG CTCTATGTCTCCCCACTATCCG CCTCCAAAAAGGT 

GAAGAATCATTGCTATAAGTGAAGACACCTTATTTGGTCrATGTTTTCATGTGGTCTGC'lT 

ACATGATTTGAGCIAGTTCAGAGAGTAGAAACX5TAGGGTTTTAAA 

CTACCAGGTTATACAATTCACATTTTAAAAATTATTTT^ 

GGGTATAATCACACATCTGAACATAAGAATTAGTTTTATGCAGCT 

CAGAGAAAAATAAACCATTCCTTATGTCAGTCTTGTTAGGGAGGGATTAAGAA^ 

GCTGGCAGGAAAATATGTAATTAGAGCAOGTTCCCAGGCCTGCACACCT 

TGTCGGTGCCAGGTTCTAGCACAACCAGTCACAGTGACGCTCTGAGAACATTTCGGACTCATCT 

GAAG CAGGCATTT ACACT ACCACACAATGG AGTG CATTG CTG C AACACTGGTTT CATT ATTTTCTGACG CCT AC CTT CAGCACTG C 

CCTGGTTTACTCCGTTTGAGGTTTTATTGTTACGGTA 

ACAGTACTGACTTGATTATCAGGCTGCAGTTGCACAGCGG 

ATAATGAGAACAATTGTTTGACATTTTTTTATGTAGAAAGATGGGCCATGGTAGTTCT 
TATATTCCCTATATGTTGTTGAGTATTCGTTTTTCATATAAAGAAAGCTTAAAATGTGTTCTCA 

GTATGGGAGG ACTCCATCATGATTCTCATTAGL'TTTr'r'rC'r'l' C T T T 1 1 C T TTTATTGGCTTTTCTCAACAAAAAGAAACTCTGACG 

TTAGAAAAAGTAGTGAAAATGAAGAAACAGTATGCACGATGCCTATTTTTCAGATAAT^ 

AGACTTTTCCCCCCTAAGGGAGATACACrGTATTTTGCTATCTGA<^ 

TGCACCTTGTGAAGATGACAAAAGGCATTTGGTTGCGAAAAGAGAACTGGG 

TCATAAGAGATTTTCAGTCC!GTTGATCCAC1TGACTGGAAA 

AAGCCGCCGCAATCGACGGTGTAGAGGATGAGCGAGAAATTATCTAGTAGTTGCTCTAGGCCTGTCTTAG^ 

ACATCTGGCTTCCTGTAGGTACAACCTCAGGACATAAACTTTGACT 

GTTACCTTTTATTCATATTTCCTTTATTTTAGAAT^ 

TTTAACTGTTACACACTTCCTTGTTAAGCT 



GGATAGCGCTCTTGGGGGGTGTTTATTGCCCTATTTGGCAAAAGGC^ 
GGCTGCCACCACATO^AGATCTCTAGAATTGCAGAACATTC 

AATTCATG CATT CAAAACAG ACCAGTT ACTGTC CAC CATGG AC CAGG CAC CATC CT AG CTTTGGAGG AG ACAAAC ATGAGGG AAGT 
AAAAiCTCAGTGCTCATACTTTACTGA 

T AT AAT AG CACAG AG TTT AG AAT CCAGGAAAGGGTTTTCTG AGAA CTGTTGGGGTCTACAAACGGGGT AAAGA'I"!" ICTT AT ACG AG 
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CTACTCTTTCCATTGCCAAAGCCAAATTAGCTGGCGGAAAGGTGAGTTGGACAAGTGAGGTG 
CAGAACAGTAGTGGCCAGCAATTTCTGTAC^TATTTACT 

CAAAGGTAAACACATTTATCGCCTCATTAAGGCTrGTTTTATGGCCACTTCGAT^ 
CTTTGTCTC^AATGTATGTAAAAGGGATTTCATGGACACTC 
5 TGTATTTGAGAAGATGTCACTGGTGATTCCTACTTGTGTACTGTGTGTAC^ - 

TGGACCAAATGAATGGACAGATGCCAAAAAAACAATTATGCTTATTAATC 
AAAAAAACCATTAGTATTTAAACCCTTTTATAGCTGAGCTAATGATTGCTTC 
CCTGTAAAGCCCAGCTATGTTATCTAATTAGTAGACTTTGATTTAAGTGGCT 
AACCAAATGGTAGATTTAAATATATAAAGATGTTGACTTGATTATTTAATATTTGTAGCAAAG^ 
1 0 TAATTCCACATCAGTGCAACATTAAAAGGGAGAGAGGATGCCAAGAATCCCATTAA 

AAACATTTACGCCAAGGAATACCAACAAGACAATATCAAGCAGCTCrCCCTAAGGATAGA 

TGCACAATATCACACAGTAAAATTAAGGACAATGTGTTCCACCAGATGCT 

TAAGTAAGTGTTTTGTATACATATTAGCTCCATATTACCTCCATGACCTC^ 

CTCTTTCCTGTACC^TGAATCCArrin'TL"rTTTACrrri ,, l'CT'rrrC , l , 'l 1 CI 1 rCTTTTCTCTCT CTCTCGCTTGCTCTTTCTTTCT 
1 5 TTCTCTCTCTGTCTTT CTTTTTTT G TT T TTTGTTTl 1 I GlTrrri'Gl^riTlTGAGACAGGGTCTCACTCTGGCACCCAGGOTGGAG 

TGCAGTGGCACAATCATGGTTCACTGCAGCCTTGACCTCACAGGCT 

CACAGGCATGCACCACCATGCCCAGATAATT'ITTTX"!' 1"! 1 ATTTTTTGT AG AG ACGGGGT CTCACT ATGTCAC C CAAACTGG TCTT 
GAATTCCTGGGCTCAAGTGATTCTCCCACCTCAGCCTCCCAAAGTGC^ 
TTCAATAAAGATGGTGCTATTTGAAACTATGGCATGTCACGTGTATATTATTTTATTTC^ 
2 0 TGC CAAGAAAGCTTC^^T ATTTGCTT AAAT AGCCAATGTGACGGAGAAT AGG T AAC CT CTTTTCTTTTTTTTTCTT ATGATTTG CT 

TTGGCTATTAGTTTTGAAACATACATGTTATATTCCCAATTAGATCTCGTTACATCC^ 
TCTAACAAGAACTTCGGTTGACTGAGTTTATATTATATAGTTTTTCTGTCCTTTGCC 

G QATTCAG C ACCACGG CGT ACG AT AG CCACCATTTGCTTTG CC CTTTT AACT^ CAACTGCTC 
AGAAGGGCTTTTTCACAAATTGTTTACCGTGGCAGTTGCTACCC^ 

2 5 TGTCAGT AT AACAAAT AATGTGT ACAATTTTTT AAAATG CTG AATTT AC CAAAT CCCCAT ATT ATGTTTTCACACTG CT CTTTCAG 

AAACTGTGCTAATACAATACTCCTATAAACTGATCTTGGTCCTGGCAGGCCCT 
TAAAAATTATACATGAGGGCCACirTCCCTTAAATTATCTATTGTATGCCAG 
GTGCTAGAAATGTCTAATTTTCTCATAATATCCATAGCAACTAT^ 
TTCACAGAAATCATGCTAGAGTTGCAGTTTCCATTTTCTT^ 

3 0 TTCATGCCATAGCACTTGAGTTATATCATAATTTCTCATTTG 

CTAAGTACCCGTGAGTGGCACTTCCGTGCTGTGAGTGCTATCAAAGGCAACCTCAGATGGATAA 
TTTGTTTTTTGTTTTGTTTTTGTTT^ 

CATGTTAAAACTGTATAACCAACAGAACCATTTAAAAGGAAGAAAAACATC^ 
CAAATGCTCCTCATTACTCCAGAGCAGTTATATTTGGATTATAATTAAAAC^ 

3 5 C GAG AATG C CTGAAGTG AAGT ATT CCT CCCAAAGTT ATTG CACCTCTTTACT AACAAGTGCTCX5G ATTTT AAT AAGC CTGTCCCCC 

CTCCTTACCCTGTACAGAGGAAAGGATGGCAAAGAGTGGCTGTGGTACAAAGGTTACATAT 

G T AAAAATGGGG AGGTGGG TGG AGG ATTT C CTTT AATTATGAAAT AAGTG TG ATGTT AG T AG TT ATTT CTGT AAC CTGGTGAAAAT 
G CTGCAT ATTTCAG G C CT AATTTTT AGCAT ATGTT ACCACGTG TGT AAT AAAACGGTG AT CTGCG CT CT CCG AATCTTTTGG CTGT 
T AAGG AG CACTCTG ATT AT AACTAGTTG CTGTAGAAACCTGGTCATC CAGATG CTT ATC AAGAATGGATT ATTATTTTTT AAAC CT 

4 0 TGGTGGAGAAGCCACAGACTGCAGCAGTTGGACAAGGTCACAT^ 

GTCCAGGGCCTGGGGGCATTAATATGC!AACGGTTCTGTTTTT CTTTCAG 
GCAATTGTTGCGGTACTGAGTGGGTGTATAGAAGCTAGGGCTGC^GCCTTGrrC 
AATGGGCCTGGCATGTAGACAACTCTGTGGGATGGATGAATGGTGCGGCT 
GCTAGCGTAGGAAGGGTGGTGAAAATTGATATGCACTATTACTAC^CGTATAAAACTTAACAC^ 

4 5 GTTCCTGTTTATTGATTTTTATGAGTTTGTTAGAAC^TGCGCTTAATGCriTTA 

GTAATAAACCTTTCCGTACTTTGCAATCGCTG^ 

ACTTGGTTTTAGGAGTGTGATAAAAAAAAAAAAAGAAAAAGAAAAGAAAATAACT 
GTCC^CTCCCGCTATGAAAATGGCCCCTAATTCTAATTACTCATATTTCCAAGAA 
GTCTCAGAGGTTGAAAGCAAATCAGGTTTCCAATAAGACAGTACATCAAACAAA^ 

5 0 TAGAATTATGGTTGGACATTTTTGGCAGACAGATTGTCTTTTGCT 

ATTAGAAGGAGAGAAGCAATCTGCCCAATTTAATATACTAAAAGAAGTTTAGAGTCTCTC 
GAATTAATTGACTTAAAATAATACAGTTAAGAAGTATAACCAGCAGGATTATGGAGAAAGGTTTCTTr 

TGCTCTCAGTGTTACACTTAAGATGAGATTGAAAiT'ITCTTl 1' 1 lACGCCAGGTGCCCCGTTCTAAGATTCAGAGCAGCAATGTGA 
ATATTCATCCTTGCAGTGGAGCTGAGAGAAGTTGTAGGGCAAATCCTC-GG^ 

5 5 TCCCATGCTTATAAGGCTGGCnXK3GCTTAATTAAAGTCAATGGATGACCAAA 

AAGAAAATGAAAGAACAACCACCOVCCCCATATCAGTCTCTTTCT 

AGCTCTTTACATAGATCAAGAATAATATTTCTTAAATCCCTTAACAACACTGACGTGTCCATG^ 
ACATCTCTAACCTACGTC!ACTTGGGTGGCCCTCACCCACAGCACCACAACA 

TCATAACATAG CAACCTTGTTCAGGATGTGACCTTTGGGTTTCAG CAACCCTCTCTTATGGAGCCCTGGCC^AACCTT AACCTTTCC 

6 0 TGTTGGCTCGGAGGGTAACCTTrCTTCATAri^ 

GGGCCAAAAGTTTCGGAGTGTTAAAATCTACGAGCCTCTTTTCCTCT 

ACGTAGCAAGATCTCTTTTTACCCTTCCTGTATAATTTTGGTTATGCTTAACTCTAAAGGAAAC 
CTTATTTCGTACCrrTAGCTGATTGTCATCTTTACATGAGCATCAGA 

GTGCTTTATGAGTTGTGAAGTGTTTTTCTCTCCCACAAGAAATTACAAC^CAAAGTGCCCTATGC^ 

6 5 GACAGCAGGAAATTTTTTAACCCTCAAAACTTTCTGAT^ 

TCCTTTACAAGTATATCATTACAGACCTGCAAGTGAGAAGGTAAATTTCCCCATG 

CAGCCCCGTGTCCTACAGAAGTGACATTTTGTTCCATTTTCATTTAT 

T ATAT AACC CAAG AAGCCAGGGATT AAAG CZACAAT ATTTCTT AAC ACAG CTT CCT CT AAG AT CT C CAGT AGCT AC CT AATGCTT AA 

7 0 AGCTTGGGCTGACGTGGGTTATGGGGTTATAGTAGCTTTATAGATATTTCTAT^ 

TATGTGTGGCTTAGGGCTATACTGACATGTTATTTTCTATGTGTTCATTTATTA 
ACCTCACTCTGGTCTCTATCATTCAAAAAATATTGCAAGATTCCC^ 
TCTTTATTCITGGTATCTCAAAATTCTG^TCITCCT 
AAATTATTTAAGTTGCAGGAACCAGACCTTTTCCCTCCTGAGTA 
75 TGAAAGTATGCATAAATTATTTAGTTAAACCTGAGJw 
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AGAAGAATGGGCAAGATTTTTTTTTCATTCTCTTAGAA 

TACAAAATGGTAA riTl ' X 1 rAAAATTCCAAGCTTCATAATATCCAGTATATCTATATCATTCCAAGGTGAATACTGTGATAATCAA 
TCAATACCAGGAAGACTCCTCCAGCTGGCGAGAACTGCAAACCCT 
GTCnTTCCCTAGTGCCAACCTAAAGACTGGGGGGCATTTCCGATTCAATT^ 
5 TAGAAAGTTAGGTAATTTTAAGATGTTATGAAJA 

AAAAATACCAGAAAAGTGTGAAGTTGTGTGGTG<3TGGTGGTGGTTGGGGTGTC 
TAGATAATATTAAAGTAGCAATGAAAACAGCCCAGAAATATAGGC^ 

ATATTACAGAATGTACCACATTATGTATCTGATCTCTAAATTCGCCTTAGTATAAATGCTCCAA 
AGAATGGTATTTTTAATTGAATACCTAACCraVAATCCTGAAAGAAACAT^ 
1 0 TTATGTGAGTTATTTAGTATACAGCTGTCTGTTCATTTGTAGAATCC^ 

AAATATGGGGGGAGATTGGCATGTAAATTTGTTGAGTTTTCATGAAAAAATAAAAATTTAAAA 

TGTTCCATTCACAAATGATGGATGTCACTGATCCCATTAA 

TCTCCCAATTTCAGTGTAGTTGCTCTTGACCTTGCCTC 

CTAAACCAGATGAACAAGATTTCATACTGACACCACCATACTCTGTTAAATAGTTAAAAATC 
1 5 TTAGATTTGTACACCCAAGGTGATGAAGAGAGCACAGATAGAATCTAGAGACAAA 
TCTAACACGAGGTGATGTGGGAAAAAGTCAGTTAAAATTTCT 

CT CC C CCCTTCAAAGG TTTGTT ATGAGGAG CAAGTGGAAAAATGT CT ATT AACGGGCTCTG CTGT CAGT AG ATT AATG CAT AAATC 
CAGGTATTATTATATAATCAAACTGTCCTTTTAAATGTAGAAGAAAAGGGT 
GTTATAGAAAGTCTATAGGGACTAAAAATGCATTGTCTTCTATATGCATTTATCTTGCC^ 
2 0 GCTGCTCTGGCCAGGACAGTGGTGGAGAGCTCTGACTCTTG 

GAGGTCAGGTGACTTCTTTCTAACTACAGCAGGTTCTCAGATCCCTAGGCTGGCC 

TGGCACAGT AAG CCT AAG T CTCAGTGTCAT CTG ATTGTG AAAT CTG CTTG TTTGCAT ACT CAT ACAG T ACCAGAG CTT AAG CCCTG 
GC CACTGT ACGGATG ACTGT ACAAGAG CAG AAT CT AAG AC CAAGGGGTTGTTTTT ACT AAGGAAG AAAAATTT ATT AGAGCT 
CAAGG AAAT CT ATT AGATTATCTCACCAAATACATTAATAGCATCAATTA 

2 5 AGTATATTAAAGAATTTGTGGAAGTCCCCCATGTGAACCAGTATTAACCATGATAA^ 

AAATGAGAATTTATAAACTTGTCTAATTCTCATTGCITX3AAGAAAATTG 
TTAAAGCTTAAAAAGCCCACATACTTATCATCGAATTGTCTG^ 
ACITACATTTTAAATGGTTGTTTTCCTTTCTCTTACACTACT 
ACTTTATC^TTGTGAC^CAGTTTGACAAAAGCAAAGGAAATTCAGAGT 

3 0 GAAAAT AAAATGCTTTAAAAAAAAAAAACAATGG AACAAACHTG GG ATGG ATTT AG T CTT AGTCTT AG AGG CAG C CTT CTT AAATG 

CTG AAT CTCTGAGTTCG CCATCTTTAAGTTCTT AT AAG AGTTGCAAT AT CTG TATT C ATCAT AAATCT AGT CAG AGTT AAAGAAG A 
GTATTTGGTTAAGTACTGCTTGGAAAATATGTTGTCCTTTTTTAAGATTATGTGA 
TAATAACCTACCTTGAGATATACTTTAAGTAATAATAATGGCTTCTTGGGATTAG^ 
TCCTGTACTATTGCTAGGAGAGATGTGGGGAGGGAGAGAAAGAGAAAGAGAGAGAGACGAGAGAA^ 

3 5 TTCACTATTGCTATGCAAATTATAAACCTAACAGAGAGGTGATGTATCATTTTTATTAA^ 

TAGAAGAGTATAAAGTTAATTCCCTCCCGC<3AAGCATTTCTAAAAAATATAAATTG 
TCTTTTGGCATTTTAATTGAAGCTTCCAAACCGTATC 

AACTCTTAATTTTCTTTCAATGAAGTAGCAGATAGTCAGCCTGTAAATTTTACCTACAT^ 
CAAAACAGTGTGCAGGTTGGTGGAAATTGAGGTCCAGGCAGCAAGGCATC 

4 0 ATCAAAAGCGGTTCCTGGTTGTATGATATTCrCTTTTGATAGAG 

GAGCAGAATGT AATTTCT AG CAAT CTGACAG AC CTG CGGATTG AAG CGG CAGGTTTT CCCTTCCG ATTTCAGGG AATTGCTT CTT A 
CTGTGACAAATGATTAGCTGTTATCTCTGACrrGAATTTGATTTCTACAAGCA 
TTTTTTACAGGAAATGAAATCGGCTCCCTAGTCATATGTGGACC^ 
AATTGAAGGCAGATTTAGCATAGAGGGTGTGTGTGGTGATGTGTGTGCAGGGTC 

4 5 CACACCCATGTGTGTACATTTACAGGAACGAGGGATCATTTCATGGATTCTC^ 

CCCTACATTCTCAGCAGGGTTGAAGCTTTTAGAA 
TAAGGACATCCTGTTTCAAAAGAGTTTCAACTGCAATAT^ 

ATTCATCAAAGC^CAGCTGCATTCAAAGCAAGAGATTTAGGTAAAATAAGTTAATCCTT^ 
TTGAATAGGGAGGGGTTCTAGTGATAGCAAGCGTCCTCTGGATCATAAAACAAAAAACGTG^ 

5 0 GTG CTT CTTU' IT CG CTTTTGATGGAATCTT AAT AGTGACTACACAATT AGG ATCAGGTTT AT CCTTCTG TTT AT AGTTTGT AATTG 

CTCrGGAAATCCTGAGTTTCAGTAGTTCAACTTTTAGTTTT^ 
TCATAGTTTCTAATCCTATTACAGOU^TGTAAAATAAACTGTATTTCCCAAA 
CTTTGCAAAGAGGAAACACCATCTTTAAG^TCATACTAGTAATACTCCTTATTT^ 
CCTAGTTTAAAAGAACAGAGATTTGTGGTAATTGACGAAGAGTA 
55 ACAACATTCCTAGGTATGTACTAATATATTCTAACACGGCATTGCGCTGTCATCAAGACTTTTG 
AGCAATACTTTTGCTAAATTCTTTAGTTGTATTCCCTAAATGGCAGATACCTTTGGAC 
AGTTCTTTGAAAATAACAGTTCTGTGCTCCCTAAACTTCTTTTG1T 
CCCTTCCTCCTTGCTTTGTCGCCACAAAAG^GGTGAGGGGGTGCCGGAGTTTGAC 

CAT CAGGT AATCACCT AAGTGCACTCAGACATACT AA CAGGCCCTTTG TG TTTTT CTAAAT CTCATTTT 

6 0 AATTTATTTAAAGTGACAGTGAAGCTAGACTCCAAGCTAATGAGTCTGCATAA 

AAACTTTCTGACTCTGAAATCCTGTCAGTGAAAAATTATTTG^ 

ATATGATTTGATCATCAGAGACCTGCCCTGGGTCCAAGTGG 

TTGCATATATGTTTTAGTGCTTCATCCAGACTTGAGTGCTTCAC^^ 

TGGTCCTTTCTGACCAGCAAGCAGTTTAAGAAAGCTAAGCTGTGTTGTATGCGTATTAAATATAGC^ 

6 5 TAACCTTAGTTGTATTAACAGATTGTTTGTTTTGATGACTGCGCCGTTAGCATGCCAG^ 

AGGACCTATAATATAGAAACTAAATAAGATGATAGACTTACAAGGATATACATTTCCTGTA 
ATTTACTTGAAAGGAATATGTCTTATTACAGAAGGAGAATGCCTTGA 

GGTTGGGACCAGGTTTCCTGTGTAGCCAGCCCCCCTGTCCCCACGTAAATGTTTTGCTTTTCCT 
GGGAAGAGTAAGCCCTTCTCGCTGCCTGGCIGGTGTCAGATTAAAT^ 

7 0 AGACAGGGG^GTGAGAGTTATCAGGTTGGCGACGGAACTGCAC^CTTGTCGGACTCCAAT^ 

TGACACGATCTGCAATCATTACCGCCGGAGCAGAGCTAATTAAGAA^ 
TTGAATAATGTTGTATGGCTGAAATGTTCTAAAACCAGACAGTTTAACAAGTGTT^ 
ATCTATAATATCTGTGCTGCATCACTATTTGTTTCCTCTCC 
CTCrTCTTCTTGGCATTAAGTCAGAGCITCCCCCCT 
7 5 GT ACAG AAAAGTT ATCGG AAACTTGTGG CACATT CAAAAG CAT ATTT CCT ACCCGTCT AGGCAT*ri"rrCACAAGAAAAT AT CAGGT 
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CCAAAGTGGCATTTGAAGTTTCTGAATCTGA 

AT ACAGT ACGAGTGAGTGGAGCTCATT AGAAACT AGC CGTGG AC CC CAG AT ATGTCAAAGGGGGG TTTG CTG AGAGGTG AC CACG C 
AGTCTT CAT AACTT C CTCATCTCTC CCACAACTATCAAACAGAT AGG AAGATGTT AT^ TTT AAAA 
AGCTTGACAAAGAAATTAATCAAAATGAGCTTCATGGACCT^ 
5 GCGTACCGATGTGTCCATATGGACAAATAGAAAACCCAAGAGTTAAAAGTTC 
TAGAAGGCTTATTTGAATATAAACTTTCTCGCAATACAGCA 
CTGCAATGGGAGGTAAGAGTATTGCTAGCTTCCCAAATGAACC^ 
ATTTCCAAGTTTTAAAATGTAAATATTCATCTATX3GA^ 

ACCT AAGAAAAAT AAAT ACT AATTTTT CTT AATG AGG CCT AAT AACACCTTTAACTT AC CATTTT AAG AG ACGCT AAGT CAAT ACA 
1 0 ATGTGCAGGTCCCTTCTGGATATTTATTTTCCTAAG 

TTTTCTGCACTGGACCTCTCTTTTCCAGATTTACCTAAGA 

T AAATTT AACAC CAATTT ATT CACTCT AGT ATTTCTAT ATGGG AACT AACAGCTT CT ATAGG AC CTCT ACAAAGTG C CAGAAAT AT 

GATCTCATGTGTAGTAATAAATTTAAATGT GlTJ PrC TTT T A CATATTCTATCCCTCCGT 

ACTCCCCCACTCTCAGTTTCTAAAATCACTTGTCCACTCCG^ 
1 5 AT AAACCAGGCTGGTCACAGTGG CTG ATGCCTGT AAT CCCAG CG CTTTGGGAGGC CAAGG CAGG CGG ATCAC CTGAGGT CAGGAG T 

TTCAGACCIAGCCTGGCCAACACATGGTGAAACCCCATCTCTACT 

ATCCCAGCTACTCGGGAAGCTGAGGCAGGAGAATTGCT^ 

CTCCAGCCTGGGTGACTGAGTGAGACTCC^GCTCAAAAACAAAA 

ATGCAGTTGTAAGCCA^CACl'l'CrrrGCATGATTCTTATTTTA 
2 0 CCAGTGGTATACCTC^AATTAATAAATGGTACTAAGACACTAAGGCGTTACTTTCTTGCATTATT 

AGAGAATAATCCAAAGGTAAAATAGTCAGATAATTTATTTTTATGTAAGTT^ 

ATAATTACATTCAGTATGTACCTTTTGTGCATTGAAATAATTTTA 

AACACTTGATGCCATGCCTTTCTTCCTAGTGTTTATGT^ 

CTATATTAGGTGGGCATCACTCAAACACnTTTCCATAGTAGATC 

2 5 CTATGTTGTATTTATTGTATTAATTCCAGTGGTAATACAA 

GAATTAATACAATAAATACAACATAGCTTACTACrrCAAGTCTAlTACACGCCATCT 
GAAACACCCAAAAGTCACTCATCCTCTTTCTTAAAGCATTAAAAGTTT^ 
CTGTACTAAGCTGGACATCTTAAGGGCTGCAGGTTrTAATATTTA 
ACCCCAGTTTTTGTCCrrTTTCCTGAAGAAAGCTGAATrrc 

3 0 CAAAGCAAGGCCAGCTTTTAAAACAGCCACCAACTCTGAGTCAAGGCATCT 

TTCAAGACCMTCTTCACCTTGCACTGAATCTGACGGGGTCTTACT 
AGCAAGGATTTCGATTTTAAAGCTGTACTTGTTATAGGTGTTC^ 

TACTGG AGTGG AAGGGGGAGCTGTGTTG CTTTTAGTTTGTCATTTAGGATGTTAAAGTTAGTATGGGTITCAGTTTG CATTTTAAA 
AATTAGAATGGTAATGAGCAAGGTGCCAGAAGGAAATAACAAAGCACATTTTCCTGAATAGCT 

3 5 TCrACTCTGACrrCCCAACCCGGCTTCCTCTCCCGCCTGCTCCC^ 

GTTTGCAGGACGAGATGCTGCCGGCCAGGCCAAGGGCAGTTATGCC^ 
GCTGAAAGCTCAGCCAGTTTCrCrTTCTGAGGCTG 

CTCTCTTAAACTGCCTGACAAAAAAAATAAGTACCTAAGTACGACATAAGTGGTACACCCA 

TCTTTTAGAGGCACCAG AT CCTCAACCTCCCCTTACTAGTGG CATTTCTCACCTGCTCATGTGGTGCCCACCTGTGGCTTGCTGGC 

4 0 ATTGCCACTGCCTGCGAGTGGGGAGATTTCCAAGTTCAAAATGAAGGCCn'GATTTTACATG 

G CT CTGTGAAGG AATT AAG GATGGCATG AAGTC CT AG TTT ATGTCACTG CTCCCTTGTTTATG IT LTT L'CCTTGACTATTCCCACG 
CCCCCTCTCCAGATTTTGCATGGCTTTTATAGCCTCTAGTAATTATATCAGCA 
TGCAACXJTAAATGTCAAAAACCATAAAGGCCCTTTTTAGCATGT^ 
GAGATGTTGCATTTTGAGGTGAGACAGATGAAAAGTGGAAAACTCAATTTCACCACT^ 

4 5 ATCATCTTGAACGTTTAGCATTTTCTTCTGTAAA 

GTCTGGG ATG CC C C CTTT CCACAG CAGGTGC CC CCAAACTC CT AAATTATAT ATTT CATG CTT AAATGTCATG ACCTTTCAATG CC 
CTGCATTGCCCTrTATCCAACTTAGAAACC^AGTAAAAATAAAATCTTACCA 
ATGTGAAAGAATGATACGGAACTCACTGTCACCAAAAAACTGTCCCTAGACrrc 
ACTAAGCACTTTCTGGGAGCTGGTCTCTAGGTGCTTTACATGTAACCTTCA 

5 0 TCCTCCACTTAATAAAGGAGGACATC^AGGCAGGTTAGATAACTTAACAAAAGTCA 

ATTGAAGCATCCTGATCCCAGAACCTACAGAAATATAAAATGTCTATCAGGCAGGAG 

TAGGOVAGGTTGTTTCCTCTCITTGCACTTCCTATTTGAAGTTGGGCC 

AAGCTGOVGTCCCTTACTCAGCTACGAGTTAGGGACC^^ 

CCATGGTACAG T 1' ITT ATGGCTCATT AATTTAT CTTT AATTGAAG AT AAGTTGGG AGAATTT AGAGTTGTG AGAAAATGTGTTC^TT 

5 5 TTCTAGTCAAAAATTCTCCTGTAGTTGTGATGAATGGACAAGATGGGAGTAACT 

AAAGAGCATATTTGTAGGCCCTCAACAAAATCTAGTGATGGAAGGAAGAACATATAT^ 
AATACATTTTATGGGTAAGCT7UVGAAGAAC CAG TT ATG CAAATTTT CTCT 
TTTTTCCATTGCTACCACTGCATAAAGCAGAGGAAGCCAGTGCAGTCAGAACCCTCT^ 
AAGTACTTAGCAGCCAGAGAAATCTTGCGTGTAAAACAGAACTGGGATCTG 

6 0 CACTGGTATGTACTAAAGTGCTCCAACTTTTTAAAAACTTTAAAATCAGTTGACT 

ITTl'riTTTTTT TTGAAACAGTGT CT CACTCTGTCAC CCAGGCTGGAGTGCCGTGGCG CGATCT CAG CT CACTG CAACCTC CATCT 

CITGGGTTCAAGCAATTCTCCCACCTCAGCCTCCCCAGTAG 

TTTTTGGTAGAGATG<3GGTTT<^CCATGTTGGCCAGGCTGGTCT 

AAGTGCTGGGATTACAGGTGTGAGCCACCGCGTCCAGCCAATTGTTAGTCTTTGGTACG^ 

6 5 TGGAAGTTTTTGAAAGAAGAAAAAAAAATGAGAGTAGATTGATTAATGACTATTAAAAATAGC 

TAAAGAGCACCAGGC^CTGGATAGATAATTGGTCTAAAGAGCCTGGACCTTGCTGAA 
AAAATGGAGTCCTGAAGATGAAAGGTGTAT LTTTT CCATATTGACTACGTGACT^ 

G CAGAGG GGATG TGGAGAGGGAAGAAGGAGGGAAGG AAGG AAAGAAGG AAGG^ G AGGG AG 
AAAAGAAGGGAGGAAGGAAGGATGTGGAAGGACGGAAAGAGGGGAAAGGATGATTCAATAT 

7 0 TTTGGGGAAGTAGACGATGGTGCATGGAGTCTCCTGAAAGCATCATATGA^ 

CATTTAGGTATGTTTTAAGATAGGTGGGCTAACTCTGAACTAGGGA 
ACITAAGTTCAGGGAAGGAAGGAAGGAAGGGGGAGAGAGAAGAAGGGAGACG^ 
TCTTCTGAAAGGCTGATGCAACAAATGCGACTGAAAAGCCAAAGTATGGATACGTT^ 
TAACTGTCTGAGACTGAAACCATTGACACAG 
7 5 TACTGCCTCAAATGACAGTGCATTTGCrGTrCAGGTAAAATT 
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CGATCATCTTCACCATTTTCCCCCAAAAAATGAGGGGTGCATTT^ 

TTAGGGGTTGTXyVAATGGAAGCTCTGGAGGGAGGAAAGGCATTC™ 

CTGTGAGACTAGGGTGCirCCTTTAGATAAATGTGTATCTGTAATCAGCTTAACTA 

GTTTT CAAAG AGAGAGAGAGGGAGAGTGAGAGGAAGAG AGAGAG AATG AATG TGG CAT 
5 TTGTTCATGCTGCAATCTACACATCAATTATCCCGCTC 

ACTCACTTTCATGCCTCTGATCAAAAAAAAAAAAAAAAAAAAAGAGGAGGAAAGAAATGATAGAAGCA 

GGAGAAATACATTCTTAGATCAAAATTTCCTTTACATATGAA 

ACAATTAAATATTTGCAAATGTTTTAATTGCAAAATATTACT 

TTCCATACTTAAAAGAAGACAGAGAGGTGAGAATTGCTGGTCTTGCC^^ 
1 0 ATT AAAAGT CAGTGACT ACT AATT ATCAGG CT ACTTTT CTGAGG CT ATTCAAG AATTGGGCAG TG AGG AG C CAGAGAG CT ACCTTT 

GTATCAAAACCTTGGTGTTCCCCCTCACCGACTGAACCTTCATCT 

AATCCCTGTTGGTTTGTGGGCTTCCTTGTCAGGGTGCrC A rrr 

TGTAGCTCATTTGATACCnKSTGGGGATGCTATTCAGG 

ATTATCATTTGAATGAAAGTGCAGATAGGCCTGTTCAGGGGCCTGGGCAT 
1 5 TGAGAGGCCAGGTCCCCCTGACATCTTTAAATCCCCGACCCCCAGCKOT 

AAGCTCCAAGCACAGATGCCATTCAGAATTGTGCTTC^ 

AAAGCTTAACTTGAATAATTTCTTTTGGTACITCT 

CAGATTTTGTGTTCTTAAAGTTATGTGGATACGTAGCTG 

TAGATGTCATCAAAGTCTTATGTTT CT ACGACT ATCAGATCAG ATGAGCTT AGT ATTG TGGTAGATG CAGT CAGAATGTTAG AGTG 
2 0 CAAAGACAATAGAAAGTTTGGATGAAGCTACCAAAAAGAAAAGTTT^ 
TCTTTGTTTTGCTACCTGGAATGGCTCCTCT 

GAAAAATTGCTATGTCATCTGAAGACCTCAAATGATGAGTCCTGAACAGTA 
AAAGCTTGCTTTCGTTCAGGACAAGAATGCACAAC^ 

GTTTTGGTGGGGAGCGAAAAGTGAAAGGAATCTGTGAGAACAAATACrcTGGTGTT^ 

2 5 AGAAAAATATAATGAACACAAGCACATACACATTTTATT^CCC 

TTTGTTACAGTCCATAATTACAACTCAGGTTACGCTITAACT^ 

TTATTGTAGCATTATAATTAATCTTACAGGAAGTATAATAAATTTACAGTATCAGAT^ 
TGCTTTTAATCAGTTCTGGCAAAATTAATTTTGCACT 

TTATGGATTCCTTTTTATTAAAATAGGGCATATGGCCCnTTAGCTAAAAAACT 

3 0 ATTTTGAAATGCCATATATCrTCCCTTTAAGATAATTCTTTCTCTTAAG 

AACTCTT ATTTCAATG ATCT ATTTTTTG AAACTG T AAT ATIT ATTTCC CATTAGC CT ACTTTT C CC CAAGG AT CT AAGT 

TTGAATAAAAACACAGGCCTTGAAAAATATCTACATTGATAAGGGACTGCA 

ATCATTGAAAAATAATGAAATTTTACTCTTTGAAAAGTCC^ 

GCAACTATGTTACAATACTCCAAAATGCTTTCTX^ 

3 5 CTTAAACTGATATCAGTGTTGTAAGTTC^AAATAGTATTGAGAAAAATATACCTA 

TGTAAGGAAAGGTAAATTATTCACCAAAAAAAATCAAGATGTCTT 
TTGAAATTTTGAATAAACTCATCACTTAAATCTTTATTTTACCT 
AATTTATTTTGCAAAAAATATTTGAAGAATAAGGGAAGAGTGATTTCTTATTC^ 
TTGAAAATTGAGTGAGCCTAACAGTACATAGCTTTCTTTTCCT^ 

4 0 ATAGATTCTGGGTAGTTCTGATGATATGAAACTAAATTTTTTAAATAATTTAACT 

TCTTCACTAAAAGAGCATATATGCAACAGTGTGACCTCTT^ 

ACCATAATAAATTGAGAAAGGCAGGTCTTGGTAAAACTTTATAGTCATGTATGAATACC 
CATTCATTTTAATTTATGTCGTTGTTTATCATCAAGGAAG^ 

4 5 AAT AATAACTTT CAGT AAGGCAG GAG TCTTAAACCTCTTAG TAT C^ CAGT ATTATT CTG CTCTG TAAG AT AATGTGTTA 

TCAATACTTCATAAGATAACTTTGGCCAGAGAATGAATAAAATTTATAGTCACTTTCAAACT 

CAT AG ATC ATGCAG CTG TCTG CCAACAGAT AATCTTGATGAGT AAAAAGAAG CT AACAGAG AAAAAGGTGG CTT AGT CTAAAAC AC 

TTAGGAAGGACTTAAAAAATTATTTCCCAAAGAATTAGGGAGTAAA 

CTTTGTTTGAGGAATTTTCCTTGCCACCTCAGAAAGTGAAAATTTACTCC^ 

5 0 TGATTGTTTAACTTGTGTAACCTGTTCCTTGAGGTAAAATGAGGAAGAAGCATATGATTTGGCCT 

AGTCCATGGAAATTACTG AACCAAACTAGAGATCTAAAGCAAAAG CC CTTTTGTG ACACAGATGAACTCAATAGTGCACTGTGACT 
ATCTGACTTTTGCCACGTCCAGG CTTAGGCCGCCAAG CGG CCAGGGT CAG CAAATTCCACT ACCTGAAAATGACAG AAGGCGGCAG 
GG AAGGGG AATTGAGG CATGGGG AG AAGCAGTTGAAG CAT ACT AGGG AGG CATG AGGTT CT CTCT CT CT CTCT CTCT CTCT CTCTC 
TCTCTCTCTCTCTCTCGTGCAGCTAAAC^SAAGAGAAGGAAGGAAAA 

5 5 TAGGCGCTGTGGTCTCCAAGCAGAGCCGTGTCTGGTGGCGGCAAACTAGATTGCTCCAAG 

TGTTGTTGGACTGTTTGTTTACATGCAGTT^ 

ATATGCCGATGTC^TCCAACCCCCAATTATTTGAACATTGTGAAGGTTAAGACATCAATCCAT 
AATCGTGACATTATTTTCTCTAGTTCATTGACCTTGTTGGCAAT^ 

GACTTGACACTAAATTAGTCAACTAAAAATTTAAAGGGTAAAGGGTCCCCCCAGCCACCCACTCCTGCCCCTCCCTGTACT 

6 0 AGAAAATTTGGACAAGGCACACCATGCGTGTGACTCAATTGTTCTGGCCAAAAAATC^ 

GGTGATTACTGGACTTTACAAAAGTTAACCACTAGTTAA^ 

TATATATACTGCTTTTGGGTACTGAGGTTGAACAATTGGTCGGGGATAATTCCATA 
AAATGTATAGAGCAGGCCTAAAATTTCTCCACTGCCTTTGAGTATACCTT^ 
AATAAAATGATCAATTGTAOUAGTATAGAAATATACCAAGCGTCT 
65 TTGACATATGCAAAGCAGCCTATCTCTCATGAGTCTGAGTTCCAGCCTCCCT^ 
CCAAACATGTAAATAAATAGCCATTCAAAGTGCAAGCATTTTT 
CCCTTTTATGTACCTTTAAAAACATTTAGCCACTTTATTCT 

TTTT G AATTT AGAAAATGTTGG CAATGTTGG AAAG AAAGACATTCAGCATTT CAAAGTTCTGG TGGGG AAGGAAAAAAAAG ATT CT 
GAATAAAACAATTTTATGTGATTTTGCAGGTGGTTTGAGCTATCATTTTATATGTACG 

7 0 TATAATACTGACCTAATTATGCATAGAAAAAGATAAGCCATTTACTTGATTCATTTC 

GGCTCCTATTTT CTG AAAG AAATTGTTCGCTTTT ATTG CAG AAAAGT AG C^ 
TCCAAAGGTTTTGTGTGTATTCGGTTAGTTACAGTAATTATACCCAATAACrc 
GTTTT CTCACACAGGCCCTGCTGCAGAACATCTGCTGAGAGATTTC 
ATTATCAAATAAATTGAATXrTCTAATTTT 
7 5 CCTGGGCAATGAGCAAAAGATCTTTTCTACCCCTm 
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TTTTTGGCTTCTGTGCCTCXSTAGGCTCrCCATC 

ACAGGGGTGAGGAAAGGTGGTACCTCAGAACAGTTGAGCAAACCTTAAU'riT 1 1 I r TT TTTTTT CI I CCTCCCAGCCTGT 

TGTTT AGG CCTGAGCAAAT AAATGGGAGTTT AATT CAATT AGAGC CTGG CTGT AGT C AG TG CCTCATG ACAGAAATT ATTCAAT AT 
TTTTCCCCTCACATTGTTTGGTTTACACATCACTG^ 
5 TCTGTCTGCCATGTCACTGTCCCTTTCACCGTTCTCTGGTTT^ 

TAGTCAGGGTGTTCCTATGTGAGTAGGGTGGCTGCAGTTTTAAACATACT 
ATCTACTGCTAAGCCTTAAGAGAAATCCTTTGGGTAGGGAAGTGAA 

TG AG ATTT CAAAGAT AAATCTGGTCAGGGCAGT ATG AAGAATTGGACT C CATTG CATGATC CC CTTTCAGG CATTTCTTTGGGGT A 

AGAGGCAAGATTGACTGAGATTAGAATTTTTTGCATGTG^ 
1 0 AGCTACTGATGAAACCTCTACCACAAGCAGATCTCCAGCCCCCCTACTTC^ 

CCCCCTCAATGATTTCACTTCTCCCCATCCACTAAGATCTATTAAGATGAG 

CTCTGGAAGAGCTGTTCTTTCCCATTATTTCAGCTGAT^ 

AATT ATT ATTGACTTCAGTAAACTGCAGAAAACCAAGAAAGTCAC 

TTTTTATGGATAGGCTAGAACTCCAGTCTCCTTTTTCATGGAAGC^ 
1 5 AGCTGTTGCTACTTCAGATGAAGGTATGGGTTCATGCTAAGGT^ 

AAAGTGTATTTCAGTTATATCTGTATCTTAGAAAAATGTACAGTTACAGACAAG^ 

AAATTTTTATAGTTTTTCTATTTCGTGTTTATTTTTTA 

GGGCTTTGAAAGTTGTGTAACCTTTTCAGAGGAAGGTCATAGAC^ 

ATTCCGTTTATTCAAGGTTTTATTTCCAAGGTCTTATTACTATAA 
2 0 AAAAATACCAAAACAATTTTGATGGCTCAGAAGCTTATT^ 

CTCTTGAATATCCTTCAGGC^TGATAGTGAGCATATTTTT^ 

AAGTTGTTGAATGATCGTGGATTCCAATAATTAGAGAGAGTCTTCTAAAATCTTT^ 

TAGTCACTTGATGTATTTATGAAAGTCTGTGGAATTATTGGC 

TTTTGGGGGTCTTCCCTCAGCATGCCATCATGCTAGGCTTGCT 

2 5 TGACTTGTGTTCTGGAATATAGCTAAGTATTATGTCCTTGGATTGTAAGC^ 

ATTTCTAAAGGATAGAAATAACTACAAAAACTCACTAAATGAAAGGAGCGGTCAGGGGCCT 

ACTAGTTTCCTTTGGTAGGAGTTAAGTAAGGTAATCAAAACCATTATGAGTC 

TTGTAAAAAAATAAACAGATGTCAAGTCTTTGGGAAATCATAAATT^ 

GAAAGAAAGTGAATTTACTGTTACAAATTCTTCGACCAATATCTGTTTTTAGGATAGCTTGA 

3 0 TCTTATTTGCCATTATGTTTATATTTTTGATATTAAGGAAGA 

TGTTGGAGCCTATCCAGAACCTCAACAAATGTATGGAAATTTGAACCTTTTTGAA 
TCCCAGGTACCAGGGAACCTGCTAATATTTCCAGAGGGAATGAGTGX3CTGCCATCTTCCCAAGATGOT 
GAGGAGCCTCAGAACTGGCCTCTCTCCACTGCAACCATGGCATTGCAGAAATTCTGCTCT 
TACATAGCTAATCTGAAATGGAAACATTTTTATCCCT 

3 5 ATTT AT AATG CAGGTTT AATG CCTTCGAATGTTCCTGGG AT AAAAAATT AAAG TGG CCAAGCT AG TTTG T AT AAAGGCTG ACTAT A 

GT ATGTTAACAATAATT ATTTATAAC^IAT ATTTTGT ATTGGATAG CACATTCCATCTGAAT ATTTCAAATTGGCATGGTCCCTT CT 
AACTGGCATTAATTCATTCATCTTGCATCCCCAAGTAATAAAGTTATTCCCATTT^ 
AGTGACAGATGTTCAAATTCAGAAGAAAATCTCTGTTTGAGTTTGTCTTTGCCTTGTA^ 
TTAAATAAGAAATTGAACCTTGCATTTAGTCCTATGATCTTTACCCACCCAAGC^ 

4 0 AAAAAAAAAAGTCAATGATAAATCCCCCCGGATGGGCAAGAGTCACT 

GTAGCACATACTTGGAAATTGAAATTTTGTGGGGAAGACAAGGAGGAATAAAG^ 
TCCCGACACTTGCCACATGTAAAGACGTTGTAAGAAAGCAGGCCCACAAGGCCTTTAATCCACTO 
AAGCATCTAAGCATAAGAGATTTAGGAAACCAAATGATTCATAAAATTCCAAAACACTTCTF 
GGTCTCATTCTGTTGCCCAGGCTGGAGTGCAGTGGTCTGATCATAGCTCACT 

4 5 TACCTCAAC CT CCTGAGT G ACTGGGACTATTGG CATG CAACAC CAT ACCTGACTGATTTTTT ATTT CTTG TAG AG ATGGGGT CTTG 

CCCAGGCTGATCACAAACTCCTGACCTCAAGAAATTCTCCCTGCTTGGCCT 
ACCTGGCCAAAAGGACTTATTTTAAATTTGAAGTCATATTACCAGCAGAGACAT^ 
CTAAGCTGAAGGATTAGGCATGGTCTCTCTCATCATGGCCTTCCTACT^ 
CAACACAAAAATTGTGACTGGCACAAGTTCAAATCTTCCTTT 

5 0 AGACACTGAAACAATGGTATTTCAGTGATTATACCAGTTACATCACCTT 

CATTGCACATACAACTTTGTAGGCTCCTATGACAACCTCATAAACAGGGAACCA 

TCCTTGGATGATAGGCCAAGGCCACACAGCTGGTTACACGTTGGTTCTGGGACA^ 

TGCCTCTTTATCTCTATTGTTTCCAAACTATTGCAGGAAACATGGGTGTATT 

TCT AT AACTCAAATT AAGGGG AT CCATG AAAGT AACAAT C AT CAAA ACAATTT AAAATTAAGCAACAAT CTCCATTTTTG TGG CAT 

5 5 GAAAATCCCAAAAAGTCCCAGGGATTTCCATCCCTTACT 

TGTATTATCATGTATG TTTTTGTTT GGCTGTTGTACACCTAAACACATC 

G C CTTTG ACAAGCCATTTGACCT CACTGTG C CTGT ATT AC CTC CTTTGTCAAAG AGTTG ACTT CAAGTG TCCTTCT 
TTTTGTGATTATATTTCTCCAAATAAATTGTAAGCTTCCTGAGGTGAGAATGGCACTT 

TGGGTATTTAATAAATGTTTGATGATGGCATTGAACGATTTATTGGCACGCTTGAGTAGAGTTATAGGAAATCA 

6 0 TTATTAACAATTAGCTGTCTATAAGCTGCATATGTAATTTTCCTTTACAT 

GCCCCTTGACATCAGTTCTTCTGTATGTTAATACATTAGCAACTG 
AGCATATAAGTAGGAAAAGGGGTrAGTAAGGTAAC^ATTGTATAGGGCATCTTCA^ 
TAATTATCAGCAATGTTTAAATGCTATGATAGGCCATTTTTC 
CAAATTTTGCTACTATTCn'ATTACATGTCACCA 

6 5 ATTATTTGTTTTGCATATGACTGTAAAATTAATTGGCATAGTGATATAATTTAGAG 

GATTTAATATA 

HUMAN SEQUENCE - mRNA 

GGCGATCACGTTTTCACATGATGCTCACGCTCAGGGCGCTTCAAT^ 

7 0 TCTCGTCT CT CT C CT ACAGAAAAGAAAAAG AAAAAAATGT CATTAGAAGAGGCGT AACACGTC AG TCCGTCC 

CCTGGAGTGGCCGAAAGAGATCAGTTCTAACCTGCTCTGCAGGAATAACGGTCCT 
CAGTTTGCTCCGGGAGCTGTTTCTTCGCTTCCACCTTTTTCT 
TCTGX5CCAAAACTACAAACAAGACTTCGCAGATCGAGCCTGCGTGCTGCOT 
lvTrrGCTCTTATCAATGAAGCAGCCGATCATGGCXSGA 
7 5 GTGGTCAACTATGACAATGTAGTGGACACAGGTTCTGAAACAG 
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CAACCCTCTGGACCAGGAGACGAGTCCAGCTAGTGTGCCCAACCATGAGTC 

AGGAAGAGGAAGATGAAATAAGGGAGGGTGGAGTGGAACACCCCTGGCACAACAACGAGATTCT 

GAAGAAATGAAGGAAGACTATGACACTATGGGGCCAGAAGCCACGATCC^ 

TTGCACATQVGATTTTGAGGAATACTTTGCCAAA^ 

AGCX5CAGTGACACAGCCATTATTTACCC 

AATGACCTGCCACCTGGAACTCCAGATGCTTTTGCCCAACTGCTGACCT 

ACTGAAGGAGCACATCAAGTACCGCCACGAGAAGAATGAAGAGAACTTTT 

CCCAGCTCGAGCGGCATATGGTGACACACAAGCCAGGGACAGATCAGC^ 

AAATGCACAGAGTGTGGCAAGGCCTTCAAATATAAACACCATCTGAAAG 

GTGCCCAAACTGCAAGAAACGTTTCTCCCATTCTGGTTCCTACAGTTCGCAC^ 

T AAATGGC CGAATGAGAAACAAT ATCAAGACGGGTTCTT CCC CT AATTCTGTTTCTTCTTCT CCT ACT AATT CAG CCATTACC CAG 

TT AAG AAACAAGTTGG AG AATGGAAAAC CACTTAGTATG T CTG AACAGACAGGCTT ACTT AAAATT AAAACAGAACC^ 

CAATGACTATAAAGTTCTTATGGCTACACACGGGTTTAGTGGCACT 

TAGGAGTTCATCCATCTCCTCAGAGTCCAATGCAGCACTTAGGT 

AGTAATTTAAGTGAGGTACAAAAGGTTCTACAGATTGTGGACAATACTG 

TTCAAAGTTGAAAGGTTATCACATGAAGGATCCATGCTCTCAACCTGAG 

GTCTTCCGGTAGTGAGTCATAATGGTGCCACTAAAAGTATTATTGACTATACGTTG^ 

CAGAGCTTGACTACTGACTCAAGGAGACAGATCAGTAATATAAAGAAAGAGAA^ 

CAAAATGATTGAGAACCACAACATATCCACTCCATTTTCATGCCAGTTCT 

AGCATGAACGTTACCTTTGTAAGATGAATGAAGAGATCAAGGCGGTCCT 

GTTTTTGTTGATAATAAAGCCCTCCTCTTGTCATCTGTACTTTCTGAGA 

CATGTCTCTACTCAAAGCATACTATGCTATGAACATGGAGCCCAACTCCGATGA^ 

CTCAGGAATTTGTGAAGGAATGGTTTGAACAACGAAAAGTCTACCAGTACT 

AAGCCGTTAGCTCCCAACAGTAACCCTCCCACAAAAGACTCTTTAT^ 

ACCATCTATAGCAGAACTCCACAACAGTGTTACGAATTGTGATCCTCCT 

AACCAGTTGAAAAATTGGACCACTCCAGGAGTAATACTCCTTCTCCCTTAAA 

AGTTCATACACTCCAAACAGCTTCTCTTCTGAGGAGCTCCAGGC^ 

CAAAAGT ATT AT AG C CACAAAG AACAAAACAAAAGCT AGT AGCAT CAG TTT AG ATCAT AACAGTG TTTCTT CCTCAT CTG AAAACT 

CAGATGAGCCTCTGAACTTGACTTTTATCAAGAAGGAAT^ 

AGCATGAACCCATTTAGTGCCAAACCTTTATACACAGCTCITCCACCTCAA^ 

CCAGACCAGTATTCCTGGGCTACGACCATACCCAGGACTGGATCAGATGAGCT^ 

GAGCAGCTACTTTTGCTGATATGOVGCAAAGGAGAAAGTACCAGCG 

GACTACATGTCAGGCCTAGATGATATGACAGACTCCGACTCCTGTCTGTCTCGCAAAAAGATC^ 
TGCATGTGACTTATGTGACAAGACATTCCAGAAAAGCAGTTCCCTTCTGCGAC^ 
AGTGTCAGATTTGTAAGAAAGCGTTTAAACACAAGCACCACCTTAT 
TGTGATAAATGTGGCAAGCGCTTCTCACACTCGGGCTCGTACTCGCAG 

GG AGGAG CGGGAAG CGGCGG AGCGCGAGGCGCG CGAGAAAGGGC ACTTGGAACC CACCG AGCTGCTG ATGAACCGGGCTTACTTGC 

AGAGCATTACCCCTCAGGGGTACTCTGACTCGGAGGAGAGGGAGAGTATGCCGAGGGATGGCGAGAGCGAGAAGGAG 

GAAGGCGAG<3ATGGCTACGGGAAGCTGGGCAGACAGGATGGCGACGAGGAGT^ 

TATGGATACX5GATCCCGAAACGATACGAGATGAAGAAGAGACTGGAGATCACTCCATG 

AAACCAAATCAGACC^CGAGGAAGACAATATGGAAGATGGCATGTAA 

TAGTATTGTTACCTGCTTGAAAACACrGCTGTGTTAAGCrcTTCATGCACGTGCCT 

GAAGGGGCGGTTCAGCCAAGACAGATGTAGACGGAGTTGGAGCTGGGTATTGT^ 

GTTAAGGCCTAAAAACTGTGTGGTTCAGAGACTAATTCCTGTGTTTAATAGCATT^ 

AATTGCACTCTACTTATGTATCACTACAAACTTTAAAAAACTATGTCT 

CCATACATCAGTATTTTTATTATTATTATTGTTATTCCTTTTTAA 

TCrGTACTTTCCTTTCGCTATTCATCAATTTCCCAl'lT' 

TTTTTrCTGTGAAGGAACTTGAAGTGATGCATGTGTGAATTTAAGATACCGAAGT 

AAG ATGAGAAAT AAAG AAAGCCTTTGTAAGGTGGTTTTAAAAG CCTTATATGCAAACCTTTT AATCTGTGTTTCTG CAAGTGCCAT 

CCTTGTACAGTGTTAAGAGGGTAACATGGGTTACCTTTGCACCAGCT^ 

CAGTATTAGAAGAATAGGCAGCAGTTCCTTAGTTTACATATGTTTGTGCAATT 

AGTATTACACCAAACTGTTTTTGCAACAAAAAAATTTTTTTTGC^ 

TC^TTTTATATTTCCTAATTTTATTTATTTCATACTGTAGTGTACAGTATTATAGT 

AAGGAACATGACGTTGATCATTTGGGCAAATTTTACGTAAAGAGAAGAGCATT^ 

ATTTTTCAATTTTATTATTTTATTTTTGTTTTT^ 

ACACTGTATTTTCGACTGAAAATTCCACTTTCTTCATCTTGTTTTT^ 

ATGACAAAAATCTTTCCTGAATTGTCTTTGTAAAAGTATTATTGA^ 

AT AAAAATG T AGC CAAACT 

HUMAN SEQUENCE - CODING 

ATGAAGCAGCCGATCATGGCGGATGGCCCCCGGTGCIAAGAGGCGCAAACAAGCCAATCCC^ 
CAATGTAGTGGACACAGGTTCTGAAACAGATGAGGAAGACAAGCT^ 

AGGAGACGAGTCCAGCTAGTGTGCCCAACCATGAGTCCTCCCCACACGTGAGCCAAGCTCTGT^ 
GAAATAAGGGAGGGTGGAGTCGAACACCCCTGGCACAACAACGAGATTCTACAAG 
AGACTATGACACTATGGGGCCA.GAAGCCACGATCCAGACCGCAATTAACAATGGTACAGTGA^ 
TTGAGGAATACTTTGCCAAAAGAAAACTGGAGGAACGCGATGGTCIATGCIAGTCAGC^ 
GCCAITATTTACCCAGAAGCCCCTGAGGAGCTGTCTCGCCTTGGCACGCCAGAG^ 
TGGAACTCCAGATGCTTTTGCCCAACTGCTGACCrGCCCCTACTGCGAC 
TCAAGTACCGCCACGAGAAGAATGAAGAGAACTTTTCCnXaCCCTCT 
CATATGGTGACACACAAGCC^GGGACAGATCAGCACCAAATGCTAACC 
TGGCAAGGCCTTCAAATATAAACACCATCTGAAAGAAC^CCTGCGAATTCAC^G 



AGAAACAATATCAAGACGGGTTCTTCCCCrAATTCT G ' rrr C"rT C I TCTCCTACTAATTCAGCCATTACCCAGTTAAGAAACAAGTT 
GGAGAATGGAAAACCACTTAGTATGTCTCAACAGACAGGCTTACTTA 

TTCTTATGG CT ACACACGGG TTT AG IGGCACTAGTCCCTTl' A'lXi aATG G TGGG CT7GG AG CCACCAGC C CTTTAGG ACTTCAT C CA 




iGTTTAATCTCTGTAAATGGCCGAATG 
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TCTGCTCAGAGTCCAATGCAGCACTTAGGTGTAGGGATGGAAGCC^ 
GGTACAAAAGGTTCTACAGATTGTGGACAATACTGTTTCCAG^ 
GTTATCACATGAAGGATCCATGCTCTCAACCTGAGGAACAAGG^ 
AGTCATAATGGTGCCACTAAAAGTATTATTGACTATACGT^ 
5 TGACTCAAGGAGACAGATCAGTAATATAAAGAAAGAGAAGCTACGTACTTTAATAGATTTG 

CTTTGTAAGATGAATGAAGAGATCAAGGCGGTCCTGC^ 

TAAAGCCCTCCTCTTGTCATCTGTACTTTCTGAGAAAGGAATGACAAGCCCCAT 

1 0 AAGGAATGGTITGAACAACGAAAAGTCTACCAGTACTCAAATTCCAGGTCCCCATCCCT 
CAACAGTAACCCTCCCACAAAAGACTCTTTATTACCCAGGTCTCCTGTAAAACCTATGGA 
AACTCCACAACAGTGTTACGAATTGTGATCCT 

TTGGACCACTCCAGGAG T AAT ACTCCTTCTCCCTT AAAT CTTTCCTCCACAT CTT CT AAAAACTC CCACAGT AGTTCATACACTC C 
AAACAGCTTCTCTTCTGAGGAGCTCCAGGCTGAGCCTT^ 
1 5 CCACAAAGAACAAAACAAAAGCTAGTAGCATCAGTTTAGATCATAACAGTGTT^ 
AACTTGACTTTTATCAAGAAGGAATTTTCAAATTCAA^ 
TAGTGCCAAACCTTTATACACAGCTCITCCACCTCAAAGCGCATTTCC 

CTGGGCT ACGACCATACC CAGGACTGGATCAGATGAGCTTCCTACCACATATGGCCTACACCTACCCAACTGGAG CAGCTACTTTT 

GCTGATATGCAGCAAAGGAGAAAGTACCAGCGGAAACAAGGATTTCA 
2 0 CCTAGATGATATGACAGACTCCGACTCCTGTCTGTCTCGCAAAAAGATCAAGAAGACAGAGAGTGGCATC 

GTGACAAGACATTCCAGAAAAGCAGTTCCCTTCTGCGACAT 

AAGAAAGCGTTTAAACACAAGCACCACCITATCGAGCACTCAA 

CAAGCGCTTCTCACACTCGGGCTCGTACTCGCAGCACATGAATCACA 

CGGCGGAGCGCGAGGCGCGCGAGAAAGGGCACTTGGAACCCACCGAG 
2 5 CAGGGGTACTCTGACTCGGAGGAGAGGGAGAGTATGCCGAGGGAT^ 

CTACGGGAAGCTGGGCAGACAGGATGGCGACX1AGGAGTTCGAGGAGGAAGAGGAAGAAA 

CCGAAACGATACGAGATGAAGAAGAGACTGGAGATCACTCCATG^ 

CACGAGGAAGACAATATGGAAGATGGCATGTAA 
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Table 49 

MOUSE NOMENCLATURE 
ICSGNM Vdacl 
Celera mCG3636 

5 

HUMAN NOMENCLATURE 
HGNC VDAC1 
Celera hCG16588 

10 MOUSE SEQUENCE - GENOMIC 

TTTGATCCTGG AAAGG CATCAGAGG CTGGTGTCTTTGGTAAGAAGGG CTG TGT ACCCTG AACCTGGGATATGGAGGGGT CTG ATGG 

GTTTGGGGGAAGAAGCTGAAGAAGTTATGGCATGTCAGAGCAAGCC 

GAATCCAACCAGCTTGTTGATTTATTATCITTCTA^ 

GGAAGGGCAAAAAGGATCCCATAGTTGGAGATGCCTGAGCC^ATCCAC^ 
1 5 TGTGTGTAATTTCCACGGAGGTGTAGCAGTAGCAAGCCGTATCAGAGGC^ 

ATCCATACCCTCTCTCCTGACACCAATTACAAACCTAGGGATCCCTGAGGCCACCCTCAC^ 

CACTCCCAGTCACAGGTAAAAGCAACTCAGGAAAGGGTCAAGT^^ 

TGCAGATGGAGTCCCAGCTGGTGCTCACTGCTCTCCAAATGTGCCAACTCACACGGAGCACCATCCGCTG 
AAGCGCATATTGGGGTTCCATACTTGCTGCCTGTGAGGTCAACAC^ 
2 0 CTCAAAGCTGG T AAC CATGG GAATCTCGTTG ATTT ACACTGT AAGG C CAACT CT CT AAATGAAGAT CCG CT C CTAGAACAGC ACTT 

TGTGGAAGCTAAGGGCAAAGGCCAGACCTCCTTGTAGACAAGGTTAAGAAC^ 

GCATTAGGCTGCAGAGG CCATTCTTCAGGACTG AAACCCCCCAGG ACATGGTC CTCAGCTGCACAG ACTGT 
GGACTGGCTGCCTGCTTGCATGCTGTAGTCACCTGTCATAC^ 

GGGACAGGGTGGACAAGCCAAGGCCAGGCCTGACCCCAGGGCAGCATTGATTCCAGAAAG^ 

2 5 TTGAGCTCTATGATGGAAGCAAAAAGCCAAGTTCAAAACAATGTGGAAGGAAGAACCTACCTITG 

CACCTGGIXX3AGGATGTGGTTCCGTTGGCAGAGTGCrrTAGTGTCTAGCATG 

AGATCAGAAGTTCAAACTCATCCTTGGCTACCTAATAGTGAGTTTG^ 

AAAACAAAAAACAAAACAAAACAAAACAACAACAACAACAACAACAACAACAAAA^ 

GGAAGCAGAT AT AG ATGGG TCTCTTTGGGTTCAAGG CCAG CCTGG TCT^ CT AGGG.CT ACAT AGTGA 

3 0 GACTATGTCTAAAAACAACAAAATTCCCCCAAAAAACAAGAATCAAGTAATGCTTGTT 

TTGTTTG 'ri"rrrrGTrrit5'rrrrGTTrri*G TTTTT CAAG ACAGGG TTT CTCTG TG T AGC C CTGG CTG T C CTGGAACTCACrTfG T A 
GACCAGGCTGGCCTCGAACTCAGAAATCTGCCTGCCTCTGCCTCCCAAGTGTTGGGAGTG 

GTCTT AAGG AG TTGGGTTGTT AAGT CAAAGG ACAG AGG CAGGT ACAG CTTTG C C C CT AGGG CAAAC CC C ATGCATCTGTT CACTTG 
TGTAGCCTGAGGAGACAATGCTATCTCTTGCAGTAGTATGGTATCTTAGCCAATGATCGATTGATC 

3 5 ACACACACACACACACACACACACACACACACACACACACACACACACATATGC 

ACTT AAGGTCTCCGGTAGG AAGG CAGCCTCTGACAATTTCA 

GCAGGCAGATAAGTTTGCCACCCAATAGCAGAAACGACCGCACTTACTCCAAACCCTAATC^ 

AGCCTCTGTCTTTATTAAGATTTTTTTTAACTTATT^ 

AGGCTGGCCTCAAACTCACAGAGATCCTCCTGCCTCTGCCrCCTGAGTGCT 

4 0 AAGAGATTCTAAAATACAAGTGTGTGTCTGTGTGTCTATGCCCATGAGTG 

ACAGGGACATGGGTCACGGGTGGTGGAATGCTGACGAGGGTCTTCTAGAAGAGCAG 

TTAGAGCACCCGACTGCTCTTCCGAAGGTCCAGAGTTCAAATCCCA 

GACCCCCTCTTCTGGAGTGTCTCAAGACAGCTACAATGTACTTACATATAATAAATAAAT 

AT AAAT AAAT AAAAAAG AAAAAAAAAAAAAAAAGAAG AGCAGAAAAGGT ACTT AACT C CT C CTGT C C CGGATTTG 

4 5 AGACAGGGTCTCACTCTGTAGCACAGGCTGGCCCACATCCATAGAGGCAGGGAG 

ACAGGGAGGCATCTGATGATACAAAGTTTCAGCTACACATGTTACATAAT^ 

CAGT CCCG^GACAG^GATGGCTG CTGTCCTGTCTATTTGATGGAGTTT ACAATTTGTAAAC CTGGGCATGCCGGTGAGGGAGTTTC 
TAGAGTGGGTTGATT AAGGAGGG AAG CCCCGTCCTAATGCTGGGTGGCATCTG CTCAGGGG CTTAGCCTACATAAAAAGG AGAAAG 
TGGG CTCAGCCACCAACATTCATCTCTCCCTTCTTGCTGACTC CATCATGTGG CTCTTGTCATGT 

5 0 CITTCCTGTTATGATGGCCTGCCTTCTCAACTCCCAAGCCAAAATAAC 

TTCAGTAAGAACAGTCACTCATACAATGGTGGATTGTAAGTTTAAGTTAGATGTGAAGA 

TTTGTTTTCTTGGTTTTGAAAGACAAAGTTTCTGTGTGTAGCCCTGGAACTTACTCT 

TCTGCCTGCTTCTGCCTCCTGAGTGCTGGGATTAAAGGA 

TGCTCTGCAG CTCTAGG CTGGTTTTAAACATAAAATCTTGT AGCTTCCCTCATGCACACACACACT 

AGGACTATC^TTCAGTTCCCAGCACTCACGGGGAGG CACACAATG ATCCTTAACTCGTTTCAGGGGATCTCG CG CCCTCTTCTGG C 
CTCCATGGACAATG CATGCACATGGTACACAATCATCTATG CAGGCAAAACACTT ATCAC ATAAAAT AAAAATAACATATAAATCT 
CCTTTTAAAAAGTCAGGAGAGGGGCTGGAGAGATGGCTCAGAGGTTAAGAGCA 
CCCAGCAGCCACATGGTGACTTAAAACCATCTGTAATGGGATCCGATGCCCT 

6 0 CATATACATTAAACAAACAAACAAATAAACAAACAAATAAATCTTTAAAAAA^ 

ATGCTTACTATGGGGGCCTAACTAGTCTTTGGAACCTACAGAGGGAA^ 
CCTAAT ATT AATTAAAGCAATTGTTTTl'CAAAAG ATTT CTT^ 
TGTACCACATACTTGCCTGGTACCCTCAGAGGCCAGAAGAGTGTTAGATTACCTGG 
ATATGGGTGCTGGGTCCTCTGCAAGAGCGGCCAGGGTTCITAATC^CrGA 

6 5 CAAGGGAATGTAGGGATGTGGTTGGCATCTGTACAGCTTGTTGATGGAGATAGAT 

GAAATCTATGTATTAAACATGTATAGTTTTTTTTATATGCCAACCA 

TGT AT AT AC ATGTGTGTGTGT AGGC^TTGTG AATGTG TG CCACATGGGT ACAGAT AC CT ACAG A CGATAGATC 

TCCTCTGGTAGAGCAGGAAGTGCTCTTAACTACTGAGCCATCTTCAGCCCCAAG 

7 0 CAGCATTTXX3GAGGCAGAGGCAGGAGAATTTCTGAG 

ACAG AGAAG C CCTC TCTCAG AAAACCAAAAACT AAT CAAACCAAACCAAAACCAAAACAAAAAAAACCCTG CT AG CCTCGAAT CCT 

AGGACTCAGGAGGTAGAGGCAGGAGGATTTCTATGAGTTGAGGCCAGCTTG 

TATATAGAGAAACCCTGTCTCAAGAAACCAAAGTGGGTGGAGATATACCTTTCT 

CTGGGTTCAAT CC CTT AACAT CCAT AT^TTGGTT C^T AATTTTTA^ CACAAC CATCCAGAG CTCTAAGGGA 

7 5 TCCAACGCCTTCTTCTGACCTGCAGGGGCATCAGGCATGCATGTTGC^ 
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GTAAGTCTTTTTAAAAAAATAATCTTT^ 
CACGAGGAGTTAGTCAACTTCAGCAACATACCAAGACT^ 
CATCGCTCGCCCTTGTTGGGCCTGCTATAACTTGCCAGCAGT 
CATTAATATTATGAATGAAGATTTCCTCA^ 
5 AGGGCCATTTTATTCCCTCAATCCCCAGGACTCACATGGAAAAGGGAAAGAA 
GAAAGTATAACGTAAAAACAAAATTCTAGGCCTTTAGGCCCA^ 
TAATC^TAAAACAGATAGCGACATTCCTCCACCCGCCAGCTTC 
TTGCTGGAACCCGCTATGCAAATGTGCTATTGTTAGGGGCT 
CCTTGTGACTCTTGTGACTTTGTACTTCCTTGGGACTCTTA^ 
1 0 GAGTTGTGGTTT CTT CCTTT AAAT AC C CCCTT ACCCAG CT ACTTGGGGTG CCACGGTC CTCT ACC CCTGCGTGGTGT ATGAC CGTG 

GGCCCGAGAGCGCTCTTGAATAAGAGTCCTCTTGCAGTTTGC^ 
GTCIAGAACGTGGGGGAGTCCTCACGTTGTGGGTCTTTCATGC^ 

TAGTATAAGAAAGAACTTCCCTAAGGTGAAGAAAAAAAATAC1U' , 1 , '1 ,, 1 1 r i 'I'l 'I i 1 i 1 1 rTGAGACAGGGTTTCATGTAACCTAGGC 
TGGGTTTGAGGTCATTATGTAGGCAAGGATACAGTAAACTTCTGATCC71GCCTCTC 
1 5 TTCATTATTAAGAGAAAACTGCATCCCCTCTGGATGGTTTACTTTGTTCTGTGCAAA 

AGAAAATACAATGTAAATAAATTAGGCAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAG 
AAGGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAGGAAAGCAAGCAA^ 

CAG CAGG AGAGG CCT^ATGCAGGTGCACACT AG CTGCACAGCTAATCTT AAC C CT AG CGTCAGTTTT AGTCT ATTTT CACAGCTAG 
AGAACTCC^GGGTCTCAGTGATTCCTCCCCAGAGTCAGGGGAAAGAG 

2 0 GAAGCCTGTGCATGAGACAAAAGTGAAATAGGGTGACTTATCCCAACAGACCT 

CCCCCAACAGGACT ATGGTCAGGACTGGC CCAT CCC CACAAGT CAACAAG ACAT CT CTGG ACT CACATGG AACTG AAGT AG CT CTG 
GGGACCCCAAACTCAGGAGGGGGATCC ATGAGGAAATGAG CTCAAAG CX5TCTTGCAACTGCA(IAGTGTTCTGCTrG^X j r TTCTGGC 
CCAACAGAG ATGCAAG CCTGCCGGAGG AGTTGGCAGACTC^VGG AC^TCCTG CTGTTTCCCTAAAGCTTCTGGAriTCTTAATTTGA 
ATCTGCAAGTAAGAACCCGACTGGCCAGGCTTGGATCAGG 
25 GGCTGGGCAGAGTCCAGTTTAAAAAAAAAAAAAAAAAGGTGACTTGCrTTTCC 
AATGAATAACATTTCrrGGTGCACTTGGATTTCAATCCCAGGGGCT 
CTTGTGTGGCTAATTCTI^GGATTTAAAACCTTCCCrrCTGG 
AAAGTTAAAAACAAACAAACAAACAAACAAACAAAACCCAAAA 
CTCTAGTCAACAGTTGCTGGAGAGGGAACTCTTGAGCTGCTGAT 

3 0 GGGGGCCTTTGGGGTGATCTTTTTAGTTACCTTGAGTTACCTT 

TGGGTGGAATTCITTAATCTGTCATCTGTGCCTTCTCTAC 
GCTGAGTTTTAGGTAAGTGCCTTACCAGTCCTCrrGCATCT 

GGTGGCGCACACCGCCTTTAATCCCAGCACITGGGAGGGAGAGGCAGGCGGATTTCT 
TAGTTCCAGGACAGCG&AGGCTATACAGAGAAACCCrGTCT^ 

3 5 AAAAACCCAAAAAATAAAACAAAACAAAAAAAAAAACCAAGAAAGACTCTC^ 

CCCACXjACACACAGCCCTGGGCATCAGGGCTCTGGGTCAAGTGTTAAG^ 
TTCCG ACTCTGCCTCCCTGT ACGCCCCTGTGACTC AGCAGCTTT CCCTGCCT^ 
GCCGGGTAGATAGAGCTGCAGACTTGGGAACAGGTTCTTTCTAATGCAAAGCA 
ATTCTATTTCGAAGGGCTTTCCCTCATAGTCTTAACTAGATATGGGCT 

4 0 TCTTGGTTCCCTTCCTTTAGCCCACGGAACAGCATTIT 

GGTTGTGAGGAACCACCTGGCGTCTCCTCTACTGGACCCC^ 
TGTGAAGTCTTCACTCACATGTGATTTCACAAACTCTCCAAACCT^ 
GGCCACGTGCTGAGACCAGGCTGGGAAAAAGTGTCCGCCCTCTCTACACTC 
CGTAGCGAGACTCCGCAGTCTCACAAACCCACTTTGTCTTT^ 

4 5 ACTAGGGAAGATTGCACGC^CTTTACAGTGCGGAGCCXSGCTCTCGGGAGAGTTTAATTTC 

CGTGTCGCAGGCTCCGGAAATCAAGGTTGGAGCAGAGGGGACAGTAGCCGCTCTGACCTATA 
TCCCCACTGGGCCTCCGAGTGACTTGC!AGCCTCGGCGACTCCGGCGACACTATCCGCTGAGG 
CGGAAGCrCCCCrCCCCCTCTTCCCAAATGCTTCCAGCAGCCTGGAGGCTGCAAC^GTGAC 
CCTC^OSC^CGTCAGACCCACGCCrCCCTATaSCCCCGTGC^ 
50 GCCGCAGAGGGCGAGAGCGCGGCTCXjCGAGTGGGAAGAAGACCCCGAGA 

AGCTTCGGGGTCCGGCGCGCAGAGCTGGGGCGTGTCCGCGCCGCCGCGATCT^ 
CCAGCCTCCTCGCCGCAACCCCCGCCGTAGCTGCCGCTGCT^ 

GCT ATGCCTGAGTCCCCAGGG CCTCGGGTCGCGCTCCTGGG AAGTCTCCAGGG AGG CGATCGAGGG AGGTTCGGGCGAGGGCGCTG 
GGCCTAAGCGCTTGGCCCCX5GGAAGGGTGAACCAGGGACGATCATTGCGGGCCTGGAGGGGCA 

5 5 AGGGCX3GGCGCAGAGCGGATCTCCCAGATTCTAGCTCGAGCGTTCAGAGGA 

GAGGAGCGGAAGGGCAAGGGCGGCCGCGTGGAGGCCCGGCGGGCCGCGGGTCTAGGTGACCTTCCCGGCGGCGCCCT 

CCTCCGCCGAGAGGACGAACTCCAGGCACGGCCATGCCCTGGATCCCCCAGCCCCT^ 
GGTGGCGCCGGGTCCCTCCCG03CATGCCAGTGTCTTCCCGGGCGGAACGGGCTGCC 

6 0 ACCTGGGCCTTCGACTGAGAGCGAGACCCCATAGGGAGCCATACG 

CGCCAAATGCAACTGTGCATTXTTGGAAACrrGTTTTCAGGGCCT 
GTCACACTCGTAGGAAGATTTAAAACTTCAATCTrCTCATCTC 

AGAATAAAATGATCCGATTAACACGAAGCCTTGGGGCATGCACAAGGACACCTTAGAAACCTTGAC^ 
AGCTTTTGGGGAGGGTTGAACTCATCCTCCACAGACTTTTAT^ 

6 5 ATGAATATACAGGCGTATTATTTTAATGCCATTATTACTAATGTCAGAAGCAAGCATT^ 

AGAATAATTCACGAGACACTGTCAGTTTGCAGATTGACAGCTGTCAGGCTGGAG 
AAGAGTGTGGGCTTAGCTCAGGTGTGTGCTGTCCAAAACTTCTCTTTC 
TGTTT ACT AG CAAGCCAGG CAT AACTG CGGAGGGCCTGC CTG CTTTTCT 
TCTTCAGGGACTTGGGTGGGACC^CTTGGCACACATTAGGTGGT^ 
70 ATTTAATGTGTGCATGCTTGTGCCTATGTGAGTTTATGTACATACATGCAGGTGTGAGATGCCT 
CTGGGTCTTCTGCAAAAGCAGTAAG(^CTTTTTTGTT 

TAGCCCTGGCTGTCAGCAGTAAGCACT CTT TTTT I 11 11 11 1 I' l l ' TTH 1 1 AAGATTTATTTATTATTTTATGTAAGTACACTGTA 

GCTCTCTTCAGACACTCCAGAAGAGGGCCTCAGATCT 

GGACCTTCAG^GAGCAGTCAGTGCTOT 

7 5 TCTCCATT' rr r n" l"l'C 1 1 1 CAT AC CACT CTT AACT ACCCAGGGATG GTAGGTG CACATT GAATC\iI\iGAATCTT AAAA.CAC CCATT 
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agatgacctagacagctcactcacttctw3aagctatc 
tcacctacagcagtagtcatgagagatgagaaacmctcctttagc^ttctacaact^ 
ggtgtttctgggcttaaagctagagagtgatatgggta 
gacaggtgtgggggtatgacagctcx1agtaaggtgccagtccccacttgtagtagct 
5 tagaatgaggctttggacaggggctgtctggggccatgcggtcctcatgc^ 

tgtcgttgtagctttgcgcttrccccaggttccttcggtacctgaaaggatggcagag^ 
ctccatcccgcagttaagtcacagcatgtgagcactttacctagggctgagc^ 

GTTCTGATCTGACAAGAGCCACAGAGTCrAGCTC^OTGAAATAGCTT^ 
GATTGATTGATTGATTGATTGATTTTGGCTTCTGTTTTTA^ 

TGG TGTTCTG CGGCCCTG CTGTGT AT ACTTGGTG CTTGGG CAC CAGTC CT C CG CT AGT ATCC CCACACTGCACTG CTGCTGTGTTC 

TGAGGAACCCATAGGTGTGAGTGCCTCTGCACAGCAGAAGTTACCAACTGGTTGGG 

TGTAGTGGACGGTGCTCATAGAAATCATTTGCCCTGGGGTGAGATAGGAGGTGTCCT^ 

AAACAATACCTCTCTATGGGTGGGAAGGACTCTGGACCCCGTGGTTCAGGGTGA^ 
1 5 T AATAGT AGAT AT ACAAGAC CAGAG CTTGG TT AGATTC CCT AG CCAT AAAT ATAG CAATGGAAACTGTCT ATTGG ACTGGCTTGTG 

TGGGACTTAGTAGAATGCTAGCCCAGCATGCAGAAAGCC 

CACTCAGAGGTGGAGGCAGGAGGATCAAGTTCAAGGTCATCCTCCGTGAC^ 

TCCTGTTTCAAAGCCAAACCAAACTGGCItKSTGGTGCACACCT 

GAAGGCAGACAGATCTCTGTGAGTTTGAGACCAGCCTGGTCTACAGAGTTTCA« 
2 0 TGAAAAACGAAGAACCAACCAACCAAAGAAACCAAACTITATTATTTATTTGTC^ 

TGTCATCTTAATAGAATTCTACTTGTGTTTGTTTCATGGCrTCTC 

CAGAGAGACTGACTTTGGGGTCTTTTACTTAGAAAGGGAGTAGGATATAGACCC^ 

AAAAAAGCGCTGACTGTGTCAACCCCCTATGCCTTGTGGTGCCTGGGCTATCCTGGAGCCCTGGTGTACCT 
CCTGGAGTGCTGCTGGCTGCCTGCATGGCCTTCTCT 

2 5 CTCCCCAGCCTTATAGAAGGCTGTGCTGCTGCTGGTGGCCCAG CTTCCAGAGGGTTTTAAAATAGAAGCAGAGC CTGG AAAGCCAG 

ACTGTCCCCCTGTG AG ACTCTGTGGAGGGGGAGGGGAAGG CAG CTGCITTGGGAACTCyVGCCTGCTTCCAGAGCAGAGGCCACTGC 

CTTGCCAGATGTGTGTGTGTCTGTGTGTGCCTGTGTTTATGTCTGTCTGTG^ 

CTGAGAGGTTGATTTTTGCCCCTGGTCCC™ 

TTATTTTAATTTTTAGATATATTTATTCATTTTATGTATATGAGTACAGTC 

3 0 ATACATTACAAATGGTTGTGAGCCACCATGTGGTTGCTGGGAGTTGAACTCAGGACCTCT 

CTGAGCCATCCCTCCAGCCCCCAACCTCTTGATTTCTTTATATGTATAAGGGTTTTCC 
TGTAGTACCCACAGAGTCCAGAAGAAAGCGTCCATCAGATCCCCTTTAACTGGAGTTAGAGAATATTT^ 
CAGGGAAGCAAACCTGGGTCCTCTGCAAAAGCAGCCAGTGCTCTTAATGGCTCA^ 
TTTGTGGGTCAATTTTTTGCCCTGGCAAACCATCCAAAACTGTTGGAGAAAGAG 

3 5 CTGGGATGGTGATGGTAACATTTGGTGGCTTTGGCTCCTGACAGTAGCTATGTACTC 

TCATCTGT AGTTCCCTGGATGGTCTG AGAG ATGCCACCGT AGGTGGCTCCTG AT^ CTTGTTTCCCCTTTG CT 

TTTGGAGATGGGGGTCrGAGCrCACTGCGTGTTTCCAGTGTTGAG 

TGTGTTAGCTTGGGGTTCTGGTGGGTGCACAGTGCCGAGTGAGGCTAGTCTTTTTT^ 

4 0 CTTGATAGCTCCCCTGGCAGTTTTTATACACATTGA 

TGTTGGTGGCTGCTCGGGAGAGGAATATGCCCCCATW 

AACCAGTATGAACACTGTCCAGACAA C1T1 TCCTTCCTGTATCTTAAAATATTACT 

GTGTATATATACTATACCATGTGCATGTCTGGTGTCCTCAAAGGGGATTGAACCTCCAGGAACAGGA 

CACCACATAGGTACrGGGAATTGAACCCTGGCCCTCTGGAAAA^ 

4 5 TCTTAAAAAATACTTTATGTGTACGTATGTGTACTACATG 

CTGGAAACCCAACCCTGATCCTGAGAAGAGTGGGCAGTGCCCTTAGCCTCGGAGCCATCT 

G ATGACATGTTGGAAGGTTGCTG ACTT CCAATGTTTTCTG CT CTTC AC AACACT ATTT CAAT AATGAAG T AGAAT AAAAAATGG AA 

ACCACATCTGAATTCCXSCATCTACTTTTCCCAAACCACATTTCrrCCCT 

GATTTCTCCCACTCACTCACAAGTTCCTGAATTGATAAAGGACCCACT 

5 0 TCTTTTCTCTCCTTTTCTTTTCTTTCTTTTTAAA 

CTCCAGTGCTATCCCAAAAGTCCCC^CCCCC^CCCTCACCCCCACTTCCCGGC^ 
CTGCACGACCAATGGGCCTCTCTTTCCACTCATGGCCTTATGTTCACTTTT^ 
TTGAACTCAGAGATCTGCCCGCCTCrGCCTCATGGGAGTTGGGGTTTTAAAG^ 
TTCATTAGGTGGTACATAATGTTCTATGTTCTCACATATAAA 

5 5 GGCTATTCGTCCCCTGTATGACCAACAATAACCTTGAACTTTTGGTCCT 

GCCCCCGTATCCATCTCCCCTC^TTGCTTTTTGATGACTCTTTG^ 

CATTTTTCTAATGATGGGCATATTGGTCACATCTAGGTTTGTGTGACACAGGCAATG 

GGTGCCTAGAACACTTAGTCCGTGGGGTTCCTGGAGTAAGAT^ 

GACTGTATTCAGATTCCGTTTCCTG ACATTTTTCCATT CTTGGGGTCGTCAGTGT ACAGCAGCTCCTGGAATAAAACCCATAG CTG 

6 0 GTTTT AAGGCCTG AAGCACCCAGGATACCCCCTGAG AGGTCCCT CACC CTGGCCTCAGCTCTTTC CTAGGGGGCTTGCTGCTCTGG 

GG ACTCCTGGTCTGGTCTG CTCTGCTCCGCGCTGG CTTGCCAGCTGCCTCTGGGGGGGTGGGGGTGGGGGTAGGGGAAGCG CC CAC 

l w r cr iT CTT^C l ' Tl ' l'T TTTAACTGAGTCGGGCTCTGGCCTGAGTCCrGGTTCATCT 
TTTTTCAT CCTTTTTTTTTCTCTCTCTTAAATTTT ATGTAT ATG AACGTTTTG 

6 5 TGATTCTAATGAAAGAACAGCAAGTGCTCCTCCCTCACCTCTGAGTCATCTCTCTAGCT 

TTTTT ' l ' T 'f X T T TT T TGAGACAAGAGTCTT71TGTCGCAT AGGCTGGCrrC?GAACT 

GGTCTTCTTGCCTCTCTCTCCTGCAGGCTGGGATTATC 

GG CTCTGTGCATGCTAAGCAAGCATTATACGAACGGAGACTCAT^ 

CGGCAGCAAAAAGCACGAAGCACACCCGIGCTTGCGGTGATGCTTTGGGAGGGCTC 

7 0 TGCGAGGTCTGAAGACACAGTTTCAGTTCTCAGAAGCTATATG 

CAGGTGCACCACGGTACACACAGGCTCACTAAATACACGTAATTAAAGATATAAGATGTGT^ 
TGCCCATCATGTGCTGGGCCTTGACTTCTTTATAGAC^TTTGGGTAAACATATGCTTG^ 
GGGTGTATGGTGGGAGGAGGGGTAGAGGGTGGGAGCACGGTGAGTGAGCAGCTGTCACTGCCATGTAGT^ 
GTTGTCCTCCCTAGGCCCCAJVGATGGAAGTTACGTAAGTGGGTTGTGTGAACrG 
7 5 AGTATGGCTGGGTATCCATTCCTTAGGCTGGAGTGGCGTTTGCATAATTTGT^ 
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CTGGCAGACAGGGGGGCAGGGGAGTCGGGGAGGAGCAGGTGTCCCCCGAGCCC^^ 
AGAAAGAAAGGGAGCCTCACTGAGCCTGGCCCCTACCCTGA.GCTCAGAAGTTCCGACT 

CTGATGCAGTAGCTGCTGTTTGCTGTGGGTAGTG G ' rr i' rr i' I T Tt 11 11 l GT'rriTlTl"l^l"A'TCCTCTTTTGACAGGTGGCTTTCC 
ATGCTGGGTTTGAGGATTAGTGTCAACAGACTGCATTTGGGGAACACT 
5 TGGTCTAGATGCCTGCAGGGTAGGGGCrCTGGGGGGCTCTCTTTT 
GATAGGCTCAGGTAGACACAGGAAGATCAGAAGTTCAAGGCCATT^ 

GGACCCTGC CTCAAAAGGAAAAAAAAGAGG CAGGTGTGT AGG CACACTTTCAAGGTT AG CTCAGGGG TT AAGTAAGAGCACTGT AT 
TTTTGTTGTTGTTGTTGTTGTTTGTTTG'l^^ 

CTCCAGGGGATCTGGTGCCT CCTTCTGGTCTTT ACAG GCAC CTGC^ 
1 0 ATAAAATGAAATCTTAAATATTGGTTAGAGTGT 

TGATCCTTCCTCTAGGGGAGAGACAGCTCTGTCTGTAGATGTTTCTT^ 

TGATTTTATGTGTAGAAGTGTTTTTGTGTGTGTGCGCGCGCTGGGTCCCCT 

GTGGGTGCTGGGAATTGACCCCTGGTCTTCTACAAGAGCAACAAGAAGTCTTAGCCA 

CACGGATGGCTCTTAATCCAGCAGAGATGGAGAATGTTCTGGGCTCT 
15 GCCrrrCAGCCCAGGATTGCTTGGGGGTGACTTTTC^ 

AGTCCCCAGCTCTGGAAGAGCGGGTGGGGCAGTATTGGGTATATCTCTC 

GCCTACATTAGGTTCCAGCCTGTTGGTGGGCGAGAGGTAAGAG 

TCTTTAGTTATTTACTGAAGGATTGGTTTATTATGTGGAGGTAAC^ 
TATTGTTTCCCTGGAATCATTTGAAAT 

TACATGTGTCATGTCAGATGTCGCCTATGCAGATCCAGTTTGATAAATTAGGTGTACAGTT^ 

TTTGTCTCCTCACCTCTGAGACRAGGTTTCTTTGCATAACAGACCTGG 

AACTCACAGAGATCCACCTATCTCrGCCTCTTGAGGAGCTAGGATCAAAGGTGTGTC 

2 5 TTAAATGAAGGGAACCAGCTGAGGATGTAGCTCCACTGGCAGAGTGCCTGCCTGG 

TGGAGAAACTAGATGTGGTGATGATGCCTGCAAT^ 

TAGAAAGTTCCAGGCCATCCTGGACTACATGAGATCTITGTGTGTGTGTGTGTC 

TTCTGGTTAGTGAGATTATGGTTCTGGGATGGGACTCCT 

GTCCCAGAGCAAGGTTTAATTrTTTCTACACAGCTTGT 

3 0 ATTCTCTTTCCCTGACCACTGTGATATTTAACTAATATTTGCTCACTA^ 

TCCTAGGGTCCCAGCACCCATGTGTGTGTGTATTCACTCATAGATACGTAAATAAAAGAGAT^ 
AGCCTTTCATATGCTAAGCAGATGTGCTACCCTGCACTCTAGTCTCAGCT 

TATATGAGTACATTATAGCTGTCTTCAGACACACCAGAAGAGGGCATCAGATCCCATTACAGATGGTTGTG 

TTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCAGTCAGTGCTCTTAACCG CTGAGCCATCTCTG CAGCCCCCAAGTAGTATTT 

3 5 TATGTGCATTTGGGTGTTTTGCCrrGCATGTGTGAGGCTGTTGGATTCCCTGAAACTGCT 

TTGAGTGCTGGGAATTGAACCCAGCTCCTGGANNNNNNNNNNNNW 
CATATATACATGCGGAAACACATAAGGTGAAAAAGAAGGGGCTGGAGAGATGGCTTG 
GGTCCTGGGTTTGATTCCCAGCACCCATGTGGCAGCTGAGGTTGTCTGTTACCC 
TGTGTGCAAAACATCAATGCACATAAAATCAAAAGAAATCATTAAAGAAAAAGCC^ 

4 0 CACTTGGGTGGGCTTGTGGGAGTGACAGGTGGATATTTGTGAGTTCGAGGCCAGCCTGGT 

GGTCTACACAAAGAAACCCTGTCTAAAAACAAAAATCAAATCAAAACAAAACAACAG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTG CT CGTGCACAT ATG CACACGT ACATGGGCACATGCATATA 

GAGGCCAGAGGACAACTreCTGGGGGAACTATATCATTTTCTCAGGGAT^ 

AGAGCCACTGAGCCCTCTTGCTGGTCCCTTCTTTTGTA^ 

4 5 CCCTCCCrCCCTCCCTTCCCTCCCTTCCTTCCT 

CCTGGCTTTCCTGGAAGTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGA 

AAAGGCGTGTGCCACCAGXSCCCGGCTATGATACCCTTCTTTTGTAGCATGTT 

TAGCTTGTCATGCTGTAAAGAAAGTTTTGTGGAGCTGGGCTTGCAGCT 

AAGATACATCG ATTGCCATGXKjAAGCTTCAATT/TAT^ CATTTTGT AATTACTGGCAAG 

5 0 AGTTCTGCGGTTCAGTATAGTGGCTGACTCCTGGGAATATGTCTTGGATGGT^ 

GCTCTGGGTCGGTGCTTGGGCCTGGCGTTGCTACAGTCAAAAGCAATCACTA^ 
GACAACTCAGGGCACACCCACATCTGGCACACCCACATCTGGATGCCTGAGAC^ 

T AT ATCAAAT AAGTTTT ATTGGTGGT AAACTAATTT CCAT AATT AC CT ATGTGTTCATT CTTTCTCGTGCTTTTG TTGTGG C AGAA 
CATGGCCGTGCCTCCCACATACGCCX^TCTTGGCAAGTCCGCCAGGGATGTCT^ 

5 5 GGGAGCAATGGGAAGCCTTTCACCTACTAAGGGAGTTGGGA^ 

TGTCCAGACAAAAGGATGAGCTTCCTGTtKSCTGTGCGCCACCCAGTGG 

TCCCTTGCAAGCTCCAGCGTCTGCCCC^TAGAACACTGTTCTAAATTTACCCTGATGGTC 

GCTGCTGGAACCTGCCCCACAGNNNNNNNNNNNN^ 

GCCCTGGCTTCCTTTAAGTATTTTCAGATAGCTAGGAGGAAAACAGTGGTG 

6 0 TTTTCCTCAGTGCTGGGGTCAAACCTITAGCTrGCTAGCAAACATTCTC 

TAACCTGATTTTGGCrGTTTTTCCrTCTTTAGGCTTTGGCTTA^ 

GTCTCrCCACTTGCACGGGACGTTATATTGTCCCTTACCAGGAGTGGGTACTGTGG CCCGGGTGG CAGAGCCAAAAA CAGTTGCCT 
GCTTGGATTGTGGCTGTTAGTGCTTGTCATCGAAGCX3CT 

C CAGTT ATTT^ATGTTGTTTG AAATCAAAACTTCT CTGTACCCACTGATCT ACCCTGATCCA 

6 5 TCCTTGATGCTTCTTATTTTAAATTTGACCTCTCCTTCCCCCGCCCCCCT^ 

AGGCTGGCTTCAAGTGACAGAAACTCTCCTGCCTTAGTCT 

ACCTAGGTTGTTTTGTTT C TT 'l A TTTAAAGACTTAATTT ATTTCTGTGCATGTGTGCGTG CCTGCAGGAGTTTGTG CACCATATGC 

ATGCAGGAACCTGKKK^GX3CCAAAGGAG^K3CATTAGATTCCCTGGA 

GTGCTAGGACTCGAACCCAGGTCCTCTATAAGAGCAGTAAGTO^ 

7 0 GTTTITTTTGTTTTTTGTITTTTGT CCCTGGCTGTCCTGGAA 

CrCTCTCTGTAGACCAGGCTAGCCrCGAACTCAGAAATCCGCCTGCCTCrc 
ACCGCCTGGCTTTTTTTTTGTTTTTTTTTTACTTATTCATTTATC 
GCTCTAGAACTCACTCTCTAGACTAGGCTGGCCTTGAACAC^ 
ATGGCACCACCTATGTTTGTAAI^tTTCATTATTATTTAGGA 
75 ATTGTTGG 
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TCAGACTTTAGGGCCCCCAAGGGAAGAAG^ 

ACTCCTCIt3TATTTCCAX5TATGT AATTGAACTGTAGAAGT AGTTCTC CCTG 
GACCAAGGGCAGACTGGACTGTAGGCTAAGGTCTCTATCTCTCT 
AACCACCAAAGTGAACGGCAGCCTGGAAACCAAGTACAGATGGACTCAGTATGG^ 
5 ACACCCTGGGCACTGAGATCACTGTGGAAGACCAGGTAAGAGCTCCCTGCC^ 

ACATCCTGGGTCCTGTCTTTGGTTAACCTGTGC CACCTTT CTCTT ATTTG CAG CTTGCTCGTGGACTGAAG CT CACCTTTG ATTCG 
TCATTCTCGCCGAACACTGGGTAAGGTTCTCCTTACCCACGAGGATATTCTGAGA 

AGG ATTTTTGGGCTG ACATCATGCCAAAGGTAG CCTCTT AGCTGGGAGG CTGGAGGGTGGAAGGAATCX5TG TGG TCAGG ACCAGC C 
AGGGGTATGTAGTGAGATTCTACATCAAAGAACAAAAAACCAACTAGGTAGCAACAACAGTAAG^ 
10 ATATTCTGGTCTCTCGTGGTGTTCTATCCAGGAC 

AAAGATTTATTTATTTTATGTATGTGAGTACACTGTAGCTGTGTAGAGAGT^ 
AGGACCCCTG4TTCGATCTAGCTCTGCTTGCTCCAA 

TCTAGCAGTTTTTAGACACACCAGAAGAGGGTGTCAGATCCCCTTACAGATGGC 

TCCAGGACCTACAGAAGAGCAGTCAGTGCTTTTAACCGCTGA^ 
1 5 TATTTCTCTAGCAAATX^TTTTGTGTGTTCC^ 

ACTTCCCTTTTCTTTTTGTCAOVGGGT 

TGCTCCTCCTCCTTCTGTCTCTTAAGTCCTGGGAT 

GGGCTTATGCAGTCCATTTGTGACGTATGCTACGTCT^^ 

CTCAGGCCAGTGTTTACTGCTAAGATCCCACAAAGCCACTGTC 
20 * TTCACATAAGGATGCCAGTCAATGAAAGTGGCCAAGGCTGAAGGGCAGA^ 

GCCAGCCTATTTTGTACATCTGGAGGGCTCTTCTCCCA 

TGGGGCTGCCAGGTCCTCCGGCCAGAAATAGTTTAGGCITGTTGGTTATTGAC^ 
CTCTCCTGAGTGCTCCACAGGGCAAAGAAGGAACTCTTCTTTGAGTTCAG^ 

2 5 TAAATATTGGTAGTTGACATAAATCCTTGCTG 

ATTTAAATGTGGGCTGGACCTGAGACCTGAAGGACTCAGGGTCCTCATCAGTATCCAGAACT 
CTCACGACTATGGTGGCAAAAGGTCTCCTAGCATAGTCACTA^ 
GTGTATGACTATGTATCACATGTGAGGGGGAGGGGGAGGGGGGACCTGAAG^ 
TGGGAGCTGCCAGTGTGAATGCTTGAAACTGAATCTCAGTCITCTGCACTCTTAACT 

3 0 CTCTTTCavGCCTGATCTCCAGCCCCGGGGTCCGCTCTTTCCTTTTTCGCTGGCT 

TCAAACTGCTGTGTGGGTAGCTI^GAGGTACATGCnTCITT^ 
TAGCTGTCTTCAGACACTCCAGAAGAGGGAGTCAGATCTCG 

CTGGACCTTCGGAAGAGCAGTCGAGTGCTCTTACCCACTGAGCCATCTCACCAGCCCCACATC 
ACAGCTGAGAGGCCTAATAGATGCCTGTGTCAGGTTTTGGGTTAGCCGTGGGGAGAGGCTGGAAAAAGAACATC 

3 5 TTTTTGATCTCTTCCCATGGTGTACACAGTGCT^ 

GTGGCTTAGCAGGTGTTTAGGGACAACTGGCCACTGACCTCACTTCTATTTCCTGGCTTCTACAC 

TTTACCGTTCTTGCATGATTGTATCGAGATGAGTTAGCTCGGCTTTAATACACrCTTCCTTTTC 

AATCAAGACAGGGTACAAGAGGGAGCACATCAACCTCGGCTGT^ 

TGGTGCTTGGCTATGAGGGTTGGCTGGCTGGCTACCAGATGAATTTTGAGACCTCGAAGTCCC 

4 0 GTTGGCTATAAGACGGATGAATTCCAGCTTCATACTAATGTGTAAG 

GACAGAATACCACGACCAAGGCAACCCGTGAAAGAAAGTGTTTGGTTAGACCCA.CCGTTT 

CAGGAACNNNNNNNNNNNNNNNNNNNNCCC^ 

GCTTTTTTTTTTTTTTTCTTTTCTTTTTTTATTGCCCACCT 

TGCTTGGAACTGAACCTTGGTTCTCTGCAAGAGCAGCAAGTGCTATCAACTGTTGAG 

4 5 TTT C l T TT G AAATAGTTTTAAAGTT lTrriA TAGGGAACTGTGATACAAGCAGTATGTAGAGCTCT 

CCCTCTGATGTTGAAGTTTTCTATCTGGCATCACATTATAACATACTATATAATATACAGCTATTC^ 
TCAAGCACCTATAGGACTCTGGCCTCTGAGATTATTACAGAGCTC 

TATTGAGTCTGTCAAAATTGACATGAACACCACCCACTCTGTTATCTCCCCATAAATGTGGAGCAGGTAT 
CAGTATTTGGACCCAGTGTGTGCCTCTCTTGTG^ 

5 0 TTTTTAT AT AAAATATTTACTGTGAACCCACGTGCTATGTT ACGTGGGGGTG CCTTGGACATCTGTCT CTTGGGT ACATGAGGAGG 

TCATGGTTGAGTGGTTAATCATGGTCATCTAGGTTTCTGTAAGTGCACCCTGTGATGTTTGC 
GCAATTTTCAGAATGTGTCCCTGTCATTAAGCAATTGTGACTAACAAAGATCCATGC 
AACTGATGTCCTGCTGATATCATCTCTCTGGCCTGGAACCTAGTTCTGGATCCTCTATTGCATCAAGCTCT 
GCCTCTCCCTGGGACACACTCTCCATCTTTCT 

5 5 GTGTCCTTCTGTTTGGATTGGTCTGATATTTCCTC 

TGTGCTGTCTGGAAGGCCACTATTGTGTGTCTTGGCCTGT 
CGCTGTAAAGCTGTCAGTTTCCAGCTTTGTGCTTAGTA^ 

ACTTTGGGTTACATGGAAGCATCCTTTGATGCTTGCCTGTGCAATTATTGGTAGGTG 

6 0 GTGCTGGGATTAAAGGTGTGAGTCACCAGGGGAGCCATCATTCCTTTCT^ACTTATGGGC 

CTrCCTCCCTTAACCAGTTTTAGTTTTTATTTT^ 

ATGTATGTATGTATGTATACCGCTTATGTGCCTGATGCCTGTGAAG^ 

GGCTTTAATACACTCTTCCTTTTCCCTTTAGGTAAAAAAAATGCTAAAATCAAG 

CTGTGACGTGGACTTTGACATCGCTGGGCCCTCGATCCGGGGCGCT 

6 5 TGAATTTTGAGACCTCGAAGTCCCGAGTGACCCAGAGCAACTTCGCAGTTGGCTATAAGACGGATGA^ 

GTGTAAGTGTATGTGGGGGGTTGTTTTAGTTGCTTTCC 
GTTTGGTTAGACCCACCGTTTTAGTGTGTTACAGT 

AGGGCAGGCAGAAAGAGGGAACAGTGGGGGTGGTGGGAGACTTGTGAAACC 
AACAGTTIGACCAACTGAAGACCAAGTATTCAAGTGC^ 

7 0 TAAGTTTGGTTTTCAGTCATTTGGGTCAATGGGCCCCACGGCCTGTGAGCATTTTTGTTT 

CTCATTCTCTCCATTAACTCGGTTAACAGCTTGCT 
TGCGTGGTCTTGGAGGTTGAAGGACATTGATGAC^TGAG 

GC7CAGCAGATG AGAACACTTGCTG AGAGG ATAAACAGTTTG^ CCCAGAAAC 
TCTAGCTCCAGGGGGTCCATTGCTCTCCTCTGGGATTGGAAGGGGACATCCCCCCAGGCCC 
7 5 TTTGAAAAGT AACTTT ACTTGGGTTAATTCG ATGAAGT CT AAGT AAAT AG CTTCTT AA 
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ATTCAGACCAATAATGGTATTAGTCTAAACAAATACIT 
GTGGGGTCTGTCTGTCTGCTTGTCTGTCTTTCTGTCTGAACT 
GGAGTCTGTTGTCTTCrrCCTCATCCAAACCACAGTGCTTGCCTTCC^ 
TATCAGCTTCCCTCCTACCATCTTTGACACAACCATTAACCAAAGTGAA^ 
5 GTCTAATTTTCTAGTTTCTGCTTTCATTT^ 

GTCTtTTATGTGTGTGTGTGTGTGTGTGTATCTGTGTGTGTC 

GGATG CT CTAGAGCTG AGTT ATGGG CAGTTGTGAGCTCCCTGATGTGGG TG CTGGGAGCTGAATT C AG AGC CTTTGG CAAAGCAGC 
GAGTACTCTTAACAGCTGGACATCTCTTGCTATGC^ 

TGAACAAGAAGTTGGAGACTGCTG TCAATCTCGC CTGGACTGC^ CTT CGGAAT AGCAGCCAAGT AT CAG 

1 0 GTCGACCCTGATGCCTGCTTTTCGGTGGGTACCCGAGAATCACCCATCT 

CAGGGCAAAGACAAGTCTTGTGTTTGAGATACTTGAGACAGATACTT^ 

CTTACTCTGTAGACCTCACAGAGATCCACCTAGTCTGCCTCAGCAGCGCTGGGA 

GG C Tr T T rGTTTT TCS T TTT TTGT TTTTT^^ 

AGAACTCACTCTATAGACTAGAATGGCCTCAAACrCAGAGAT 
1 5 ATTTTGCTTGACACCrrGCAGTTTTrTTTTT 

TCCAGAAGAGGGCATCAGATCTCATTACGGACGGGTGGTTGTGAGCCACCATCTGGT^ 

AGAGCAGTTGGGTGCTCTTACCCCACTGAGCCATCTCACCAGCCCACCTGCATAG^ 

ACGCTG AACGCTTATGGAGGTCGATTGGGTAGCTTCCTTAAAGGCACATAG CAAGTATGTG CTGGGACCTGGCTGTAAGTAGACCG 
CTCTGCTATTGAG CCTCTCTT ACTT AAGTG TTTCACTCTTC CT CT C CTTTC CATTTGGTTTTCAG TCTCT AC ATC CACATT AAG AC 
2 0 ATCAATGGTCAGGATGATGTATTCCAGTGGTAGAGCC 

CAAACAGAAACAAGGCTGGGATGTAGCTCAGTTGGTAGAGTGCT^ 
GGAAACAGCAGGCAGAAGGATCAGAAGCAGGCCATTCTTGCCTATGATGTAG 
AGATCACATCTCTAAAAGAGAOUUlACCAGGTGGCCTCTTGTCr^ 
GTTTrGTTGTGAGCCTATTTGAGCTGGCTTGTGTTATCACTCTGT 

2 5 TGCAGAATGCTTCTtSCTGGTTGGTGTGCCrCACAGAGAGTCTATTTT 

GCATGCATATAAAACTGAAAGACAAAGGCTCTCCTTGTTGGGATGTTGGGAG 
GAGAGCCCTTCTTCTCCTCGCTCTGTCCTTCAAAGTTCACAGGAGC^ 
TTTTACCAGATGGCCZATGAGCAGACAGGAACACAGTTCTGATACCTGTGA 
CCTGTGGGACAGTTTrCATGGGTGAGGAACACTGTCTrCCCTTGGACTTAA^ 

3 0 CTCTAGCCTGATTGGCTTAGGGTACACTCAGACCCTAAAACCAGGTATGTCT^ 

TTTTGCTGATGTCTGCTATTGCTGAGOGACATTATAAACGAGTATATATAAGCT 
AAGTCATTGTAAAAAGAAAAAGAGAGTCTGAGAAATAGAAAGCAGAAGGAACCTGTCTGGGCrAA 
TTTTCTCCTTTCCTTTCCCCAGTATGTGATATGTGATAGCTGCTTCCAGACrc 
TGGAAAGACrAGGGTATAGGGATGtAGTGGGTCTTCT^^ 

3 5 CCR.CTCAAGGCTGACTATGTAATATGAAAATCAAGTTCrTCCAG^AAAACCATCCCGTTTGATCT 

GGAAAAGAAGTTTATTGTTTTTTTTTTTGTGTGTGTGTGTGTGTGTC 

TCAGGCTCTTACAAGATAGGGTTTCCCTCCAAGGCAATACT 

ATTGTAATGGGACTCCAGGAGGCTGGAAGGGAGGGGGATTCGG 

CCGTTGCATGTGTTC ATTCGAGGAT AGCAGCAGGTCAG^ CTTGGCCACTGC 

4 0 TGACTGCACCTGOAACAGATTCTGACCTTGATTCTGACCTTGATC 

CCTCCTCCGTCCTCTGTC^CCCCTCCCrCCC^C^CCTCTGTCTCAACTCCCC^ 

ATGG CAAG AACGT CAATGCGGG TGG C CACAAGC!TTG GCCTAGGACTGG AATTTCAAG CAT AAATGAAT ATTG T ANNNNNNNNNNNN 
NNNNNNNNCTGGCCTCGAACTCAGTGATCATCCT 

TC!ATTCCTTT(nTACITATGGGCTTTTACTGTAGGGAAGGGCAGTCCTA 
45 ATTATCTTTATATTTTGTCTATATATGTGCTTTG 

AAGGTCAGAGGATGAGCCTATAGTCCCTGGAACTGGAGTTAO^ 
TTTCTGCAAGAGCAGCTAGTGGTGCTCTTAACCGCTGAACCATCCATCT 

CATAGAT ATGT AT AT ATGCACACACAT AAATACACAC ACAC ACACATG CAGTT^ CTTCTGTG TG TG TG TG TGACAGTGT ATGT ATG 
GACATGAG AGAACAACTTGGCAGGTTTGG T1TT TT T CCTTTTT CCATGCTAGTCCCAGGG ATTCGCTCAGGTCATTAGGTTTGC CT 

5 0 GCAAGCACCT CT AC CTCCTGAG CT AT CTTG CCTG CC CCT AAT CATTTTTG ACT (^TGTAT AG ACT AATGGACATTT ATT CAGTTGC 

ATCATTTAGACGGTGGCTCAGGCTGTGACAGTGTGGCACTCCTTC^ 

CCTGTGTTTCrrGTTTTGCAGTTGTGTGAAGTTCATCT 

CTTGGCGCCTTATGTTGGGAATGCTGTTTAAAGGCGGTGCTGA^ 

CTGC^ AATT AACACTGGCTTT CTG AAACTCTGGTGAT AC CGGG AAAT CACTG ACCACC CTG AGCCAAATGTAGG CACTACTGG ATG 
55 GGACCGCAGGGCCTTCCTTACGGATTAGCACCCACCCAGGAGTTGCCTTCTACAGATAGAG 
CTGTGTGTCTCTTGAGTGCTCGTCAAATTAGCACCACTCCTTAAGTAGCTAAAA 
GGAAAATCTCCATAGCTGTGCTCTGGGCAGATCGCAGGGGGAATCTGAGTCACT 
GCACCCTTAGCTAGAATCCTATGAGAATTGATCTGGGTGGTACCC^ 
GTCAAGGGCAGCTTTTTGCTGGTTTTGTCTTTTG 

6 0 CTGGCCTTCAACTCCTGATCCTCCTGCCTTCATGGCCAGG 

AGGCTGACCTCCAAGTTATGATCCTCTTACCTCTTTGGGAGTTGGCTCCCT 
CAAGTTGTTAGGCTTC^TGGCAAGTGCCAGTACCTTAAGCCATATTACTGG 
TTGCCAATTCTTGTCITACCGATGCTAACATTTCCCCATCriTT 
GGCCCAATAATTCCTCCTAATCTCTGATAGCTTAAAGAGCCGCTTCAG 

6 5 GCCATTTCCCAAGAAACAG CAAGGCCTCTGGGTTT AGAG CCATGGTCTG AAAGCCGCAGATGAACCCTTAATGTCACTGGATAAGG 

CCAGGAATGGGCAACTATGTGTTGTCTTTTGGAGCT 
CATCTCTTTTGTAAGCCTGTGATGTAGTTCTAATTCTACTTGA 

GG TGG CCTTTAAT CCCAGTACTTC^GAGGC AAACAGATCTCTGAGTT CAAGGCT AG CATGG C CT AT AGAAAG AGTTCCAGG ACAGT 
CAGGG CTGTTAT ACAG AG AAATCCTG TCTTG AAAAG CCAAAAAAC AAAGATTTTTT^ CC 

7 0 TATGTACACAGTGCCTCGGTGTGCTTGTGGGGGTCAAAGG 

TCCTTCCTTTAAAAATACTGCATTTATTTTTATTTTATG 
TGAAAGGGATCTTTTTAGACTGATCCTCACACCCTGTTCCCATCGTGCCCTC 
ACCAAAAGTAGTCTTTAGGGTGTGGTTAGCTATTTCTTTTGTCCCA 
CAGGACTTAATTCTTCCTTGTGTTGTGGTOT^ 
7 5 CAGAGTATGAAATAGCTTCCGGGTTCTCCGGCTCTGAGCTGG^ 
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GACTCCTCTrGTAGCCTCACCACTATTTTTTAGCAGTTTAATG^TACATTA 

CCCTTCAAATGCTGTAATTAACATCACTTAAAATAAAACTTGAATAAAATACTGAA^ 

ACGAAATGGCCAGGGCATTGGTTTGGATTTC 

CACTTGGGAGGCTGGAGAGGTGACTCGGGGGTTTATAGTGTTTCCTGTO 

ACTTAGGGTTGATGTAGTGGCACATCTCAGTACCTTGGAGATTAGCACAATGA^ 

GAGCCCACCCT^AGTTGAGGCACTGCTGGCATTGATC 

GGAAGCTAGGGCCGCTGGCTGGATCTAAGGAGAACTC 

GGGTCATCGGCAAGAGCATCCAGCACTGTTGAACCACCAC 
1 0 GTCTGGTTGCTTTGCTCAGGTTTGAATGAGAATGTCCCCTTAGGCTCTGGCTT^ 

TAAGGCTTAGGGGGTGTGGCCTTGCCAGAGGAAGTACGTCACTG^ 

TTGTACTCTCCAATTCTTGTTTGTGGTTTGATGCCTAGGCTT^ 

CCCGGCCTTGATGGTGATGGGCTCCTATCCCTCTGGAATTGGAAGCCAAGATG^ 

AGGAAAGTAAACACAACTAGGCTGCACCTTCATTCAGCAAGAGACTTTGGG 
1 5 ATTTGTTGTGAAACTGAGTCCCTTCTTGCCGGCT 

TCTCCCGTCTGCGGTGTATGGAACAGAAAGACTCGCrrGTCrAATCCTTACT 

TACCGGCTCACGATTAGGCGAGGCTCCCCTTTATAAGGATAACTGCCATGAAAAGTAAGCT 

AGXSCTTTCCCTTGTGGGGCATGGGAGGGGGTCGTTTTGAAGTT^ 

AACTTGCCCTGAAGACCAGGTTGGACTCAGAGATTCACCTGCCTCTGCCTCTTA 

2 0 ccTGr ciTmuTimmTriTrmmTrmTO 

AACTCACTCTGTAGACCAGG^TGGCCTCGAACTCAGAAATCCACCTGCCTCrGC 
CCACGCCCGGCAAAAATTTGTAACGTC TnTT TCTr T^^ ^ 
GCTGTCTTCAGACACrCCAGAAGAGGGAGTCAGATCTCATTACAGA 
GGACCTTCGGAGGAGCAGTCGGGTGCTCTTCCCACTGAGCCATCTCACCAGCCCCCCAC 

2 5 TTGATTTATTTTTATTCATAGCTGTCTTCAGACAGACAGATCCGTAGAGGGCATCAGA 

TGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGTAGTCAGTG CTCTTAACCACTGAGCCAT CTCTCCAGCCTGCTTTTTT 
TTCTTTTTTTGAAGAAAGGTTTAATTGCTTCTATTTCACAGGTATG 
GGGGCCTCCAGAGACCAGAAGAGGGAGTCAGATCCCCTGTGACTGGAGTTACAGATG 
TGAACCTGGGACCTCTGGAAGAGCAGCCAGTGTCCTTAACCACTGAACCATCTCT 

3 0 GTTGTTTCTCCTTGTGTGTGCATGTGTGTTACAGAG 

ACTCAGGTTGTTAGG CTCGGTGGC AAATGTGTTTACCTGTTGGGCTACCT CG AGATTTTTACTAAATCGTGAGTTCCTCCACGGAA 
TAGGAATTCTACCCG CACACTCTTCAGCTT CTAT AGG AAGTGAAGTGG AGACTrCTGGTTTCTTTGGJU^AGTCAGTTATGTCAGCG 
TTTTGCTGGGGCACCCAGGTGAAAGGATGGTTTG 
ACACATGAAGTTTAGAAGGAACATAAATCTGACCCCACGGACAGTGG^ 

3 5 TTTTGCTGACATCATGGATGCATTGGTTTGCCTTACACTGCATT 

AAAGAAAGGTCCCTGCGGCTTCTTGAGG<riTCCGTGGGA 

TTGCTGATTCCTGTGTGGTATCTGCTGAGTGGACTGGACTGAGAACAACAGAGACTGGAATCTC 
TAGCTTATCCTAATAGCCTTCCTTTTCCACTATCTCTGCTGGTTGGTGGGCTAAGAG 
GAGAAAAATATATGCCTACAAGGAACATTCTTTTTTCTTGGCAGGATT 
40 TCTGGACATGTGTGGGCATACATTTGGAGTTCAGAGGACAG'ITATTGGAACCATTTTCCCCCTA 
GAAACTTAAGTTCAAGGTTGTGGCAAGTATGATCTGCTCTTGTGTGAGA 
GTATAGACX3TGCAGCTAATTCTTGAGTGAGTGCTTAGGCTCTGACCAGAGACT 
GGCTCAGTGGTTAAGAGCAGTGACTGCrCTrCCAAAGGTCCCGAGTTCAAATCCCA 
AATGAAATCTGATGCCCTCTTCTGGAGTATCTGAGGACAGCTACAGTGTACTTACATATAATAAATA 

4 5 AAAAAGAAAGAAAGTGGTTTCCCTGTCCATGTAGTTGAAGGC 

AAGTATGATGGGATGTCATTCACCAGGCACTTATTACATCATGCCTGCA 
AATGCCTTGAGTTC^TAGTATCTCATTACTGAAA^ 

CAGAGGACCTGGGATTGGTTCCCAACATCCACGAGGCTGCTCACAACTGTCTGCAACTCCC^ 
CTAGTCTCTGATAACACTACTCCCATGCCTAAGCCTCGAAAGATGAGACATATACAC^ 

5 0 TCAGCTGATCCATCACrGAGGACAGAAGAACAAAGTAGATTAGGGAGGATGATAGAGGGTGTTC 

CATCTCCAAAGAGGCAGAAGCTGGAATTGAGATTAGTCTTrGTAATGGTTTGTATAGGCT 

GGTGTGGCCCTGTTGGAGTAGGTGTGGCCTTGTTGGAGTAGGTGTGTCACTGTGGGTGTGGGCT 

CTGGAAGCCAGTATTCTGCTAACAGCTTTCAGATGAAGATGTAGA^ 

TTGATGATAATGGACTAAACATCrGAACCTATAAGCCAGCACOVATTAAATGTTGTCCTTATAAAAGTTG 

5 5 ACACACACACACACACACACACACACACACACACACAC^ 

AAAACCCTAACTAAAATAGTCTCrrAAACACTGGTATTTTCTAGACTGGCAG 

CTCTGCTAATCTGGAAGGTTTGGGCACCATTCCATGTACATGGGCTTTATGGCAGACT 

GGTTTGTTTAACAGAATCTGTGTGCTTTTAAGATGTCTTGGGGTCA^ 

GGG AATTGCAG AGACAGCAGTG AAGC TG TTGGGGG CAC AAT AAAT G C CAACTG CT CATTTT ACTG AAAATT AGTG AAATGAAC AT A 

6 0 TATGAATTGAAATCTATAAGAAACATTCACACAACATACTCACAGATGGAAACT 

ACACTTCTC Tl ' Cn^T C lTrn XT rT CrTTCTTTCTT^ 
TTCCTGTGTGTGTGGGCAGGGAAAGKSGGGAGAAAGCATTTCTCTGTGTAC^ 
TGCCCTTGAACTCAGATATCCACCTGCCTCTGCCT^ 
TTAAGGGTGCATCAATAAACAAAGTAAGGTCAAGTTGGC 

6 5 ATTTAACCGGATGATAGTGTTATAAAAGAAAAGCAAGGAAATTGGCTAGAAAGTCAG 

AAATGTTCC^AAGACATACGTCTATCACTITTTACATTTTGTAAAA 
AGGTGAAGGTGAAGGAGTTCAAGGTTATCCACTGGGTAGAAAGGGATTTTGAA 
CACAGTCAGCTATTGGATGGAAC^CAGGGCCCCOVAAGGAAGAGCTAGA 
TAGGTAGAACAACAATATGAACTAACCAGTACCCCCGGAGCTTGTGTCrrCT 

7 0 CGGCCATCATTGGGAAGAGAGGCCCCTTGGTCTTGCAAACTTTATATCCCCCACT 

TGGGTGGGTAGGGGAGTGGGGGGGGGGGGATATGGGGGACTTTTGGGATAGC^ 

AAAAT AAAATT AAAAAAAAAAGGAATTTTGAGG C CAG C CTGGGCCACTTG AGATCTTGTTTCAAAACCAAAAC CAAAAGGGTG CTG 
gn_gjwwiTTATTCAACAGTTT AGAG CAATTGTGCTTGT AGAGGACCAGGGTGTGG TTCCCAGG AG CCACATGGGG TGGCTCAT AATC 
ACAGCAACCTCCAGCTCCAGGGGAGTAGACGCCCTCTTCTG^ 
7 5 ATTGTTTCTTTAATGCTTATTTTATCATTATTATTrTTTGAGAC^ 
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TAGACCAGGCTGGACTCGAATTCACTTAGATTTACCTGCTT 

GCTAGTTTTAATTCAGAAATGATTGCTTGCATATATGTTTGTGTACCCCATATGCCTAATC 
TAGTCAGATCCCCTGGAACTGGAGTTAGAGGTGGCTGTGAGCTGTTTAATCTG 
GTAGO^TGCTCTCAACAGCTGAACATTCTCTCCAGCT 
5 AGTTGTTTT ACTT ATGAGTTT CTG CAT CAGTT AT ACGTCT AGTGTTTGAGGATGCCT AAAGAGG G CACCGT AT ACCCTGGGACAGA 
TGGGTGTGAGTCCTCACATACAGGGGTGCTTGGAATCG^ 
CTGAAATACATCTCAAAGATGACAACGATAAACCACAGAGCAA 
TT TTTTTn 1 rriTTTriTTAAGC^CCCTTTT 
ACGT AG CTG CTT 

10 

MOUSE SEQUENCE - mRNA 

CCGTAGCTGCCGCTGCTCCCGCCGTCACCGCCTCCGAGAACATGGCC^^ 
GATGTCTTCACCAAGGGCrACGGCrTTGGCTT^ 

AGGCTCTGCCAACACGGAAACCACCAAAGTGAACGGCAGCCTGGAAACCAAG 
1 5 AGAAGTGGAACACAGACAACACCCTGGGCACTGAGATC^ 

TCATTCTCGCCGAACACTGG*3AAAAAAAATGCTAAAATCAAGAC^ 

CITTGACATCGCTGGGCCCTCGATCCGGGGCGCrCTGGTGCTTGGCT 

CCTCGAAGTCCCGAGTGACCCAGAGCAACITCGCAGTTGGCrATAAGACG^ 

ACAGAGTTTGGTGGCTCCATTTACCAGAAGGTGAACAAGA^ 
2 0 CACTCGCTTCGGAATAGCIAGCCAAGTATCAGGTCGACCCTGATGCCT 

TAGGGTACACTCAGACCCTAAAACCAGGTATCAAACTGACGTTGTCAGC 

AAGCTTGGCCTAGGACTGGAATTTCAAGCATAAATGAATATTGTACAATCGT^ 

CAGAATTTAGTGTACCTTTTAATGTTGTATGTTGGGGATGCGAGAGTT 

CGGCITTAAGGTGTTTACCATTTCAGAGGTACAGCAGAAACCCC^ 

2 5 AGGAGCCCCTCTAGAGATGCCAGGCTACAAGTGGAAAGCTGGGAA 

AATTGTGACTTCCCGAGCATCGAACCCTGGTGTCCAGATCCTATCTGCT 
TAGACTTGATCCTCACACCCTGTTCCCATCGTGCCCn^TTCCCATCCrAG 
TAGGGTGTGGTTAGCTATTTCTTTTCTGCCATTTTAGGGTGGAG^ 
CCITGTGTTGTGGGGTGGTTTTCITTCTCCrCCTCCT 

3 0 GCTTCCGGTTCTCCGGCTCTGAGCTGGG<ZGGTGATTGTGGTCACACCCT^ 

CTCACCACTATTTTTTAGCAGTTTAATGGGTACATTAT^ 
AATTAAGATCACITAAAATAAAACTIGAATAAAATACT 

MOUSE SEQUENCE - CODING 

3 5 ATGGCCGTGCCTCCCACATACGCCGATCTTGGGAAGTCCG 

TGATTTGAAAACGAAGTCAGAGAATGGATTGGAATTTACGAGCTC^ 

TGGAAACCAAGTACAGATGGACTGAGTATGGGCTGACGTTTACAGAGAAGTGGAACACAG^ 
GTGGAAGACGAGCTTGCTCGTGGACTGAAGCTCACCTTTGATTCGTCATTCTCGCCGAACACT 

G ACAGGG T ACAAGAGGG AGCACATC AACCT CGG CTGTGACGTGG ACTTTG ACATCGCT GGG CC CTCGATC CGGGGCG CTCTGGTGC 

4 0 TTGGCTATGAGGGTTGGCTGGCTGGCTACCAGATGAATTTTGAG 

TATAAGACXX5ATGLAATTCCAGCTTCATACTAATGTGAA 

GTTGGAGACTGCTGTCAATCTCGCCTGGACTGGAGGAAACAGTAACACrCGCrrTCG^ 
ATGCCTX3CTTTTCGGCCAAAGTGAACAACTCTAGCCTC 

TTGTCAG CCCTGCTCGATGGCAAGAACGTC AATGCGGGTGGCCACAAGCTTGGCCT AGGACTGGAAT^ CAAGCATAA 

45 

HUMAN SEQUENCE - GENOMIC 

TATATTGCATGGTGCTGAGGTTTGGGGTATGATTGAACCT^ 

CTTGCCCCCCTCrCrCCCTCCCTCCCCTCTCTTGTAGTCCACAGTGTCTAT^ 

TAGCTCCCACITATAAGTGAGAACATGCAGTATTTGGTTrTCTATTCCTGCAT^ 

5 0 CCATG TTGCTG CAAAGGACATG ATTTCATT ATTTITTA C CG CCATAT AT ATGT ACCAT ATTTT CT T TATTC 

CATCCACCACTGATGGGCACCTAGGTTGACTCCATGTCTCTGCTATTGTGAATCACGCTC 
TTTTGGTAGAATGATTT Al ' rri 'ClTT T ' 1 ' T ' l GAGATGAAGTCTCACrCT 
GCTGAAACCTCCACCTCCCAGXaTTCAAGCGATTCTCCTGCCTCAGCCTCCTGA^ 
CGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACTATGTTG 

5 5 CCTCAGCCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACCGCACT 

TATTTTTAGTTCTTTGAGAAATATCCAAACTCCrCTCT 

TCTTTrCTCTGCAGCCTTTCCAACAACT ATr 1"! 1 1 AACITTTTAATAAT ACCCATTCTGACCGGCGTGAGGTGGCACT CATTGTGG 
TTTTGATTTGCATTTATCTGATGATTAGTGATGATGAGCATTTTTTGATGTTTGTTGACC^ 
TTTTGAGACGGAGTCrCACTCTGTCATCCAGGCrGGAGTGTGGTGGC^^ 
60 AAGTGATTCTCCTGCCTCAGCTTCCTGAGTAGCTGGGATTACAAGCG 

ACTCGGGGTTTCACCATGTTGG CCAGGTTGTCTCAAACTCCTGACCTCAGGTGATCC 
TTACAGGCGTGAGCCACCATGCCTGGCTGTATGTCTTCTTTTGA 

TCGGATGGGAGACGGAGTCTCACTCTGTCACCCAG^CTGGAGTGCAGTGGCACGATCrCA 
TTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGT 

6 5 AGAGATGGGGTTTGACCATGTTGGCCAGGATGGTCTTGAACrrCCTGACCTCAGGT 

GGATTACAGGTGTGAGCCACCGTGCCCAGCCCTCTGCCCACTTITTAATGGGGGTCTT 
TCCTTATAGATTCTGGATATTAGAGCTTTGTTAGATGCGTAGTTrc 

GTTGATAGTTTC'TTTTGCTGTGCAG AAG CTCTTAAGTTTAATTAGGT CCCACrTCTCAATITTGTTTGTTTGTTTGTTTGTTrTTG 
AGACACAGTCTTACTCTGTGACCAAGGCTGGAGTGCAGAGGTGCGATCATAGCT 

7 0 TCCTCCCACCTCAGTTTCCTGAGTAGCTGAGACTACAGGTGTGCACCACCAT 

GTTTTTGCGATGTTGTTCAGGCTGATTTTTGTTTTGCTGTAACT 

GATG CCCAGGAGGGTATTTC CT AGG rriTCT T CT AGG ATTTTAATTG TTTG AAGTCTT ACATTT AAGTL"1TTAATCCX3 TCTTGAGT 
TAATTTTTGT AT ATGGTG AT AGG CAGGGGT CCAGTTT CATT CTT CTG CAT ATGG AT AG CCAGTT ATTC CAGCACCATTT ATTAAAT 
AGGG AGTC CTTTC CCCATTG CTT ATTT77G TCC AACTTTG^ TT CTC 

7 5 TATTCTGTTCCATTGGTCTATGTGTCTGTTTTTGTACCAGTACTATGCT 
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CAGGTAATCTGATGCCTCCAATTCTGTTATTTTTGCTTAGGATCCTTTGGTTATT 
ATAATAGTTTTTTTTCTAATTCTGTGAAAAATGATGTTGTTA^ 
ATGGACATTTTAATGATATTGATTCTTCCAATCCATGAGCATG^ 
AACTGTGTCTTTAC^TGGTGGAAAGGGTGAACAAGCTCT^ 
5 AATAC CGTCAT CTTGGG ACTC AGG ATTT CAACAT ATCAATTTTGGG AGT AAACATT CAT ACT ATAGCCC CATT CT AAT AAATCATT 

TCrAGAATTGTCTTTTAGAGGTTGTATATTAGTTTCCTATTGCTC 
TCAGTGGTTTAAACAACTCAGATTTATCATTTTACAGTTCT 
GGGCATCAGGGCTCCCTCCTTCTGGAGGCCTTGGGAGGAGAATCTGTTTCCT 
CTTGGCTCATGGCTCCTTCTTACATCTTCAGAGCTAGO 

GACCCTCITGTCrrCCrrCTTATAAGAACTGTAGGCCGGGCAT^ 
GGTGGATCATCTGAGGTCAGX3AGTTTGAGACCAGCCTGGCCAACATG^ 
GGCCTGGTGGTGGGCACCnGTAATCCCAGCTACTTGGGAGGCTGATG^ 
GAGCTGAGATCACGCCACTGCACTCC^GCCAGGGTGACAGAGTAA 
1 5 ACATTCTTGGGATTACATTGGGCACACTTGGATGATCT 

TAAATGTAOW\TCTAGAATAATCACACTCATAGAAGCAGAGAGTACAATG^ 

GGTGATGATCAAAGGGT AT AAGGTTTAGTT ACGC CAGAT AAATAAATT CTGGAGATCT ATGCATCTG ACAAAAGG TAT AAT AT CT A 
GAATCTATAAGGAACTTAAAGAGTTCAACIAAGCAAAAA 
CTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGCAGGCA.GACT 
2 0 GCAAAATCCCATCTCTACTAAAAACAAAGAAAAAAATGTGGGCAGAGGACATGAAC^ 
GCCAAGAAACATATTAAAAAATGTTCATCATCACTAATC^ 
TATACTTAGATTTGCCAAGAGGATAGATCTTAAGTGCCCCTACAAA^ 
AGGTGTTGGGTATGTTTATATGCTTTACGGTGGTGATGGTTTCATGGAAATA 
AAATATGTACAGCTTTTTGTATGCAATCCTATCTCAATGAAGTAGTTTITGT^ 

2 5 AGTGAGTAGGATGTGGATATCCTTGGGGGGCCATTGTTGTACTTACCATAGTTTGCCT 

AATACTGAACATAGATTGCCTTCCATTAACATTGTAACAGAAA 

ACTVAAGTCTCGCTCrrTGTCCCCCAGGCTGG AGTGCAATGGTGTGATCT CAACCTCTG CATCTCGGGTTCAAGCCAT 

TCTCCTGCTTCAGCCTCCCGAGTAGCTGGGATTACAGGCGCCTACCACTATGCCTGGCTAA 

GTrTCATCATGTTGACCAGGCTGGTCTCAAACTCCTGATCTCAGGTGATCTGCCCGCCT 

3 0 GTGTGAGCCACCACGCCCGGACCCCCACAGGAAGTTTTTGTAACCA 

GGAAACCAGGTTGTGTTCCGTCCTGGGCTGCTGTCCATTTTCCTGCAGCAGGCAT 

GGTT CC CAG CATCAAGGGGGGTGG AAT AGGATT ATG ATTCTG AT CG CAT AT AG C CT AATTG CTTTTCAGAAGG ATTG T ATCACTTT 

ACACATCTCCTGATGAACCTCATATGACTTTCTTAAAGCTGAAAAAAGTT 

AAGCATATTTGGAGGCTGGGCCTGGTGGCTCATGCCTGTAATCCCAGCACTTTG 

3 5 GGAGTTTGAGACCAGCCTGGCCAACGTGGCAAAACCACATCTCTACTAAAAATACAAAAAT^ 

GT AGTCACAGCTACTTGGG AGGCTGAGGCATGAGAATAG CTT AACTCGGG AGGTGGAGGTTG CAGTGACCCGAGATGGTGCCACTG 
AACTCCAGCCTGGGTGACAG^GGGAGACTCTGTCTCAAAAAACAAAACAAAACAAAACAAA 
TAAAAAGAGCCAAAGGAAAAAATCACTCATAGTCTCATAAATAGAAACACA^ 
CAATTCTC^CTGGGTGCAGTGGCTCACACCTGTAATCCCAGCA 

4 0 AGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCT 

AGCTACTTGGG AGGCTGAGGCAGAATTGCTTGAACCAGG ACCTGGGAGG CGG AGGTTG CAGTGAGCCAAGATCATGCCACCGTACT 
CCAGCCTGGGCTACAGAGAGAGACTCCATCTCAAAAAAAAAAAAA^ 

GCTTCrTGGAATTGTTTTAGATTAGATCTAGGACACCT 

4 5 GATGCTTTCTGTCCTGTTTCCCCAAGAGATTACTGCTCTCACT 

GTCAGGGCACAGTGTGGGGAGTGTGGCCTGAGATCCITGACCACAAACACGCCAT 
GG AGTTCACTGGAAAATGATTGGAAAGTTAGGTTTTATGACAATG 

ATATCTTTCTCTTTAAAAATTAATTTACATAAATGACTACCATTTCTGCCTTCATGAACTTCATCT 
CAGACAATGCTAAGAAACCACATAGAAATCC^GGGACATCTGGCTCTGAAAAGAGA^ 

5 0 GAAAGGGGGGTGGAATAGGTTTATG ATTCTGATCATAAATTGG CTrGCrTTCCACTTTGATAAATATCTACAGTAG AAG CCCT AAGG 

CTGTAAGTTAGAGATAAGACAAAAACCAAAAACTCCCrAAAATGAAATAGTCTTACATTGGTAACCC^ 
AAGAAAAATCTCATTTTCCCTGGATGTTTTACATTGTTCTl'L'rrG 
TCCAAACTGGACTAGGCAAAACAGGAGGCCGTTGGCTCAGGCAGCTC 
GACCTCATCTCTGCCTTTCTCTTATGGGCTTACTTCA.TTTTCAGGGAGTCT 

5 5 CATTGGTCCATCCCTGCGGGG CGGGGATCGGGGGCGGTGGGAGGCCAGG ATGTGGGGGCGG AGAGGCGGGGG CAGGAGAGGT ATAC 

AGGGCTCTAATTGGTCAGACTGGGCCATGTGGCCACTCCCAGGAAGGCTT^ 

CTGACCAAAAGGAGAGGGAAACAGGGTC^TTCGCCCAGAGAAAAACTGAGGTT AGGATTGTCCTGTCCCCACAAACCTCCTTCTCC 
ACCCACCAAGATTTGACTGTCTTCGCCAGGGACTTC 

CCACGCATGGCAAGAACAGGGGGGCACTCAAGGAACAC^GGCCCAGAGCCACCCTCTG 

6 0 AAAAGCCCTCGTTAACTTCGCAACACCGACAGGGATGCCAGACCCAGGTCTCCCCAAGG^ 

CTGCCATACTCATCCCTGGGTTTCTCAGTTCAGGTTTC 

CCCAATCAGCTGTGGCAGATGGAATGGAGGTGGGGCTCAGGGGGTTTCCCCTCTCCCTCAG 
GTGATTATGGCACCTCCAGATCTCAGCTGATTTTGTTTTTTAA^ 

GTCCAGGCTGGAGTGCAGTAGTGGG ATG CTAG CTCACTGCACCCTCGAACT CCTGGCCTCAAGCCATCCTACCTCCTTGGCCTCCC 

6 5 AAACAGTTGGGATTACCCACATGTGCCTCTGAGCCTGGCCCAGTTACTGATGAGTGCCT 

TCTCCTGCCTTAGCTTCACGAGTAGCTGGGATTATAGGCATGTGCCACCATGCCCGGCT 

GTTTCTCCATGTTGGTAAGGCTGGTCTCGAACTCCCGACCTCAGGTGATCCAGCCGCCTCAGCCTCC 

GAGTGAGCCRCTGCACGGGGCCGAGTGCCTACTCTTTATGTGTCCCATTTTAAAC^ 

7 0 GAGGCCAGCCTTCTTGCCTGGGTC^CGCAGCCACTAAGTGAGTGGTTAATCTGAAACT 

CAGGGTAGOkGAGGCCATGAAGAACTCITTCCAAGTTGTTC 
TGTCAGTCAGGCTGGAGTAC^CGGCTCAATCTCAGCTGACTCOVGCCT 
TCCTGAGTAGCTGGGACTACAGGCGCGCGCCACCACGCCCGTCTAGTTTTTGTATTTTTAGTA 
CCAGGCXGGTCTCGAAGTCCTGGCCTCAAGTGATCTGCCTGt 
7 5 CCCGGCCCAG ATTGGTTCTTT AAAGCGGAGTTGAGAGGTGGCG CTGGAGGGGAGGAAGCCCTTCGACATTTGCTCCCAGCCT CAAA 
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ACTGGGTTGCACCGGCATCTGTTC(ZTAGCCCTGTGGGGTTCCTC3G AGCC CCGCCTTGT CCCGCCCGG CCTCCCTGAGTTCTTGTCC 

TCCCCCTGCCCACCAGTCTG&GTGTGCGGGGTGGGGACCACCAGGCT 

GGGTCATAGAATCCGTCCTACACTCCTCAGGATCTTGAAGGGGGCTCTTCCTTACA^ 

CAGGGCCTAGGTAACCTGTCTGCACCC»CTCCCACTCTGTATCAGCTGCCCT 
5 TTCCAACTCCAGTCGCCACITTTAATCTXKrCGCCTTTTCCCA 

GCTTTAGATAGTGTCGCAGCTGGCTGTGAGGAGTGTCCTGATTCTGTAACCCGGCA 

CCITCCAGGACGTTTGTTGATCAAAGGCGAAGGATGTGCAGAATCT^ 

GGGCCTCGCGCGTAACACAGGGGACACTCAGTGGTGAACGAGACCACTCGCTTAGGCCT 

CACACTAACTGGGTCCCGCAGGACACGCrcCGCCTGGTCGOT 
1 0 GCGTGATCTCCTCGCGCCGAGAGACCCAGCAGGGACCTCX3CAAA 

AATGCATGGAAATTGGCTGCT AGT AG CTTTGCT ACAGG AGAAAATCGCT AATCT GT AG TG CAAAATCGG CGCACAGGGGTTTTCT C 

TGAGATTGTCTAGACGCCGCGCGGGCGGGTCGGAGGCGCCAC^CACATTG 

CCTCTACTCTGTATCTGCGCCGCCCCAGGCGCCCAGTGTCCGCGCTGGGCCTGGGA 

TTTGATGGGGGGCCGGGCK^CACTTTGTCCACCCAGGCGAGGCTTGGGTCCAGCGC 
1 5 GCGCGGAGGCCAAGCAGGAGGCCCAGGAGCCCCXSCGCXSTCTrCCAT^ 

GGCCTCCTCCCCCTTCCCGAGCGCTGCCACTGGGG 

AC^CGTCAGTCCGGCGCCTCCCCGTGCCCCGACTCACGCAGGTCCTCCCGCG 

TGCCGGGGCAGCGGGGCCCGGAAGGGTAAGAACGOSGCTCGCTAGCGGGATGGGGACCCTGGCCC^ 
GGGGCCTGGGGGCGGAGCGCGCCGGGCCGGGCGCGCAGGAGGGGAGGCGGGTCCGCGCCGCCGCCGTCCGCGC 
2 0 GCCCAGCCGCTC^CTCGGCTCCGCTCCCTGGCTCGGCTCCCTGCCTCCGOT 
GCCACATCCTCTCAGGTAGGCGCAGCGGAAAAGGACCCC^ 
AGAGGGAGGCGCTGAGGGAGGAGCGGGCGAGGGGCCCCGGCAGCCCGGCrc 
CTGGTGAGCGGGCGGGCAGAGAAGGCCGCACAGGCGGGTGCCG^ 
GCCTGCGGCTGGCGGCGAGAGGCXX3CCCGGGCTTGCGGGC 

2 5 AGGTGACCTTCCCCGGCGGCGCCCTCTGGCCCTGGATCCCACCAGCGTGCATCCGGCGGGGACCCGTTC 

GACCAGGCCAGKXX^CGGGTGCCGGCCACG CTGCCGCCTCG^ 
CGCGCCCTGGCCGGCCCACGCCCGCAGAAGCTGGCGCCCXKSCACCGC^ 
CGAGAGTCGACTTAGGCGGTAGAGGCCGGAGCTCGGGCCTGGGCTGAAGGTTTGGCCT 
GTGGCCT T CT T GGTCTTAGGAGTGATTACTGGTTCGGTCGTTCATTCATTCG^ 

3 0 GAGCCTGACTTTGGATTAGAGGCCCGCAGTCTTAGTGGTTTGTACAAGGTCACACAGCT 

ATCTATAGGACGGGAGCAATTGTACCTCTCCTTCGATGAGTTTTGGGGAATAATAA 
CAGTGCCGGTGTGGAGAAGGGCTAAGTAGAAACCCTGACACCCTTGGGGC O ^ 

TTTG CT AGA.CGTGGCGTTGTGG AAG GGGTT CAAT AAATGT AGCT CCTGTTTTT AAACATT CTTTCACT ATT ATTG ACAAGCATTTG 
TATACCAGGTATTGAGTAACTTTCAGCTTAATTTATGAAAAATCT^ 

3 5 TTTGCATGTGGAGATCTTGAATGCAGGAGCAGACCCTGGTTTAGAG CCCGTTCCGC CCGGGGTCCCTGTCCCCTTG ACTAGCCTCC 

CCTCTCCTGGGGAG AT AGGGGAAAGGGCTC CTCTGTTG CCT AAACTTCTCC^ 
GGCCAGCCTTCATTCTGCCCACCCCAGCTCrrGGACCAACTCAGCTCC^ 

GTCTTCAGTGCCTGGGGTGGTGCTG CCTGG CACACAGTAGGTGCTTGAGAAGGGTCTGTGGGGGTAAATGCACTCTTGTGTCAGAA 
TCCACCCATCTTTACrGAC^GAGTTTTACTATTC 

4 0 GGGCATATGAAGGGCTGCAGGCCCAGACCATAGTCCTTTGATCCTGG 

AGGTCTGGAGCGCGGTGGCACCATAACGGCTCATGGCAGCTT 

GTAT CTGGGATCCCAGGCACACACCACCACGCCTGG CTAATTTTTGT ATTTTTAGTAGAGTCGGGGTTT CACCATGTTTGCCAGGC 

TGATCTCAAACTCCTGGCCTCAAGCAATCCACCCGCCTCCGCCTCCCAAAGTGCTGGGATTACACGC 

CCTGGACATCCTTTTGAGGGACACAGACATCTCACTTGTGGAAAACAGCCTC^ 

4 5 TAACAGTATGTATAGTCTCTTGAAATAAGTGCTTTAAACACATCGTCTTCACT 

GAACTGGTGTTTCTGGGCTTGCAGGGAGGGTGACATGGGTGACTGGGATTCTACAGAAGGTC 
TGGTGGGTGGATGGACAGCTTGTGCTGGGGTTGTCCTGAGTAGGGTGCCT^ 
TTAGTTTTATTTTTGCTATTCTAGAGTGGGGCTATAGGCAGGGGTTATTCGTAGCT 
TGCCTGTGGGCCCCTGGCCTTGCTGAACACTAGGCAGGTGTGGACAGTGAACTCCATGCTGGGG^ 

5 0 ACCCTTACCAGGAAGATGCTCCTGAAGCACTGAAGTCCACTTCtSCCTTTGATGG^ 

TGACTGGAAGAGGCGCATGTGTGGTTGAGGACCAGCCGAAGCTTGGAGGCAGCC^ 

GCTTTGGGTTTTCCATGGAGTCCTGTGGTAGCGACTGAGAAA 

ATTTTACCACGTACTCAGGGGGTTTGGGATGAGTGGGACTTTAAGTTCTTTACT 

G CTT AGGCCCATAACACAGGTCTGGTGAGCCAT AGGGT CGGTCTCATACAGGTCAAGTCACTGTCTTGTGCCTCAGTTTCTCTCTC 

5 5 TCTTTT T TTTTTTr r iTrr X TJAGATGGAGTCTCACTCTGTTGCC 

TCT AC CTCTCGGGTTCAAG C AAGT CTCTGCCT CACCCTCTCG AGT AG CCGG G ATT ACAGG CG C CTG C CAC CATG CCTGG CTAATTT 
TTTTGTATTTTTTAGTAGAGATGGGGTTTCACCATCTTGGCCAGGTTGGT 
CTCCTAAAGTGCGGGGATTGCAGGTGTG AGCCACTG AGCCCGGC CGAGTTTGTCTG CT 
GATTGCTGATTGAGGGCTTGCTCAGCACCTTTCTGGGGGCTCAACGAA 

6 0 AGAGAGATAGTGTGTTTCCATTTCACAGGTCAGCAGACTCGAGCACAGAGAGG 

GATTCAAGGCCTGGTCTGAATGGTGGTGCTCTCACATTGCAGTTGCACrrCCAAGGGA 
CCTTTTGGAAAGTCAAAATTGGTGTTCAGATGGGAGACATTATTTT^ 
GGCCAACTGGCAGTGGACAATGGAGTCACCTCTATTTCITGCAAGGACAGTGGTGC^ 
TGGGATCTGGTGGGAGGCATTCTTTCAOTACCTGTTCAGCTAAACAATGC 

6 5 TCAGGGTGACCTTTTGAACATGCTTAGCAGCCATTATTATAAT 

CCCTGCTTACCAAGCGCCAGCCACTGTACCAAGTGTTGTATATTAACGAGCT 

TCTTTCTCCATATTAGAGCTGAAGAAAAAGGAGTAGAGA 

TTAG^^CTCAGGTCTATTTGATTCCCGAGGCTTCCTGCCCAAGCAC 

CTCCAGGGAACTTGAGGTGCCTTAGGTCAGGTAGGAGCCCCAGGTATGCCTGGGATCCAGACCATA^ 

7 0 GGCCATCTCCCCTTGATAGGTGGGTGGGCAGGCTGCCCTGAGTCAGCAA 

TGCTGGTGGTGGCCCAGCTGCCAGGGTGTTTCAAAATAGAAGCT^ 
AGAATGGAGGGAGAGAACAGCTGCATTGG43TACGCAGACCTC 

TTGAACCAGGAACTGTT C C C CTGGGTTGTTT CTTCCTGGCAAG AAAG AG AAAGTGCTGG CATG CGTG CATGT 

GTGTrGAGCGAGCTGCTGATTGTGTTCGCCCCCATGTTw 
7 5 TCCTCAGGGCTAGGAAGAGGTGCTGTTTCTGTGATCACTG£ 
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AAACCACCTAGAATGGTGTTGGTGAAACAGCG 
GCAATGATGATGATGATGGTTGTGATTGCrCTGTT<X3TTAJt5C 
AGAGTTTAATGAAGACATCATTTGCAAAGCTGTGGATGGGGTGTCAGGTCTGG 
GCTTCTTGGCAGGAGCTGTGGCCTTGGGTAGAGGAAGGTAGTCATTGCCCATCT 
5 TACCCCGACTTCATTCirCCCTAAACTCCCGTCTTCTGCAAGTACCTC^ 

GCAGTCCATAATGATCAGTTCIWGGGCACAGAGCAGGGTGGAAGGGAGGAG 

T AGTGGATT ATGGTT ATT ACTGC CATTTGTTGAACACTT ACTC CATG CC^ AT ACTTCTGCrTCTTCTG C CT 

TGTAACAACTTCCTTAGGCCTAAGGCCTTTCCTCCCAGCTGTTT^ 

TCAACCATTGCAGTAGTTCTTGGCTCTCTAGGGGTGCAAAGCT 
1 0 TT T T 'IT ATTT AAAAG AGACAGGGT CT CACTGT ATTGCC CAGGCTGG TCTCG AACTC CTG CATT CAGG CGAT CCTCCTGC CTCAG CC 

TCCCAAAGTGCTGG^ATTAGAGGCGTGAGCCACCATGCCTGGCACATGCCTGAATC 

CATATTTCTACCAGGCGGGGGCCTGGTAGGTGC 

CCTtSCCTGTTAGGCACAGGCTGAGAGGTAGGGCAGGAGGAGTTCC^ 

ATTCTTGTGCACCCTGAAGTTAGGGAGCTCTTTCCTGCTCATGATTTCT^ 
1 5 GAAATGTGCCCCGACGGACCTTGCCTGATAACTCTTCCATCTCAGTGTAGTT^ 

GGT AGC CTTCTGGCATTGCT ACTGCAG CAG C CTTGGAACGTG ATTTGGGACACAACACAGCT CAATTTCAGGCACAGTGGTTGG CT 

TTTAAAGTGCTGTTT1TTTGGAGAGGGAGTCTCTCGCTCTGTCA 

CTTTGCCTITTGGGTTCAAGTGATTCTCATGC^ 

TTTGTATTGAAATTCTTTATCCTTAAACAAATTATTAGTAT^ 
2 0 GAITTGAAGTGCATTCCTTCCCTAAACCTCCTC 

CAC^TGCTTTCCCATCCATTCATGAACATAGAAAGATGTGTAAAAA^TTT^ 

TTTTTCATTTAACTGTATCATAGCTAGCTAGCTTTCTTTCUT 

GGCTGGATTACAATGGCGCGATCTCAGCTCACCACA^^ 

GCTGGGATTACAGGCATGCGCC^CCAraCCTGGCT 

2 5 TGGTCTCGAACTCCTGACCTCAGGTGACCCGCCCACCTCGGCCTCCCAAAGTGCT 

TATCX3TAGCnTCATAGCTAGCTTTCTATGCCACCACATACATCCACCT^ 
TCGCTCTCTCATCCAGGCTGGAGTGCAGTGGCATGATCTTGGCT 
TCTCZAAGTAGCTGGCA.TTACAGGCGTGCACCATCACACCT 
GGTTTK3CCATGTTGGCCAGGTTGATCTGGAAGTCCTGGCCTGAAGTGATCCGC 

3 0 GGCATGAGCCACCCACCTTATTCTTTTTGATTACCATATGGTACT^ 

TTTTCCITTTGATGGA(nTTTAG<5TTGCTTCTGGGTTTTG 
" AGTAGTGCCTITGGTTGCTACIAGTGGGGCAGGTTCCTAGAAGT 
AGAGTGTACCTGTAGTACTCCCCCITGGCGTGAACAAGAGTGACT 
TTGCCAGACTGCTGGTTAAGTGAGCTTCTGAAAC^CAGATGTGGTAGTC^ 

3 5 CCATG CATGGTATTGAGGTCAGAAGCAACC^GTGCTTCTGATGG CACTTGGTCTCCTCAGCCTGAGATCAGCTCTCCTTAGGCCGA 

GGTGGGAGCAGGCAGGGAGTGGAGGTC^TTGCTAGTTGAGGGTCTCCCCAACTCCCCCACrcrr 
TGCAGAGGGAGCTCITACTTCCTGGTTGCATTGTAATGAGTTCCrATTGTCAGATTCC^ 
ATAATACTTTGATGCTCATTGCTCATTCACTGATCAAACA 
ATACGTAACTTTTTCCATATTTTGGATAAATTTATGGATGACA 

4 0 GGTGTATGGGGTAGGGTGGGGTGGGGGCACGGTGAGCACAGCAGCTG 

CCCCAAGACAGAGGTTACGTAAGTATT ATGCAGACTGTG CTGGGTG ATAGATACACTAT CCTTCCCCTCTGTGCTTCCAGTATGGG 
CCGGATTTTCATTCCTCAGCTGGAATG<3GGCTTGCATAATTTGT^ 
CTGTGGAGGGGAACCAGGGACTCCTGTATCCTGGTGTGACrrrGCTGGTCCTTCTCAAAG 
CAGCCTGGCCCTGCCCATCCCTCTGTTTCCTACCAACCAGT 

4 5 GAGAGGCCTGTGGTTCTGACTCCCCCAGGACGCCCTCTGACTTCTGTCTGTCCAGGGCrrA^ 

TGCTTCATGCGCAGCCTCTAGAGGCTCAGCTTGTGAAGGCCCCTGGGGGGTGGAT 
CTGTACAGGCTCCGATTTGGAGCAGATCTGGGTGGAAGGA 
CCTGGCAGGGTGTTAAGTGAAAGCAGGCTTTGTTGCCGACTTTTCTTTCCT 
ATTTCTATCATAAAATTGTCTCTGAAATGGGTTCGAGAGCTGTGTAAAAATGGGCA 

5 0 AGCTCAACTTCTCGGAGCGGCAGAGCrGTCTCGGGCCrTCCTTAGTTCrcT 

AAAAAGGAGGGGAAGCTACGACTACCTGGC!AGTTGGGGGCCTTTAGT^ 
ACTTCAGGCCCAGGAGTGGACTTGACrrCCATTCAGCCAGTGGT^ 
TTACTTGGGGGCCCAGAACX3GGGGAGATTAT<3CTTCATTGTCGGTGT^ 
GCCAGCATCCAGTATGTTTACAGTGAGGCATTGGATAGGGATATGTTC 

5 5 AACTACATGGTTGTGTTGTGGAACCCCACAGCCCCAAAAGAGC^ 

GTGGGTGGGGTGTTTTTTGACAGGTGGCTGTGCTTGTTGGATT^ 

GGATGAGTGCTCTGCCTTTATCCTGAGCGGGGACCAGTCCAGATGTGAAGGCT 

AGGGGAGAGTCGTGGGGAAGGCTCTGGCTCCTAAGGTCCAGGGAGGTCTTAGGTGAC 

6 0 CTTCTGCTTCCCCAGCCCCCACCCCGGCTCAGTTTGTGGCC^ 

GGTTTGTTAACCCTGTCGGGTGGAGAATGTTCTGGGCCCGTTACTGTTATGTTCTGTGTCTG 

TCAATCCAGAGTTGTTCTCCTAGGAAGGTGTCAGGTAAAGGGTATC 

GGGATGCACCTACAGGGTTAGTTTGGGAGGGAGGAACAAGTGGGGAGAGGGGAGCCCCCA 

TGTAAAATGX3ATCTTCCCCCCTCATCTCTCTACCCTG ATACTAATGTGGG CTG ACCTGTCCATGGTTTCAGCCTGATCATGTGCAA 

6 5 GAAATAACA(TAGCAGCGACAAAAATAGGACAGAGTAAACAG 

TAAAATTAACAAAGCATCTGGAAAAAGTAAAGTAGGACCTGGATCACAACT 
TTTTACAACATACTCAATGAGTCTGTGGCCTCCTGCAATGCCT 

TATGGAAGCATATCATGTCATATGJVCATGCCAGGAGCATGTCATATATGTGTTTATTTGTC 

7 0 TTCTGGGGATGGAGAAGGTGCAGGAGATGAGGCTTTC 

CAGATTAGGATATTAATCCTGTCTACACAGCATATACAGCATGCCAAA 

AAGTAGTCCTTTGATGGTTGGTTTTAAAAATAAAGTTACACTGGAGCT 
GGGT^TTTTCCATGTGAAGAGCTGCCAAAGTGGTTTGGTTTTACTC 
7 5 CAAAAGTTTrGTGAAAAGTTGCTCCCATCCTGGCATATAGTAGGTACrCCATAAA 
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TGGAAAATGATGTTCCCATGAAAAGCTCTGATTTCTACT 

TTT ATTGGCAACAGTCCT ACAATTCAG TAAAGT CACTGG CT CATGGGAATGTGT CTTTGACAGTTT ATTCATT AAGCAAATATTT C 
CTGACTCCTACrCTGTTCTGGGCCTGGGCTCAGTC 
GACTCATTGAGTGTCTCAAGCTCTGGCCACAAAATGAAGGACAAGACAG 
5 TGGTAGTCTTAGTGCTAGGTGATTTTTATTGGTTGCTAATTACATCTAA^ 
CAATGCATTTATTCTGGTGATTCTTTTAT 

ATGTCTTCACCAAGGGCTATGGTGAGTGTTCCAGAGAGGGGGTGCCTTTCAGC^ 

CGATAACTGTTTAGGAGACCTGGGTAGGTCAGCTGTTCTGAACTTTCACGTC 

GTAGCAACTTTGGTTGTGCACCCAGTrGGACAGGCTCCTGCATTGGTC 
1 0 GAGCAGCCCTGTGGAAGCTGCTGCAGTGGAGCGCGGCTTCT 

GTGTCTTTGCCAGCTGTCAGGTGT C TT C TTATCAGACCCTGCTGGGAGTG 

CTGCTGTTCCATGGCAGCTGAGAGCTCCCCCCACGAGCAGCAGCAGAGAC^ 

TTTTTAAAAAAGCATCCreAAGCAACCATGAGGGAGGCAGAGGTGCTGT^ 

TTTCTTTAGGATTTGGCTTAATAAAGCTTGATTTGAAAACA^ 
1 5 GAGTATTTATTTATTTTTATTTTATTTTATTTATTTATTTT^ 

GTGATCTCAGCTCACCACAACCTCCACTTCTTGGGTTCAAGCG 

CCGCCACCATGCTCGGCTAATTTTGTATTTT^ 

AGGTGATC CGC C CAC CTTG G CCT CC CAAAGTG CTGGAATTAT AGG CATGAGCCACTGCG CCTGGCTT ATTTTT A i i n l AAAAAAG 
ATTTTAGATTCAGGGTGTACATGTGCAGGTTTGTTACATGGATGTATTGTGTAATGGTG 
2 0 CCCAAATAGTGACCATTGTATGGAACGTGGGAATTTGTTGTAGATCAACAAATAACT 
AGGTACTTTTTTTTTTTTATTCTGGCAGTTTTTGTGGTT^ 

TG C CAG T AGG C C CC CAGTTCAG C AGCGGG T AATAGAGGAGAATGGGT CAGGTCTCAGTTTT AGTTTT CCTTGTGCTTCTT ATTTT C 

AAATTCACCTGCATTCAACCACATAAGTGTTTTTAAAGTAATCT^ 

AAAGAAGTTTTGGGTTTGAATTACCGCrGTGTCAGCTTTAGCT 

2 5 AGATGCTTTCCTGTCCAGACTGAAGAGTTACAGGGAGGAAGGA 

AGCAGGCTCCTTTACTCCCATCTGTACTTTAAATAT 

CTCTTGCAGAAAGGCAAGTATAGTGTTTAGGAGTGAAGTGAAGCCAAGTGAGTGCTATG 

TTGAAGGAATTT ACAAGCTCAGGCTCAGC CAACACTGAG ACCACCAAAGTG^ CAGTCTGGAAACCAAGT ACAG ATGG ACTGA 
GTACGGCCTGACGTTTACAGAGAAATGGAATACCX^CAATACACTAGGCACCGAGATTACT 

3 0 TGAAAAGATGTGGATTTATTAATTCGCATCTGCCAGCCTGCCTITGTGACTAAATCrAGTGGCAAAr 

ACGTGGACTGAAGCTGACCTTCGATTCATCCrTCTCACCTAACACTGGGTAAGAGTTTC^ 
TGATGTTGCAGGTGGTGGAAGCCCTTTGTGACAAAATGTTTTGGGGGCTGA 
TTAAGTCCTCTGGTTTTGATTTGGTCCCAGGGGTCGTATCCACATCrATGCACT 
TCAGGACTGGTGTTTACCCAACAAACGATTTCTAAGTGCCACCTGTGCTTGGCCCT 

3 5 GCCATGATTGTTACCCGCCAAGGAGCTCACITTCTAATAGAGGATTCAGACC^ 

TTTTGTTAGTGTTCTCCTAGTTTCTGCCATGGAGGTTGCAGGATTTCTGTTC^ 
TGATGTAAGTGGCTATGACCAAACGGGAGTGGGTGACATTTGGAAGCAGGGCCTAACTGGTCA 
GGCATGGAAGTCCTTCTCCCAGCTTCCTGCTGCTTAGGGACACTGAGACCCr 
ACTTTCCCCAGGCCTTCTGGCCACAGATGTAATAGGCTTGTTAGTTCTTGACC^ 

4 0 AATT CCTGGGTGCTTTTCAAAGTG CCTAGAAGGAATT ATTCCCTTTTAGAATACA CACTCATATTGGTGGGT 

CTTCCAAACCTGGGGAAGATGTTTTATATAGTGATGTTAGGGTCAAGGGCATGTACTG 
GGCGTGGTGGCTCTVCGCCTGTAATCCCAACACTTTG^ 

G ACCAACGT GG AG AAACC CTGTCT CT ACT AAACIAT AAAAATT AG CTG CG CCTGGTGGG CGCATG CCTGT AAT C CCAG CT ACTTGGG 
AGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCAGAGGTT 

4 5 AAGAGCGAAACTCCATCTCIAAAAGAAAAAAAAAAAAAAGAGTCG 

TGAGGTGGGTGGATTGCITGAGGTCAGGAGTTTGAGAC<^CCTGGGCAACATG^ 
TTAGCCAG^CATGATGGTGCGTGCCTGTAGTCCCAGATACTTGGGATGCTGAGGCATG 
TTGCAGTGAG CTGAGATCG CACCACTGCACTCCAGCCTGGGCTACAG AGAGAGACTCTGTCTCCAAAAAAA 
AAGATATACATATAAAACTTAACATCTTTAAGTCrrGTAGTTAAATGGCACTAT 
50 TTCTTCCTTTTTTCCTrTCCAACAGATATTTATTAG 

GTGACTGGGCACACAGGGTTCCTATCCTCTGGAGATGTCGTCTTATAGCTTGG 
GCTGATTTACTCTCATTCCTCTTCGCTGCTGTAGTTTCTCC^ 

G AACTCTGCCATGGTATCAGGGAGCGGTTATCTGGAAAG ATCCT^ CCC ACATG CTTGAA 

TTTGCTGCCTGGGACTTTGCCTGCTGCCCTGGGATAGCAC^CTTTGGAAT^ 

5 5 TTGCAGAGGTCTCCAGGCCATGCAGTATAAAAC^ 

TCAGCCTCC CIAAGTAGCTGGCACTACAGACACGGGC CACCATGCCTGGCT 

GTATTGGCCAGGCTGGTCTTGAACTTCTGACCTCGTGATCCACCTGCCTC^ 

CCTGCCTCAGCCTCCCGAGTAGCCGGGATTACAGGAATGCACCACCACACCCAGCTAATTTTGT^ 

6 0 CTCCATGTTGGTCAjGGCTGGTClTGAACTCCTGACCTTGGGTGATCTGCCCACCT 

GAGCCACTGCACCTGCCCCCCTTTTTTTTTTAATTTA 

GTTGATATCATATAGTTAATTAATCTGTTGTCACAATGACGTCTTTTTTCT 
TAATTGTTAAGTCirCTCTAATCTGGAATATTTTCTGACTCT 
€ 5 CTCAGTTTGGATTTGTCTGATGCTTCCTTATGATTAAATTTAGGTTATG 
TCTGCACCTGATGTTAGGAAGAATATATCAGTAGGC 
GTAAAGCTACTATTTTTCTCITTGTAATTAGTAAGTAT 
TACITTTGCTCAAAAATAGTAAC!ATC(^TTGGTGATTCTTGCCT 
GTGATCCTGGCTCACTGCAACCTCTGCCTCCTGGGATCCAGCAATCCTCCTTCCT 

7 0 GTGCCACCAAG CCCAGCTAATTTTTGT ATTTTTAGTAG 

CAAGCX3ATCCACCCGTCTCAGCCTCCCAAATTGTTGGGATTACAGGTGTG 
TCTTTTTTGTTTTTATGAGATGGGGTCTCIACTCTGTCACCT 

ATT7TTGGTAGAGACAGTGTTTCACCATCT 
7 5 cCAAAATTCrGGGATTACAGGTGTGAGCCACCACACCTGGCCT 
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TGGT ATT CTACCGT AAGG AAGAGCT ATTCCTT CTT CCC CATGT ATTT ATTT AATCAATT ATTT ATTT ATATTAG T AT AG ATG CATG 
GACATTTACrCTATGGGCTATAATGCACAGTGTCAT^ 
CTGTGTGCTTTTTGACAAACCTCTGTCGTTTTTCAGA^ 
TTGTTTTTCCCCAGTCCTGCTATCTGCCATTTCTCCAAGGJ^ 
5 GTAGGCCTGGCGOSGGGGCTCATGCCTCC^TCCCAGCACTTTGGA^ 
CAGCCTGGCAAAACCCTCTCTTTACTAAAAATAOIAAAATTAGCT 

AGCAAGACTCCATCTCAAAAAAAAAACAAAACrAAAATTCrG 

TGG TGGTGCAT CATGG AG ATT AGAAACGAAAG CTGGCTTTCCAAAGTGACTG AGGGTACAG C CTGGGAAAACACCTGG CT CC CAGG 
1 0 AGGAGCGGAGTGTAGACC^CATCTGGATGAGGCCACAGACCTCAATAAGGACCAGCA 

OVAT AAT ATCTG AAAGAGGAAG ACTCCACAGACATGGTGTGC^ C CACTCC C CAG 

CACTGCCCTTTCTCTTGGGGATTGTG<K3CCCTGGCCAG^ 

GTGCACACCACTGGTCAGAGGCCCTGCCATAGGCATCATAGTTTGTGATC^ 
1 5 GGTTAGAAGATGCCrrcATTTTATATTTCCTTTTCCCTTTGTA 

CAGCCTGGTGCCAATCACATACTTGGCTGAAACCCAGACTTC1TTCCCCAAAGAGCA 

TTTCTTGAGACATCTTGCTCTGTCGCTCAGGCTGGAGTGCAGTGATC 

GCAATTCTCGTGCCTCAGCCTCCTGAATAGCTTGGGATTAC^ 

ACAGGGTTTTTCCGTGTTGGCCAGGCTGGTCTCCAACTCCTGACCTCA 
2 0 TGAGCCACTACGCCTGACCAGTTTTGGATTTTTCTTATTG 

GTACAGAAGTAACTTGAACAAAATTGAGGTTCCAAGGCATAAGTTT 

CTACTCCCAACTG CTGAGGG AGCTGAGCAG CCAAAGAAAGAG ACTG ACAAATCCAGTTTCTCAGAAAGAAACATTTAATAGAGACT 
T ACAAAT AGAAGCGATGTCTCAAGCAG C CAAGAGACAG TGGATCCCTGC ATTCATCCT CCAGAAAGT ATTCTTT AT AT AG CAAG CT 
TTTAGAGTAAAGACATGTGCATGTCTTAGACCATCTTGCAGAAAACT 

2 5 CTTTACTTTTTAAATTTTTATTTAT^ 

CTCATTGCAACCTCTGCCTCCCGGGTTCCAGCGATTCTTCTCCCT 
ACCCAGCTAATTTTTGTATTTTTAGTAGAGATGGAATTTTG 
ACCTGCCTTGGCCTCTAAAAGTGTTGGAATTACAGGTGTGAGGCACTGTGC 
GATGCATTCCTTGGTCGTCATGGCTGTATTAGGCTATTCTTGCC^ 

3 0 CTGAGGCAGGCAGATTGCTTGAGCCCAGGAGTTGGAGACCAGCCT 

C^CACACACACACACACACACACACAC^ 

AT ACCTGG CCAGGCG AGGTGG CTCACACCTGTAATCCC AGCATTTTGGGAGGCTAAGGCAGG CAG ATTGCTTGAGCCCAGGAGTTG 

GAGACCAGCCTAGGCAACATGGTTAAACCCCGTCTACCCACATGCACACACACACTAGCTGGGCCT 

AGTTCTAGCTACCTGGGAGGCTCAGGTGGGAGGATCACCTGAGCTCAG^ 

3 5 ACTCCTGCCTGGACAACAGAGTGAGACCTTGTCT 

CAATTGGCTCATGGTTCTGCAGTCTGTACAAGCAGAG 

TG GTG ATTG ATG AAGCAGGAG CAGG CACATCACATGG C AAGAG CAGG AG CAAG AG AGAG AG AGTGGTGGGG AGGTG CCACACGCAA 

ACAACCAjGATCTCATAACTCATTATTGGGAGGACAGCACCAATCC^ 

TCC^CACTGGGGACTGCATTTCAACATGAGATTCAGAAGATACAGCATCGAAACTG^ 

4 0 GTCACTCTTGCCATGGAACAGGCTATTrTCCTACATCACTT^ 

TGT CT CATTC AG AATTTGG AATTTTT AAC CTTT AT 7 TTT T T AATG AG AAAT CTGAAACAT ACAAAAAT AT ACAG AAAAAT ATG T 
TTAACCCTCATTCTATTCCTATCACACAGCrrTTGATAGTTACCAACCTC 

ACACT C CT CCACGCACAT ACT ATTTTAAGT AAAT AACAACACTTAATTT C AGCTG TAAATATTTT ATTATGT AT CTTTAAATGGCA 
ATAATCCATTTTAAAGGAGCATAACCATGATATTATTATTATATTGAAAAAAGTT 

4 5 AAGATGGAGTTrCACTCTTGTTGCCCAGGCTGGAGTXJCAATGGTGCGATCT 

ATTGTCCTGCCTCAGCCTCCTGAGTAGCrGGGATTACAC^CATGGGCCACCACACCCC^CT 
GATTTCTCCGTGTTXXaTCAGGCTGGGCTCGAACTCCCGACCTCAG^ 
GGCATGAGCCACTGCGCCCAGTGGATCATTTCTTAATATTGTCAAATCTCr 
TTTTTTTTTTTTTTGAGACTGAGTTr^ 

5 0 TCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGATGAGATTACAGGCATGTGCC^ 

TTTTTAGTAGAGATGGGGTTTTGCCIATGTTGGCCAGGCTGGC 

AAGTGTTGGGATTATAGGCGTAAGCCACCGCACCTGGCCTTTTGCTGTTTTTAAAATAACAGGTTGGT^ 

GACAGCTATGTGATTTGAGCGTTATTATAATTCTTTTTTTTTTTTTTTTGAGA 

AGTGGCGCGATCTCGGCTCACTGCAACCTCrcCCTCCCGGGTTCAAG^ 

5 5 ACACGCCCACCACCGCGCCCAGCT7VATTTTTGTGTTTTTAGTAGA 

TGACCTTGTGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGAATTACTGGCATGAG 

ATGCCTAGAATAAACATACTGTTTAATATGTTTCAGCCTAT^ 

GCCCTTGGGAGCCTCTTCAGGTTACCTTGTAAGTCCTTTTG 

6 0 CTGAGGATGAAGCATTAATCTGAATGTGAATAAAGCTAGAA 

CCTCCTCTGGTCT CTGCTGAGAATGCAGCTTCCTGTGGGAGATGT CGATCTTGTAACTGAATGATAG AGACAATCAG ATAAAAATG 
ATCCATGTTGTATTTGTTGGATGTGATCTTGTTCTCCAGTGTGT^ 

ACTATTCACCTGCTCrrCCrAAGGGATGCTGGGTCTTGAGGAACCGGCAGTAGAGACACTGAACT 
ATCATCAGGTTGGACCTGGGATACrGTCTGGCrAACTAACACTTCCT 

6 5 C CCGGTTCT CTG AAAT ATGTACTCATTGGGGT CTCATGAAT ATGGAATTGAAAT CGATTTT ACTG TTG T ACTCACTT AGAGGGAAC 

ATAGCCTTTGTTTCCTTACITAACCATGTGAGCCACAG^ 

TTCIATGGCAGATAAATCTTTTCCACAGGTTTGGAAAATATCAAAGATC 

GAATGCCTGTGTAGGCTGCTACGTTAAGCATACAAAGAGGATAGGAGAAGAAAGT^ 

7 0 AGTGGCTTAAGGGAGAACTGGAGCCCIXIACTTCACCCCCACCCC^ 

ACTTGGATTTCAAGGTGAGTTAGCTTCTGATCTTAATA 
CAAGCXSGGAGCAOVTTAACCTGG^ 
. AGGGCTGGCTGGCCGGCTACCAGATGAATTTTGAGACTGCAAA^ 
GATG^ATTCCAGCTTCACACTAATlTrGTAAGTGTACATGGGGGCA 
7 5 GATTGCTTTTCACTG TTTrrTTTTTTT^ 
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G CCACATGG CCTG CTG CT ATCTTTTGTTTC C CACCT ACC CATTT CCTGGAGTGTCTT AGATCCTTT CC^CTCTG TG AACAGT AT AT 
ATTGCCACAGGTACACAAAGTGGCTGATTAATTACAGCAGGGACATG^ 

TTT AGTTAACAAAAC^G CTCT CTTTCTCTGT CT AATGCCTG^ CTTGATG ACATGT AGACATGGT AGACAA 

ATTTAGGAGCTGATTATCTCAGTTACAAACTATCTTCT 
5 GAAACAGTGATrATAGGTTAGGAGAGAGCCCCTTGAGAATTGAAGTAGATT^ 

GAATCTTGTTTTATCCATAAACCTTGTTrrATTTAG 

ATOTTTATTTCTTTCCACrAGJlGGTCA^ 

TTTCCATTAATAAATTTTTTGAGGAATTTTTATCATTAAT^ 

GACAATCTAGTGGTTGGCAGGGGGCXKKXIAGGXSTATGACT^ 
1 0 TCTGACTGTCGTCATGGTTACATTAATCTGTGTACGTGTGTTAAAATTCGTAAA 

GGCGTCGTGGCTCACGCCTCTAATCCCAGCACTTTGGGAGGCCGAGGCG^ 

TCAAGAAGGTGAAACCCCATCTCTACTAAAAATACAAAAATTA 

GGCTGAAGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGTTTGCAA 

GAGCAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAGCAAAG 
1 5 TGAAACAT ACT AAAAAGCT AT AGGAAACATAC CACATGCAT AGAAATG TTTTTGAACGT AGAATTCTGATTGG CATGGGAAGTGTT 

TTCCTTGGTTTGAAATTGGGCATTGGCACCJVATTCT^ 

ATCTG AG CTT CAGCATTTT AAATAAG CAACAAGTGGGT ATGGTTT ATTTTTGG AAC CAGCGTGAAGGCAGCTG ACACAACT C ATCT 
GGGTTGCCCTGGTGCTTGCAGGGGCCCAAATGCATAACAGAAATTC^ 
GTATTTATTTATTTATTATTATTATTITITGGTGACAGGGTCTCACT 
2 0 GCCTCCACCTCCCGGGATTAAGTGATACTCCCACCTCAGCCTTGCAAGTAGCTGG^ 
ATTTITGTATTTTTTGTGGAGGCAGGATTTCGCCATGTTGC 
GGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACrGCATCTGCCTT 
TATTGTGTTCCTGGTTTTAGGATATTCrGAAAAGTCTGGCCTTTTTTAA 
ATTACTAAAGTCATTCAAAAATCTTATTTTTGACAGGCTGCTGT^ 

2 5 TCTGTGAATCAGGCAGTGCCTGGGTCTCCCTGTACTTTCTAGAA 

TTACAATCTGTGGCTCATACGTGTAATCCTAGCTACTTGGGM 

CCTGGGGAACATAGAAATACTCTGTCTCTAAGAAATTTTTTAAAAATTA 

CCCAGGAGTTTGAGGTTACAGTGAGCTATGATTTTGCCACTtS 

AAAAAAAAGAAAATTTAGAATGTATTCTCTGATTTAGGGAGAGTGTCTTGTGAAGACTAG 

3 0 CTTTGCTGGATTCCAAAGTACGTGGGCTCTGACTCACAGTTAACCAA 

ACTCAGACACTGATTTTATGTTAAAGATGTTTGAGAGGGTCTA^ 

TTTTTAAAAAATT GCAG CATT AAAATri"l'l'Cl"rAATri"lTTT AAT CAAAGGAGT CTG AACC CATGGTTGT AAAAT CAAACAGT ACA 

GATGGGCCTATGAAAAGCAGTGATTCGAGCCTGCCT 

TTTCTGTGGTTCCCTCCATGGAGAAGGTTGCTTCTGACCAGGC^ 

3 5 ATGAGAGACAGTCTATGATCTGTGCCGACCCCAGCCCGACAGAGGACAAAGTGTAG 

GTGGCAGTGACTGCAGAGTTTGTATAGGCTGTTCAGCTGGGTGGG^ 

GAGGAAGAGGAGAATTTGAATGACTGAAGAAGGGAAAGGGATATTCCCAGCAACTGGAGTC^ 
GGAGCTTGGCCAGCTGGGGTTTCATTCTGAATCTTTTT^ 

CTGCCAGGCTTAGATGTTGCAGCATGGGTGAGGAGTATGCACAGGTGGACGAGATGAATTC 

4 0 CTCCTCCAAGAGGCTGGGTCAGTGCCAACTGTATCTCCCCAGCAGTGGCTGCAGCAGGTTGTGCCA 

GAGGCAT ATGTG CAGCCCCACCTTTCTTG CCATTTCTGGCTGTGTTCTCTT 
TTACTTTTATTTTTTGAG CCAGAGTCTCACTCTGTCACCCAGA 
TCCTGGGTTCAGACGATTCTCCTGCCTCACCCCCCCAGGTC^ 

TTTTT AGTAGAG ACAGGGTTTCATCATGTTAGCCAGGCTGGTCTCAAACT CTCAAATGATCTGCCCGCCTTGG CCTCCCA 

4 5 AAGTGCTGGGATTATAGGCATGAGCCACTGCACCCGGAC^ 

AGTAGCAGTAAAAATAAATCACrCAGAATCTCAGTGCTCTACTTGCTC 
GCTCTGTTGCCCAAGCTATATCCTGGGCTCAAGCAGTCCrrGCTGCCTCG^ 
ACGTACCCCCTCTCCCCTGGCTTTTTCriTTTTrTC 
CTCGGCTTACTG CAACCTCTGCCTCCCAGGTTCAAGC<*ATTCTC CTC 

5 0 ACCAGCCCGGTCAATTTTTGTATTTTTAGTAGAAACAG 

ATCTGCCCATCTCGGCCrrCCCAAAGTGCrGGGATTACAGGTGTGAGCCACTGTGCCTGGCTAGGTG 
CTTCCTGTAATTTAGTACCTGTTTTGTGAAATTTGGTGAACTTCTGACTTC 
ATCCCTOTCTTCTGGGAGACTTGGGTGGATCTGTCTGGGCTGCAGCTCACCT 
TGCIAGGAATGACGGGACAGAGTTTGGCGGCTCCATTTACCAGAAAGTGAACAAGA^ 

5 5 AGCAGGAAACAGTAACACGCGCTTOK3AATAGCAGCCAAGTATCAGATTGACCCTGACGCCTGC 

TCACGCATCTCACAGGCAGGCC CATCTGCTCTTTATGTT^K3 AATCCTTCAGGTTTG CTGGAGTGCTGTG CTTCAGCTT ACTCAAAG 
TGAGAGATGCTGCTGGGCGCCGTGACTCACGCCTGTAATCCCTGCACrTT^ 
GTCTGACACCAGCCrCGCCAACACGGTGAAACCCTGTCTACTAAAAGTACAAAAATTAGC 
CCCAGCTACTCGAGAGGCTGAGGCAGGAGAATTGCTTGAACCGGGGAGGC 

6 0 CCAGCCTGGGTGACAGAGTGAGATTCTATCTCAAAAAAAAAAAAAAGACTC 

CTCACTTTGAAAAGAGTGCCAAATGAGTAC^ 

TGTAGATTACTGTGTTTAATCCCATTTGAAATAGGTGCTTTGGTTTTTTAAT 

AGACAAGGCCCAGGTTGGATGCAGTGGTGTAATTGCAGCTCACTGCAGCCTCATCCTCCTGTGCT 

CCTCCCAAGTAGCTGGGACTATAGGCACACCAGCATGCCT^ 

6 5 TGCCTGGGCTGGTCTCAAACTCCrGGGCTCAAG 

ATGCCTGGCCAATCATCTCTGTTTGTTTG T ' rrri 1 11 1 ITGAGACGGAGTCrCCCTTTGTTGCCCAGGCTGGAGTGCAATGGCACG 
ATCTCAGCTCACTGCGCAACCTC CGTCCCCCAGGTTCAAGTGATTCTACTG CCTCACCCTC CCATGT AGCTGGG AACACAGGCACC 
C^CCACCACACCTGGATAATTITTGTGTTTTTAGTATAGACGGGGTT 

AGGTGATCTGCCTGCCTCGGCCTCCCAAAATGCTGGAATTACAGGCATGAGCCATGGCCCCTGG 

7 0 ATGAGGAACTAGAAACTTAGGTGAGTAACTTGCT 

TACTGTTAAGCCTCTTCCATTAATACCAGTTCCTGT^ 
TGATAATTCCCCCCGCCCCGCCTCACTTTTCATTCTCCTTTTCAGCTCCT 
CTTGCAGAGTCTCTGGCATTGCTCAC^GGCCCTCGAAOT 
ACAGGGCACTGGCGGCTGTATCTCTrTGGGGTGAAATCAGG 
7 5 AT GGACAGCCCTTTrTAAGTCTCCTAAACTAGAACCGAATTTGGCTCCCTTA^ 
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TCATTATTTATATGAGCAAAAGGACACTGG 

GCACACCCTTTGAACAGAAAGGTATGCACCTAAAATTGATGTATTTTGTAT^ 
AAAAATTTTGATTCTAAAGTCCAAGGTTGGTTGACTAAAAAGCT 
TTCTCTGTTTTGTCATTTCCTTGATTAGGCTAAAGTGAACAACTCC^ 
5 GTATGTGTTGCTTGTGTGTGAAATCGGlTCrrTATCTTAAAAAAATTGTGGTG 

TGTTTATATAATACAGTGGGTATATATAAATGGATATATGTAAAGTATAATTAAATGTTTATAA^ 

ACATTTGGAGAAT ATG AGATGT AGCAAG CAGGAGGAAAAAGGCTGAGGCT CTTTG CT ATTTTCTTC CATCCTTTTT 

TCCAGTGTGGACATAATGCTTGATACTTATTTCTGGCAGATTCT 

AAATAGGAAAAGTGGTTTGCAGGTTTCCTCTGCACAGTTGGCTAGGCCA^ 
1 0 TTKIAACCCTGGAAATCTAAGACTACCAGCTACTCCCTGAA^ 

GTACATGCA.CACTACACATGTGTGTACACTATGAACATGAACATGTAAGACCTTAG^ 

GGGATGATTGCCAAAGAAGCTTACTGCATGCTACCAGTGCTATAGACT 

GGGGTTCCAGAAGTAGATTTGGGTAGGCTCTCTGGGCGGTGGGATAATTATAG 

GTCAGATAGTGCTAAGTAACAAGGATTGAAGGGCCATGTAGTTTGGACCCTGCTGATAAT^ 
15 TTAATCTTTCTTGTCATTTCTTTCCTCCATTCCTCCTCCTTTCCT ' 

CAGCTCTTCTGGATGGCAAGAACGTCAATGCTGGTGGCCACAAG 

CAATTGTTTAATTTTAAACTATTTTGCAGCATAG 

ATTGCTAAATATGTTAGCCCTCCAGGTTAAAGTTGATTCAGCTTTAAGATGTTACCCOT 
AAAAAAGGTCCTTTCAGTGGTAGACTCGGGGAGAACTTGGTGGCCCCTTTGAGATC 
2 0 CAAGTGGAAGCGGATAATATGTAGGCACTTTGTAAATTCATATTGAGTAAATGAA 
GGTTCCCTAACCCTAATTGATGAGAGGCTCGCTGCITGATC 
ATGACCTGTTCCCACCCCAGTTCATCATCATCrCTTTTACACCAAAAGGTCr 
TGGGGTGGAGAAGGTGGATGTGATGAAGCCAATAATTCAGGACTTATTCCTTCTTGTGT^ 
AGTATGAAATAGCTTCCAGGAGCTCCAGCTATAAGCTTGGAAGTGTCTGTGTGATTGTAATC^ 

2 5 AAATTGGACTTCTGTTGTATTCTCACCACTCAATTTG 

ATTAGATCCTCCCCTTCAAATGCTGTAATTAACAACA.CTTAAAAAACTTG 

TTTATTAATAAAATATAAATAAACAGGGCATTTGTTTAGATTTATTTTGTCr^ 

CTCACTCTACAACAGATTACCATATGTTTAGCAGCTTAAAACA 

3 0 GCTTGGCGTCCTCTITCCAGGCTTTCAGGGTTGTGGTGGAATTCAGTTAT^ 

GCTGAGCCACTCTCAGTGCTCAGAGGATGCCAGCATTCC^ 
. CCTCATGTGAAATCCCTCTCACACTTCCAGTCrCCTCCAGGTAGAGCCCCAACCCT^ 
AATGAGGGT AATCT CCCTTCAAGTT AACTG CCAGGT AAAGT AAC CT AATCAC TGGT AATAT CCTTTCATGTTTGTAGGTCCCTCCA 
TATGGCTCATGGGAAGGGGTGATTTTGGAATTCTCCCATAT 

3 5 AGGCGGAGTTTCATTCTTGTTGCCCAGGCTGGAATGCAATGGCGCAC^ 

CAATTCTCCTGCCTCAGCCTCCCCGCCTCCTGGGTTCAAGCAATTCTCCTGCCT 
TGCCACCACACCCGGCTAATTTTGTATTTTTAGTAGAGATGGGGTTTCT 

GGTG ATCTGCTCACCTTGG CCTCCAAAAGTGCTGGGATT ACAGGCGTG AGC CACCACATCCAGC CTATTGTGG AAAC^CA^ 
AATAAACTCTATGCATGATCCAGCTTCAACAAAT ATG AACATTTTTT^ CATG ATTTTTCAG C AGAATTCCTTTTTTTTCTA 

4 0 TGTTTGAAAGTGACATTCAAAACAGGAACCATCTATTTTAGC^ 

GTCATACTGTCITCAGGGACTTGCAGCTATTTCTAAGTGAATGTTC^ 

GGTACATCTAGT CAGATGTCAC^ACAGTATTAAGATATTTT ACTTTT AG AG ACA^ CAC 
TGTCATCATCATAGCTCACTGCAGCCTTGAACTCCTGGGCT 

GCAT AAG CCTGG CC CTAAAAT ATTT AAAAATGTAT AAAGGG TTCTCTTGAGT ATGAGGGAAT ATTTC CCT TTACTT ATTG 

4 5 TCTGCTTGTTGCATGCAGCATGTTGGAATATCATTGTGCAAAATGC^ 

AATCTGGTATGAGGACACAAATGGACAAAGTGGACAATGAGTCAGGGAAGAG<^ 

G TCAACAAG CAG AAACT AAGATTG AG AAG CTCATTT GTTGAACTGAGTGGTTTTACAGAAAACTT AAGT ACCGCTC^ 

AG CT C CTGGGGGACTGGG AGG ACTTTG CACCTTGGCT AAAGTGGAGAGTTTGG AAT AAGG ACACCATTCAGTCT AA CTT C 

TGGGGGATTTCGGGAAATGTCCAGATGAATGACTTCTTAGTTTTCTTAA 

5 0 AGATAAAAGCTTAGGGGAAACCAGATTCAGGAGTAGAACACAGGCCCTT^ 

G CAAT CTTCG CAGGG AAT C AT AGAGACCAGAATGGTG AAG CTGTGGG CACAAT AAATGTCTTG CIT AACGTCACATTG ATTTTCCT 

GAAATTGATCAAATGGAACTTTTATGAATTGAAGCCTGTATCTGTTT 

TATGGCGCCAGGGCCCTAGGTGAACrrCTTGGAAGTTTG 

AAAGAG AAG CTTT ATTTT AGGAAATCTGAAAGTT ATTTTC AATGG C AG C C C CT CACCTCT ACTT C CT AGCTAC CTTCCTTT CTTTTC 

5 5 CTCCATAATTTTTTTTTTTTTTTTGAGACGGAGTCT 

TCCCGGGTTACAGCGATTCTCCTACCTC^GCCTCCCGAGTAGCT 

AAGTGCTGGGAGCCACTGCACrAGGGCCGGTCCATAATATCI^TATCAT 
CAAGAAAATAAACTCCCAAGGCGGGCGGGGGGGGGGGGGTTCTITT^ 

6 0 TTTG CTG AAAAAAAAT AAATGACTG T CGG G AAAG ATTTTT AAAAATT ACTT AT AATCC CATCAC CG AAAT AGGACT ACTT AACATT 

TGTGTTTCTTTCGCCnTGAGAGAGGAAGAGCAGTTACT^ 

ATGGCTAAAAACGCTATTTTTGGCTGTATAGTACCCAATCATGC 

ACACCCTACCTTTAATCCGCAGCCAACTGTGTGGCGGTCCCCACrrrCGCCTA 

CTTATTCCGGTTTCCAGAGCACGTCTCCAGGCTTCG CGGCGTCTGACCTCA 

6 5 TGGCGTAGAGCCTAGCAACAGCGCAGGCTCCCAGCCGAGTCCGTTATGGCCGCTGCCGTCCCGAAGAG^ 

AGaSAAACTGCTCCCOXSGTCGGCCATCCAAGCCCT^ 
CTCTATCCAGTGGTAAAGGGGGGCGGCAGTCTGGGCTTAGGAAGGCTAG^ 
GGGAAAGGGCGCTTCCAAGTGTCAGGAGCAGACGGCGGTGTCTGGTTGGGTCAGGGTCT 
TGTCTGAAGGGGTCCAAGGTGGTAGGGTCTCCAAGGGGCCTGAGAAGCGGGGTCCCTGAGAG^ 

7 0 GCAGGGTGTGTAG ATGGG CGGGG AGGG AGG AGTGGTGTT AG CG CCT AGAG ATCGTTGAAGAATG GG CTG CCGCAAGG AATGGCGGT 

GGTGGGGTCTCCAAAATTTCTTGGATCC^AAAGTTAGGAGACT 

TGC^AGAGGTCAGTGGTAAAGACCGAGGCGGGCAGGGGCAGTAATCCAGTC 

AAOTGAJLAOGC A A A G AGCTGT CTT AGGGTACTGG TG CAT ATTTGAAG ACTGGG AGG TTCTGGTGTTTGATAAAG ATT CAGAGT ACT 
7 5 GGGGCGITGGGCCCCTGAAGGCrTGGGAGGCAGACAGGAGT 
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TAACCAGGATAACCCTGCCTCGTTTTAATGCAT 

AATCAAGAGATTGAAGGCTTTTATTGAGAAATTAGG^ 

AACTTGTAAGGATTAAACTGGTATTTTGTTTAGTA^ 

GAATTTTCAGCTGCAAAACAGTGTGACCTGGTATA (>l t!nT T TCTTTC 
5 CAGTACTGTGGGTACATACITATGTTTCCTGAGCAACATT^ 

GAGGAAGGACTGCTGTC CATTT C CT AAGGC CCTTTT ATCATT AGTGAAATGTAATG AGGATTTC CCTGGTTGATGCAGATATTCCC 

GTTCCTAAAGT ACTT AAGT AGCTT AATTGT TTTGGC CATTT AAT CATCT AAGAAAAATAAAACAC7UUVCATTT AT ATTTCCTG CTG 

GAATG CTT AAGGGAT AT AT AT CTGG CAACTGT AGTTTCAG AAAGAGCTIT1TT C CCCTAAGAGATGGTTGAAAATTGGG CAAG AAG 

TGAGGCAGAATGTGGACTGTT'AAGTGTCCTTTAGTCTCTGCTTr^ 
1 0 ACATTTTAGAGAATCTTTATCTTTTAAATTAA 

AAGAAG CT AAATGAAAACGAACAT AC ATGTCTTT AATAGG CGAG CAG CTCCACTG CACT ACAGTGT AGCTTGGG AT AT AAGTATTT 

TTTGT AAGCAT AC CTT AT AATTTCAAAGT AGT CACT CCTGT AAT AAGAAAATT AG TCT AG AATGTGTTGAGTTCCAAATTTCAAGT 

GAGTTTACAAAGGTAGAAGGTTACCACATTAGATTTATTTAAATT^ 

ATGCTCGTCAGTCTGTAAGAA ITITTAJITIT ATGT^ 
1 5 GTGACTAATTCrTGGTCATGCCATTTCAAAAATGCCTGGTTCTCTA 
• AATGTATAATGCATTTCATTTTTAAGGCATTGTTTCT 

AAC^AAATGCTGATATAGTTTTATTTTAGGGCAGTTTT^ 

TTTGTTTTTTTTTTGAGTCTGTCGCGAGGC^ 

GATTCTTCTGCCTC^GCCTCCCGAGTATGCTGGGACrACA^ 
2 0 GGTTTCACCGTGTTATCCAGGATGGTCTCAATCTCCTGAC 

CCTGAGCCACTGTGCCCAGCCTTTTTTGTT^ 

ACTTGGCCCATTGCAACTTCCXXrCTCCTGGGCTCAAGCAGTCCT 

CACCGTGCTCAGCTGATTTTTGTATTTTTGTAGAG^ 

GATCCACTTGCCCTGGCCTCCCAAAGTGCTAGGGTTACAGGCGTGAGCCA 

2 5 GCTTCAAGTAGAGAGGTAATATCAATTAGGAAAGGGAAGATTTAGAAGTCT 

CGTAAAGTCAGAGTTGTACCTTTGTTCTGAATACTCTTCATATCT 
TGATAGCCTTTGTATTACATGTTTCAGGTCAATTATTTGTTTTTTCTTGTGCT 

TG ACTAAGACATTCTG CTCCT ACTTGGTG CT AGGT ATTGG AGTT AAACAATGAAT AGGGTGAT ACTGGGGGT ATC CTTAG AAG AAT 
GAGTGGCGTTTTGCTGAGAACCATGGCATGGGCTCTTGAGGGAG 

3 0 TTGTGGTGAGAGTTGAATCGAG AGCAT ATT C AAT AAAT ATTT ATTG AG TG CCTG CTATCT ATT AAGT AGGT AATGGGGATAGAG CT 

GTTAGGG AG ATT AG TAAGAT AATGAGAGTG AGGTAATGGGGG CCAG AT CATGT AGG ATCTTG AAGG CT ATT AT AAAAATTTT CACT 
TTTGTGAGAAATGGATGGCCCTAC^GGGTCTTAAGTAGA 

GTTGAGTATAGACAGTAGAAGTTAGGGTGGAAGCAGGCAGATTAATTAGGAGGCTAT 
TTGGATGAGGCAGGAGCAGTGGAGGGTGAAAAGCCCTCAAATGCTGCATATCT 
35 AAATGACAACTCCAAGGTTTTTAGTTTTAGTAACTA(2AAAG 

TTTTGTTTTTTGAGTGGGGCATAGGAATTTATTTAAACTTATTTTGAGATA^ 
TAGAGATCCTGCATATCCCCTACCCAGTTTTCCTAATAGTAACGTCTTTCAAAAC^^ 
ATTGGTACAGTCAAAATACC-GGGCATTACCATCACCACAA^ 
CTCACCCCTCCTGAACTCCTGGTAACCACAAATTATCCATTTGTATAAGTTG^ 

4 0 CAGTATGTAACTTTTGGGATTGACTTTTTTTC^ 

GTTCCTTTIT AAATTTT AACTTTTA'ri''lT''l' T CGAG AGT AAGTCTTG CACT C CAGG CTGG AG TGCAATGG CACG AT CTCGG CTCACT 

GCAACCTCCGCCTCCCAGGTTCAAGCAGTTCTCCrcTCTCAGCCTCCAG^ 

CTAATTTTTGTATTTTTGGTAGAGATGGGGTT^ 

TGGC CTCCCAAAGTG C C CAGATT ACAGG TGTG AG CCAC CAG T C CTGGTTG TTTGTT C CTTT AAATTGCTGAGT AAT ATTCT ATGGT 

4 5 ATAGGTAGACTACACTTTATTTATCTGTTGAAG^ 

TGTTTGTG CAGCACTCCAGCCTAAGTGACAAAGCGAGAC CCTGTCTCAAAAAG AAAAAAATAAAATGAGCTTTCTATGG ACATCAT 

GTAGTTGAGTCTGCCAGTCTCTGTTTTTTAATTGATGTATTTAG 

TCTTGCCATTrrTATTTTTTGTCTTTAGCCTGGG 

TATCCTGAGGTCTCTAACTGT*i'CTACCCAGTTCTTTCCACCrTT 

5 0 TTAGTTGTTTTTAGTGGGAGAAACAGGGAAGAGTGTG 

AATGTCAGACATCTAAGAGGAGATGTCAAGTAGGTGGTTGGATTTA^ 
TTTTCTCAGAAAAGTTAGGAAGTGATGTCATCAGTTGAAAATGAGGATAACAG 

TGTGATTGGTGTCCT AAGAGAGT AG AAGGGTA7ATGGACTAGGAAAATATG ATTG CTTT AAAC 

ATACACCTGAATCTTTGGCATTGAATTTAAAATGAAATGAAAACAGTGGC^ 

5 5 GGGTGCAGATGTGJ^TAGTCAGAGAGCTTGATTTA 

CAAGGGGGAAGGGAATTGAAGATGGATGTAAGAGAGTGATTATAAGCAATTATAATAGCTGAC^ 
GAGAAGAGAGTGTATTGGGGAAGTTTTGATGGTGACCGTAAGGGAATGAAAAGGATTAGAG 
ATACCAGGGAATGAACTGGAAGGATAGGAGGTCTTGCCAGCAAGTAGAATGCZATGAAATTGAGAT^ 
AATAATGACTCTGGGAGTGAGTGGCTGAGGTAJUVGAGGAGGGCrAGCT 

6 0 CAGGAGTACTTGTCTTAGATGGAGTGAACTCTAGAGCCAGGAGCTAAAGATCATTAAG 

TGAAAGTAAGTTAGTGCATTATAGCTTAAGATAAGAGAATCAAAGCTAGAGGCATT^ 

AGGGCAATGAGGAGGAGAACTGTGCCTCCTTGAGATTCAGTGGTTCAA 

GAAAGGAAAATGTGTCCTTAAGGGAGAGTCAGGTTTCAGTAATAACAAGAAAGCA 

TGAATTCTATACTGCGTGGTGGAAA TTTL ' l I TATAATTATTCCCACCCTATCAGCTTGTGTCCTTTAAAGGTACTTACCCATGGAA 

6 5 AAATTGTTCTTTTAAAAGTTAAGTTTAGTTAGTTGGCTCCCA 

TTTGGCAGGAAAATTAAATGTATCAGAAAATAATAATTTTTAAATTCT^ 

ATTTTCAGAAATTCCTGTTAGTAGTTTTCTGAGTAATGTACIGAGTTGTAGGTCCT 

TTCAAGG 

70 HUMAN SEQUENCE - mRNA 

ATCCTCTGAGAAGATGGCTGTCCCACCCACGTAT 

G CTT AATAAAG CTT GATTTGAAAACAAAAT CTGAGAATGG ATTG GAATTTACAAG CT CAGGCTCAGC CAACACTG AG ACCACC AAA 
Gl^CGGGCAGTCTGGAAACCAAGTAC.^^ 

7 5 CACCGAGATTACTGTGGAAGATCAGCTTGCACGTGGACTGAAGCTGACCTTCGA 
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ATGCTAAAATCAAGACAGGGTACAAGCGG^ 

GGTGCTCTGGTGCTAGGTTACGAGGGCTCX3CTGGCCGGCTACCM 

CTTTGCAGTTGGCTACAAGACTGATGAATTCCAG 

AAGTGAACAAGAAGTTGGAGACCGCTCTCAATCTTGCCTGGACAGCAGGAA^ 
5 CAGATTGACCCTGACGCCTGCTTCTCGGCTAAAGTGAACAACTCCAGCCTC 
TATTAAACTGACACTGTCAGCTCTTCTGGATGGCAAGAACGTCAATGCTGG 

CAT AAATGAAT ACTGT ACAATTGTTT AATTTT AAACTAT TTTG CAG CAT AG CT ACCTTCAGAATTT AGTGT AT CTTTT AATGTTGT 

ATtSTCTGGGATGCAAGTATTGCTAAATATGTTAGCCCTCCAGGTTA 

AGAAGAAACCTATTTCCAAAAAAGGTCCTTTCAGTGGTAGACT 
1 0 TT ATCT AGAAATGGCTGCAAGTGG AAGCGG ATAAT ATGTAGGCACT^ CAT ATTGAG TAAATGAATG AAATTGTGATTT 

CCTGAGAATCGAAC CTTGGTT C C CT AACCCT AATTGATGAG AGG CTCGCTGCTTGATGGTGTGT^ 

TTTTTAGACAGATCTTCATGACCTGTTCXZCACCCCAGTTCATCATC^ 

TTTCTTTTGTGCCATTTTGGGGTGGAGAAGGTGGATGTGATGAAGCCAA 

TTTGGCCCTTGCACCAGAGTATGAAATAGCTTCCAGGAGCTCCA 
1 5 GACAACACTCTVGAATCTAAATTGGACrrTCrrGTTGTATTCTCAC CACT CAATTTGTTTTTT AG CAGTTT AATGGGTACATTTTAGAG 

TCTT C CATTTTGTTGG AATT AGATC CT C C CCTT CAAATG CTGT AATT AACAACACTT AAAAAACTTGAAT AAAATATTC CT C 

HUMAN SEQUENCE - CODING 

ATGGCTGTGCCACCCAOSTATGCCGATCTTGGCAAATCTGCCAGGGATGTCT^ 
2 0 TGATTTGAAAACAAAATCIX1AGAATGGATTGGAATTTACAAGCTCAGGCTCAG 

TGGAAACCAAGTACAGATGGACTGAGTACGGCCTGACGTTTACAGAGAAA 

GTGGAAGATCAGCTTGCACGTGGACTGAAGCTGACCTTCGATTCATCCTTCTC^ 

GACAGGGTACAAGCGGGAGCACATTAACCTGC-GCTGCGACATGGATTTCGAC^ 

TAGGTTACGAGGGCTGGCTGGCCGGCTACCAGATGAATTTTGAGACTGCAA 
2 5 TACA^ACTGATGAATTCCAGCTTCACACTAATGTGAATGACGGGAC^ 

GTTGGAGACCG CTGTCAATCITGCCTGGAOVGCAGG AAACAGT AACACG CGCTTCGGAATAGCAG CCAAGTATCAGATTG ACCCTG 

ACGCCTGCTTCTCGGCTAAAGTGAACAACTCCAGCCTGATAGGTTTAGGATACACT 

CrcTCAGCTCTTCrGGATGGCAAGAACGTCAATGCTGGTGGCCACAAGCTTGGTCT 
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Table 50 

MOUSE NOMENCLATURE 
ICSGNM NfatCl 
Celera roCGlBlBO 

5 

HUMAN NOMENCLATURE 
HGNC NFATC1 
Celera hCG15098 

10 MOUSE SEQUENCE - GENOMIC 

CAAACTCCAAGGAACXTTAAATTCTAAATCGAGTCTGCCAA 

TGAAAGGCACACTAATCC^GTAAGTGCTGGAAATGACGTCCCAGTGGGGGCTTA^ 
GGGGATGTGCTATTCCAACTGAGACTGCAGGCTGTG^ACTGAAAGAAT^ 
GCAAGAGCATATGCACAGAGGGAGGGAGGGAGGGAGGGAGGCAGAGGGGTGCTGAAACC^ 
1 5 CACCAGGAAAATCCCAGCAAACTCCAAGGAAGGTAAACTTCAAGAGACCAGC^ 
CAGAAGGTGGGGGTATTAACCCTTGCAAGGGAACCCCAGTAACGTTAACATCTGA 

GTATGTCCAAAGGGCACTAAGAAAAAACTTGCAACAGAC CCCTGTGCTCAACAAAGCrrcr'r'I CTAACTCTTAGACG AATTAAAAG 
CTTTC^JCACAG AAACAAAAG CCAAGAAT ACGCTG CT A^ CGGGAATG CTG 

GGACAGTAACCTGCACCTACAGGAAGCAGCACGTCAG<1AGGCAGGCAC^ 
2 0 CATGACTCACTTAAAAAACAATCATATAAAACAATGTCTGTATACTC 

GT AAG ATGTTTG C CAAT AACAACAAAT AG TGT CTT AAGGGACTGCAG AAGG CAAAGG AACAATT CC CACAATGCACCATG AGG TTT 
GTCTGTATCGCTTTCTGTAATGCTGTAGAACATCACACCACTGGGCAATGGGAGAACCATATAGAAAT 
TCAAATGGTCGCAGGGTAAACTAAAACCCAGCATGAAGTCACGTC 
CCAGGGAAGTACCTTCTCACCAAACACATCCTGTAATGCAAAGAAAGC^ 

2 5 AGAGAACACGCCAAGTTCAACTTCATTACCTGAAACTGAAATGTGAACAGATTAACATT 

CGAAGAGTCAAACATTGTACAACTTCATGCTGTCTTCTGGAGATAAGCTGTAGGTGGAGAGGTACAC^ 
ATCATCCATCATGCACAAAGAAACTCCAGGAAAGCCAGAGTGGCT 

AGTCAAAGACTGCTGAATTGGATGCTCAGCTCCCTCCAAGCCTCGTGTATTGAAGACATGGTCCCTAGAGAGGGGTTGGA 
GAGAGAGGGGGAAAACCTTTGGGCAGTGAGACTACCTGGTAGGAACCCCCATTTCrc 

3 0 ACCCCAAACCTCCCTTCCTTTTGCTTCTCTGCTGAGATGAGGTTGTTGTGGGCTGTCCCATGG 

CCAT AGGTCCT AAAG CAATGGGGGC C ACTCAAT AATGG ACTGAACAGG C CAAAAAAATG T AAATT AAAAAAACCTT ATT T CT T CAT 
ATGTTAATCATCTrTGTGTTTTGTTACAGTAGCCGAAAGCTAACTGATACAGATTAAGAGG^ 
AGTGTCTGAGAGACAGAGAGAGAGTTCTAAGTGTAGGTGTGCCTAACAACACAGTTCCCAAAG 
CTTGGTGAAAGCCTTCAGAGT(^GGGCAGGTTGACTCTGTCAGGGGGTGAGCGTGACTC 

3 5 QTGGCCACTTCAGGCAAAAGATGAGGAAATTGAAGAACACTAGGTAT^ 

ATATGCCCTGGGAGAAAGTTATTAATAACTGACTGATTAATTGATTGATTGTGGTAT^ 
TCIAATTTCTCATCATGAGACAAAATACCAGAGACAGCCACCTTAAAAGGATGAAAGCTTATGTTA^ 
TTAGCTCCACTGCCTTTGGACCTATGGCAGAGCAGAACGTTCTGGCAGCAGAGCACACTGTGGGCAAAGCTGGTGAC 
CAGAGATGGAGTGTCTGGTCTCAGTATGCACTCCGAGAGCAGTGCCCATCCCAT^ 

4 0 CAATGCTCCATCAG CCAGAAAACAGGTCGTTTGC CATGGCCTTCAAGACTTGGG'i"! TTTTTTGGGGGGT CAGGGGTGGGGGCACCC 

AAACCTTTAGGAAATGGATGTTGCTCTGGTTAAATCAAATC^ 

AGGAACCCCAATTCTGTGGTCCTCCAAGATACCCCAAGGTCrGAGAGGGAG 

GTCAGGCTTCACTTGACTGTTTTTTTGTTTGTTTGTTGTT 

TTTTAAGCACTTTAAAAGTAACCTGACTGGGCACCAATCAATCAGTAGGCAGCACATCAAGCCAGG 

4 5 CCAATCAACTCAAACCAGCCAGGTGTGGTGCATGCCTGTCACCCTAGAACTGGGAATATTA 

GCCTGGGCTACCTGAGACCCTTTTTAAAAAATGATGCTCCTGAATTCT 

AAGGCCCAGTTTCCCAGCCTTCCCAGTGCCTCACAGTCCTGGGCCTCCTCCCTGTGTTGTGGAGTGA^ 
AAGCCCCAGGTGAGGTAACAGGAGGGAGTTGCAAAGTAACCATTGCTGTGGCCCACACTGCAG 

AG CT ACTGATGTTTT AC ATGGTTT ACGTAAG CAAGCAAG ATGT ACATGGG TCTGGG GCTTGACTGT ACAAAG AAG CCCTC CCT AGC 
50 CTGGTGC ACCGTACCTGC CCACAGTGCAGTGTGTACACAGCCAAAGGAGG CTCCAGGTTGCAGGACTGAGAGTCGTGCCCCT AGCA 

ACACAGAACCCTGCITTTGCACTAAAAGCCCATCTVGATCTTGTTGGGGTT^ 
AACAATTGCCTGGCTCITGGAAACrTTGAGCGCTC 

CTATTGGArTAAGCTTCTGGGGTGCATTTGCCCCAATAATCCACAGAAGCAACA 

AG CT C CTTTT AACTTTGTTAGTC ATCG CC AAACCTT AAAAT AAAAC CCGTGT AACGTCTCTCCTGGT AG CAGCT AT AAAC AAAC CT 

5 5 CCTCTAGAGGAGGCCGAAGGAAGAGGAAAGAGGCGGTAGGGACACGCAGGG 

GCCGCAGGGCCTGGCKXLACAGCTCAGAGCTGGCC^ 

GTGGTCC CGGCAACGG CATTAAACAGAGGGAAACAGGCCCGGGATTC CGTCACCCGCGGGGGATAAGGAGGGCATTGAGACAGACA 
GGAAAGGGCTTTTCTGGATGGGGTGAAAAGTTATTTATTGGAGAGGGAGGGGGG 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNN1*NNNNNNNNNNNNNN1JNNNNNNN^ 
NNNNNl^NNNNNNNNNimNNNNNNNNN^ 

' TCTGGCGACACTAGGACTCACTGAAGAATGAATTTGGGGGAGGGGGGTTGGAGGATGGTT 

CTAGCGGCAACCTCTTTCAAGGAGAGCAGGGAGGCTTCCGC^ 

6 5 ACATCCCAGCATCTTCACTGAGCATTCAGCTACTTAATAGATCATGCT 

CTAGACCACCTGCGTGAGTGG CTCCATCCTCTCAAAG CCAGAGGTAG AGAGGTG CG CGCCATGCCT AGCTCTCCATGTGAGTAGCA 

GGCCAGTAGGTAGTGCGCCTGAGACATGTTGGGCACTGCCTGTGGCTATGCAGCATCCAC 

AACAGGATGXjCTTTTAGTTCTGTGGCGTTGAGGCAACTAGAAAGAGCCGCTAGATGCT 

GTTTTCATATTGTTT AACATCGAG CCCAGTTGGCCAGCTCCATTG CTGGTTGGATCAGATTCrCTAAGTCTTGGGGTGTGAAGTAC 

7 0 TTTGGAGCACCTACTC^CTCACTACACGCCCCGCGCGCCCCACAG 

TGAGTTTATTTTAAAGCATATGAGGTGTAGGCCAATAGATTTATGATTTCTATACAC 
CCGTTCCXSTCCAAAGCCAACAAGAAAATATGGTCGGGTTTTG^ 

GAAAAAGTGTGAAGCGAGGCGGGACAGAATCCTCAGAAGTTCCCCCAGCACCAGCCTC 
CCTACAAAGCACCCtaTGGTCCCTGCACTGGTCGT^ 
7 5 NNNNNNNNNNNNNNNNNCTGGCCCTTCGTGCCCGCTCCCCX3CGCCCC 
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GGCCGGTCT^CGCGTCCAGGGAGGGGCGCGGGCGTCCGGCCGGACAGCGTGGCACACACTGTGAAGGCT 
CCGGCGCAGCCCAGATTTCTAGCGCGCT^ 

TTGG AG ATACGACATGGGGAAACTTGATT CACGGGGG CGTTGAGATGTGAGGATGGGG CG CAGTGGGGG CGGGAG CGCT AGAAGCG 
AAAACAAACAACCAAACAAAAACACCCTGGGATGAAGTG^ 

CAAGCAAGTCCATGACCAGAAGTATAAGAGTCCTGCATCCTCCCAAGTGAGTCTAG 

TTGGCAAAGTGGCCCCAAGTGGTCCAGACTTCTAAATGCCTCTCAAACTCG 

TAGAAAATGCCITGAGAAAGATCGAAGACGGACTGCCAGGTTTGAGGTTG 

ACCTGTCAGGGTCACAACAATGACATGACCCAGACCrTCTGTCAG 

TAAAACGTTTAGGCTCCTGCAGGTGTCTTTGAAACACCCTTTAAAGAG 

AATGGAC CCCAGGAG CCAT AGGG CT CGT AATT CC rrrri 'GG CACCAAAG AAGTT ACTT AAAATT ACTGAAGGC CAG CTGGACC CCT 

GCTGGGCTGGGGCACTCCTATTTACAAGGGCGGCCATCCTGTAGACAAGCACT 

ATGAGTGGGTCAGCAGGCAGAGGCAGCCGTTCCCAGAGTCAGCCCTGGACACCTG^ 

CACAAATGGATCCAGCTTTTCAGAGCCCAGGTGTCCTGGGTAGAGAGTCT^ 

CTTTCATGTCGCTGACCTCAATTCTAAAAGGTCTAAATGTTGCTAAGTC 

GTGTTTTTGTTGTGGTCTGGGTCGTGAATTTCTTGACATTTA 

TACTTGGAGGCCTTAAAGCTTACGTTCTTTACrTGTAGCGTTACTGT^ 

TCAC CCTCG T AATCCT AGG CG T AAT CTCAGCAC CTG AG AAACT AAGG CAGG AGG ATTGC CATGAGTTTGAAT CCAG C CTGGGCAAC 

AAAGO^GAACCAGGCCTGCCTTGGGCTACAATGTAAGACCCCAAGTCCCAAA^ 

TTCCrrCCTTCCTTTCTTTTCTTTCTTCCI^ 

C Cl TT Cm ClTll 1 1 1 C T T T T CCrTTTCTTCrClTTrCLTrL T lTC'TTT'l 111 1 rCTTT 111 U Ten GTTTTTTTGTT'rrTTTG 

TTTGTTTGTTTTTTTGTACAAAAGGGAAAACTATTGGGCAGA 

GACAGTGTCCCCAATGAAGCAGTGAGCTGGAGGCAATAATGACCGAATGATAC^ 

CTGAGCTAACTAGAAACreTGACCCAGTCTTGAGACAG CCAGACCCJU^CAGGGGAAAAGAACTGTATG CCGCAGGTCAGTATCTG 

CCATTGCTGTGGGCCGTGGGAAAGCACCTTGAGGAGGCCTTC 

GCATTTGGATGGAAGCTCATGCGGCCTGGCCATTTTTGAAGGCCTTGTGCT 

GTAGGTAGTCACAGCGTCAGAGTTGCCTCACACAATGCGCCACAGGCTCTCCGGAGC^ 

TATAGCTTCATGGCACTGTTTCITAATCCAGATTAGAAAATGCTGCAGCCAT 

GCTGGCCCATTAACATCTIGCACAGTTAGGAGGAAATGG^^ 

AGACCAAGCTTCATACTTATATAGAGGACTGGCAAGGGGGCTGGCAGAGGCACA 

CTCCTGTGTTAGAGAGTAGACAAGGAGGGAGGCAGGCCAGCCTCTGAGAGAAAGGCCTTG 

CTGGGGGCATGTGGCCAGGGGAAGCCAGGTGCTTTGTAACTTTAGCATAAAGCT 

GTGTGGAACTGTGGGAAAGGTGGATAAAGAGAGATCGAGGTCTCACGGCAGAAAACAGAACTTCCAT^ 

AAAGAAAAGTAAAACAAAACAACGCTCCAACTCGCATCTCAAAGTGAGAACTATTATTAGJ^ 

TAGTTTATCACTGGAGAAAAGCAGCCATTGAAACAGGGTCC^ 

AAAGCCATCCCCCAGAGTCCAGAGCTGCAGGGCAGGAGGACACTGGGGA 

CTCCTCCTTTCCTTCCTGCCTCCTTCCAGGGTTGGGTTCTGATACTGGTCTCCCCTGCCCT 

ATGTTGGTTTATGTGGTGCCAAGACAC<^GGAAAGTTAGAACAAAAAGCTG^ 

CCCAGCCTTTTCCTTCTGATTGCTGTGTGCATGGTACCCTGTCTGAATGGGTCACACCCAGCACCT 

GAAATGACCCCCTACTGAAGCAGGAGTGTC 

GATTTGCTTTGAGGTAGGTGCCCACAGAGAGAGGTTGGCCACCATGGGCAGACTTGAATGCTTCGGAAG 

TTGACCTGCATAGAGGAGAAGGCGACCCTCAGGAAACTCCACCCTCATTCT 

CTAGAAAATGACCCCACCCCCCAGCGTTGATTGCTTCAAGTTGCAGCGCTCTGAGGTGGGCTCTC 

TTTCCTCGGGCAGCAACCTGGAGGAAGCCTAAGAGAAGCCATGTTGGGAATATACATATA 

TCCCTCTTTTCTTTTCTTTTCTTTTCTTT T CT TTC 

TTGGTCTTTTTCCTGGTGCAGGGAAACAGCIAAGACCC^ 

GGGGACCGGCTGCGGGGCCGGGCGGCGGGTCCAGGACGCAGCTCCAGCCCCGGTCAGCCCTGT 

GGCCCGGCATAGGCTGCTGCGCTCCCCTGGAGGCGGTCAGCTGCGCAGAGGTTGGGGCGCGGACGCCAGGGCT 

TAGGTTCCTGCGGAGCGGGGGGTATCCTAAGGGAGAGCGAAGCCCCACAGGAGGCGGAGGCCCGAGGGAGCTGTGCGGGCT 

GGGGG AGCGGGCATGCGACAGAGGG CCGGGGGTGGACCTCGGACCGCGCGGT CCTGCGGGCACGAGTGGAGCCTGGG CTACGCTTG 

GGCTCGCGCGCGGAGTTTATTTAAAACTAGGAAGCCGGCGGCCGCGAGCCGCTTGT1TATGTAAACCCGGA 

AGGGGCGGGACCGGGGCGGGAGGCCGGGGAGGGGACAGGGGAGGGGACAGAAGGCGGGGGCCGCGTCACGCCG 

GCGCCGGGCGCACCAGTGCCGCGCGACCCCGACATGACGGGGCTGGAGCAG 

CGATCAGAGCGGCGGGGGCGCCGCGGNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 



NNmJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

ACACCTCAGCTACCAGAACCTGAGAGCTITCGCTATCTTC 

GGTTAACAATCAATGTGCTGTTTCCCCAAACTTGAATCTCCTAAACGGTTGGCAGCCCATTCCT 
AGTCTGGCTCTTGTCTCCACCTGACTTTCTGTTCCAAGTTGCACACAGT 
CCCAGCCGCACTGGGACAGAACAGACATGACACTTGTAGAGCATCTCCGATGA 
TACTATTTTTTTTTT AAGTT AAACATCCTGTTTOCTTTTGAAGGTGAATG 
TGTCCGCCCATAAGACCCCTTTCGGATAGGTTTTCCrrCTTCAGGGCACAGGTGT 



TTGGGGGATTTGGATTTACCCGCTCTTAGTGCATGGGGGCGTTTITGAGTTCT 
GACTCTTGACTTCCCCTTAGGGTGTCCATGTTGGATTTGGATGAGTGTGTACCACrG^ 



70 



75 



GCTTCCTGTGACTCCTTACTTACACTGAATGGACAACT 

CTTTGGT ACCTCATGGTGG CCTGAGGACTG CTTGGGGAGAGG AGAGGGTATCAGGGATGCTG CAG ACT AGGTAGTCATCTCCACAG 

ACTCAGAACCTTCGAGGGAAGGGTGCrcCACTTGTTCAGGTATAC^ 

TCGACTCATAGCTGGACACAGCCGCTCTCACCGTGTCITTCT 

GGT CCATT CATGG AGGG CACAAAAGGAAGT CAG TT'TGGGTG C CAAAGG AG ACACTGG7 AGGGGC TCTGGTG AAG CCTGGGAAG CAG 
GTGGCrCCTTGAGCTTTGGATGAaSTCCTCCCCTTACGCATGTGAG 
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GAAGTAGATGAGGTAGAAAGGTGTCTTGTGTCTCACT 

ATTGCTGCACATAGGTGAGTGCTCCCTGGAGCTCCGAGTGCrCCCTGAACTCCCT 

CTGCACAGAGCTGTAGGGACATGCCTCTTGGATGGATGTCCCTGTGAACTC 

CTAAGTCCAACACCTTTAGCTTGCAGGGTGTAGGCCACTGGAGAG^ 
5 CCCTTCTCTTGTGTTGACAGAAGGTGCTAGAGAACATTGGACATGTACAATAC^ 

AGACTGACTGACTGACTGACTCrGACTTTTTTCTCT 

CTTATATTACACAGTCATCTGAAGTTACTAAGCTCTCnKr^ 

TAAACATAAATTCATGTAGGCCTGACCGATTCTGAGATCAGTT 

AACCCACATCATGAAAGCGGAX^TGGCTACCTCAGAGCACAGCTTACCTCCT 
1 0 CCCACCACACCACACCACTGTCTCCCAGCTGGGTTTCATCCAT^ 

TCTTCACAACTTTCTTCAGATGAAAGCATGTGAGCAGAAATATGACTGC^ 

TTCCTCCCAGCCAGGGCTCTTAGCAAGATGGCAA 

GACTGAGAACTACGAGAGCAGACACTGTCTACTCGCCCTCC^ 

TTAAAGTCTAGATGGTTACACTTAAAAATCCTGAGCTCAAAGCCT 
15 GCACTAGTTCCGCTrTCTTTACAGAGTANNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CAAACCTGTCTGTGTGAGCAGGATATGCrrAGCCCCGTATCAGGTTTTCAAA 
ACAACAAGGGAAAGGAATGCTITAATATTCACCT^ 
GACCCTTAAGCACGCTGTAGTGAGCTTATTTGCGGACCATTGGGTGACA 
2 0 TAAATCTCCCTTTAGCACATCGTTGCATGACAGTATTGAGAACCTGTTGCTGTCTGTCA 
CAGTCCTCATTAGGGAGCACTATCCAATGTGCTAAGTAAACACAAGAA^ 
TTTCATCCCTAAGTGTGGAGACTTCCAGCCTCCCTC 

GGAGGCCAGGACCTGGCTGGCAGGATCTTTCTGGTTCTATTTCTAGCGCAGCCC 

2 5 GCAGAGGGTCACATATATGTGAGCATCACATACACGTGGTGTGGCAGAGACACAACATTGTCTTC^ 

GCCTTAAGCTGAAG CACCTACAGGGGTATTTCCCAGCGGGTG CCACAGATTGCCCAGTGGGCAGCAGGCTrTCTCCACAAGTCTT A 
AG AAGACAT AGACCAGGGGCTGT CATGGTGTGCAATGC C CTGG T ACAG CTG CAGT AGT ACCAG CTTGTTGAATTTGCTG AGG CTGG 
TGGGAGCTGGCCAGCGCAGAGAGTTCTGGATTTGCTCAGCATCGCCTGCCrGCATCTCATCCA 

GTGATTGAGAGAAATGAGCTCAGAGGAATAGGAGGACAGACGGATGGACGGCGACACCTGCACAGGGTGACCGCGCC^ 

3 0 " TATTACATGGCATAAGAGAATGTGTGCGTGAGAGGTCTTTGCITCCAATTGTG 

CTGG CCCAGTCAGCAAAAGC CAGG CTG CGGGG CACGG ATGTTG CCC CGAAACAG C AG CAT ATG ATGTTCTCT AAT CAGCCT CAATG 

TCTCAGGATTAGTAAGGAATTACTGGGAAGCCTGGCATTCCTGATAGACCCC^ 

TACAGGTGGAGGGCTACCCCACAGGTCCAGGCT^ 

GCACTCATGCTCTGAGGACTCTGAGGAGAAGGCAGC CAGCT CTGTCACAAAG CGAGCGGCTCT AGGCATGTAAG CTTTT 

3 5 GTTTTGGAGTTTGACCAAGGAAATCCATGTGTAAGCTA 

GCCCCAGGCTGGTCTCTGACTTCACCATGTAGCAGGATAGCATCTAATACCTG^ 
GATGGGTGTAGGCGTGCTTATGTGGACATAGTCAGTGTCGGCATTGAACCCAAGCT 
GAGCTCTACCCCAAGCTCCAGTTCTGTTTTCTTAATAAACCGTTTATGGATTTT 
C^TTTTTGGTTAAAGTAAAGGCCACTCTTGTAATXn'CCTGGGGCCC^CGTG 

4 0 CCGAGCCCTGAGTGAGAAACCTCCTTCTTCCTCTTTCTCTGGGCTTCAGACACCCT 

GAGATTTGCCATGTCGTCTGCCTGCCACTCAGCCATTTTTCCCCAGTAC 

GTGT CAAGG C AG GACT CAG CTTGG CTTGCTG CT AGGACTTT CT AG AAAACT CT CTT CCTTCCAAAAG 

AAGGTGGTCTGCTGTGGGGCCAGGACTGGTCTGGATTAAGGCGGATC 

ACTGCAOSTGTGCCrCCTTTGGGArrTGG 

4 5 TGCTGGGCTCAACTTTCCCAGTGTGTGCACAG CTGCTGGG AT AC CCATGTTTCCCIATTAGCCTTAGCCTCAAACTTCCTTTTCTTC 

TGGGGTTGGGTTCCCTTAAGATGAAAACATAAAAAAAAAAAAAGATGTATTTATTT^ 

ATATACACCACATGTGTGCTTTGGTGCCTACCAAAGTCAGAGAAGGACATCTG 

TAAACCATt^TCATGTGAGTGCTGAGAACTGAACCCGGGT 

CAAACTTGAAAGCCCGTTCTTTTCCAGTGTTGTGTTGTTGGCAGTCTTCAGACAGTGC^C 

5 0 ATAACATTGGGAAAACCTGACTTCCCAGATGGCT 

GCCTGGTTACCACCACTCTCCAGATGGGTTCCTGAAGCACCT^ 
GATGTCCCATATCrGCTGGGGACTGTGCCCTGGGCTCTAGTATGGAACCCTTCTA 
GGAGAGAGCATAGATGATATCATCCTGAGGGGGCCTTTTAGGTGATGAACAGGTGO 
GGCAGTGAGTGTGTGAATTTCTGAAGACCTTGGGAGATGCTCAGAAG^ 

5 5 GACCGAGGCATGCTGCGGAGGGTGTAGTATCTCAGGCAGCZACC^AAGCCAGAAATC 

AGGCATTCITCCCGGACTGTGAGATTCCAGCATCAATGAAATCAATTCCTAA 
CTTTTTAT(^TTAGCAATAATGCAGTGAC!ATTTTGGGGTGCCCACTTTC 
ACTTCTTACCCGAAGATGAGCAGATCCCGGCCATATCTTCTGGGCCATGGAACATTGAATGT 
TGGGAAGTGCTAGGCCAGCTAAGGGAGATCrCTCCATCAGTATCrTrCTCrG 

6 0 GATGCATAGAAAACCCTCTTCTCTCCTTTTCTT1TCTCCT 

TTCCITCCTTCCTTCCTTCCTTCCTTCCITCCT^ 

GAGAACAGCAGGGCAGCACAGCTCTGGACATAGCCATCTGTTGTGCTACTCCT 

TCTGTTGGTAACITCACAGAGGAGCCAGACCGTCTGTTGGTCTCACOVACCAGCC 

TTTCGTATCTTCACCAACTGGGTTCTTTAGGATTGGTGTTCTTTTCTCACCATGGACAGT 

6 5 GGCTGCATTTTGGGATACCAGGGGTGGCCTGTGTCTGCTGGACCTGAAGA^ 

TCCAGTGCTTCTAGAAACAAACACTCTGAGCCTAGGGACTCGGCAGAAGTG 
TGTCTCTCCAGAAGTATTGCITTCTCTAGAAAGTGGCTTTAGAGTTATT^ 

AGATGCAGGTTCTTGCTAOSGGTTGCTTCCGATGC 

7 0 CGTCTCTGTGACCCAGGGATTGAGGACTACCGATGGCCTCTTACTGGACACTGGAGT 

GCTGTGTAGCCAGAGAGGGTTTGTGAGCCCATTAATGGTAGAAGCCTGGGAGAGAG 
GACTGGCATGCTCCTTACAAACCATCTTAGGGTTTGACCGCTGCTTCGAGACCCTTTTTCT 
GCCAGCGCTTGTCCTTGACCAOUjTATCACGCAGCTGGGCXSCT 
ATTCAGAGACCTGGAAATGCTTCTCTCCAGACTCGTGTA^ 
7 5 CATGGCTTTGCTGTCGAGTACTAATTATAACAATGATTTTTTA 
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GTCTAGGGGTGTCTTAGGAGCCTGTGTTTTGTTTTAGACGCTCTAGAAGATAGATTCCCA 
AAGAGCCACACTGWAATCCACATTGTCACCCTTCCTCTGGGTCT 
TGCTTGTTAACACCTATTTAAACACCTGGCTCCCTGCGTCTGAAGCACCT 
CTTCACAGTCTGTCTGTCCATCTTTCCTTCTGCCACTGCCCATCCATCAC 
5 AAGATAAACGTGTCATCTAGAATCTGAAATACCCCTGC^ 

TGTGCCTGGCACCGTCTGAGGCTCTGCAGACAGCCGTGAACCACAG 
GATAGATAAAAATTCCTGTTAAATTGTGATATATACCAGGGTGGTG<5TGGGGGAGGAGG 

AATCACAACTGG CT AGGGAATGTG CATTTTX3AAT AT AGAG AAGACAGT AAG AG CCAGCCATGTGATGTGAGGTGGAGGACATCTTG 
CCTGAGGAATGCTGTTGTGCTGCTATAGAGTAAGCTCTATAGTCTAGCAG^ 
1 0 GAGGCTACA.CTCTGGGCCTGGAGTCCTCAGGGCTGGCCCCAGGTCTTC 
TGGCCTCACTCr GTlTlT ClTCTTTTCTTCC^^ 

GCACACTCTGCCTTGCCAGCAGCATGCCACXSACCTCCAGACGTCC^CCC^ 

CAGTTACGGAGGGG CTGTGGACAG CGGGCCTT CGGGATACTTC CTGTCCT CTGG CAACAC CAG ACC CAACGGGGCCC CGACTCTGG 
AGAGTCCCAGAATCGAGATC^CCTCCTACCrGGGCCTACAC 
1 5 CTTCCTAGCTX5CAAGCGCTCACCGTCTACAGCAACCCTC 

AGCCAGCAGTCTCTCCTCCAGAAGCTGTAACTCrcAGGCCTCCTCCTACG^ 
ACCTCrCCGTGGCAGTCACCCTXSCGTCTCTCCCAAGACCACGGA 
GCTAGGATCGCCCAGGCACTCCCCATCCACCTCTCCTCGGGCAAGCATCACGGA 
CCACGTCCCCCTGCAACAAGCGCAAGTACAGTCTCAATX3GCCGGC^^ 

CAATCCCCTGACCACCGATAGCACTCTGGACCTX^ 
CTGTGGCCCTCAAAGTAGAGCCAGCTGGGGAAGACCTGGGCACCACT 
CAGCACCTTCGGAAGGX3T13CCTTTTGCGAGC 
CCCGACATCATATATGAGGTAAGCAGCCTCCGACTGGAGTTGGG 

2 5 G<KTTCTCTCTCC^GATAATGGTATGCTAGCATT 

GCCTGCTTGTTCCTCTTCTAAGGACACCTTGGACTAGACTAGGGGAC^ 
GCAGCACATGAGTTCCGTCCTGCrCTACAGCTCTGNNNNNNNhW^ 

TGTGTTGGTAATAAGCCGAGGGAGGCAGCTATCCTCCTGTGGATGTTCTGCCTGCTGCTCGC 
TCGCTATCTATCACCATGCTGGGGCAGATGGGCCCCTCAGCTCACAjGTCCCT 

3 0 CACXlACCACCACCTCCCGTCrTGCTGGCCGTCTGCTATCAGGGAGTGTGT^ 

TTAACGGGATGCAGGGGCTGCTTGCGATTTACCAGGTCCATCGAACCAGCTCAGTGCTTT^ 

CAGCTGTACCGCTTGCGACGCCTGGAAGGCTGCTITCTGCTCCAGGG 

GAATGTCaSTCTGAAATCAGTAGGGATGTTTCCATGAGCTGAGGCA 

3 5 ATCACCACCTGGTATTT<X5CCCCCTGACCTCT 

CGTGTCTTGAGGGGTCAC^CTGGGATGTGGGGTGGCCAGCTGGGAGCG^ 

GGCCCAAGCAAGTTGCTAGGTGCTAGGAGCCTGTCTTGGGCAGAGAGACTC 

TGCCGCATAGGCCTCTGTCGTTCTATCTTCTGTCCCTCCATCTCTCCTAGGAGGCCCAA^ 

CAGGATCAAAGTTT AT CT CAAAGACGGG AACCCCAAATC CAT AAAGAACTGTTGTG CCTG CCAT ACTGTCGGT AAG ACTT CT AGTT 

4 0 AACGAAAACCGACTTCTACTTATTTCAGCTCOWTC^ 

GCTCACAAAGACTTGGTGlGAACGCCTGAGGCCTAGGGAAGAGTCATGTACTAAAATATAT 

TATAAGTTAAGTGCAGATACGCACATGCGTGGTGAGATACGCACATGCGTGGTGAG 

GAGGGAACCACTGACTGGTGGCCTTTATGCCAG^ 

TGGGG CT AAG AGTCCAAATT AGAACTTTGGGGACACAG^ 
AATGTCTTCAAACATAAAATTrACCCTTTTTCCTTTT^ 
GTTTCTTGGTCTTGGGTTTTTATTTTTTATTATTATTA 
GCATTCCTGGTGGCCTACAGTTGGCTATCTGCCCAGCCCAACCTGCCAGTG^ 

5 0 CTACTTAGGAGCAGTGACCATGGCCCAGTGGCCTGTGATCCCATGGTCTTTATOT 

TGCTATGACATTCCTATGAGCTTACACAAACAAACACCACCCCATCCT 
CCTCGAAATCTGTTTCCTGGTCCCITTGAATGGGTTTCTGTAAACTCCCT 
AGGTGACGTGTATGGCTGTGACCATCACTGTGAAGAAGCCAGACTCCTTCCTC^ 
AGGAACTTGAAGGTATCCCTGAAAGGCTCTGTTCCCACATGTCCTCCAGATCTGGGCT 

55 

GGACTCCTATTGTTCTGTA 
CTCCAGTCGGCTCTATGGGACAGGACAAAGAATGGCACCGAACTCCTGCTACAACCGAAGAAAT^ 
ACAGTGGAAGGGCTTTTACTGTTTTCGGCGCGGGCITTAACT 

TTGAACAT CACCGACAGTGATGACAATGGCAG ACACCTTCTGTG CAGGT CG AG C CATCTT AT ATG T CC CTGCTGAG CACGG CT AAG 

6 0 GAAATGGCCAGGAGTGACAGTGTCCAACAGCACGCTGGCACCTGTCAGTGTGTGTGGAGGGTGCA 

AG ATGACAGTCT C CAG T AGATTGTTC CTGG ATGT AGTTTTCAG ATGGT AG CTTTT AGGG AGTTGTG TG TGTAT ACACATTCATGTG 

TGCACTTACAC^CACACACACACACACACACACACACACACAC^ 

TACCTTCTTTTTTGAGACAGCATCCCTTATTAAATATGAAGC^ 

CG CTTGTTTTCCC ACC CACATT AC CGT<X5 CT ACAGATGTGT7 C CACAGCACCCAG CTTTT ATGTGACAT CTGGGGATC CAAGCTCA 

6 5 GGTCCTCACGGGAACATACACAGCAGGC!ATTTCACCATCATGCT 

GAACTGAAGTTTAAGTGACTGTCCCHTTAGCCATTCCGT^ 
GGTCAGCTCTGCACAGTAGCATTTTCTTTCCITCTAAAACAGCX3 
GACTTGCCTATAGAAAGGCAACGTTACGTGAGGAATTCGGATAAAGACG 
GTGTTCATXSACACGGAGACACATTTGCTGAGTTAGGTT^ 

7 0 GC4GCCGACTCAGCAAACAGCTTTTCCTCAGAG 

TCAGAACTTGTTTCTTTCAGGCTTTGGCTGTGATGTTXK^ 
TAGCTTCAAAAGGTCTAAGAAGTCCTCTTGCA^ 

ATG CCTG T AG ATCGG TCACAAT AAAT CCAG CTCT CATCTTCT C CACACTT ACTT AC CACG CTCATCAGGGGATTTTTG CGTTGGAC 
AGCTCnTCAGTGAATCCAGTGACCTTGTAGCCTGGGGAAGG(ZACG^ 
7 5 TGATGGCCCACCTCAGGACTCTGTTGGCCAAATTGGCCTGCCCATACTTCA 
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GGTGGGCTCATTGGGTCCGTGATGGTCGTAGATTGGTTGCAAACTCGAGTCTAG^ 

G<^UIGTGAATCGGTGGCCACAGTGTGTCTTGTTCAGAGATAGGTAT 

CCGAGAACCCACATAGACTGCTAGCCCCAGCTAGCAGGTGACAGGCATA^ 

GGAGGTTGCCACAGATGAGTCCATGGTTATGCAGTCTGTGTT 
5 CTGATGTCCCTGCGACAGTTTGGGGAACGCACCATTAGCCT 

CCMATATCCCTGGAGATTTGGAGGAGGGGAGGGAGGGGTAATTTGG^ 

GAGGCTTTGCAGATAGAGAAGCACTGGGCCGGGTCAGGGCTCCAC^ 

GAACCTGCCCAGATGCTAACTGATAAAGCCAGGACAGACACTCAAG^ 

ATTCTTAGTTGTTCTGTG<5AGTCAGCCCTGGAATGTGCT 
1 0 CX3TGGAAAGAAGGACACCGTGGGGAGCTTCGGAGCCAGAGCCTGAGAGCTAACTC 

TTTGGGGTTGTGGX3TGCTGCATATAAGACCTGGGAGGTTTGTGTATGGAATTCT 

C^GGGTGACTGAGATGCTGGTTCATGTCTGTTCACAGTTTGAT^ 

TTAGCCTGGTGTATCAGTTAGGACTTTCAAGGAGGCTGTGA 

TCTGAGTACTCTGTAAAGAAAGCGGAACTAGTGCACTTCCCCCTT^ 
1 5 TCAGGATTTTTAAGTGTAACCATCTAGACTTTAAT^ 

GAGTAGAC^GTGTCTGCTCTCGTAGTTCrrCAGTCAGTTCrrGGAAGTGTTCTGCCT 

TTGCCATCTTGCTAAGAGCCCTGGCTGGGAGGAAGTGTCCAGGGCCACATGCCCCAGTGAC 

CTGACATGCTriTCATCTGAAGAAAGTTGTGA^ 

ACCCAGCTGGGAGACAGTGGTGTGGTGTGGTGGTGCTGGTGGTGTGGATGGGTGTGT 
2 0 CATGTACATGGATGCATAGACTTGAGGACATCATCAACTATTACTACCCAGGCTTCGT 
GTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTGGGGGTTCT 
GATCGACCTGTTTTCTGCCTTCATGXK3TCTGCCTTTATATTTTGCCCCCT 

GCTTCCAAAGTACTTTGCTAATGGTGCCATCTCCATGGTCCCCTGTCTTTTTTTCATCTACTCTC 

GT CTG AGAGG CAGG CATTGCTGT ACTTGGAAG AAT AG C CAGGGAAG CTGCAG CT AGTGTGGG ATT ATGGGAACACTT ACCACATCT 

2 5 CCTGAAGGCCATGTACACGGGACTATTACAGACCTCTGCCAGGTGAATCACCT^ 

TTAGGCAAGACATGTGTCACCACTGACCGCACCTGGCCATTGGAAGGTTTC 
AGATGCAAACATTGCCAGGAGGTTTGATATAGGAGTGGCCATGCACCGCAG 
CCAGCTCTGCCTCTGATTTATAAGCTGCTGTTTCCAG 

TGTGCAAATGCCTAGCTTAACCCATGCTAGGTGACTGTGGCTATGTGTCCAAAGCTCCAGGCCATCTCCAGG 

3 0 GATGACCTTGTrTTTGCTGGAACCAGACCTCTAGAGCCAAGAAGCC 

TAGGAACAGAGCTGTGGTGCCTTATGTAAGTGTCAGACTGCAGTTC 

ATCTCCAACCCTGGGCCAAGGTGTCCAGACTGTrCCGTGTCTCATCATCATTAAGGAATATCT 
ATGGCTGACGGGAAGGAGCTTCCTTCAGACAGTCATAA^ 
CAACTTTCAGAACTTGACCAGCAACTGTTTGGACAT^ 
35 TTGCGTGTGCTCTGTCCTGCGTTCACTTGTGCGACGTTTGGGTTACCGTGTCAGT T I'G T T T CGGTTTTCATTTTTCAT AAGTAACG 

CATGGCAGGTGTGGCCTACAGCATCCTCATCATTAAAACATCAGTGATACGAATCCTGATCAGCT 
GTGGCTAGAGAAAGGCACAGTGTGGAAGAGGAAGCTGGGTGCACAGGGGJVAA 
ACAGAGAGAGCTGGAGAGGCTAGACAGCTCCTTTCCCACAA 
TTCTTCTTTGTTTTCCAGTCTGAAACAGGTAAACAGGGCAGATACCCTTCTAG 

4 0 TGGAACCTGGAGATTCAGCCTATCTCCCGAGCTCTGTCCTCCACCC^ 

AGATACCCy^GGCTTGGAACGCAGTTCTGTGGCTTCAGATG 
GAAGGGCAGAGTGCCCCAAACCTGGAAGAAGCATTTCCTTGGAGATGT 
TGTATGAGGTCTGTGAATCTGCTGTTTTAACACACAOVTGTAGCATGCACAAACT 
CAGATTTGTGACTTTGACTTGGTCTCCCAAGCCAGTCATGCACTCT^ 

4 5 ATCTTCCTTGGCAAGAGTCCCCAGTGTCCCCTG 

CTGGTAGAGGCTGGACCACAAAAGGAAGAGGCCAGCAGAGGACAGCT 

ACTGGGCCATTTTCCTTCCCTATTCCTGCTTATGCCTGTGGGT^ 

GTGCCACTTTTAGTAACATGCCCAAGATACTAGTCCAACAGAC 

5 0 CAGAGCCCTTAAAATATTCCAAGGTGTCCTGGTrGCT 

GTTAAGGGTCAG CTT ATTGGCX3CCTGTGGTCCCTAAGGTCTTCCCAGGTTG CTTTGGGAAAGGTGGGAACTAAG AGACCTGGACT A 
GG CAACCAGGAATCCTGTGCAGAGATCGCACTCATGTGCTGACCAA 
TGGGCTCTCTGTCCATCTTGGCTGAGG1GGCTACCAGTCCTCT 

TTGGGCGAGGCAAGCATTCCCAGGCTAAGAGTCCTGTGGTCAGGGGGCCACAGCCCAGATGTGCACT 

5 5 GGTCCTGGGCTTGGCCCTGGATGGATGGACAGATGAAGCC 

CCTTCCGCAACCCTTTCTCCTTCCTCATCTCCCTGCCTCCCTCCGTCCCTCCCTC 

TTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTC 

CTCTCTCTCTCTCTCCTTCCCCTGTTCCTTCCTATTTGACT 

CTGCCCCCGTGTTGATGTAGTCGAAGGTCTTGTTTCCCTTTAACTGGCTTCCATTTTTTTCCACA 

6 0 TTAGGTTTGTAGTTCCATGGACCTCAGGGTGACTAAAGCATAGATGGTCr^ 

TTTTAACTATGGATGGTTCAGAATCGAAAGGATTCTACCCAAACTATGTAAGCAAATCTGTCCC 
ATTAATCTTGTACCTGTAGCTGGGATGACTTTCCATGAAAGTTG^ 

AAAGCTAGTTGCCATGTGTACCCATAACCCCTGAAGGAAGAAGAAAAGGAGAGATAGGCTTCTCTTTAGAAAT 
CAATCCCAAATGGGCAGCAGGAACCACCAGAGAGACCCTGTCTTAAGTAG 

6 5 ACACATGAACACATACACACATGCACATGCTGAGTTACAAAGGATAAACACACT 

TACTGTGAAACCTTTTTCTTCCTGACCCCCAAATCTCCATCCCGAGACAG 
CTCTGTAGACCAGGCTGGCCTCAAATTCAGAGATCCACCTGCCTCTGCTTCCTGAGTC 
GCCAGACTGCCATCTGCCATTACTATTATATATATATATATATATATANNNNNNNNN^ 
NNNNNNNNNNNNNNNKNNNNNN^^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

GGCTTTCTGAAGCTCATCTGCACATGTTCTGATGTCTACAAGACGGGAAT^ 
CCTAGGAGCTCACACAAAGCGAGCCCTGATCTCAGGCCGGTGTTTGCGGCT 
JVGGCCTrTGG^TGGATCCTTTGTCAGTCAGCATTCCCTCCCT 
GAAGCTGTGGGGCACTCACACCCCGGTATGGATCTTGGTGAAGGTTGTC 

7 5 GCACGCTGGGTTCAGACTGTGGTAGCCCAGTGGTCTTTGTGTGTCTGTGGCTC 
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CGGCTTTAACCCTAGAGGTCAGTAC^CA^ 

ATTGCCATTCCCTGGCCCCACCCCCACCCCCCAGAGGTAGATTAGAGTCTGGAAGGCAGGGGA 
AGTCCTTAGCTGTCATCTGCCCAGCCTGGCTCCGAGCT 

TGCCTGGGACCCTGTGGT CGTG AAAAGGTGAG AGGCAGAACTCAGAACAAACAGG AAG CAAAG (HTTGT CTCGAAAGACT CTGTTA 
5 CCCATGGGTGTTCTTTGAGTATCACACAGCTGAGATCGGTGCTTGGT^ 

GGATTAGGAGCTGGGCCTGGAGAGACTTGTGGCCCCTGAGCCXK3CTGGCT 

TCTGCTTGGAGGAGAGAGGAAACTGGCAACTGCCAG CGACAGGAACCAGGGAACGTGAGCCCACTGTGGC CTCAGGTTTGGGATCT 
GGTTTGGAGTTGGTCACTGTGCTGTGACCTGTTCGCCACAG 

1 0 ATTATCAGGAGAAGCTGCCCGTCATTCCCACGCAGCCTGTCCTCCCATCCCTC 

CCTTCAGACCCTACAGCAGGAGGCCACCCTGCTTTGTGACCC^AGGACCCT 

CCCACATGGGCACACACACATAGGTAGACAGACCCATGC^CATGTGAGTGCATAC^ 

TCACACACACACATGTACATGTGCGCACACACAGACATAC^ 

ACACACAGACATACACATGCACACACACTCACACACATAGAGACAC^ 
1 5 ACCATTTTCTTGAGTGAATTCAGGAAAAAAAATCTAAATGTGTTTAGAA 

GTGGGTACTTGATATGAAGCTTAATGAGAGAAAAAGAGGGCTGGCT 

GGCCCAACCACAGGAGGCTCACCTCCAGTCTGCACATATGTTITATTCTGCGATGTG'i 

GTCATCAT AGTG CACACGTGGAAG TCAGAGG ACAACTTGCAAGAATGAAAT CTCTT CT AGCGTGTGGGGGGT CCTGGGGA T 
CTTCAGACGGGGCAGCAAGTGTCCATACCTCTGGCTCCATCrCTCTGGTGCTTAGCGA 
2 0 TTATTTTAAGTGTGTGTGTGTGTATGGCCATGTGCCTATGAGTGCTAGCGAATAGCTATAAAGCCT 
GATCTGGAGTCACTGATAGTTAGCATTCTGACAGGATGCTGGGACTCAAATCCAG 
CACCGAACCATCTTTCAAGCCCCTGTAACGTGTCTTAGGATGTTCGTTGTATGTAGT^ 

GAG ATTG CTCTT CAT AGATGG CAGTTTCAAAACAGGTTGGGAGAAGGTG CTAGACACATGATATCTGACAGCGCTGGTGGCAATGA 
CTCTGCTTTGGTGTGGCCAGCCrCrrGCn'GGGGTGGAGGCr 

2 5 ACGGGACCGAAATCAATCAGATGATTAGGGGCCGGATGCTTTCAGAGGCT^ 

GTTCTTTACAGAGCGTGACTCCTCTCTGTTGGAGTCAGGTG 

GGCCTGGGGAGTGAATAGAGTATACAAGGCTGACGTTATGACACGTTGGTGCCGGGGTG 

ACGAGGCTAACACCTCAGGAACCCCAAACTGCTGACCCTGTTGTCAGGACAGGGTAAAGATACA 

CATTAAATCAATGGOTTCTATAATAGCAATGGACTGGCrAATAATCAAAGGCCAGCCATAAA 

3 0 TCCTGTTATTAAACAAAGTTCTTACCAATGAAAGATC 

TTTGCC^GACACTCCCTTTCTCTATAAAATGTGTGCACATCCACAAGT^ 
GTGTGTAGGTAACTTCAGATTACTTCATTTATACAAATCAAAGGG 
GAGTGTAACTCTTGACAAGATGGAACTGAATTGAAGATGTGTTTGAAGTCAAAACCC^ 
AGGCAACCACAGCCCCTGGGTCTGATGTCAGAGACCCTGTGGCAG 

3 5 TCAGCCAGGTTTCCCCCTCTAATCTAGTCCCCATCTCTGATCTAAAGTGAAGTCCTATGGGTT^ 

GAAATTTTTCAATCCXKK3TTGACAATCTTAAATGTGTAAACCACCCTTClTrrr 
CCTATCTGCCTGTCTGTCTGTGCCACTCACTTTTTAAAGATTTTTrCCT 

GCTCC CCCCAGCTCTTGTCCACAGCCAGTCTTTCTGTGCAG ATATAAG AAACTTGTGTG CATATATAAGAAACACTGTG AAGGTGTG 
GGATGTTTTACAGTGCCTCCTCCCACAACACCTGGGTAC^ 

4 0 CACACCCTCTCCTTGAAGTAGGAGTCTGCTCAGCTCCCATGGCGTGCCTTCCCTGGGGATC 

CCTCTCrcATGCCCGCCTTCCGCT»GCAAGCTGCTGGAATCAGGGTATATGA<^ 
TGAGTTTCTAAGTGCACGGCAGCCCTTCCAGAAAGCCTTACCCCTTCTG 

GG CCAAG CCTGTGCCGTCAGCCTGCCT CTGTGCTGGGCAG ATG C AGG CACTTACTATTGTXjTGTTTGTG AGTCAGATTCCTAGAGG 
GATGACCCAGCTTCTACCCGAATGGTAACAAACACAGGTGCATCTTC 

4 5 GGGAGGAGTTCTCTTGAGTCTGCTGTAGCTCGGCACT 

GGGCTTTCTTTAGAGAAGGGTGATCCTTCCreTTCCTGGTGAA 

GCTATGTCTGCACCACTTCCATGGCTCTGGGAAGTAGATTAGATTAGCTTTTAAAGACT 

TGGCAAGCT ATG ACCACACACG ATTG C ACAGT AAC CATGG C CTT ATG AAGACTTT ACTC CCCT AC C AAG ATGCTG CTG G ATGG T AT 
CGTCTGACAACTGTGAGGGCTGATCCCCGGATGGTGCCCCAGGCACAGCATGGGCAACTCTGTCTTGGTGTAC 

5 0 TCCrCTCTTCCTTTGCAGCCCATCCTTGCCTGCCCTTGACT 

TG CAG CC CAAGTCTCACCACAGGGCTCACT ATG AG ACGG AAGG CAG C CGGGGGG CTG TG AAGG CTT CAGCTGG AGG AC AC CCC ATT 
GTGCAGGTATGG CATCTG CTCTG AC CAG C CAAT AG CTCAT ACCT AAAACAGTTTTGGATTCACAGCT AAGG ACTTT ACTCAGACGT 
AAAACACAGTGACCTCAGAGCCTGCTATGTT'CCTTGTCrTCAATAATGTCCCAGTGA 
CAAAGTGTGTTCACCGTACCTCACTGAGGATGCATCGTCTCGCAGGCACTGCAGAGGCAGG^ 

5 5 AGGGAATGGCTGAGCAAATGCCCAAGTCAAAGGATATTTTGTGTGAATGGGTATCACT^ 

ACAGTATGCAGGACCACTGTGAATCAAGGGCTGTCTGTGGCTACCATATAATG 
TGCCCAGAGCAGACTTACTTGTGGAGCAGGCACCATTACACCAACTCTTCTGCAACT 
GGATGTGTGCTCTCCACCCACCCCACACCACACACGTGGGATTGGTGGAAGCACCAGGTCT 
CTTCTCAGTGTGCCTGCAGGTGTTAGGAGGTAGGGTCTGAGGGAGCGGGTTATGAGATGGAGTT 

6 0 GGGCACAGTCACACTTGTGGCATTTTGGGGAAGGAAAATAGAAATCCAACTA^ 

CACGAGACTGGTTGTTTCAGCTGGTCTGCAGTAGAAGTAGGTTCTAACAGATGTGC 

GTGAATCCTCCTTCTTCCAATCTCCTTATGTAGGCCTCCAA 

CAGGCTGACCTTGAACrCAGAGATCTCCriTGTCTCAGTCTCC^ 

CATGGCCACT ATG CCTCAAGATCTCCGTCTTAAGATCCAGGTT AGAATCCTGCGTG CCCTCCIATT^ 

6 5 GATGTAGTCAAGTTGACAGCC^GGAATAGCCATGACACTAAGTTTCCCTGAGCGTT^ 

TGGTACGCTGCTCACCAATGGTGGCCAGGAGGCACACTGGAAAGTCGGAGGCC 
CACCTGGACATCAGTGTGACATCCCAATGGGCGGGGTGAGTTTGTCTTGGCT^ 

GATCCCAGACATTACTGTTTTAAGCGCCGCCGTGAAGTGCTCACTCTCCGGCTGACTCTGGTCACATG 
TACCCCAGTATCCCAACATCATAAACCCGGGTGCCCAACCCTGTTATTTCAGG 

7 0 TACCCTGTACTTTTATTTTATTCCATTTAGGTAAATAAAGAGCTGGCAAGG^ 

CGCAG CAAT CAG CCACATGGAATGAGCCGGGCGAGGTG CAGAAACG AAG C^ 
TTTGTGGCCTGTGTGCTCGATGCCAATTAATACCTACTCTGTO 
CAGCTGG CTTGACATTTTAAGTTTCTCCCCTTCGCAC^TGGACGAGGT 
TATTTACCTCTCCTlTGCCCGCTGTGGTTTTCACrGAATGAAAGTGCATT 
7 5 TAAGTCTGTTAAAGAAAGGCAGGGGAGTAAATGGAGA<^TA* 
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35 



40 



CTGGCrTGAGAGCGGGGGGTGGGGTGGGGGTGTGTGT^ 

GTGCCATCTTGGGCITGGTGGTTGGTGGCTGTAAGAATCCCAGCTC 

TGTTTGGGGGAATCTGT G TCTCTCAGCCACAGATCCCQVCT G 

CAGGTCCAGGTGTCTTGAGCTCAGGGCAACAGTGTATATCTCAGTGCAGCT 

CAAAATCTAAAACAGTGTTTAAGAAGCACCCTCTAAATTATTTA 

G C CACTG C CACAGAACAGCCACCGCTTCTACAACACTGTTGAAGGG CAAAGG AAAACACC CCCTAG CT AAAAAAACACCCAGTT CG 
GTGGCGTTTCTCATTGTGGGACATTAGCAACTCAAGGGGTTTCCTGGGAT 
AGGAAAACAGCTAAAACGAGGGAATTTTAGGGGATTTGTTTTCTCT^ 
CACGTAATTATGAAACACACAAAAAATAGAAAACTCTCT^ 



GCATGCCCGGTGTGCGGACACTCCACAAGCCAGTTTGTGCCCAGTAGTCTTTGTAG 
CCCGAGAGGTCTGGACTCACCGTTCAAAATGGCCACAGGCTCTCACTAGCCATTCTATGGTCA 
GTCTGGCCTTACAGTCTGCTAGCATTTGAAAAGCCCTATCAGAAGGCATAGCCATGGGTTATCCTG^ 
AGGGCTTATCTATAGTTTTCCTCG3AGTTCCCAG 

TGGAGATATCCTTGGGGACACGGTGGCTCCATTCAGTTAACCACTGTCriTCT 
TAGAGCATGCTGTGOTCTTGCTTCTGATGTCACATTGTAATGTGAGAAATC 
TAGAGCAGATGTCACAAATACTCTGACTACAACAGGGGCATTTGGTCAGGGCTTCT 
TTTCATCCTTGTCCMCAAGGTCCCTCGTGACCTTGATCra 

CTGACTGAGGATCCAAATTCTCCAGACGCTTGGCAGGGGATGTGATATCTACCTTGTCCT 
GCCTATAGAGGCCCCTACTGTAGCGCATGCAATGCTTTGTGGTTTGAGGA^ 
TGTTTACCAATCACCAAAATGACCTGCCTGATAGCATCTGCGCATCACACACA 
GAAATATGAACACTCAGAAGCTGGTCAGAGCCTAACATTTAGGT^ 

CTACCGCCCGGCCGTGAAGGACTCAGATCAAAGAGACACAGATAGTGTCCCAGACAGGTGACCCTGGACTCTG^ 

GACATG CTCTC AGGGCACAG ACAG CAGGTAC CTG AGGATCTGTTGGAACCAGGG CAG AGT CATAG AGGAACCCATGACAGG CCCTG 

GCTTGTGCTCCTGCACCGGGGCCAGGCAGTGGCCAGTAGGGACTCATGACATTAGGGAC 

ACCCCATATAGATTCATGGCAGGGACTTTTTCTATCGTACAGTCCTITGACAGCTG 

CTCAGGTTGTTCAAATTGTCTTTGAG CTAAGCCCTTGTTATAATTACTTTTCTTI'CTTGGTAGTGTCTTTTCCC CCAGGTCCT ATG 

AGGAGATTTAGCATGTCAAAAATACTGTGAGAAGTCAGTTCCGAGGGGTTTCTGACCTGTGGCCAGAAGG 

TTAGTGGCCTGGAATGCTCTGTCCCGTGCGAGCCTGCATCACCCCGGATTCCAGGTCTCATCT 

GGTAAGGCATGCATACAGCAGGGGCTAACCTGCAGGCCTGACCCCAAAGCAGGAGCATG CTGCCCAGAGTCAGCCAACCTG CATG A 
GGTCTGGATTAATTTTTTATGG ATCAGCCTGGACACTGCATGGAGTCCTCTCTCCTTTCAGGACT CG AGGCAAAC CTCCTTTC 
CCTGCCTGTCATGTACAGGGTAACTCrCCTGTCCACTTGGATGCTGCCCCT 

GTT CAG AAAACGT AAGGGT CTCAGGGAATT C CATG AGGGAAG CTG CAGT CT C C CAAACACAC CACAG CTTTGCTT ACTGT C CATG G 
CAG AGCTAAGG CT CG ACAG ACAATCGGTAGTAAG AGACGAG CCGTCGTTCCAGGAAGAC^TCTGGGGTCAGCTCTCTTAGCGGTGG 
GAGAGTGAAGCAGACAGAGATGGCCGTTCGCTTACACA.GTCTGCAGCAGCATCCCTGAC 

T CAAGGCTGGGGTTTCATGTGGCTGGGCTCTCTGTGAGCTCATTT CAGAGCCTTCCCACTCAGCAGAGTGGGAAGCAGGGAGGGTA 
TTATGTGGACTGTGGGAACAAAGACAAGGCAGGGCTACAAGCAGAAG^ 
CCCCCCCCCCCAGACCTGACTCAGAAGCTATGGCTTCTTTGTTCCAGGCTAGGAAGA^ 
ACCTACTGGTGGTTTTCTAGTCATAGGCCCACAGAGCTCTGACTTAGTTAGGGT^ 

T ACCCTAC CCTCCTTGGGGACAGGTGAGATGAGGGCTGAGTATTGGAGACAAAA C 
GACCCATACGTATCACIAAGTGTCCCrGGCTACCCATGGGCAGCTGTTCAAATGTA^ 



4 5 CTACTGTTGTATACCTTGCCTGGCCTAAGGGGCTTCCTTTCATCTC 
AGTTTGCCCTGTAGCTGCAGGATGGCTGCAGGCCTTC 



GTGGCATCAGTACAACAGGTAGCCTTTGTGAGCTTCAAGTGTTTTTAGTCCTT 
TTTGGGCTCTAGTCAGAC!AGTAAGGGTCAGGTTCTCTTTCATAC^ 
5 0 CACGATGTTAACAGTAAAAACCCACCAGAACGCTGACAGCCGCAGGCCGGACCCZTGCAAGG 
CAGCAGCACTATTCCCTGGGCGTTCCCCTACATTTCTGGGCT^ 
CTGCTCCTGAGTGGCCTCTGCCCTGTAACACTAGCCTAGAGTC 

TACAGGTGGCGGCTTCCTGAGTGGCCTAGTCTATGAGGTACTGAGGGAGGCGTCTGTGTTT^ 
TTTACAGAAGGAGTCTGCTGGCGTTGGTGCCITITTCCTTGGA 

5 5 CAGCCTCAGGGATGCTAAATGTCAGAGATGATCCX3GGTTGACATGCGTG^ 

CCTGAGAGCACTGAGACCCCTGGGGCCTTAAAAGCTCTCTTTGCCATTGCCAGTAGG 

TCATCTTGTGCAG^ 

AAGGGTCCCATTCATTTTTTCAAGTAAGGGGTCGAGGAGGTTCCAGCCCTCTCGAG 
60 GGGAGAAGAGCAGAGGTGATGCTGGGCTGGAGACATATAGAGCCTCCCTGAGGAGCACAGAGCAGGCCTGTC 
GCCGTGGCTGTGAGCAGCGATCCTTAGATGTCCACTCTGACCTTGTGGCACCACAGGTCATAGAACTCACAT 
AGGCTC AGGG ATAAACTGCCIAGG AAACTGGTTTCrrCCTTAGCCTGG CATCTCCC CCTTCCCCACTG CCT ATCTGCCTCAG CCTTCC 
TCAGTCTAGGAGGAGAAGCACAGACITTGTGAGTGAGCTGTTTTCTGG^ 

AGTGAGTCTGGACTCTGACCTCCGGAATGGCACAAGATGGCCCTCGGTGCATCTGCACATGCGCAGAGCCT 

6 5 TTCCCAGATAAAGACAGATCTAGGGTTCTCAACTCTAATAGGTCCGTTCAATTGGCAC^AGAGCCACGGAGCT 

AGAAAGCCACAGAGCGTGGCTTCCTCCTGGAAAACCTGCACCAGGACCTTrTATATGAGAAA 
AGGCTCCAGGAGGTGGGGGTGGCTTGCAAG^CCTTTGTAAGGAAGGAAGCGCATGTT^ 
GCTTCAGGAACATCAGACTTTCTCCACAGGAGACTGTTCGCTGTGTTGGGGATGAAGCTT 
GCTACCACTTAAGCCCCCAAGAGATAATTTTGAGGTAAATTAAAC 

7 0 AAGGTGGAGTGACTTTGCCAGTGGAAATCAAGTCCCCATGAAGGTGCTCTC^ 

TCTCTGGCATTTATGCCCTGGTAATATTTACATGGATACAACCTATTTTGTTT^ 

AAGGTAG AGGAGGTTTGT ACATGAAAGACCCAAAGTTCCTACGTGCAGTCT CTCACAGCTC CCCTGTGCTGCCCTCGGCTTC AGAC 
TTCTGGAAGGCCTCGTGTGTACTCGTGTATCTTAGCTCTTGGGGTCTGAGGGACGTCACTAGACCTAG 
ACCTCAGTTTGATC^.TC^TACAJVTAGTTGTCTTC^TAC^AA 
7 5 TCATGACACATATGTGCACACACAAAGTGCTCACATGCT 
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CACAGACTCTAAAATACATGAGTGCTCATACAGTACAAG 

ACACACCATACACACATACACACACTCACACACACTCACACACA 

CCATACACACACACACTCACACACACCATACACAC^CACAC^CACACAC^ 

CACACACACACACACGCCCATGGATG TTCATGAG ATAAACCTGT ACACT T CAT ATTTAT ACCAATT CAT AAGT 

5 GCATACATGTATAAACACATCTATAGTTATACATAAATGGTTGTACACATGTCAC^ 

CCTGTGGCACCCTGAAAGCCAAGGATTTTTTTTATT^ 

TGAAGGGCAGTGATAGTAACTCCTTCCTTCTGTCCTAGGGAGCTATC 

TGGGGCCTTATGTCCTGCZAGCCTGTACTCATTTGCrGCAACACAGCCGCT 

TGGTTATCCAGAACTGGACAAGCCCTGTCTGTTTGC 
1 0 GGTTTGGTTTTGCAGTATATGTTAGGCAAGTTCCCTACCACTGACCCACGCT 

TCAGTCTTTTCTTTGAGTTGTCCAAGCTGACCTTGAGCTCACTCT 

GCCTACrcCGGGCTGCXSATTATAGGCCTGCACCACACAGCCTAGTTCTC^ 

GGGTCCTGCCCTCGATGGTCATTTCTTTCCTACCAGACAGGCACACAG 

GTCTTCATGCTGGTTGTTCTGAGGAGGTCACCCTTACTAACAGCAGCrc 
1 5 AGAGCCCCAGTTCACCAGAATGACAGTTGGCTCCTACCCTCTCAGAT^ 

XACAGTGAGGCCTCCAGGGCACAGTGGTGAGTTCCCAAGATGCCCCT 

GAGGGAAATAGAACGTCCAATAACAGAAGACTTCCCGTGTCTGGGAAGAGATGTCTCCAGG 

GGGCTGAAAACAATCTGAGGTAATGAGAGTCCAACACAAACTCTTA^ 

AACCOICCCTGGGTGCCATCCCCAGCCAGGAGGAGGGAAAGTGAGACCAATGTGTGAGCC 
2 0 CACGGGATGCTAATGTTACTCGGGGTCTTAAAAGCAAATTTCCCTTT 

GGATGTCACGTGACAATGTCCTTTGCTTAGTAACCAAGTCAGAATGACT 

CTCTTAATTCAGACCTTCCTTCCACTGCGTATGAAGGTGTGCGTGTTGGCCCAACTCC^ 

TCTGCTGGCCTGTCATCTCCCTGTGTAAGTGCTGAGGTG^ 

ACGCTTAC^CCTTGCAAGTGGAATAATGTGTTTTGTCTCGTT^ 

2 5 ATGTTGTTCTGACAGACACTGTGAGAAGCCAGAGGCAGAAGGTCT 

AAGTGAACTTCGTGCCITAAATAATAGATATTTACCTCCCIAGGGT 

T AATGGAT AAAATT AT AC CGT AT ACAC CGCCTTCCATG CCT AAG AAG C CTG CNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNCAT AATTTGTCT CTGAATTTCT TT AC ATTTCATATTG T AAAACT ATT C CAATCTT A 

3 0 TATAGACTAGTGGTTTCrGAGTAGATTCATCACACTGCAAGAGGCTCCCAGGTGTTG^ 

TAAACCCTGTITGCAGTAACrTGTGTGTTCACACAGCTTTGTAGT CC CTGCAGATTAAGACAGTGTTAG ATGTGATC ACTGGCCAG 

AGAGTAGCAGAGGTGGCTTTCTGCGCTGATGGCCTCAGATGGGGGATTGCAATATC 

GGGAGTTTGAGGGGAAATGCCCTGTGCTTTCTGGCTCCACCCACCAGCCTTA 

CCTGTGGTCAG CAGG CACCCTTT ACACTCACACGTGTG CCCTTGGCACAGAGCACACATAAAAGCTGCACGGGTG AATCTGAAG CG 

3 5 ATCTAGCTCATAGATGCTGTGACTATGGTACAAACAGGGAGCAGGTG CCTGG CCCAAGCCCAGAGAGAAGTCGTTCCATGCTCACT 

TAGTATACAAAACACACGGGGTGCGTGTCTGTGAGTGCGTGTGTGTAGTCACGTACAACAACACT 
GACTGTCTCTCTCTCTCACACAGGGATCCTCCTTGATACCCGTACGTGCT 
CCCTACTCATCCACGGCTCCTCACAGGACACAAACATTAATT^ 
GGGAGGOUlAGACCTCCTGGAACCACCrACTGATGTCAGGGAGTGGGTTGGCTG 

4 0 ATCTTTGCAACACAGATTCTCGTGCGTACIAGTTGTGACACCT 

CCTTCTATGTCCAGAAGTGAACGGGGGTAAGGCTCATCTCCAACCCTTAACTTCCT 

TATTTCCTGGGAATGCAAATTCAGAGATAO^TCCCTTTAAGGGAATCAGATG 

AGAGAGAGATGTTTCCTGTGGATTGAGCTGTGCTGTGGGTTCTGGATAGGCGGGCC 

4 5 GTCTAAACTTCGTGTCTCTACTTGAGGACTCTCTTGGTCTGGGA^ 

GGGCTTTGAATTGGGAAAGGAGAAGGTTTTCGCTGAAGAAAAATGATGGGATCC^ 
CTTCTCAAAGCTCTCAGTGTCTGTCACCTTCTGCCAAGTC 
CAACCTCCCCGTGGAAGCGGTCTGCTTTCATCCCAGCCATCAGAGGTC 
CAC^CCCGGGGCCTCTGCCCCATTAACCTCTGACTAGTGGGGAACACATTTTT 

5 0 GCTGTACCTGTTTGAAGTACAGTCCTGGTCCCCAGCCCTT^ 

CAC ATG CTGGTG^ ATCACAGG AC AT AATG AAAGC CGTG AAGTGCC CATG C CTCT CCT AATG 
GGCGGTGGCTGAGACCTTTGCACGAGGGACATGTGAGAGGCCCCGCTGCCT 
CCAGCAGTAGCTAATTGCACTGTCTGATTTTCTGGTCCCCACTTCTAGGTTTCATTGCCT 
ACAATCCCCTACCATCCCGAAAGGCTGGAATTCCATCGGAGAGACTAGT^ 

5 5 CTTTCCGGOTACCTCCCTTCCCCCCCTAAGGATCTGTGGGGCTATTTCTTGCG^ 

ACCTlC"ri'TGCTGCAGGACrCTrGAAGACTGCTTGGTGL w riTCriATGGTGCC 

AGCAATCCCGCAACTCATCACAGGGCTCTGAGTTCCATGTTCCAGCT 

TGTGGTCGGCATGAGGGK3CTGGGGGCTTATCCATTGGCAAAGTGG 

TCCTC^CAGCCACCTGTGTATCCTGGCATCCTACAGTGTATCTCATCCAACCCCGGCTTCCCCCA^ 

6 0 CATTG CAAGCATGGTG CATGAGGTGGG CGCATCTCAGG ATTCT AAATAGAATGATGGGGTGGGGGTAGGGCTTCT AGGAGTGAGGC 

AGAGGAGGTGGGAAGGGCGGTAGCTACCAGCATGGGGCAGTTTCACTGCGTCTTATCCTGTGGGCTATATA 
ACTCTGTGGTAGGGTAAGCGGTGCAACTtKSCATTGATAGCA 
GCAGTCATGAAAGGATGGCCTAGACACAGTTGTGTGTGTGTGTGTGTGTTGCT^ 
ATG AGTGAG CAACTCT CACTCTGCTAACTTCTGCCATAACTG AATGTAAG 

6 5 ATGTGGGGAAGCCAGAAAAGCCTTACCTCTCCAAATCTGGAGGTGAGGGCGCCATGAGC^ 

CTCACCTGGCTTGGAAGGACATTCCTTTCTCAGAGGCCAGTC^GAAAGCC 
TCAGGCTTGCTAGAGTTTTGTGGTCCTTTCTCAGCA^ 

TTGTAGCATGCAAATGGTGTGAAAGCTGCCTTTGTGTGTGTGCACATGTGTAATATTGTGTGTGTGC^ 
TGTGTGACTGTATACTTTTTGTATGTGATGTGTGTACAATATTTTAGGGGAGTGTC 

7 0 CATTGCCTCTAGGTGTGCCrrGTGAACCCATGTGCACACrACCTGTCT 

TAGGATAATGAGGCTTTATTTATTTATTTATTCATTCCGATAGTGTCTCATTGAGTAGTC 
n GC -CTGGCCTTGAGCTA^AGATCTGCCTGCTTCT 
TTATACTTATAATTTCAAGAAGTACTAATATTATATC^ 
7 5 CTCCTTTTGAGCCAGTCTGTTGACAAACAGTTGGTTCCCAGGATTTTATC 
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TTGCCAGCTACACGGTTACTTGGAGAATGAACCTCT 

ACGCCTTCTACCAGGTCCACCGGATCACGGGGAAGACTGTCTCCACCACCAGCCACGAGATCATCCT 
GAGATCCCGTTGCTTCCAGAAAATAACATGCGAGra 

CCO^CACCCAGTGCTTAGCCCCAGCTCCTAGGCCCCAGGCCCTGTGGCTCT 
5 TGACAGATTTGCCTCTTTCTGGGTTCTCAGGTGCTGTGTGTTATGGC^ 
CCAGGGCCCAAATTAAGTTCCTAAACACAATGCC^^ 

GAAAAGATAGGGCGGGCAACACCTCTGGA' lTtj ' r i J TCACTCAGAGCTGACTATGCACGCGGCTGACATGGCCAGCATGCTGCGCAA 

AGCCCCTGTGGCACACGATTAACTTAGAAAGGGATTTATC 

GAGTGCCCTGTGTTTTGAGTC^AGAAGCTGAGAAAACAC^ 
1 0 GAGCACAGACA^TAGCACACATAAGCAGGTCTCCAGGCAGAGGA 

ATTAGACTGGAAGGCCTGATGTC^CTCTGCGGTACTCTGTCTCTGAGCT 

AGGGAGCCTGGGCTATGCACGTAAAGACTGCAGTACATGATTGAGTTTG^ 

GAGATGAGGTTGGTCCAGGGGGCTGTGCCAAGCTGCACCTTGTGTGTCCTT^ 

AATGACCAAAGGGAAAGCATTCTGCXSTGTCrCAGCTTCTCACCAGT 
1 5 GGTTTTCTTCCTCAGCTATATAACCCAR.C1TTCCTA 

TTTTGTAATGTATTGAACCTTTTACCAGACTATAGATTTGAATGAAATTATGAAGT 

TTTACAAATCATAAAAGGTATTAAAGTAAATAAGT 

AGGAGGCACAGCCTTGAAGGCTCTTCCTCTCrGAGCTGCTCT 

CTACAGCCTGCAGGGCTACTCAGATGTGTGGGTGCTGTAATCT^ 
2 0 GCAGGAAGGCTGAGTCTTTAGTTCTGGTCCAGCAGACCA 

TGTGCTGGGATCCTGAAGCTCAGAAACTCTGATATT^ 

GCTGGTCTTCCGAGTTCACATCCCACAGCCCAATGGCCGGACGCTGTCTCT 

AGCAGACCATGCTGTAACTGTTGCACGAACTGTCCCCTCGAAGGTCTGTTTCAGTA 

AGACCTGACTTCCACGTCTGGCTGCGTTACGCATTCCCACAC^ 

2 5 GTGGCCTTGGCAACTATATTAGTAAAACGCAAAGCCTGTCATGAAGCAGAGTTATTCTGACT 

CTTTG AGGG AAATTGAGAAGAGGAAGTTTGGATTTGCAGCCACCAGACTGGACA CT CTTT ATC CTTCTC 

TTCCGATTCTCTTTTCCCCTCCTCTT'CT 

TCCCATATTCTCCTTCTCTTTTATTATCTATCCCTCCTCCCCAGTITGTGTG 
ATGTATGTGTGTGCATGCGTTCAGGTGTGTATGTGTGCA.CACATGTGTGCA.TGCATGTG 

3 0 CACGGCTACAGATGGAACTCATCCCTATGCATGTTAAACAAAT 

TCACCACGACTGTTCATCTTCCCTGATGGTGCCAAGTTATGCCITTAATGTGTGAGATGGCAA 

GTCCTAGAAAGAAGCTGGTATGGATGAAGCTACCACCTTCCTCTGTCCTGCTTAGCTGGTTCCTTG 

GCCATCCTCTCAGCCCCCTGCACCCTGACGCCCTCCCCAATCTGGGGACTGCATTCATGTGGGGTCA^ 

AG ACAGTCTTGGCTAGCCCTTGCTCCTC ATAT CCCAGCACTGTCAAACT ATGTCC CTTCTCCAG CT ATTGTGG CTTCATCTCCT AA 

3 5 GAGACTCTCACCATGGGGAGATACACTCACTGGGGCTTTC^TTCCATG 

CCTGGGATGTGCAGCAGGTCCCCTTGCATCCACTAACTGATCCTTGCTTCCTCT 
CACACACACACACACACACCACACACACACAA 

TTCTGAGGCTGAATCCTCTCCCCTGCCCTGGTTTGTCTGGCTGGAAG^ 

TCACCCCTGCTCCTAGCCTCTGTAGTCCCCCTTGCAGGAC CACGCT CATG AAGTTG CTGATACCCTACTGGGTCTCTCCCAGCTGC 

4 0 CTGTCAAGCTTTCTGCTTTGCCTCTGCACCCAGGCTCAAT 

CCCTTCCAGGCCAGGAGCCAGCATTAGAAGTCAGAGTTGCCTATTGCTGGCA 
ATCATGTCCTGATGGGTTTGCCTCAATCCTGACACCT 

CCCAGAGAATCCGAAGAGTTATGTCAAATGGCAGCACCCGGGGCCCAGCACCTGAGCTGC^ 
AGGTTTCACTCACTCTTGCCTCCTCTGAOU^ 

4 5 TACCCAGTCATaSGCGGGAAGAAGATGGTGCTGTCTGGCCATAACTTTCTGC^ 

AGGTATGTTCTTCAGGCTGGCCCGGATGAGGTCCTGTGCTCAGTGCTGGTGGG 

CCCCTGCATGAGTGCGTTTGGTCATAGCATGGGCAATTGTCTGATAGAGGTAGCCG 

ATCTCATCCCCTITGAGTTTTTGTATTGCTGAAACT 

GTGTTAGGAAAGCCTATAGGCTGGTGTTGGAAAACrCCTACTTGCTAGAGTCATCTGCT 

5 0 CCTGTTTTGAGCTCTGCCCAGCCCTTCATTCTGGCTTTTCCTCTTC 

AGCCAGTCCCTTAAACAAGTGCCAGCTAGTACCACAGTGGATATAAGTCTGGACCTG 
CACTGGGACCTTCAGTGTCACAGCCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATG 
TCTGAGTCTGTGACTGTGTGTTTGTATTAATGTGTATGAGTGTGTGTGAGTATGTG 
TATGTGAGTGTGTATGTGTGTGAATGTGCATGAATGTATGAATGTGTGTGTGTC 

TGAGAAGACAGAGCATGGCACTGCCATGCTCAACTGTAGCTTCCTC 

ATG CCAGGGGTGAG CTTAGAAGGGCCGGTGTCCTGAAG AGCTGTACAT TGT CTGCATGCCATT 

GGGGGCCAGGAGTGTGGCCCCAGGTCACTTCAGTTAGACATTGGTCCTGGCTGCTC^ 

TACTGACCAGAGAGGCTTTTGGGCTCCAGAACACAGGGAAGGCTGAGCAACAG 

6 0 CCCGGAAGATGCCTGCCCTCAGCTGCAGTAACTCCTCT^ 

ATGGGGATTTTrTTGGGGGGTGCCATACCCTATCCACAAGGGTATTGGTC 
CCrGCTTTTCGTTCTTACCAGCTGGCCTTTTCTCG<3AGGTGGTCA 

AGTGG CCT CC AT AG TGGGGTCTTGATTTTGT AC C ATT CAC AGG AAACAT AAG CAC TTCAAT CAACAC AGGCTTTG ACATTT AACT C 
GGCCTGCCTCCGTAATC^TCCATGTAATTGGAAAATATGTTGTAAA 

6 5 AGTGCCCACACCCTTCrCCGGGGAGGTAAATTCTGAAGTCATTTGACCTTCTG 

CACTAGACATAACTAAGGCTCCGCACTCTGCACrGTAAGATCCCTGTACTCCAATC 
CCCTATCTGTTTATGGTTTGTTCAAGATGGGAAGACCTACAGAAGTGCC^ 

TAGCTCGCACCTGCCGCAGCTGGGTCTGAGCCTGGGAACAGTGGCAAAATGCATCTTCCGGACTACG 
GACCACAAGCCCCTGCCTTACTCTGCTGGAAGCCCCATCTTGCCTTCAT^ 

7 0 GCAGAATGGGTCCCTGACCTACGTTCACAGGTTACAAGTCATGTCTATACGTAAGGCCrc 

ACTGAGCGCTTrGAAATGTGAGAACTCCArcGTTCTGTGTCTAGACAACAATGT^ 
TAGATTATTTGCATATGTGCTCAGGTGTGTGCftTGCAAATGATATT^ 

TTG C CTTTGGTTTTCT AAG C AAAT AT CGT C C CTG AC CT AC CACACTG TGTGAGT CGGG ACGAG CTTTTTCT AGTTG TTTCTTT CTG 
G CAAG CCAG AAAG CTC AAT CTGAA.TTGCAAG C AC ^ AC CGG ACCCTCAGTTTCTGAT AG CGTG CCTGG T C CAAGTT CCATTTTT AAG 
7 5 CTGTTGGTTACAGAAGCCAAACATTTGGATGAGAGGA 
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TGGAAGGGGCCATCCCTATGGCTCTGACTGTTGTGGACA 

GTATCTTCCCTGTGTAATGCAGAATATCTGAGTCACAGGTACACTGTGGACC^ 

CTCAAAX3GGTCCTCTGCATCTGTTGTCAAAGAGTTTCCCTGC 

GGTAGGCACGGGTAGCAGAGCCAGAAGTTAGCTCCTACACTCTACACACTG 
5 ACrGCAATGGCTTGCAGAGCTAGCTCCTCCCTGGGTCATCAACTAATGATCT 

AAGC^GCATGGCCCTGGCnTATGCATAATGGTGAGGTTAGGACTTTTCT 

GGACTGGCTGGAATGAGACTGTCTTTCAGGGGATGAAGGTCAAA^ 

AGACATCCTTGGAGAATATTTTTTCCAGGAAGCAGAGGTAGTCAGTGAGG 

TCAGATGCTCTGGCGGCGTTTTGGGCTATCCACTCCAAGTGCCGC^ 
1 0 ACCCCCACTCCCCACCTCAAGTATGGCTTCATCTTCAGCTCTGCAGGCT 

GTCTGAGCAGTGGGATCAGTTATCCTCCTGAGCTCAGCTCCGTCT^ 

CTGGTTTCTAAGGGAACGCCCAGCTCTACATGTTGGCATTGCAGGTAGCTCTC 

GTTACTGCAGTGAAGCCAAAACCGAGGTGCCCCAAACCCCAGTACGGACAGGTG^ 

TTGCTCTGCAGTCACAATGCTCTCTGGTGTCCTGCGTCAAGTCTGCCCT 
1 5 CTGCrTTCCTAGCTGGCCCTTCCTCCCTCCCTTCCTCCCTTCCCTCTCC^ 

GGCATACAAGCAGAAGGTGGCAGAGTTTGTGAAGCAAAGGCGTCACTGG 

CCTCAGGCAGGTGCATAGATGGAGCTCTCACGCCTTTACTCTCTGACTT^ 

CGG AAATGGG CTGCCATGGCAG CGAGAGAGAAGGTCTCAGGGT CAAAGGGCAG C CTCAGT CAGGTCTCAGGG CAG AAG AGCAGCTG 
AGGAGC CTCCTGACTATGACCGTGTGTAGCAGGAAGTG CTCTCTCAACCTAGCTA^ 

TGGAGGAGGGACCTCTGGACAGCAGAAGACAGCAGGGCTGAG^ 

TCC CT AGG CTTT CAG CCTG TTTG TT AATTGTT AGGAGTTT CAAGGG ACCC CTGCATGG T AGCACTTG C C CCTGACAAG AGCAT AG C 

ATAGGATAGCATAGCATAGGATAGCATAGCATAGGATAGCATAGCATAGCATAGCATAGCATAGCATAGCATAGCATAGGATA^ 

TAGCATAGGATAGCATAGCATAGCATAGCATAGAATAGCATAGCATAGCATAGCATAGCATAGCATAGCATAGCATAGCATA 

2 5 ATCCGGGAACCCTGGGTCTCCAGCCAAGTGCAGTATAGCTGACAGGTCT 

TTGTTTTTGAAGTGTGAGCCTATTCATGAGCACAGGCTTCAGGGCATACTGTCCCAGGA 
TAGCTTCAAGCAGCTTGGGTGGGGGCGCAGGTGGAGGTATTGTAC^ 

TGTGTGTGTGTGTGTGTGTGTCTCAGTGTGTTTATGTATAACTGAGCATAGGTGAATGTGTA 
TGCATCTATCAATGTATGGGCGCATGTGCCTGTTTGCATATTAATAT^ 

3 0 GTAGAATGTCCCTGCCCCTTGCAGTGCTGAGCAGGGGCACCrCAAGTT^ 

AACCAACTTTACCTCCCGTGGGGACAGGAGTGAGATCCCCACAGTGTCCTATTGGTGGTCAGTC 
GCCTTGGGCCATGTATACACAGAAGCATCTTCTCAGCTACGTC 

TCCCTGGCCATGGTCTGGGGTGGGCCAGAAACGGACAGTGTCT^ 

3 5 GTTTTCATCACGGGCATAAAAATACCACCGTGCTGACCTGGTCAGGCGAGGTTCCATGTGG 

ATTTGGGGAGAGCACACCCTTGAAGTTCCACTTACAGACGTGCCCAGCTCTGCATGGTTCA^ 

T AAGCTGGGGACCGGACCCATAATCCCTTTTGCAGAAGG CATTTCTTCAAGTCTT AGAAGGAAATAAATTTGG AAAGGCAGTGAGA 

GACCCACCCTGGGCATTCAGAAAGTAGACTTGGCCTGAAGGCCTGCT^AGGTAGGAGAGGCCTGCCT 

CTGACTCTTGCTGTGAGCTTCACGAGCAGCTAAACTCACCATCT 

4 0 TCCTTGCCAGTCTGGTTCCTCTTCTTCCTAGAGTGTCTCAGGATGAC^ 

TCTGCCTATTTCCTGAAGTGTATTCACAGATTTCTCACAGGGTGCAGTTCT 

ACTTTGATCTCCATGAAAGAAGTCCCTTTTCCACAAGCACTGGCACAGTGAAGGTC 

TGGAAGTACTCATCAGAGTGGCCATGTGGGAGATGCTGGGGCTTTCTGGGTGGA 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGGGGGGGGGTGAGGAGGGA 

4 5 AGGAAAGGGGGGCAGGTCACTCTGCAAGAGACCAGAGTG CTCCTTAGCTGAGCG ACCT 

CCTCTGGCATCCTGCCTAACGGTGGAGGCTTTTCCACATACTCCTACCACACGGTC 
GGACAGAGTGGTTGCTGAATCATGCCCTGCTTCGGGCATTCTATTTT 
CTCACCCAGATGCCCACTGTCGCCTCTCCGAACCTTGTGGATTTCCCT 
TGTGCTTTGCTCTCTGGTGCCTTTCACAAAATGGCATTTTAATCCCTGTTA^ 

5 0 AGTACCCCCATGTGGCCAGGGCAGATGGGCATGGGTTGTTCTGTTCTCTGTGACTATCA 

GACAAGGTTCTATGCATG CCCGTGTCTTCCTCCTGGCACGTCCCTGG CTGTG AATCTGTCCTCAGCTATGCAGTG CCACCAGGCTG 
TTTCCCAAAATGCTGGGGGTATGG ATGG CGGGGGGGGGGTCTTGTTTTACCTTGGGGTTCTTCTGAGGGGTGCAAAG CTGCCCTCC 
T AACCTGGG AGAAGCAGGCCTCCAGGGCACCACATGCACAC CAGG CTGGCGACCCTCAGAGCCCCTCTG CAGT CCAG CCCTGCAGA 
GCCTGTGTTCATCTAGCCCCGTAACACAGCCATCCCAGTGCCTCACACTGATGGGGTG 

5 5 TGGGATGCAGACTCAGGGCTAGAAGGAGGAAGCCTCCTGCTCCACCC 

CCTGAATAAACTCCAGCTGCTTCCACGTGTGGAAACAGTGGCTCAGCCCCTACCCTGGCGTGTGACTG 

GCAGCGGCGTCCTGCAGCCAGTGGAGCGGCTTCAGACCTGCAGGCTGGCTGTGTGGAAACCTTGG^ 

TTCTGATGACTCACACTGCAGGGAATTCCCTAATGATTTGGAATA 

TAGGGAGCTCCTGATAAG ATGGGACACAAAGGCCCTTTT AGGATGGCCGG CTGGCTC CCTGCCTGGGAGAGG CTGAGTTCTG AACA 
TCATGGCCGATACCTGATGATTCCCTCAGGAGTATGAAGGACTATCTCCA 

TGCCAGAAGCAACCTGCTAGGTGTACAGTCAGAAGTCCATTCAGCCACCGTCAGAGCGAGAAGAGTG 

TCTATTTGACAACTGCAGCTGGAGGACCTGGGTCTGGATGGGACCAG 

AGACCCCAAGGGACCATATTTTCAATGGGAAAGAGAAGCCGATGGGTGATCT 

6 5 CTGTCTTGTCTGGTAGTGGTTGGCATGGGAATGCTGGATTGGAGCTGGCC1A 

GCCACTCTGCTGGGACAGGGGAACAATCCGTCACAGGCAGCACCTTCTC^ 
ACGGGGGCCACACTACAGTTGGCACACCTATGTACAGGCAAAGAAACTGTAGATC 

CCTTGTGGAGGGCAAATCT AACACAGKATGCAGCCTGACTACAGACCCAGAG CACGGTTAAAG AACCG CATGTGGGT ACTTCCAGT 

7 0 TTCACTGTTTCTGACATTCCGGTGCTCTAGGGGTATCTTGTTGTTTTCCCTGA^ 

CATGGGCTCACCTGATGGCATTTGCTCAAGGCCTGGGGTATCTGGGGAGCTGGTGTGGCCT 
CTCTGGTGACTCACATTTCCACCCCGTGCCTCCACCCTCCGCCCACCAC 

TATCAGTAATGTAGGGTTAGTGACAGACAAATGGGGTGTAAGCAGAAACTGAGCGTGTG CCTAAAGCGG CTCATCTCAGGAACTGA 
CTTCATTTTGGAGGAGATAAAGCCCAGGATG^^ 
7 5 AGGACTGGGGACTCAGAGCCTGTGAGTGGCAGACACCCTTCCTCTAATCACTAGGTCCACACCTCATCGCCAGCTG 
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GTTCTTTCACTGTG CC T ATG AG AGCCTTTTAAT A'l"l" rri'r 1 T T AAC ATGCAAACAG CC ACTGGGCT AAGGTT AAAAAT AC CACGTG 

TTTTt5CATTTCTAGGTCCTGATGTGGCATC^ 

GGGACnTTGCCCTGGCTTCCCCCAAAACACATAGGAGGCTGGT^ 

TACATGTTGTGCATTGCTGGGTT AT ATTTTGGGGGTG CATG TTG TGGATTGTTGTTTTT AGGG GGTGGGGT CG C CCTGCCTCT CTC 
5 TGCTTTTCTGTCCCTTGGGAGiA.GACTAAGGTGTAGGAA^ 
CACAGGGTGGACGATTAGAAACACTCTTCATTATGTG 
ACACAGGGCCAGTACTCTAGAGCAGGTCAGCACATGCCATCCAGT 
CCTCACCGTTTCTTCCCACTCTCCCCTCCATGACCCT 
AGAGTGACAGATTTCITCCTGCCCTTTGTTCCAGATGGC 
1 0 GCCAGTAAGTACTTAGAATCCTGCAGCAGGGAC^ 

AAAAAAAAGACAGAGGCATTGCCCAAATCCCTTGGCCAAATGAGAGCCACTAGAATAGAA 
AGAGAGGCTGGAGAGAGCTTGAGGCCrC'lTl'ATTCTGGG 

GGTGTAGCCCCAATTGAGTGGCTACGCCCCGGGAGATCTTGGTGTTCTAAGCCTC^ 

GCCACCCAGCCTGTGTGACTTGCCACCACCCTCTCCAGAGAACGCCTCAGCT 
1 5 AAAGGGCTGACCGAACTGGATAGCTTAGCTCAGCTGGTTGGGTTTGCTG 

ACATAATT AGTCTGTCT CCCTCCCX5CGT ACAT ACGCACCTG CT CACCAT CATGTG AGG TCATT ATTGGTTGTG 

ATTGGCTGATCTTGTCTCATATACTTGTGCTGGCTATC 

AAGTGGCTCCTATTGGAGACCTCGTGGAGTCAGGGTCCTCTT^ 

TGCTTGCTCGGCTTTTGCAGTACTGGAAGTTGAACTAAGGACCACGCA 
2 0 CCTCAACAGTGTCTTTTAATTGGACATTAAGCrc 

CTAGTCTCAGAAGAGCATAACAAAATGCAGTGAGTAGGTTTTTGAA 

ACTTCGTCACAATAGTCTGGGCAGCGTCCTTGTGACCACAGGGCATGAGGAGCCTTT 

TTAAATGAATGAGAGAGCTTTTTTTGGTTTTGTTTTGT^ 



2 5 TGGGCATATGAGAAAAGGACTTTTTCTATGAACACTGGGCAGGCAGCAAGAATC^ 

CAGAGTGGACAGGCAGTCATCGCTCCCGTCAGCAGAGCAGTCACCAGGGTGTCT 
TGTCTTCTAGCTGAGTCAATGAGGATGGGGGAAAAGAAATCCTTT 
ACCTTTCCAAGTACTTCTCAGAGGAAAGGTCCTCTCTGGCCAGTGATACTGTTCAT 
TACCCGGCTCCCTCCCTCCCACTGAAGAGATCCTTCTCTGCCT 

3 0 ATTGAAGGCAGTGGCACATACACATGTACATGCGCACGCGC^CGCACACACA 

CCTGCACACACATTGCATTTATAGATATTTTAAAGATGGGTTTCT^ 
GGTTTCCTCAGGCCTGTGCTGGCTGCAGCCTTTACAGTCTCTGACAAC^ 
CCTCCAGCTTCCTTCCTGGTTTTGAGAGCAGAGCTCCT 
TGAAGTGCGTTGACTTGTGTCCGTATCAAGATGTGACCCTC 

3 5 TTTGGTCCTTTCTGCGTTCCTTCCACGGACCTCAGATATAGAGACCACCATGGGGT 

7VACGTCACAGCCAGGCCACTTAAAGCTGGGCACAACCCAGGCCTGAGTGG 
GTAGCATAGGCACCCTTTACCCGCCCCTCCCCCACCCATCTCTTCCAAAGGAAGTGAAGGT 
TGGAGCCCAAGGGGATGGAGCCCAGAGGGGACCCTAGCCAGTCCTGC^ 
CTCTCTGATAAAAAGAGGCTGGGGCTGAAGTTCTTCTGGTTTCTGTTTCTA^ 

4 0 GTTTGCTGCCATCGATTTACTACCATTATAACAAGCTACGTATGGTTAATT 

GACACATTTCCCCTGGCCTGTTAGACTAAAACCACACGAGGAAAGAAACG^ 



45 



50 



55 



TCCCCCCCCAACTTGGGTGTTCCC^CTCAGTTCCAAGTATTCATGGCTTGCA 
AAAACAAAAAAAGAAAAAAAAACTTGCATATTTCTGTTTAAAAA 

ACTCTGGACAATGTCCATAAGGAATTCCACCATGTGTAACCACAAAGTGCCTGCTATAAGTAGGCAGAA 
AGGTCACTGGGCCTTAGATACTGGGGCAGGAGTAGGCCTTGAAGTC^ 

GATGCCCATAAGGGTGTCrCTTCTTGCATTAGTTCAACCACAAGGGTCTCCCAGCCTCTAGCT 

TCCCTGAGGAGCCGGTGCACGCTACCTCCGGCCAATTCACAATCAAAACACGTGAGTAGGTCAC^ 

CCCTCTCAGTGGCCATAGACAGCCGTTTGAGTGGACAGTGATGTCTGG^ 

TCTTTGCTGTTTTACCACCAGAATTCCCTGGTGGTTGAGATACCA 

TTTCTATGTCTGCAACGGGAAACGGAAGAGAAGCCAGTACCAGC^ 

CCTTAAGCTCTGAGAGTGAGCTGAGAGGAGGTTTTTACTGAGCAG 

AACAAAACAAAACAT AC CTG T AG CCT CTT CACACCACGTG ATAG C C CT ATTCACAAGACCAAGTCGC CCAC CCCCTCAAAG AAAAG 
CGAAG C CTGGG TGTGTTTTC CTG TG ACTGGTGC ATGCTGGGGTC AT CACTTGCT CGC CTTTTGCAAAT ACAGCAGC^ CGG CCAACC 



GTCAACGCAGTTTTGTAAGTAGCTTCGAAAATAAGCCTGCCGTGGTCACTG 

TTACTCATTTAGGGAATGGCTTCTTTCATTCAGAAGTGATC^ 
6 0 CGTTTTTAACTGCTCCCCACCCOSAAAGC^ 

CTGGAGCAGGATGTATAAACAGAGAGCAGAGTCC^TTGTTATCCTGAGTT 
CCTTTTTAAATTGCTACrTTTTTTTTCCT 

CATGAAAGCAGCCATATTGCTATCTTAGTAAAATCAAGGTGGTTTTGTTGTT 
CAAGG TTTTTTGT T T T TG AAGTCTrAAAGGCATTTGGAACAGTTT AG ACAGTACGAA 

6 5 TCnTATCAGGAAACCCCTAGAAATGCCCTTTAAAAATGAGGACAATAGCTT^ 

CAGCAACTGTCTGTGCTGTGGGTGACCCCAGAACIAGCGGCCCATCCCCC^ 
TTCCTTTCGGGAAATAATGCACACTCTCTCTTACAAAAAAACCAAAC!AT^ 
TGCAATGATTGTGTCCTACCTATACTTCAAGCATGGTTGATCrAAGATTTT 
TTTCTAAAACAATCTGGAAGCATCGTAAAGATTAAC^ 

7 0 GCTCAGTATTCGTAAAGAGCACCGTGTCAGAGGGGTCTGCGGAGCAGGCGTCCCXjT 

CTCTCCCTCCCATGCTGCCCTCCCTTGGTGATGCA^ 
CTCCCACAGTCCACATAGCCTCCTGCTGGAAACTCCCAGCTGCAAACC^ 
TGGCTTCACCCATTTGCTCCAGACCCOTCTTCAGCCCTITCATCCTGC^ 
CCACCACCATTGTGGTCTCGGGGACCAACCGTATTrCCACACCATTAGACTG 
7 5 GTTGAATGCTAGCATTACTCATTGCCTGATTCCrGGCTGGCTACCXSAC^ 
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CAGATGGAGAACCCCAGCTCCTGTGTTCCGT 

TCTCTGAGAACACCGATGCT^TGAATGGAT 

CTCCAGGGCACATGAGTGGCTCITAATTAACCTGACTCC^^ 

CCTCCCCTTCSAGC CT TTC TG TATTATTAAGTGAGTTTCATGT 
5 TTATGGTCATCAAATATGCTGTGTTTGAGAAAGTAGGAATA^ 

ATACAAAATCGAGACTCATTTAGTTTTCCTTCAGACTTAGGTGTGAGGATGTCT 

TACCCTTGTGAAAGGGCCCCCACTCTTGCACAAAACCGCCCCT^ 

TCCAAGGTATTTTGCTGAGTATAAGGCAGGGGGATGCCAGCTTTGTGC 

AGCAGATTAGCTCACCTGGTTGACACCCACCCGGGGCCATCCTCCTTTCTTAl^ 
1 0 GAAAATAGAGCAAGTGACTGGGACCCTTGGGTTCTGGGTGGGGTGGGGGGGGTACTGAAGTCT^ 

CTGGAAGGGTAGCTTGGCCTCCTTCCCTCCTACTCGTGTGTGCCTAACAGGGAAAGTCTCCA 

CGAGACTCTTGTCTGCCACTAACTTAGCTTTACTCCCTGAAAGGAGACTGTA 

CACACCTTAAAGCCAGTGCTTCTGTACAGTGAGGGGACAAGTGACCGT^ 

ACTGTTTAGCTAGCTAGTTAGGAGTCCCCCGGCCTTCCCAGCCAGGACAGGACTO 
1 5 AGGAACACGTACTCCCGAGCAGATCTTGCACAACTCATTGGAAACCCT 

ACAGATGCTAAAAGCACCTGCCGACTGAGAAGAGACTTCAGAGTGCCCA^ 

TCTCACTTGACACTAACTTTGCTTTACTCCCTAATAAGAGACTGTAATA^ 

GTCATAAGAAGAGGGCTATGCCAGGAATGTCTGGGTCTGAGCCOTATTC 

TATAGAATGTCTCCGGTTTGGGTTCCAAGATGAGCATCTCTGCCAGTCTGAGACCTCAGACT 
2 0 GCAGGAACTGGTACCTGCTCACAG<3TGCAGG<3GTGCITATAAAA 

TTTCTGTACAATGAGGAAACAAGTCTCCACAAACTATAGG^ 

GAGTCCCCTGGACTTCCCATCCAGGACAGGACTGTGAGTCCAGGAACACCTACTCCCATCCCTGACCCAACCCCTAGCACT 

CTCGTGTGAGAACAGAGACTGACCTGGTGACAGAGACCAGTAC!ATGACTATAGGACAGGAAAAGCATTGCAGrc 

GTCAAGCAACCCCCTGAGCACACAAGACCTATAAGGATAGGTTGCCCCAAGGG 

2 5 CTCCCACA.CACAGGCACACTTCTTTGCATGGACTCTGGGGTGGAGCCAT 

TTGAGCCCTTCCTGATTATTATTAGTAAGTACGTCTAAGATATATTTCCATCTCAGAGGAATAAGAGGAACGCA 

GATTCTTGGTTGTATCACCAGAGTCCGGAATCTTAGAGACTCGTGTAGACT^ 

CCCCCACCTTATAATGGATCCCTGTGCCTCATGGGCTCTGTAGCTTAGTCACTGCCCTGACTAC^ 

CTCAACTTGT AACTG CCTGGGT ACAG AG AC CGTGATGG CATGG CAGTACTGTG CACAGC C CATGGCTG TCTG CCTGG GCACCCACT 

3 0 T CCTGTCCTATGAGAAG CCTGG C CCTGG AG AG AGAGG CT AAGGTCTGCC CT ACT C CAAGGGTTT CTT ATGG AGAAAC CATACAACA 

TAGTGGCTTGTTGGGATTAAAGCCAGGACAGTTTCTTCCACTTACCA 
AAGGCTTCAAATGGCAGAGAACACAATGTGGTGTGTAACGGGGCAAAGTGATGTT 

CTT AAAAGCTAGTGAAGACCTTG CGGAGTGAGGAGAGGTQVGGGTGGGCCAAGGAG CGCTGTCTCTGGCGC CTAAAG CT ACAGGTG 
GTACCTACCTTCAACTGC CAATG ACCTCAGCAACCAACTGGCCATTGTAGCTGTGTG CCTTCATGTTCAAATGACACCGGTCAGTA 

3 5 GACATTTGGGAGAATGGCCAAGGGCTGTCTAGACCTCTGGATCAAGGACT 

GGAGGGAGTCTGTTTCTCATACCGACACTGATGTGCCGAAGAGAAGGAGCTAACACTTGCACCC^ 
AGGAAGGTTAGGCATCAGAAGAGGCCATTTCrrGCACTATGTGATCTCTCAGGTC^ 
GAACACTTGTCTGGGTGCTAATGTGACAGCTCCAAGAGATCCAGGCCTCTCAAGCCTAAGCT 
TGTGCTTCTGTGGACTCTCCATTTTCACACAGGGAAGGAGATGAA 

4 0 AACCATGTGGCACCTGGAGTGCCCGCAGAGGATGGCTGTGACTGGGACATTTGA^ 

AGGGCCCGAGAGGGAGGCCAATGTCTTGTGGAGCGATCACAGAACACAATAGGCCATTAGG 
TTAGAAATCTAAGGGCCATGTGCCACCAAAAACAAACAAAAGCGTGAGTCAGACT 

GGAGACATGCATGGGCGGG AGGGAGAG Cy^ATAGCTCCGCCTGTGATCTGCCCAT AG AATTCATGCCATATGTTTG CCACCTGTGA 
GGTACTCTCAGGTCCCTGGCTGCCCCACCCAGCTCTGTCCTAGGCAGTATGGCAGCCCCAGAGCCCCAGCTGCT 

4 5 ACTGTCAAAGCATAGAGATTGGCTTCAGCAGCTAGTGGATAGAACGCCAC 

ATCCACCGCCATCAAAAGTAAATGGTGTTTATCATTCTCCACCT 

CTGTGCTCACTTTGATACCTCTGGAAAGGGCATATTCCCAGCT 

AAG AAAGGCCGG AGC CAACAAGTCCrTGGTGTGTGCTGTTTCCAGTCCCCGGCGGG CTGAGCTTCCGC CTCACCCTTCAGG ATTTC 

5 0 GTATTCTGCCTGAATGTACCTCCACGGATCCTCCCGCTGCACTGAACACATCCCTCT 

AGACAACCCTTACGCTCTCTCATCCCCAGGGAGCTC^SGAACCACACATCTGCAC 
CTTCCATTCACAGGCTCACAGGTCCTGTAGGATCIXIAGGTCGCTTG^ 
GGCTCTCTGTGCTCTTCACCCCTC^AATCCTCTCTTGCCCCTGACTTACTG 
AATGTCACTGCCATGAAGACAATCTGACTTTTGTTTCATAGGAGTTT^ 

5 5 CAGCTGGTAAAATAATTCAAGCCACGACACTTGAGTTTACTTGAAAATGCAAGTG 

TCACTTGTACTGAGCATACGTCCTGTGAAGGCTGTGGGAAGGGGACAAGTGCCCGCCTCTGCCT 
AAAGGGGACTGATAAAGGTGCTTCCCACCntSCTCCACTAGAAGGACAAGATAGATGCC^ 

TCCCACATTCTCCATATGCACAGCGTAGAGGGTATGGTACAGGCTGTCATTGATGGACTCGCAGAGGTC 

6 0 TC CTGACAGCTCTGCCTCAGAATCTTCACACAGGTCCCAOT CTGG CAAAGCCTAGGCTGTAGG AATGAATTAACCCA 

GTGGTCACACXSTTAGTC^TAC^GGCTCTGCCATGTGTCATGTGTGATGATGCCGTGTCT 
CCCACTCAGCATACAGGAGCACTCITCACACTTAGGACTGAGCT 

ACACAACCGTGTGGTCTCTCTTC^TGCAGCCACACTGCCTTCGTGTGTCAGATGCTGGGCAGTCCT 
CACAGGTGTCAGATGTCGGTGAAACTTGCCTGGCAAGGTTGCTCA 
65 ^XAAGATGGTTGGATGTCCGAGGTGCTGATAGCTGCTGGGACCTTGAGCTGATGCCCCAG 
TGCGGTCCGTCTGCAO^GGTCCATGGGATGTTTGACTGATCCCAGGCAAAGA 
CAGTAGACACATTCAGGCTCCCTCAAGCACAGGCACATGG 

ATTACTCCAGGAGAGAGTCACCTTCTTCCACTTGCCCCACTTAAAGGAAGGTCATTCT 
CCTAGGCTGTCCTGCACCCTAGGCAAGCACTCTTAAGCACTAAGCCATATTCCAGCCTAGCTA^ 
70 TCTCTGTGGCTGAGCCTTACTAGACTTCGCCAGAAGATGTGCACTGTTAGTCTGAAT^ 
TAGGAGATCAGACTTAAGATTCAAAGCCAAACTATGAATCAAGAGCT^ 
CTCCTCTCTAATCCACCGAATGAGAGTTGGTGAAACATAC^^ 

CCAGCAAGCTTGGATGCAGGGCAGCAGAGGC^GCTGGGGTAGCTATACAGAACAAACT 

CAC AAGGG CCTGTGACAGACAO AG AAC AGTOG A ATTCCCCCAGG AT CCTT CTGTGTGCAT AAACTT AT AG C CCAG CCCCAGT ACAG 

7 5 CCTCAGCACAGACTTAATCCTTAGCCACACACTGGAGAGGGCCT 
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TGCACAGCAGGCAGCCTTTGGAGAGTGCCTATAAGCCATTTTAGAAGTATAGTTTCCCAG 
GACAAGCCCCAGAGATGGAACTCTGTGAGGCCCAGGAGAGGAGCGTCAGCCAAGC^ 

CCTCATTGGGAGG C ATCCT CAC CTGGTGTTGG CATCTAGGAACAGAAGACTATTTGCT AGGCTIT AAGT CTCAAGTCT AGAGT CAAG 
GAATGCTCTTTAAGGCTCCrrcGCCCATAGACTATCATTGCAGGACCTGCAGGAG^ 
5 ACAACCGAGCATGTGCCATTGACGGTGTCAGCCACACTTGGCT 

CTTTGAGGCGACTCCAGGTCAAATCCCTCAGCCGTCCTTAGACTCT 
CCTGCAAGGCGCTGCCGTGTACAAGGTTCCTGGACTACGGTGACTGCTGTGGCT 
GACACACATCTCCCATGGCCXSCACAGACCCAGCATTGTGCrGTGCTGGGTGTGAGGATCAGTCACA 
TTCTTCATGAGGGCTTGCrTTGTCTCTGACAGCTGGCCCAAAAGATTGCC^ 
1 0 CTGTATAATAAACTCACGACTGGGCTCATGTGCTGTTITCAATGTGCTAGCATTGCGG 

N CCCAGTCT AG CACGGAGACATTGCGTCACACCCCCTGCCTCCCTCTGG CGCAGGAGACAGACCAGCGCCTGGGGCTTCTGTGGAG 

CCCAGGTCCCATTTCCTGACTAATAGGAAACAAGAGGCCCCATCCTACAAGGCTCTCAGCTCCAC^ 

AGTCTTGAGTTGTATAGCCCTAGGGTTTCrGTAGACAGACACTGTGAGG 

GAACCTCAGAGAAGGGCCATCAGAATCCGATTCTTGGATCACTCCTCCTCTGCCAGATGGACAAGCTGCTCrc 
1 5 CTCAGCTGTGCTGAGTGACCATCCAGTGATCAGGGGCCTCTGTCTTTGAGTGGAACTGCCT 

GTGTGAGACTGTGACAGGGTCCCCTGCACAGGAAGCAGAGGCTTGCCACCTTGGAGT 

CTTCTTTCCCTTTGCGCTGACCTGTGTGACACTGCCCCCTCCCAAGCACTGAGCT 

GACATTGGGCTGGCAATGTTTAAGTCTCCGTCCCCATCTATCCAATCAAACAGGCT 

TGAGTGGAGCTGCTGCCTCAGTCTCAGATAACCAGACAGAAGGAGCCAGAGGATGGAGTGGA 
2 0 GGCTGCGCAGGGTCTGCCCCAAACCCCAATCCTGGTTCATTTTGTACCTCCTGAGAAAGA^ 

AGGACTCATGCCCTCGTGAATGTGCAGAACTTCAATGCACTG 

GGTTAGCAAGATGGGCCGGGGTACTTTAAACAGGTCCTAACAACAGAATCTACAGGGTTCCC 

CTGTCTGGCCTCTGGATTGCTGGCCCAGGTCCAACCAACGCTCCAAGTCCCTC 

ATTTACTGGGCAGCTGTGTTTGGGACTGTGCCTTTCATGTCATGCTGGGTAGTTA 

2 5 TCGTGGGAGGGTTGCGTTGGCCTTGGGACCTGGCTGGGGACCCAGAGCAGCT 

CTTTTTTTTTTTTTTTTTTTAATCACAAGCTCTGAAAATGACC^ 

GAGCAGAGCGACTCATACCCAGGATTCTTCTCGGTGTAGTTTTGTTCTTAGCAACATTGCT 

TGATCCCAGAAACTGGCTGGACACGGTTCrrGCCCC^GGTGCCTCTCAGGGTCTGACTTCTACCGGGTT 

ACAGTGTGGTTGAGCCAGCTGGACTGATACTGGATGGGGGTCGTTTCTGAAG 

3 0 TCGTGCCTGGAGCTGAGTTAGTACCTGTATAACACATTTCATTGTAAGCTGTAGAACCCAGATGGCCAGGG^ 

CAGCTCCTGGGAGATACAGCCCTGAGGAAGGGCGGGACCGAGAGCT 
TTCCCTTCCTTCCGTTTTTGTTAAATGCCAGGA 

GGGG ACCTG CC CAG AG AGG AG C CTGCT AC CTG CCTG CAC CT ATGATT CCCACGGCTT AGG CAAC AGGACAG AT AGGGTGGTTGGTG 
AACTGAACTAGGTCTTCACTGGGAGGCTTTTCTCAGAGGCGCCGTGTGCA^ 
35 TTC 

MOUSE SEQUENCE - mRNA 

GGCCCGACATGACGGGGCTCGAGCAGGACCCGGAGTTCGACTTCGATTTCCT 

GCCGCAGAACACTACAGTTATCTGTCCCCTAGTGTCACCTCGACCCTGC CC CTTCCCACAGCACACTCTGC CTTGCCAGCAG CATG 

4 0 CCACGACCTCCAGACGTCCACCCCGC^TATCTCAGCTGT^ 

GGCCTTCGGGATACTTCCTGTCCrCTG^CAACACCAGACCCAACGG 

TACCTGGG CCTACACCATGGCAGCGGCCAGTTTTTCCACGACGTGG AGGTGGAAGACGTACTTCCT CAAGCGCTCACCGTC 
TACAGC^CCCTGCACCTXSCCC^CCTGGAAGCCTAO^ 

CTAACTCTGAGGCCTCCTCCTACGAGTCCAACTACTCCn'ACCCATACGCGTCCCCCCAGACCT 
4 5 TCTCCCAAGACCACGGACCCGGAGGAGGGTTTTCCCCGAAGCCTGGGTGCCTGCCACCTGCrA 




GAAGATACCTGGCTCGGTAACACCACCCAGTATACCAGCTCTGCCATTGTGGCAGCCATCAACG 
GGACCTGGGTGATGGGGTCCCTATCAAGTCTCGAAAGACAGCACTGGAGC^ 



5 0 GGGAAGACCTGGGCACCACTCCACCCACTTCTGACITCCCACCCGAGGAGTAC^ 

GAGCAGTrATCTGTCGGTGCCACAGGCCTCGTATCAGTGGGCGAAGCCCAAGTCTCTTTCCCCGA 
GCCTGCCCTTGACTGGC^GCrCCCGTCACATTCrGGTCCATACGAGCTTCGGATCGAGGTG 
ACTATGAGACGGAAGGCAGCCGGGGGGCTGTGAAGGCTTCAGCTGGAGGAC^ 
GAACCrCTCACGCTACAGCTGTTCATrGGGACGGCrGACGACCGCCTGCTGAGGCCCC^ 

5 5 GGGGAAGACTGTCTTCCACCACCAGCCACGAGATCATCCTGTCCAAGACCA 

TGCGAGCCATCATCGACTGTGCTGGGATCCTGAAGCrCAGAAACrCTGATATTGAGCTGAGG 

TATCGAGTGTTCCCAGCGGTCAGCCCAGGAGCTGCCCCTCGTGGAGAAGCAGAGCACAGACAG 
AGATGGTGCTGTCTC^CCATAACTTTCTGCAAGACTCCAAAGTC^TTTTCGTGGAGAAGGC 

6 0 ATGGAAGCAAAGACTGACCGGGACCTGTGCAAGCCAAATTCCCT 

CCCCGCCCAAGTCAGTTTCTATGTCTGCAACGGGAAACGGAA 

ACTCrGTCTTTCT AACCTrAAGCTCTGAGAGTG AG CTGAGAGG AGGTTTTTACTGAG CAGCCCCCCGAGGCT AT AAGAGGATGTTG 

TTGTAAACAAAACAAAACAAAACAAAACATACCTGTAGCCTCTTCACACCACG 

CCCCTCAAAGAAAAGCGAAGCCTGGGTGTGTTTTCCTGTGACTGGTGCATG 

6 5 GCAGCGCGGCC^CCAAGCAGCTCTGCTGCGCTCAGGGGCTCATGCC^TCTGGG 

T AG AAG AAAG T ATTTGT CAACG CAGTTTTGT AAGT AGCTTCGAAAAT AAG CCTGCCGTGGT CACTGGGGAA.CAT^ 
CCTCATGGTGACGCTTCTACACAGCGTG CGGTGTGTCTCCACTGAATAATGCTGTCC CCTGGTGACGTGAGACTTTCAGATGGAAG 
CTCTTCTGCTCGAGTTTACTCATTTAGGGAATGGCTTCTTT 

TATTTACGAAAATACCGTTTTTAACTGCTCCCCGCCCCGCAAGCTTCTAGAAAGGTC 

7 0 GTGCAGGCCATTCTCCTGCAGCAGGATGTATAAACAGAGAGCAGA^ 

AGGCT ATGT C AACAACCTTTTT AAATTG CT AC T TTT T TTTTTCCT CH* AAAAACTTAAGAT AGTCATGT AATTT AAG AGGG AAGTT A 
TACAATAAATACTAGCCATGAAAGCAGCCATATTGCTATC 

TTGTTTTTT AAGGTTTCAAGGTTTTTTG TTTTTGAAGTGT AAAGG CATTTGGAACAG TTT AG ACAGT ACGAAAAGTTGGT ATT AAA 
ATTCTGAA fi /TCAATTGTCTTATC&GGAAAC^ 
7 5 GACATCAGTGAAAGGGCAGCAACTGTCTGTGCTGTGGGTGACCCCAGA^ 
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TTATTTCACAGGCCrCTTCCTTTCGGGAAATAATGCACACTCTCTCTTACAAA 
TTTTATTTTTTGAAAGTGCAATGATTGTGTCCTACCTATACTTCAAGC^ 

MOUSE SEQUENCE - CODING 
5 ATGACGGGGCTGGAGCAGGACCCGGAGTTCGACTTCGATTTCCTCTTCGAGTTCGATCAGAGC 
ACACTACAGTTATGTGTCCCCTAGTGTCACCTCGACCCTGCCCCTTCCCACAGCACACTCTG 
TCCAGACGTCCACCCCGGGTATCTCAGCTGTTCCTTCAGCCAATCATCCCCCCAGTTACGG 
GGATACTTCCTGTCCTCTGGCAACACCAGACCCAACG^ 
CCTACACCATGGCAGCGGCCAGTTTTTCCACGACGT^ 

GAGGCCTCCTCCTACGAGTCCAACTACTCCTACCCATACGCGTCCCCCCAGACCT 

GACCACGGAC CCGGAGGAGGGTTTT C C CCGAAGCCTGGG TG CCTG CCACCTG CT AGG ATCGCCCAGG CACTCCCCAT CC AC CT CT C 

CTCGGGCAAGCATCACG<5AGGAGAGCTGGCTCGGTGCCCGCGGCT 

AATGGCCGGCAGCCCTCCTGCTC^CCCCACCACTCACCCACACCATCCCC^ 
1 5 CTGGCTCGGTAACACCACCCAGTATACCAGCTCTGCCATTGTG^ 

GTGATGGGGTCCCTATCAAGTCTCGAAAGACAGCACTGGAGCATGCGCCCTCTGTG 

CTGGGCACCACTCCACCCACTTCTGACTTCCCACCCGAGGA^ 

TCTGTCGGTGCCACAGGCCTCCTATCAGTGGGCGAAGCCCAAGTCTCTTTCCCCGA 

TTGACTGGCAGCTCCCGTCACATTCTC^TCCATACGAGCnT 
2 0 ACGGAAGGCAGCCGGGGGG(nX3TGAAGGCTTCAGCTGGAGGAC^ 

CACX3CTACAGCTGTTCATTGGGACGGCTGACGACCGCCTGCTC 

CTGTCTCCACCACCAGCCACGAGATCATCCTGTCCAACACCAAAGTCCTGGAG^^ 

ATCATCGACTGTGCrGGGATCCTGAAGCTCAGAAACTCTGATATTGAGCT 

CAGGGTGAGGCTGGTCTTCCGAGTTCACATCCCACAGCCC^ 

2 5 GTTCCC!AGCGGTCAGCCCAGGAGCTGCCCCrCGTGGAGAAGCAGAGCACAGA^ 

CTGTCTGGCCATAACTTTCTGCAAGACTCGAAA^ 

AAAGACTGACCXI^GACCTGTGCAAGCCAAATTCCCTGGTGGTTGAGATA^ 

AAGTCAGTTTCTATGTCTGCAACGGGAAACGGAAGAGAAGCCAGTACCAGCGTTTCACGTACCTTCCTGC 
TTTCTAACCTTAAGCrrCTGAGAGTGAGCrGAGAGGAGGTTTrrACTGA 

30 

HUMAN SEQUENCE - GENOMIC 

AAGGCCCTGACCGGTTTATAAAGAGCACCCATCTTCTCCGG 
AGCGAGAATTGCAGGATTAAGCTTCIACTTGAAACT 

GCTITATTG7VACCAAAACACTTTAG AAACCGTTGGGGAA CTGTGGTCCAGCC CCACGCTTTGTCTAGGGC 

3 5 AGGTTCCGTTCCCTCAAG CAGGTGATGGG AATCC CGG ACTGGAGCCTGAGGCTGAGCAGCCAG AGGCCACACCGC CAGGGCCAGGC 

CACACCACCAGGGCCGGCC^CACCGCCAGGGCAGGCCACACCGCCGGGG^ 

CCTGCCTCATCGCCCCACAGTCTGGAATGCTCTGCACACGCATCAGAAGGAGAGTCGCTGAGCATC^CAA 

CAGCCTCACTGCCCAAGGCCGGGGCAGAGATGACTTGCCTTGTGTTACTGCCT 

AACCAAGTAGTTTAAAGAAAAATGTTGGAATATTGCCATTTGAA 

4 0 AGGGACAAACCCAAACTCTTGCCAGAAACTAAAAATGTGATTTATTTACCTCTC 

GATCAGGACGCGGCTCTGGGCCCTCCTGCCTCAGCCCrCCGGCACTGTG^ 

CGAAGTGCAGAACT^GAGAAGAGGCTCCGAAGCACAGAGCGCCTCTGCCCTGGGC'r'l 1GCACACATTCCTGACTTCTTAAGACACAG 

CTGTGCAGGACCATTGCATCTCCGTGAAGCTGTCCTC^TTGGCCT 

GGTCATGTCTCCCTGAGACACACATGTGGCATCCTTTCCCCGC^ 

4 5 TTTCTAAGAGAAATCGAGGTTTAAAGGGGACTTTTTTTTGC 

GCTTGCCCGGCCTCCTCACCGGGAGCTTCCCTCTGGCTAGAGTTGCTGTTTC^ 

AGGAAG CGAGGACCCTGTTCAG CCATGGCCTTCCTGTGAATTAGGCCTCCTTCAGGTGCAAGTGAC^ 

TTGAGCAGAAAAGGGAAGTTGCTTCCTCACCAAGGTGAAGACCCCAGGGGAGGCT 

AGGGCATGGAATCGGCCTCTGGAGTCCCTGTCTCCATGGCAGCCCCGGGGTCCACAT 

5 0 GGCTTCAGAGGAAGAGAACGCCATTCCCTGCGGCTGTGGGGCAGCAGCIAGGCACACCT 

CGTGGGCTATGGCGGGAGTGGGAACATTCCCGTGACCCCCCTGGAGTTTAGGCCAGGCAGTGGAGGTCCTCGAGCAAA 
CACAGGG AG TG CT CTG TC CAGT AGG AGCATGG CG CCCATGT CAGGG CACTGGC CTGGAAGGGAGCGTCTT AGTGGAGGCTCTGGGT 
GCTG CCAGGCCACCTGAG AGGGG ACATG CATTGTAGG AAAGGCCACAGCGCACTGTG ACAGGTGTGGTATCCTGGAAGGAATGAAA 
AACACTGCCCGGCCAGGCGCGGTGGCTCATGCCTATAATCCCAGCACTT^ 
5 5 TCGAGACCAGTCTGACCAACATGGTGAAACCCOSTCTCTACT^ 



CAACCTGGGCAACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAGAAAAAAGA 
GGCTGCCATCACAAATGACCACAAAGTGGGTGGCTTGAATGGCAGAAACATGTCCT 
GGTCAGCAAGGCCAAGTGCCCTCTGGAACCGGCCACGGAGGTCCCTTCCCTTCCTCTTCCAGCTCGTGG 
6 0 TGGCTCGTGGAGCCATCATTCCAGTCTCTGCCTCTGTCGGCTCACCACATTCCC 
GTCATCATTGGGGTCCCCTTAACTGACACCATCTrAACTAGTTACAACTGTAGA 

GAGAC CAGGGG ACTT CAATGIAATCGTTTTGGAGJG ACGCAGTTCAATTCACAACAGCATTGTTCTGAAAT CTGTCTT 

GAGCCArrAGC^GGTGCCrCTCATTGATCTACCCTGAGAGGTGAAGGCCGTTACTATTGGCTCCCGTAGGTGTAC 

ATCCAAACTCGGGCTGTGGGCAAGCGCCTTCTCAC^CGAGAGCT^ 

6 5 AAAACCCCCAG^CATTGACAGTCTCTGGAAATGGTCCTAAGGACACGCAGCAAGTGA 

TTTAAAAAACCATCTTTACTAGGAAAGGTGAATGTATTTC 

ACCAGCAGGTCTGCTGCAG CTGAGGACACAGGTGGGTGGGGGAGAGTG ACT CGGTGTTTCTCATCCTGC 

CCCATCTGCCTGTTGCTGGGGCTGCTAGGTCCTGGGGAGTCTGGTGG 

GGGTGGGAGGCTGGGCCnTGGGTGTGGCTCACTGAGAAGACAGGGTGCIAACT 

7 0 GCCAGGGGAGGCCAGTGGGGTCTCAGGATGCTGCCCCCGTCAGCACTCAA 

GGCACTGTCCCCATCCCCAGCTCCAAAGCATTGGCTCAGAGATTTTGCCTGG 

ATCTCCTCCCAAAAG AAACGACTTCATTTTCAG CAG AACATGGGAGAAGGTGAAAG CTTAGGGTCCCCTCGAAAACATGGAGGTTG 
TCATCAATGGCATTTGGGACIATTCATGGATTTAATGAAGATTTAGCCT 
GTCAAAACAAATATCACACATGGACATCAGAAACTCTTGCTTCAAAGAAQ3CACT 
7 5 TGCCTCC^GATATTGTTAAAAACAATAGAGCAATCCAATGGCAG CCGTGG AATTTAACAG CTGGGTATGGTCAGGG AAAGAGAGGA 
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AGAGATCTCTATCAATGTGGTCGGGGAAAGAGA^ 
AGACTCCTTCTCCCnAAGGAGCAAGCCAGAGGCATAACTCT 

CAG AAGTCACAAG ACAAGT AAAT AAG CAAATAACAAGCCC CACAC AGGGGG AC CAGT ATCCAGAG CTGCT AT AAT ACATTGT CT AA 
AATGCCCAGTTTCCAACAAAATAGGAGGCATGCAAAGAAAC 
5 CAGAAACTGCCTCGGGAATGACCAGATATTGGATTT AA CAAAAT ATCCCTT AT ATG CT CAG AGAACTAAAGT AAA 

TCTTGATT AAAGAAAG CAGGACG ACAATGTCACAT CACGT AGAGACTGTTG AT AGAATGTAT AAAAAAT AAC CAGTGGAAATT CAG 
GAGTTGGAAGGCGCAGTCACTGAAGTGAACaVTTCACTATTGGGG 

TTGAAGAT AC AT CAGT AGAGAGT ATG CAAGC C CCGAACAGAGAG CAG AT AGAATGAAGGAGACCAAGGGG AG C CT CG GAGAG CTGA 
GGGGCACCAGGGACGCCCTCCAGGGCACC^CCCACACAGGGGGTTC^CTA 
1 0 TATTAAAAGAAATAATGGTTGAAAATATCGCAGATTAGTTTAAAAAAATA^ 

GGGCAAGCCTAGAGAGGCCCACAGACCAACACATTATGAGTAAAAATCCTGAAA^ 

AATAATAAAATAAGGTATCACTTACAAGGGAACCCCAATAAGGCTAACAAGTGACT^ 

GTGGGATAAC!AGCCAAAGGAAAGGAGAGAGX3AGGAGAGGGGAGGGGAAAGGi^ 

AAACCTATCAATCCCAAATCCTATGCGC^ 
1 5 GTTTGTTGCCAGCTGACCCACCTTACAAAAAATACTAAAGGAAGTTC^^ 

ATCACATGAAAAAAACAGAGTGCCTCTTAAGGTAATTACGTAATTTCATAAGACAGTACGAATA 

TGGTGTGAAAAGTGATTGTATAAAATAATGTATATAAAATATAATGT^ 

TATTGCTAATAACAATACAAAGAAAGGGAATGAGAACAAAGCT 

ATCACAATGTATTCTTAC^TTTGTAACATTACTGTTTACAATGTAGAATGTAT 
2 0 GGATAGTGTTTTTGTGTGTCACTGGAGTCGAAATGTATATGGTACATACTAGAGCAATCACT 

AAACT CAGT AGAAAATAATT CACTT AATG CAAAGAAG TCAAT AAACAAAT AAATGAGGGG AAGACAT AAG AAAAAT AG AAAATG AA 

AAGTAAAGTGGAATAGAAATGAAAAGTAAAGTTCAATAATAACATTAAATC 

TGTCAGACTGAATTAAGAAAACATGACCCAACTTTATGCCGTCTAGAGGAGACA 

CAACGATGGGAAAAGACX3ACACACCAACAGCGACTTCAOTAAAGCTGGAGT 

2 5 AAGGTTTTATTAAAGAGGAATGTTTCGTAATGATAAAAGC^CTAATCrrGT 

AATAAGAGAGCTCCAAAATCTATGAAGCAAAAACTCACAGAATGAGGG^ 
CTCCACTTTCAATAATGGATACAACAACCAGGCAGATAAC^ 
CAGATATCTATAGCTAGCACACTCCACCCAACGACAGCAGAATACACATTCTTCT 
CCATGTTCTAGGCCATCAAACAAACTCAGGTGGTTTGAGG^ 

3 0 CAAGCAGAGAGTTGTCAAAGAGGCCCTC^GGACTGGGTGCAGTGGCTCATGACT 

AGGATCTTTTGAGCTCAGGAGTTC!AAGAAATGAGC!ACTTATCCACT 

GCTTAGGCGGGCGGATCAAGAGGTCAGAAGCTCAAGACCAGCCTGACCAAC^ 

TTAGCCGGGCGTGGTGGCGCACACCTGTAATCCCAGCTACTTC<3G 

TTGCAGTGAGTGGAGATCACACCATTGC^CCCCAGCCTGGGCAACAGAGCGAGACTCCGTCTCAAAAAA 

3 5 AGAAAGAAAAAGAAAAAAAAAGTGAGCATGTATTTTGCCAGAGTCTC^AGATTAGAATTAAA 

AGCTATTTACTTTTAAGTAAACAGCTGAGATTTTTTTTTTTAAGTCAGT^ 
GAAGGATTTCC<mTA3TTATGCACCTGTGTCAGCACCTTCTGACTTTCCTTCTAA^ 

TTGGGGTT CAGGGTTT CT ACAG CATG CTGTT ACTTGTGAAACAT CT CTTT AAC CATGTCCCAG AGTTGC CCAGG AGTTT AAG ACC A 
GCCTGAGCAACATAGCAAGACCTCATCTCAACAACAACAAAAATTAGAAAT 

4 0 CCAGCTACTCAGAAGGCTGAGGCATGAGGATCACTTG 

CAGCCTGCAAGACAGAGCAGAAAAAAAGAATCAGGATCCTGC^CAGAGGGAGGAGAGG 

TTGACTGAATGG CGTTGGGGAGAGATGACTCCAAAGTCCTGG AGTGGGTGAGAATG ACTG CG AGTGGCTTTTAGGTGGGGAGGTT C 

CTCCCTGGCCACTCCG^GGAGGGGACGTGGGGCTGAAGGGTATCAGGTGCCGTGCTGAG 

ACACACGTCTAGGGTAGGAAAGTTGACTGATCCATTGGTGATCTGAGTT^ 

4 5 GCTAAGAGTTTAGACAGGGAAACCTATGAAGCCCrTAGC 

AGCATGGTATAGAAACTCTGAACCCCAAATGTATGGGTCCTCAGGAACACCCCAGACITTAGA 

CGGG TGT ATAACT AAAGG GAAATCCTTCTTTCT CAG CTCCAACCATGG CTG TG AGG TT CATTCACACTGACTT AAAAAAAAAAAT C 

TCAGTTTACTTAAAAGTAAATAGCTTrrTCTATAATTCT 

CCATTTCTCCAGATGGTCTCAAGAGTCTCTGGACATCGTG^ 

5 0 GGATTCTG AG TTG AGTCAAAACCT CAAAG ACCCCTGATGGGAAAAG CT CTC AAGTG ACCACCGCTGTGGGC CAGAATGCAAAACTG 

CAG^AACAGAACATTCGCAGGAACAGAACACAGTCGTATTAAGTGATTTTCCC^^ 
AGGGGG AGGAAAAGGGTGGGCGCACCTGCCCCTGGCTGGCGCACCT 
CTGAGGATGGAAAGCCCCACGCGGGGQ^GGTGGC^CCCAC^ 

CCGGCCCGGCCCGCG CTGG AGACGCCCGCATCTGCCAGGATGGCGTCTCAT AGCCCTGGTGCTCACACATGA CCAGGAAG C CCC 

5 5 AGCAACAGTGACCGCCCAC^CTCTAGAAAATATTGGACGGC^TGGATGAACACCCAAGTGCGC^ 

CCAAGGGGCTTTTAAAACGGTAAGCTTCTAGGGGTGTCT^ 
GTTTTCTGTCCTAAGCTCCTTTTAACTTTGTTAG 

AT AAACAAACCTACGAGGAGGCAGGAGGAAGAGGAAAGGGG CGCAGGG CGCTCGGGG AGCAGAGCCGGGGGC CCGCGGTGGCCGCA 
GAGGCCGGGCCGGGGCGCAGAGGCCGGGCGAGCTGGCCGCGCTCTGGGCCGCCGCCTCCGGAACT 

6 0 ACCGTGGTCCCGGCyUlCGGCATTAAACAGAGGGAAACAGACCCGGGATTCCGTCACCCGGGCG 

GCAGACAGGAAAAGGGAGCTTTTCTGCATGGGGTGAAAAAATTATT^ 
GCGGGAGGAGGAGGAAGAGCCGGCCGCCCCCGCCCCGGCCCCGGCTCCTCAGG^ 

CTCGAGGACCGCGGCTGGGGTCGAGGGGCTCIAGTCrCCCACGTGACCGGCTGGGCGCGCCCCGCCAGACCCGGCCT 

6 5 TCCTGGCCCCAGGCCTGGGGACACrCGCGGCGGGAAGATTTGGAG<3GGA 

AGTCCCCCTCCCCCCCCGACCCCCGCCCACCGGCCTGGGCGTCCTCCCG^ 

NKNNNNNNKNNNNNNNKNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNTTATTGAAGGAGAGGGGCGGGCGCTCGGCGACTCGTCCCCGGGGCCCCXaCGC 

^CGGCAGGGAGC^GGGCG CGGG AG CCGC CGGG CCGGCGGGG AGG CX^GGGAGGTGTTCT CCA 

GCGGCCCTGCGTCAGAGCGAGACTCAGAGGCTCCGAACTCGCCGGCGGAGTCGCCGCGCCAGATCCCAGC^ 

CGGGGCGCGGGCAGGGCTCGGAGCCACCGCGCAG^TCCTAGGGCCGC^GGCCGGGCCCCGCCAC^ 

TTT CCTCCGGGGCGCGCX^CGCTGAGCCCCCCCCGAGGG^^^ 
75 CCTGTGCCTCCGCCCGCCGCTCCACTCCCCGCCGCCGCCX5CGCGGATGCCAAGCACCAGCT^ 
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TGGCCCTGCGGCTGCGGTCTTCGGGAGAGGAGAAACTITGGGGCCCGCGCrcCGCGCCGGCGG 
GTAAGCCCCGCGCGGCCTCCGCCCCa3GACCCCTGCGCCCCCCACGGCCCGGGCa305CC 
CGGAGGGGTCCGGGCCAGAGAACGAGGTCGGCCGGGTCTGTGCG 
GCTCCCAGCAGCCCX^TTCCGAGCGCGCCCT 
5 CGGGTCCCGCCGAGGJW^CGGCTCGCAGGCCGGCGATTTGGGGACGGGCGA^ 
GGGAGGAAGGGAGGCTTCCTTGGGAGTGAAGGATGCCCCGTGTCT^ 

GCCCGTCACGCCGGCTCCTGGACCTGCAGGACCAAAAGATC 

TCCTATAGGCAGGAGTTCCAGCOVGAGAAGATGTTTCAGGCAAGTTTGAAAAGGAACCG 
1 0 GAAAAAAAGATGGCACAGTGTGGGTCCCTGAAACCCGGTGTG^ 
CCCACCTTAGCCCGTCCCCG<3CACACACCCCTGGCG^ 

TGAG CAT ACG AGTTAGGGATT AGGGATTG CAATGCT AATGGAAATTGGAAGACG CTTTT C CAAATT CCTT ACCAG CT CTG AAATAT 
TTTATATAATTTATTTAAAGAGCGCCAGCACACCTGCCAACATATGTACrTCATAAGG 
CCTGCGCGACCCTGATAACTGCGGGX3CTTGCTTGAGAACCX3GGCT 
1 5 CACAGGGCCGTGAGCTCCACGGAGGGGGGTCCGTGTGGTCGGTGCCCTGACGC^ 
CAATGTT AGAAAATGTTGACATG TTT AAACCATGTC C CGT 
AATTTTITTTrGTTX3GAGTTTCGGTTTTAAAT 
TTTCCAAGTTCAGATCTCTGTACCTGAAGAGCCGGTGG 

ACGGGGCAGTGGCTAGTTTTCGTGTTTCCTACTGAAGAAAATAAGCT ' 
2 0 ACGCCCGACCGACATGGGTGTATTTTATGAGGGCTGCAGAACTGGGCCAGCCGGAA^ 

CGAGGCCATTCCTGGCCCATCTGAGGACCTGGGCGCCT^ 

GCGTCGTGGTTCCTGAAGGCCCCGGGCTTGCITCCT 

GTCGCTCCAGACACAATGCCCCACGGGCTGGGGCCAAACA^ 

2 5 AGCTCAGATTAGAAAATGGCAGGGCGCTGGCACCCCGCTTCAT 

TGGGAGGATGCGTCCAAGCTAAACATTTCCATGGAGACACCCAGTGTCTGTGGAACGG 
GAACCATTTGCAGGAAGGAAGCCCAGCACTGGGGCGGGGGAAGGGGGGGCTCCTAGGAGCAG 
TTCTCTGGGATGGAGACCAGTGCAGTCCTAGGACAGCAGCAGTGAGCCTCCATC 
GGCGGGACAGTTTAGGAAAAAGTTTTAGTCTAAACA 

3 0 TGGCTGGAGTTTTAAAAACAATGTACAGGCCGGGCGCCGCG^ 

ATCACGAGGTCAAGAGATCGAGACCATCCTGGTCAACATGGTGAAACCCCGTCTCTACT 

TGGCGGGTGCCTGTAGTCCCAGCTACTCTGGAGGCTGAGGCAGGAGAATCGCTTGAACT 

GATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAACA 

ACGTTGTGGCTTCTCAACAGAAAACAGAACTTCCACTTGCCCA^ 
3 5 CATCCATCCCAAAGTCAGAAGTATTATTAGAAACTAAATCAGGCCACACGGAACGATGGCTTATCA 

AAAGGGTCGCGGGAGAAGCCCGGGGACGACCCTGKSGACTGGAGGGTTTCT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNIWNTOJNNNNNNNNNN^^ 

NNNTOJNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNKNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NlftJNNNNNNNNWNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNKNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNIWNNNNNNNNNNNI^ 

NNNNNNNNNNNNNNNlWNNNNNNNWNr^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

HNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN*^^ 
55 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNIWNNNNNNNNNNNNNNNNNI^^ 

NNNlWlTOmiNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNinWNNNNNNNNNNNNNNNN^ 
g0 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNIW^ 

NNNNIWNNNNNNNNNNNN^^ 
65 NNHNNNNNNNNIWNNNNNNN^ 

NJJNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNIWNNNNNNNNNNNNNNNNN^ 

NNNNNNNJn^NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNmiNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNinWNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNW 
75 nnnnnnNNNNNNNNNNNNNNNNTO 
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NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNmWNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
mWNNmiNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNIWNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNKNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

10 NNNNNNNNNN>WNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNbWNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

bnWNNNNNNNNNNNNNNNNNNNW 

HNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNITONNNNNNNI^ 

NNNNNNNIWNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNN1WNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNKWNNNNNNNNNNNNNNfNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNIWNNNITONNNNNNNNNNN^ 
25 NjqNNNNNNNNNNNNNNNlWNNNNNNNNNN^ 

NNNNIWNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNHNNNNIWNNNNNNNNNNNI^^ 
30 nnnnNNNNNNNNNNNNNNNNNNNNNI^^ 

NNIWNNNNNNNNNNlWNNNNNNNNNNm 

NNNNNTOINNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNltt^ 
35 NNinJNNNNNNNNNNNNNNNNNNNN^^ 

NNNIWNIWNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNIWNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNKNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNOTINNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNIW^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
45 NNNNNNNNNNNNNNNNNNNNTO^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNKNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
50 NNNNNIWHNNNNNNNNNNNNNNNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNAGCCTGGGGGCGGGGGGCAC^GT 

GCTGGGGCAGATCCTGGGGATGGTGTGTGTGTCCTGACCAGAGGCC^ 

GCTCCACCCCAGTCCCTGCCTGTOXrTCTGGGGAAAGAA 

AGCCTCCCAAGTAGCTGGGACTACAGK3CGTGCGCCACCATGCCTGGCTACTTGTATTTTTT 
CCCAGATTGGTCTTGAACTTCTGACCTCAGATGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTATA 

CTTTACGATGGGGAAGGCGGGACTGGGGCCTGGCTAGCACCATCCCCAAGG 

6 0 AGACATCCTGAGCAGGTCTCACCTCCCACCCCTGAGAGCTTCTGGTCGCACCT 

TTCTCTTTGTTATGGCTCTGCCCCCTGAGTGTCrGG 

TCTTACCAGGCTCCCGTGCCGTGTTTCC(^CCTACCTGGGGCGCTAGAACCATTT 

ATTTTGAGACCCTAGATTTTTTTGGCATTTATCCGTGTTGGTTTCTGAACCACGACTTAG 

AAGTATCTTCCAGCCAGTGCAGTTGCACCAGGAGATGAATATAGAATTTCCATGTGCTTGTTTGTATT 
6 5 ACCCACCTCTAGGCATGTAATCATGAGGTTCCAAGATGTCATAAAAAGTCTTTTCAAATCAGAAATCCAGAGGCACG 

TTAAAATTACTGATTTCCCTAATAATGATTTTAAAAAATATTTTATTGAGTGTAATTAA 

GTACCAGAATCTTCTGTTTTGTTCTAAAAGATGGATTCGTAGGATATGAGCCAGTATC^ 

CTGTGTTTTCATCCAATCCTTGAGGAGTTTTCTTTGTAGCTGGTGATTAGGGCACCTTGACA 

TACATATGTAGGTCCTTAGATCTAAAACACATCCACCATCCATCATCATCCATCCATCATCAAACCAl^ 
70 GCATCCATGCATCATCATCTGTCCATT ATT CCTCCATTCCCTATCCATTCATTCATCATCCATCCATCCATCATCCATC CATCCAT 

CATCAAACCATTATTCCTCCATTCCCTATCCATCCATCCATCATCCATCCATCAT 

ATTCATTCATCATCC^TCCATCCATCATCATCCATCCATCATCAAACCATTATTCCT 

TCCATCATCCATCCATTATTCCTCCATTCCCTATCCATCCATCTATCATCCATCCATCATTAT^ 

TCCCTATCCATTCATTCATCCATCCATCCATCCATCATCATTCATCCATCCATCCATCAT 
75 CCATCCATCCATCCATCACC^TCCATTTATCTAT CCATCCATTCATCCACCATTTACCT ATCATG CCTCCATCCCCAATGCATCAT 
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TCATCCATCCLATCCGTTTAGCTGTCCATCCATCCATCATCCATCC^ 

CCTCCCTTGTTTGTCCATCACCATCCATCTACCTACTCTCCATCTACCCATCATCCCTTC^ 
TCAGTTTCTTTACCTGTCTATCCATCATCCATTGTCCACCCAC 
TGCCAGGCACAGTCTTTGGCTCTGAAAACAGCGGTGAGCAGAATAGACTAAGGCGTC 
5 TTCTGTTTGTCAAAGAGATGACAGTGTGGGATGAATGAGTAAAAACAGGC^ 
ACTAGAGAAGGATGGCTCACGTGTTGAGTTTGGGGTTTAAATGAAGAT^ 
AACCCGTGAGGACCCAGTCXSCCTCTCTCrcGGAAGGA 
GCGGAACCCGTGAGGACCCGGCCXSCCTCTCCCTGGGAAGGACGTTCGGGG^ 
AGTGAGGGGCTCACGGGAGCCTTGTTGGCCAGGTGGGACTGGGG 

TTTGTGATGCCCAGCCCCTCATGCTCCCATCTGClTCl'rri iCTCTCTAGAACACTATGGCrATGCATCCTCCAACGTCAGCCCCG 
CCCTGCCGCTCCCCACGGCGCACTCCACCCTGCCGGCCCCGTC 

GATCACCCCTCGGGGTACGGAGCAGCTTTGGACX5GTGGGCCCGCGGGCTACTTCCTCTC 

CCCTGCC CTGGAGAGTCCTCGCATCGAGAT AACCTCG TG CTTGGG CCTGT AC CAC AACAAT AAC CAGTTTTTC CACGATGTGG AGG 
1 5 TGGAAGACGTCCTCCCTAGCTCCAAACGGTCCCCCTCCACGG^ 
TGCCTGAGGCCGGCCAGCAGCCTGTCCTCCCGGAGCTGGAACT 
GTCCCCCCAGACGTCGCCATGGCAGTCTCCCTGCGTGTCTCCCAAGACCACGGAC^ 
CCTGCACACTGCTGGGTTCCCCGCGGCACTCCCCCTCCACCTCGCCC^ 

TCCTCCAGACCCGCGTCCCCGTGCAACJUVGAGGAAGTACJIGCCTCAACGGCCGGCAGCCGCCCTACT 
2 0 GCCGTCCCCGCACGGCTCCCCGCXSG{3TCAG 

TGGCCGCCATCAACGCGCTGACCACCGACAGCAGCCTGGACCT 

CAGCCGCCCTCAGTGGCGCTCAAGGTGGAGCCCGTCGGGGAGGACCTGGGCAGCCCCCCGCCCCCGGCCGA 

CTACTCCTCTTTCC^CACATCAGGAAGGGCGGCTTCTGCGACCAGTACCTGGCG^ 

CCAAGCCCCTGTCCCCTACGTCCTACATGAGGTGAGCCGGCAGCGCGGGGCGGGA 

2 5 GGAACGCGGGGAGCGGGACGGGGGGCGGCGCGGGGCGGCCGTGTCTG^ 

CGTGGAGCTGGGAGGCAGAGTCTGTCCCCATGTCTGCCTCCT 

GCGTITCCCATCTGAAGTCTCCAGATCCrCTCGGGCCCTG CTGATGG CCTCGTGTTCGCTC^GTCTCTGCAAACACCCATTTCAAA 

GAGGGTGGCCCTGCTGGCGGTCAGGGCTTCCACGTGGGAATTCCGGGAGACGCCATCOT 

GTTTCTTCCTCGGGGAATTGCACCTGCCTGGCCCCCCT 

3 0 CCTG CCTX!GGGTGCAGATGGG ACCGCAGCTCAACAGTGCCCCAG AGCACC CACGGCCTCCTGC 

TCCCO^GACGCOjGCTGTGGCATCAGGCTGGGGGAGCTCACGCGCCTTCCTT 
AACCGTGTGAGGGGCTG CTGGAATT CCGCTGGGTCCACTTAACCAGCTTGGTC 
GTGGGCTGTGAGTGGCCCCTGAGAGTGCTTTCTGCTCTGGGATTm 

AGTGCCCCATGGCCCGTGACGTCAGGGCTGCCGCTGTGTCCTCAGGAAGAGGCGTTTGCCA.CGCGCAGCCC 

3 5 GGGTGTTTTCCTGTTTTTGTTTTTTTTTTTTT^ 

GACCCCGCAGAGCXK3AGGGCAGGTGCJU3CCCCCATGCCTGCAGGGTGAGGAAGGCAGGCAGTG^ 

TCTGCTCTGCAGTCTCAGGCCATGCTTCAGCCA 

GGCTGGGAACGAACCCACCCAAACCrCTCTGAAGGCTO 

GGCTGATTCTGCCTTTCTAC^GGCGTGATACTTAGAATCGTCGTCAGCTGAGGTTTCGAGAGAAA 

4 0 TCTTGACTTGCAGTGGGCGCATTTACCTTTCTATCCCAAC^ 

TGATCACAACGAGCCTGTGTCAGGACCCGGATTGTATTTACAAT^ 

TGTTTTTGCCTGCTGTACTATCACTGTTATTGTTTAAGTAAAAGTACAAGCGTAGTGGCA 
CAACGCGCTTCATGGTCTCCAGGCTTACGGACX3GTGAC 

AGAGTGATGTCATCAAGGTCCCAAGACTGGGACTTGGCATGAGGAGTGTCGTA^ 

4 5 ATTTGTTTCCTGGGGCTGCCCTGAGAAGGTCCCGCAGACGGGTGGC 

GAAGCCGAGATCAGGGGCTGTTCTTTTGGAGGCTGTGAGGGAGCATCTGCCTGACCTGTCT 
CCCCATGACTCCTCATCTCTCCATCCACCGTGCATGTCTGTGTCCAGAT^ 
GGCCACCCTGGTGGCCTCATTTGAACTTGATCGCTCTGTAAAGACCCTCT 
ATGTGAGTTTGCGGGGAGCACCGTTCGGCTGCCTCA^ 

5 0 AAGCTCGTCCTGGTTCCTTCTTCGTCTCCTC 

CACCTTGGACCTGTGTTTGGTGTGGAGGTTGGAAAGCATGGGTGGC 
CCGTGCCCACGTGACCCTGTTAGCTTTATCTCCTGGCTGCAGAGCGCCCAGTTAG^ 
CACAAACAAAGCTC^GTAAATATGTCCCAAACTCGCTCCTGGGGTTTGCAGTTAACCTTT^ 
ATATGTTTCCTGGTTCCTTGTGCCCTGTTTCCAAGTGACCT^ 

5 5 AATGTTAGTT r TTT ACCAGCta AGGTGG AGAGGCCAGACTTCCTTCC CCAGG AATGC CTCTCCGTCGGTCACATGCAGCAGCTCTGG 

CAGGCXrTGGAAAATGTCACCAGGATCTCAGGCTGCACJVACTAGAAAGCTGTATTTTTAATGATCTC 
GCACGACATGCCCATCTTCTCCCCACCGTGGTACAGAACTGCTGCTCCATTAATCCTGTCCCT 
GAAAGTGTGCTGTGTGCATCCTTCCCAGGGACCGTCTAGGATTGTTGTC 
CCCCAGTGCTATGCAGAGCCTCTGAGAGGCCCACAATGX3TCAAAGAACG 

6 0 TTTGTGAAGTTAGTAAGAGTAGGTGATAGAACCTTGATAGTTTTAGAGTTTGGAGCC^ 

ATTGTCAAAGCTTGGGGGGACTGGCACAGGAATCACAGGCCACCTGCTATACCCCCTA 

GGCATCAGAAAGGAGTGGT rrr CTTCCCGCCTTGC'lTCT 

CTTTCCGTGTGCTGTTGAGGGTGCAACAGGAGGCGTTGCCGG^ 

GTGCGGGTTATTTATAAATAGTGGTTCTCAAATGGTAGCTTTTAAAGATTTCATATTGA 

6 5 GCTTCCTCATTCCTTCATCGTTCGGCCTCCTC 

GACATCTGCTCTTTGGGCATCAGCCTCTGGAAAGTGTCCCTGTCAGAGTCTCCACGTC 
GGAGTTGCCTTTGTAGAGGCAAGGTTGTAGGGGAATTGGAAAGG 
AATTCTTGAATTCAATTTCTTTTTATTITATTTATTTATTTTTT^ 
GGCACAATCTCGGCTCACTGCAAGCTTTGCCrCCTGGGTTCACGCCA 

7 0 CGCCCACCACCGCGCCCGGCTATTTTTTTGTATTTTTA 

ACCTCGTGATCCACCCGCCTCGG CCTCCCAAAGTGCTGGG ATTACAGGCATGAGCCACCG CX5CCCAGCCCTTGAATTCAATTTCTA 

CACATGAAACATTGCCTCTTAGAACTGAATGTrrAAAAAGAATTTCT 

ATAGGCTAATAAACAATCTTTACTTAGAGTCCGCTTGGTrT 

AAATCTCTTGGCCAGGCGCAGTGGCTCACGCCrGTAATCCCAGCACTTTGGGAGGCC^ 
7 5 TTCAAGACCAGCCTGGCCAAATTGGCGAAACCCCATCT 
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TCCCAGCTACTCAGAAGGCTGA£X»CAGGCGAATCATTT^ 

TCCAGCCTGGGCGAGAGGAGCGAGACTCCGTCTCCAAACAAACAAAAAAACCCTC 

T ATTGAAGATTTTAG CAT ATCGGTCACTTTTCTG CTATG CAT AGT ATT AGTTT CAAAAGGT CTG AG AAAACG TTC ACTCTCTTG TG 
TTTCATAAGTTATCTGAAAGTTTCACAGTTTATCACAATT^ 
5 TCTTCTCTTTCGCAGACACGAGTCCTCTGCCAG^ 

TACCTGAGGGGTTCGGGCCCCGGCAGTGTTGGGGGAACAGTGCA(^ 
GGCTCTGGAGCCAGAGTGTCCGCGGGCGCTGGTGGGCCCTTTGG 

AGCATAGCGAGACGTGTAAGGTGGTAGCCGTGGGGTCTTCCCCGAGCTTCCCGCTGCCA 
GAAGCCGAGGGTGTCGAGTTGCCCCATGGTTTGTAAAACA(TTGGCATGGGCTC 

GCTAACCCTTGCTGGGGAGTCCCCCGCACAACCTGTCCTGCTC 
TGGCTCCCGGCATGGGTGTCAGGCCCTGGGGAGTCTCCAGTCCCT 
GAACGTGGCTGTGTGCCTCCCTGGC7VTTCX5CGGTGGTGGCT 
AGCCAGACGGTCCCCAGGAAGCCCTCAGCTGTGGTGGGAGATGGGAGA 
1 5 CTGTGGCGGGAGATGGGCrGTGGC^GGAGATGG^CTCTGGTGGGAGATGGGAGA^ 
GGTGGGAGATGGGCTGTGGTGGGAGATGGGAGATGGGCTCTGGTGGGAGAT 
NNNNIOJNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 
20 NNNttJNNNNNNNNNNNNN^ 

NNKKNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

hWNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNlTONNNNNNNin^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNW^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNKNNNNNNNNNNNNNNNN^ CTGTG GTGGGAGATGGGAGATGGG 

CTATACCAGGAGATGGGCTGTGG CGGGAG ATGGG CTATGG CTGGG CCTCTCCTCTGTGGTGCTGTGGATGCGTCCTATTCAGGAGC 
3 0 TGCCTACCAAGTTTCATTGCGATGTTGGAATCTGTCCTTG^ 

TTCCTCCCCTTCCTGTGGGTCTGCAGGGCTCXSGGGTGTTGATGGAGACT^ 
GGG^GAAGAATCCGCGTGCTGCCCCACCATGCCTTGCGACGCTCTGTGGTA 
CC^CCCGACTGTGGAOSCAGCTGAGGCCATTTGTCC^ 
ACCCCGGAACCTACACTCC^GACTCCACCGGTGCTGGGACCCCC^CT^ 

3 5 CCTGTACGGOTTGGGGGCACGTGTCACGGGGACCCTCX3GCCCAGACT 

GGTCAGAGCCCAGTCGAGGGGCTTTACGCAGCCTGTGGTGTCCTTGGAGCCA.CAGCT 
TCCGCTGGTGAGGGGCTCAGAGTTGAGAAGATCATGCCTCCTGAT^^ 
GCCGCTTCGGTAGCATTTGGCAGCCCACGGTTTTCTGCCTG^ 
TTGGAGGGCGGGGTTTGCAACACCAGTTTCTGATGGGAAGGACT 

4 0 CGTTCTCCCAGGCGTGTGTGGAAGGGCTGGGGCATCTTCTCCCTGGT 

TGAGGAGGAGAAGGCACTCTGTTCCTGGAAAGGAGAAAACAACGTGGCTTTTTGCT 

T CTGTTGTCCAGGCTGGAGTGC AGGGTGTCATCACAGCTTACTGCAG CCTCGACCTCCTGGGCTCAGGTGACCCTCCCACCTCAGC 

CTGCCGAGTAGCTCGGACCACAGACGTGCGGCACCACACCTGGCTAATTGTTTAAGT 

CCAGGGCTGATCTGGAACTCCrcGCCTGAAGCAGTCCTCCCGACCTGTCCTCTCA 

4 5 TTTACAGTCAGATCATGGAGATCTGAAATGTTGCCCCATCTC 

CTGCCCGGGAGCCCCCT^GGAGGGTCAGAACCTGCCCGGGAGCCCCCCTAGGAG 
GGAGGGCCGGCACCTGCCTGCCCGGGAGCCCCCCTAGGAGGGCCGGCACCTC^ 

TGCCTGGTTAGCTCAGATG CCCCCGTGG AG ACAGCGTGAGGGCGGCAGGGAG CTGGAAACCTTCCCAGGTTCCCAATTTTGAGATT 
TCCTAAGCTGCCTGTGGTCCJ^CTGGGAGGAAGTGTCTGGTCCrGATGTGCGTCGGTGACCACAGA 

5 0 CGCCAACGTTGGCTTTTGAGCAGCGGGATAGGTGGGCCTGGTGGTAGGTGCTT 

CGGGATTAAACCTGAGTCGGGAGTTGGAGGGCAAC^GTTATTC^^ 
TTCTTGAGCAAAATGTTTTCTACATTATGATGGGATr r rCTr 

CACAGGGCTTAGTCGGCTTTGGGGAGAGACAGTAGTTGTAAAGGCCACACCGGACGCCGTC 
AGACCATGTGGCCTGCATGCTGCGTTCCATCAGCACTGG^ 

5 5 AGGGGGACGTGTTTCCAGGCGACGTC<3CTGCAGAGAGGAAGAGG 

GGACACGTTTCCAGGCGACGTCCCTGCAGAGAGGAAGAGGGGGACGCGTTTCCAGGC^ 

ACGTTTCCAGGTGACGTCGCTGCAGAGAGGAAGG AGGGTCG CGTTTCCAGGCG AGGTCGCTGCAG AGGGG AAGAGGGGGACG CCTT 

TCC^GGTGACTTOSCTGCAGAGAGGAAGAGGGGGACGCCTTTCCAGGTGACTTCGCrG 

GGCGAGGTCACTGCAAAGAGGAAGAGGGGGATGCGTTTCCAGGTGACATCGCTACAGACAGGAA 

6 0 ACGTCGCTG CAG AGGGGAAGAGGTGG ATGCGTTTCCAGGCAAGGTCGCTG CAG AG AGGAAGAGGGGG TCGCATTTCCAGGCAGCAT 

GGCTGCAGAGAGGAAGAGGAGGACGTGTTTCCAGGCGACGTTGCTGCAGAGAGGAAGAGGAGGAC 
GCAGAGGGCAAGAGCX3GAACGCGTITCCAGGTGACGTCGCTGCAGAGGGGAAGAGG 
AGAGGAAGAGGAAGATGCTTTTCCAOTCGACGTTGCTGCAGAGAGGAAGAGGGTGGGAC 
GGGAAGAGGGAGGCCCCTCCAGCAGGGCCACCTAGATCTTTGCITT 

6 5 CTGCAGCITGATTCTGCrCCTCAGGGCCAGAGGTTCTAGT^ 

CTTCTACGCTGGAAGCCGTGATGTGTGTCAGGAACGTGTTCTGATGTTGGTGTGCAAAAT 

CCCCCATCACGGCAGTTCTGAGGCTTGTGGGGCACCCCAGCTCCTCTCCCT 

CCTGACACCAAGATGACATGTGGC^CAGACCTGGAATC^^ 

7 0 GGCAAGGGCCACCCC^GGGGTCTGGGAGTGCAGTGTCGGGCCTGTGGAGGTGGAGAGCAGGG^ 

AAATGGAGACACGTGGAAACTTCTGGTAAGAAGCTCTCAAGCCTCC^ 
ACrTTGGGCTCTGAGGTCCACGTGGCCCAGC^ 
G&CAGAGACCAGGAGAGGGCTGTGAAGCAACTCTGTGTTTACT 
TGTCCTGTCCCCGTGGCGACATCACAGCTGAGCGCGAAGCTGGAAGCTTGGG(^ 
7 5 GGCCAAGGGTGCTCTGTAGCTCCAGGCGTCAGAGTCTGAGCATCCTTn^ 
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ACACATCCTTATTTTGAAAGAGCCCTTTAGGTGATTT 

TAGGCTGCTCTTCAGTGAAOVTCCAGAATGCCGTCAGTTCTTGTGAAACCCCCAAG 

GTGAACTCTGAGAGCTGAGCCGGGCCGTCITGGCACCACTAGAAACGGAA^ 

AACTTGTGCAATATGACATTTCrGGTTTTGGTTT^ 
5 AACTTCCTGTGAGTAATAATGACAGATGGGCCCTGACCCTC 

GCAGTGACTGGAACCTGTCCCTGGTGTGAACTTGGGGATGAGTGGGTGA 

CGGACCACACItnTGCTCTCCAGCCACGAGAAGATGCAGCGGC 

GGAGAGGGAGGGTGTACGGCATGGGGCCTGCTGCCTGGGGC^CTGGGATCCCACAG 

TGAGATCCCACAGGGAGGCCCGGTTCCCACTGAGCTTGGGTTCCCAAC^ 
1 0 GAGCTGCAAGGAATGTTTCAGAGCCATGGAAGGTGCTTTCT^ 

CTTTGTAGAAAGCGTGCTCTGGCCCACACGTTAACATTTGGGAGGAGGCCT 

CTGCCTCGGGGGCGGGGGTTCAGGGTCTTGAGCTCAGCTTCTCACCCATCCG^ 

TGTTGGGGGCTGCTGGGCCTGGGATGGOVGGGCCTGTGTCCAAAGCTG 

GTTGGGGCGCGTACAGAGGGGATGACTTCGATTTTTCCAAAGAACAGGGAAGGGTCT 
1 5 GTGGAGCCACGGGACGCCATCAGTTTTTATGAAATCCCCAAATGTGTCCTTTAAAACAAAACCT 

AGATCCCGCGCGCTCCCGGTCTGTGGCITTGACTTTGGTTTGAGTTCCTGGG 

GCACTCCCTGTCTGGGGACGGAGGCTGCCCATGCACCCGGCTGTCCACCTTCTTCCCAAAGG 

CGGACCCGGGATCCACTTGCCCAGCCTTGGGGAGGA 

GGTCCTGGGAACTGTGGGTAGGCCATGGGCCGCTGGTGCCTGCACCACTGTCCTGGTGGA^ 
2 0 ACAAGAGGCCAAGGCACAGATGGCTGCAGGGGCTGCCCCTGC^TCCTGAGCCTCACC 

CCTCCTCTGGGCCGCCTTCCCTCCCATGGGCACAGAGCTTGGGTCCCACGTGGAGAGGAGGCCCT 

CGG CCTTGGCTTCTUKjGAGGGTTGG ACGGGAG CCAGCCCATTGGACAGTAGCCCGGGGCCTCTGGGTGACCCTCCAGGGTGGGACA 
GCAGTGGTCCCAGGCCCCCCTCIGACTCCTGAAATCCTAAGGTTGCCCGGGCCGCC^ 

2 5 GCAGGATTTTGTGAAAGGCCTTGTCTTGGGTCGTGGAAGAAAACCACGTGGT 

GTGAAGGCTGCGTCC^CGGCCAGGCCCAGCCAGAGGTCGGGAATCATGTCGGGGGTGACGCACAGATGCCGCCG 
AGAAACGCGGCGTGTGCGGGCCCACCTGGCCTCTTTAGGCCGGGCTCCGGGCTGCCCTCCAGG 
CAAAACTTTCTCAGGGCCGAGTTCTGCCGGCACAGGTCRGGGGGCAAAGCTGGC 
TGGTCTTGGGGCAGCTCCAGACACACAAGCCCAGAAGACCCAGAGGCTGCCTCACCT 
30 TCTTTGTCTCTCCATCTCTGTCTCTCTCCGTCTCTGTCTGTCTCTCCATCTCTGTCTC 
TCTGTCTCTGTCTCTCCATCTCTITCTCTGTCTCTCTCTGTCTCTCCGTCT 

TCTGTCTGTGTCTGTCCCTGTCTCTGTTTCTCTGTCTCTCT 
TCrGTCTCTCTCCATCTCTGTCTCTGTTTCTCTGTCTCTCTCTGTCT 

3 5 TCTCTGTCGCTCTGTCTCTCTGTGTCTCTCTCTGTCTCTGCCTCTCrGTG^ 

TCTATCTCTGTGTCTGTCTCTCTCTCTGTOTCCATCTCTCTCCATCTCTCTCT 
TCTCTCCATCTCTGTCrCTCCATCTCTCTGT 

TCTCTCCATCTGTCTCTCTGTCTCTGTCTCTCTGTTTCTCTCCCTGTCT 
TTTCTCTCCCI^TCTCTGTCTCTCTCTCTGTCT 

4 0 ACCAAATGGCCATGAGGCCCGCGGCTGCCGGCTGAGGAGGGCTGCGGGAGGACCCTGAAACCCT 

TGGAGTCCTGTGGGCGGGTGTGGCCCTGCCGAGTTCAGAGTCTGTCGCTTCGTCTGTCTGGAAACTC 

TCACGGTTTTC^CCTAAGCAATGAGCACGTGCGGGCTTTAACCCT 

CAAAGGAACTGCCTCTGCTCTGGGCTTCTGCTGCCGAGGG 

CATGAGGAGAGTGTTTTCAGATGCTGGGG ACGTGGCCTTTCTAAGGGAGAG^ CTCAG CTCTTTGGAA 

4 5 ACTGAAGCCCATTCACCAGGTGGGCGGGGGAGGAGCTGGGCAGAGGTTGAGACACAAGACCTGAG 

TTTTTGGTTGTACAAATAACAAACACCGTTCACTGTAGAAGACTTTTAAAC^ 

GTGGCAGCTCACACCTGGAATCCCAG CGCTTTGGGAGGCCGAGGCACCAGGATCGCTGGAG CC CAGG AGTTTGAGACCGCCCTGGG 
CAACATGGAG AG ACCC CAT CT CT ACACAAAAT AT AACAATT AG CC AGGTGTGGTGGTGCGC CTGTGGTC CCAGCTG CT CGGGAGG C 
TGAGGTGGGAGGATCCCGTGAGGCCACTGCACTCTAGCCTGGGTGACAGAGCGAGACCCTGTCTCAGAA 

5 0 CTGCGATTGAGAATCAGACAGTGATCATTTCAGGATATGTCTTTT1TTTTTTTTCCTCCCCCT 

CTGTCGCGAGCITTCGGCTGAAAAAAAATCCCTGACGCCCT^ 

CCGTGGAAAGGACTGTGTGGCCGGCTCCCCAGGCCACGGGGGCCACTGTCCTCCATGCCACGTC 

TGAGGACAGCAGGACCTGAGCCACGAGCCACACTGCTGTGGGACCCX3 

CCTGTGGGCTCACACTCGGCAGTTCTGGGCCCCCTGG 

GTGTGGAGGTTGCATCTGGCTCAGTGAGAAGCATTTTT^ 

CAATGCCCC^CCCGGCCCCCGGGGCTGCCCCATAGCAGCCACCATGCCCTGGCCCCAGGCCCCAGGAGGAGGATCCTGC 
GCCTTGCATTTTG CCT AGGACCTTCTT AACTCTCCAG ACTCATCAGGGAAGCTCT CAGG AGCTCAGGATAAACAGGTCCTTGGCT 
TGAAG CCTGCCGGCATGGGTTGGGAATGGTGGGGGCTCCATGTCTGCAGC AGCCA CT 

6 0 CTGGGTGCGGCTGTGGCCATTGAGGAGGGAGCATGTCCCGGGT^ 

CCCTTCGTGTACCC^CGTGTCCGATGTCTCTCTGCCATGCCGTGACGTTGTCCTTCACGTAAAGTT^ 
GTCTGTGAAATCACAAGGTTTTACTTTATGGACTGTGTGGAATGGACCCAGGGAAG 

TGTGCAACAGTGAACTTCCCTGGACAG CTGGCCTTGGTGTGG CAGTGGGTGCTGT ACGGCTGGCCTCTGTGCAGTGGGTTGGGGGG 
AGCTGGATGGCTGGCCTCTGTGCGGTGGGTGGGGGGGAGCTGGACGGCTGGCCrCTGTGCGGT 

6 5 TCTGTGCGGTGGGTCGGGGGAAGCTGGACGGCTGGCCTCTGTGCAGTGGGTGGGGGGAGCTGGACGGCTC^ 

GGCCTtrTGTGTGGTGTGGGGGCTGGATGGCTGGCCrCTGTGCAGT^ 

GTGGGTGGGGGGGTGCTC<3ACGGCTGGTCTTTGTGCAGTGA 

ATGGCTGGCCTCTGTGGGGTGGGTGTAGGGG^AGCTGGACGGCTGGCCTCTGTG 

7 0 TGGCCTCTGTGTGGTGGCAGGGCTGGACAGCTGGCCTCTGTGCAGTGGGTGG 

CGGGGAGGCTGGATGACrGGCCTCTGTGCAGTGGXjGGCAGGCGCTGGATGA 
GCTGGCCTCTGTGCGGTGGGTTGGGGGGGCCTGGAOSTCTGGCTrC^ 
GGAAGGTCAGCCAGGCCCTGTGGTGTCTTTAGTGAGAAACAGGAAA 
TTAGAAGT GTT IM 1 11 TTATTATTTGTATATAATGTGTTGTTTTCAGTCrTGCACTG^ 
75 ACAG CAGG CCAGAAAACTG CCTTH AT AT ATP AC ATTGG CAAAGACCAAGCG CACCCCGG AC CCT AGTGGGTGAG CCGG ACCCATTG 
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GCAGTGGCCTTTGTGGTGCCTCCAGGGAGCCTCACGATGCTO\CGCCT 
GTGGTCTGGGAGTTGGTTGTGTGGATGTTTGCGTTGCCAGCGTTTAC^ 
GAGGATGGTCCTTGTGATAAATGGGATGCTGTGATTAGATTGTTTCTGAATAAG 
AAAAAGGCCAGGTATGGTGATACGTGCCTGTAACCCCAGCACTTTGGGAGGCrGAGGTGGGA 
5 AGAATGCAGTGAG CTGTGATCACACCCTCTAGCCTAGGCAG CAGAGCGAGACCTTGTTCT AAGAAATAAAAAGAGAAAAAGAAAAG 

ACAAAGTACCAAATTATGTGTCTCATTTTCTTAAGATGTATCATTTTC 
CAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAAGCGGGO 
AACATGGTGAAACTGTGTCTCTACTAAAAATACAAAAAATTAGCCG^ 

CTGAGGCAGGAGCATTGCTTGAACCTOGGAGGCGCAGGCTGC AGTGAG CTCCACTG CACTCCAGCCTGGGCCACAGA 

1 0 GCGGGACTCCGTCTCAAAAAAAAAAAAAAAAAAAATGGAGAGTiGAGAGAAACA 

TCACCGTCCAGACCGGGTGGACGTTCGTCGGTCACCATCCAGACCAGGTGGATGTTGGATGC^ 
TA TTrrTT CAGTTGTGTTGAGTCTCA CTTTTT^ 

CCTG AGCTTAGGG CTTCAGCAAG CGAGTTGCCGTTCCGGGGGATGGGCTGGT CAG AGACCCCCGACTGAAGCACGCCTGCCCG AGG 

CTGCCCCAGGCGTCGTCCCCCAGAAGGAAGCACGCGGAGTGGAGTGGGGTGGAGAGCAGGC^ 
1 5 TGCAGGCTCCTTTCCCGCTCATTTGGGCTCTTTTGTCTGAACCAG^ 

TCTAACCAAAACCAATCATCTTAATTGTCTGGATTATGATATTTATTTTGTA 

TCACTAAATCTGCAAATGATTCCAGAGCCAAATGCAGTCCTGCACCGCGG 

TCGTGAGATCGTGAGAGCGTAGTTTTACTGCATCTTTTCCATGTTTAAATCT 
' GTCACCCACX3GTGTTCAGTGCAGCCACATGCCGTGCAGACGTGTACT 
2 0 TGGACACACCGAGTCGGTTCACACGAGTTCCCACGACGACAAAATG^ 

GTGACACGTGGCTGCGTTTACACAAACTTATTCTTATGTAATAGTCAGCT 

TGT CTG CCTTT AACCAG ACC AG CGGTGGGGGGAGGCGG CTTTT CTTTTCTT CTTT C CT CCTCTCT CCT CTCTCT CG CTGT CT CTTC 

TCTGTCTCTTAACITTCCTGGGAATCCAGTGAGCCCAGTTTTTTCACTGTG 

TCTCACTCTGTTGCCCAGGCTGGATTGCAGTGGTGCAATTTCGGCTCACTGC^ 

2 5 CTTCACCCTCCCAAGTGGCTAGGATTACAGGCACTCGCCACCATGCCCA 

CCTGTTGGCCAGGATGGTCTTGGTCTCCTGACCTCAGX3TGATCTGTCTGCCT 

CCACCACGC CCGGCCATGAATTTTGTTTTTCATTTGTT^ CAATTCAT AGCAAAAGAT AGGTG CAAGGTGCCTG AGACGGG 

caccgcctcgcatcatctgcgactttgtttggtgcctgcctctccag 
agccctgcagckk:cccctctcccttccccagaaactcacagctggcc^ 

3 0 tgtccttcctgtctggaggcctgggtccagccgcctttc 

ctc^cgtgggaaacagctctgacagtacagccc^tgaagctgcgtcttgg 

ggggtccagcaggcacccgcagggcccaaaacccc^cagagcggaggtgggtggtga^ 

atttctgagttttcaaattactatgaagcttccaatgtcacaaaatatgtagtcttgttct 

t attttt aaaaatt acgt agttg att c c att caag ctgcct c ct ggg cttttt cct cttt aaacg agt ctt ct attctgc ccat cg 

3 5 TTTTTAGTTTGTTTGTTTTGAGACAGAGTCTTC 

TCCCCTGGCTCAGGCGATCCTCCCCGCTCAGCCTCCT 
GATGTGCACCACCACGTCCTGCTCATGTTTTTATTTTTTGTTTCG 
CTCCTGAGCTCAAGTGATCTGTTCCCCTTGGCCrCCCAAA^ 
TTGTTGTTTGTTGTTTTTTTTTAGATGGAGTCTCGCTCT 

4 0 TCCTCCTCTTCGGTATAAGCAATTCTCCTCCCTCAGCCTCTCGAGTAGCnX^ 

TTTGTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGATGGTCT 
CCCCGAGTGCTGGGGTTACAGGCGTGAGCCACCATGCCCGGCCCTTC 
AGGTGTGGGGGCCTTCGGGGGGCTCGGGAGGCAGTTTCCTTTAACTTGC^ 
GCCCTTCTCGAGAAGCrGCCCCGCATGTCCCATGTCTCTGGGGCrrGTACIT 

GCCCACGGTGCTCGGGGTGAGCCTGTGTGGTGAGGACAGTCGGCGGGCCCTGGGTCTCT 
GGGCCCTGGGTCTCTGGCCACATAGCGGTGAGGACAGCCGTTGGGCC 

TGATGAGCAGTGCTGGTCCAGGGAGGGTCCCACCTGCATCCTCCGCCTGCCTTGGCCCCTGCAGTAGGC^ 
TGAGCGGGAGAGGAGGCTGGGCCGGGGCrcCAGCGGCCACAGAGCCTTC^CCCTCCCAG AAACG 

5 0 GCCTCTTGCAC^GGGTGGAAGGTGGTGCCTGCGTGGATGGTGGTGCACCGGG^ 

TGCACCGTGTGGCCCCCAGGAGTGTTGAGACAGTCTGCCTGGGCCTCCT 
CAGGACCTCACCTTAGCCTTTCACACCAAACATTCATCTCCTC 
GAGGACAGCCCCAGCACCTGTCACCCXSCCAGGTCATGCAGCTCGTGTCCCTCrGTC 
CACCGTCGTACCCCGCCCraCTGCGTTTGCTCCCACCGCCCT^ 

5 5 CTGTTTGT CTTGCAG CAAT CATGGAATTTGGGGG CTGG CGG CG CTG CG CACAT C AC AACTT CCAAACCCACT ATTTT ATAAAGGGC 

CCAGTCAGTCACCTGTCTCATGG CAGCCAC ACAG CTGGTTCAT AGCGAGGAGAGGTTTGAGCTTCCTGGTTCTTAGGCCACAGCCC 
ATCCCTCTTAGATTTTAAAACAGATTTTCAATGAGTTCTAAAACCTGTCTTACTGGAG 
GAAGGAAGAGAAGCGTGTTAAGGTTGTGCTTGGAAAATGTTTCCATCTTAGAGAACTGGCCGGGCrc 
GTAAGCACTGCATTGACACAGGCTTCTCCATCTCCATGTCAC^A 

6 0 TGAGCACGGGCACGTGTGGCCCGGX5CX3AGGTCTGTGTGGTGCTGAACX3CCTCCT 

GCCCTGGACTGGCAGCTGCCGTCCCACraVGGCCCGTATGA^ 

CGAG ACGGAGGG CAGCCGGGGGGCCGTG AAGGCGTCGGCCGGAGGACACCCCATCGTG CAGGTAGGCACTGCGGCCAGACTCGCAC 
GTCACTTGGTGCTTTTGTGGTCAAAAAGTAACTTTGGAGCCACAGCTAACTGT 

TCTGTCXnTSGTTAGTCCCGGGCGGCTGCTCACTC 

6 5 AGGCCCTCTCTCGTTGTTTTCAGATTTGCCCGAAGCAGCTCTCGTATC 

ATTCTCCCCTCTCTAAAGCCTCTGAACTTGGAjSGCAGGAGTTCTCACCA^ 
TGCAGGTTCTGTGGCCCACCCAGGCTTGCAGGATCCCATACTCTGGGCTTGGGTGC 
GAGGAGAGGAGGAGTX^CTTCCAGAGCXK^CCTGGGCCTGGCAGCCACA 
GCCGCGTCCCTGCTGGGGTGGGCCrC^CTCCCCaX3GCTGTGGTGTGGCTTC 

7 0 GGAGTTAGGCTGTCGAGGGAGGGGTGTAATGACCGCGCCAGGGACGGGCAGCGTCTGCACAGAGCCA 

GTGTTGATGTGGATCCCGGGTAGCCGTGGGTGTTGGCATTTTCTTCGTT 
AGATCTGCTGCTTTGGCAGCCACAGAGCAGCAAGTGTGTCCAGC 
GAGCACACCCCAACAGCAAAAAACCGGGTGCCCCAGCCCCGTTATTTAGGGA^ 
CCTCGTATTTTTATTTTCTTCCATTTAGATAAATAGTGGAGAGAAGGTGG 
7 5 CGGCGGCAGGGAATTAGCGGGCTlGGGCGCrcGGGTGAGGCGTTTT 
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TG TGTG CTTG ATG CCAATT AGTACCTC CCTCGTTGGTGCTTGAAG C AAGGAAATGCGGG AG ACACCTGAGCGCACAG TGGC CCGTT 

TGACATTTAAAGTCTCCACCCTCCCVIGGTGGGAAAGGTCAGACAGCGCTGCAC 

TCCCTGCCTCGGTTTTTACTGACCGAAAGCTTCTTATTGCTCAG 

GAGGAGTGGAAATAAAGCAATAGAGAAATAATGGCCCAGTCCACCCTATTTGCTGTAAAT^ 
5 CC CTTTTC^CTGTATCTGGATGAGGGG CGGGTGT AGG AATCGAGGCTGTT ACTTGGAGGGT AAAG CAGGAC CATGAGTGTGGAGAG 

AGCTCTGAGGTTTCTGTTTG rrT ' ^ 

GAGTGCAGTGGTGTGATCTCAGCTCACTGCAACCTCTGTCTCCTG^ 

GACTACAGGCAGCCCTGGGTTTTAATGGATGTATTTT 

CACAGCTCAGTGCTTGGGTACTCTCCCTGCGTCTTCTGGAATCT^ 
1 0 CTGAGGGTGCAGAAATCTATCAATACAGATGCCACTGAGAGGATCCAGGTGTTCT 

AAAATATCTTTAAAATGTACCCTCAACAATTATTCCAAATCA^ 

TGCGTTTCCACAGAAGGAGCCCTTTCCATrGAAACGCGGGAGAAAACGGTCCCCT 

CTCATTTGTAGAAATTAGTAATTTAAAGAGTTTTGTGATG 

GCAGCTAAAAAGAGGGGGTTITGGGGGTTAGTTTTCTCTTTCAGTTGG^ 
1 5 TTTAAAAACAAACGAGAATAGAAAGCTCTCAGTTGCCGTGGAAGCT 

TTTAGTACATCCTTTGACGCGGCCCCTTTCCCGCTTTTCAGAAACGAGGCTCATCGCACTGG 

TGGGTAGTGGGCGCTGCGGGCGGCTGXSTCAGAGCCACTGGG^ 

TGCTGTTCTCTTCGTTGTGAATCGTCTGTCCGCTGCATCTCAAAAAGCT^ 

AGCTGAGCCCTGGAGGGTGGCGGGGGGCCGGGGGATGTGGGTGTCAGCTCTCGCG^ 

AGTGAAGCACGTGAGCAGAACGCAGCACCTTCTOTCCAGGAAGCATTCGG^ 



GAGGGGGTCCGGCATCCGGGGCTTCTCCCATTTAGGGCCTTTCCTTCAGAGGGA^ 
GTGCCCCCTGCCGTCTCTGTGGGGATGTCATTGTTCCCACGCCC^GGAGAGACCA 

2 5 TTGCCATGATCACACCGTCCGCACCCTGAGGCCACCCCCATTACGGCTGCCA 

CCGTGGTTTAGCCTGGAAGAGCGGGGCCCTGGAGCCGTCTGCGAGGGGGTCTGGG 
CGTGGTCCCTGCGGGGTCTGGCGGGACCTCGGCCATGGTGATGGGTC 

CCCGCCCCCTGTGCGTGTGAGGGGATAGTGGAGAGAGGGGTCTGGCGTGGGCCTCACTGCTCCTG^ 
CTGCITAGGACTCACCTTGCCCTGGTAGAACAAGTGACTCAGACAGAGC 

3 0 AGCGC^GCCAGCCTCAGCTCCTGCCCACACAGCCCCCTTCTCACCGGCTCCT 

TCGGTGGGCCGGTCTGGATCTGCCGATGCCGGGACCTCGCCAGGCCGCTGGTTGCAGGGGAGGCTGAGGGGGCGTCCGCCT 
CTGCTTCTCACCAGCATTTCCAGTGAGGCATGAAAGTGAT 

GCTCACACTGAGTGTCCTGATGGTGGCACCTCGGGCAGCCGCCACCCTCTGCCCrCCACCCTCT 
GAGGGGGAGGGTCTCTCTGTACZACACAGTCTCCrCACCCTGGAGGGCCTGCTCT 

3 5 AGTGCTCTGGGTGGCTGCTGGCTTTTCTCCTTGTCrrCACCTCCrrC^ 

ATGTAGGCAGCCGGCATCAGGATGGTCTGGGGTGAGACAGGCAGTGTTCTTCCCCTTCTCT 
GGGCACAGCCATTGAGGCCTTTTCGTGCAACTCATTGCCT 

ATCTGGGGGACTGTGGTCCCGGAGGAGACCTGGAGTCCGGCTCCCCGCTCCGTCCGAGGAGGAGCTGAGCGGCGTGAGOW3TGCCC 
GCCCTGGTCACTCTGCCTTGCTTTCCTCCTCTCTGTTTCTGGTTGTGC 
40 AGCCCCCTCGCCAGATGTGTTCCTCACATGAGTCAACCCGTCCTTGGCCTGCGTCC 
GAGGTGGCCGGGAAGGGCCCAGGGCCTTGAGAAGATGTCTTTC 

OXTCCCACGGGGGCACGTGGACCTCTGTGGGATATAGCCTGACCTCTGACCCAGGGGTCAAATA^ 

AC CAAAAT AAC CTGCC CG AT AGC ATCT CACCCAT CACACGGG AGG C CG G G CTCGTGTG AAT AATTAAT AGGTCACTG G AATG TGAA 
ACGAGAAGCCTCGGCCCGTTCTGAAACAGCATTTTAGGTTTGCAGCTTATAATATTGTCAGT 

4 5 CAAAGAGGTGAGGGCGGGTGGTCTGGGATCCTGCCTC 

GGAGGCCAGACTGGCCCCAGGGAGGGCTCCTGGCCATACCCCAAGCTCAGGACGCAG^ 
GGCCTTGGTGGGCCrCTCCTTCCATTGCCCAGCTTGTAGGTCAACTTTGCCTCC^ 
TAAGAAACTTGACAGCTGAAGACCAAGGAGTGGAGCCTCCCGTGAGGCAGCTCTTCT 
AGTGGTTCTGACTAATTCTCACTTTGTGTGGTACCTTCAGCTGGCAGGGGCTCG 

5 0 TGAGAGCCCGTGAATACCGTGGACAGATTACCACAGCACGCACTGTTCCACGCAGGCC 

CACACACAGGAGCTCCTGGTTCTCCATGTCCCCTGCCAGGCAGGGCTGTTAAACTGGAAA 

ATTGTT ATCTG ATCAGAGCTCCGAGGCCCCACCAGGAGCTGTG CTATTGCAGAACCTCAAACAGCAGCCGCTCCGGCCCCG CCCCA 

GGAGCCACGCAGAGGCCCACGCACGGGGAGGACGCCGTGGGGTGGAAACTAAAGTGCAGGTCTGACCACAAAACCAC^ 

CACCG CTG CTG CCAAATG CCGG AGGC CTG CAGGAGG CACGGATT AATTTTTT ATGAGT CTG C CTTGG AACCT AG ACAACAG CCCTC 

5 5 TCCTCTTTCAGGACAATTTCAGGGCGAATGAC 

ACTCTC AGAAGTG CAGTGTTG AAGCC CGAGCCTTCCCCGTGTCAG CTTTTATGATGAACG AG^ ATTCACCATTT ATTTCTCCCCTA 
AAGCAAAGCCCACGGGCTCTGTTTTGAGCGATGGGGACAGATGCT^ 

TCAGGAGACCAACGTGGAATGAGGTCGGCCCAGCCAGTCACTGCTTAAAAACAGTAATGACAATATTTG 

6 0 TTGTGGAGATTTACTGGCACGGGGAAGGCGTGTGGCCTCTGGCAGGTCT 

CTGCCTGGAAAAACTGGCAAAAATGTGGCCTGGGTCrC 

TACGAGGCGCATGAGTGTCCCCAAGCCCAGATCCTCAGCCTCCAAAATCTTCTATTAGTAA 

TTTGGGAAAATTTAAGATGACGCGTGGGCTCTGGCAOSGCCGTTGCTAGTGACCTCTCCT 

GCTTTCCTAGGTAGAGAGGGGTCTGCACGCCCACTGGAACATGCTCTATCCCAGCCTCAGAAAACCCACATT^ 

6 5 CAGGAGG AGTGGACACGT CACCTGGAGACCACATGAGAAG CGG C CAT CTG CAGCAG CAATCAC C CCGTGGGGAGG CGGCGCCCTGG 

ACCCAGCAGGTGTCCAGTGACCKIAGGGTGGTGTTGGCCATGG 

CTGGGTTCTCCCGGGAAGGAGCGGCCACTGTCTGCACTGTCCAGTGCTC 

CCCTGCTGTGTCTGAACAGCTCCGGAGTGCAGGGTCTCTGTGTCAGTGCCGGTTC 

CCCAGGCCAGACGTGGGAGTGGCGGCTGGGAGCACCCATGGGCGTGTGCCCCT 
AGCCA.CACCTCCCTCCTCCAGGCAGGACACCGGGCCTTGTCTC 
GCCCTCCGGAGCCGAGATOCACCATCATACACACTTGGAGCACATAGACTGACC^ 
GTCCTAGGGAAATAGAAGGTTTCACCCGGAGATTGG<K3AGTGCAGGGCT 

7 5 GGGTGCTGCTGX5ATGGTGGCTCTTCCACCCCAGGATTC 
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TCGGGCTCT<X3AGGTGCACATGGCTGCrGGGCGGG 

CCACTG ACCAG CAGG CTT AGGGGC CATT CTGGAGCTGGGAGGG CTTGTGGGGGAGG CAGGATG AACAGGGGCTGCTTGCCGG T CCT 

CAGGCCACCCTGCCAGCTCCTTGATGTGTCTCCCTGAGATCR^ 

AGGCCCTGCTGGGGTAGACTGGGGAGAAGTGCTGAGCGGAAACCA 
5 TCAGGCGACGCGACTCTCAGTGGCACGGATGTCTCTAGAAGGTTGCCGTTTCT 

CTTCrGGCCCTGGGGTCACGGGGTTGCGGCGTTTCCCACTTGCCT^ 

GGTCGAGGTTTCTCAGCATGGATTAATTGATTAGATTTAATTAAATCTAAACGT 

GTGAGCGTGGCCGTCTCTTCTCTGGGCGCTTCCAGGTGCGTTGGGGCCCCT 

TGTCGGTGCTCTCGTGAGTCTGGGGTTCGTGCCGAGCGCCATT^ 
1 0 TCCCTGCATGGCACAGGTTGGGAGAGCAGCCGGGAGGGCAGCGGGCCGAGGCGGTGGGGGA 

CXKXTCTCTGGGGCTGCTCTTCCGGAGAGACXSTGGAGACCCGCGTGC^ 

GCCGGGCGGGCTTTCTCCTCCCCACCTCTGTCCACCCTCGCCGAGGCGTCGGTCCTGCr^ 

TGACAGCX3CACGTGCCATTCACGGGCGCCATGTCTCCAAGAAGAGGGGCTCCTGCCGCGTGCCTGTTCT 

TCCTTCCAGCACCTTCTCTGGGCTTCCGGTTTGAATGATCTTTCTCCAAAGTCGGCG 
1 5 AGAAAGCCCAGACACCTTGGAGCCGGGGTCCGGGAAACAGGCCCTGGCAGCTCAGC 

GGTTCACAGACGTGCGGTGGCGGCGGCGGGTGGTGGCAGCAGTGTGGGGTTGTTGGCGC 

GGAGCTTCTGCTCCGGCG<^CACGCCTTGCTGTAGCCTGCAGGTCAT 

GAGCTCAGCTCTGAGCATCAAGAAACCCGAGTTTCTCCCCACTCCCACCC^GACftGCTC 

2 0 GGCAGTCCTGCAGCTCCGCCTGCCCCCr'rTTGAAAACCAAAAATCCCAAGAGACT^ 
GAX5CCTGTGGCTGAGTGCAGTTTTTCAGT05CGGCTGGAAGCT 

ACAAATT CAGTTTT ATCTGTGG ATCTGAACACG AT CT ATGAAT CACT AGG AAAG CAACTGTG TT CACGGGAAGG ATT CTCAG AATT 

GAGCTGACACTCCGTGGTGTGAACTCAGTATTGTCAGCTTGTGCACCGCGCGTATGGACACAC^ 

CTTGCCTCGGGCCCCCACACACACCGCCCTGAGAGCTAGGAAGACGAATCGGTCAGTCACACGGA 

2 5 CCAGGCATGGTAGTGAGGG CAGCGCCCACTACTCACCCAGGGCCTCTCGTG CAGGAAACTGCATGGACACGTCXSTCCTGTT'CCTGT 

C^TGGGAAGGGGCCTGGAGCCGTCCCTCGGGGGGTGGGGAGGAGCTAGTTTCCCAGTCGGCGCCCGTGCATTCACCGGT 
TCCACTCAGCAGGCTTGCTATGTAACCTCCGCCGTTCAGGCCTCAGGATGCAGCAAG 

CCTCGGTTCCTGCCGGGCACATGGACGCTTCCCTTGAATCCTTCTGCCGACGGCATCTGTGGGAGGGCAGGTACGATC 

3 0 GGTGCTGGTCCGCCCACCCTCTCTGGGAAGGGCTGTGCCACATGGCGAAAGCTGCTGTGGG 

CCAGGGCTCATGATGCCAACAGTG CCAGGTTTGTGGG ATTTGGGGGT CAGGAGGG CCTGGC C CTG CT C CTGCAAGGC CGAGAT CTG 

TTCCCTTGGAGGGTTGAGGGAGTGTTGGCTGTAGGGTTCCTGGAGGCCTCTCTCATGCACTCT 

TGTGGCCAAACAGGTAGAAATAAAACCTGAAATGAGAAAAAAACTTTTTTCT 

TCCCCGTTCCTCCCGTTCCTTCTCTGCACTGCACACTGGGAACCAGGACAAGAGGTGTGC 

CCACCCCGAGCCCACGCTGCTCTGGAGGGAGACCCCAGCTTCCCCCGCCTACCCCGAGCCCTCGCTGCTCTGGAGGGAGACCCCAG 
GCCCCTCAGGCCCTTTCCCGCATCTGAAG 

GTTTACAGGGAAGCACTCCTCACGTACAACCTGGCAGACCACACCAGC^ 

4 0 CGACCCCGCAGACCACACCGGCACTCCACACACAGGCACCCCCGTGGGC^ 

CACACACAGGCACCCCCGTXX5CCACACACATGC 

ACCACACAGCCAGCAGGTGAGGAAGTGGACGCTGCGTGGCTGTCTCTTGGTGGCTTCACCTGGAATGTC 
CCGAAGCCCCGGCAAGCACAGAAATCACACCGGCAGCCTGTCCCACTTCGGGCAGGTGAGGCCTG^ 

AGGGAGG ATG CGGGGGTGTGT CCTGGTCCTCCACCCGTTTCGG^TTTGGGGTGCTCCCTTCGACTACTGG CTCC^TTTTTGATGTC 

4 5 TTTTACTTTTGTTTCGATTTTTCCTGGTATCTTGAG^ 

CTGTTGACTCATCTGCITTGGTGTGCCGTGGGTCACGTGGGCCTGGGCTGTGGGTAAGGTGGGCCTCGT^ 
GCTCTGCAGCAGTGGCTGCCGTATCATCCTGGCCAGGCCCCTGTACCGAGGGTGGAGCCACGGGG 

TGCTTTGGTGGAGAATGTGG CTGGACGTCCTCCCATGACAGATGCACTCTCCTGCCTGGTTCCCTGG CCCCGGGCCCAGCCCTGAG 
AGACGCACCACGGCCTTCACCTGGAAGTGTCACCrrCTGGCTGCTGTTGTGGGTG 

5 0 GACGCTGTTCACACAGCGTGGCAATCTCTCAAATTAATCCAGCATGGGGGGAATCGGGGTTGAAACCGGTTO 

CTGTCCCTGTGTACTTCCTTGGGGCAATTTGAAAAATGCAGATTTTACTCXSTAGGCC 

GGCTGCTCCTGTGCCCrGCCTGGAGAGAAGCCTGACTCCAGGTCGCCTTGGCT 

GAGGGTGAGACATTCCCACCCGTGAGCTGCCTCTGTCCACCC 

ATAAATTTGGCCTCAATCATCCACTACCCACCATGTCACCGTGTAAGTGCTGAGGTGAGCACAGGAGCTAAAA^ 

5 5 CGTGTAGGAGACTCAGTAGCTCTTATACCTCGGCAGTGGAATAAGAATAATACATTTTGTTTAGTTAA 

TCATTGTTGTGACAGGGAATTTTATAAATAGGATTCTGACAGGTTCTGTACTGTGCCAGAGGCAGA 
GTGACGTATTTAAATGACTTTTCTTAAAATGAACTTCATGTTTTAAATAATAGAT^ 
CCATTTTCACTGAAATGGCATGTTTTTTAAAGGATACAGTCATACCCCAAATTCACAGCATTCCA 
CAGGGATCCTGGGGGAATTTTAAATTGTAGTTTTAAGACCACGTGACCCTGGAGCTGCCC 

6 0 TGGTTTTGGTCAGCGACTCATTTACATTTTTTTCTATGTAAATTAAATTTTTAGGT 

GCRGAATCCGCCTCTTAATCTGCTTTTCTTTTTACATTTCCCATCATAAAGATGTT^ 

GAAGTTCATTAACTTGGCAAACGGCTGGAGTCTCTCTTCCGAGGAGCTGAGGGCTGG^ 

ACTGTTTGTGTGGAGACACTCATGTATTCATATGGGAAGGGGGGCTCTATTGGAAA 

6 5 TGTCCTCATCCCTGTCTGTGGCTTCCCCACCGTGTTCCCACGTGATGCCTTCGTTCAGCCTCTGGCTT^ 
CGCCCCTTCGATGTCTCAGCTGGGACCTGGGTCCCGGCTCAGCCTCACGCGGCGTCCCAGT 
GGTCAGGCCCTCAGTGACAGCAGCCGTCGCTTCACTGTGGCCTCGTCCCTGGAGGGCTCTGAGTTCAG 
CTCCTCCCTGGCGGGGCTCTGAGTTCAGCCACCGGTGAGATTGGTTTGTTCTT^ 



7 0 CGCCGTCCCCTGGCCCGGGCTCCCCCGCCTTCTCCCTCGCCTCGCTGGCCCTGAGTGGGCAGCAGGCATCTC 
GGCGTCGCATTTTCCTGTGGCTGGAACGTGCCTGCAGGAGCCCAGGCCTGGAOT 
TGTGGTCAAAGATTTTAAAGTTAGTTTAGTGAGAAAATCGACTCCAAACGTG 
TCCCGGGGCTGGGCGTTGCCACGGGCCTTGCATGGATTTTGCTGATGAATCTCAGGGCT 
TTGGGGGATGCAGAAGGGAAGGCACTCTGAACGAGGGGCCCTGTGGGAGGCCCCACTGCGC^ 

75 CTC^AGGCCTTTGTGCTTGCTTTGAGAATC^AAC 
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ATTGTAAATGGTAGCGTTAAAGATAAAAACAAACAGTTCTAAAAATGACTCAC^^ 
GGAGGGTGAX5AGGGAACGGCACCACACTCTGTGTTTACCTCTCAAGACGCTGA 

AACATGTGTGTGCCCTGC CGTG AGG CG CACAC CTT CC CAGAGGCGTGAGTGGGG CC CGG C CACT AGTG TCCTGG CGG AAG CACCGG 
GGCTGCTCCATTTGTCTGGAGTCCTGTCCCAGTGGTGTGGGTTTTGAGTCTCACCCACACGGCG 
5 CTCTGCAGGGTGGTGCGGACTATAGACACCTCCATGCATGGTTGCCCTAGGCAGTGT 
GCTGCTC^GCCACCAGGACGGCGTTGAGCGCTCCCCTGCCTCT^ 

CCTGGGGCCCGTOrrGCTGTGTGTGTGCAAACGCAGTGTGGCGTCCCCTGATCAGGAGTG^ 

TTAATGGAACACAGGAAGGTGGGTC^TGTCCTGCGGGGCCrrCATCTGGATTTATTC 

GGGCTCCGCTTTCCTGCACACGCCCCCTGCCAGCTTCACrGTGGCCT 
1 0 AAGGCCCGTTTCAAAGCCTAAAACTCAGTGCCCAGGTCCATTGGG^ 

ATGACATGGCAGAGAAATAAAAAGGGGGCAGGGCAGGGGGACACAGGCCTCGAACCCGCCG 

TCCCGGCGGTCTGTGCTCTGGGTGCTGAGCAGGTGTTTTCTGTTCTCTCGCCAG 

GCTGCAGCTTTTCATTGGGACGGCGGACGACCGCCTGCTGTC 

TGTCCACCACCAGCCACGAGGCCATCCTCTCCAAC^ 
1 5 TAAGCCGCGGGGGACCTCCGGCCTCTOSCAGGGG^ 

CCCX3CACTTCCAGGCTGCGTGGCCAGGACACTTTTGGTTTAACAG 

CTTCTCTCTAGGGTTTCAGCCCTGTGTATATTGCCACTTAAAATGT 

CTAAACAGGACCCCTATTTCrrTCCGAATACAGCAGTGTAATGAGAGGATGGTTAAAACGTGAGCATCAAAA 
GCACATCTGGATCAC^TTGACCTTTAGGAGGTTCCGACAACACGGATAGGCCT 
2 0 ATTTGTTT AAAAGTG AATTTTTT AGTAAGGTGAGGACGAAACATGT A^ C CTT ACT CACTTAATCCTAGCTTGT 

GCTGGGCACGCTGG CCCTAACCTGTGAAGGCAGCCGAGGAGGAGGACTG CTGGTTCCCG AGAGATTGATGTGGTACTGTGGGGGGA 
ACCGAAAACAGACACATCTCTGGGTGAATGAGTGCAGGTTGAGCTGGGAC^ 
CGGAGGGCAGGGACGGCGTCCTGCGAGGGTGACCCGGAGGCTGGGAGC^ 

AGGTGGAGCCCCGGGGGTGGGTGCAGAGGTCCTGGGGCTGCAGGAGTGGGGAGGAGTCCCGCACG^ 

2 5 ATGGCGGCXJGGGGGCCACGTTGGGGATTTTAATAAGTGATTCAA^ 

TTTTCAAGCACTGGGAGGAATCTACGCACCTGGGAATTCATTCCAAATCCCT^ 
GCAGACAGGGGACAGTCAGGTGCGTTCCACAGCC^ 
GGGCAGAAAGGGTGGCCGGGAGACGCCCGCAGCCAGGACCX3CGCTCC^ 
TCTGTCTTTATTCTCATTCTCTTAACTTACTGAAAAATACTGCT 

3 0 GTCrAGTTTTGAATGAAATTGTTAAGAGGTTATATGAAACCATAG 

ATAGTGGAAATAATAGAATAATTTATTCTAATTATGATTTATTATATATTITATATAATTATAAATATATAA 

AT ATAT AAT AT AT AATT ATT GT AAATGATAAT AAT AAATTGAG CTITTTTGGG AACT ACT CCAAG AT AAGT GAGAACAG CAGG TG CC 



TTGGGACAGCTCCGACTGGACCCX5CTTGGTCAGGGGTGGAGGGT<3GAGGCTGGA 

3 5 GTGGCTTTTC 

AGGGTCTGGGGGGCGAGGCCTTTATGCTCCCGCGTCAGCCAT^ 

GCCGGAATCCTGAAACTCAGAAACTCCGACATTGAACTTC& 

GGTGTTCCX5CGTTCACGTCCCGCAACCCAGCGGCCGCACGCTC 

4 0 CGTCCACGCGCCC^CAGGGGAGGAAACCGTCCCGTCACATCCGC^ 

TGGGACTGAACGGTTCCCACCAAACCCIACCACAGTGTGTGGCCACGAGGGGTCT 

CTCCCC-GGGGCCCCTGGCGCTGGCTCCTCTGATGGCTCAGCATGGCC^^ 

GCTGACCTTTGCGTATTTGTGCGATGACTCTGATTTGAGTGAAAT^ 

CGAGGTCGAGACCATC^AGG ATGAAGTGTGCACGGCTTCTCCCCTAGAAGTAGTGG CATCCG CAGGGGTCGGC CCGCAGGTCGTGGC 

4 5 GGTGCTTGCTCTGGGTGGGTCGGCrCACGTGTGACCT^ 

ACTGCCCCTCTCCTTCTGATGC^GCCCAGCGCTCAGCTCAGGAGCrc 
GTGGGCGGGAAGAAGATGGTCCTGTCTGGCCACAACTTCCTGCAGGACTCCAA 
CTTCACCAGGGGCCATCTGCGGCCTGGGCTITCGGCGCTGGTGGGAACCGGT^ 
TCACCGKSGACGGGGCITATGGGGTGTGGCACGGAGCAGAG^ 

5 0 GCCGACAGCTGTGCTGCTGAAGGGAAGAGGTGGATGTTTATTTCTGGGGCCGCC 

CAGGGAGCCCACGGGCTGG<3GACGCAGAGGAGGGCAAGACGCATTTGCT 
CTTCGGGGGTCAGTTAGCAGCAGGGCAGGAGGGAGCTCCCCTCCGGGGCCTTCCCCTCCTO 
GGATGGGGTGGGTGCCGCAGACGGCGGGAGCTGGGGTGGTCTTTAATGCCAC^ 
GGCACAGGTCACCCCTGCCTCCAGGTGTTGCGCTGGCCCAGCCTGG-^GGCT 

GTGGCTCCCAGCCCAGGTCAGTTCCATGCAGACGCCGGGCCCCAGCACTCTCATCCC 
AGTGTGTGAAGACAGTGGGCTGCACTGATGGGAAACTGGCATTGCAGGGAGAGT^ 
GCCCAGCCCTCAAGGGGGTTCAGGCAGAGTGGGCCTGGCT^ 
GCCTTGCTGGGCCrCTGCCCAGGGTGCXSCCAGGTGAGGGGCATGCCC 

6 0 ACTTACCAATTGGAATTCTTTTGCACTTACTCCAATCCCTGGA 

AC AGTTTGCCGG AAACTTAAGTGCTTCAATCAAAAT CGGCTTT AGAGGTCTTTGACT CTTCCT ATAGAATCAGGGCT 

TAATTXKSAAAACATGTTTTAAATTACAGAGAAAACCAACrAAGG 
GAGGAGGCTAATTTTGAAATAATTTTCCCTTCAGAGCTTTOT 
AGGGCCTCGCTT , GGGCCAAGACTGGCCAGAAC<^TGGCTCCCAGCTT 

6 5 TTTCTTCTGCCCAGAGAGGCCTGGGCACCTTGGCAGAGCCTC 

CAAGT7ACTGTGCTGAGTTGGTGCCACGGCGCCATGGGTAAGGGCAGCCACCA CCAGGAGC CAAGGGGATGCAGG 

ACCAGCCCTGGCCTCATGCGGCCGCCCCTACAGGCAGCCCCTCCTGCC^ 
GGGGTACCTGTTCTGCCCTTAAGCTCGGCTGCCACAGCCACACTTCTGGGCAATG 
CC^ATGAGGT CAGGGTGGGTCTCTGA.CC CTCTGCCAGCACCAGCCGGCGTCTC 

7 0 TGTGCTCGAGAGCACX3TGAAGCTGTGAATACTGCAGGGATGAAGCC 

TAATTTGCATATTCATACACACACAAGCACATTCCATGTTT^ 

CCGGCTTCTCAGGCGGGAGCCGTTGCTTACAACACGTGCTGATGTGCAGTGACAGGTTGGTGCAGCAGAAAGCCCCA 
CCTGGCAACGAGAGAAACCACTAAACTGAATTTAAATTAAAAACACAGTTGCC 
TAAAGTTCTGTTTTAGACTGCTCTTGCAGATGGAACCAGGGCCGGGTGTGAGCT 
7 5 CAGTAGGGCAGTTGAAGTGGCTTCTCCAGGAAGGGCCCCTGCGTCTCT 
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GCTGTTGGGAATGTTGGCTCTGGACACAGGCCAGGGTCGGCTCC^ 
GCCAGCTCAGCCTCTGCGTATCAGCCAGGCCACCTTCAGCTGCTATGGGTTCAC^ 
AGCGGCTCCACTCCCCTGGAAGGCTGGGAGAAGCTGGTGTTGGCCT 
GAGCCGGCCGGGCAAGGGCATGGCTCGCCGTAGCCTGGGTGCCGGGTCCCT 
5 GTTTGAGTCTCAGCCTGCGCTCCAOGGAGCTTCCTCCCAGGTGTCTCAG 
CTGAGGACTTAGTTTACAGCTTCCTCGGAAAACGACATTTTC 
GGCTGTGGGGAGCCXy^GGGCCGGGTGTTGGTT^ 
C^GCTGCAGGAAGTGGGAACCACACCCCTCTCCCACCCACACCCCT^ 

10 AACGAAACGCCCGAAGAACGCCCGCGTGAAACGTGAGCTCGGGCTC 

TCTGAGATCTGGATGCATOIAACATGAGCCAGTTCTCGATGCTGGAG 

GTGCCGTTTTTTAAAAACATCAGCGAAGCCTGTTTTGGTGTCGCTGCGCG^ 

CTCCGTTTGGCAGGGATCX5CGCAGCGGGCAGGGCTCAGGGGGTGGGGCTGCACGGGTCT 

TGACTGGC^GGCCAGCTGGGGCCTGGACGGCCGCCGTCTCCTTCTCC^ 
1 5 TCTGGGGAGGCAGCCTGGCATCTCACAGGCGGCCGGGGCTCCCXK3GAAGCAGAGACG 

TGGACCCAGTGCCGTGCCACCCACCACCCCAGGGGCCCTGCCCTG^ 

CTGCACGGTGGGAGCGAGTAGACAGAGCCCCTCCCGGAGGCCCCCAGCCCAGCACATCCA 

AGCCAGGTTCGCCCCAAACCCAACATGGTGGGACCCCAGCTCGCCCCCCATCTCATG^ 

CTGGCCCTGGATGTTACCACGTGGAGAAGACAGGCCTGTGGGCAGGCACTAGCCACACTC 
2 0 TTTCTCCACACAGCAGCAGG CGGAGGG CCIGCCTCCCACCTCACTGGGAGGGCCAGGACTCCAAATCTGGG CCAAATGTATGTTGT 

TATTTTTCAAATAAAGGGACCAAAGGCCCGGGCCTTrCCAAGGCTGCCCT^ 

CCACAAGCTCTGACAGGCACCTGGCCGTTGTTTTGGGAGGCTCCACCC^^ 

TGCAAGGACACGGAACTGTCACGGAACCACCGTGGCTTTGGGCTGTCACGGAACCACCGTG^ 

CGGGACACACACATTCACCGCGGCGAGTGTGGGGTGCATGTCCCCAGGCAGAGGCCATGTCATCCAGGTGC^ 

2 5 AGGACCCTCCTGTCCGCCCAGAGAGCCGGGTCAGAGGAGTCCCCAGGCCTGAAGCCACTCCCT 

CAAGGCCAGGGGGAGGTTCACCCACCCCAGGGTGTCGCGGAAGCCCCGAGGCCCCTGCAGG 
GTGCCCGCTGGCCACCCTGCCCCTCTCCTCCTCCAGCTGCCCCTGCTCCTTGTTT^ 

GAGGAGGTGAGGGTCAG<MCCCCTGTAAGGCAG ACTAGTGGCACTG CC CCTTTCGGCCCTTGCTGT CCCTCCTCTGTCCCTTGTGG 
AAGGTCTCX3GCCTGGGGCAGAGGAGCGGGC03AGGAGGAAGGTGGAGTCCAGGTGGGTGGT 

3 0 GCGGGATGTAGGCCGAGGGAGATGGACCCGGGGGATGGCTGCCTCTGTGG 

GCAAGGAGGACACAGCTGGGAATTCCCAGCTCCAGCGTCCGCGGCACGGGCAGCCTCCCTG 
CTGAAGATGTTGGATGCTGGCCCCGAAGGCAGCATCGCAATGACAACACCTTCAGAGGTGTCCATC 
TGCAGATCTTTCAGATTTGTGTTTTCAGGGAGAA 
GCTCCGAGAGAGGAGAGTTGGGGCTGCTTTGTCACCACAT 

3 5 TCTCTGTTTGCTCCCAGAGACCACACAGGGAAGGCAAGGAGGGCAGCGCCCGGGGAAGAGACACCTGCC 

CTGAGGATGGCTCAGTGGCCCAGATGGTCCAGGACCCCTAGAATTGAGTGATGA 

AGCGCTAGGGGCGACTCTCAGACTCCCCCTGCCTGGGCCCCGCTGGTGTGGGGAGGCCACGCGTC^ 

GGAGGGGCCGGGCTTCTGGCTCCCCTCCCTCCTCTCCCTTCH'CAGC^ 

TTGG CT AGTTT CTTTTTT AT ATCAG CTTT AT CGAAAT ATGAT CT AT AC ACCATTTAATT CACTCAAAGTGT AT AATT CGGTGGGTT 

4 0 TTGGTGGCTTGACATAATTATGCGGCCACCACCAATTTCAGAACATTTTCT 

GGCCTGCTGGGGAACCCCATCCTGCCGGCTGTGGTGCCTGGCAGCTTCTGTGGAGCGTC 

GGGTCAGGGCGTAGGGTCCTCAGGTCCCCTCGTTGGCACAGGTCGGGACGTGGGG 

GGCTGCCCACACCTGTCrcTCGATGGCACTAGGCCACGTCC^CCCTGTGGAGAT 

GCCCTGCAGGCAGATGTGCAGCAGGGCTGGGTGGCTGTTTCCCCTCCAGAA 
GGGAGAGCCGGGGATGGATGAGACGCCCCAGGGACGCTCGCTTCTGGGCATGAGGATGCT 
CGGGGAGCAGGGCACGAGGATGCTTTGGGGAGCAGGGCAGGAGACGCC^ 
ACGCTGCTGCTTCCACCAGCGGCCCTCGTGGCTTTCTTTGTGGAAACAGT^ 

5 0 TCCGGGCGGCGGGTCCTGGCCGGTCTGGGAGGCTCCTGGGCCCGCAGGCCTGCCTGTGTGGAAACCTCGCGGGTC 

CCTCCCGATGACTCAGACTGCAGGGAATTCCCTAATGATCTGGAATATTTTTACTCCTG 

CX5G AG CTTCTG AT AAGGCTG AACACAAAGGCC CTTT CAGG ACCGG CTG CCTTC CTTC CTG C CAGGG GCTGAG TTTTGAACAG CACGG 

GGGCCGGCGCTGCCCGTCCGGGCCTCCACGGAGCTGGXSAGCCGCCATGGCGATGCTC^ 

TTTCCTGAGATGCGAGGTTATTTCCATCTTTCAGCAGTCTCGGCGCTTTGTCT 

5 5 GGCCGTGAATACTATTG CTTTTG AAAG ACAATTACTTTATGATCCTTAAGC AGAAACAG AAAGCCTG CCCT CTCACCCTGGT 

CG ACCTGAAACGGCCACAGACGT CGGACACCACG<30K3AGAGGCCCCACCTC CAAG^ 
GTCTTTGAGGAAGGCAGGGATGTTTGTCGGTTCCCCGCACTTTTAGG^ 
GCAGGGTGCGGGTGCTCCCAGGTGTGCGGTGGAGGCTGCTCCATGGCAGCTGCT 
TCCCCITGGGGCCGCTCCGGCAAGGATGGTTCCCCACAGGCCCACGAGGCACGGG 

6 0 TTTGCTTTTCATGGACTCACCGCTGCGGGGCAGCCACCTGGAGGAGG 

ACATAAAGAGAGX3AGCAGGCTCTGTTGGCCTCGGCTGCAAAAATGG 
GTGTGCGCCTCAGTGTCCCTCCTGCTCAGCACAGTCACTGAACCTCCCTCCCCTTGC 

AAGTTCCTAGGGAAG CTGAAGTTGTGACTCTGAGCTTCCTGGGCTGGG CGGGAAGCAAACT ATAG CCrGTTGCGCCCGGGAG CTTC 
AGAGGACCACAGCCTTTTGTGGGGTGTGGACGGGGCCTCACCCCAAAGATGGACT 

6 5 AAAAGGTGCCAAACAACCAGAAC^CTTAGTTACACACAGAGTGATTTGGATTr 

CTTTCCAGGGCCTGGAGGACTTGCTGCTTCATGAAACTCA^ 

TTTTTCCTAGAGGAGAGAGGGATGCCAGGCTCTATCGAGTGGGT^ 

GCTGGCATGGCCAGGGCTTGKSCTGGGCGGCTCTGGGGT^ 

7 0 GGCACTGTAACAGAGAAAAAGGAGATTTTGTCTGATTTATCAGTAAT^ 

CAGAGCATAAGGATAAACTTAAGTGTGTATACCTGTAACTGTTCATTTTGGGAA 
AAGAGACTCTGTAGGAGATTGACATCTGTTAATTTGAGATAATTGC^ 
CCCAGGCTGTGGTGGGGCCGCCTTGTCCGTGTGCGCCTGGAGCCCCCACTCTGC^ 
TCACT ACAC CTCTC^GGAGC CTTTTT ATTTTTT AAAAACATG 
75 TTCTGTCT C CTGATG GG CAAC AAC TTCT CAGAGC AAGG CTCTTCTCATTT ACCC^GTGGTTC CTTTG CACAC AGG CAGGG CC AG CG 
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GCCAGGCCCTCATCCACCAGAGTAGACCCCAGCACGAGCA 

CACACCTCCTGCATCCCTGGGAGGCTCGGCTGGGAACTGCAGAGCCGGGTCCCA 

CrGATGGGGTACCTGCTCCCCCTCACACCTGCTCT^ 

GTGAGAGGCCAGCAGATGCAGGGCCAGGACGGATTCCTGAGGATGCCCCTGCCCTGTAGTCCCT 

ACAAG CCCTGAGCCGCGTGTCCTCAGCTTCTCCAGGCTCCCAGGGCAACTTTX^ CACGTGG 

CTCCTGAGGTGCTGCCCTCGACAAAGCCCTGGGATG 

GGTCTGGATCACGTGGGGGTGTCKK3GGAGGGGGCTCCCCACCACAC^GA 

CACCATGT CTGGGAGATGGAAGCGAAAACTG AC CGGG AC CTGTGCAAGCCGGTGAGTGC CT TTGG CG CAGCTGGAG CT ACTGTGGG 

TCCCCAGGGCnTGGGAGGCTGGGGTCTGCrGGAGCCACTGCGGGTC 

AAACAAATAAAAATACTGCATAGAGTAACAGAACCAGTAACAAACAGGATCACCAAG 

CAACX5TGTCAGGTCACGTCAGAGCCCTGGCGGCCGGGGACGG 

CAGCCCAGGGAAGGTCGT CTCACACX3G ACCCTCCCGCCCTTTGCTG^ 

TGGGAGTCAGTGCTGTGCTGAGTTCCCGGGGGGTGGTCAGCACCTGCCCAGACCC^ 

ATCCCCAGTGGGGACTCCTCACCCCGTCCTCCGAGGCCACTXSATTGCGCCGACCTCCCCrGTGA 

TGGGraTGGCTCCTGGACTCGCGATTGGCC^CAGCCTGTTAGCGGGCGGTGCATGTGTGCGGGGTCGT^ 

TCACACTTCTGCTGTCACCC^GCCGTTTCCAACTCCCCCAGTCAC^ 

CTGAGGATAATGACCGTCCTCGGArTGGAGATGACCT^ 

TGCTGCGTGCATCTCTCCCATCCCTGCGCTTCrrGATCCATGTGCT 

GGTGTCTTGGATCCTCTCTGACGCTGTCTTGTAATTGGCACATGGAGACCATTGAT 

TTTTTTGAAACAGAGTTTCGTTCTTGTTGCCCAGG 

AGCTGGGATTACAGGraTGCACCACCACGCCCGGCT 

TCTGGAACTCCCAACCTCAGGTGATCaSCCCGCCTCGGCCTCCCAAAGTGCTGGG 

ATGTTCACITATTATTGATACAGTTGGATTGATCCCTACGTTGGATTTTTTGAGTG 

TCTCTAGX3CCAACATAAGATGCCACATGGCTCACAA 

TGCXX5GTCAGGGAAGTCTTGTGCACCCACTCAGTGTGTGTCCCTGCGGG 

GGAAGCTGGTCCCTTCAGAAACCTC^GGGAAGAGCCCATCTCGGGC 

AGGTCGTCrCCTGTTGGTGGGACGATCGGTGGCCTCCATGG^ 

GTCAGAAAGCAGCGTCTTCCACCCTGCGACTTCAGCGCCCJTTCCCCCAAGGAC 

ATCAGATGGAAAAGGTTCCTCCCTGGCC CTGGGTGAAGTGAGCTCACCTGCACCTT CAGTGGTGCTCAGGCCTCCCCTGAAGCTCC 
TGGCACTCIAGAGGGTTCCAGTTCAATGCTGTCGCTTTTC 

GGAACGGCCAGCCACAGGCAAAGGCAGCTGGAGCTGCTGTACCGGCCTCT 
CTCCTCCCCACAGCCCGTTCCCGTGTCCATACCGGCCTCTCITCCCC^CGGCC^TTTC 
CTGGCCGGGCGTTTGCAACTGCCCAGGGGGCTGGCAGCCTTCGTCC^TC 
ATTCGTTCCTCCCCACTGGGCTCCCCTTGCTCTGCTGTGCGTCT 

CAGGCAGTGTACCCTGGTGCCTGGCCTCGTGCrGCCTACC CTCCCTAGC CCrGGCTCCGTCGGGGTCTCCACAGGCTTCGCCAGCC 

CCGCACATGCACGCGGGTTGCGGG ATGCAGGATGCCGGAG CTGGGCTGCCCAGCCGCTGGCCTG CCCTGTCCCC CTAGG CAGGGAG 

AGCAGCCAGGAGGAGCCGTGAGCCITTCTGCACTGAAAACAATCCCAGCCAAGCT 

ACTTTTGACACTGGC^TATTTCGAGGGAAGCTTAACAGCT 

TTCAAGTTTACTGGTTGTTTTTTGTTTGTTTTTTGTTTTITTGAGACAGT 

ATCTCAGCTCATTGCAG CCTC CGCCTCCCAGGTTCAAGOGATTCT 

CCACTATGCCTGGCTAATGTTTTTGTATTTTTTTTAGTAGAGACGGGA 

CTCAGGTGATCCGCCCACCGTGGCCTCCCAAAGTGCTGGGATTA 

TCTCTTAAATGGAGTTTGCTTTAATATATATATTTTCTCAT^ 

AAAGGAAATAAAACCTAGTGTAGTCTCACCAACTGAGACATAACCACT 

TGTGTGGATGTTCACGCAGACACACACACACACACAGA^ 

GTATATGTATGTGTATATATATATATTTATTTATTTATTTATTTTTm 

AGTCOU3TGGCTTGATCATGGCTCACTGCAGCCTTGACCTCCCAGGCT 

G ACACAAGTGTGCC CT ATT CTG CCCAGCT AATTTTIT AAATTTTTGT AG AG ATGGGG C CTCG CGGTG TTG CCCAGG CTGGT CCTGAA 

CTCCTGGGCTCAAGCAATCCTCCCACCTGGGCCTCCCAAAGTGCT 

TTGTGAACAAAATGGGATGTGCTCTGTCTACGGTTTTTGTACTTTCT 

AAAGCAGGTTGCGTAGGTCCTGGTGCCATTTGGAACCCGTATTTTTTCCGGTACIAGACGAG 

GTGTTTCTTCTTTATCGCTTCCTTCTGTGTGTATTTAAGTGGCAGTGCTCCC^ 

TTTAGAGAGTTGAAGTGGGTTGGCCTGTGCCCACATCAACATAGGCCCTTCCAGTCAGGTAGGACTTAC^ 
GTCCAGCATGCCCCACMGGTCCCACCTCCACCCATCC^ 

CCCGCCTCCGCCC^GCCAGCCTGGCCCACGGAATCTCCGCCTCTGCCCAGCCAGCCTGGCCCATGTGG 

CAGCCTCTCCCACAGGGTCCCCGCCTCC^CCCCAGCCTGTCCCGTGGGGTC^^ 

CCTCCCGGCCCGTTACTATTTCAGTCTCGGCTTCCATGAGACGTTTCCTTCTTGTCT 

CCACCTGGTGT AG CTGCTG ACTCCATCGCAGCTGCCCTGAGCACTG CTGAATTCAGAAAACAGCCTTGGAGG AAGCGTCTCTTTAT 

CCCCCOTAC!AAATGCCGAGACAGGCTCCGGGTGGCCAGAGCTTCCCGC^T 

GGGCAGCCCAGGCCCCGCCCACTGACAGCACTCATGGCCCCTCCGGCGGCAGCTTCAGCTCC 

AAAGGCAGGAAGACGCCCATGTGCCATGGCTTCTGCTTCITCCCAATGAACTG 

AGAAAGTGTGGCCCCTGGGCACAGGGCCCAGTGGGGTGGGGAGCAGGX3GTGCGGGGAGGGCATC 

GCCTCAGCTGTTTCATTATTrGTAAATGAAAGAGCCTGGAC 

ATTGTTAATGTGATTCAATCAATTTGGCTAAGTTAGGTGCTGAGCTATAATTA^ 

TTTCTCCTGCTTCTTAAACTAAGTCATATGATTATAAAAATGAAACTTTATAGATGCGA 

AAGTGCCGCCCrCGGCAGGGATGCCTGCATC^CCCCAGTCATTTAGGGTCATCAG 

CETTGGGGTCTCCGTGTCCCAGCCCCCCACCC^ 

ATACTTGAAGGAGCATGAGTTTCGTAAACATGCTG CTC^CTGTTACRTTAAAAACATGAGAAGGTAACGGCAG CCTCCACACGCCG 



GAGTGAATTTGGGGTCTGACTGAGGGCCTCGGCCATCGTCAAATGTGACAAACACAGGCCCT 

GCCTTGGGTCGTTTrCTGAAGGTCACCAACCCTTTG^ 
GTGGCCCCGGGCAACCTCCACCCTCAGGCACCCGGTCCCAGTTCTC 
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GCCCCTTCCCTGTGAGCGTGCGTGTCCACACGCGT 

CCAGGTGGCTGCTTGGGAGTTTACCGTCACGGCAGTCCTGTGCTGCCGT^ 

CGGGGTTGC CGTGTGGC CGC CGTGGAAACGCGGGGTTG CCG TG TGG CCG CCGTGG CGCGG CAG C CAT CG CCTG C CCGGTG CTGATT 
GTGCCTCCTGTGTGCCCTTCTCCTGTAGAATT 
5 ACGTCAGTTTCT ACGTCTGCAACGGGAAGAGAAAG CGAAGC CAGT ACCAG CGTTTCACCT ACCTTCC CG CCAACGGT AACG C CATC 

TTTCTAACCGTAAGCCGTGAACATGAGCXjCGTGGGGTGCT^ 
GTTGCACATTTAACrcTGTGATGTCCCG 
CATTTTCCITGATCCCTGTTGGGGGTGGGGGGCGGGGGT^ 
CAGCCAAGGGGAAAACATGGCTCTTCTGCTCCAAAAAACTGA 
1 0 AAAGG CAAAT AGGT AT AG CT ATTTTGCAGGCACCTTT AGGAATAAACTTTG CTIT^ CTGT AGTCCTGATGTGGT CTTT AA.GG 

ATGGTGAATGAGCTTTGTGCTGGGCGGACGTCCCC^ 
CrGGGGC^CCHTCTCCTCAACCTGCCrGCCC^ 
TCTTACCrrTTTTAACCTTTAGGGTGTlTTCIT 

GGCGTCCCCCACGCCATCCCCGGCTTCCTTTGTGGGTACAGGCCACTCCCCCCAGCTC 
1 5 AATTTTACTGTATTTTGTGGGTACAGGCCACTCCCCACAGCTCGGAGGGGACA^ 

GGT AAG CTT AGCTAACAT CAAT AATCTTTTT AT ATTGCTTTTTTT CT AAAAACTG CAG AGATTC CTGTCATCTGAG AGGG AAATT A 
CACACTGGGCGCTAATTGCAAAAGCAGCTGCGTTCTCTTAGTAA 

AC AGCATTTGT AAT AG ACCAGATCCT AAGGAAG CC CTTGAATCCCCACGTTCACAC CAAACTG TTGGGACTCXTITGTTTTT ATCAG 
GCAAGCTCrGGAAAGGCCTCTCAAGATGTC<5AAAATGGGGTT^ 
2 0 CTCCAGGGGTGATGTGGAGCCCCCAGCCCAGC 

CATGCTCrGGGACCCCCGGGATTGTGGGGAAGAGCCrCATGCTCTGGGACCCCa 
CCCCCAGGATTGTGGGGAAGAGCCTCACGCTCTGGGACCCCCGGGATTC 
TGGGGAAGGGCCTCACGCTCTGGACCCCCCTGGGGTITACGTGGTCTGTGCC^ 
CAAAGTACAATATATCTTACAGAAGAAAATATTTTATCTT^ 

2 5 TTG AC CAGAAATTG AATGTTTT AAAGAAAACCT ACXS CCTITAT AGAG AAG^ CAGTG CATT AATTTTTTTCTT A 

TTTGCTCAGCATGCGGCAGTCCCCACCGTGGGTTTGAGCAGCACro 
TGCGGGGAGCX3TGCCTGCCCTTCACAGGTGGGAGGTGGCXK5CCCCa 
CTCCrGCTACCTCCMTCCACACCCC^CCTGGC^^ 
CCCTCrCAGCATTGTGCCCAGCGTGGACATCTCTCCTGCACCCCCT 

3 0 CAGCATGGCTCAACGCACX3CCAGGTTCCCCAC4GCTCACCCrGCX2CA 

TCTTCTCTCCTGGACGTAGCACCATAGTCCCTGCOT^ 

GGGCTCCCCGGCTCCCTGTCACCCCACCIGGTCCTGGGGGCTGCGACCGCA^ 

CCC*GGCTGCCGCAGGGCGAGATCCCC»GTGTrGAa3CTGGG 

AATAGAAACCAGCACCCAGGTATCCX5TGTTCCGTCCCGCGTGCCCGCT 

3 5 CG CCG AGG CCG CTCCAGGACCnGCGTCCTCCGTGCTGTGCATCTGTGTCTTGG CTTCTGCCACGTGGTCC 

GAGAGATAGTTGCCAGCTCTGCTTCTGC03ACCCTGGAGGTG 
GCACCGCCCTGACrCAAAGCTGCrCTTGGCTTTTTCCCT^^ 

TGCCGATGAGCTTCGGGTCTGCATGCCCGGGGGTCACGGCAGTGAGGGGCGGACTCCTAG 
AC^GGTGACAACAGTGAACCCACnTACACTCCAGCTGTGGAATTCATTT 

4 0 TTTTCCTTOUUVATTGCCATCCCCACAGGCA^ 

GTCCGGTGCTTGTGGGGGGCGGGCTGGGGACACGCCTTGGCCACCAGCCGCCCG 

GATGACTCGGCCGAAGATGAGCTCAGTCCGCAGCCCCTGTCCCGGCCCCGCCTCCCGTCTTCAGGGGGCCT 
GTCAGAAGGTCGCAGTGGTCGGCCTCOKSGAATCAGGCATCTGAGTGC^GGGGGCCCCCGGG 
CTGATGGTCTGTGATGGGGCCCAGAGGGAGAGTTCCCTTCACCCTCCCTGGGGCX^^ 
45 AGC!AGAATGTATG AGCCCC^TGAACGGAGTGGGCATAGCC CTCGGGGAGTCCTGC CGGCCG CCCCTCCTGACATCCCGGGGTGCTG 

CCCGGTGAC4GAAAGTGAAAATGTTGTCTGCAGTTAGA 
CACTGTTTTTGTCTAACTGCGCCTGAATTTAATCT 
GGCAGTTCCX^GAGGTTATTCCAGCACGTCAGCATTAAAGGAGAGAOT 
GAAAGGCGGCGTCCGCGTGGCCTGGGGCTTGTGTGAGGGGTGCACITGTGCCCCCAGAT 

5 0 T CG TG GGGGG CTG CGGG CACTTTGG AGAAAGAAAACC AAGTT AC AAACT CAAAATT AC CT 

GAAAATAAAACATGACAAATGCCAGGTCTTTTTGTCTGAAGTCGTTTTAGTC 
GGGGACACGGAGCTCTTCCCCCATCAGTGCCCAGGAGAAGCTGCTCCCGGCCCTA 
CTCn3TGGGGGTCCCITTCX3TGGCCAGAGCCCCCTTCCT 
CTGCTGTGATGACCCAGGCACAGAAACAGCTTTGACCACAGAACGT 

5 5 GGATCTGGGGGTCCTTCACCAGGGGTGGGCGTGGGCCCCGAGCCCAGACAGGAGCC 

AAGACGCGCTTCCCTCCCCCGGTACTAACCAGCCCTTCTGCTGGCCCTGCAACC4ATGACCTAC4GCTTGAGAATCTGTTTCTG 

GTTGGTACAGAGGGAGCCGGGGTCCTCTGGCCTTGGGTCCACCCCAGCGATCATCAC 
GGGGGG^CX5GGGGGCGGCTGTGAAACTGAGGTCCCCGTGCACAGATCCCACC^ 

6 0 GCACTTCCCAGGCAGCAGAACCGGCCCCTCTAGGAAGCCC 

GTGTGAGCTGAGGGGCTCAGAGCACCCCACAAGGGATCCGCCTGATTGCAGACA 
CCTTCTGCTC^GCCCGGCAGCCTCTrCTTCC 

CCACTGCATGTAAAGCTTGATCCTGAAACAGTGAAGTTCTTGAGGTCTCTG 
CCATTGGCC^AAAAGCACAGCCTGGGTGGGCCITGAACACCT 

6 5 CACCCTGGGGCTGGGCCTGAGCTCGGCrcraVCrcCAGA 

CATCTGATCCTGCCACCGTGGCGGCTGCCTCCAAGCTCCTGACAACGTC^ 

CTT AAGAGACTTTCCATGTGTCGGCTGTGGG CX5CCTTGACG CACCAGGGCCTGGTCACAGGTTCTC AAAGC!AG AG ATTCTCCTGGC 
ACCCCT CGTGCGCCCTCCAAGGGCCTAAAACGCAGACGACCCACACTCCATC CCCC 
TTTCTGCAGCTTGTC<3GGGTCAAGCCGGGACCCCTACACTG 

7 0 TOVGACAGAGCGGATGATGCTGAC&TTTCCAGCC 

GAGCCCAGCTGCCX^GTGCrTCTGGGGTTGGAGCTGGGCTGGCC 
ACTGAAGGCCCCACAACCACCTGAGGGAGACAGCTTTTATTAAATTTC^ 

agagaagcg tg ttctcacg ctt actg t caacgttgcg aggg aagcgtgtt ctc atg ctc actgtcaacattgt aaacctg agc^ 
gcgtgttctcagctcattgtcx1acgtaaaccccagggaagcgtgttctcacgctca 
75 ctcgc^ctcactgtcgacgtaaacctgagggaagcgt^t-t 
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ACACTCACTGTTGACGTTGCAAGGGAAGCG 

ACTGTCGACXSTTGCAAGGGAAGCGTGTTCTCACACTCACTGTCGACGTTG^ C 
TGAGGGAAG CX3TGTTCTCACGCTCACTGTTGACGTAAACCTGAGGGAAGTGTGTT CTCACACTCACTGTCAA(XnTX3T AAACCTGA 
GGGAAGCGTGTTCTCACACTCACTGTCGACGTTGTGAGGGAAGCGTGTTCTCA 
5 CGTGTTCTCGCGCTC&CTGTCGACGTTG 

GCTCGCTGTCAACGTAAACCTGAGGGAAGCGTGTTCTCACGCTCACr^ 
CGCTGTCGACGTTGTAAACCTGAGGGAAGCGTGTTCTCACGCTCGCTGTCGA 
TGTCGACGTTGTAAACCTGAGGGAAGCGTGTTCTCGCGCTCACT^ 
CGACATTGTAAACCTGAGGGAAGCGTGTTCTCGCGCTCACTGTCGACGTTGTA^ 
1 0 CXSACGTAAACCIXSAGGGAAGCGTGTTCTCrCACrGTCAACTTTG 

CTGAGGGAAGCGTGTTCTCACACTCACTGTCGACGTTGCAAGGGAAGCGTGTTCTCACACr 

TATTCTCACGCTGTCXSACATAAACCTGAGGGAAGarrGTTCrCACGCT 

CTCACTGTCAACGTTGTAAACCTCAGGGAAGCGTGTTCTCACACT 

GTCGACATTGT AAACCTGAGGGAAGCGTGTTCTCACG CT CACTG TCGACGTTG CGAGGGAAG CGTTTT CACACTGTCGACG T AAAC 
1 5 crGAGGGAAGCGTGTTCTCACGCTCGCTGTCAACGTAAACCTGAGGGAAGCGTGTTCTCA 

AGGGAAGCX3TGTTCTCACGCTCGCTGTCAACX3TAAACCTGAGG 

GCGTGTTCTCACX3CTCGCTGTCGACGTAAACCTCAGGGAAGCGTGTC 

TGTTCTCACACTCACTCGACGTGAGGGAAGCGTGTTCTCACGCTCACTGTCGACAT^ 

TCACTGTCGACGTTGTAAACCTGAGGGAAGCGTGTTCTCACGCTCACCGTCGAC^ 
2 0 TCACCGTCGACGTTGTAAACCTGAGGGAAGCGTGTTCTCA.CGCTCACT 

TCACTGTCAACGTTGTGAGGGAAGGGTGTTTTGACGCT 

TTGTAAACCCGAGGGAAGCGTGTTCTCACGCTCACTGTCAACGTTGTGAGGGAAGGGTGTTTTCACGCT 

TGGAAGCGTGTTCTCATGCTCACCGTCGACACTGTAAACCTGAGT^ 

CGTGTTCTGAIX3CTCACTGTCGACGTTGTGAGGGAAGCGTGTT 

2 5 CTCTCTCACATGCTGTGGGGTGCTGGCCCCACCAGGCCTCCCAGGAGACCT 

GAGGTCCGAGATGTGCTCGCCAAAACCACCCCCAGCAAGACGCCAGGCATC 

GAGCGTGTTTTAAAGCAGGAGGGATTTGGGCTGGAAGATTGCTCCTGCAGAAA 

GTCAGGATCCAGACTTCTGTGAAAGAGCCACAGGGCTCAGCATCCGACGGGCGCA 

GGGGTGGGG C CGGGAG CCTG AG ATGGG ATG AGAGGG CGGGAAGGGGCTTGTTG AG AAAT CT AAGGG CCACGGGC CAC CAAAAACAA 

3 0 ACCAGGAGGTGTGAGTCAGTGGGGAGGCTGGGTGCCAGGAAGC 

TTGGCGACGCCAGCCGCGCGCCTTGGGAGGGATCCGCCTGTCCACGTGCAGCCGCCTCCGGGCGGCGTCGG 

CCGTGGCTCTGTGGCTCCAGCCGGAATGGCAAAG CCTGGCTCCACAG CTGCCTGGGAGCGTG AGTCACGCG CCCCACAGAACACGG 
CGGGCAGCGCGGGCGAGXSATTG^CGAAAGCCG ACCGATGTGTTTTGG CTTGTGTGGTCTGTGGG AG CGTCCTTTGTCATCAGCCTA 

35 TGCCGTGCTGTCCTGGCCCCGTCGCTGCCACCTGAGGCCCCCATCAAGTACCTGAGCTCTGCGTCT 
GTAGGAAGCCCCCCACCGATGTTGCTCGGGGTCGACGTGATAAACCTGAGGGAAAGGGAATTCTCT 
ATGGGTGTGGGGGCAAAGTGAGATCATGAACAAAAGCATCTGGGCCCGTCCTGGACACATGGCCAGGGT^ 
CTCCACGGAGCTGCAGTTTCAGCCTGGAGGTCTGTGGCGTGCATGGTTGGAATACTCACCAGACTCAC 

GATCATG C CTT CCTGGAAACCTTGG AACG CG CTT CAGCAGAAACT C C AGTG AAG AGTCGGTG AG AG AG C ACAAATT CC C AAAGG CG 

4 0 GATGGAGTTCGGAAGGACCCTTCTCTGTCTCCCACATCAGGC 

CCTCACCTGCTCTG<IAAACTGATCTTGGCT CTC CTCAGC CTCG AGGGATGGGTG AT AAGG CAG CTT 

CATCCGCTG TTTTCTTTC ATTGTTTG AG AAAAGT CATGCCC CGG AGGTGTCATTCCTGG TTGTTTATGTTGATGG C AAAACTTGGA 
CTTTCTTGTCATACAGCAAAC^GAAGACAATCTGATCTTTGTTTTCCAGGGGTTT 
CTGTAGTGCATCCCTGGTAGAGTAATTCAAGCTGAACACAAGGTTG 

4 5 GATCATTCTTTGCTGTCJCACAGCAGCTGCAGCCTGCAGGCAGCTCAG 

ATGTTACTGTTGGQX3AAGAAGGGGGTCCGGGGCACT 

GGTGGAAGAAGGGAGTCCGGGGC^CTGTCTCAGTCTrCGGGGCITCCATAGTGGAGCAC 

GGGGTCCGGGGC ACTGT CT CAGT CTTTGG GGCTTC CAT AG TGGAG CAC CACCGG CTGGG CAC CTTGTAAACAACAG ACATTGATTT 
CTCACAGCTCrGGGGGCTGGAAGTCCGAGGTTGGGACTGGTACGGTCGGGTTCrGG 

5 0 AATCCTGCCATATCCTCACATGGCAGGACAGGCGATTT^^ 

'CCTGATGACATTGTTTAACCCCCAGTTACCTCCTGCAAACACCACrCT 

TGAGGTTGAGGGGACACGAGTCCGTCCATAGCAGTGACCCCACTTGTCTAAAGCCACAGGGCT 

CTGCCCATCCCCAGGCCCCC^TTCTTGCCCCCAGGCCC^^ 

AGGAGCCAGGAGCTGCCGGGCACCAACCACACAGGAAGCCAACAC^n'GGGGCA 

5 5 TTTTTATTCTGAGATCTTGTGACCTCTGTCTTC 

GAGCCAGGCCACACCAAGGCAGCCAAGGGGACTACGTTGGGAGGAGGATC 

GGTGGGTGAAGCCCACTCATGTCCTGGCTCCTGTTACCGGGTAGGGATGGCAGCCCATTTTCCAGG 
CCAGGATTGTGCaSAATGCTCTTTGGCTGACGATGGCCGTGGCGGGGGOT 

6 0 CTCACTGCCTGCGTCTGTGGCCAGAATTGGCACCTGCATGGGGCCTGGAAA 

GACTCAGCTCCCAACGTCACTGCTACCGAGGCAGAGACTCTGCAGCGGCATCA 

TGGGACGGGCCCTCCGTGCCTTCCACGGAGGATGCAAGCGTGAGGCCTGGGTCTGCGGGGACGGTTTACCT 

CGCTGACCACTGCGCGCCGCCTCCACAGACAGTCCC CATGGACG CTGGGGCTGC CGTGG ACGATGGGGCAGCCATCATGGACAACG 
GGACTGCCATGGACACCGCAGCCAGCCTGCTCTGTGGGAATATGCrTTGTAAAG 

6 5 ACACTCTGCTCTTTGTAACGAGTTTCATAAACTCACCCTCAGGTGAT^ 

CTGCCCTTTTTCTGAGCCCAfcCTTTCGCCACCrrCTACCCCAGCCCCTCATGGCrr 

GCAGGGTG AGCCCGGAACACGGGCAACAGCTTGAACTGGG CCTGTGCGGCCC CCTCCACGGGCGACAG CGTGAACTGGG CCTGTG C 

GGCCCCCCTCCACGGGTGAGACHrCCAGGCGCTCTTGCAGACGCT 

GATTTACAGAGCAGGCATCCATCTTCTCTGAAACCCACACAAAAAGCACAGT^ 

7 0 TTAAGCTTCTTAGATGCTGGGGTGTGGAAGCAGCCGTGTGGCTTTCCTTTG 

CTGTATTGGGAAGGCTGCAGCTCTCCAGGACATGACTTAGCAGAGAACATCCC^ 

CAGTGCCCCTTAGGGCAAGGACAGGGAGATGGGGCAGTCGCCTGGGGA 
GGATGGCACAAGACGCCGCTGCCAGGGCTGAGAAACCATCCX3GCCAACCCCAGACT 
7 5 AA^CGCGTGGAGACCCCAGCCCAGGGAGGGGCACTGTGGCAGGAACAGGGCCCG 
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AGCACGGTGACCACGTCGCAGCTCATGCTGCCA^ 

GGGTGGGGCACGTGCGGTGATCCCAGGTTTCAAQTAGCTGCCCCCAGCTCCCTGCAAATA 

CAGGCACACGAGTTTATTCCTGAGAGGAGCGTGTTGCCACAGCCTCTC 

CTGCCTGTGGGTGGGGGGACAGGAGCTGCCAACCTGGCGGGGTCCCAGGGAAGCG 

GGTGGATGCATCAGTGCTGGGCCTGCTCCGAACCCGCAG<^CAAG 

TGGGGAGGTGGGTCGCGTGTCCOVGAGCCCTCTC 

GCTGGAGAGATGCGGGGGTTAATCCCGCTACCACCGCGGTTT^ 

AGCAGCAGCAGACGCCACCCGGTTGCCCCAGCCCCAGCGGCTCTCGGGCATAGCCGGCCTTGG 
GACAGCTGTGCACGGGGCCTCTAGTGTCCGCCAGAGCTGGTGGGTGAGAGTTTT^ 
CACTGGCCCCCATGCCTGGCCATGTTTCTGGGCCCTGGCTGAAAATCCCGAGCAG 
AACCTTCCAGCCATGTGCTTTTGCTTAAGATCTAAACTCTGAAAATTACC^ 

ACCAGCATCTTCTGAGCGGGAGAG ATGTGGGTG CCCACCGGCGTCTCACTCCCACATCTGGGCCroTTTCCAG TGAC CCCTGCAC C 

TCTCAAAGGCGCAGGCCTGAGTTCACAGAGAGCACTTGGCCAAGCCCTGCCTCTG 

TCATCATTTCTGGATTTACCACATGCTTCAAGCTGGAACAGCAGGAAGGGGCAGTTCCTG^ 

GGGCGCCATGGCCTAAGGCAAGACGCGTGCCCTGGATGAAAATGACrrCCAGCTCTGTAGTCTCCAG^ 

CTCCACGGACACCGCCTCCTGCCACCCCTTCAAAAAAGTGTTGCAGCCCAGGGCCCAGATA 

CGGGTAGGGTTTCCCAATCAACATGAGGGCGCCGTCTCCTCTGCAGCCCCCTGACTTG 

CTGTGAACCCTCAGCTGAAGAGGGCAGCCCCAGAGGCAGCCACATTCAG 

CGGCGCTGCCAAGGCCTGTGCACCTTCTTCAGCCCGAACTAAGGAGAATGTGC 

CTTGAGCGTTGTCTTCAGATGGGGCCCAGTGGGCCTCACTGTTGTGACG 

CTGGTCCTGGGGTGTCACTCCAGCGTGACCTGGTTCCTGAGGTGT AATTC^ CGTGACCTGGTTCCTGGGGTATAATTCCAG CGC 

ATGACCTGGTTCCTGGGGTGTCATTCCAGCGTGACCTGGTCCTGGGGTGTCATTCC 

AGCGTGAGCTGTTTCCTGGGGTGTCATTCC^CATGACCTGGTTCCTGAGGTGTAATT 

TCCAGCATGACCTCGTTCCTGGGGTGTAATTCCAGCGTGACCTGGTCCTGGGGTGTAATTCCAGCATC 

AATTCCAGCGTGACCTGCTCCTGAGGTGTCACTCCAGCGTGACCTCGTTCCTGGGGTGT 

TGTAATTCCAGCGTGACCTGGTTCCTGGGGTGTCATTCCAGCGTGACCTGGTCCTGGGGTCT 

GGTGTCACTCCAGCATGACCTGGTTCCTGGGGTGTCATTCCAGCGTGACCTGGTCCTGGGCT 

TCCTGAGGTGTAATTCCAG 

HUMAN SEQUENCE - mRNA 

GAATTCCGGGCCGGGAGAACCGAACCCCTGGCGGCCGCGACCCCG^ 

GGAGGACCAGGAGTTCGACTTCGAGTTCCTCTTCGAGTTTAACCAGCGCGAC^ 

GCTATGCATCCTCCAACGT(^GCCCCGCCCTGCrc 

TCCACACCGGGCATCATCCCGCCGGCGGATCACCCCTCGGGGTACGGAGCAGCTTTGGACGGTGGGCCT 

CTCCGGCCACACCAGGCCTGATGGGGCCCCTGCCCTGGAGAGTCCTCGCATCGAGATAACCTCGTC 

ATAACCAGTTTTTCQ\CGATGTGGAGGTGGAAGACGTCCT 

AGCCTGGAGGCCTACAGAGACCCCTCGTGCCTGAGCCCGGCCAGCAGCCTGTCCTCCCGGAGCT 

CGAGTCCAACTACTCGTACCCGTACGCGTCCCCCCAGACGTCGCCATGGCAGTCTCCCTGC^ 

AGGAGGGCTTTCCCCGCGGGCTGGGGGCCTGCACACTGCTGAGTTCCCCGCGGCACTCCCCCTCCACCT 




CCACCCAGTACACCAGCTCGGCCATCGTGXSCCGCCATCAACGC^ 
GTCAAGTCCCGCAAGACCACCCTGGAGCAGCCGCCCTCAGTGGCGCTCAA 

GCCCCCGGCCGACTTCG CG CCCGAAGACTACTCCTCTTTCCAGCACATCAGG AAGGGCGGCTTCTGCGACCAGTA 
CGCAGCACCCCTACCAGTGGGCGAAGCCCAAGCCCCTGTCCCCTACGTCCTACATGAGCCCGACCCTGC 
CTGCCGTCCCACTCAGGCCCGTATGAGCTTCGGATTGAGGTGCAGCCCAAGTCCCACC^ 
CCXKXX3GGCCGTGAAGGCGTCGGCCGGAGGACACCCCATCGTGCAGCTGCATGGCTACTTGGAGAATG 

TTTTCATTGGGACGGCGGACGACCGCCTGCTG CGCCCGCACGCCTTCT ACCAGGTG CACCGCATCACAGGGAAGACCGTGTCCACC 

ACCAGCCACGAGGCCATCCTCTCCAACACCAAAGTCCTGGAGATCCCACTCCTGCCGGAGAACA 

TGCCGGAATCCTGAAACTCAGAAACTCCGACATTGAACTTCGGA 

TGGTGTTCCGCGTTCACGTCCCGCAACCCAGCGGCCGCACGCTGTCCCTGCAGGTGGCCTC 
TCAGCTCAGGAGCTGCCTCTGGTGGAGAAGCAGAGCACGGA 

CAACTTCCTGC^GGACTCCAAGGTCATTTTCGTGGAGAAAGCCCCAGATGGCCACCATGT 
GGGACCTGTGCAAGCCGAATTCTCTGGTGGTTGAGATCCCGCCGTTTCGGAATCAGAGGATAACCAG 
TACGTCTGCAACGGGAAGAGAAAGCGAAGCCAGTACCAGCGTTTCACCTACCTTCCCGCC^ 
CACTGATGATTATGAGCCTGCTCCAACCTGTGGACCGGTGAGCCAGGGGTTAAGTCCTCT 

T CG CG ATG CCACCCG AC CC CAG CTC CTG CCTCGTGGCCGG CTT CCCG C C CTG TCCG CAG AG AAG CAC CCTGATG C CAG CGG CCCCT 

GGCGTGAGCCCCAAGCTCCACGACCTTTCTCCCXSCTGCCrACACCAAGGGCGTTGCCAGCCCG 

GCAGCCGGCCGGAGAGGCCCCCGCCGTCCAGGACGTGCCCAGGCC^ 

CCCTGCTGCCACAGGAGTAAATGAAATAATACGAAATGACCT 

AGTTCAGCAGGGGTATGCTGACTTCAGCAGACAAAGACTTTTG 

AACTGACTGAATGCCAG GAG CTG AACATT AAT ATGTG CAAAG ATTGG CT CTCCAACAAG AAGG AAAGCAGGG AGG AAGGG AG ACCA 

CTGTGTCGCCTGGAGGAGAAGTCATCTGATGACAAC^ 

TCATGGGAACGGCCCACACGCAGTTTGACCCCACGCCCAGCCCTTCTGGC^ 

CCAGACCATCA 

HUMAN SEQUENCE - CODING 

ATGACGGGGCrrGGAGGACOVGGAGTTCGACTTCGAGTTC 

AGAACACTATGGCTATGC^TCCTCCAACGTCAGCCCCGCCCTGCCGCTCCCCACGGCGCACT 
ACCTTCAGACCTCCAC1ACCGGGCATCATCCCGCCGGCGGATCACCCCTCGGGGTACGG 
TACTTCCTCTCCTCCGGCCACACCAGGCCTGATGGGGCCCCT^ 
GTACOVCAACAATAACCAGTTTTTCCACGATGTGGAGGTG 
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GCCTCCTCCTACGAGTCCAACTACTCGTACCCGTACGCCT 

CACGGACCCCGAGGAG<^CriTTCCCCXSCG<^CTGGGG^ 

GCGCCAGCGTCACTGAGGAGAGCTCGCTGGGTGCCCGCT^ 

GGCCGGCAGCCGCCCTACTCACCCCACCACTCGCCC^^ 

GTTGGGCAACACCACCCAGTACACCAGCTCGGCCATCOT 

ATGGCGTCCCTGTGAAGTCCCGGAAGACCACCCTGGAG 



CCTGGCGGTGCCGCAGCACCCCTACCAGTGGGCGAAGCCCAAGCCC^ 

TGGACTGGCAGCTGCCCTCCCACTCAGGCCCGTATGA^ 
1 0 ACGGAGGGCAGCCGGGGGGCCGTGAAGGCGTCGGCCGGAGGACJVCCCCATCXSTGCA^ 

GATG CTGCAGCri'ri'CATTGGGACGGC^XSACGACCG CCTG CTGCG CC CGCACG C CTTCT ACCAGGTG CACCGCAT CACAGGGAAGA 

CCGTGTCCACCACCAGCCACGAGGCCATCCTCTCCAACA^ 

GTCATTGACTGTGCCGGAATCCTGAAACTCAGAA^ 

ACGGGTACGGCTGGTGTTCCGCGTTCACGTCCCGCAACCC^ 
1 5 G CT C CCAGCGCTCAGCT CAGG AGCTGCCTCTGGTGGAG AAG CAGAG CACGGACAG CT ATC CGG TCGTGGGCGG GAAG AAGATGGTC 

CTGTCTGGCCACAACTTCCTGCAGGACTCCAAGGTCATTTTCGTGGA 

GAAAACTGACCGGGACCTGTGCAAGCCGAATTCTCTGGTGGTO 

ACGTCAGTTTCTACGTCTGCAACGGGAAGAGAAAGC 

AAAACAGAACCCACTGATGATTATGAGCCTGCTCCAACCTGTGGACCGGTGAGCCAGGGG 



20 



CAGCGGCCCCTGGCGTGAGCCCCAAGCTCCACGACCTTTCTCCCG 

CTCX5GACTCCCGCAGCCGGCCGGAGAGGCCCCCGCCGTCCAGGACGTGCCCAGGCCAGTGG 
GCCACCCCCGGCCCTGCTG CCACAGCAGT AA 
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Table 51 

MOUSE NOMENCLATURE 
ICSGNM Syk 
Celera mCG3409 

5 

HUMAN NOMENCLATURE 
HGNC SYK 
Celera hCG37721 

10 MOUSE SEQUENCE - GENOMIC 

AACTTTTAATAAAATAT AGTTGCAT ACAGGTG AAT ATAGT CTT AG CAT AC71AAT AACITX3TTTT CT CTAAACTGT AGCATTCCTGG 
ATGCTTTGTGAAAGAGATTGGGTCTTTTCATGGAAGTGTGGGGAG^ 

ATTC CTT ATTTAAGGATT CACTGTTT CTrGATCTTGGGGTGAAT AACC CAG T CAATGTGTTG C CTTGCAAGCATGAGG ACCTGGGG 

TCAACCCCATGCAGGCCTACMAACAACACCAGTAGCAGCAGGC^ 
1 5 GGGATTCCTGGGGCTTCCTGGCC^GCCTAGCCrGTTCGGAAAG 

GG AGGGGG TCTG AAAAAC^UUU^GCT AAGGTTGATCTCTC CACACACACATACACACACACACACAAACAT ATA 

ATCCACACTCGCATCCAC^CACJVCACAOVC^CACA^ 

TCTGTCATTTTTTGTGGACATGCACGGAATGATTTCAITCT^ 

AAGGTTTATTTGGACCCTTTCAGGTCCCAGTCTGTCATTGAGAGGAGTC^ 
2 0 TGTAAGGATCCTGCTTGCTGCrC^CGTTCrGACCTGTG 

GCCGCCCATGGTGGGCAGGACCTTCCCACATAAACC^TCAATCAAGACAATCT 

GACAATCACTCAATCGAGGTTTTGGATGACTCCTTGCTTC 

TTGTCTCACAACATAAACAGTGCCC A1TT T CTT GATCTATTC 

GTGCCATCCCTACTGAAAATAGTCCTAAGCGTGGGGCTAAAGTGCTGCCTACTGTCCCTAAAC^ 

2 5 ATTAAGTAAATTCCCATGTTGGGTGAGATTCTTACAAGTATTGGAGTCCTGCCCAG 

GGAGTTG TITC ' l ' lT AAACAGATGCATATCTCCTACAAGGTTGTTCCTTGTCT 
TTAACTTTAAAAGTTCCCTGAGAAAGAGTGGTTGAAATTTTGTAAATT 
GAAGTAACTATATTTAGTGTCATCCATTGGAAAGTAGGACCAAGGTCAGGGGA 
GATGCTTGTC^TGGTAAAATAACCTGAACACTGGGGTTCATCCCTGA 

3 0 TGCTGCrCTGAGGGGATCTAGGCAGAGTACTCTGGCCCCTGGA^ 

AAG TCTT CT C CAGG CTAC C AGAG C CCAGACCACTGGTTTGTTG ACAGT ATG TCGAGAT AG TTGTTAGT CGCTGG T TGTG CCTCTGG 

TTTGCCTTTGGGTATCTGCTTTGGGTACCAGGATCTAAGG 

ATAGCCGGGCAGACIACGGCrGTTGCTCAGTGTCTCTGGAGGT 

3 5 AAAGGTCACACAGTCCTCCTTAGTCCTGGGACTGGGGAACAGGCIATC 

TGAGTAATGTGCTTGCTCTGCAGCATGGAGACATGGGCTGACAATC^ 

CT AGGGAAGTAAAG CTAC^CAG AGCCCTGGAGCTCCGTAGCCAGTCT ACC CAAGCCAGGGAG 

TTACAGAAAACAATGTTAAAAAACGAGAGGAAGATACCAGATGTTATGTGTGOVTAGATTATGTGTTCT 

TCTTCTGGCTGCCTCTCTCTCCTCTCTCTGTGTCTCrCTGTCT 

4 0 ACACATGCTCACATACATACACATGCACACTCGCAC^ 

ACTGGTTATGACTGCTCACCAATGCTCAGGGCTAGCAGAGGGAG 
GGCTTCCTCCATTAGCTAAACTACAGCTTTGGAGGAACGGCACCTT^ 
TCTAAGCGCAAGCCCCTCTCATGCTCCACTCTGCACACCCATGGAG 
AGCAGAGGGAGGTGAGAATCAACTCTGAGGGCTTCAGTCTGAGCTAGAGG 

4 5 TCTTTCCTGGAGTCCTAGGGCTCCCAGGAAGCCAGTCATAGACTCGTGGGCATCTCTGA^ 

CTGCAGGAAGTGTCCTCTGGTTAGCCCCTGGCATAAACCTAGCAAGGATCTAGCCT 
AC03GTACAGTGTTCTGAGGGTGGGATGACAC(^TTTGGTACCCAACTGT 
TGATGTGTCTGTCCCTGTTTGGTGAGCAGCTCCCT^ 
TCTTCCrATCTGCGCAGAGCTTCCCATCCCAAGGTGCCAGTTGCCATC 

5 0 CTCTGAC^CTGCCCTCCATAGTCATAGGAGAGCTGGGGAG^TGCACAGGTGTCAA 

AGCTTGCGTTTAGCTTCTGCTCTGCATCCCTGCTGCCTTCTAGGCTT 

GGAGCCAGTCAGTGGACrGCAACCTCAGAGAAGGAAGGTGTGGTTGGGGTCACTG 
TAGTCATCTGTAATTGTGCCCCTACCTATGTGTACATATGGC^ 

5 5 TAGCTAGAGAATGGGAGTTCTAGCTATGTAGCCATGGGGAAGCOVTCATCG 

GGGTCCCCTACCCATAAACCCCTTGGCCATGTTCTATAACTAAGCCTAAAAAACCCGTTGGTC^ 

TCAAAATTCTITGGACCCTTATC^GGAAGGGGGAGATGTTTATGTGATCTCT^ 

AAAATGGTCTTCAAATAAATGAGGCTACATTGAGTTATTTTAATAGATT 

TTGTGCTCACGTCACTGCAACTAATCTAGTGGTAGTAGAAGAGAAGTCCCCCCCCACCCTAAACACACA 

6 0 GTCTCTCIXn , CAAGACTACCCAGGCTGAGCAGCTCCGAGATGCTACTC 

GCAATGCAGATGTCCTCGTATGTAAATAGAGGGTGAGGAGGGTGTCAAGACCACATGTGGAA 

GAGCTTAGGGCAGATGCTGGACAGCTGGGACCATCCCCAGCTGTGTTTGTT^ 

GCrTCAGGGTTGGTGATATATATATATATATTTATTTATTTAAAGAAGCTTATA 

ATATGCACATG CTGG AGATACTGACG AATAGGAAGAATGTATTGACTGCCTGTAGG CAGG C CTTTGAGGTACTT AAG AACAAT AT A 

6 5 TGCCAATGATGTCCT AGGGCTTCAGAGAAAAGG CAGGG AAT CCAATGAG ACAGC CTTTCATTACTGAAGCATGGAG CAGGTGACAG 

AGGTCXCTTCCTTCAAAAAACCATGAGTTTGTCTAAGACATGA 
CTGGTGCGCTGCTCTCCAGGATCCTCTGGACTTATACAAAT^ 
AAACTCTCTAAAGAAGAGCATGTGTCCGTTAGGCACATAGTTTC 
CTGTCOUaACOTTTGAAACTTACATTTTCATATAA^ 

7 0 ATGAGGATGTGTTGGGGATCAATTTG AGGGCTAAAGG CACACACTGAAAAAT CAGATCATGAGGCTATCAAGGGGGATGGGAGGAT 

CTTAGACCA(^GGCCTGGGAAGGAGGAAAGTCATTAGCGAAAATGCTGTA^ 

AGGACAATGTGGGAC!ACAGCTGGAGTGGCTGGAGTCAGGCCAAGGAGG^ 

G^CAAAGAAT^GGCCC fc JGAAaaGAGGACCAGGTAA^ 

ACCAGTGG CTTGGGG AG C CATTT ACAGACT CTGTGTTG CT AC-GAAACTCCCTCAAGACCCTTCAG C CATTGGCCACATTGGTGAAG 
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ATTGGGACT AG TCT ACTCCATG TGTGTCCATGCTT AGAG AG AGAACAAT CTTTT ATG ACTCT AGGGATG AT CT ATGGAAG ACAGGA 

GAGAGGGAAGGCTCAAAACACrCTTCAGTATCATCTTCACTGGCTATTGCAGTTGCTAT 

AGTGGTCCCCCCIAGATGAGGGTTCAGGGTAAGGAGTGGAGTTGGGAA 

G AACCJ^CACAT AGGGTTGACCT CACT AG AAAGACTGGATGT C CATTT AATCAG CAAATT ACGGT CAGAAAAAAAAATG AATG ACC 
5 CTCCACCAAACATCAAGAGCTGCTCCAGGCATGGAGGAATTATGGTTCC^ 
AAGGGAGGTCACCTTGTTACACIKX1ACCATGGAGAAGACTGAAGCA 

AGAAAAAAAATC CCCTCAGTCTGGGTGGT AG CTT AGTGGGTAGGGTG CTTGCTTTG CGGGG ACAAGGGCTTGAGTTCACT AACCAG 
AACCCATGCAAAAGAGCTAAGCTTGGTGGCTCA 
CAGCAAGGAGGCCAGAGTTTATTTATCTITTGGTAATTCCTCTGCG 
1 0 CAGCTACAAACTCITGCTAGCAGCAGCAGAGAACTGAGAAGAAAAATC 

CTCCTGGCTGGTCAGGGGAGAGGCTGGGAGCTTATCAACTCTTTGTGAACCA^ 

TCACACACACATG CAG ACTTTGG ATGAAAAAAGAAAAGAGG CTTG ATTCACTT CACACACACACACACACACACACACACACACAC 
ACACACACACACACCTTCTGATTGGGCCCGACTAA 

TTATATATTCATGCACAC^CACACATACACACATTTAGTAGAGCAAAAGCTCCCATGAGCAAGT^ 
1 5 AGAAAAAAAATCACGCATTCTGGATGAAAAATAAACTCCAATCTTTATTGCT 

GAAAGTAAGGAAAGGCTCATAAGTAAGAAAATCTCATCCTCTTTATTGCTGAGAAAAATC 

ATGAAAGAAGGCTTGTGTCCTACTGACATCGTTCTTAGTTCTTACACTT^ 

TTACATTCTGAAAAGGGTTGATAGCGAGTTCAAGACACTAATTC^ 

ATGTGTTTCCATTAAGACTAGTATCTAAATAAACATTCACTATCTAGCTTCT 
2 0 ATTCTTATGGTGTAAGTTAGCATGGTTTTGTCAAGAGG CAAATTATTTG CCTTGTGTCTC ATTAGTTTTGAAGTTTGTATAGGTAA 

ATCCAGTCAATATTTTATCCTCTGTCCTTTTACCTGCAGTA^ 

TCTCAGCTTGCCTAGAGGGAATATTTTTTTCCTCATCAT 

ACACATGAACAGTTACAGAACrCTAATTGGAG CTTCC AT ACAAT AG AGG ATT CAAAATT ACTTGT AT AAATTT AGG AATT CT ATAT 
AACCATTATAATCATTATTAGGAATTACAAAATCATCX^TTTGTCTACATAGCATTA 

2 5 CAGAAAATCTGCCCAACAGACTCCAGGGTGCCCAGGAGTCATTAGAATACGCAATAATGAC^ 

TCTGAGCTCACCCATCACAGGCCATGCAAAAAATGGTGGCCCACTCCAGGAAAGAGACAGGAGGATCT^ 

CAG ACT AGAG T ATCTGG CAAG CT CCAGGT C CAAT AAG AGGACTGTTTCATAAAAAAGG AG AGTGATTG AAAACAGC CTCTGAAGTT 

GACCTCTGGCCTCTAAATGTAAACATGCAGAAGCATCCACATC 

TTGCATGCATAATrTACATCCTACAAGGCCTCAA 

3 0 C^GAGCTACCTGTAAGGTGTCTGCAGAGGTAAGTACTGGCCAACT 

TCAGGACTGGGGTCTTACTACTTTCCTCAAGTTAATGCITTTAO 
TAGTGTTGCACTGAGAAGGAAGAGCAAGTGTCTCAGGACTGTTGAGGGTGA 
CAGAAGAGGGGAGCTCAGATGTGAC^TGCAAGCCACAGTGGT^GG 
GACACCATGTTACCTGTGCCCACATGACATCTTTGAGTTGTGCAGCCAC^ 

3 5 CTAACTCCATGTGGCATGAGTTACACAGCATATACACAT^^ 

GTGTGTATGTGAGGTATGTGTGTATGCTGTGrTAAGTGTGTTCAGTGTGAGGTGTGCTTGTGTC 
TATATACTGTATGTGGTATATGTXK3GCTATATGTTATGATGTCTGTGTGCTGTGTGTACCTCTC 
GTGTACCATGTGGTATGTGTGCATGTGATATACGATATCTGTGATATGTGGAATATATTACGATGTGGTGTG 
GCATGTGGTATGTGTGGGATATGTGTrrACAGTATGCTGTGTGCATGAGAGGTAGTTTGAA 

4 0 CCCAGGTGAAAGGATTTCCTTGTCCAGCATATTCCAGACGACCAGCATTTTTT^ 

GCTGTGTGTCTTTTCCATTCACAATTGTGTCCATGCATTCATTT 

GGCAAG CTCCTCACACAC CT CCACCTACTG ACTG ATCATCTGCTATG CAGGGGAGGTGGG C CG AACTG TCCAGGG CT CT GGAT GTT 

TCTCCTAGTGGCACCCAAGGCTGGAGAGGGCACTCTGGGTAGATGGCACT 

CCCCAGGCAC<IAGATGAGAAAGCCTGTGGTAACGAGGCCACA 

4 5 GT AGG CTCCACAG CCTTGGGATGAAG CTCATCTG T AGG AT ACATTTCT AG CATTCGT GAAG TT CTTGG TT AT ACCCCAG TACCAT A 

TAAACCACATGTGGTAGTTCGTGCCTCTGATCTCAAACTGGGGA 

ACATACTGACTGAGTTCAAGGTCATCTTGGGTAACTTAAAACCC^GAGA 

GTGCCTACTGGGGCCCACCGXXjGKjCTTTGAGTTCT 

GAGGACTGTGTCCAAAGACTACTGTGCACTTCTCTTTTTCTCAGTTTCTTCT^ 

5 0 GAATGATAGTGAATTGGGTCACCTTCGTCAAAAGCCGTGKS 

TAGGTAATAATAGAAAGTACCACACACCAAACACACACACCAGTGTAAGCACT 
AGTCATAGCCAGTGTCTCrTGTGXSTTTGGGTTCTGCATTGTC 

CCAGAACTCTGAAGGGGTGCAGACATGGCGGGAAGTGCTGTGGACAGCGCCAACCACCT 
GGAAGAGGCTGAAGACTACCTGGTCCAGGGAGGCATGACCGATGGGCTCTACCTGCT 

5 5 TTGCTTTGTCGGTGGCTCACAACAGGAAGGCACAC CACTACACCATCGAGAGGGAACTTAATGGCACCTACGCCATCTCCGGGGG C 

AGGGCCCATGCCAGCCCAGCAGACCTCTGCCATTACCACTCCCAGGAACCTGATG 

CCC^CCCCCC^GAGTACAGCCCAAGACCGGACCCTTTGAGGACCTGAAGGAGAACCTCATCA^ 

ACCTTCAGGTGAGTCTGCTGGGCTTGAGCCCACAGAGTTCTCTAAACATATTCAGAGCCT 

TAGCAGATGCCTGTGTG CTGCTGCCCCTCGCTG CCTCTCTTCCCACCCCATCTGCTCATGGGACCAAAGCGGAGTCCTCACATCCT 

6 0 CTCTGCCCTCCTCCITTCACGACGAAGGAGAAGGTTGTGTTTTAACTTTATT 

GGCTATTCATTACC&AGATCTGTTGTTCTCTCACTAGGAAGGAAAAG 
TTATTTCCACCTTGTTTCTTCCTGGTCCAGGCTGCTGGCCAGGGCCT 
GATGGGACCGACGACATGGCTGAACCCTGATAGACATGAAGCCCCAGGTGCCCTC 
AGTTGCATGTACTGATTTCGTGTGTTGGAGGCGGGTGCCTACAGGTCT 

6 5 AGTCTCTCTTCCACCTCATGGAGTTCCGGGAGTCAAGCTGAGACGGCCAGGCT 

AGGACCCCCAACCTCACACACATCTTATAGCTCCTTGGCTGTTATGAAA 
GCTTTGGAGTATCTATTCTCITAGCCCATTGGTTTATGATCTATATATT 

AAGATGATCrAGAGGGAAGCAGAGTTTGGTGTTGATAGGACCTGGGACAGATGGGTCT 

7 0 TCTCTCCATTCACTGCAGGGCCAGGCrCrGG 

AATTCCTGTGAGTGACAAGCCATCTCCCTACCTCCTCACCCAGAGCTAACAGCTTCCCC^ 
TGTCACACACCATGATCnACCAACACGGGAGTTGCTGTCACTTG 
TACCGTGCCTAAAGAAAATGGTGTTCAGTCCCACGAGGGTTGGGTATC 
75 7 ATTTT CT G ATGGAGGAAT ATTTGCACAC^TGT AACGAG AAAT CTTAGT ATCTT ATCTAAf; C CTG CCCT AGAAATT CATTT ATG TT 
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TT AG AAGCACTTG ATGCTT AT AAT CT CTAGGCAATCAATTTCCTG TGTG TGTAGGTTTTCCTG CTAATG CATG TAAATG T ATGTG C 
ACCrGTGAATGGAGGTCAGAGGTGTACGTTAGGTATCrCTACAGTTGCTTT 
TTATTGTGCGTGTGCATGCTGGTGGGTCGGGAGTACTTGTGCCACA 
CTTCCACCTTTACATGGGTTCCAAGGATTGAATCCAGGTCACCAGGCTTGTG 

CCCCAGGAATCTTCTGTCTGAACAGAAATAGTTCTGTGGTCTTCA^ 

CTTGACCTGGGTATCTGCAGTGGGTCAGTGTGAGCCTAAA 

GTAGTCTGGGGTGCAGAGAGGCTGGGAGCTGTCCTTGTGTGGTTAAGC 

AGGAATGCCTGCATTTGTCACCCCATGTCCCTACACAGAGGCCTTCT^ 

GATTCAGGGGTGTGGCCTTACAGTCAGGCAACATGATCAGTATCA 

AT CTAG CT AAAGAATTTT CT CTT ACTT CTGAT ATT AAAAAAAAAAATTT AAGGTGCTGAAAAG ATGG C CCAGT AGCT AAGAGCACT 

GGCTGTTTTrCCAAAAGAGCCGAGTTCAATTCT^ 

AGTGCCCTCTTCTGGACTCTGCATGAGCTAGGAATTCATATGGTGCA 

AAGAAAAGAG T AAAATGAAGATTATTTTT AGATT ATCTT AATCATTT CT CTTTTTAAAAACCTTTTGTTGG TT CrTTGTGACTTTT 

AAATCATGCTCCCCAATCTCTCTTATCrCCCCCTTCCCTTGTACCCTCC 

AGATCTCATGGAAGCTGTAGTATGTCACAGTGTGTCCCAGAGCATACCCTTTO 

AATCACTTGTTGGTCTTGTACAAAGCCrcr^ 

GGTGTTGCCCTATGTCATGGAGATCCTGTAGCTTTGGATCT^ 

GGCCATTCAAAGTCCCGAATCTGGGCAGGTATCTTGTGTGTGGCTGTTCTG 

GCCATCTTCCAIGCCACGCATGCTAAGACAGACCCGTGTCCTCTCAGTCCTGTC^ 

CTGTACACACCCCTGCTGGGTGTGAACTCGTGCACAGAGCACACCTCATCCA^ 

CAGCAAGGAGAAAGCTGAGATGGCCACACAGGAGATCTTGGTA 

TCCTCCATTTTGGGATGGGATGCACrcrTGAATCCiXTGTT 

TTCTGGAAGCCTTCCCCrcrrCTCTTTCCTTCCTCACCCTCT 

CAGGGAAGACCAAGTCAACGGGAGTAGGCACCAAAGCTCCCTCCT^ 

GAAGTGCCACTTTGACAGGCCAGACrCTTTACCACATCCTTCATC 

AACTITAGGCCTTCAGCCTGAGCCTACAGGGGCTGCTGACAGAAATC 

CAGGCTACCATTGTATTCCTCAGAGCTGTAAATTTCCT 

AC^TTCCTATTTTCTGACTTGTGACATACATTGTGCAAAGAAGAAG 

GAGCCTGTGACACGATAACATGCCACTTCCTGCCATGGTGTCTCATAAAATATGTGTTTATGTAT 



ACTTGTATAGTGTTGCAACGGGA 
GGACCTCTCTGAGGACGTCCAAGGGGCTCTGAGAGAGAAAGAGGA 

3 5 GTGAGAGCCTTAGTGACCCAGTATTGCC^GAACATGTTGCATGTGACCAGCTATGGGAAGGGAGGTCCCTATGATGTC 

TCTGTTAAAACCCITACTTTACTTAGTAATGGCTCCAGACGA 
TTCAGTACCCTCTAGTTTCAAGAATCTGCTGGGAGTCAAGTTAAGTC 
CACAATGTTCCATTGTGAAGATTAACTGGGTCAAGTTCGGAGACAAGCTGO 
ATGTTTCCAATCAACrCTAAAGAACGCCTACCTTCCCCITGTCCT 

4 0 GCTTAATGTCTGTTGCCTCTCCTCCCCCATGTCACGTGGGCCTGTC^ 

CTAATCTGTCCTTTTITTGCTCATTAACTTTTAAAAAATAGTTACATC 
TTACITTTCACAATTGTTCCTCGGCATTTTTAGGAGCATCTCGTTGGTTGGTT^ 

TG CATCAT CG CT AGTTTGT ATC C CATCT CAT AGTTTT AAT ACTG AG CAT ATC CAGG T AT ACAT AT AATTTTCCT CCATG TCTCTCT 
CTTATCCGATGCCTCTTTTATCATAACTCATGTTTACTTACATCTGGGGTGACCTTTTTGA 

4 5 GTGGGGATGGAGAATGCCCTGTGCTCAAGTGCTTTCCTTCAGGATTCAAGTGGCGG CTG CTACCCTAAAGGCTGGCTCCTGCCCAC 

TGTAGTGCTGGTAACTGTGTTTGTCCTCAACTCCACTAACACrrGGTGTGTGTGTGTGTC 
ACATGCAAATGTATGCAGGTGTACATGATAGTGCATACATACAGAGGGTAGAG^ 
CACATTGCTCCTTCAAAAAGTGTCTCTCACTAGGACCTGGGGCT 
CTGTCTCTCTCCACCTCCCTAGTGCTGGGATTTAAAGGATAT 

5 0 CAAGCATTTTACTGACTGGACTATCTCCTCCTCAGCCCCAAGCTCGTGC^ 

GTTCCAGCTTACGGGGTGATCTATGCCTACTTTTCAAAATTA 

TGAGACAGTAACTGTTGTGAGCCTCCTGGGCCACGGCGAAAGCCGAGTTCAGTGAGTC^ 

GGTGATCAAAGCAGGTCATTC^TTTGATCTGTGTGGGTTTG 

AGTTCAAAGACAGCCGTGGACTGATGTCAGTGTCAGTTTACTGATGCTC 

5 5 AAAGAATCTCTACCTAGGGACTGGAGCAGAAAGACAAAGAGAAA 

CAGCTGTCAGCCTCAGGAGATAGTCCCCCrcGGAGCTCTCCACCCATGAGATA^ 
AACCTGGGGCTTGTTGATTATGCTAGGACATCrrACTACCTGCAAGTCCCAGAGATCCAC 
CAGCACAGGGACTGTAAGCAAAATCATGACACTTGGCTGTGTTTGT^ 
TGTTTGTTTTTGTTTTTCTTTGGTGGTGGTGGTGGGGGAGGGTTTCA 

6 0 TCACTCTGTAGACCAGGCTGGCCTTGAACTCAGAGATCCGCCTGCCTCT 

CTGCTTGGCrTGTTTGTTTGCTTTAATAGGGATTCrAAA 

GCACCCTCTCCACAGTGCAGGGAGAGATGATTGTGGAAGATGTTTCAGAAGACGCAT 
AGATCTGGTTTGCTITAGGGACCGATCTTGCTTTCAGGATCT 
ACAGCGTTTCreTACCAAATGTCACTTATTCCAGGATCCCTGTCTCCT 
6 5 TAGCTCTGAGCAATGATCATTTTCCCACCTATTATTCCCCTCTCTCTTGACCTCTCT 



ACAGAGACTC AGTGGTACTCAGGCTGG AGTGAACCTGCTGTCCTGCT CAG ACCGCAT CCCTAGA G T T TT A 

CACACCGAAGCCTTCAGTATCTCATTATCTCCCAGGCCCT 

GGAGGGCACACAAAGGTAGAGAAGAGGGTTAAGCCCTTCCAGCAAAAGTCACACCCT 



70 



CTCTCTCTCTCTCrCTCTCTCTCTCTCTCTCrCTCTCTCT 

CATGCCCATCCATGCAAATGCAGATGCCCAAGGAAGATTTCCAGAATCTT 

TCTCTCTCTGAGCAGAGGCTCACTGCCTCAGCTG^ 

TGGG CTT ACaGG CACATtiTG AAGATGT CC^ CCTGATTT AC CGCAAGTG a a u i CACC CACTG AACCAT AT CC CAG CCCCTT ACACU 
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TGTGTGTGTGTGTGTGTGTGTGTGTGTTTTCACACATT^ 
AGGCTCAACCCTTTATTCATATATGGTGTrTTATTGTGTGACT 
TCATCCXAAAGGAGTTGGCACCTCAGGGTGCTCGTGACTGACTGTAGGAGGG 
ACAAATCTACGATGGCGGAGAAGGGTCTTTCACCCTTCACCAGCTT 
5 CTCTGCCAGGCAAGCGCCATTTCAAAAGAAATC^TTCGTAAAGTTGCAACCT 
AAGGGAAGTGAGGTCACTTCCGGTCCCAAGGCCCAGCTCAGCCTGGACTCT 
TGTACAAACCACCCCAGTTTACAGACACCCTTCTC 

GGATGCTCTGAAAATTCATCTTCCTTCCTTTAGCAAATGCAGAGGCCACAGAGGCCT 
GGACTCGCCCTTCCCCAGATCTCATGATTAGAACXSCATGAGTTGCTGGAA^ 
1 0 CTGAACTGGGAG AGT CACACCT C CT ATTGCCATGCCAGC CAG AT CT CACCTTCTCCGGGGCCTTC CTAAGT ACTGATT C CTC CT AT 
GATGTCAGGACTGCCTTGTGCCTGTGTCTCGAGGTTCAAGGGTGCACCT 
ATGTAGCTCCTGGTACCAGTGGAACAATGAAGTCTCATTCACCTACAGACGCCTC^ 

ACGTGGCTATCATTCCCGTCTCAGGTGGGGGATG CTACCTG CT AATGTCAAACAATGTCTTAAACCCGGCCT^GGCACTGAAGAG A 

AAGGGAAGAAGACCATAGGAGTAGAAGAGTTCATAACCTCCTTTGTC 
1 5 CACCTGTCCTTCACCTGAGTCTAGGGGCTTCACAGACACCCCAAAGCCTGTGCCT 

TCACTGATCTCCAAAGCGCCATGGCCCTGCTGGATCTTTGCCATAGTCCATCT 

TGGCTCTGGTACAGAGCCCTCCTGTCATCTCGAGGTCTGGGTGCTGCGTGCCCACCAC 

CTGCTGCCCACCAGGGCCACCAACTGTAGTGACGGTACTGCT^ 

TCTCTCGCAGGATCAGGGCCAGAGACAACAGCGGCTCCTATGCTCTGTGCCTGCT 
2 0 GACAGGGAOIAGACCGGGAAGCTCTCCATTCCTGAGGGGAAGAAGTTTGACACCCT 

TCTCTCTCTGGACTCACTTCTGTGTGTCTCTGTATCTCTCTCT 

TTCrCTGGACTCTCTTCTCTGTCTCTCCCTGTCTGTCTCTGTCTCT 

CACTGGGATGCTAGTTCCAAAACTCTTAGGAACAAGAGGAGGAACGAGGCT 

TTTCTAGAAGACTTGTGTGAGAAATGTTCACACAGACCGTCATCAAG 

2 5 CCAT AAGTTCCTGGTG C CT CAAG AT CATGTATTTT AGGGCAG CATT AAG AATTTGACAG AAGG CCATGC CAT CAGGGACTGGAAT A 

ACATGTGCATGCATGCACACACACACACACACACA^ 
AACCCTAGGGACTGTGACCTAGCTGGTCCTXSGAGCrrCTGAAATGCC^ 
CCAATATGGGAAGCACCCTCCTAGGCJAGCATCCTCATAGCCGAGCCACCACAG 
CTTCCTGGGCTGCAGACAGCTGCCTGTCCAAAGA 

3 0 CAAGGCAACTCAAGCACCCCMCATTATTCACCCAGCTCCCC^ 

AACTACATTAACTGTGGATGCAGAAGGATCAAAAGTAGTAAATTCAAGGCTAGCCTGGGCTACATAGAAA 

AATAAACAAAAAATTAAGTCAATAAAGGACTAAGTGATGGAGATGCTATGGAGACCCCGAACCCCT 

CGTGATGGTTATTTTTTTTAAACTATTTTAAAA 

AGTACACTCTAGCTAACTTCAGACACACCGGAAGAAGGCGTCAGATCTCATTATGGGTC 

3 5 ATTTGAACTCAGTACCriTCAGAAGAGCAGTCMTGCT 

TAATTTGATACCATCTAAATCACCTGAGAAGAGAGCCTC^GTGAAGGATTC 
TTTTGATTACATTAATTGATGTGGGAAGACCCAGGTCATTGTTGGGTG^ 

GCAGAGAGTG CTCACTAAGCAGT AAGCATG CATGCATTCATTCACTCTCCACT CTTGACTATG CATGTG ACCAGCTGCTTCAAGGC 
TAAAAAGTTTGTTGACTCTATCCAAGGACATCTTTGGCCCATCCCCCG^ 

TAGTGGAACATTACTCTTACAAGCCAGATGGG CTACT AAGAGTCCTCACGGTACCATGCCAAAAG ATTGGTG CACAG ATGGGTGAG 
TGATCGGCACAGAAGAGGCTCTGTTTTGGGGGAGCrAGCCATTGTCAACCT 
T^CGGCTTTACCGTGTGATTAAACTAGATGTCACTCAAATT^ 
CAAACAAACAAAAACTAGAAGATTTTGATTGCATGTGTGGCCAGGATGT^ 

4 5 GTAGGGAATCTGTGAGTTCAAGGCCACTGTCAGACCCTGTCTCAAAGAGAAAAGCT^ 

AAAGAGAAAAGTACAGATGCAGAAAGTTGAGTCTCCCCAGTATGTTCCCATCATACT 
AAACACAGAAGTACACAGAACTGTAACCATGACATATTCTCCTTCCAGGCCACCC 
ATTCTCTGAGGCGTCTCTGCTGGGGAAGCAGCTGTGTCTGGA 
GAGCACATCCCGGAAGATCTGGGAATCCAGTGAGTCTGAGAAAGCCTCAGCCT 

5 0 TTTCTCGGCCGTTTTATTTTCTCTTCATTCTTTGGAAG 

TCAGATGTAGAAAAGGGGAATATATCGTCCACGTTCJU3TCTTGTACTGTCTGTGTGAACTCGTGTG 
ACATGCATATCAGTATAACATGGCCAGCGGATAGGTATCTCCTTGAGGTTGGCTACCCA 
TTATTACTAGGGACTTGACTTAAATTGATTTGACTAATGATT^ 
GTCCAGAGAAGGAAGGAGTCTTTGGCCAGAATGCACATTCCACrCCATGCCACT 

5 5 GTTGGGTTAACCCTTTCATTGCCATGCCTTCAAAG 

AGGTTACTTCACAAGAGCATCTCTTCTCATATCTAGGGTCT^ 

TTGACTTCTATCTTGCTAGATGCTAAAAAACCCMTATCTACCAGGCACATGTTCTGTCT 

ACTGAACCTGTTCATTCCAGGGTTAGTACTTCTTACCAGACTC 

TCTGAACCACAGTGCCCCCTGGTGGCCrCACCTGGTGTTTATCATTCT 

6 0 GGTTGCGTATGTCCCAGGATTGTGTAAATCAAAATGTT^ 

GTGGAGTCTGAGACAGAAGATTTC(!ACAGACAACGTGGCT 
TTATTGAAAATTCTTGACCTCTCTGCCAGAATGATAGGCTCTTCT 
CCCTGGGGCAGGTGAAGGGAGCGGGCAGGGCATTTGCAGAATCGGG^ 
TTTTTCTGAAACTAAACGCTCTATTCCrCTCTGCCXSTTGTGGTTTCrr 

6 5 CCTACTCCTTCCCAAAGCCTGGCCACAAAAAGGTGCTAAAGCGTCCCGTGC 

AGTGCACACATGCCCTCCCTGCCTTGTGGCriTCr^ 
ACCTCCAGTCACTTCAGTGAGCCCCACCTGCTGGCTCTCATTC 

7 0 CACAAGGGGCAGCATGAGGGCITCTGCCIACRTAGAGTTCTTCCCAGT 

GACAGCAGGTTGATTTCTGCATTTTGTTACATGGCTGAATTT^ 

GCCAAAGGCAGTTGTCACCAAGCCTGACAGCCTGAGTGTGATTTC CTGCAACTC ACACAGGGTGG AAGGGG AAAG CCAGAACTGAC 
TCCTGCGAGTTGCCTCTGATCCCCATGTATGGGCCTTGACTCCTA 
TTCACCACTGAGCTGTGCCCAAGCTCCTGGTTTCTGTT^ 
7 5 crGTAGCCAAGGTTGTCTTTGAACrrAACTCCACCTCCCAAGCT 
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GTTTATAGTGAATTACATACAGAAGCATGCACACACIAGAC^ 
GCATGCACCCCCATATGTGTATATGTATGTATATGC^CAGATATATGTGC^ 
CTGTGTATATAATATGTATAGGTTGTTCTATCATATTTATTTTTATCTTCTTTA 
ATTTTATATCAATGTTTITTCCAAGCGTGTGGGCATATGATTAAGCCAGTGT 
5 CAGACAGCATAAATGCCCACGAGTTCTACTCAG CAA^TT ATTTG CTCTGTTGGC 

AGCACAGCGGAGAAAGGAGCATGATGCTTGAACAGGAAGGCAGCCATGTTGTGCT 

TAGTCTGCGGCTGCCTGGACCTTTCTCAGAGCCCGAC^ 

CCAGGCAACACGGTACCTCACAGGCTAAGGGACTTAAACG 

CCACTCACAGACTCTGTAATCCTGAATGTGACACCACACTTAACAAGCCTCACAGACACTC 
1 0 ACCGACTCTTCAAGAGCTTCTGATGACACCCTGTCATTCCACTTGC 
TGCACAGTCCTGGCAAGGAGTGCTTTCRTCCCAGCATCT 

TGTATGTACATGAGCACACATACACATGTATGACACACTCATATATACACTCATACATGGAAATATCT 
TCGTATACCATATTTGAAATGCATATGCCCTGAATATGCACCATATACACACATATGCATCTGCATAG 

CATCT ATTG CTGCTT ACACATGG CAAATGGATTGCCTG CGG AT AG CTACACAAAG ATGT ATTTCAAGGACTTACAGAGTT AAACAG 
1 5 ACCAGTTAAGTGATTGCAATGTCCATGGGAGAGATTCT 

AGAAAGCTGCIGCATGGTAGGGGTCAGCTTCCCTTCCCCGCCCCCTTT^ 
GCTCACCTGAGTCAGGCCCATGTGAGATCATCCTCATGATGGAAGTCA^ 
CAGCAGCAGGACTGCCTCAGGCTGGGTCTTCTGCGTGACT 

GCTGCAGCTGGAAG AG ATAGCGCAGTAGG CAAAGTGCTCGTCAC CCAAG CACGAGCCCCTCAGTTCAGATGCCCAGCACCCAAGTA 

2 0 AGAAGCCAGGCACAGTGGCACACGGTGCCATCACAGTGCTGAGGAGCTG 

GAGGCAAGCTCCAGGTTCAGTGAGAGACCTTGCCTCAAACAATAAGGTG^ 

GGCCCCCG AATGTCCAC^TAAGCAATCGCACGTACATGCACATACATGTGCAGGCACATAGAAA 

TTTT AG AG AAGCCG ACACCAAC AAAG CCCAG T CAACAAG CACTTGAAATAACTCT AGTGT AG AGAGTT AATG AAAACTG TG CTT 
TGATCTTAGGATTATCAATTTAAAATGAAACCAGGGCGGGGACTTAGCTTAGTGACAGAGT 
25 AG ATTTGGT C CTT AG GACCAGG AGG AG AAAAATACTTGG AAAGT CAAAT ACAGATTTTTGAAATAATGT ATTTCITTTT AT CTT AT 

GTGCATTGGTGTTTTGCATGCATGTGTGCCTGTATGAGGGTATCCGATCTTGGAGTTAC^ 
GATGGCTCAACAGTTAAGAGCCCCGATTGCTCTTCC^GAGGT^ 

TGTAATGGGATCTGATGCCTTCTTCTGGTGTGTCTGAAGAGAGAATGACAGTGTAAATAAATAAA 
AAAGAGGAGGAAGAGAAAGAGGAGGAAGAGGCCAGGTTTTAGGTCTTTGTTGTGAGATCA 

3 0 GATCTTTAGGTGAACCTAACCGCATCCCACCTTCTGACAGAGTCTGCCCACA 

TGTAGCCAGCTGATTGCAGCCAGTGTCCTGGAGCCCTATGGCTTCTGGCCTCTGGGGAGGGTTACCACC 
TGCCTAGTTTGAAAATACTGGCAGGAAGCTGGGAGTGAAAGTCAAGGTCCAGGAAGCAAAAGGCAA 

GGGCAGAGG CTACCTGCTC CACAGGACAGAGGGTGTGCTGCAGAGCTCTGGTG ATCACGGGCCTAGAG AG AGAG ACAGGCA 
CTGGTGCCTACTGACCAGCCATTCAGCCAGTGGAGACCCCGTGTCAAACGCT 

3 5 TT CACAGG CACATG CACACACAGAACTGCTCACACACACACG CACACACAATATC ATG CACG CACG CACACAAG C CTGTCT AATG C 

ATCCCCACTTTTAACACACTGTGACAGAGTAGTTCXjAACTTGAAAATTATCT 
CACTAGAAGTTGTGCATAAGTGGAAAGGGTAAGGGTGAAGCTGTCAGCGCTTCCTTTACA^ 

TGCCAGGGAGGGCCAGACCAC CAAGTACAAAGTGGAGAG ATGGATAACGCATG CTAGCGCTGTCTACCAG CAATGCTTCTCACGAC 
AGCTCACATTTCCCTCCCirGATCTTTCAACTGAAGCTTT 

4 0 AGGAGGGTTGTATCTTCATCTGCACCACAAAGCAGCCAATGACTGCTCTGGCCCT 

GTGCCATGATTGTCCCAGTTCCCTAGGCTGCCACCTAGGTGCAGGAGGCAG 
AGAAGGGTTATGAATCTCTTGGGTCTCAGGGGCrCTGTCTGGATCTTGAAT 
TGGAAATGAAGCACTGAGCTACATCCACTTAAAGAAGAATCCCAGTTTCCAAGGGAGGTGCCT 
GAGACATAGGGGTAGCTCCTGTCTATCTGGGTGTGAACTCCTTCCAGGAGCTGGGATGCTCAGGG 

4 5 TACGTCCTTTTGATATACTGCCATTTTTTAAAACTGATTTCCTAAGACCCTCAGCTC 

TAGGAAATTGTGAGTGCTGGTTTATTATAACAAGTOT 

CCCCAGTTGAGACACTGAGCGGCAGAGGGTAGACGCTCITCTATCCACGGGAAGGA 

CTrTAACTTCATGATGGTCAGATTGTCTTGGGGATCTCAGACATTTCTTGTGTACAGAGG 

AGTCACGCTAACCGGAAAAGACATCTGGAGACTGCAGCTGGATGATGGTAGCTATTATCAAGTATAAACTAGGG 

5 0 GGCAGGGCTTCTAAGCAACAGCCAACTCTGCTGAAGTGATGTGTAAATGTTTAAGAGGGTCGCT 

GTTCCCTTCCTGAGAGCAAAGACTGAAGAGAGAATTCATCTCAAACCATGTGAGTT^ 

CTCTCTGTtCAACCATCCACCATCTATCCACCCACGCATCCTTCTGTCCATCTACCAACT 

CCTATCTATCCACTCAGCCATCCATTTATCCATCCATCCACATATCT 

CATACACACACCCATTCACCCACCTATGTGTCCATCATCCATCCTTTCACACGTCTGTGTTGT 

5 5 CCAGCGTGGCCCAGCTACTGAGCITTGATTTAATGCCrAAGTCrrGCAT 

GTCCTTCGGGAGTGTTGAGAGAGGCCAGGAGAATATGGATCCTGTGACATCT^ 
TCTITCCCACAGGTGCTGGCAGACTCTAACTCATGG1TAACACCACGAAGCCAGCAAG 
TAAACATTGATGGGTGGGGTTGTCACGTCAGTGAGGCGGGTGGGATTTTTGATATC 
CTGAATGTCTCTGTTTCTCTACACTTCCCCTCAAATCT 

6 0 GTAGATATTCAAGACAAGACAGAAATGGGAGGAGAGAGCCGTCAGAGCT 

CCACTTTGCTTTAATTTGTGAGTAAAGATGGGCCAGAGAAAAGTTTTTAAAAATGTA 
CTCTGCCTACTTCATGAAACAAATTGATATTGATGGATCGAATGAGAAATCTCCGTGTATA 
CCACAGGAAAACCAGAAGAGTTCTCTTCCTTCAGGAAGACAGAGAGACTGTCTCTTTGACAG 
TGGTCCTTATCACGATAATGACCGAAGTGCATGGGGTGGGGGTGA 

6 5 GCCTGGGCTACCCTGTGGTAGCAATGCCTTGGTGTCTCAATCTGTATGGAAGGCAAAAC^ 

AACTTCTGCCTGACGTTAGATGAAAGCAGAAATGAACCCATGGGGAGGGAGCCTGCGAGGTCCnXS 
AGGAAGTTCTCITGGGATAGTCAGAACAGATGCTCTTCCCTCCNNNNNTJN^ 
CTTCAGGGGTGGGAGCTATAAGCCATGAGCCTGTCTAGCCTTCTCAGGTCCAAAC^ 
ACCTCTTCCTAATCCTCTGCCTCCCCTGCTTATGTTATCTGTGAACACTTGGAAATG^ 

G^CCTGGGGCCCAGACAGAGGTGAG^ 
CATGTAGAGGCAGCTTACAAATTCCTGGCACCATTCC^ 

AG C CACAGGT AG AAAATATGAC ATCCATAAAAC AAGCAGTCCTG CAAGCA 
AGCAJLACAAAC AAA^CCCTTG AG TTGTT A^ CAT CGG CAG CATCATG CTGTGTGTCTGGCACATTCCGGGG ACCCTGGTTGC CATT C 

7 5 TTGGAAGAAAATGTTGCCAAAGCATAACTGTAACCTCTGGGTGGCTTO 
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CAGGGAGCTGTGATGACTGGGAGGCCACAGGGGACACAGTGCAGGAGGGCGCAGGAGACTTC 
ACCTTTCGTAGGTTCATGAGTGTCTGCATCrc 

GAGGCTTAAAGCAACAAGAGTCAGTTATCTCTCATTTCATTTATGCCTTGTGGCT 
GCATGCACGGAGCCCCGGGTTTCATCCCCAGACCCGCAGAAGCCATGCATCGTGTTGTATACACC^ 
5 GTGTTGAGTTCACGGCCTGTCTCACATGAGCCTCAGAGCAGTA^ 

AG CCTTGAGAAG CACACATAT CATTTCT CTTGAGCTCTGTTGGTTCAGTT ATGAAAAG C CAACAAGGACTGTGTCGTTGGTGCTAC 
TTCTCAATAGGAGACTGTCACAGCTGGCTTAAAGTCT 

ACAGGAGTCCAGACTGGAGTCATCTGTTCCCTGACTACGTTGCAGGCCTrCAGAG^ 
1 0 G AGAG CC CTT ATGCTG ACCCTGAAG AGATCCGG C CCAAAG AGGT CT ACCTGG ACAGGAGC CTG CTGAC C CTGGAGGACAATGAACT 
GGGCTCCGGTAACTTCGGGACTGTGAAAAAGGGATACTACCAAATGAAA^ 
TCAGTGTGAGAGTGCATTCTGAAAACTTCTCAGAACGTACCT^ 

TTTATTGCACGCACACACATACACATTGCAGCGCTTATGTGGTCAGAGGACTCATC^ 

GGGGATTAAGCTAATGTCATCAGGTTTGGCAACAAGlTrT'l 1 1 ITTTTTTT T I T IGTrriTTT ACCAACTG AGCCATCTTGTAGTC 
1 5 TTCATTTCTGACTCATTTAGAGTAAAAGCATCCTTTGTGTATGCTAACTCT 

TATAGTATAGAAAACAC^CTAAATAGAAATAAGCAAATTCATAAGAAAATAAAAATATATC 
ACTAACCTTGGTGTAGGCATTTATTGGATCTTTATATGTTTTGTATGTAAATGG 
GTAATGTGTCTATCGTGTAGCATTTTCCATGTCTAGACCTT^ 
AAOVCAACTGAGTTCACTTGTGACCTGTAGCAGTT^^ 

CAGATATGTTCACAAGTATGTGTGGGTTGTGAGTACACATTTGTGCAGCT 
ATAATAGTAATAATAGTAATAATAATAATAGTTCCTCAGGCCCCGTCCACCrTGCTTTTTC 

GAG CTTACCAATTAGGCT ATGGAAACTGGCCAG AGAAGCGCAGGAATTCACCTGTCT CTGCCTCCCCAGTGCTGCGATTATAGCTG 
TGTACCACAAACATCTTTTTATATGCTGGGGTCTTGGGGGTCGAAGCCCAGTCCTCA 

2 5 AGCTGCCTCCCCAGCCCAATAGTGATACTTTTTAAAAGAAAACCATGTCTACTCCT 

' G CATT AACT AAGAATTG T AGAGT AGTGTTIT AGAACTGG CAGG AG TCTTGTG C CTGG TT ACAGAGAG AT CCCAG CATTCCTTTGGG 
TGTGGCTTATGTCTGTATGATGACTGCTGTCTGACAGACTGAAGACATGCT iTAGTCC 
TGCATCTATGCTGNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TGATGAGTTACAGCCATAGATTTCAGACCTTAAACATGTCCCTCCATCTTGGGGTAAACT 
TTCCATAAGCAACTGAATATAACCCATGAGTATTTCTCCTAGAATTTTTCCTATCTGTC 
ATTGCTGTGATGAGACACCGTGACCAAGGTAACTTATAAAAGGAAGCAATTAATTGGGG^ 
CTGAACATCGTCACAGCAGGAAGATATGGCACTGGAGCAGTAGCTGAGTGTATATCTTAT^ 

3 5 AACCTCAAAGCTCACCCCCAGTTACAGATCTCTTCCAACAAGGACACACCTGCTAAAG^ 

GGCCCAGCCrTCGGATATATGAGCCTACCGAGGTTTTTCTCATTAAACCAC 
TCTCATAATGGAGTCTTACTCTCTTACCTGTA 
GTAAACAGTCAGGAATTGGAAGAATACTAGCCTCTTAGAATTA 
ATTTTCTTTTTTAAATGTCTAGAGGAATCAAJSAAACCTTTTCT 

4 0 ATAGACCCAACTATGAAAGCCAAAATAGGTGGCTAGAGATGACAGAATGGAGCCTCCCATGTTCCCGGGAGGGCCCAG 

CTAGGGTGATGATAATATCTAAATCCTCCTGTGAGGCCACAGGCTGTGT^ 

TACATGCATGTGGTlTGGGGATCAAGCTCCAGCCACCMGCrTGGaiGAG 

TCCTG AAT AGTTCTGTTTTGATT CATCG<5TTTGAAGGTTCAACAAAAG AAACAACTTCTCATG CTTGTGGAAATTTTACCACAGGT 
4 5 AATTTTTCAAAGCATACCTAGTTAATXjTrrG'rrCT ITCTT '1 1 AAGCCACATTTTGCTAGAATACTTTCAAGAGTTCAGCATGATCC 

ATGTG ATGTTTTTG TG AGGGAAT AAAAAAAAATC CAT ATTTT CGAAACACATTG AGGCTTG CTT AAAAGT AACCAAG CC CAGTGGG 



CCAGAGCCCCAGAGGTAGAGACAGGGGGATCAGAAGTTTAAGGTCATCTCCT 
GTAAGACCCGGACTCAAAGAAAAAAAAAAAGCAAAGCAATGTATTTTTATTTTTTAA^ 
5 0 TCATAAAGAAACTTCGTATTCCTTTATGCATC^ 

CACAACAGGGG AACTTTGTTGAAG CTGG TTGAATATTTCACAATTCGGGGCTTT C CTCAG C CTGG AC CATTCT CCATTG CTGTCCA 
CT ACAC CGGGGTT CACAGGC AGTGGGGGATGGTTCCT CTGTCTGT AAAAG AAC AG CTGT AG AAAGAATCTCTTGTGT ACTG T ATGG 
AATCATTACCATCATACAGAGCACCAACCACAAACCAGAGAACAATCCCCGAAGT^ 
TGTGGGGGTTACTCAC^GGAGCTCCGTGTGCCGCTGAGAA^^ 

5 5 CCTAGATGCCCCCTCrCGGGCTCCrGTTAGCTTTCAC!AGTCGTATCCrCT 

GACAGGATCTCACACAGATGACTGGGAAGCGCCC^C^GGAGT^ 
CTTATATGGAGGAGCACCAGCATTTGAAGCTGGTCTTTCmrr 
TATATTGAGAGGACGTTAAAGATGATCTTAAACCAACGATCCT 
CCCATC^CCC^TCTGAAGCACACGAGTTGTTCAAGCTCAGAGTTAAAGGGAAAGA 

6 0 TTAAGCAAGCCTC^CATGTTTCGAAAATACACrATCTTTTTCAAT^ 

AGTATTCTAACAAACTGTGGTTTAAATGAAAGAATAAACATCAGCTATGTGTGATTTGAA 
CAGACGCCTTTAATCTCAGCACITGGGAGGCAGAC^ 
GGACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAAACAAAA^ 
AGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAGGAAAATTTGCACCT 

6 5 CT AG CT ATTT AT CTTGATGTG TGTGTGTGTGTGTGTG TGTGTGTGTGTG TG TGTGTGTGCAGGT ACT CCTGACACAG CACAT AT AT 

AGAGGCCAAGGG ACAAC C CTTGG ATG TCACCTCTTACCTTT CAC CTTGTTT CAAACCGC CTCTGG GGATTCACTG CTGG CC CCACG 
AAT CT CC AT AG ATTCT C CC CT AGAAG CCCTG GGGTTGCAG AC ATCTG CT AT C CAA CTTT CTGTGGGTTCTGGGG ATCTGAACT CTT 
AAC CT CACTCTTG C ATG CG CTTTTT ACACTGGGTT ATCTCC CCAG C CCATGG CCG TTTCTTG I'iTGT lTl"!' T IT 1' T T TCGTTTG TT 
TGTTTGTTTGTTTTTGGGTTGCTTTGTTTTGTTTTT 

7 0 CCCTCCACCCCCCTCCCCCCAACACACACACAATGTGACTACAGAGAAGAGGACCT 

CTC^CAGAGGCGAACGTCATGCAGCAGCTGGACAACCCCTACAT^ 

GGTGATGGAGATGGCGGAGCTC-GGGCCGCrTCAACAAGTACCTGCAGCAGAACA 

C^CCCCGCACATGCCCTCTCCCCCGAXSTTTGGG 
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CTCTCTCTCTCTCTCTCTCTCTCrGTGTGTGTGTGTGTGTGTGTGTGTGT 

ATGTGGAAGCCAGAGGTCAATATTGGAGATCACCCTCTCTCCTCTCT 

GTTGGTGTGAGTGTGTGCCACACAACGCAGATACTTATGGAGGCT 

TTGTGAGCCrrCCCAACATAAGTGCTGGGAACCAAACrCCGGTCCrCTGCAAGAACAGTACG 
5 CCTTTAATCCTAGCACITAGGAGGCAGAGGCAGGTGGATTTCT 
CCAGGACTACACAGAGAAACCCTGTCrCGAAAAAAA^ 

AAAG AACAGAAGGCGTTCTTAAC CACTGAACT AT CTCGTTAGCTCT C CG CCTT ATTTCATGAGACAAGGAACT ATCT CATT AG CTC 
TCCGCCTTATTTCGTGAGACAAGGAACTATCrTGTTAGCTCTCT 

CACCAATTCAGCCAAACAGAGC CCCAGGG ATT AC^ C C CAACT C 

1 0 AGGCTTCATCCCTATAAGTCAAACAGTTTACCAACTAAGGCACCnTCCTAGCCCTCT 
CAGGGCATTGCCCACCAGCAGGCAGAACATTCTTCTTTGG 

TCTGGCCGACCTCTTGAGTGCCTGCATCTGAAGAGGCAGCAGCAGTTCGTTGAT^ 

GTCAACAT AGAGTAACTCCCCGATTCCTCTGAGCCGGGGTTCCTTCTGTACAG CTTTTCGTCAT CATTTCCAACATGGATGTGTGA 
GTTTGAGCATGTGCAGCAGGTGGAGGCCAAAGGTCAAGCC^VAACCC^ 
1 5 G AGGT CC CT CATGGGCCAGAG CTCATCAGTTT AG CTAGACTTT CTGGGCAGGG AACACCAAG AATCCACC CGTTT CACTT AC C CGG 
CTTTGGGATTACAAGTGCAGGCCX5CCCGCCATGTTTGGCTTT^ 
GCAGGATGAGOSCTTTCCrAAAAAGCTTTCrCCATAGCG 

TCATCATATAGTGTGTGTATGTGTATATGTGTATGCACATGTGTGTATGTGTATATGTGTATGC^ 
CCATGCATACGTGCAGAGGCCAGGAGAGGCCACTGTGTC 
2 0 GTGTGGTAGCACAGACCCTTAATCTCAACACCAAAGAGGCCGAGGTTGGTGGCT 
AGTTTATTCCAGGCTAGTTGGGGCTACAGAGGGAGACCTT^ 
AGAAAGTCCTGCCTGTTTGGTTTGGCGTGGTTTGATTTGG 
GTCCCCATCCCCTCCCAGTGCTGAGATTACAGCACAGGTGACCAC^ 
TCCTCAGGTTGTGCACGCAGCAACTGTCCTCGCCCTGAGTCATCTTCCCAGCCTG 

2 5 CCGACCCTTCTTCCATGTACTCGGGGGACACGGCrcTGTGTT 

TAGGCACATTAAGGATAAGAACATCATAGAGCTGGTTCACCAGGTTTCCATGGG^ 
ACAGAGATCTGGCTGCGCGGAACGTGCTTCTGGTCACACAGCACTATGCCA 
GCTGATGAAAACTACTACAAGGTAGGAGCGCTCGGTGCAAGCTCACAGAACA 
CAGGCTGAGAGCTCAGTACAGGCTCACAGAGCAGCTGAGGGGACAGCTTCCATGCATA^ 

3 0 AAAGATAAATGTGTGCTCTCTGTATTTTACTATGGTAGAAATCTCACTAGTAACCTTGGAGAAAAGTATTCT^ 

TCCATCAAGAAAGACAAACCAAAAATACTGTAATGTTrCT 

TGGTGGTGGTGGTTTTriT'ri'll'TTTXl T TTTCGTTTTACATT^ 

TCTAATAGAGAGCAGAGGACAACTTACAGAAGTTGATTCTCTCCATCTACCATC 

TTCGGCAGCAGGCACCTrACCCATTGAGCCATCTTGCTGGCCT 
3 5 AAATTGATTTTTCTTTTCTTAGGCAGGGTTT 

ACCACGATTGGGTTACTCAGTCTACCTAATACTATGACCCTTTAACACAGTTCCTCGTGTTGTGGTGACCCC 

TCATCGCTACTTCATAACTGTAATTTCGCGACTGATATGCAGGGTGTCTGATATGTGACCCCCAAAAG 

GAACTACTGGGGGCTITATGCGTGTTAAAGGAAACCACT^ 

GATTTTTCCGAGTCCTTTTCACTGAATTGACACTGTGACT 
40 TGTGCATCGGGGTX3TGCACACCTGACTTAACGCATCACAGAGTGTC 

GAAGTGGTACGCCCCCGAATGCATCAACTACTACAAG 

CGTTCTCCTATGGGCAGAAGCCCTACAGAGTGAGCCATGCTTAGCTGTGTGTTTTGTGTCTCAGCCTCT^ 

CCCTCAGTATGGGGTTATTGCCACAAGAACCTGAACCCCTGGAATCTGAGGACACCCTGGGGTCT 

GCGGCTAGTGCAACCTAACITACGCTTGTACTCACTTGAGAATGACATGTGATACTGAGTGAATGCA 
45 ATACGTGCACCAGGCACAGGCCATCCTTCCTGAGAGTTAAGTGGAAATGGTGATACGGATGGAT^ 

AGCGTGTGTGTTCTCATGTGTGTGCACACOTGTGTATGTGCAAACACATTGTGTGTGTGTG 

GTGTGTAGCCCAGAGACCATCCTCAGGTGTCATTCCTCAGGGACTCT 

CTCACCG&ATAGGCCAGGCTGGCTAGCCTGTGAGCCCC^ 

TACGACGNNNNNNNNNNNNNNNNNNNN^ 
50 nhnnnNNNNNNNNNNNNNNNNNNN^ 

NNNKNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNN1TONNNNNNNNNW 

NNNNNNNNNNNNNNNNNN^^ 

55 

NNNNNNNNNNNNNNNNNNN^^ 
NJTOJNNNNNNNNNNNNIWNNNNNN^ 
50 nnnnnnnkNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNl^ 
55 NNNNNNNN NN>mNNNbmN^^ 



NNNNNNNNNNNNNCCAATCATGGTGTTTCATGGCAG^ 

TAAGCAAACTAGGGCAGCTGCAAAGTCGCCAGCTGTTACAATCTTATGAGACT 
GATGTCAACTC^AGCATtrrAGAACACTCTTGCCrTCCACAAACT 
70 TCGAGGCTAGATGCTCCTGCTGACCACAGACAGCTCAGTACCAGAAGTCTCCCG^ 

ACCACTGCTCAGCCCATAGGGTCGGAAGCCCCATCCCTATTCTTGGGTACAGTGTATTTTGCCTT 
ACAGACACCGTGCACGTGGTGAGAGCGTGAATCCACTGTGAGCCACTGGGCATGCTCTGC 
CCGTGATAAAGCTGTTCGGTGCTAGTGAGGCACTCGATGTTAGAAAGCTG^ 
TTTTTAATAAAGGTTCAACCTCCCCTCTAGCCCGCCACTCAGCAGAGGTAGC^ 
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AGCAGCTGCAGACCAGTCCGCTAGGCAGGCAGACT^CCAGGCATGAAC^ 

CG CTTGCTTG CCTCAG CATCT AACCTGTAGGCGACAGCACAG^CTTC CACAGGGTT GGG CTCTGAA 

CCAGACCACAGCCACAGAAATCTCTGAGTTCTAGCAAGAGCACAAACCAATACC 
TATGGTATATTACAGCTGTGTCCATATTCTCCTCTTAAGATTTTTAGCCAAGGG^ 
CTGTTGTGTAACTCTAGGGACCTGACTTGAGTCCCTA^ 

GCAC CAG CAAGG AAGATGGAGACAGG AAAATCTC^G ATtKnTTCTGG T CAG C CAGC CTTGC C7VACTTGACAAGTTCTAGG CAATTG 

AAAGACCCTGTCTCCAAAAAAAAAAAAAAAAAAAAAAACCAAGGTAGATGGTGTCTGAGGAAGGAC^^ 

TTCAGACACAGAAGATATTCTGTACCATAGTGCAAAACnnTCTGG 
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GAGGCACGGCGACTTAATCATGTTTC 
GCCAGGATTCACAGGTGTGGAACTGAGGCTTCGCAATTACTACTACGACGTGGTTAACT 
CCACGGAITCCCAAGCGATCACAGGAAATTCATTCAGATGA^ 

AGCTAACATAGGACTCACCCTCACAACAGGTCTGGTGCCCAGAGACAGACAAGCAGCAAGACCTTG^ 
CTGGTCTTTGTTTCCATCTGTGTGGTCTTCGCTATCGGTCACGTTTGGGAGCCATTTCCAAATTCT 
CTGGGTCCAGGATTTCAGTGTCCCCTGAAGATCAGAAAAGAAAAGTGCTTTGCCACTCCAGCAAAG^ 
TGGTGCAGTCAGCAAGCTGGAAGGTGGTTAAGGACAATGACAAGCCTGGACACTGGT^ 

GTGTGGATGACATGCTGAGATGTGCTGGCTTGGCTGACACGCCCCTGGACTCCACCTCCCTGCAGTCGTGTGTG 
GGTGTGAATAACGTTCTGTGTGGAGGCTGCCGGGGGAGGTGAGCTCCCATGATCCTCTC 

GGAATGGCTCATTTGAGCTCACAGTTGGAGGGGATAGGCCATCATGGTGGAAAAGACAAC 

AAGGAAGCTGGAAGGTCACATGGCATCCACATTCAGGAAACAAA^ 

CACCTCCGGTGACCCAGTTCCTTC^GATCCATCTATCAAAGATTCC^ 

AATGTTCAAGCACACAAGGTGCAGTGCCAGGCAGGGGCCTTCCTGAATGA 

ATCTCAGGTCTGTCCCCITATGAGCCCCGTTTACCATCCCAGATTCCCCCTTC 

CAGGCCCTATCTCCAAACACCTAGGTACTAAGTGAACCTAGGTACTAAGTCTTCAACACATGAATTCTGAGGG^ 

TGGAGCCGATGGCAGCCTGGTGACCAAGACCATCTC^GGATGAGTTTCTGAATGGAATCTTC^ 

ACCTTCCCCACAAGGAAGCCTTGCTAAGTGCGACATTGAGCTGAGGACACACAAGCTC 

GGCTAACTACTGACCATACACCGAATGTCACCCATTCACGGCACTTTTGCTrGAGGACCTAAAAGCTACA 

CATCTAGGTCTAGGGACATGTACAGATGGGAGCATCTCACCTTCACCTCTG 

GAGTCCCTGCTGGTGGTGGGCTCAATGAACAGAGACGATGGTG AAATGGGTG CCTCT CTTTAG ACCAGTGGTTCTCAACCTTCCTA 
ATGCTGAGATCTTTCAATACAGGTCCTTGCGTTGTGGCGACTCCCAACCATG^ 

TGCTACTATGATGAATCGTAGTGTAGATATCTGATATGCAGGATATCGGATATGTAAATATCTGGTATGCAGGATATC 
GGGCACATCCCACAGGTTGGGAACCACTGCCTTTGATAGGGAGGC^CTATGCTTAGAGCATG^ 
GGTGTGTCTTGGAATTGTGTTAGTAGGTCAACCGTCCTGTGGAGGGAAGGAAGGGGGTGGAGGA^ 
TGAGCAGACCTGCAGAACACAGCCTTGGTGAGGAAGTCCCATGGGTCAGACCACCTGTAGATTTAGTCAGG 

G T AT AGTT ATGG AGACAGG ACTG TCTG CTCTG AG CAAG AAGTTCTCTGCTG AG CTCCT AGC CT AC CTT C CCTTGTC CCCAGGG CT A 

TAGAAAGGCCACCTCGAAGACCAGGGAGCATGTGTGAGAAGTCTGTGGAGGCCTCTGCCTTCTTCCTGGCCTCTCT 

CTCCTTCCCAGGGATTTGGGACTGACCGTTATCTGGCTATGACT 

GAAACCGGTTCCCATGTGCAGAGTCACATCTGGCCATCGACCAGGGCTCT 

TTCTCTCCCCTGGGCCACTGAGGAGGGCAGATGGCAAGAGGATCCrrCCCTAGAGGATTTAGA 

AGGTCAGAAACTAGCCAAGGTCCTTCTCAGGCATCTATCCTTAACTTGGTCTCTCC^CAAGCTGC 

CTGCCTGG CAGCTGGGCTC CCTGTGTGCTTTTACCTATGACCCTTGGTCAG CAGAGTACAAGGAGCACCT AAGCTGCCTCGCCACC 
TCCCACTGACTGCAAACAC CACATACCTCATGCGG AACCG AATGTCTCTCCAGTAAACACCCCAGCCAG CCATTCATAAAAACCTG 
TCTCTGTGTGCGCTGGGACATGTCCTCCCTGTACC CCGGGCCTG CTTTGTGTGCCAGACAGTGACTC CACAGGG ATGCCAGGTCTG 

GTTATTTCAGAAATTATTTTAAATAAAGATCTACTATTAGTCTTGTC 

CACCCTTGACTAGAGCTGGCCAGCATCTTCGTAAGTGCAACCTGCTTGTGAAATACTGAGCCCTCAGTG^ 

TGGGTGGG AACG CAG ACTCCAGAGCTCACCTATTCTGCCG CTTCAGAAGCCTGCA\AGGAGAGCCCGGTCCTGTGAGGGGAAACAAC 
TGTGTCTGAGATTCGTAGGCTGAGAACTGACAGCCGAACrrCAAACC 

AGGGCGGGGTGAGCTTTGCCACTGTCAATGAAGACAACACGAAAGCATCAGAGCAGAGAAGGTG^ 
TGGGCTCCTCTCATTCTATAGGGAGTTATGAGC!AGGCACACATAC!ATAC^ 

TATGTATGTATACACAGGAAC^CAT ACACATGCATGCACAAACGTGCATTAAAAACATG CCTGCAAATG CACACTGCACACACAG A 
CATGAACCTGTGTACAC AAATCCACAATCACATGTACATGTTGAC^CATCCCCACTC CAATGCCACTTTGCCCACAGG CT AATCT 
GATCTTGGAAATCCCTCCATGGAGAGTCCCTTCCC!AAGTGACTCTAGGCTGTGTCAAGTTC^ 
AGAGTGTATGGTAAAGAGAACCTGAGTTTAGCCCCCIAGTGCCTATGTAGAGAG^ 
CTGGGGAGGGAGAAGCAGATCGCTACAGCTCACTGGCCATCCAGCCTCACCCAGCACGGT^ 

GTCTT AGAAAAAT AAAGGAGAAAGCAGCTGAAG AG AACACT C AGTGT CAACCTT CTG CATG CACACAT AC AAAC AT GCAC C CCTG A 

ACACATGCACCCCTGAACAGATGCATGCACACACATGCAT^ 

ATGOVCAC!ATGAACACATGCATGCACACACATGCATGCA<IACA 

CCCTGAACAC^TGCACACATGAACACATGCATGCACACACATGCZATACATACACAT^ 

GAACACATGCACACACACAAACATGCACACATACAAACATGCACACA 

ATGTGAACACATGCACATGCATGCACACACTCATGCACACAC^ 

ACGCAATGCCTCTGAACAACCTCAATTTAAGAG(2ATTGCCTGCAC^ 

AGAGCCACAGACTTTTAAAGCTGGCAGGCTCCCAC^CTGTACCAATTC^ 

TTAAAAGGAGGAAAGGCCTGTCTTGGCAGCAGTCTCTGAG^CCTTGTTCCTGGAGTC 

TACCAAGCCACATCCAGCCAATGCAC^TACAGAGGTGCAAAGCAGAGACTGGAGGTT^ 

GACAAAGATGGTTGCTGACCCTCCTAAGCCCATCTTCTGGGGTCCTGGC^ 

CCCCATGGAGGCATGCTCCCATCTGGCAACACGTTTCAGCT 

CAAAGGGAGATGCGTGGGGCTCTCGCTACAGCAACGTTATTCCTGCCAGGATGGTAAAGCTGTGGGTACCTGAC 
TGTCTGTGGTGGCATAGCTTCTC^CCAGAGGAGCACACCGTGG 

GAG AG CGGTTCC CAACAT CT ACAACAGCATT CCTGACG AACGGCTT CC CTTT CAC AAGGCC C CAG AT CTTTGAGTTTATTATTTTA 

TTTGTTTGTGTGTATGTGCATGTTCACACACATGTGCACCCGCGCATGTC 

GGGTTCCTTCTGAGGACAGAAGAGANNNNNNNNNNNNNN^^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNHNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
KNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNh^ 
X0 NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNN 

15 MOUSE SEQUENCE - mRNA 

GGCACGAGGGAGGAGAAGGCGGCGGCGO^GAAAOVGGAGGAGGGAGGAGGAGAGGG 
GGAGCCACTCTOCCTCGCCGCACCCTAGCAAAGGGCCT 

CGCGGAGTGGGGCCCAGCCGGAGCTCCGCCTCTCGGCAGCT<^^ 
20 GCAAAACAACGTCCCCAAGCATTTGAGATCCGGCACTGTCGTCGTGCGCGCCrCT 
TGXZGGTACTTCTCCATACACrTCCCAGAACTCT 

TACTTTTTTGGCAACATCACCCGGGAAGAGGCTGAAGACTACCTGGTCCA 
GAGCTCGCAATTACCTGGGTGGTTTTGCTTTGTCGGTGGCT 

GCACCTACGCCATCTCCGGGGGCAGGGCCCATGCCAGCCCAGCAGACCTCTGCCATTACCACTCCC^ 

2 5 TG CCTCCTTAAGAAGCCCTTCAACCGGCCCCCGGGAGTACAGCCCAAGACCGGACC CTTTGAGGACCTGAAGGAGAACCTCATCAG 

GGAAT ATGTGAAACAGACCTGGAAC CTTCAGGG C CAGG CTCTGG AG CAAG C CATCAT CAG CCAGAAGCCCCAG CTGGAGAAGCTGA 
TCGCCACCACGGCCCATGAGAAGATGCCCTGGTTCCAT^ 

AAGACCAATGGAAAATTCCTGATCAGGGCCAGAGACAACAGCGGCTCCTATGCTCTGTGCCTGCTG^ 
CTACCGCATTGACAGGGACAAGACCGX3GAAGCTCTCC^ 

3 0 ACTCTTACAAGCCAGATGGGCTACnttAGAG 

AATGCCCATCCCGTGACTTGGTCACCGGGTGGAATAATCTCAAGGATCAAATCCTACTCCTTCCC^ 

TGCCCCACCCCAAGGGAGCCGTCCAGAGAGCACTGTGTCOT 

TTCAGAGAGAAGCCCTGCCCATGGACACAGAGGTGTACGA 

CTGGACAGGAGCCTGCTGACCCTGGAGGACAATGAACTGGGCTCCGGTAACTTCGGG 

3 5 AAAAGTTGTGAAAACCGTGGCTGTGAAAATCCTGAAGAACGAGGCCAACGACCCGGCTTTGAAGGACGAG 

ACGTCATGCAGCAGCTGGACAACCCCTACATTGTGCGCATGATCGGAATCTGCGAGGCG 
GCGGACGTGGGGCCGCTCAACAAGTACCTGCAGCAGAACAGGCACATTAAGGATAAGA 

CAT GGGG ATGAAGT ATTTGG AAG AG AGCAACTTTG TG CACAG AG ATCTGG CTGCG CGG AACGTG CTT CTGGT CACACAG CACT ATG 
CCAAGATCAGCGATTTCGGTCTTTCGAAAGCrc 

4 0 AAGTGGTACGCCCCCGAATGCATCAACrTACTACAAGTTCTCCAGTAAGAGTGACGTCT 

GTTCrCCrATGGGCAGAAGCCCTACAGAGGGATGAAAGGGAGCGAAGTGACCGCCATGCTGGAG 
CTGCAGGATGCCCGAGAGAGATGTACGACCTGATGAACCTGTGCTGGACTTACGATGTGGAXSAAC^^ 

G AACTG AGG CTT CG CAATT ACTACT ACGACGTGGTT AACT AACAG CTCCGGCG CCTGTCCGTG CACAC CACGGATTC CC AAG CG AT 
CACAGGAAATTC!ATTCAGATGAACTGGCTCTCAGAGTTT^ 

4 5 CTCACAACAGGTCTGGTGCCCAGAGACAGACAAGCAGCAAGACCT^ 

GTGTGGTCTTCGCTATCGGTCACX3TITGGGAGCCATTTCCAAATTCT 

GTCCCCTGAAGATCAGAAAAGAAAAGTGCITTGCC^^ 

AGGTGGTTAAGGACAATGACAAGCCTGGAGACTGGTTTTA 

GTGCTGGCTTGGCTGACACGCCCCTGGAGTCCACCTCCCTGCAGTCGTGTGTGTTTGCTGGAG 

5 0 GGAGGCTGCCGGGGGAG<3TGAGCTCCCATGATCCrCTGCGACTGAGC!AGAAGTGTC 

GAGGGAATGCATCCAGGACTCCHTGnTG CTGCCACTAAAAAATTG CCAACCAGCAACTTAAGGG/^GGAATGGCTCATTTG AGCTCA 

CAGTTGGAGGGGATAGGCCATCATGGTGGAAAAGACAACAGGCAGG 

GGCATCCACATTCAGGAAACAAAGAATGAACAGGA 

TTCAGATCCATCTATCAAAGATTCGACAGGCCTrCTCAGAC^ 

GAGCCCCGTTTACCATCCCAGATTCCCCCTTGGGACCCCCCrCATCTGGATCAT^ 
TAGGTACTAAGTGAACCTAGGTACTAAGTCTTCAACACATGAATTCTGAGGG 
GACCAAGACCATCTCAGGATGAGTTTCTGAATGGAATCTTCA 
TGCTAAGTGCGACATTGAGCTGAGGACACACAAGCn'GAGAATTC 

6 0 CGAATGTCACCCATTCACGGCACTTTTGCTGAGGACCTAAAAGCT 

ACAGATGGGAGGATCTCACCTTCACCTCTGAGACACTG 

TCAATGAACAG AG ACGATGGTG AAATGGGTGC CTCTCTTT AGACCAG TGGTTCT CAAC CTTC CT AATGCTG AG AT CTTT CAAT ACA 

GGTCCTTGCGTTGTGGCGACTCCCAACCATGAAGTTATTTTGTTGCAGATT^ 

TAGATATATGATATGCAGGATATCXSGATATGTAAAT^^ 

6 5 CCACTGCCTTTGATAGGGAGGCACTATG CTTAGAGCATGGTTAGGAAGTATTTAGAAG CTATTGGTGTGTCTTGGAATTGTGTT AG 

TAGGTCAACCGTCCTGTGGAGGGAAGGAAGGGGGTGGAGGAAATCCCT 
CTTGGTGAGGAAGTCCCATGGGTCAGACCACCTGTAGATTT^^ 
TCTGCTCTGAGCAAGAAGTTCTCTGCTGAGCTCCTAGGCTACC^ 
GGGAGCATGTGTGAGAAGTCTGTGGAGGCCTCTGCCTTCTTCCTGGCCTCT 

7 0 GACCGTTATCTGGCTATGACTGAGCAGAGTGGAGGGATGGGCTITGCCCCT 

ACATGTGGCCATCGACCAGGGCTCTCATCTGGTGAAAACTGTGTC!TTACT 
GGCACATGGCAAGAGGATCCTCCCTAGAGGATTTAGACTATGAATGCCCACT^ 
CTCAGGCATCrATCCTTAACTTGGTCTCTCCACAAGCrrGCCTCrGATGCTC 
TGCTTTTACCTATGACCCTTGGTC^GCAGAGTACAAGGAGCAC 
7 5 CCTCATGCGGAACCGAATGTCTCTCCAGTAAACACCCCAGCCAGCCAT^ 
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TCCCTCTACCCCGGGCCTX3 CTTTGTGTGCCAG ACAGTG ACTC CATTGCTCCTTGCAGAGGG 
AAGCCCTCGGTTGTCCTTCTCrGGGAATGCTG CCACCCAGAGCAGCACACAG TG ATTGCCCTGTTATTTCAGAAATTATTTTAAAT 
AAAG ATCT AC CTCGTGCC 

5 MOUSE SEQUENCE - CODING 

ATGGCGGGAAGTG CTGTGGACAG CG CCAACCACCTG ACCT ACTTTTTTGG CAACAT CACCCGGGAAG AGG CTGAAG ACT ACCTGGT 

CCAGGGAGGCATGACCGATGGGCTCTACCTGCTACGCCAGAGCCGCA^ 

GGAAGGCACACCACTACACCATCGAGAGGGAACTTAATGGCACCrACGCCATCT 

CTCTGCCATTACCACTCCCAGGAACCrrGATGGCCTTATCro 
1 0 GACCXK^CCCTTTGAGGACCTGAAGGAGAACCTCATCAGGGAATATGTGAA^ 

AAG CCATCATCAGC CAGAAG CCCCAG CTGGAGAAG CTG ATCGCCACCACGGC C CATGAG AAGATG CC CTGGTTC CATGGCAACAT C 

TCCAGAGATGAATCAGAGCAGACGGTCCTCATAC^TCAAAGACCAATGGAAAATTCCTGA 

CTATGCTCTGTGCCTGCTGCACGAAGGGAAAGTATTGCACT 

GGAAGAAGTTTGACACCCTCTGGCAGCTAGTGGAACATTACTCTTAC^ 
1 5 CAAAAGATTC^TGCACAGATGGC^CACCCAGGAAGCCCAAATGCCCA 

CAAATCCTACTCCTTCCC^^CCTGGCCACAAAAAGCCTGCCCCACCCC^AGC^ 

CCTATGAGCCAACGGGAGGGCCCTGGGGAGACAGAGGCCTTO^GA 

TATGCTGACCCTGAAGAGATCCGGCCCAAAGAGGTCTACCTC^ 

TAACTTCGG^lACTGTGAAAAAGGGATACrACCAAATGAAAAAAGT^ 

2 0 ACGACCCGGCTTTGAAGGACGAGCTGCTGGCAGAGGCGAA 

ATCTGCGAGGCGGAGTC CTGGATG CTGGTGATGGAG ATGGCGGACGTGGGG C CG CTCAACAAGT AC CTG CAGCAG AACAGGCACAT 

TAAGGATAAGAACATCATAGAGCn^TTCACCAGGTTTCCATGGGGATGAA 

TGGCIXSCGCGGAACGTGCTTCTGGTC^CACAGCACrATGC 

AACTACTACAAGGCCCAGACCCACGGGAAGTGGCCCX3TGAAGTGGTACGCCCCCGAATGC^ 
25 GAGTGACXfTCreGAGCTTCGGAGTCCrGATGTGGGAAGCGTTCrCCTATG^ 

TGACCGCCATGCTGGAGAAAGGAGAGCGGATGG^GTGCCCTGCAGGATGCCCGAG^ 
ACTTACXlATGTGGAGAACAGGCCZAGGATTCACAGCrrGT^ 

HUMAN SEQUENCE - GENOMIC 

3 0 CTCTTTAGTACITGTAAATGCCTGATGGGTTCTTCCTGCTTA 

AAAGAGTTTAATTGACACAAGGCCGGCCAC^CCACATGAGAGATGAAGTTACT 

AGGTCAGTGGGX3TCAGCAGGGCCAGCAGGGTCAG 

AAAAC^TATCTCAAAAAGCGAATCTTAGTTTCTGCAATGGTGATGTTATCT 

3 5 CTCTAAATAATAGCTGGTAATCATTTATGTCTACACCTTAGCAGAAT^ 

AAG CTTT ACAAAGGTGG TTT AATTTGGGGAAAGG ACT ATTG T CATTT AAACT AT AAACT AAATGTCH" C CC CAGGAT AACTTAGCTC 
AGACCCAAGAATGGTTAAGGGCAGTTTGGAGGCTAATGGCAAGATGAAATT^ 
CCCACTGTTATAATTTTTGAAAAGGTGGTTTTAAGTTAGAATGGCAATCATTAAAA^ 
TGGAGAAATAGCAACACTTTTACACTGTTGGTGGAACT 

4 0 ATCTAGAACTAGAAATACCATTTGACCCAGCCATCCCATTACTGGGTATATACCCAAAGGACTATAAATC^ 

ACATGTACACGTATGTTTATTGCGGCACTATTCACAATAGCAAAT^ 
TAAGAAAATGTGGCACATATACACCATGGAATACTATGCAACCATAAAAAATGATGA^ 
AATTGGAAATCATCATTCTCAGTAAACTATCGCAAGAACAAAAAACCAAACACCGCATATTCT 
ATGAGAACACATC^ACACAGGAAGGGGAACATCACACTCTGGGGACTCT 

4 5 GATATACCTAATGCTAAATGACGAGTTAATGGGTGCMCACACCAGCATGGCACA 

GCAGATGTACCCTAAAACTTAAAGTATAATAAAATAAAATAAA^ 

AAGGTAGTTTTATAAAGATAAATATACAGATCAATGAAATAGAATTGGAAGTCTGGAA^ 

TCTGTCAACTCAATTAGGAOIGAATAATTGTTTTAA 

AATTGGATCTTTGCCTCATATTGTACATGAAAATTAACTCA 

5 0 CTAGAGGAAATCATAGGAGTAAATCTTCATGACCTTAGATAGTAGTTTCTTAGC^ 

AAAGTCGATAAATTGAACTTCATCAAAATTAAAAACTTTTATTCTG 
ACAGGAGAAAATATTGGGAAATTCTATATCTAATAAGAGACTTGTATTCAAAATATC 
AGATAAGTGACTCAATTGAAAACTGGGCAAATGATCTGAATAGACACT^ 
AGATTCTCAGCATTGTTAGTCATAAATGAAATG^ 

5 5 ACGACAG ACAAT AACAAG AGTTGACAAAGATGTGG AAAAATTGG AAC C CTCCTGTTTTG CTGGTGGAAACCTGAAAATGGTGAAGC 

CACTTTTGAAAACAGTTATCAGTTCCTCAAAATGACAAACATA 
CAAGAAAAAAGGAAAATATACACCCACACAAAAACTTGTATAC^ 

ACCGATGAGTC^ATAAATAAAATGTGGCCTATCCATACAATGGAATATTATTTGGTACTAAA 
TAACATGAATGAAAACATTATGCTAAGTAAATAAGCCAGTCACA 

6 0 AATAGGCATGTCTACAGAGACAAAAAGTAGATGGGAGGTTTTCCTAGGGC 

GGAAGCATGAGAAGTGATATGGGTATGGGGTTTTGTTTTGTGGTTTAAAAAAGTTC^ 
CTAGCACTTAAGGATGCAGTGGCAGGAGTATAGTGTGAGCCCAGGAGTTTGAGACCTGCCT 
XCACAAAAAAGAAAAAAAGTGCTTAAAAAAGTTCTAATATTGATAGTGATGATC 
TTTAATTGTATACTTTCAACAAGTAAACTGGATC 

6 5 AT AT AAAAAAG AAAAAGAATG AAAAG AT CTTGG T ATG AG AAAAAC CAT CT AATTTT CACT ACAGTG AAAAAT AACAG CTG AACT AC 

CTTCTAATTATATAATTATATTTTAAAAGAGTACTCTGTTAAGCAAATGATTAGCT 

ATTTTTGAGCAT AAGTTTTTTCCAT CCTTTTGAATCTCCAGTGGCTTGGGTCATGT CTGGCAAGTAAAGGACCCT AATTCAATTAT 

TGAATTGTTTGCTCCCTG&AATTTCACAAAAAAATAAG^ 

CAAAAAGTAGCAGAGCTGAGACTACTATGGCCCGTGCTTCCTG^ 

7 0 TCACACTTCAC^CACTATCTTTC^TCTTTAAGTAACTT^ 

GTTTACCATTTGTAGGTGAAAAGTCACTTTTATATCTC^TTGAATTGGTTA^ 

ACATCCTGTGT CATTTCAGCAACTGGA1X5AATGTGATCAGAAGGTG AAAAT CACTGAGAG AAGCCACTCACCTCCATCATG CTGGT 
CTATCAAAATTAACTTCCCTTCATAGGACACAATAGC<3TTGTTACr 1 'TT T TTAAAGGTATACCTACATATGAAAATAATTAATT 
TT ATCT AAATTTAATGTAAT ATG C!ATAAAAATACACCTGATGCTGTCCT AACT 
7 5 GTATTATTTCAATACATGGAGTTGGTTATTATTTTACTTGCAACCT 
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TTTTTAATTTAGAAGAGACAAAAATGAGGACCT 

GCCTGG CAACGT AGT AAAACCCCAT CT CT ACT AAAAAT ACAAAAAGTC CCAGCT ACT CAGGAGG CTGAGGTGGGGGAATCACTTGA 
GCTGGGGAAGCCAAGGCTGCAGTGAGCTGTGATTGCGCCACrGCACT 
AAAAAGTACAGCGTATATTCTATAGTAAAAAACATGTGGGCCATTACAT^ 
5 GGCCTCACCATCTAGAAGAAAAGAATCCTTACTTTCCTC^ 

T AAAGTT ACAAGTGGT AACCAAACAAGAAAAGGAG AACCGCTTAAC^^ CG CTGG CTCATAT AAAAAC CAG 

AAGTTTGAGAGTATCACAATCAGGCCACACTCCAACCTCCCAGAAAC(^ 
TCAGATCTTGAGAGAAGAAGGGGCCATTGTTTATCCrTAGTT^ 
GATTAGTGCAAAAGGAAATAGCAGCAATGTGGCATGATTTTCT 
1 0 AGCTGAAACTCACAGATGCCCCCAACCTCGCTGGACAGAGTAAGCTCAGCCI^ 
TTCCTCACAGTCTCACATTGGAAGGAGGGGCCCAGTTrGTGCAACACAACT 

AGGTT AT AGAAAATCAAGCTCAAAAGGAAT ATT AAG C T AAT CCAGAAAAGGGGT AAG AAAAGG CTCTG ACATG AGG A 
TTTAGGAGCTGCCACACACAAAGGGGCAGAGCAAAGTGAAAGTCTGAAGAAA^ 
TAGCAGCTATGCAAACCTTAAAAGAAAATAACATGCACAAATTAAAAATAAAATCC^ 
1 5 ACAAAATTTG AT AAG AG C CTG CATT CAATTTTIT AAAAAACT CAAG AACCG ATTGAAT AACCAAGAGAAAAT AACT AAAAAG AGTG 
AGAT AGAG AGAACCAG AGACAAAGAAAT AATTTG AGG AC CAAT AAACCT ACCTACAGG T AAATT AG AAAT AACAAG AAATAAAGCA 
GACCCAGCTAAACATTTAGTTCCCGAAAAAAAAGAAAGGGAGAGAGA 
TACTACAATCAGAGAAAAGATAATAGACITAGATGATAAAAAGTTAATTT^TA 

T ATTT AAAATCACACAAG AAAAAATT CTTTGAATGAAG ATCT AAATCTGCGCATT ACAATGGTGGAT AC^ CATTAT AT ATTTG 
2 0 TTAAACACCATACAATATACAACACCAAGAGGGAACCTCAATGCAAACTTTGGGCT^ 
CATTGTGTTT AAG AAAT AAAAGATGGCCXJGGTGTGGTGACTCCTGCCATC^ 
CAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACTCrc 
CCTGTAATCCCAGCTGCTCAGGAGGCTGAGGCAGGAGAATTGCT^ 
ATTGCACACCAGCCTGGGCAACAAGAGTGAAACTCCATCTCAAAAAAAG^ 

2 5 ATATATCCTTtSTCAAGTTTCTTAATATCTCAGTTAAAGAAAGTTTCCT 

GACAAAGGACTGGCTGTTCTCACATTTCTCCAAAACAACAT^ 
AAAGGGAAGTAGGATGGAGACACAATGTGCCCACCTTGATGTCACTGCAATA 

G CATTGAT AGATG CAG T ATGTGGGAT CACAT CTT CACAG AGTCCATCCAG CTCAT CAACAAAG AAAGT AGG AATGAGAT ATGG TGG 
T AGAAAGGT CTGG AGGTG CAT AAGGTATC CTTTTAAATGGAGAACTAAGA GTT ACAG AAATCAAA 

3 0 AATTTTAAACTTAACTATGTAAAACTCATATAACTAGGGAAACGTGAGAGGTGAGCT 

GTTCATGTATCATAGCAGAGTTACTGATACTGTCTGAAATTAAAGCTCATGGTTTAGAAGAAAT^ 
CACTTTTCTTTTCCTTT AAT CAAAGTGGGGTCATTTCAGAAATGAT ACATTTGTGGCAAAGAATCATTTCTAT 
CTTTAGATCCCATTTCAGAATTTTTTATTCTGTTAAATTTAAGTAGAATTAAAT 
ATGTATTATATTTAAAATGATGCCC^TTTTTATATCTATTGCCATGTGTACATA 

3 5 CGGT AATGATGATT ATGT CTGAAATTTGGG ATG TTT CT ATTT C CTTTG T G CTTTT ATTCTTTT AAGGGATTG C CT ATT AGGAATAC 

TTGCCACTGTTACCAAAAAAACTATTTTTTTAAATTAAACACT 

GGTACCGAGGTTTCCCATATACCCACTGACCTCACAC!ATGCACAGCCT 

TTACAATTGATGAACCTACTTTGACACATCATAATCACC 

TATGGTGTTTAACAAATGTATAGTGCTGTGTATCCACCATTGTAATATCATACAG 

4 0 CTCCTTCTATTCATCCTGCCCTCCCCACCAGCCCATGGCAGCCACTGATCT^ 

ATCACCTGTTGAAATTATACAGTATGTAGCCTTTTCAGAGGGC 

TTCATAACTTGCTAGTGCATTTCATTTTAGTGTGGAACAATATTCC^ 

CTGAATGACATCITGTTTGTGTCCAAGTTTTGGCAATAAACTT 

TGAAC AT AAGGTTT G AACTCCTTTGGGT AAAT AC CAAG AAG CACAATTACTGGG TACT ACTTT AGG T AAATAC C AACTACTTTGGG 

4 5 TAAATACCAAGAAGGACAATCCTCTTATAATTACTCATGT^ 

ACAGTGGCTGTACCATCrTGCATTCACAACAGCMTGAA 
GTGTTCCGAATTTGGGCCATACTGATATCTTACTGTCTTCATTTC 
AGGC^TGGTGGCTCACGCCTGTAATCCCAGCATITTGGGAGGCCAAG^ 
GCGAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGCATGGT 

5 0 GAGGCTGAGGCAGGAGAATGGCGTGAGCCCAGGAGGCAGAGCTT^ 

ACACAGCAGTGTAAAGGAAGGAGGTTGTAGATGGATCTAGAAT^ 

TCTTTGTGCAGXjCTGGGTTTGCTTTGTACACACTGGTC 

AGACATTCTGCAGGCTCCATCTCAGGAACAATGGCTATTTTT^ 

5 5 ACCATTATCAAAATCTTCCATTTATGTTTGTTAAAGCIAACCTA^ 

GTGG ATGAACTG TAACCTAG CTT AATAGGCAGACAAGATTG AAAA CCT AATTT AC<3 AGTATGTGCCTTT AACAATAG CTGAGT CTT 
GGCCAATCCCAGTGGCCATACTTCAACCATTCATACACTC3CTGAGTC 
CCAATCCAGTTGTTTCTCTGCCTTACCTCCAATTTCTGTATGTCACT 
ATCCCTGGAGTCTCTX3AATCCGCTGTGATTCTGGAAGCT 

6 0 AATTCTG CTGAAGTTTTCITTTAACAGGTTT AG AAAAAATAAT^ CCTGGAAGCAGAAAAG 

GCTGGGGGCTCCAATAAGTGTAAATAGTCCCATCCCTATATrTTCTCCATGGCAATTA 
TTGCTTCTCGCATTTTGGCTTAGGGTAAAGCTTTTTAAAAC^ 

GTGGG AACATTTT AAAGAGX3 AATGT CCAAAAGGAAAAGGGGG ATGGG TT G GG AGAAGGGT ATC AGG CGGGT AT CT C AAAACCATT C 
TTAGGGCTATAGGTTTAATTTATTTGGTTGTCGACGTCAGAGCCGTCATGGT^ 
65 CCTTCAGAAGCAGCCTGCCCCATTCCCCACT 

CCTCCCGAGTGCTCCAGCACAGTAGCTTGGAGCTTGTTGGTTT^ 
CTCTTCCCCGGCACTGCATAGCAAAAGGAAAGGGCCGCTGGGTGTCTGT^ 

GGAGGAGGGGGACCGATGCX3GTCCATGTCCCGGGCAGCCCCACCTTCTCTGCCTGCGAAGGGCCCTTGTC 
GCGCGCCCC^CCCGGGCTCCTCTACACCTGCCGCCGCC^ 

7 0 AGCTCCGGGCTCATGGGCGCGGTCAGCAGGGCGGGCCAGGGCGGCGGGGCGCGACACTGG^ 

CGCGTTAAGGAAGTTGCCGAAAATGAGGAAGAGCCGCGGGCCO^ 
GCGGGTGGCCCCGCGCTGCt3CCCGCCCTCGCCTCACCTGGCGCAGGTAGGTGTGGCCX3 
TAAGGAGAGGAGGTTACGGGGAACGACGCX3CTGCTTTCATGCCCTTTCTTC 
CCATGTGAATAAAATACAGGTGGGTTCCGCCAGCTTCGCTCCTGAACCTAC 
7 5 ggGGCTCACCCCTGCCCGGAGGGGACGGAGGTCAGAC^^ 
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CGCGG AAAGTG CGGTCGCGGCCCGGCCCTCGGGAGACGCGGGATTGGG ATCAGG CACAGCGCG AGG AAGTCGATCTTGGAGCTAG A 
AC ATTTT CCTTTGG CCATTTAC ACGAATC CACTGGAAAATGCCGCAGTt^TTT AT CAAAGTT ACTCAAAGT AGAAATGT CCAGACGT 
CTTATGAGCTTAGACAAATCTTTTACTAC^AAAAGAAACAGC^ 
AGCATAATTACTCTTGTGGATACATTTTCATTGTCAAGTAATTTACT^ 
5 AAAATGTAAATTTGAGTTAAGAGTTAAGGGTG'irrC'r i I f TGTCTGTTTGTCTGTTTTTGGCAATGTGGCTCC^AAACCT^ 

CACCTAAAAACTATATAAATGCAATCC ATTCT T f TGTTC 

(J 1"! mT rA CGTTGGAAACAGTGTTGCAAAATA' lTT T T TGAGTTTTGCCTGACrTAGCAAAGATTCAGTCGAACTCAAGTAGAGTT 

CAAAGCATTGCAGCGTGTAGTAAAAAAAAAAAAAAAAAAAAAAAAAAGAGTTG 

AAAAT AAAAAAAGGAACCATTTAAT CCCT AAG AGTTCCT 
1 0 GGACAAATTCATTGCCTTCACACAGGCAGCATTTTAATTAAATG 

CGAGCAGTGCATAGATAGAATTATATTTTCTCAAAAAGGCTTAAATAC^ 

CAGTTtlAAATTATTTTATCAAATCATTTTTG 

AATTGATTTGTATTATCCCCTGTATTTGAAGTGCACrTTAAA^ 

CTTATCCTCTTACTCTTCTACCTTGGATTTGAAAAGGTTG 
1 5 GGAAGATTTTTTTTTAAGCCTCreTATTTGAAGTC 

TAGGATAAAAATGGGGTGTATACTTrAGGTCTTTATCTTTCCTGAGAGATGAAATTGGT 

ACTTrAGAAATGAATCACCCTGATLTTTGTTAGGCCCCTATTTAATATTCT 

GAAGACTTCTTAGTAGAACTTCACnTATTTAGCACATTGTT^ 

GAATTCAAGCTTTTTCAGAATTTGGAATTAAAATTTAGAAT 
2 0 T AAACAT ACATG TTGACCCTG CTGGTT ACATGC CATT CC AAT AT AGTCTG G AG AG AAT C CTGT AATTC CATG CATT AAT ATTT CTG 

TACTTTTACIATTTAAGAACATGAATATTCTCACTAAGTGGAAT 

TTATCACTACGAGAGCTTCTCTAAGACTTACCAAAAAAAAAAAAAGTGT^ 

TGAGTCTGTATTAGACTCCACTGAACTACGGTTTCCTTTTAAAAGGAAAG 

AATT AG AAG GG CAGTT C CTCTCTCTTT AGGG AG TCTT AAAT AG CAAGATTTT ACTG AATTTT CAAT CATAC CG ATCAAAAT AG AGG 

2 5 TACTTCAAGTAAGAGTGGATGTCTCATCTTTGTAATATAATTTCAAACAGCATA^ 

AGCAGAAGTTACTTACAGGAGGGGTTACTGTCACCCTrTACAGCrGCAGTCTG 

GCAACCAGCTGGGTTTGTGTCAGTTATCTAAGCCTAAGTGATGGCTAATT^ 

CAGCTACCCCCTGACTTTTCTACTCTTGCITT^ 

AGAAAACAAC AACAACAACAGG AATITAGG CCITTGGGG AAGAATTTGTTT^ TTTT AG AAGTCT ACACATTGAG CATT ACTTTTT 

3 0 CTT CTAAAGCAACTAAAATCAG AAGG CAG AGTCCTAATG AGAGATTATG ATCAAATTGCCTCATTTGTGTCTTATCACCTTCCT 

TCTACTTAGATTTGAACAGGTTGGATTTGAATACTAGGA 
AGTCTGATTGGATTTTAGGAAAGTTCATTCATTTTGATATGATGTTATCAGG^ 

AATATTAGCTTGTATCAATTTGAGG'l''l"rT , r i r 1'ITCCTTTATGAATCAATTGTGCAAGGTTTGCCCTACATCTCTTAAAAGGTGGTT 
AAACTGGAATCTTAAAGAC^GTTGAATTTAGCCAAGACCAAAGATATGCAAAGAAAGAATATm 

3 5 ATAAATCCAAATGGCTCCCTTGATTCTTTTCTTTCTGGGACIACCACGTATT 

GAGGAAGCGAGGACAGTTAAGAATGTTGATGATTACTGGATTACT 

CTAGCTCTGAAACTGGGATCCTtKSCTCTG AGCTGG AGAGGCG CTTC CAGTT ACTGCTGTCCTGG AG CGCTGGAAGGGAGCATAAGT 

GTGAGGAGACAGTGGGCTTTTAGCCTGGGATGCAGATGAGAGATGTCTTGACCT 

ACCGAGCAGGGTGAGGGAGGCTCTCGACCTGGATACrGTGAACTAGGGAGAGAAAGTAAATTCAACAG 

4 0 AGATTTAGGAATATCCACAGTTTAAATATTGCTTTACTTAAGT 

GACCTTTCATlTCTCCCCrCCATTGGTTrGTCTTAACrGATAAAT 
CATCTATCAGTTCCTGTTGAGTTGACATGAAAATGTTTTGGTGTGACCT 
TGAGTTCCTCTTTTCATTCTGCCCATCCTTTGGTACTGCTTTGAGTTATGTCCT 
TATTTTCAAAGTTTTTTGGTGTACT AATTTT AAACAGTG 

4 5 ACCTTCTACTCCTTCTCACACCATCCCAGGATAATTATTCTCrCATTTGAGATTAAGAAGTTA 

TATAAATATAGTTTACTGCTAAAGCAAGGTATATCATTCCATTTTCTATCTCGTACCCCTTT^ 
GCTTTCTTTTTGTGTTTTCTTAGTTTGCACTGTGCCTATT 
ACACAATTTTCTACAGTGTCAGATATACCAAGTAATCTCACGGTTCCATTT^ 
TTGTCTAACAGTTGCTCTTCAGTCTTGCTATACATACAGTA^^ 

5 0 TTTTTTTTTTCTTTTTCTTTTTTGTAGAGATGGGCrCTCGCT 

CTTGCCTC^GCCTCCCAAAGTGCTGGGATTATAGGCATGATCCACTGTGCCCAGCCCrA 
TTTATGTAAATTGCTTATCATACrCTTTGTTGTAACATCTGTGGCATTAA 

TTGTAAATCCACTGTCATAATTATTGAAAATATTTCTCCAGAGCTTTTTAATTCTATGGTTATTT 
TTATGATTTCTTCCATCAGGGTGCATTCTGAATCATGCTTAGTGAGA 

5 5 TTTACATCAGTGTCAATGTCACTTTTGTATTATATACGACACCAACAT^ 

AATGGAGATCTCATGAGACTTTAAGCAGATTGCTGGCCTTCTCCTCGAT 
AGTGGAATTCCCTGGGAAGCITGCTCAAGTAAGTAAGCATGAGAAATGCTGAATGAAAT 
ATGGGCATCTTCAGAGCCTTTGTGCAGGTAACATGCAC^AGATTCTCTGAGAGAG 
TGACCCTGCAGTGCGTTTGTTCTTAGAGTGGCTTTCAGGGCTC<;CT 

6 0 TGCTCCTTACCAAGCTCCTTAGGCTTTGCTTGGTAGAGGCAGAATGTTGTGGTT^ 

TATGTGGTGGATACACTGCCTCAAAGTGACATTTATCGAATATTTTTGTAGCTACGGGTT 

GGTTC!AAGTC C CCCTTCTTT ATTTGT AAGCTCTTCTTTTGTAA^ CAGT CACAGGGGG CTTCACTCTG AT AC CT AATGGAGA 

CCGTGCCTGCAGTGCTCGGTACCTGGTATGTGGTTAAAACTGACATAACTGTC^ 

CTGCCCTGCAGTCTTGGAGGGAAGGCACGTCCAGATAGGTTTGATGAGCAACTACAATTGCCAAA^ 

6 5 GATGGGAGAACTTTCTAATATTTTCTTTTAGGAAAGTTGTCCATTGCATAACTGATACACGAGTA 

CAGATACAGCGAGCAACAGAAAAGCCCTTCTTAACCrCCACCCTCAGTCCCAGCCC 
GGAGCTCTAACCTCTCTTCTrrCGTCCTGTGCATT'CATGTGTGCATC 
GTCTTCTTAATACCTTTATCACTCTGGAACrTTCT^ 
TAATTTTTATGACAGTCCAGAATAGGCATGTCGA ri ' T TTTTTAATTm 

7 0 TTTTCIATTCTTTCCCTCTCCTTTGCCATTTGCACTG 

CCCTGTCCATCCTGTTCTTCAGTCCATCCACTTAGTTTT^ 
TTACTTCTTTGGGATTTTTTTAAGTGATGTATTAGTTTT 

GGGGTATCAGTTCATCTGCTTGCATGCTTGGTGCCTCTCTCTCCTGCTGTCAGCTTTCCT 
GTTGGTTrTTTTTTTTTTTTAGGTACCCrAGGCATTAGTCTTTTAAGGCT 
7 5 GGGCAAGACCTGGATGTXSCACAGGGCAGGATGGCCCC^^ 



1093 



WO 03/008583 



PCT/USOl/51291 



GGG T ATGGC CATG CCTT CC CTATTCAAACG AAGCCACACAC CTG T CTTTT CCTG CG AGGCGTCTTCTG CTGCTTTCTTGGAG CTGG 
GGAGCATCTACTCTTCCCTTGTCCTCCCX3CTCCTCCCTGTTCCTCCCTCJVGGTCTCTC 
TTCTCCCACCTTGCACCCCCCACACTGCTGCCTGCCTC 
ATGCCTTTCAAGGAGAGCATTCATTGCCAAGCTACTCATGGAACATCTCCT 
5 ATACTGTTTTTCATGTTrCriCCCCITCTCTGCATTTATAGCrGTGCATATG 
ACAACACCAATCAGCTGATGGTCATGGCTACGTGAAGGAACTCACAGTGAG 
GCAGGCCTTGGCGTTAGGTTCCCCAAATCACTGCTGGTGATCTTGTAGCTCT 
GGGTCCAAATTCATAGATTCTTAGAAGCCTCTTTAAATATGCAGA^ 
AAGGGATTTGTCCAACCACTGAGTGATGCTGCTGGGATTAGAACTCTCAAAACT 
1 0 CCACTACTCTAATAAGGCGTAAAAGAGGATCTGCTGTCCTC 

GCCTGGAGAGGCAGAGGGAGGACACAGGCTCAGTGTCAGACATC 

ATCTTTGTGGCTTTTG CATTGAGAAGTT AAG CATAT ACAAT AGG AATT AGACACAATC CCAAAC CAATGAGAGACTCAAGTGAGAA 

GAGCTGAAGCTTTTATGCATGCCTTCCTTCC^TAGAGATAAATTTACTATCCAG 

GAGCCCCTAGCTGTGAGTGTCTCTGTCCT^CCCACATCAGACCrCCATGTCTG 
1 5 TCTG CTTGTGTTG CTGTCG TG AGGGTTGGG AGTGG ACAGGTGTGGACAGATCT CAGAC CCGTTTC C CT CCTGTTATGG AAATG TTT 

GTGAGGCAGGTGGGACTTCAATGATTTGGTCAAGGAAAGGCA^ 

TTGTGAGGGATGATTTCCCCAAAGAAGCCAGCAGTGTGTAAG^ 

TGTCAGTGTCTGTTATGGAGGAATGGAGTATCAGTCCGATCA^ 

CAATGTGTCGCCTCCCAACTCTGAGCCAAAGAGACTGAGGTCAGGATGTCCCAGGG 
2 0 GTGCCCCCAGCATCTCCTTGGCTTGTGGGTGACCATCTCCAGCTCCACATG^ 

CTCCTTTTCATGTAAGGACACCAGTCATGTTGGAGTAGGGCT 

CTGTCTTTAAATAAGGTCACACTCAGAGGGGTTAGGGAGTTAGGACATCAACA 

AAATGTGTGAGTGAAGACTGGCACCTCCCCACGATGGCCTTT 

ATGGACAGATGTCTAAGGCCCCTGAAATGAGCCTGCTGTACAGATGATTCCAGAACTCTGTCAGTGTGA 

2 5 GCACATTAAAAT ACTT ACGGTAAAGTG 1TTCT1T CACTTT AG ATGTGCAAAAA CT^ CAG 

GGAAGAAATGTAAGATAAAGGTGTTAAGGGAGACAAACTCCACAAAATTGTAAAGTGAAGCTGCTG 
CAGATAATATGCAGGCATGGGTCCAAGGGAGGCTTTGCACACCAGGCGGGGGAGGTGCAGGGCCAT^ 
ACGGCAAATCGTTTG<5AAAAAAAAACAAAACTTTGC^ 
TTGAAATAATGAAGCTATTAAAAGAAAACTTAGGTGA 

3 0 AATTTAT ATTCACACT CAT CACAAGATTG AT AAGCATG ATTT C CT AAAATTTT AAG CAAAT CCCCAT AAAATG AAAAAAT AT ACAA 

ATAAGTGAAAAAAATGGTAAGATATATTTAATTTCAATACTTGTGGTAAAACATCAATAGCTTTAA 
ATCAATAAGAAAAAAGACACAGATGGGTGCTGACAGGGCCGTTTGGTAATATCTTTT 
ACAACTTCACTTTTAGGAACCTATCACACATATAAAT^GACAAAA^ 
AAAAAAAATCACTAAGGGAAGTAC^GTTGATTCATATGTTGGAAT^ 

3 5 CCAAATATAACAATGTCCAAGTTCTACTAGTGAAAAAGCAAG&TC 

GGGTGCAATGACTCATGCCTGTAATTCCAGCACTTTGGGAAGCCAAGGTGGGTG 

TGGGCAATATGGCAAAACCCCACCTCTATGACAAATACCAAAAAA^ 

CAGCTACATGGGGGCGCTGAGGTGGGGGGATCACTTGAGTCCAAGAGGTCAAGGCT 

CAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAATAAGTGAAAAAAA CCACATG CACATAATAA 

4 0 TTTAAGTGATATTGGATTGTCGTTTACTACTTATATGTGCATTTCTGCCTATA 

AAAT ATG TAG CATGAT CTGG AAGTG T AAG CAC CAAATT AAG AACAGCAG TTTTT AGGGGGAGGGG AAT AGAATTGGTTTAGAGAAA 
AAGG TGAAATT AACATTTTT ATTCTG CCT ACTTCAGTG TGCCTG C AGT ATTTACAATGAT CAGGAGTGTG T ACT1TTTTCAATTT A 
GAAAAATGATAATAATAAGGTTTCCTTGGGGGGAAAG 
CATCATGACIGCTGGCACCATTCTGTCCTCCAATGAAGAGCGAGGCT 

4 5 GGTCTCAAACATGCTTGCACCCTGCTGTCACTCW5GTCCTGAAGCC 

CCTGCCTTCAGATGTTTCTTAAAGCCACTCCTACTCCACCCC^ 
GTTCATAGGATTTCATAGCAC^CAAACACATCTTTAAAGGCT 
AATGGCAGCTGTGTGTGCCTTTGACTGCAAAGACACTGGGCAGGCCAGAGTGGCT 
TGTGTGTGTGTGTGTGTGTGTGTGAGATTGAGAGAGAGAGGGCTCGTGTTTTTCTGT^ 

5 0 ATTCTCCTAAGGTTTGCCACrcrGCAGAACTTTTG^ 

AGACTTGCAGGAGGTGAATGCACATGAAATAATTCTCAGAGGAAGAATATGTGCACTGGT 
TTTCTACACAAGAGGATTTGAATGGTTTTCGTCrrTAAGAT^ 
ACCATTTGTCATCTGCATGCTTCATGAGAAATGTTrTTTTCTCGTGA 
TCCATAGTTTCTTGGTCTCTGTGGGTTGGCTTCTGGGCCT 

5 5 ATTAGGGGATGGGCCGCTGCTGCTACCrCCCCACCCa^GCCACCrCCATCAGCACT 

CTCATCGCCTGCCCCCATGTTTACACTTGTTAATAAAATCT^ 
GCTCTAAAGCCCTCTCTGGGGGCAGAGTGGTTGTCAGAAGCAGA 

GGGGGCTGGTAGTACCCCATCTGCAAAATGAGGTTAAAAATACCTAACCCTTGGGTTGC^ 
AGCZACCAATCAGACTGGCTCAGAAAGAATAGGCCCTCTC 

6 0 GGTTACCAGGGAATTGAAGACAGGGTTCTTTTAGAAGCAA 

CAGTATCATAAGCAAATAAGCTGTGTCTAGATCCTAATCATGTGTTTAGGGCTCTGCTGGGTTTAA 
CGTTGTCCTCCACTCTCACATCCCCAGTCACTAGACAAGGTCACTCC^ 

TTGCATGACTTTGGCCCTTGCT CTTTT AAAATGTCCAGTTAGAGACCATGGCATGGAGAGTTT^ 

CACCCTCTTTGCATATTGTGGG CACTCTTT AAAGTTATTTCTCATATTGTAATAT ATC71G ATATGACTTGGATATAGT CAACT AGAT 

6 5 ATTTAGTGAAAAATGTATCACCCCCATTATCAAAAGAAAACCCAAAGCCTTC 

GGGAAGTACCAGGACCACACAGACTCTCGTCCTGGCGGTGTGTTTATAAGATGCA^ 
AAGGCTGTTC^TTATGGAACCACTGGCCACATGTGGCCACTGACCGCTCAAT^ 
ATTTAAAATTTAAAATTCAGTTCTC^GTCACAC^GGCCAGGTGTCAGGGAC 
CAGAGTAGAACAGGTATTTCCATTGCreTAGCAAGGTCrGCCAGACACCAAGCTGCT 
. 7 0 GGATATGAACTTCCACCCACCTCTCCCCTTTCCCAGTTC 

GACATGAGACCTTGAGTCTTGTCCATGGGGCCCTG^ 

ACACGTCCACACACACACACACACACACACJVCACACAAAATAATGT^ 

TTTCCTTGCAGCAGCCCTCCAAGCGGTAACACIAGCCTTGGAG 

7 5 AGAATAGTTTCTG AAAGAACAAACCAGCTTTTT^ 
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TTCGGTTAAAAAAAAAAAATCCAACATAGTT^ 

AGGTAACGTGTTTCTC ACGAAGG CAAAACAAAAAAAT AAG ATG CCAAGCAAG TGG C CCTTG T AC CTT CAAGACCAAGAAAT AAGGC 
GTTTGTTTCTGGGAATGCCTTCCATTTGCrrCATTTTGTAAGT 
AGCAGAGGTGGGGTCCTGCCACTC^CCCCTCGGCTCCAACCTTGCCT 
5 CTCTGGAGTTCCTGTGCTGCTTAATTTGAGTAGAGATAAAAG^ 

TCCTTTCATGATTCAGCCTGACTCCAGGAAGTGCAGTCACGTGCTGCCCT 
TCCACTTACTGCAAAAGGCGCGATC^^ATTCTTCTTCCTGAATTATTC 
ACTTTCATTCACTAAAAATGrrTTCCTGAATGATGCTTTT 
CTCTGTGCTTGTTCCCCTCTGTTTGGGAGCCTGCGCTGCTC 
1 0 AACCITCCACAT AGAGGTGAT AATTCT AGAT AATTCT AGAG CAACTGCTTCT CT AT CTGGCTGT ATGCT AAGCTCT CT AGAGAG CT 
TTACAAGGGAGAGATTCTGAGTGTTAGGGCTGGGAAAA 
TCTCAGCTGGAGCITTTGATTGTTTTTTGTTTGTTTG 
GGAGCITGTTCTGTTTAGCGATCGTGAGAACCACCTAGGCA^ 

GGGCATTGG CAATTTT AGGACACATCT AGATGTGTCCCAAG CAAT CTGAT ATTTTGATGTTT AT CTT ACAG AGAAG CATGTGTTT A 
1 5 GAAGGCAGATAGCTGTGCTTCTGTTCTTCTGATTTTCGACAGAGA^ 
TCGTTCACTGCCAGCCCTGAGTCTAGCTGGACTATATGCCTCT 
TCTTCTGTCTTCTGGCTCGTTGTGTTrCCAAGGAGAAGCTTGAGGC^ 

CAT AATTATGGGGT ACACAGTGAT ATT AT AT CTGT ACAAGGTGG AAGGGTTGAATCAAG CCAATTAACATACC CATCAT CAT AAAT 
GCTTATCATTAATCCTCCrCTCTATCTGCAACTTTGTAGTC^ 

TTGACTTATTTCACTTTAGATAACTTC 

AGTATTCCATTGTGTGTGTGCCACATTTTCCTTAGCCGTTTATTCCTTGG 

GAATAGTGCTGCAGTGAAGCATGGGAGTACAGATATCTTTTCAACAT 

TTTGTTTTGTTTTGTTTTTTGCA^ 

2 5 AATTCTTCTTCrTCCAGTGTGGCCCAGG AAAG CCAAAAGATTGCACATCCCTGCTCTAGAGGTT CCTGGATAAGATGCTCTTTTAA 

CCCCTGTAGACAAGACTlTTrTTATTTCTGGAAATTTTATAACTTTAAACATGTGTTTTACT 

GGGACATTAATTTTATATATTTTAGATCACCCCTGTC^ 

CATTTTTCTCAAGCTCCACAGATCCTGTCCCCCATGCTGCCT 

CAACAT-rrcrrTTcrGGCCTTTTcrccrGATcrcr^ 

3 0 TTGAGCTGTTATTTTAACCAAGTGATTATTCTGTTAAGTTCTTTGATTCCATATCT 

TGGGACATTTTTCTCATGAAAGTCCTTTGCCATTTGTTTCTC 
CCAAGGTAGGATCCTCATCTTTGAAATAGATCATGGAAGGGCCTTGG^ 

TAATTTGTTGTGTGTGTGTGTGTGTGTGTGGAGAGAGAGAGAGAGAGAGAGAGAGGGTGAAAGAGAGAGAATGAAT^ 
CATATTTCTCCCCAGTGAGGCTGCCAGCCATTCAGTCTCCTCTCTTCAAC^ 

3 5 ATCCTTCCACTCTGCAGGC CTTGTCTGAAGTCTACAG CAGCCAGTCCCCACT AAGGGTCTTCATGATCCCAGGGCAAACATAG CTG 

CATTGGGCTCCGCTTGTGGCATGTTATTGCTCAGGATCC^GGTGCCCT^CT 
GGGTCATTTCAGAGAGGAGGGGGCAGTATCTCTGCCAACACCCAAGCAATCT 
AGTATGTGGCATTGAAACCAATGGTTAAAAATCAACAAATAAGCAGGAAACAACT^ 
CTTAGCTTCGACCAGCCTCTCTGATGGTCAACCAGGCCATTTTGTT^ 

4 0 CCACGTGTC^VGGAGCAGGGTGAGGTATGGGGGCTGTCGCAGCAGCTGAGATGCTT 

TAAGGAAGCATGGGAGCCCAGCATCAGGCn'CTGCTCATGCATAATCTCATTTATTAGTGACAGCAACC^ 
AAAAAAATTATAAAATACAGAACAGTGCAGTACCACCCIAAATTAATAAATGTTGTT^ 
AAAGAACAGTCCCAATAAAGTGGAAGCTATAACCTTTTATGGATGAGGAAGTG 
AGCAGCAGGCCTGGCTCAGGCCCCTCTGGTGCCCCTGAT^ 

4 5 CTGTGCAGGTTGGCGACTGCAAATATCAACCCCTCTAATAGGAAACCAGGC^GATTTTTCTCATGCT 

ATGTAAATAAAACCTATGTTTTTGAGAGTAGCAGGAAACAT^ 

GCTGAGACTGATATTATAATTTTCCAAGTACTAGTATCTGTGTATTGATTTTTACAG 

GTGACCACTGAGGCCCTTC^TGAATATCTTAGAAGTCACTCAAAATGGATAAAGAATGAATGTGAG 

TGAGATAATATATGGTTGATCCrCAGTATTCATGGCTTCTGAATTTGTGAATTTGCCTA 

5 0 TCCCCAAATCAATAACCATGATGCTTTCAAGACCA 

ACATG C CC AAGACTGGGAAG AAAAAG AGGTTT AATTGG AATT ACAGTT C C AC ATGGCTGGGGAGGCCT CAG AATCATGGC AGG AGG 
TGAAAGGCACTTCTT ACATGGTGG CAG CAAG AGAAAATG AGG AAG AAG CAAAAGAGG AAAC CC CTG AT AAATCCATGAGATCTTGT 
AAGACTTAATCACTATTACAAGAATAGCATGGAAAAGACT 
CATGTGGGAATTCTGGGAGGTACAATTCAAGTTGAGATTTGGGTGGG^ 

5 5 AATCTCATGTCCTCAAATTTCAAAACCAATCATGCCTT 

TCCAGAGTCCAAGTTTCATCTGAGACAAGGCAAGTCCCTTC 
ATACAATGGGGGTACAGGTATTGGGTAAATACACCCATTCCAACTGC^ 
AAGCTGAAATCCAGCAGTGCAGTCAATTTTTAAGCTCCAAGATGATCTCCT^ 
C!AAGAGATGGGTTCCCATGGTCTTGGACAGCTTCATCCCTGTGGCTTTGCAGTGTACAGC 

6 0 GGCATTGAGTGTCTGTGGCTTTTCCAGGCACATGGTGCAAGCTGTCAGTGGAT 

CTACTCACGG CTCCACTAGGCAGTGCCCCAGTAGGGACTCTGTGTGGG AGCTGTGACC CTGTCCTAGC 
AGAGGTTATCCATGAGGGCCCTGCCCCTGCAGCGAACTTTTGCCTGGTCATCCAGGCAT^ 
GAGGTTCCCAAACCTCAATTCTTGACTTCTGTGCACTCGCAC^CTCAACACCACGTGGA 
TCTGAAGCCACAGCCTGAGCTCTACGGTGGCCCCTTTCAGCC^ 

6 5 CACACTGCATGAGGACCCTGGGCCCAGGCCACGAAACCACTTTTTCCT 

GAGGTCTCTGACAT^CCTGGAGACATCTTCCCCATGGTCHTGGG 
GCTGAGTTGAATTTCTCCCCAGAAAATGGATATTTCTTTTCrATTA 
TTTTTTTITTAATAAAACTGAATGACTTTAAC^GTCCCCAAGTCACCTCT 
ATACCCTAAATCATCGCTCrCAAGTTCAAAGTTTCACAGATCT 

7 0 AACAAGAGTCACCTTTACTCCAGTrCCTAACAAGTTT 

TTATTAGCATTTTTGTCAAAACCATTCAACAAGTCTCTAGGAGGTTC 
CIAAACTGTTCCAGCCTCTGCCTGTTACCCAGTTCCAA^ 
GGTACCAATTTACTCTATTTGTTTATTTTCACATTGCTGATAAAGACAT 
GCTGGGAGGCCCCAjGAATCATGGAGGGAGGTGAAAGGCACT^ 
75 GAAACCCCTGATAAACCC?.TTAGATCTCATGAGACT 
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TTACCTCCCCGTGGGTCCCTCTCACAACATGTGGGAA^ 
CCATATCACAGAGTGGCACAAAATTTTAGCCACATGCA^ 
GTCITTTTCCCCAGTCTACrrTAGTGCCXSTATTTTTl' G CATTTTTC 
CCCCAGCTGAAGTGCTATCKMTGTCCCTAGGTGTAAGAAGGCTGTGCTGTGCCT^ 
5 CTGTCAG<3CATGAGTCATAGTTAGTGCTGTCGGCCGTGACT 
CACAGAAAGACATAAAAGAAGTGATGCATTGATO^ 
GCGATGGGCTACTACTTGCTAATTCAGTTTTGGGGGTGACT 
CTTLTlTlACAAAGCaAGGAAATAATTT^ 

L'ltjl'Tl'lATAACACAGAGAAAGTCTCTTTGCCTTACATTTTATATTAGT^ 
1 0 GTGACTGGCACTGTTTTGGTACCAAAAGGGCATTGCAAAAAAAGTCCAGAAGTAG 
CTCATCAATGTGATATTTCGTTTAAACTTAGTAGCAAGCCCATTGAATTCTCAAA^ 
TTATGAAAGAATGATTTTTCATAGTCAATATTTCTAAAATATAATGTCTCTTTCT 
ACITGCACCTGCGGATGCTGGAGATAGGTCATGATTTA^ 

CCCACAGGATGAGTCCAGGTGATATTGGGCACACIGCTCACTCAGAGGCTATCAGACAGAAAGCT 
1 5 CAAGCATTGCCTTCAGTGGGAGTCTTACTTGCAGTCC^ 

ctctttgaataacactgtagacatagctacc^gaaaataaaagaaatagaaaggcatga 

tgtgaagatctgagaaagagctcctcatggccctggcatgggagggaagag 

acaatggttactctccaagggaaaacactttgcaagagc 

cctcatcactttatcacttaaaggacaaaaaactcatagtcaacaggggtcagcgcttca 

2 0 ggtcgaatatccaagaagacataacaatcctaaatgtgtatgtgcataacaacagagcatcaaaa 

agactgaaaggag aagt agacaaat cctgtctt at agttag ag atttt aatct ctttct cacaaaag ttgat ag accaaat ag at a 
ggataaaaatgacctaagcagtactgtgactcaacttgatctaattgacatttataaaat^ 
cattcritctcaagctcacttggaatattcaccaagatagagcacattcttgga 
ataccaattatgttctcagaacacagtataattaaactagatatcaacaac 
25 ctggaaatcaaacaacagacttctatataacctatgtttccaagaagaagt 

aaataaaaatataatttatcaaaaattttggaatgtagcaaaagcagtgcttatagag 
ggtaaaaaatatataaagaaaattacttgtgattctacitttaagaa^ 
aagaaaagaaataattaaaatgaaagcagaaatcattgaa^ 
ggttcnttgaaatcatcaaccaaattgataaaccttc 

3 0 tcagaaatgaaggaaaggtcataactgttgatctaatggatataaaaagatgataaaggaagat 

aatgtggtaacrraggtgaaataaactaatgctitgagtgacacaaactactaaaactta 
ctatatctttaaaatttaaattaatataacctttaaaacagaaatcatc^ 

ctt aag aaag aaagaat ac cag ttttc cataat ct cttc cagaaaat ag aaatggag g aaacagttt ct aatt catttt atg agtt 
cagatttcacgtaataccaaaaccagttaaggcattacaagaaaa 

3 5 tgttagaaaatcaaacccaacaacgtatgaaaagagctatgtaccacagccaag 

aacatttgaaaataaattaatgtgatacatcacatcaatgggctagagaagaaaaatc^ 

gg catttg acaaactcaaaacact ctttcatgtcaaaaaaaaaat ctt aaact aggaat ag aggaaaacttttg t aacttg at aaa 

gagcattaaaaaaaaaaacctacaggtaacatcatacitaatggtgagagactg 

agactgtcctctcttaccactctgttcaacatcct^ 

4 0 gaaaagaagaaataagacatcrittttcacagataatatgcatgtccatatggagaacttcaaa 

aaccaataagtgagtctacaggatgctagatgaatatacaaaagtcggttgct 
gaaataagaataattaccatgtataatagcaccaaaaagacaaaaaaaaaa^ 
aatatgtacaggatctgtatggagaaaactacaaaataatgatggaagaa 
gttcgtacattggaagactcaatgttgttaaaatgttcattctgccaagcttc 

4 5 ccagcaagctgttttgtagctaatgatgaaatgattctgaactttacttg 

cactaaaaaggacaaagttggaggactcacactactcaatttouigtcttact 
aacagacattcatcaaaaarrcacatgctagaaagtaaaggtaaattaagtcaaa^ 
ttagttttgactgtggggacagggacatgtttatcctgagctcaaaggg 
gccacagcttagaacatgtaggaagggagcaaggtgggggcattgagagccagag 

5 0 ttatgtggttttggagacaagtaggcatgtgtgggaggctgagacct 

gtagagaagggaatgatctaagagagctrggattttcatcttcataaatc 
ctttaaaatacaaaacctagagcttgaccccagagt^ 
tgtttcctggtgatgctgagattgctggtccagagacgcacatt^ 
caggaaagccccaagcitac^tttggatgggttgact 

attgaggataacaatagtatctaccattagggatgttgtgcacatcagttgataaccittgagtcaagta 

TTTGCATTATCCTCATCACCTTGATCATCATGGCC^ 

TATCTTCTATAAAATGAGAACTTGTTTTTCATGAAGCTGTCATTTTCACAAGTTTGGGGTGCA 
TTTCTGAACACTTCCCTCGCGAGGCTCCTAACCCCACCCC^GTAAACTCATCAA 

6 0 AGGGTTCCTTTTTATCITGCGTTTCCAGGGATGTrTC^ 

TGTGGAGGATGTCACAGACTGCAGTGGGAAATTAAAAAGCTGCTTGCTT 
ACCCTGAGGGAGCTGGTCAGAGGTCCTCTGTAAAGTTGCATCT 
ACA.GGGACAGAGAGAGCTGGCACCCTTCTTTCCTCTGCCCTACACATTGGCCTTTCCG 
TCAGTCTCCTGGCTGGCAATTAGCATCrTGAGGTCTC 

6 5 TTGAATTTG ATACATGTTATGATGT AGGACTTTTTAAAAAAACATAAAACAATCAGGAAAACCT CG CTATACCCATTCCAGCAATT 

CAAGACCAAGTTCATTTTTGGTAAAAACACTTGGGAAAGGGCATTTCCAGGCTGGCT 

C^CTCCCTCCTGAACTGGTGTGTGTTCTGGAGGGATCCATGTCT 

CGTGCTGCTGGGGATCTCACAC1TCTGCTTTCTGGTTTG 

ACATCGTCCAGCAGCAGGAAGCACrrCTGGCTTGTTGATGCCTGTCTTGTG 1 1 11 AAG AT A 

7 0 CTCCTCTCAGATTTGTGTGCTTCTTCCCriTCCCACATTCTATACT 

AAGGGAGGTAGGTCAAATTTGTGGGAATCAAGCATATTTAGAAACCCTAACCTC^ 

GGATACTTAGGTTTCATGCCTGCATCTGCGTTTCCTCTCCTCTGC^ 

GAAGGTTTAGCGGTTTGCCATCTTGTCTTGCCTGCAGTTATGATGCAGCTA 

TTTG CTTTAAG CATTGAGTAT ATTT ATGTCTCAGGTTT ATG AAAGACAATT AT AAAATCT CT C CTGT AA 

7 5 TATCTTGCAGTCCTGAGTCACCCACAATTATTTIT^ 
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CTAAAACTCTAGATA TTTC TTAATCCACATTTACT 

TCCCCATAAAGATTATGCTCATAAGGTAGTGATGGAAAAGTAAAATAGAGCCTGAAAGCGGGAA 
CAGACAAGAGATGTTAGGTCTGCCGGCTGTTGTCATTGCAGTCTGCTGTGGCCTCAC 
CTTCACAGCTGAGTAGGAAAGGATCTTGGGATGGGTACCAGGGCAGGATCAAGG 
5 C CATATGTGGTTTGCCCAGATAAAAG CAAT AT AGT AAG ATCT ATCAGGCAT ATTCAACGTG AGTTCTT TTTT CT AGCATCTTG TTG 

GTTGTTTGTGTTTTTCC CCAGAGAAGAGTGAGTGATGAGGTGGC C CTGGGACATGGTAGATGG TGCAGTTCACTGGGGAGACATCC 
TTGGCCCCACTCAGGGTCAATATGCG CCCCACACCGCTGGCTGCAACCCCACCCTGTCCTCTTCTGCTCAAAAGAGCATG CAAGGG 
ACATTGCT AAGTGGCAATTTTGGTTG CAACC CAGTGTT CTCTT AATT ATTCACAAAAG CT AGCTCTCTTTTT AATT AC CAATTGTG 

1 0 AGAGATGTCTCCCAGGTTGAGCCTGCAGCCTGGCCCGGTCAC 
CACCCGTGGCAAGTGGCAACTTGCITTCXSCTCACACrc 

AATATT AGATT AT AG CAGAGGTT CTCAGACATTAGAGTGCTTAAAAATCAC CTGGG AAGCAGG CCAGG CGCAGAGGCT CACGC CTG 
TAATCCCIAACACTT/TGGGATGCCGAGGTGGGCXK1ATCATGAGGT 

CTCTACTAAAAATACAAAAAATTAGCTGGGTGTGGTGGTGTGCGCrGTAGTCCCAGCrA 
1 5 TTCAACCCAGGAGGCAGACGTTGCAGTGAGTGGAGATCGTGCCATTGCACT 

AAAAAAAAAAAATCACCTAGGAAGCTGTTrAAAATTTACCCTTCITGTCC^ 

ACCCTGCTTTCCAGAAGGGAGCCTGAGGGTGGGGTGCCCTGAAGACCACACrCAGGAAGAA 

AGAGTCCTGCCAAGAACTAGAATCTGCAGGTGGCTTTGGCCAG 

GAAATCACGCAGTCCCCAGGATGAAAGTGGAACTCTCGGACCAGATCCTTA 
2 0 TTGTC!AGGGAGGCAGGAGGTTCATTTTTCTCTCACA 

GAATGAATCAATCAGTTAGATTGTAAATTAAATGCACCAGATC^GCTGGTT 

ACATAATGTTAACACCAAAGCGTAATATCTGTTTTCTCTGTTAATTATGGC^ 

TTGCAGAGCTTGAAAATGTCTATGTGTTGGTTGAGATA^ 

CAGCAAGAGAATTTGGTGTTGAAGGTTTGGATAAT^ 

2 5 GCCTCATGCrACCATCCTTAGCTTCACITTGTC 

ATATGGGAGAGTCAACCGCTCAGCAGAATGTGCACAATGTAA^ 

GGCAC^CCTCAGGATGGCGCCGTC-GAGGGCAAGTGATACCTACTGAC^ 

CAGCTGGGGTTCTGTTCCTCAGTGTTGAG^CCACAGGTCTGTGTC 

CAGTGCCCCCAGCCCTTTG CAG ATGAGTCAAACATGCTTGTTTGCAGAGCCTGGTGCTGGGTGGGX3TC CTC 

3 0 TTCCTTGGTCATTTCTCTGTAA1TAATCAGCCAGTCCCT 

TTTTTTTITITTTTTTTTTTT^ 

CTCCGCCTCCCAGGTTCATGCCATTCTCCTGCCTCAGCCTCCCA^ 

TTITTGTATTTTTAGT AGAC^CGGGGTTTCAC CCTGTT AG CCAGG ATGGT CT CAAT CT CCIXIACCTTGTGATCCACCCGC CT CAGC 
CTCCCAAAGTGCTGGGATTACAAGCGTGAGCCACCACACCCGGC 

3 5 TTCACCCAAGTCTTGACAATTTTCTCAGCTTCAAG 

GGGCTCTGTGGTTTGAGCGTGATAAGCCTGGATATCTTTCTGCACTATC 
CAATTGITTGCAGGTGATCTTTTCCCAGAGAACCACTTATTTC 

ATGTGAAAGTGAGACACCTTCAATCGXrrAAGGACTGCTGGAAATGTCAACTACAGGAAGT^ 
CACCTGC CGGAG CCATGCTTCAGTCGGTCTCAAGAAAAGCAGAAGCCTGGAATCTC^ 

4 0 ATGGGTTAATAAAAAGGGATACAC^GACATAGGCTACCTCTTCTGGTT 

TATGAGATAGGAGTAATATATTTAAGAACACAGTCTTACATAGAGATAAC^ 
TTTATGGATGAGGT^GGGTCTTTTCAGGGAAGGTTGGGGAGACGGA 
CCATATCTCCGCATTCATCTATTCATTTAAATCTGTTTGCAACCCCAAAATCAATA 
TGCACAAAGCAGTGAAAAATITAAGTCACCCAACACATACGTTC 

4 5 ATAAACACATTTTCTTTTCTCTTGGCCCATTTAGTCCATGGTT^ 

ATC^CCCCAAACACAATGCTAAAGTACrTACTGTCTGGTGTCTCTAAGTGCAAGAAGGTT 
TGTGTTAGGGAAGCTACTTTCAGGCATGAGTTATTGTGCTGTTC 
AAATATCTTTAAATAGAAATACACATAAAACAAGGTTCTGTATTGATT^ 
CTAACACCATATTTCCCCAGGAGCAACTGTTCAGGATTTGCAAATTCAGCTTTC^ 

5 0 TAACAAGAAGCAACTGTATTTTGTTTCATGCATAAGATGGAAAGAGAACAGAACCAG^ 

AG AAAGAAAAGAAAGAAATG ACACT ATG AAGATTT ATTAGTT AAAATG TT ACT AGTG ATAAT AAATTAATCAG AACTTTGG CCT CT 
GCCTTTGATCCTGGTTCCCATAATCTCTAAGTGCCAGGCrCAGATAATGTGGCCCr^ 

ATTTTCTACAGGTAAACX3TXMACATGAAAG AAGCCAGGGTG AGG CATC CCT 
TCCTAACCTCTATTTTAGCTCCACX5TACCTCITCAATACCAGA 

5 5 . GCTCATAC^CTCACC^CITT1"1 M 1TGTCAACJ^^ 

GCCCAACTCCATTGC^CATGCTATATCAGTTATCIGTTGCTACATAACAGAT 

TACTTATTATTGCTCATAATTTCTGTGGGCCAAG CATTTGAGATAG CTGGGTAGG CTCAGGCCCTCTGT AGGGTTGTAGTCAGGTG 

TGGGCCAGGGCTACAGTCTCATCTGAATCCTTGATAGAGGCTGGAAGATACTAT^ 

ACTGCTGGCTATTGGCTAGAGGCCTCAGTTCCITACCACATCAGCCTCCCCAAAGAG 

6 0 TGGTTTCCCCAGAGTGAGCAATCCAAAAGCTGAGGAAAAGTGTTA^ 

TCTGCAGTCAGCCACrCTGTGTGCAAGGGATGGGAGTACC^ 

TATACATTTTAACTCTCAAACAAAATTTGAAGACTGAAGACC 

TAAACGTACTTACACTAATGGAGTGTCTATTCTAGTGG.CTT^ 

CAGGGAGCTGGGAGTGTTAGTGTCAGGTGGG CCTATAGAGCTCAGCCCCCCTTC!ATCATGTTGGCATTCATG CTGGCTGTT 

6 5 GTAGGGGATGAAATTTTCCCCTGCCAGAAGAATG 

ATCTTCCAGTCAGACTCCCCTAGAAACATCATTTCTTAATTAGCT 

GATCTATTGCTCTTCATCTATAAAGTTGCTGCCTGATAAGGTAGAACTACACAAATGAT^ 

GGCAAAGGTTGTATAGTCAGCTrTAGTCCTG<5CATAGGGGAAATGGACT 

AGTAAGTGCAAAAGGATACAGTATAATTTAAATAAAACACATGGAAGACATGJ^^ 

7 0 TCTGCAGGCTCCATCAGGGTGGCTGGCAGGGCTTCTTTC»GTG 

TCTAGATTTAACTCCTCATGGTGTATCACGTGCAA 
TTTGGTTCAGTCCAAGCCCTCACTGACTTTCCACCATCTTCA 
ACATGTGTGCACACACATGCACACACATGCATGCCTTCTTATGAAGTGTC 
TGACTTATCCTTCCCTCTTCTATTCAAAGTTCCCACTTGAGAGG 
7 5 CTGAAATTAGTTACAGTGGCTTTCX3TATACT 
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AGCTGGGCCGGGTCGCCAGCGTTCCCGATTCTCACCTCTGTGTCCCTGGGTTTCCTGCTCTC^ 

GGGTACCATTCCITTCCAGCCTTGTCTTTTGTTCATTGGCAATG^ 

ATTGGTAAAAAGGCTAGAACAAGCTTATCCAACCCATGACCC^ 

AAAATTTGTAAGCrrrc: 1' i AGAACATTATGAGAiTl'C ill CCL"riV"ri"lTCiX»TAGTTCATCAGCTATCATTAGTATTAGTATATT 
5 TTATATGTGGCCCAAGACAATCCTTCTTTTATCAGTGTGGCCCAGGGAAGCCAAA 
ATGGGCACATCAGATTTGTGTCCACAACAGCTGACACAGTGGT^ 
TCTGGAGACGATGTTTGGCTGTCATGATTGGGGAATGGGGTGCTACTGGCATCT 
TCTGTACTAGATGAGACAGCCCCCGACCCTACACACATAAC^ 

CCCCGAG CTAATGTGATGAATGGTGCCAGATGGAAATGGGTGGGACCATCATAAGGAGGGGGAACAGGGTA CAACCATCTTA 
1 0 ATTTGTAAATTGAGTGTGGGGGATGTGTTTTGACCACATGCTTTTG 

GTTGGATGGATGGGGCCACCCATAGCTGCATTCATGGACAGCTGGACACGGGCCCAG 

G CACTTTT AAG AGGAG CTTGGGTGAT ATGAAATTGTTTGGACAAAATG AGTG AG AGG TCTTGAAA CAAGGCAGG AAG AAT 

GCATTGACTTGGAGTGGCTGCCAAGAGCAAGCATTTGCAGGT^ 

ACAGTTGGAAGACGGGAAATGGAGGAGGAGAGATGGGGAAGGAGAGATGGGGA^ 
1 5 CGGCACACATTGGAGTGAGTTCCCTTCCCTTAAGA 

GGAGCATCTCGCAGATTAGTTGCCAGGGTTCTCTGATTCTGGCCCTGTT 

ATTTTTATTACAAAAGCAACATAC^TTTTGACCCTTGAACG^^ 

TTCAACAAAAATATTGGAAAAAAGTTTTGAAGGTTTTTGACAATTT^ 

AAATGTTGGCCGGGOSCTGTGGCTCACGCCTCrAATCCCAGCACT^ 
2 0 AGATCATCCIGGCTAACAAGGTGAAACCCCGTCTCTACTAAACATACAAA 

AGCTACTTGGGAGGCTGAGGCAGGAGAACAGCGTG 

GCCTGGGCAACAGAGCGAGACTTCATCTCAAAAAAAAAAAAAAAAAGTTAAGAA 

ATGT AG TCT ATTTTTTCATTT ACT AC CAT AAAATG TGT ACAACT CT ATT ATT AAAATT AAAATTTATCAAAACTT ACACATAAAT A 
CTTATATGCCATTCACAGTCGAGGAAAATGTAAATAAATATAAA^ 

2 5 CAC!ACTGTACTG CTGT AAT AATTTCAT AG C CACTTCCTGTTG CT ATTGTGATG AGCT CGAG AGTTG CAAGTACCTG CCT AAATCAC 

TGTGTGATGCTCATCATCTCTGTGTGAGCAGTTCATCT 
CTCTTGCAGTTCTCAAGTATTTTTC^TAATGTTTAGTG 
CAGTGAGGCTGGAAGTGCTCCCAAGAAGTCIATGACATTATAAGAAAAAGCTG 

AGCCTATGGTTGC CTG CCATTT CAGATGG ATGAGTCATCTTGT AAACAGATGACAGT AAATTT ATGG T ATTG A^ 

ATACATATAACATGC!AAAATATGTGTTAATCAGTTATTTATG1TATAGGTAAGCCT 
TTGGGGGAGTCAAAAGTTACACATGAATTTTTACT 
CACTGTGGAAATCCTAAAAAGCAAAGAGCAACATGTTTCCATCA^ 
CAAGGCAGTGAAATCTCCAGACCTTTGG^GCTCAC^TTTTAGGGAAAGG 

3 5 TTTGAGAAGTTAACTATGTGCGGGGGTGGCCAAGGTAGGCCGAAGCAGCACCTGAGACTGGA 

AAAGGCCTGAGAAGGAGGAAGCCAAGCAGGAGGCAGGAGAGGCCTCA^ 
AAGGTGAGGTTGCAGGTCrCC^GCCTGGTCAAGGAGAGCCTTGTGGAGGAGCAATGA^ 
TTCCTTGTGAGCCTCZACTCCACCTTCAGGGTAGGGAATTGTCAGTAAGGAAA 
ATGGCXMTGGCTTCCACGAGGGCAGCCTCAGGAGCTATGGGGAGACATGATGGCT^ 

4 0 AGGACCTGGAGACCAGCTCATCTAACCTGCTTCAAGATTCTCCATCCCTGTGACT 

TCATGGCTTCCCTGCATCACTGTGAGTGGCCAATGCTTCTGGTAGAAAGCACAl^ 
GAAAGCAGCCAGGGTCTCATCCTCAGCAACTGTGGTGCTCAATGTC^TTTCCT 

GAT AAG CAGACATGCAGAT CAGG AG CT CAGTGGAGAG AAAT AAATAC C CATT AAC C AACAAAT ATTTTTG ATG ACTT ATTCTGCAC 

4 5 TAGGAGCTGTTCCAGGCATGGTGATATGGCAGTGAACAAGGTCATCCTAT^ 

CAAGACAATACTCAAATTCAACAGTGAATCATGATTTAAA 

GAAGGGAGGTATGCGGCCCAGGACATTACAGAAGATTATTTCATGTCAAAAAAC^ 

TGTCACAAAAAAGGTCTCAAAAGAGCAAGGGATTTGCAGTGTTTTCAGGAGCT 

ACTACCTGCCCAGGTGTTCACTGAGGTGCCAGACCCAGCAGGCTGAGGGGTAC^ 

5 0 TTGTTTCTCAGTTTCGTGCTTTGGAATGAAGTAGTGTCACACATGGAAAAAAATA^ 

AGCAAGTTCTAAACACTATTTGGAAGAAAATGATCCCTGGGAATGCTAAAGA^ 
TTAAGCATTGTGAGTTGGCAAAAGCATCAGGCGGGAAGGACTGAGTCTGTGTGTG 

GATTTGCCATACTGACTGAAAGCTGATATCTGAAAAGTGGAGCTGAGAACAGCAGCCT 

5 5 AGATGGCCATTTCCTGCCAAGGTGGCCTGAGAAAGAGGAGGTTGGAG 

TCCTGAGAAGGTATGCACAGGAACACCATCGACATTGTTTATTCCCCCAA 
CAGTGAACCAGTGATTCTGCCCAAACC^TCACCACCAACATACCCAGGT 
CCATCAGCCACATCGGTGCTGACACTGAGTGCAGCACA1TCTTCTGCCAC 
CACACAGAGTGTCCCTGGCGTGTGCCTGGAGGTTGCTC 

6 0 CTTTCCATTCACTTCAATATGGGATTCATGCCCAAGTC 

GTAGCTATGTTAAGATTTATATGAATTAAAAAGTATTTGCTCT 
TTGAGCTGATGGTGCAGTCTGCTGTTCTGCAACTAC!ATAAAAAGTAACCrrGGGCCCT 
TACACAGTGACIAGGGGTATTTGAGCTGGGTCTGTAAGGATGACTGAGATC 
CACGTGGTGAGAACAAGAAGGGAAAGCCACAGCATCCCGGTGCAGTCCAGTGTGG 

6 5 AAGGGCTGATGTGCGCTTTGAGGGAGGACCGGGCAG CTGGAGAGGGAGGCGTCAGGAAG CAGGAGGGCAGAAGGTGAGG AGTG AAG 

GG CAG CAGTGG AAAGG ACAGGGTT ACG C CAAGCAG TGG TCATGGAT ATGGGCAGT AAGGAAC ATG CACCAAGGG CG C CTGGAAAAC 
GAACCTTAGAGAACGCAAGTCCTGCGGAGGACTCCTGGGTTCCrGGCTG^ 
AACCTGGTCTCAGGTGGCATCAGAACCCAGCATCCGCATCACO^ATTGCAG 
AGCCCCCAGGCCACGTGGAAGTATGAGACGTGCTGGGCTGCA 

7 0 GCAGGAAAAGGGTGGAGTGGGGGAAAAGCAGGCTGGGGGCAGTGGCTGG 

ACCGCGCCCGGCCAAGG CGGGCAGATCATGAGGTCAGGAGATCGCGACCATCCTGGCT AATGCGGTGAAACCC 
AAAAAAAAAAAAAAAAAAACTGTAGAAAGGGGAAAGCAGTCCAAAGAGA 

TCCAGAGAT G T CTTTCTGTTAACAG CATTCTT AGACTCAAAAGT AAATGT AG ATAT ACTTT AAAAAAC^ CTTGCTAAGA 
GATGAATTTTTAGAAACTTGTTGGACAAAGAAAAGTCATTGAT 
7 5 rrrrCAA ATTATAATAAAACCCATGCCTCATTAGGGGCTCGCCCAGGAGTACTGACTTGTC 
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TGTCGCCTGTGTCAGGCTGACGCTCTTACATTTTATAGCCCCTTGTAGGATAGTGCAT 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TTTG CTTCTXIAAATGT ATTTATTT ACTGCAAG CAGTGTCT CT CCTGTGT CAGATGTCCT CGTGGG CTGGGAAC CGGCACTATCTCT 

TTAGCACGGGCCCTGCAGATGGGAAGTGGCGTTGCCCAGGAAGTGGTCTGG^ 

CTOSGGAGCCAGGGAGGGTGTCACTCTTCATGTCTCAGG 

TATGGTGACCACAAGTCACAGTTCCGTGTGGGCCCTCAACGTGGCCTCACTTCACCT 

TG CTCGAGG CAGCGTCTCGCAGCTCAGGTGGGTGAGGTGGTGGTAGCGTCCT CCTTGGTAATATCACTGTGATGAGG AGCAAGGCT 

TCTCTAACATTGTCTGGACTCAGGAGGAGCAGTTGGTTTGTTCTCATC 

TCAGCCCAAGAGTCCTGTTTTGCAATAGCCCTrCTGTTTGCTTCTC^ 

ACT C CCC C CAAAAAAGAATTATTT AATTTGGT CACTTC CATCAAATG CT CTG ATTTT AG CTT AGTT AG AAGG CATGCTG AAAACTG 

TAATGATGATGTGAAC^VGAAAGACAACTGTGTGTTGGGA 

CCACTTTGAAAGGGTTTGAGTGGTTTTAGCAAACGT^ 

GGTGGACACCTGCGCAGGTGTGTGCCCTCCGGCCCCTGAAGCATGGCCAGCAGCG^ 

CTTTTTCGGCAACATCACCCGGGAGGAGGCA^ 

GCCGCAACTACCTGGGTGGCTTCGCCCTGTCCGTGGCCCACGGG^ 

ACCTACGCCATCGCCGGTGGCAGGACCCATGCCAGCCCCGCCGACCTCTGCCACTACCACTCCCAG 

CCTCCTCAAGAAGCCCTTCAACCGGCCCCAAGGGGTGCAGCCCAAGACTGGGCCCTT^ 

AATATGTGAAGCAGACATGGAACCTGCA3GTGGGCCACAGCT 

CTTCCTGGCTACATGCAACACATGTGCCTATGTG CCCTGTGTGCCCAAAT AACACAAC CATCTCCTTAAG CATCAATTTTGGCCAA 

AAAATGCCTCTCTCTX3ATAAGAGCCTTTCCTCTTTCACAGCCA 

TCCACAGCAGGGAACCACTGTTACCCAGTGCAAGGAGGGCCITTrCAGAGA^ 

GACAAGGTCCTACCATACAC^CTGATAGGCACTTGATTTTATGGCTTCCT 

ATAGTTGCATCACCCGTTCTTTGGCTATCTGGTATCACCTGCTATGCACCACAATTTATT^ 

TTGCTGTTTATGAATAAACGCTGCTAACAACATTGCTCTAGACCTA 

TTGAAAGGGTTGGGATTGCGACAGCAAAGGATATGTGTGTT^ 

ACCCAGGCTGGAGTGCAGTGGCIATGATCTCAGCTCAATGCAACCT 

AAGTAGCTGGGATTACAGGCATGTGCCACCACACCTGGCTAATTTTTGTATTTTTAGTA^ 

GCTGGTCTCGAACTCCTGACCTCAAGTGAACCATCCTCCTTGGCC^^ 

GCCATGTTA'ITTTTATTTATACACrTGAAAGTA 

ATAAGTTTTGATCCTGTTTTACCATGCCAGGACTCGAGATAGATTGGAAATG 

TTTACTCTGCTTTGCTCTCCATGTGGCAGGGT CAGG CTCTGGAGCAGGCCATCATCAGTCAG AAGCCTCAGCTGGAGAAGCTGATC 

GCTACCACAGCCCATGAAAAAATGCCTTGGTTCCATGGAAAAATCTCTCGGGAAGAATCTGAG 

GACAAATGGAAAGTTCCTGTGAGTATCGTGCCTTCCCCCTCACCTCCTGCCACCAGGCCTGTGTGGA 

TTCCTTGTGACTGGCCCCACATGACCCTGGGAAGAGGCCATGCATTGCCATTTGACA 

TTCTAACCAAAGCTATCACTTGTGACCCAGTCTTTATTTTGCCCAGTCTAACTCT 

TGTAATATTTAAAGAAAATGTGATTCAGCCCCATATTCTCCT^ 

CAACTAAAATTAAAATACTGCAGTl-ri'TTTACrCAAACTACCCCCTTGATCT 

CCAACAGGAAAGATGTTAAGGAAAAGTTCCTGGGAGTAGC^TCCATGAGTGG 

ACAGTCATTAGAGGGTCTtSTCTGAATCCTGGGTTCTGTTTGCTATCCTCAGTGG 

AGATACAGGCCAACGTGGTCAGTCAGTTGTCCAGAGAAGGTAAAGAACTGTCTTTGGGA^ 

CTTGGAAACATAACCAGAAACAGCGATGCAGCCTCATGGTTTGGCAATTGCT 
TTTAGAAGCGTTTTCTCTGCCTGTCCATCTATTCAAAGGAATT^ 

TAGCTGAGTGCAGTGGCACGCACCTGTAGTCCCAGCTACTCCTGGCAGGAGGATCGCTTCATCCT 

TGAGGCTAGTTTGAGACCAGCCTGGGCAACATAGTGAGGCCCTGT 

CTCTTAAAAAAAAAAAAAAAAAAGGAAGAAGGAAGAGTGGCCCAGGTT^ 

ATGCAGCCCGA.GCCAGCAGCAGAATGCTGTGGGGAGGGAAGACATTGCCCCTCA 

TTGGCTTCTTACCATCTCAGCAATAATTTCCAACATG^ 

AAATAGCATAATCCACACGATGAGGCTGGCCAACTTCTGCTXjGTTAAGAAGC 

TTTTTTCGGAGTTGTCTATTCCCGAAAACAGCAGTTGC^ 

T ATATAAAC CCCATGCAG AAGG CTGGG AGG TG CAG CT CTTGTTTGATG AACATG C CTTGTTC CCT AT CTGCC CTGTACAGACTG AC 

ATCTCATGTGTCCTCACAGCACATGTCCCTGACACTGAGGCTTATGGTTTATGC^ 

CAGCTGGTGTCAGAGCTAACCAGGCATGGGCCTGTGAGAGAGGGGC^ 

GTCTTCTTCTCCrGGGTTCTTATTCCrGGGTTCTTATTCT 

CCATAATTGCCAACCTCCCAGATAATAGCTTCCAAGGCTTTCTCCC^ 

CCTCCACACCTGCAGGG CTCACCTXaTGGACTGATGGTGCATAG CACCCCATG CCCAGGCCTGG CATCAGGCAGGGTATT AGTGAGA 

GGG AAGAGCAGAGCTAAGCCTGGGCAGCAAGACCCAGCTTCA^ 

AACTTTCCCCAGGCTCrGCrCCrCTGGGTCTG 

GTGGTCATCACTGATGTAGCITTTCTTCTCTCTTCAGTGCT^ 

TTATGGTGTTTTTAACAGAGAAAGTGTGAGGAAACACCTTGGAGGGGTTTGCC^ 

CTCCTACAGAACCTGGCTGGGGTTCrrTGGCrcTTGCTGGGTGTGTA 

AGTGTTTGAATATGGTTGTGTATG CTTGTGGAGAG CAGGGAGAGTTTGCCAGCATACTGGATACAGTCCAAGTCTACCATCCTTCC 

AGCACGGGGCAGGGTCTGCATCCCTGAATACAACGTCTCCCAAGTCCA.GGTT 

AAGTGTTTGTTTGTGTGTTTCTTTCATGGGAAATAGAGCTAAAAGGCT 

TGCCCTCTCAGTCTCAGTGCGTCAGCTCCTGATTCACTC 

GAAGTCCCAGACCTGGTTTTGAATCATCCAAATCTGCCCT^ 

CACAGCACAAGGT CTTTTGG AGTCAGAACT AG ATTC CAGT CACTG CTGG ATGT CCTCACTGAGCCAGCTG CTG TTTTTTTCCCACTG 
TTCTTCTCGGGCCTTGATTCAGATTCTCCTAACGAATGTGCT 
CCAACATTACTGGAGTCTGTAGTTGCATGAAGCACTAAGTGCCAGGCACCAT^ 
GCACCTGCCCTCATGGACCCTGCATTTCAGTGGGAGA 




'GAGGGAGGCCAi x GAGAAGAv^GCv- r 



1099 



WO 03/008583 PO7US0 1/5 1291 



GAAGGCAGCGAGGAACCATCCCCACCCAGGGAATGGCAAATTCGAATGCC 
CAGAGGCCAGTGTCACTGGCAGAGAGTGAGTXK3AGAAGGGTC 
CCTGCTTTGGGCAAGTTGGGGGGGCTGTCTTTAGGAAGCAAAGGGCCAGGG 
CATCATCGGTTTGCTGTGGGGAGTTGTGCCTGTACATATACTTC 
5 TCACTTTCTCATGCCTGAGCATCAGTCAGTTTGTCTCCGCrrCTGATCT 
GTACAGGCCTGTGGGCTGGCCCTTAAGAGGTTTTCAA 
TCCCAGCTCGTTTTTGCTTTCCACTTCCTGACATGTAGTAATACCACT 
TGGGGAGGGAATGGGGATTCTTGCACCTCTTTCrACCTCCTCTT 

AT AGAAATAT AAAAACATG ACACTG CGCACTCT ATGGTTTGTAAGATT CACTTTCT CTGGGT AGGAGGCAGTGTGG CCTGTT AACA 
1 0 ACATACCATTCGGK5TTGTCTGGCCTCAAATGCTGGCTCCACCTTAT 
TTTCCTTCATTTCTTTGTCTGTAAAATGGCT 

AAGGCCTGCTGCA.CGGTGCTTGGCACCGAACGAAGCAGCTTCACCCCACT 

AACATACACTGAAGGTCAGGTCCTCCATGCAAACTCTTTCGOT 

CTTCTGCAGGTGAGAAAACCAGGCAGGTCCCCCrTATCCriTCCCTTCrCTTCT'r 
1 5 GTGGCTATGAATAGATAGAGTGAGAGTCACTGGCCATAAAAGGATTCACACTAG^ 

AAAG AAT AATT ATTTCGG C CGGGCATGGTGGCTCATGCCTGT AATC C CAG CACTTTGGGAGG CTGAGGCGGGCAAT CACAAGGTCT 

AGAGATCAAGACCATCCrGGCCAACATGGTGAAACCCCATCTCTACTAACAGTATAAAAA 

GTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCTCTTGAACCCAG 

GCACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAATATATATATTAT^ 
2 0 GTTTATTTTTGCATTGCAAGCCATAAACTAATCAAGAGGATTT^ 

TACT CTTCACACTTTCATCGTGAGTACCAATATGCTGTCTCCACCATT CAG 

GAAAGTGGCATTGGTAAGAATTGGACTAAAAGGTAAAAAAA 

AAAATGCCTTTGCTCAAACAGACCTTGAAAACACTGAGGGTAAGGCAGCATATTTT 
TAATGCAGTTCAAAATGCAGATGATGGAAGCAGATTACGTCTGGGAGATG 

2 5 CTATATGTGAAATCCACTTACAGTGAGAGTACACCTGAGCT 

ACAAGAACTCGGGTGTGGAGGAGCTTGTCCAAGGCAGGAAGGCAAAACCAGGTCCCTGAT^ 
AGCCAGC^CCAAATCCATCTCCAGAT<^TCCATCTCCACAACCACAGGCCCCTC 
GCATGTCCATGTGAAATCTCAGAACACACAGAGGCGTCCATCAGCTTTCCCCTCT 
CATTCIAGGAGTGACTTTGAGAAGGTAGTTACAGGGCCTTTC 

3 0 GAGGGGCTGGGCAGTGTGGGTCAGTGGTTGTCCCTCACAGCCGCCTGTGGCATTTCT 

TATATAAGCACTGCCCITGTCTGGGGTCTAGCTGGCC^^ 

GGGTCAGAGTGCTGCCTTGTTGGTGACGTCAGGGACCAAGGCAAGGGAG^ 

GGCCTTGTATAGCAGCACAGTTTGCTATTTTGAGAAGCTAGTCA 

AGCTTACTTTATAAAAATTATATATCCCTTCCCCAAAAGGGAGGATCCAGGATTGAGGGGTCCX3AAGC^ 

3 5 CTCTTATTTTTTTAAAAATATAAAATTAGGCATACMTAGAATGTATTT1TTG 

TTTTAAAGGCTGGTCAATACTACAAGCATACAAACTTTAGAAAAGATAATGTAATATTTT 
ACACCTTTTGTTTCCTCCATGTCC1TGACTTCACTGTGTTTGACT 
TATAGAAAAAGAGACTATTCATTCCTrCCTCTAGGTTCACACrrCATTAT^ 
ATTTGGGAGGGGAAGATAAAGTAGAACAGAGTGAGCTTCCCTGGCTGTGA 

4 0 C<XTGCAGT<^CATTGCTGGGCTCCTCTTGGTCT 

ATGTGGTTTATTGCCATTGTGCACAGCCAATTCTCCCA 

TTATAAATGATCTTAGAATGGAAATGCTCTAAAAAGTCCATGCCAGGCAAAATTT^ 
GTGTATTGTTTTTGGCTAAGAGTCCTTGTAAATCTTA 
CTCTCCAAATAGCTCGATTAATGTTAAAAATAAAAGGTAGTAT 
45 GCTATTTTAACTACAGTTTCTACCTTCTGACTTTAGAAAAAAAAG 
CAACCAGACATGCAGCAGGCATTGTGATAGGACTTGGCTACTGTGA 
GACTGCAGGGGGAAGATGACAAACGCAAGC1X3ATGTGCTGCAAAGTCTGCAATC 
GGCCTACGCAGCATGATCTTGATTTTGTTTTCATTTACTTC 
ATGGGGCTGTTGCCITCACGAGTGCAACCCZAGCACCrTAGAACA 

5 0 GCTGCTAACTTGAATGGTGGCCACTACTTGGATAGGAGGAGGGAC1AACATGGTTCATC 

TATC^TCTAGTGTGTCGCTAACTATAGGAACTGGGAAGTOTC 
TCACCAGGCIAGATGCAACAAAAGAGCAGGGCAGACCCAGCCATCCCA 
TATAAAGACACATGC2ACACGTATGTTTATTGCGGCACTATTCACAATAGCAAAGACT 
AGACTGGATTAAGAAAATGTGGCACATATACACCATGGAATACTATGCA^ 
55 CACGGATGAAATTGGAAATCATCATTCTCAGTAAACTATCGCAAGGACAAAAAAACA 
AACTGAACATTCAGAACACATGGACACAGGAAGGGGAACATCACA 
GCTITAGGAGATATACCTAATGCTAAATGACAAGTTAATGGGTGCAGCACACCAGC^ 

GCACATTGTGCACATGTACCCTAAAACTTAAAGTATAATAATAATAAAATAAAATAAAATAAAAAAGAAAAAAAAGA 
AGCAAGAGATTCACGGGGTAGCATGGGTATGGTCGCCAGGGCAGGCCTGGATGCC^ 

6 0 TGCCGAGTGGGTGTGCA.GGAGCCCGGGCATCTACAGAACCCCAGTCCGAGCCCCAGGAGGCAGCACCAT 

CCCTGAGACCATTTTGGTGCCCTGAGATTCTATG^^ 

ATTGGCACGTGGCTGCCCCAGGCCACACCATGAGGTGTTGAGAGAGATCCTGGCTGCT 
AGGGGCGGCTG CTCTGACGGCTCACAGCAGGAG AGGCACCTGGAGAGCTGGG CTGTCATCA 
AGAGAAACTGAGTCAGAGAGGAAGCCCIAGGAGGTTGTGAGCCTCAGCGATCT 

6 5 ATGGGAAGCCTGCAGGCTTGGTGGGTCCTAAAGTTGCAAGAACATCACTGTCCCCTGCAGCTAAGTG 

AGGCCACAGTTGCTGTACATGATCTCTTGGGACCCTAGA^ 

CCCAGGAAGCCCITGTTCCAAGTATGCAGGATCAGATAAAAAGCGCAGAAACCCTCAGA 

CCCTCAGAGGCCTTGGTAGTCACCCCTTCCCAACCCCCATCCCACCACTGAATCCCCATC 

TGAAATTC^GGTGCCCTCCTGACCrTCCTTCCCCTGTCCATTTT 

CCATCCCCTCACTTCTTCCCCCTACAGCCCGCCTCrCTCTCTCTCA 

CACTTGAGAACCGCGGCTGCATATC^TCTCCTGTGGACCTGTGCACA 

GTGGTTTATCCTAATTTAGGTTTGTTrCCTGGTCATATAG 

AATGGGATATACGTGAATAAAATTTTTATGATTCCCTTATACTGAGTCCCACTTGCACT 

7 5 AAGCAAGGCGTCACCTGCCATGC^AAGCCACCTGCTGGGGAGGG 
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TTCAAGGTGACAGGTATGTTATCTGGTGTCCCCAGGCrcGGTCCCACCCAGGGCCTGCTCCTG 

TTGCACAC C CCTG ATTCTGGTGGACACAACCCTTTTTCTTAG AAAG CCA^ T ATCTTCT AGT CCTTAGCTG CAT AAAAA 

AATGGGGTGCAAAACTTCTCTGGACTGTCTATTCTTATTGTTCATTGG 

GATGCACAAATGAGCTATTGAGTGAATATISTGTGTGTGTG^ 

GTAAGGACrCTTCCATTTGCATTCCTGCCAAGTTCTCAA^ 
GATACCTGTTACCTACCTX3GGCTGGATTAAOVACTATGATCATTTC 
CAGTTGCAATTCCTGCAAAATGCCCAAATACTGTATTATTGGACTGGT^ 
AAACTTAAGTTAGATAGTAAAGAAATTCAGGCCATGAGTCAGACAGGCCCT 
1 0 TTTGCTCTCTGAGCCTCTGTTTCCTCAGT 

GTGTATAAAGCTAAGCCAATGGTTCTCATGTCTCTGTGGTTCCTACAT 
TTGCTACTTTTTTCTTCAAATATTGCTCTCTGTTGTTCT 

CTCAATTCATCCTTTATATTTTTTATTTTCCCTTGCCTTTT CTATG AGTAACGTCGTTG AATT ATTCTTCCAG C 

GCACTAATTTTTCCTTCATTTGTGTCCAATCTGCrCTTACGCTCATTC^ 
1 5 GCAGTTCTGTTTGTTTGTTTGTTTGTTTC 

GGTTCCGTTTTTTATTTTGTTAAACATTTCATAGTTTATATTCTATATCT 

TCTCTCTCTCTTTCCCTCCCTACCCCTTTrCTCATAACTCATC 

GATATTAATCTGTCGGAATCCTGAGTATTCGATGCTGGGAACACTTTCCT 



2 0 CATTGGCCCTTTGGCCAAACCCACTTTAATTGCTCTCATTTGCCACTC^ 
ATACTTCAATTTCAAAAGTGAATTTGCGGCTTAGGAAAAACCTTTTCCT^ 
AGCCTCCTTTGCCACAACAAAAACTCTGTTCCATCAGTGCTTTCA 
ACATGTGGTTTTGCAGACCTAGGTCATCATAAAGAGTCATGACCTGTCTGAAG 

AAT ATCACTTCATTGATG CTG ATCAAAGT CT CCTTG AAGCTGAAAG ACAAGATG CGTT AAAT AATT CCACCTTCATGAACT AAAGC 

2 5 AGAGGGAGAGAATGAGAGGGCGTTTGTAATAAAATGTTTTAAACGACAGGTGTC 

TTGTGTACTGATCTTGCTCriTCTTTGTGTGGAATGCACATGTGCCTCA^ 
CATTATCTATATCTTTCATGGATCAGTAAAGCTGGCATTTCTCCTGCCTCTTGC 
CCCACATCCTCCTCTTCCCCTCTCCCAGCTCTCITTGAGAAACCACTGTTG^ 
CAGCACAGGCCCCCTCTGTCCATCTGCCCAGCCCACCTTAGCCACAGTG 

3 0 T CC CAGG CATCTTCCTTC CCCTTGTATGAGTCTG ACAACTGACC CGGT GAT ACT CTACTCTG CCATGG AAAAAAAATGGCAGGGGA 

GAATGAGAACACCTAGTGGGCTGGAATGTGCCCC^CITTTCTGTTCCGTCTACA 
GAGGCGACCTTGAGTGCCTCACTACTTCTTACATCCTCACTTTCCCCTTG 
TAGGAGGGTCCAGGAGAGTAGAGGAGTCCAGCCCTTCCAGGCCATGCCCCAGCAGAGTCAAC^ 
TAGTGTCTGTGAGGCACAAGGCCTGCTCACCCC^VGAGGCCAAGG 

3 5 GTCTCAAGACCAGTTAGGACAGCTTCTCAGCATCATCTCAAGCrCAA 

TCTGTTAACTTACTCCTCCCAAGACAATTACTGGGAAAAAAT^ 
ATGACCCTGAAGGCGGAAGGCAGCAAGCCAGACAGGAGCCAGGAGCCI^ 

ATAAGCCTGGGTGGGTGTCCTGATG CCACCACCAGGGCAAAG CT CTCAGT AAGAGGACACAGCCAAGAGGAGGAGCAAGCCTTCCA 
AAT CAGTTTGTCTCCTATGTTT AATT AAAAAT AG CCTGGTCCAG GAT CTG C CTG CAGCCAACAGAT AC CTCAC^ 
40 TAATGTCGCGAGCCTTCTGGTCTTTGC 

CCTTTG CAGAAACTGCTTG 

TTC CAAAG CAAAGGG CACTCT CTG TTTC CT CTGGGTTGCTCTCT ATTTCCT CCTGGT AC CAG CTCT CTTTTT AT ACTG CTTTT ATT 
GTGAAATATAAAACAAACACATAAGAGTGCATAACTAAATAGGTACAACT'TAATGAATC 

4 5 GAAATGAAATGCACCCAGCATTGCAGGAGACCACTCCGCCCTCACACACTCATACCCATCT 

CACCTCCTGAACTCTTGTGTTCATCACXSCCCTTGC^ 

TACATCACTAGATAAAACAGGTGG CT'IT TG T CTG TTTGG GGACTTTG TGT AT ATGGT AG CATGGAGTGAGTTTTATGTTTGTGAGA 
CTCATCTGTG CTTTGT AT AGCT ACCATTTG CT CATTTT ATTG CTTT AGAAT ACAGT ATTGTGTG AATTCCT ACAATTT ACTTGCT 
TGTGGTCACTTACGTTGTGTGCAGTTTTTGATAAGCA.C^ 

5 0 ATACACCTAGGGGTGGAATTGCTCATTCACGAGGTCTGTGAATGAATAAGTTCAGTT^ 

AGTGGGTGAGCCCATTTATGCTCCCACAAGCAACGTAGGAGGGGCCTTCCT 
CTAACTTCCACCAGTCTGAAGAGCGTGGAATGGTACTTCCCTGGGGTCTTCATT 
CATCATCTCTCAGGGAAACCTGTGCTTGTCTGTCTCAGTT^ 
GCAGATTGGACATCGATTCAGAAACCAGAGTTCAGATCCCATTTATTACT 

5 5 GTTGCCCAGGCTGX3AGTGCCATGGTGCAGTCTTGGCTCACTGC^ 

CCCAAAGTGCTAGGATTACAGATGTGTGCCACTCCACCTGGTTCAGATCCCATGTATTATCTCCATGACCT 
AAAAGATTAAATTATAGGAAATACTTTGGAGTGAC^TATTC^ 

CTTACTGCTGTAGTATTTGGATAGCTCTCTATACCCTCAAACTATAAGCCACTGATATGTATAGA^ 
TTATAAGATACTTTCCCTATAAAACAGACTCAGTTCTCTCTTTATATGA 

6 0 GTTCTATTTAACAGTCACCAACTGCTGGCTGTAGTCGGCCATGTGGCTGTGAAAAGG^ 

GTTGGTTTGGGAGGAAGAGGTTGTTCTCACAGTCGGGAG 

TCCTGTACCCTGGCCAGCTGCCCTGAGCTGTGGGGTGCTGTGCIATGGCCAAGCACCGCTAGAAAA 

TCCrCCCATGCCATCGTTCAGCTTTCAAGTCTTC^3GTTAGCTTTTTG 

TGTCCAGTTCTGAAGGCCATCTCAACTCrCCCTCTGGTCGTGTATCTGACT^ 

6 5 AGTACCCZAGACGTCCCAAGGCAGTTGC^CACACTTGAACTCTC 

ACCTCAGCTTCCTCATTTTAGAAAATGCAGGGGCCAAAGAGGCTTCCTGGGACTTCATGACT 

GAACCAAAGGGCTGCA.GCTGTGGAGAGGGGAGAATGGGGCAGTGTGTGTGACTACGAAGCTAAA 

GTGCCITTTCAGCCTCCACTGGGACCTCGAACTCCCCAAGTCCC^ 

G CCTTACGTGGCATG CACrGCAGTGGGGGTGTTGTTCCCTGCCTGG CATCAGACTCCrGTGTTTCTGTTCACCAGGACCCAGGGCA 

7 0 GGGC^TTGTTCTGAGGGTCGCCCAGCATTACCATGAAGACAGAGCCT 

CTGTCTCTTCACCCCACAGTGGATGTCATTTTCTT 
AGCCCTGCCTCCCTTTCCTCCATTGGGAGCTTGGAAGTCCAGCCCT 

G CCTC C CTTTCCTGCAGTGGG AGCATGGG AGGGACTGAGGAAGGC C CC^GGGATG CCTCCATCTCTGGT CCA 

GAAGATGGCTGGTTAAAATGCCCACCAAACACCCCCACCCACACTCCAAGCTAG^ 
75 juvtgCACh^CCtGgtGaa 
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AAGCGTCACTCCTACTGAGCACCAGGCTCTCCTGAC 

TCACCAACCTGCTAGCATTTCCTTGTTGCAAGCTTGCCCACTGGCATCATCCCTGA^ 

AAGCCCTTGACCCTGAGGCTCCAAACTCTGCCCTCCAGCTGCCTTGCCAGCT 

GCCAGGGTGCTTCCTCCCAGGGTCCCACCTGGAGACTGGCGGGCCTGGGGATGATGC^ 
5 CAGAGACAACAACGGCTCCTACGCCCTGTGCCTGCTGCACGAAGGGA 

AGCTCTCCATCCCCGAGGGAAAGAAGTTCGACACGCTCTGGCAGGTACCC^ 

CACGTGGGGCTCCTTGTCCCATt3GACTCTTA 

CAGCTCTGGAGCTCATGAGAAACACACACCTGGACCACA 

AGGTGGCTTCTGTGTTTGAGAAGCACTGGTCCAGAGGC 
1 0 GCCCTCAGTGCCACAAGGTGATGTAAGTGCAGGGCTGTGTTCAGAGTT^ 

TTCCTAGGATATTCTAACTAGAGCCCAGAACAGCCTTTCAGACAAAG 

CTAGGAACAGCCAAGCAGGACTrTGTTGGTGGTGTGAGAGGCTG 

AGTCTCTCTCTGTGGAAACTAAGGCAATTTAGGAAGCATACTCCCTGGGCT 

GTCTAAAAAGTGCTTATTCCTGACCCAGCTACTAGAAAACCCTGAAACAACT 
1 5 CT AAAATAATG CTCATCACTG CTTTCCCTGAATCG CAGTGAGCACAGA , l ,, l'GC'I"lXiT CTT AATTTTG CATCACAG CT ACACTGG CTG 

ATGATGAAGGAAATAACCGTAACAGTACTTCGTGCATCATCTTCGCAGAGCATCCT 

AAAGCCAACCGACTCTGCCACCACCCCACCGCCCCAGA^ 

GTTCCAATGTCCTCACCCAAGCCCAGCTCGGGCCACACATGCAGTCAATGTTGAGG 

AAAAACTGAG ACTACAG AGC CTTAGCCCCTTTAAAGCACTTAAACTT ACTACTT CCAAATGTG ATTTGCTTTCTAGTGCTGT AATT 
2 0 GGAAAAAAAAAAT ACTCCI T TTT T G AATG CTGGATATGATCATATTAAAAAATGTT AG TCACAACATTT ACATTGTCTTTTG ATTT 
TGTGGCTCATTGCCATTTGTCATCATTAACCACAGGCTTCTTTCTTT^ 
CTCAAAGCCCCAAAGCACATCAGGGAGGTGAGTAGGGAATGTGGCCT 
TAATGCTCCAAGGTCCCAGGCCCCATCACTTATAAACAGCCATTCATTCACCTTCT 
CTGTC^CCATTTCAAAGGCTGAGCACTCACCATAGTGTTAAGCATCTCTAATTATTG 

2 5 TTGTATTGATTCCCCTGGTAGATCATAGCATTCATCCCTAATTAAATATTCCTCATTGTATATTTT 

ACTACGTCAAAAATGCCAGCAATACGTTCCAATTTCCCATGCA 

TACCCTACCCAACTCCAGGGCCCCAGTGCTATTTCACGGAGAGTTGGTGATTGCCATAATACCCT 

GACCAGGGATCTGCAAACCCACAGACCAACTCTGGCTCTCAGCCTGTT^ 

AAAGX3TGTTGTTTTGGATAAATAAATAAATAAACAAAGGACACAGATACGG 

3 0 GGCCCTTTACAGAAAAAATTTGCTGCCCCrAGCCCAAGA 
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GTCTATTTCCTGACCTTGGTATTTACCAC1TTCTTAC 

TGTTAAGAGTTCITACTGTCCCATGTCAAAAAATCGGCACACA^ 

AGTGACAATCAGCCTCATTTTAGACTTAGCTGATTGCAGATTTGGGTTGGATGA 

ATCAGAGCACTCCGATTTTGTATATCGTGAATCAAAAAATAGCCAGTTGCTTTAA 

TTGCATGCGGGAAAAAGTAAAAGAGGGTCGAAAGAAGTACCTGCAGAAAGCGTGCT 

CATTGATTGATCTGCAGTGGGGAGGGGAAGGAAGTGGTTTCCTCAGATAGAGCAGTAATTGTCCATGCT 

GTTAATTTTGGAGGCCGTCCACAACTTCCAGGTTCCCATCCTGCGGTAAGTGTCACTAGGAATA 

ATATTTCACAAATGAAAAGCATGTTTTAGCTAACAGGAGTAGTAGGCAl^ 

GAGCTGGGATTGCTAGTGAGTGTTAGGCAGAATCAGCCTCCrATTAGTGATAAACT 

CTTTTTTTTTTTGGAGATGGAGTGTCTCACTCTGTCACCCTGACT 

TCCCAG CTTCAAGCCATTCTCATG CCTCAG CCTCC CGAG TAG CTGGGATT ACAGGTG CCTGC CATCACACC CGG CT AATTTTTGT A 
TTTTTTAGTAGAGACGAGGTTTrGCCATTTTGGTCAGGCTGGTCT 

AAAGTG CTTGG ATT ACAGG CATG AG C CACCAC ACTCTG CCT AGCCACATTTTCTT AACTT CTTGTCT AG ATGAAATT CTCTT CATT 

TTTGGAGAATATACACAATGCGAATGAAACCCAATATTAAATCAGAAGTAGGCCTCTTG 

TTTAGCTTCATCCTTGAGCCGTCTATGTTAACACACGTAATTCACAGGACCTCAGCTG 

AGGTACAGATTTATGCTCTAAGCAGGAATTATTTTCCTGCTClTl'rCr'l GGGGCTACAT ATGGCe. ri r i i r i mi l i 1 1 iTTTTT 
TTTTITrGAGACXSGAGTCTCGCTCTGTTGCCCAGGCTGGAGTC 

TCACGCCATT CT C CTG CCT CAGCCT CG CGAGT AG CTCGGACT ACAGG CGCC CG CCACAACGCC 

GTAGAGATGGGGTTTCACCCCGTTAGCCAGGATGGTCTCGATCTCCTAACCTCGTGATCTGCCCGCCTCGGCCrCCCAAAGTGCT 

GGATTACAGGCGTGAGCCACCGTGCCCGGCCCATATGGACTTTTTATGCCCTTTAATGCCATCT 

CACTTTTTGCCTGGTAACCCTTGAAAAACAAAATCTGCAGAAAGCCAAATATACAT 

AGTAGCTGAGCCCAGGTTGAAACAGGGGAGGCTGTCCAGGGGCAGCACrTGTTATCAAA 

CCAGTGACCTTTCGGATCAAAATGTAATGCCTTTATGGAAGTGTGA 

CTATCTTAATATTATAGGCTCGGTGAGACAGATCCATACACCCCT 

GT CAAG CCCTGTG CTT AG CCAT C AT C ACC C AATG ACT AC AAG AAGTT AG AT ATTTTGGAATAACTTTTT AATTGACTG^ 

CATACGGGAAAAAAAAGGCATACGCCCTATGTGTACAACTCAATTAATTTCCACAAAATT^ 

TGAAGAAGCAACAC!AACTGAGCACTCAGATCTACITGTAGTTTCTTTC 

TGAAGCATGAGCTAGTGAATGTAACAGCTCTAATTTTGTAGCCTTCAGTG 

CCCAAGTGGCAGGCCCTTCCAGAAGCCTTCCCAATGATCAGGTGTTGCCCGTGGTCGATCTC^ 

GTGGTG CGATTAT AGAAG CAAATTTAAGTAGCATGTCATGGAAGGG ATCCTGT ATTTGTTTTCTG^ 

GCCGTTGTGGTTTCTAGACTTGGTCAGCGGGTGGAATAATCTCAAGAATCAAATCAT 

GTGCTAAAGCAACCC CTG CTTG CTGTCCAACT AAGTGGTAGGACCAACGCGCACTCAGCTCCCGCCCGCCCAGTCTGCCCTGTGTG 
TGCGCTGCCCCCCIATCTCTTGCCACTCTGCTTCCAGGCCT 

TTTA^GTCGTTTCAGTTGGCGTCATCCGT^ 

TGGCCAGACTCACGTGTTGCAGTGTTCTGGTAATTGATTATC 

CGCTGCAO^TGAGGTCAGGCAGTGCATGGTAACTTCATT 

AGAGGTGTAATCTGAGGTCTAGAAAAACCAGTTTTCTTTTTGAGGCAGTAATGTT 
TTATTGGATTTTGAGGGAAAATAAAAATAAGTTCTTTTTCATC 

GTATATATAATGTGTTTGAATTGCTTTTTTTCCATTCTACTTTTATTATCAGTGGGGTA 
TTTATTTTCATATTACTTGGTGTTTTTACCGACATTTTAGCATATAAT^ 
ACTAGAAACAACAGAAATAAAAGACATAAAACCCTTGAGATTTTATTTAGTAATA^ 
CT CAATCGTGTTGACCATAGAACAGAAAGATTTAATGCAGTGTTCGAGAGGAA 

TATGTCTCXTTTTCATGxT^^ a 1 1 
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GGTGCATCTTTAGACTACAGGAGAACATGTCAG 

TAGATAACAACCATAACATCATCAAAAGAGAAATGAGAAACTGGGAGTAAGTAGCACTAATGCTGACACA 
GTATACAAAGAGATTCTAAAAATAATTAACAAAAATATTAACTGTCTACTAGGATAATTGGCAGG 
GTAAAGCAAATATAAATAGTTCTTAAACATATGAAATGTTGTTTAACTTCATGAGAA 
5 AACTGTGTCCACCTATGATCCAAATTTTAAAAGCITGATAACACT 

GGGAAGTCTGAAATG ATG CAATCTGT AAGAAAGGCTCTG CAATTTTATTT CTAAGTATGT ACAGAT ATTCTTGCATTTGTGTG AAA 

TCATGTAC^TGTGAAGGTGTTTTTTCCAAAATTGTATAATTGACAAAGT 

TGAAATTCGTTACAGTAAAATTCCTACTATGGGAGGC^ 

AATGTCCTGCAACACACACATACACATACATATATGCAGAATGTGTTGATATCGTA 
10 TGGCAATGCTGGTTGCCTCTGGAGTAAGAGATGGGAATGTTAAGAAATCCCACTTTTCAT^ 
TTTGAACCrGTTAAGAAATTTTAATCAATTTTTTTAAAACA^ 
AATACCTTTCTGGCAAAAAAAAAAGTTTGCTTTCTATTTAAGTTTAGA 
AGAAACTGGAAATAAATATTACTAATGATGACACATATATAW 

AGGTTTAAGCTATAGATTAAAAATATTTTAATTGCTTATCAGAAATATTTTAGAGCTAATATC 
1 5 AACTTTCCATTATGTAATCAGTCATGAGAAAAGAATTGTGAGAATTATGTCAAA 
TTTCTTGAACTTGTTAATTTGCTCAGTGTACATAG1TGTTTT 

AT AGCAT CT AG AT AG TT AATGAAAAGT ATGTTT AGATTTG AAG ATG ATT ATG AATCAG AT AATTT AAATTT^ CAT AAAATT ACT A 
ACAGGCTGATGCATTTTAAACirGTCTTTA 

GGGAGGCCGAGGCAGGTGGATCACTTGAGGTCAGGAGTTCAAGACTAGCCT 
2 0 ACAAAAATTAGCCAGGCGTGGTGGCACACGCCTATAATCCCAGCTACT 

GTGGAGATTGCAGAGATCGTGCCATTGCCCTCCAGCCTGGGAGACAGAGCAAGACT 
GAAAAGAAATTAGTAACATACTTATAAAGTATAAACAAAATTAACTTAAGAATACAACTAG 
CCCAATATTAGTCCTGGAATTTCrTTCTCTGCrGCTI^ 
CAGCAACATCTCTAAAAAGACTTCCAGGGAACTTTTCAGCT^ 

2 5 GTCCCAGAATTTCCAATTAAAACCCATTTTTCATGTITTATTCCAAC^ 

AAGTACAGGGCTCAGTGGGAAGAGAAAAAAGAAAGAAACATCCTCAAGTTGTACCCT 

CC CTTG AGAAAATGTGTTTT CATAT AATT AT AAAGG AACAG ATTTT AG AT CTT CACTG T AAG AG AAGAGT AATTGCAG AGGT CAAG 
TGTTATTTTCATGX^TCTCAGAGCAAAGCTAATAGTAGCCT 

CCAGGGTAATTCCACCAAAAAAAGGAAACCAAACCTAGTTTATTGCAGCCAATAT^ 

3 0 TATTGTGATGACTAATGACTGTATTGTC^TCTGCATTACTGCTGT^ 

TAAGATCTAGTAAATGGATCTTAGGACACTTGGCAATACTCT 
TAATTTTTTCCTCTTTTGTTCACTTTTGGATC^ 

CGCMTTAAACCTTACCAATAGGAAATCAGGGATAATAATTTATAACTGGACTTATTA 
ATAAAATTTAAAGAAAAGTCACCAGTTGTTCITTTGGAAGGTAGAAAG 

3 5 TTGAGCCTGTTCTGAGTATTAGCAAAAATGAAGAAAGTGGAATGGTCAACATTTCIAT^ 

GATGACACITATTGATTGGGACAGAATGGCTCTCTCAAATGCAAA 
TATTTTTTATT AAAATT AAATGTCATTACATTTTTTAAGAATAGT^ 
CAAACTCCTCAAATTTTTCTTTTTAACTGAGCATTTTCT^ 

AGTG CTTTGGG AAAG ATG AAG CATGGATTTTG CATCTTCAT CTTT ATC ATAAAG T AACT ACTTGTT CTTTT AACTG ACTCAAT AGGA 

4 0 AACCTCCCATTTCTTCCACATTCTTCCIAGTTGTGGCATCA 

GAGCCCAGGGATCTGGGTAGGAGAGAGTTGATGTGGGGTTGTTGAGAGACATGAGGGTTCITTGTG^ 
AATCACATAGTGGTCTGTGTTGGGC^CATGGAAATGAAGCCCCAAGCTTTATTGGCT 
AGATCTCCCATCTGTTTCCCTATCCATAAGCCAAGATAGACTAGGGGTGAACAACACACTGCTAGTGAGAGT 
CCOUSGGGTTTAGATTCTCTGTGTCCATCGCTACrrAATrCTTACTCATTCCT 

AGTCCTCCTGGTCTGGAGAACATTCTGTGAGGTAGAGAGGCTG 

ATGTCAAGAAGAGGTGATGGTCCCTCTCCTCACCAGTGATGGCTAGTCCGAGGACATCCTTTAATTCCACATCT 

TCTTTGGATCTCAGAATGTGCCTTGCCTGCAGGGAAAATCAAAGGAGTTGAAATGACA^ 

AAAAAATAAAACTATTTTCAGCCTAGAATTGGTTGCGGGAGAAAACTGGGCAAAAGAG^ 

5 0 GTTTCTGTCAGGACAGTTTTCCAGAGGCAGGA 

AAATGTTTAAAAGAAATAGATTGTCATTTGGAATTATTCATTCT^ CAT AAAGG CTTGAGAGAGAGCTAATT 
TTTAAAGCIACATTAGATTATACTTCACATCTCCTAGTTTTTAGAAGTGATT^ 
GTTCAACACATTTTTATTGAGOSTCCTTGTGGACGTGGCACAGrrCCGAGTGCAGTGAATA 
ATTCTATTTTTATTTTTCATGTGTCATAGTTACT^ 

5 5 CCATCGACATCATTTTTACAAATAAGACCAGATGTCCITTAGCCTTGTCZAACAGGCT 

AAGTGCTCAGCAGACCGTGTGAAAGAAGAAGGTCCGGGTGTTTGGAAACAATGGCT 

TT ATT CTG CTTTTGGTTT CT CG C CTTGTAAT ATG AATAAGG CACATGAGGATTT AGTG TGTGT C AT AAGAT AAGTG ACTGTCACGT 

TAAGCrTAATGTCTCTGTTTCTGTAACITTCrCCATAATCrc 

GTCAATGAATACATAGAATTGTGACTCAACTGACGCTGAAAAAAA 

6 0 TAGTAGCAAGAATCTTGTTTTAATTTGGAAGAAAGGGTGGGTCAGGAAAACTCA^ 

GATAATGAGTAGATTTCCCCCCCCAGTTGTTTCATGATATAAACTGATGTGGACCAAATAAATACCTGTCT 
TTCCAGCATTAGCTCTGGCAGCTGGAGCATTGCTGGCCTAGGTGGCAGCA^ 
TGCCACCCTCTGCCATCCACCCCAATGCTGCTGCATCTCCACCGAAGATAAAGACTTGGGTGG 
TGGAGTTGGGAAGTAGGTAGGAGGAGCTGATGAGTACTAATTTTCCTCTTTAAATAATGTCCAT 

6 5 AAGTTAAAGAAGGAAAATGCTTCCAAGGCGAACATGCAAATCATCAGCGATCTT^ 

AGTGGGGAAGCAGCTCTATCCAGGCCAGTAGAGTTGACrCTCACCACACC 
GCACAAGCTTGCTTTAGCCCTTGCAAGATCCAAATAATTGCC^ 

T CTTT CCTTTGTCTTT C CTG CAAAAGTTAAACrGTGATTATTGGAAAATAAAAAT AATT ATG CAGAT CAACTT AACAG TT TTGTGG 

7 0 CCG^TGAGCCAGAACTTGCACCCTGGGCTGCAGACAAAGGT 

GGTCACTTGTAAGTTTGTAGAGTTGGTTCACGAATCCACACCACG 
CACATCTAGTGGCAGGCAGCAGGCACCAGATGCCCATGAGCAGTGAT^ 
CCAACGACGTTGGAAATGTCCCTTGCCACTCrTTCTVGAATT^ 
AAGATCATGTTCTTGGAAGGATGAATCCTGGTGTGGGGTCCAGCCCCA^ 
7 5 AGAAGCCCTACCCATGGACACAGAGGTGTACGJiGAGCCCCTA 
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GAAAG CTGCTGACGCTGGAAGACAAAGAACTGGGCTCTGGTAATTTTGGAACTGTGAAAAAG^ CT ACT ACCAAATG AAAAAGT AA 

GTTGCTATGTTCCACCAGCCTCACCCTCACATGAGCTCAGGTTCTAGACATGACTGAGAA 

AACCCTTTTTTTATTAATGCTACCATTCTTGTTAATITCTG^ 

G CAATT ACTTTTGCACCAAC CTAAT AG T ACATG C ACCITATGGAAAATT AGG AAAT AGAAACAAGCAG AAACAT AAGAAAAT AAAA 
5 ACATCAAATTGGCATGGTGGCTCATGCCTGTAATCT^ 

AGACCAG CCTGGGCAACAT AG CAAGAC C CTGTCTCTACAAAAAAAAT AAAT AAATAAAT AAAT AAAT AAAACATAAAAAATT AGC C 

agacatagtagtgcacacatatacttccagctgctcaag^ 

TCT AGCCTGGGTAACAAAGCAAGGC C CTGTTTCAAAAAACAAAACAAAAAAAAGAAAAGAAGAACAT CAT ATTTGT AT AAT AC CAT 

CCCAACACTACTACTGTTAACTAAACATTACTACTGTTAACTr^ 
10 ATATTATTTAATTAAAACAAAATCATTCCATTATAACAATAGTTTTTAC^ 

TCTTACATGAAAATTATTGATTCCAAATATGATTAGTATAATTATTTATTC^ 

ATACAACAGTTAAAATTGCTAAATTTCGAAAATATCTTTGTATTTCT^ 

AAAACTACIGGTTTCTAAAGTCCTTTGAAATACACCCT 

CTTCATGGTATTTCAAATGTAAGGGATTTTTGTAAAAAAAAAATT 
1 5 GACAAAAATATAC^CAGACATGCXyTGCTTTCCGGCCGGGTC 

GGTGGATCACCTGAGGTCTGGAGTTCAAGACCATCCTGGCCAACATG^ 

GGGTGTGGTGGCATACACCTGTAGTCCCAACTACTTAGGAGGCrG^ 

CCAGCCAAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGTC^ 

AAAAAAGAAATG CTTG CTTTC CAATCTGTC C CITCGATTTGTTTTTGTG CATGTGTGTGTGTG TAAATT AG AT ACACAT ACATGT A 
2 0 TGTATACACAX^CAAACATACAC^CATATGTATACATTCACAAGTATATACATGC 

TG TGAT ACTTATTTT AAAAT AT ATGTTTGT ATATCACAGTCATTTTGT ACZAATGAG CTT 

TTTCAGATTATGAAAGGTTGTGATCTTTATTCTGTGCTT 

GAGAATTTGGGGAGAGGAGAGGCTCTCTCGGTCAATT^ 

CAGCCCRTCTCTCCATTTACCAGGTCTTCTTCAGCCTCTCTCAGGAG 

2 5 CTTTTAAATTCTGCACCTAGAAATTCACTTTCAACAAT^ 

TGAATCCCAAGATGTTACAGGAATTCGT ACATGT CATCGTTGGTGAAGTT^ 

TTTTTGCTTGGTTGGGACATTCITCAGTACTGTTCITCACC^ 

GTGACCCAGGCTGGAGTGAAGTGGCAAGAAGCTCACTGCAGCCT 

AG CAG CTGGGACT ACAGG CTTG CACTGTCACGC CTGGGTGATTTTTTTT AGAG ATGGGGGT CT CACT ATG TTGCC C^ 

3 0 TGAACTTCTGACCTCAAGCAATCCTCCCCTTTCAACCTCCCAAAG 

CATTCnTTTTTAGTAATGGAGTATTACTCTCATTGTCTATAAAATAAG 
TCTGAAGACACATTGCCACTCTGTGGCAGGTATTTCCGTGGGACTGTTTCCACAG 
AGAAGTGAGGCATTTTGGAAAGTTTCTTGTGTGTTATGATTTCTCT 
ACGAGGCCIAATGACCCCGCTCTTAAAGATGAGTTATTAGCAGAAGCAAATGTC^ 

3 5 ATGATCGGGATATGCGAGGCCIGAGTCCTGGATGCTGGTTATGGAGATGGCAGAACTTG^ 

CAGGTATTGTCAGGTGCCCCCACACATCTGGAAGCTATCCCCTAAGGGAC^ 
CCTTGGGAAGCCITCCCACCCTCCTGTGCCTTCCTTTATTGGTCCATACT 
TTGGATTCAGCTCTGTTAGCTCrAGTTGTTCAGAGAAGAACTCATTT 
CTTGGCCTAGTGATAGACTCAGCCTCAGAATATCTTGACATGGAATCrrCCT 

4 0 CATTTTAAATAAATGCTTTTCTAAGAGGTGACAGTTAAGTCTTATCT 

CTCACITGGAGTAAGACAGAATGATGGCCACCGAGGAGCTTGCCrCATTTGCT 

CTCCATGGAAGGGAGGTGTTCTTACCCTGACTTTGTAGGTGAGAAA 

AACAAGGATTTGAGGTGGTGCCTGGCTATCTGTGAGCCAGGCCTGACTACT 

4 5 GAGXIAAACTCAATCCCATTGAACACGAGGAGAATATGGC^ 

CGTGAGTTTTGTTGTTTAAGAAGAGGGAAGGAGTAGATTCACCCACTG 
CACTGTCTGTTATAGCTGATGAGATCATTATGACTTTCAGACATGTCAAGGATAAGAACAT 

ATGGGCATG AAGTACTTGGAGGAGAG CAATTTTGTGCACAGAGATCTGGCTGCAAGAAATGTGTTGCTAGT^ C 
CAAGATCAGTGATTTCGGACTTTCCAAAGCACTGCGTGCTGATGAAAACTACT 

5 0 CAGCAGGTGGTAAAAAATGCAAGATAAATGTACGTTTTCTTTATTTTATTATTG^ 

AAATAACTATGTAAGATATGTTCTTATAATCTTTGATTTTACTGTTC 
AGTAGTATATTCTAAATTATGTTTC!ATGGAATGATCAGGAATCGAGGT 
TTCCrrCATTAAGTTAATTTGAAAAGTTTGCATTGGAC^ 
GATTACGAATCTCAGTAGAGGGTTTACTGCACTGCATTTCTC^ 

5 5 TAACATGTGGTGGAACACTGTCCCACAGCATGGAGCTATTGGAGCTTGGAGCT 

CTTGGAGCTATTCCAAAATAACTTTCTAATGAACACCTACCT 
TCATCTGTGTATGTCCACTACATATGTAGAAATTGTTCTATTGGCTTAA 
GCAGTGCIGGAAAGAAGAGATCTAGATAGTTATGTGACTGAAGCTAAAAGAAATAT^ 
AAGAATTAAGGGTGGACTTCCAGCAAAACATAAAGTAGAAACTAGTTC^ 

6 0 GTGAC CAAT CCCCTTG CT AAT AT AACTTG AC C CCTCTCTG AACCCT CAAATGAACCAAT AACAACT AAAAGAAGAAAAGAT AAG AA 

AGCrCTATCTTCATATTAACTCATCTGTGGTGTATACCTAGGGTCTATGTTA^ 
GAGTCAAAGAGGGTGCCGAAGGCAGGTTAACAACCCTGCTAGTCTCTGGTGAAGTCCACATGC 
AAAATCTACCTAGTGGGTGGCAGATAGGATCCTGTGTTTCATGCCCTC 
CCACTCACCTTTCCCTAAGAGGCAGTCCATGGACTGGTAGGTGATGGG^ 

6 5 GGTAAGCCACAAGAGCCAAGCTCCCAGGCAGGCAGGCAGACTCCACC^ 

CAGGCTCKSCCCTCCTCACCCCATTCTrCATACAC^ 

CAGCATGGGC CTGGAT AG AAATAGCAGGG AAACAG AG CAAGGAAGACAGCAG CCT AGG ATGACCAG CATGGGACATG CTGGAGAAA 
GAAGCCTGTTGTAATGCTGGATGTACCTCATCATGATOVTTACAGAAAAGAAGACAGCCTGTC 

AGAGAAAATG AAACATG AATG CTGGGCACCTT ATC CACTGCTG T AT AACAAATCAC CCCAAACTT ACTG C CAAAAACAC CAC CAGG 

7 0 TGTTGGTTTCCCTGCCGTCCTGCTGTGTGGGCAGGGCCAGGTAGGCAGGGTTCAGTGCGTCTGCT 

GCCCGAGGGACAGGAGGACCGTTTCCACTCTGCTCACTCACACTACTCAGCCAC^ 
GACCTCTGCXjGGGCTGTCAGCCAGCTCACCTGCACATGGCCTCTCCTTGG 
GGCAGCATGGCAGCTGCGGGCTGAGAGCAGTTACCCAAGAGGCAGGAAG^ 
TACCTCAGGGTCACTTCCACCrTGATCCTATTGGTCAGAGCAGCCAAG^ 
75 TTTGAGGAGAAGTCTCAAAGAGTCAAAGAACATGGGGC 
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AACT AAACT AAAAAGT AAACCAAAGATTGT CTTTGGACAATCAATG AAX3 AAAAAAAG C CACT AGGG AAAT AAAAGC CAT CTTGG AA 

G CAG CAAAAAG AGACAAAACT ACCGAAAAG ATAGGG TAG TGGAAGATTT CTGT ATG AAAACAAAGCAAAGTGAAATGGAAAAGAAT 

AAAAAGTGGGCCGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCT 

TCGAGACCAGCCTGGCCAACATGGTGAAATCCCGTCTCTACTAAAAACAC^ 
5 CCCAGCTACTCTGGTGGCTGAGGCAGGAGAATTGCTTG^ 

CTAGCCTGGCCAACAGAGCAAGACTCTGTCTCAAAAAAAAAAAAGGAATAAAAAGTGAAAC 

GGAAGCAAACGAACGCTG^CACTCGGGAAACCATGCGTGACAGGGTAGATT^ 

TTGATCCAGCAGCTCCTTCCGAGAGCAAACACACACATCTGGTAATC 

CTGCCTGACGACTGCTCCGCAGGGTTTGGTCCAATCA^ 
1 0 CTTATTTGCATGTGCTGACCTAGAAATATGTCTACTATACATGGTC 

CCCTAGTTTrATGACCAGAAAAAAGTATAC^CTACCTATTT^ 

ATTTAAGATTTGTCAAAGGTTTACTTTTTTCATCAGTTC^ 

GACTTGTAAGATCAAAATTTTAAAATAGCACACAAGGTGATGGAAACAGAAGTA 

AGGGCACCTTATGGTCTCCAGATTGATGTCCTGAATGTCTCCTTGTGAT 
1 5 ATGTCCCCCTCTCCACCCC^TGAGACCTCACACAGGTCAGATCAGCAG 

AAACCCAGAGAGAGGGTTTTGCTATGCAGTTTTGACCTCTC 

CATTTTTTAATGATGCTAGCAAGTAGTGCATGCTA 

TAGTATATGCTCCTCCGTAGAGACGGACCAAAATAACAAGTACATATTCACACTTCCAATAGATGGTCT 
CAACCAAGGAGCGACAGGAAGCACCAAAAGCAAAGCACAGGGAAACAAGGCAGCCT 
2 0 AAG CTTC CAGAT ATGGGGAAAG CAT AAATGAGAG ATC CCCAGGG CTC CACATTCCAG C CAAGGACTT CT ACAATC CT AGCG ACAG A 

AGAACCCCTCGACCCACGTGGGCCOTAAGACTAACATAGGGAGCT^ 
CACGCTGAGTCCCACAGGCTTCCAAGCTCTGAGTA^ 
GCCCnX5GGACTATTGCTGATGCTGC(^TCACCAGGCCAAGGAC^GAG 
CCCTGKTTGCTACAGGCTGCTGTGGG43CTGAGACACA 

2 5 TGTCCGCAGCCTCCCCAGGAGCAGGCCC^TAGCCAGCGCCATTC^ 

GGGCTGATGCTGCCACTGCCCCGCCACATAGGGAGAAGGGAGGCCAGGCACTTTCACA 
GCTGAGGGCTGCTCTGGGACCAAGGCTCAAGAAGACTGCATATCCC 
ACCCTTTCTGGTGGTAGGCCTTCAGGGCAGCCACTGCTGGCAACCTAGAGACC^ 
TGCACCACCAGGGAGCCTGAGGACAAGTCCATTGGCCTAGTCACATGCCC 

3 0 <^AAC ACCAACATGG ACCACTTGGGTTCCAG TG AGTTGC CCT ACCACC^ C CAT CACCCACA.TC ACACCCACTCC CCAGGAA 

CCCAAGAACCTGCCCACATGCCCAGCCACTGCAGCATCCACACAATCTACCTGAAAGCCCAA 

CCACCTAGCATCCTATTCACAGTACACCTrGACAACAGCCTCCACT 
ACTGAGGCTGTTTATGGCCATAGAAATTATACATAGATTAAATACTGCACT 
35 ATATGTGTGTGTGTATATATACACACATAC^CATACGTGTATATATA 

ATACACATACGTGTATATATACACATAOICATACACATACGTGTATATATACACACATATACA 
ATACACATACAC^TACGTGTATATATACACACATACACATACGTGTATACATAC^ 

ACATATGTGTATATATACATACATATACACATATGTGTGTACATATACATACACATACACATATGTGTGTACAT^ 
ACATACACATATGTGTACATGTACATACATACACATACAC^TATGTGTACATGTACATACATACACA 
40 ACATACATACACATACAC^TATGTGTACATGTACATATATACAC^TATACACATATGTGTACATGTACA 
ATATGTGTACATGTACATATATACACATATACACATATGTGTACATC 
ACACATATACACATATGTGTACATGCACATATACIACATGTC 

GTGTGTATATACATATACACACATATGTGTGTATATACATATATACATATATACATATATATACATATGCACATATGTGTATATAT 
ACATATATATACATATGCACATATGTGTATATATATACACACATATATGTGTATATATATATATACCCAACATATATATACCCAAC 

4 5 AACATATATATATTTTCAC^AAAAAGACCTCCTCTACAAAAGCA^ 

AGATATCAATGTAAGAACACAGGAAACATGAAAAAGCAAGGAAATATGATACCTCCAAA^ 
TCCCAATCAAAAAGAAATTCACAAAACCCCAGATGAACAATTCAAAATATTGATT^^ 
CTGAAAAACAATAC^GAGAATTCAGAAAAACAGTTCAGGATATGAAT 
AAACAGAAATTCTGGAACTAAAGATTCATTGAATGCAATAAAAAATACATCCCAAACT 

5 0 AAGAATCTCAGAACITGAAGACAGGTCTTTTGAAACAACCTATTCAGACAAAAATA 

CCCTTCATGATATATGGGGCAGTGTAAAGCAACTGAATATTCAAATCATCAGTGTCCC^ 
AAATCCTATTTAATGAAATATTAGATCAAAACTTCCCAAATATAACAACAAACCTAGACATCC^ 
CCAAACAGATACAATlSCAAAATCATCTTCTCaVCAGCACGTTATAATCAA^ 
AAGAGCAAAAGAAAATCTTCTAATTACCCATAAAGGAACCACCATTAGACTAACA^ 

5 5 GGAGAGAAGGGGATGACATTTTCAAAGTGCTGAAAGAAAAATACTGCCAC^ 

AATAAAAGAAAAATAAAGCCTTTCCCAGGTAAGCAAACACTGAGGGAATTCATCACCACTAGACT 

AG G AATCTT AAA CG TGG AAGT AAAAGG ACAAT ATTT ACC AT CAT AAAAAC ACACAAAAG TAT AAAACT CACTGGT AAAC CAAAC AT 

ACAAATAATCCTGTTAAATAGTGGGCTAAGAACATAGATAGACAT^ 

AAAATGCTCAATATCACCAAACAGCAGGGAAATGCAAATTACAAACACAATGAGATACC^ 

6 0 TTAAAAAGACTTTTTTTAAAAAAATGCTTTGCAGATGCAGGAAAAAAA 

AT AGTCACTT ACGGAAAAT ATT AG AGAGACTT CT CAAAACAAAAACAC^CTGGGGACGGTGG CT CACACCTGT AATC C CAG CAGTTT 
TGGAAGCCCGAGGCGGGCAGATCACAAGGTCAGGAGATCAAGACCATCCT 
CAAAAAATTGGCCAGGCCTGGTGACAGGTGCCTGTCATCCCAGCT^ 
GCGGAGCTTGCAGTGAGCTGAAATCACACTACTGCACACTCCAGCCTGGGC^ 

6 5 GAAAAAAAAACTAACTACACATAGAATTACCATATGATCCAGCAAGTCT 

TCAAAC^^ACACATACACTCAACGTGCTTATCATAGAACTATTCACAATAG 

GATGAATGGATAAAGAAAATGTGGTGTGTATATATATATATACACCTCTAGAGGTGTT CACTGGT^ 

AAATGAAGAGGCTAAAGTGAAGTTACAAAGTTATTTACrTGGTGTACACCCTATGTA 

TCATTTACTGAGTGTACACCCTGTGTAAATGAAGAGGATATTTCCTGTCATAGCTGAAGTG 

7 0 GTCCTTAGGTTCCCTGCCTCCAGGCCCTATTCTCCTGTCTCAATTTCACTCAG 

GAATTCCCTTCCTTTTGAAGGCTGAATGCTATTCCATTGTCTATATAG 

TTGGGTTGCTTTCATGl"ITrAGCTATTGGGJ^TAATGCTGCTATGAACATGAATG 

TTTTTTTTTTTTTTGAGATGGAGTCTCTCGTTGTGTCACCCAG^ 

CTTCTGGGTTCAAG CAATTCTCCTGCCTCAGCCT CCCAAGT AGCTGGAATT ACAGGCAC CCGCCACCACACCTGGCTAATTTTTGT 

7 5 ATTTTTAATAGAGATG CCOTTTC CTCTTGGCCAGGCTGGTCT CAAG CT CCCGAT CT CAAGTTGATCTGCT CTCCTCAGCClXC CAAA 



1105 



WO 03/008583 



PCTYUSOt/51291 



CTGCTGGGATTACATCCATGAACCAATC 

CAGACCCATGGAAAGTGGCCTGTCAAGTGGTACGCTCCGGAATGCATCAACTACTACAAG^ 
CTTTGGAGTGTTGATCTGGGAAGCATTCTCCTATGGGCAGAAGCCATATCG 

GCCATT AGAACAGAT AAGC^C CAGATTGTCTTT ACAACAACC^ CTTTT AC CAG T AAG C 
5 AGTTAATGTGCCCTCAGTCTATTCCAAACTGAGAATGAAATATCT 

ACATGT AAAT ATACACAAAAT ATAAC C CAGATCATT ACT CATTAGAGTTGAAAT AGAAAT AATGACTATTT ACAGT AT CTT AATT A 
CA JT'l'T 1 ' 1 AGATCITTTAATT AAAACAT ATAT ACCAAAAAGGG CACAACT CTT AG CATGCAGT ACAGTGGTTTTCCACAAATG AAA 
CATACCCATGAAATCAGCCCCCAGATGAAGAACAGAACCCCAGAAGCCTC 
CAACTGACTCCAACATCACAGATTAAGTGCTCCTGT^ 
1 0 TGGAGTGGCTGTTrrGTTTTGTTTGACTAACACCTTT 

AAGTGAAGTCACCGCTATGTTAGAGAAAGGAGAGCGGATGGGGTGCC 
TGTGCTGGACATACGAjGTGAGTACCTCACACTGACTGTGATG 

AAAAAGGATAAAGTTCAC C TTTT CATGAATCAGGAATGTGCCCGCTrrCT AAACAAACAACGGTG CTTTAGAG 

ACCTTGCrTTCTGGCCCATGCACTTACATGTAGTAGAGCCTAAAGA^ 
15 AAATGTCGATAATGGTGGAGAGATGGGGCAAGGGCAGTCAAAGTAAAAAAGTG 

GCATAGAGGATGGCAGAAGCACTCGGTAGGGGATGGTATACTTTCCTGACTCCCT 

ACTCTGGCCAGCTCACAGATACATTGGCTAAGGTTGCACTTCCCT 

TGACTGCCGTCATACGTCTCGCTTAGTGGATCCCASGTGCTGTTATGCTTTTAAAGTA 

AGGGCAGCTCAGCAAGGAAGGAAAATGCATTGCCIACTGAAAATT^ 
2 0 CATAGCACGGTGACCCCAAGCCGCCTlTCTCCAGTTCCCrrGGTACTGAACTGTG 

GTTTGTGATCAGGAGGACATCACAAATCCTGGCCATGGA 

GCCAGAGCCTCTCAAAGATGAGGGCTATTCCAAATGCTGC^ 

ATCACTCTACCACAATTGTGAGTGACAGGGAAATAATCTCATTCTGTATCT 

GCAAACTGTGGGTATATGCTTCCCTGCCCCTGCTGCTGCCGTGGCT 

2 5 AACCCCAAGTX3TTAGTGCAACAGGGAGAGCTAAGTGTCAGCAAGAGGGAAGTGCT 

AGTCTTGGCTITAATTCTCCCAGTGCCTAAACTAGAATCGT^ 

GGCAGCCCTTGGGCACGTGGCAGCCTATGACCCTTTACTGAAAGTGTTCT 

TTCTTGTAGAAGGGTGA 

30 HUMAN SEQUENCE - mRNA 

GAGGAAGAGCa3aXK3CCCXX305GCTGAGGCCACCCCG^CGGCC^CT 
CCCTCX3CCTCACCTGGCGCAGGTGGACACCTGCGCIAGGTGTGTGCCCT 

AGCGC CAACCACCTGCCCrrCl" X"l ' TTCGGCAACATCACCCGGGAGGAGGCAG AAGATT ACCTGGTCCAGGGGGG CATGAGTGATGG 
GCTTTATTTGCTGCGCCAGAGCCGCAACTACCrGGGTGGC^ 

3 5 TCGAGCXXSGAGCTGAATGGCA.CCTACGCCATCX3CCGGTGGCAGGACCCATG 

GAGTCTGATGGCCTGGTCTGCCTCCrCAAGAAGCCCTT^ 

G AAGGAAAACCTCATCAGGG AATATGTGAAGCAG ACATGGAACCTGCAGGGTCAGGCTTCT CAGG CCATCATC AGTCAGAAGC 
CT CAGCTGGAGAAG CTG ATCGCTACCACAG CC CATGAAAAAATGCCTTGGTT CCATGGAAAAATCT CTCGGGAAGAATCTG AGCAA 
ATTGTCCTGATAGGATCAAAGACAAATGGAAAGTTCCTGATCCGAGCCAGAGACAACAACGGCTCCTACGCCCT 

4 0 CGAAGGGAAGGTGCTGCACTATCGCATCGACAAAGACAAGAC^GGGAAGCT 

GG C AGCT AG T CGAG CATTATTCTTATAAAG CAGATGG TTTG TT AAG AGTT CTT ACTG TCC CATG TCAAAAAATCGG CACACAGGG A 

GTCATTCAATCCGTATGAGCCAGAACTTGCACCCTGGGCTC 

TGTACGAGAGCCCCTACX5CGGACCCCGAGGAGATCAGGCCCAAGGAGGTTTACCTGGACCGAAAGCTGCTGACGCTG 

4 5 GAACTGGGCTCTGGTAATTTTGGAACTGTGAAAAAGGGCTACTACCAAATGA 

GAAAAACGAGGCCAATGACCCCGCTCTTAAAGATGAGTTATTAGCAGAAGCAAATGTCATGCA 
TGCGGATGATCGGGATATGCGAGGCCGAGTCCTGGATGCTGGTTATGGAGATC^CAGAACITX^ 

C AGAACAG ACATGTCAAGG AT AAGAACAT CAT AG AACTGGTT C ATCAGGTTT CCATGGG CATGAAG T ACTTGG AGG AGAG CAATTT 
TGTGCACAGAGATCTGGCTG CAAGAAATGTGTTGCT AGTTACCCAACATTACGCCAAG ATCAGTGAT^ CAAAGCAC 

5 0 TGCGTGCTGATGAAAACTACTACAAGGCCCAGACCCATGGAAAGTGGCCTGTCAAGTGGTACGCTCCGGAATGCAT 

AAGTTCTCCAGCAAAAGCGATGTCTGGAGCTTTGGAGTGTTGATGTGGGA 
GAAAGGAAGTGAAGTCACCGCTATGTTAGAGAAAGGAGAGCGGATGGGGTGC^ 
TGAATCTGTGCTGGA<^TACGATGTGGAAAACttGGCCCGGATC 
GTGAACTAACrcCTCCCGCACCTGTCGGTGGCTGCCTTTGATCAC 

CCCTCCACAAAGCAAAGGCAGTCCCGGGAGAAAAGACGGATGGCAGGATCCAAGG 
TGTGATTTTCATACAC^TTATTTTTACGATCTGTTTCCAAATCCCriT^ 
TTACTCATCGGGCCCAGGGACATTGCAGAGTGGCCTAGAGCACTCTCACCCCAA 
GCATGTGACTCCTGAAGGGAAGGCAAAGGCAGAGGAATTTGGCTGCTTCTTACGGCC^ 
60 TTCCTGACAATAAAAATGTTTTGAGGCTTTAAAAAGAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGCCAGCAGCGGCATGGCTGACAGCGCCAACCACCTGCCCn'TCrTTTTCG 
GGTCCAGGGGGGCATG AGTCATGGGCTTT ATTTGCTGCG CCAG AGCCGCAACT 

6 5 GGAGGAAGGCACACCACTACACCATCGAGCTC-GGAGCTGAATGGCACCTACGCCAT 

GACCTCTGCCACTACCACTCCCAGGAGTCItSATGGCCTGGT 
CAAGACTGGGCCCTTTGAGGATTTGAAGGAAAACCTCATCAG^^ 
AGCAGGCCATCATCAGTCAGAAGCCTCAGCTG^AGAAGCTGATCGCTACCACM 
ATCTCTCGGGAAGAATCTGAGCAAATTGTCCTGATAGGATCAAAGACAAATGGAAAGTTCCT 

7 0 CTCCTACGCCCTGTGCCTGCTGCACGAAGGGAAGGTGCTGC 

AGGGAAAGAAGTTCGACACGCTCTGGC^GCTAGTCGAGCATTATTCTT 
TGTCAAAAAATCX3GCACACAGGGAAATGTTAATTTTGGAGGC 
AGGGAACCX^GCAAGAGAGTACTGTGTCATTCAATCCGTATGAGCCAGAACTTGCA^ 
AAGCCCTACCCATGGACACAGAGGTGTACGAGAGCCCCTACGCGGArc^ 
7 5 AACCTCCTCACCCTCCAACACAAAa\ACrrCGCCTC^ 
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GAAAACCGTGGCTGTGAAAATACTGAAAAACGAGGCCAATGAC^ 
AGCAGCTGGAC^VACCCGTACATCGTGCGGATGATCGGGATATC 

GGTCC CCTCAAT AAGT ATTTG CAGCAGAACAGACATCT G TTCATCAGGTTT CCATGGG CAT 

GAAGTACTTGGAGGAGAGCAATTTTGTGCACAGAG 
5 GTGATTTCGGACTTTCCAAAGCACTGCGTGCTGATGAAAACTACTAC^ 
GCTCCGGAATGCATCAACTACTAC^VAGTTCTCCAGCAAAAGCGATGT 
TGGGCAGAAGCCATATCGAGGGATGAAAJSGAAGTGAAG^ 
GTCCAAGAGAGATGTACGATCTCATGAATCTGTGCTGGACATAC^ 
CTGCGCAATTACTACTATGACGTGGTGAACTAA 
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Table 52 

MOUSE NOMENCLATURE 
ICSGNM Gnbl 
Celera roCG23363 

HUMAN NOMENCLATURE 
HGNC GNB1 
Celera hCG23470 



10 MOUSE SEQUENCE - GENOMIC 

AAGAGGGCACTAGATCTCATTATAGATGGTTGTAAGCCACCA^ 

AGTGCTCTTAACCTCTGAGC^TCTCTCCAAGCCCUTl'GTT'l'TTTGTTTTTG 

CTTTTATTTTTTAATTAGATATTTTCTTCAT^ 

CTCCCC^CCCACCCACTCCTGCTTCCTGGCTCTGGCATTCCCCTCCT^ 
1 5 TAGACCAGGCTGGCCTGGAAGAGACAAAGAGATCTCACTGAGATTIT^ 
GGGTGGCIXMTATTAAAGGTGCTGGCCACCACTGTCCAA 

GGGAGAGGGTTGGTTGTTTTATATCAAGGCCCITCCATTC 
GAAAAAAAG lTlTTTTr GTTTTTGTTTGTTCGTT 
20 T ATT AATT AT ATGT AAGT ACACTG TAG CTGTCCTCAG AGACT C CAGAAG AGGG CG C CAGAT CTCATT ACGGATGGTTGTG AG CCAC 
CATGTGTTTGCCGGGATTTGAACTCAGGACCTTTGGAAGAGCAGTCAGTGCT 
TTGTTTTTTTTACCACTTGGACTACTGACCTAGGTGGGATT^ 
GAGTAATAAGTGCTCCTGAGGCAGCAGAGCCACTTTGTTTCTAACGCTTC 
TCACAGATTCAGCTCTTTAAATGGACATGATACTTAGC^ 

2 5 CACTATAGCTCACTCTGCCCCATAGCAGGTACTTCAGTCAG 

TAAGGCATTGCTGAATTTTTAAAAAGAAGCATTACCTTTCTCAGGACT 
GTCTCCACTGCAAAAGGGTAGAAAAACTTCCCATGCTATTTGGTGAAAATTGA^ 

CACAAG TTGTACTT ACAGTTCCTTCCCTTTGGAG AGGG CATG CTGCCACTTTCAT CTCACTTCAGG TGAG TTTCCAGAGGATCATA 
CCTAGAACAGTGTTTAAGAGTTTCGAAGAAGCCAGGTCTGGTGGCGC^ 

3 0 GATTTCAGAGTTTGCGGCCAGCCTGATCTACAAAGTGAGTTCCAGGAC^ 

AAGAGTTACGTTGTTGCTGATAGGTACATATGCCATGTACATAAAA 

CATGGCCTTCATGTAGTGTGAGAAAGTGGAGAGAAAGAGTGGGCAGCAAGCCAGGATGTTAA^ 
TAGTAAAATAAACGGGAAGTTTGACAAGAGAATTAAAAGACAATTC 

AAG C CCCAAGATC CGC CATTT AGATGTTT AAACAAATTTGGAT AAGT AC C G AGGTT GCATCATT ATG AAACT AACGAGAAGTGG CT 

3 5 GTGCGCTCCAGTTGTACCGGAACAAAATGATGCTGGCACAGCAAGCTTCATGTCC 

TTTTAGATACATTGGTATATTTGCATTATAGACTTAATGAATGTATGTAATGCCAAACAA 
AATATTAGTAAGTTAGGAGTGTTGGTGGGTAGTTCAAGGTAATCCT 

ACCTTGTCTTTAAAATATCAAAATAATAATTAAAGAAATGATTTAAACATACAGACTATATTACACTA 

40 GCAAACCCCCAAATCTTGCCCCCCCCCCCCCCGCCCCAATX^TAACAAGTTAGGAGTTCAA^ 
AACAATAGTTTTCTTCTGTTTTCTGTCTGATTTT^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
CCCAGTTCTCACAGTCCTCCrrGCTTAGGTTTGTTTGTTTGTTTC 
TGCACTGTAGCTGTCCTCAGACACTCCAGAAGAjGGGAGTCAGATCTTGTTACG^ 

4 5 TTGAACTTCTGACCTTTOTAAGAGCAGTCGGGTGCTCTTACCCACTCA 

AGATTAGAAGTGTGTCCCACCTCACCGAATTTGAAAATGTTAATTATTTAGGGAACTATT 
AGCTGAACTTCTTATATACGTCCCTATCCATTTCTCTACCTTTTGCTCGTTT^ 
TATATATGTGGGTACACTGTAGCTGTAGATG^TTATGA 
TGGCCCAAAGGTTTATTTATTATTATACATAAGTACACTGTAGT^ 

5 0 TACAGGTAGTTGTGAGCCACTATGTGGTTGCTGGGATTTGAACTCAGG 

TCTCACCAGCCCCCAAGGAGA CTriTC TTAAATA eiTTlTl 'G 

AATGTTAGCAGTTGCTTCTTATATAGCCCCTCAGCACACTTGTTTC 

TCCTTCTTCCTCCCTCCCTCCCTTTCACATTGAGACAGTTTTTCA 

GGCTGGCCTTAGACTCACAGAGATCCAACTATTTTTGCCTCCCGAGTTCTAGGATATAAAGC^ 

5 5 ATCATAACTTTAAATCTTTTAGAAATTCTTGTCTTTGGAAATAGCA 

CCCAGAACTTAGGAGGCAGAGGTAGGCAGATATCTGTGAATCCTCTAGGCTAGCATGG 

AGAG CT AT AC AGTG AG ACC CTGACTCAAAAAACCAAAGGAAAAAAGAAAGAAAAAAG AAAAT AG CAT CT CAAACT AGGACAAT AGT 

GTTCTTTGTCAATGTGACTTCTCAGATGTCAGGCATTAACCTTGGAG 

CAGTAGTATATCIACTTCAGTAGTATATCACTTC^GTAGTATATCACTGAGA 

6 0 TCACTATGTGATATAGATGATGTTTGTTTCTTTGTTTTTCAAGGC^ 

at AAATAAAT aat A'rrrrr'rr CAGAC CAG AGTG AAAAGTGATTT AGTG agg C CCT ATTTG t atgttgg AGAAATTGAAATTATTTT 

TCAAGACATGCACCATTAAATTTTAAATTCTAGTTTATTTTTTGTAT 

TCATTCTCCTGGGCAATCAAGTCAGGCTCTGGGTTTGA 

TTTCATCAGTGGCAGGCTTTGCAGCCCAAGTATCGTCAGT 

ACGTGAACTGTTCTAATCGTGCTGACATTTTATT^ 
CTTTACTCCCATGTTCTTACATCATTTAAATTTCCTGAACAGCAGCTCT 
CGTGAGCCTCAGCAGGTGCTGTCTTAGTGGATTAGGCTGGAAAGTGCTACCT 
GGAGGAGAGCTGCCCAGGCAC^CAGACCC^ATGCTGGGGTTTCCAGAAGGGTCAAA^ 

7 0 G AG ACGAAACGT AAG TAAGACGTGTTTCTAATTG CT AAAGTTGTGT AAACAGTATT ATGG CAG ACTCTTCAGTTG TG TG CTGCACT 

T C ACT CAGACTGTGAGAAGGTAAT ATTGGG ACAGCAG^T AAAGG CACTCG CTGTC CATGGCAGG AGAG AAC CAG CT ATTT 

CAGGTTATCCTCTGTCCTTGGAATATGTGTTGTGCTACATACCCATACACAAT^ 
AAAGATCGTTTTCTGCCCAGCACTGGTAGTACTACACGAGGCC^GCCTGG 
AAAGAAACCCTGTCTTGGAAAAAACCAAAAACAAACAAACAAAAAGTTGT^ 
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GTGACTCAGGCGCAGTTTTCTCCTCCCTCCTTTACATGGGTGGGTTCTAGGGGGTGAACT 
GACTTAACCC^CAGAACCTGGAAATCRTTCnriTGATTG^ 
ATATTGCTTAAAAGTAGAAAAATGTATTTACCAGAAACCrAGTGA 
TTGGTGGCTGCCATGCCTCAGGAGTTTTTGAAGC^ 
5 GGTCTCATGATGTAAGAGCTTTAGTGTTTATGTCCTGCT 
GTTGCTACTTTTTCTTACGTTAATGCTGGATT^ 
AGGTGGTGATCTTGACTCTGTTAAACTAGTAAGAGGGTT^ 
ACAAAAATGGCAGAGTATCTTTCTGTGTTTTTATTCATTTGO 
AGAf^ riT C ri ' L T l tK^CCCTTTGTAAGGTAAAAACAAGTCAAACT 
1 0 TATTCATTTGGG ATTTT ATT CTG C CAT AT AGGJUVTGAATATTCCT AAATTTT AGATCTT AGGTTTTGTG TT CT AAAG AG AATGGCT 

CAGTCTTCTCCTGCAAAAGTCATGCCAGGTACCAGGGAGGCCCAAGGTGTCTCCCTTGGTGAATC^ 
GATGGTGGATATAACCTGTAATACTAGCCCTGGGGAAGGGGAGGGAGTAAGGTCAGAGTAGGTGCCA^ 
CTGTACTGCTTAGCAAGACACTGTCTTAAGAAAGGCAGGAACX3 

CAATTCCCAGCAACCACACAGTGGCrrCACAACCATCTATACTGGTGGGATCTGATGCT 
1 5 ACAGACACTCAACATACATACCATAAATAAATAAATAAA 

GATAAGGAGCATCAACAGAAAATGAGAGTATC^TTGTGGTCAGCAAT 

TGAGCAGCAGCAGCAGCTGGTCTAGGAACATGGAAGCCCTCTTTCAGTGCACAGC^ 

CCCCCCCCCATGTTAAGTACTTAGTAAGTCGACCCTTCTTT^ 

ACCATGAGCATATAGTGTGGAACTGTAGAATAGAGAAGTAAGGAACGTGAACAGTTCTGGGCT 
2 0 GTGTCTTACTCTAACAGTtlATCTCAGGACACAGCAGGCAGAGCTCGGTGTGAAGC 

TTTTTTAACTTTGACTTTTGACAAAAAGTTGATCTTG^ 

TAGACTCAAAAGGAATCTTTCTGCCTCTGCATATCCCAAATTCTGAGATTATGGGTATGCTATCATA 
TTATAAAATTTGTGTGTGCGTGTGTGTTrTATGTTTCTGTAGGTATGCT 

2 5 GTTTATGCAGGTATATTCATCTATCTATGTATGGAGGCTGATCATTA 

TGAACCAGAGCAGTGGTTCTCAACCCGTGGGTCATGACCCTTTT^ 
GAAAACAGACATTTATATTATGGATCGCACAGAGGCAAAGTTACAATCATGAGTAG 
CCACCACATGAGGAACTATATTAAAGGGTAGCAGCATTAGGAAGGTTAAAAACCACTAGAGGT^ 
GTCATTGAACCCAAGAATCCATGTGTATCTGCCCACACCATTTTGAACCr 

3 0 GGGAGTGCTGGAAGATGATTCAGTAGTTAAATGCACTTGGTGCCCTTCCAGAGGACCCTAG 

GCTAACAAACGATATGACTCAGATTCCAGG^IAATCTGTATG 
ACTAAGATTCTTGCATAAATTATATTGGTATTAAGTTGTTT^ 

ATAACCAACATCAGGTCAGAATTGTAGC CTGTGTGCCCCAGGT ACAGTGGT AGATGAG CTGTTCTAACACTT AAAGAGTCT 

GAGGC AGG AGATTTGCTATTAGTTCX5AGGCCAGCAT AGG CCACAGAAATAAGTT CAGGCAGGCACGCAGGCATTTCT ATGT ACAGC 

3 5 TTCCTCTGGGAAGTGAAGTCTTTTTGTGCTGTTGTTTATTTTC 

GGACTGGTGCATAGGTCTGGCTGGCCITGA.GTTCGGATCCTCCCTTTCCT 

T ACACT CTG AG-GTATTTT ATTTAAGG C ATG CTCT ACACT CTGAG G T ATTTT ATTTT CTGAGGCAGTGT CTCATGG AG CACAG ACTG 
GCCTTAG ACTACTGTGTAGCTGAAGATGACATTGAACTTrTTG CTTCCACAT CCTGAGTGG AATT ATAGGTTAGCAACACCTCATG 
TGGGAATTCATACATTGTAGGTAATCAGTCTACTAGCrn3AACrrATATCTCCAAC 

4 0 ATCTAGTTTTGAAACAGTCATATCCTAGGCTGGCCTCAAGTTATGTAGTCAAAGATGGCC^ 

CAAGTG CTGGG ATT ATAG AT ATG CACAC CACCAC^CCTCATTTG TCT CGGGG CT GG ACTCAAAT CCAG AGCTT CATG CATGTG AGG 
CAAGCACTGTACCAACrCGACnTTTGCATACT 

ATGTTTTATATGACTGTGTATGGCTTTGCAGCTTTGACAGCCTTAT^ 
CTCCTCAGCATAGAGTCTGGGGAGAAATAGAAGACITTTGAATGTCCC^ 

4 5 GTACTGGTTTTCCAGATTACCTCAGTCCTGTTTCTTAAAGAAATTACT 

GTTTGTTGTTGTTCT r C T T CTT CT TGATTTCTTTCTTTTATTATTTTTTAAAGATTTATTTATT^ 
AGCTACCTTCAGACACCCCAGAAGAGGGCATCGGATCTCATTACGGATGGCTGTGA 
AGGACCTCTGGAAGAGCAGTCGGCCTTCTTAACCACTGAGCA^ 
TTCCATGAGAAGGACCTGTGCTGTTTTAATATTGATTGATTGATTGATTG^ 

5 0 TTGGCATTGAAATTAAGATCTTCCTTCTTGGCCACCTAAGTAGTGGAATGCACCACCT 

CACCACAATATTTTCAGCAGTATATTAGTACAGTTAATAAAAGTGAAGATTG 

GTTCTGATGAGTGCTGCAGAAAAGGTCTCTGCATTGTGTGATTGTAGCT 

TTCTTGCCATGTTTCTGTTGCAGAGCTTGGATTCT^ 

GCAGGAGGCCGAGCAACTGAAGAACCAAATTAGAGTATGTAGTCATGCGGGTTTGATATGTTTC^ 

5 5 GTGTGGATAACAGAGTGAGATGAAATTGATCTATTATATGCTAGTTATATATCTGAATAACTTATG 

TCTTATTAGCTACAAGTTGTTGCTTGAATGTACATGGCCCAT^ 
AACTAGGTTTTGGACCCrCTGTGTGGCCACrGTGGCCGCTGTGTCA 
AGCAGAAATAGATACTTTAAATGAAGGGATCCGTAAAGTGGCATTTTAGATGTCT 
TGTGCTCCAGGAAGACCATTGGAATGTATGTAACAGCTCTTAGGCCACCATAGAGAGCAGAC^^ 

6 0 ATGGAAGATAATTCCATGTATTCTGAGTTGTTCTTTCTGTGCTGGGGACTAGTGTTAGAAATGGAGCT 

GGCAAATGTGAGTTTTTTGGTTTTC^GGTTTGTATTTTAGTTTTG 

CTCAGTGT AG AC CAGACTGGCCTTGAATGCAACATT AT ACAG AG AT AAGT ATGTGGGGTT^ TTTGTTTG TTTTTTGTT 

TTT CTTAG ACTTG AT CT AAG AG CATGGAAAAAATG ACTT AAG ACTT AACAAG AAAAT CCCAGTTTTTC^ T 
ATATTTGAAATAGCTTCCTCTGATTTTCATGTTCnTGTCTGTAAGGATATGGCT 

6 5 AATGGTTCACATGTCTTTAACTTGTCAGTTGTGTTCT 

ACTTTTACTTGAATTTAAGAAAACATACCAGAACTGCATAAGTTTTAGAG 

TCACTGGACATTGGTGTCTACGCTGTTCCTCCGTGTGTCAGTTTCTGTAAAGTTAATAATTTC^ 

CTGAACTATACCTAGCCCTTGCCTAAAATCITTTTAGGtf 

TAAGATGATGTCAAAATTTGAGTTCAAATGATCCCATGTGTCAGTCTCCCTTGTAGCT 

7 0 TAAACTTTTGCATGACTTAAAAACTTAAAGCCAACTTAAAGCC^ 

GAGCGATAAGAGTTCAAAGTCATGATCAGCTACATAGTGAGCT 

TGGAGAGGTGTCTCAACAATTAACAGCCCTTACTAC^CTTTC^ 

AATCTTATGTAACTCTTAACTCCAAGGAATTTGACACCITCTT 

CAAAAT AAAAGTTTAAAG T CAG G T ACAAT AGGGCATGTCTGT AATG CTAT CAAG TCTGAAT ACTGAAGG AGT ATTGAAGGTTTGAG 
7 5 GCCTGCGC^TGGGCCATATGCTTTTG AGATTTTTA i ialu iGTCC CAAAG7T CTG AAaAT"T AAOaG i" A'T AGAAATG ACAG AGG AG 
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TTTTCAGAGAAGGTCTCACTATGTAGCTGTGGCAGTCC^ 

CAC CACGC CrTGGCTTTTG TGGTTTGAACTTCCTGGTTTTT CTG TG TG ACC CTGTG ACTGTC CTGGAACTCACT 

TTGTAGACCAGGCTGCCTTCAAACCCATAGAAATACCCTTCCTCT^ 

TTTTTAAAACGGGAGCAGGGTGGAGATGGCTATGCTGTTTTAAAT^ 
5 TTTAGGGAAAGGATAAGAAAGCCCAAGTGTTGAAGAAAGCATACTTAGAACAGA 

TAGCGGATTTACTAAGCCATATGGTTGTAGCTGTGTAGC^ 

AGATTTGTTTGTTTGTTTGTTTATTATATGTGTGAACACTG 

CAGATGGTTGTGAGTCACCArGTGGTTGCTGGGATTTGAACT 

TCTCTCCAGCCCAGATAAAGAGAGATAGTAAATCAAAGATAAATGTCATGCAA 
1 0 CCCTAGTACCCACCCGCCTTTCTGCATCCAAATAGTGCCCCTGTAATTA 

AAGGCCAGCTTCAGTGACATATCAAGTTTGAGGCCAACT 

TATTAGTAATATGAAATCTTAAGTTTCATCTAGTCCCATGTA 

TACTTACATTTCAGTTTTGTTTTGGTTTTAAAGTTCAGCTCATGC 

GACATGCACAGATAGTTGGTGACCCCCCCAAGACCTCTGTTGTGGACA^ 
1 5 CAACCTTC ATGATGAG CATCATCTTCACAAATGGCTC CATGGGTTTCGCCTTGCCTCTCAGCACTG CACT AATTTTAGAGTCG ATT 

GGTGAGGAGATGTGAAGGCCATTGAGTCTCTAAATGGGAAGGGAAGAC^^ 

TTGGCTGC C CTG CAACITGATATGTAG ACT ACTGGCCTCAAACTG CTTGCCT CTCCC CAGTTT ACTTTGG AGACT AAAG CTG 
TGTGCTACCTTACCTGGCAAGGAATACrrCTTTAAAAAAGTAAAATTCT 

CTG CTTATAACTGTAGCTCCAGAGAATCCAAGACTGCCTGTCTTTTGGTTT CAAT AGGT AC CTG CAT ACACATAAATTT AAAAACA 
2 0 AAAACAAAAAACTTTTTTTTCTGATATAAGGGAGGAAA 

AGAACTTAGTACAAGGGCTGGAAGGATGGCATAACTCAGTAGTAGTCTA^ 
AAAATACGAGGATTATCTTCACITTGAAGGCCAGC 
CTGGTGGTGTACGCCTITTAATC CAAG CACT CGGGAGGCAGAGG 
AGTTCCAC^ATTGGGTGAGACGGCGAGCACTTTTAATCTTAG 

2 5 GGCCTGGTCTACTTAACAAGTTCCAAGATAGAATTATATAA 

TTAATCAATGAAGTAAAGGTTCTTGCCACCCAACAGTGTAAGCCTC^ 

AGGAGAG AGT ACTGTCTCTACAAAGTTGCCCCATG AC CT C CAT ACTCTGT AG CAAG CACACTGAT ACAGATCAGTCACACTCTTGG 

GTTTGTTTGTTTGTTTGGATTTTTACCAACAAAGCGTTTGTGGGTGTTTT 

TTTCTCTOTCTATGCCTCGCrATCTTGGAACITGCTCTGTA 

3 0 TGAGTACGTACTGGAATAAAAGCTCTGTGCCACCACCACCTATACAGTCAATATACTTT 

CTAGCTGGTGTGTTTATACACATAC^CAGCrTCTCAAACTCGa 

ATTTGTTTTTTATGTGAACrTCCCCTGGGGGAAAAAAGGAAAAACAA^ 
CAGGCTCTCAAGTAGATGTTTAATTAAAGCACACAC^ 

3 5 CATGGACCAGAATGTAGCGACCCAGAGCTGAGGGTGGGCCCAGACTTTCCAATTTCACTGTC 

GCCAGCCTCTTCTGTACTCAAACCAAACAGGACAGAATGATTGATTAAATGGATCAGT 
AAAGCCGGGGGTGGGGGTGGAG TTTTTTTTTTTTTTA A 

GACACACCAGAAGAGGGCATCAGATCTCATTATGGATGGTTGTGAGCCACCATC 
AAGAGCAGTCAGTGCTCTTAACCACTGAGCCATCTCTCCAGCCCGGGGGTGGAC^TTTT 

4 0 AACAGATCTCTGAGTTTGGGGCCAGCCTGGGCTACACAGAGAAACCCT 

GAAAGGAAGAAAGGAAGGAAAAAAAAAGAAAAGAAAGAAAGAAAAATTATTGTAGCT 
TCACTTGGGGAC^CAGAGACAGGTCAATATGTATAAGGTTGAGGCCATCCTAATCAACATTAAG 
GGACCCTCTCAATCATAAATCAATCAGTCATTCTCAGGCCAGCAGG 
TATAAGTGAAAACACTTTAATTAATATGGTAAAAATTAAAGAAAAATTACT 

TAGAACTTAGATTTAATCATTGTTTGGCTTAAAAATA 
TAGTTGGTTTTCAGCCTGrrCTrrCATGCAACCCCTrGTGCT 

GGGAGTCAGCCTTGGCTTCCAGGCrrcCTCACreCTGTACC^ CAGTGTAGTTTCTCCCGGG CCTG CCCAG AGTTCTATGGGG 

TCTGCATGTCITAGTTAGGTTTAGGTGTTGTGAGTTGTGTCTGTTTTGGCCT 

5 0 TTAGAGAAGAATTAAGGAACAAGACAGTTTGTAGAGGCTTCTT^ 

AAACTAATTTCCATTTGTTATTTCATAGGATGCTCGTAAAGC 

TTCAGCTGTCCGTAATGCGCACTCTGCCGCGGGCTTGTATCATGTGGGGTGAGCCCTTTT 

CAC^CTTTTCATTTGGAGGTTGCTTTGCTGATGG^ 

GTTTGACCCAACAATTTAATCAAACTGATTTTAAAACATGTAC 

5 5 CCATCCCAGAGCTCACAGGGTAGAAGGAGATTCAGAGGTGGTTCTCT^ 

* TACAGTTAAACCGTGGGCATCCAGCTCCTGCTGTAGTGTATCTACTG CTGTGAGAAACACTTOWUVCTCT71AGGGAGG AAAGCATT 
TATATCGGTTTATACTTCTAGGTAATAATATATCATTGG^ 
CTCAAACAGCCAAAGATAATCTAATCTGTTTAGAAGTTGTCTTGTTCAG^ 
ACAGTGCTCCATACATAGGCCCACAGAGGAATCTGATCTAGGCAGTGTTT^ 

6 0 AGTTAACAATAAAAACTAGCAACACACACACACCTATACTCACACCTACT 

GACATAGATTTATTTTCCCTGTGTATCCATGGCAGTGGTAAAGTGAGTATGTTAGGT^ 
AGCAGTAATAAAATGGTAGCTATTCTATGAAATGGTAGTTACATGTGATAT 

TGCGACAGGGGATGACAGTGCCTATATCCTTGATAAGAAGAAGCGAGGTACTGAGCCATGTAAAAAGCAT 
TGCTCATAAACATGCTGTATACTTAAAGCATC^ACTCACTC^AC^ 

6 5 GTGCTACTTAAGGGAGCAATGTGCAGCCGGGCAGTGGTGGCGCACTCCT 

TCTGAGTTCGAGGCCAGCCTGGTCTATAGAGTGAGTTCCAGGACAGCC^ 
GGGGGGGGGGGAAGCAATGTGCAATTTAAAGTTTATGAATTGGCTG 

T CT AGAGG ACTGGG ATTCTG TTTC CAG CACT CTGTAGTGGTTCACAACTGTCTAT AAAT CTTCTG G T CT CCT AGGG CTTCTG CATG 
CT AAAAAT ATT CTCT C ACAT AAAT AT AGG ATTTT CCATTT AAT AT ATTTTTGG rT (-TTGGTTGAT ATTAT ATTACAG AG C CCACAG 

7 0 AACGCAAAC CAAG AGT CTATTTGCTAAAATT AAT CTT CACT AATTTTATTTTTT AAG ATCACAAACAATATTGATC CAGTGGG 

AATCCAAATG CGGACC AGG AGAACACTG AGGGGG CAT CTGGCAAAG ATTT ATG CCATG CACTGGGG CACAGACTCAAGGT ATG AAT 
GGGGCCTTCCTAAAGCAGTACTTCAAATGTTGCCTATCTGCT 

GTGATAGCACACTTTGAATCTCTGTACTCAGGAGACAGAAGCAGACAAATCTTTGTG^ 
GAGTTCCAGGCTAGCCAAC^CTACAATGAAATCCATCTCAAAACCAGTAGGAT 
75 CTGTATCCACGTCTCATGTTCACAGGTCATACTCAGATTTCCGTTTAGC^ 
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TG L1 1 ilC TTCACCCTATACCTTTTAGCTGTTGGGTCTCTGlTCTCTTGGCTACrCTGTGGGCTC 

GAAGGTAAAAGGTACTGAGTTTACAACAGCAGACTGTGGTCCCAAAAACT 

GAAGCCTTATGGAAAATGAACGAATGGCTATCCTTATCATCCTTCCACATG 

AG AT ACT ACCATTTGGGGGTTCCTTTCTTTT AT ACTTT GAT AACT AC C CTTT ATTGGACTG ACT CTCT CCTGACTTTTC CACT CAT 
5 AACGAAATAAAGTGCGACATGGTTTGTGTTCTCCTTCCAGACCTAAAGGAAT 
TCTGCTTAGCTTTCTCTTTGTGAAATTAT 
TCCCATACAATTGCTTGTGCAAATGAGCX3CTA 

ATTTGTAGATTTATTGATTGATTGTTTTTTCAGTGCCG^ 
1 0 AGGTGCACCTCTAGTCCAAGAATAGTTACCATATGGTCCTAAAAATGTGAGATAATC 

GT AAT AT AAACATTAAACAACTTACAGTCCAT AGTTACTG TC^ CAT AAAAAAAAAAC CT AAG CAAAG TTAG 

AAAATGGACAAACAATGGTGGCA CTTTTT 111 11111 rTTTTTTTT 1 TTTI 1 1 IIAAACTACAGCCCTTGGAATTAAAGCAATATA 

GOVAGTTCTAGGACAGCTTGGGCTACACGAAACCCrGTCTTAAAAAG 

GAGCACCCGACTGCTCTTCCAAAGGTCCGGAGTTCAAATCCCAGCA^ 
1 5 GCC CT CTT CTGGGGTGTCTG AAGACAG CT ACAGTGT ACTT AC AT AT AAT AAAT AAAT AAAT CTTTAAAAAAAGAAAAACAAATAAA 

CCTTGCAAAGCATTATTTAC^ATTGCTTTAAATC^CAT ACTTT 

TTTAAGGAGTTGTTGTGG<5TATGCATACCCCTCTACCCCCTTACATCATTTT 

GT CT ACAT AGTGCGTTTTAG ACCAAC CACGT AG CACAAACTTGAAGACCT AGGTTT CGAT CT CT AGG ACACATG T AAAACTGATCA 
2 0 TGGCCAGATACTTAGAACCCCACTGCrrCAGTCTGCT 

GCTTCCGGTTCAGTGTGAGACTGCTTGAGAGGAATAAGGCAGAGAGTCTA^ 
TAGGAACACACACACACACACACACACACACACACTAAAATTCAGCT 
G^aAGGTCAGGGCATTATGACCAAGGAATCAGTTCTAC^ 
GGGGGGGGGGGGTCTTTCAGGAAATGTTTCCCTGTGCATCCT 

2 5 AGATCCTCCTGCCTCTACCTCCTGAATGCTGCTGGAATTAAATGGTTGCA^ 

GATAGGCACCTTTACCTGAGCCTTCTCTTTAGCCCTAAAATTTAACT^ 

GTTTTTAATCCCCTCTCTACAGAGCCTGATGAAGTAGTCTCTTTG^ 

GTCTATCAGTGCCTCCTTTTGTTCTAGGCAGCT 

TTTAGACAATGAACCGTGTTGATTCCTCCAAGTTACATGACCTGAGATTAGAAAGGCG 

3 0 CTAACAGCTTAGCATTGTCTTAATTCCTCACTGAGTTAGCAATTATTGAGCTCT 

TACTAAAGATGTCTCTTGATACCCTTGCTCAGAGTCAAA 
CTATAGAAATGGTTGAAAGGACCTTGGGTATAATGACTGAACTGCAAT^ 
TTGGCTACACAGTACCCTATCTCCAAGGAAAGTAAAAGAATCCT 
TTCTCAAGGCTCCAGAGTTTAGTCCCTAGAACCTCATAAAGAAAAAAT^ 

3 5 CAATTrACTGTGAATTCAATCTCTAGAACTGAAAAAGAA^ 

TGATAAATTTTGTGACTAAGGATGGAAAAGGATTTTATTTCT^ 

AGAACATAGCTTGGCAGGGTATGATGTAGTACAG CTTT AATGCAAGCACTTTGGAGG CAGAGACAAGTAGACCTCTGTGAGTTTGA 

GGTCATCCCGGTCTACATAATAAGTTCCAGGACAGTCAAAGCTACATAATAGAGGAACCCTGT 

CATACAAGCATGGTCTGATTAAAAGTGTCAGAGGGCTC^ 

4 0 GTTCAAATATGCCCAAATATACTTGTGGCTGAAATACCTAGAATC 

ACTATC^CATGGTAGACCTGGGAGAGCCTCTCACCCTTCCTTAAGAGCACA 

TCACTCAGTTACTTCCCAGATAACTGAGATTCCTTrGCrrc 

TGCAGAGAACTGCAGGTTCCAAACTAGGACACrAGTCATGAGCAATTCATAAT 

GTG CAATT CTG ATGT CATTT AT CAT CTC ACT ACC ACACAGTTTT AT AG AG ACAG^T ATTCT AATTT ATCTCT 

4 5 AGAGTTCGT1TTTCTTTTAATTTTGAGGGATAGAGTCTGGGCCTTAAACTTGTTA 

TTTATGAGTTACAACGTATGCTACCATTtKSAGCTCATCGTTGTGTTCT 

AG CAACTTGAAG AAG ATTTTGTTTG AGTTT ATAG TTGT AGTCT AT CATGAAAGG AAATCAGC AAAGAC CGCAGAGGAAT AC CG CTT 
ACTAGCTrrCTCTTTAGGGCTTACTTAGCCTGCCATCC^ 

5 0 TCACATCAATTATTAATCAAGGACATACCCATAC^TTTACCTATGTGCCAATAAGATGGA 

TTCCTTAGAGGACTCTAGCTTCTGTCCTATCCCACCCCACCCCCAATATAAATAAAAACAC^ 
TGGTTCCTGAGTGGGTAGTGTTTCAAGC7ITTTCCCATAATATTCAAGCTTTCTTC 
ATTTGACATGTCTCATGTCTXTGTGTATCTAATTTAACITGGAATAAA 
TCTCGGGTGATAAGAGAAGAGCTGGGGAGTCTGGTCATCCAC^TGTCCCCAGGGCTGAGT^ 

5 5 TTGCTCAAGCATTGAGATCCTCTTTCTITATTATTA 

AGATTTTTTGTTTCTGTTTTTTCTTTTCCTT^ 
GGGACAGTTATACCACAAACAAGGTGGGCAGTGC^AAGAC^ 

GAGGGTGATAGAAGAGCTTGTGTTGCTGACCCATTGAGCTTGGCACAC^GTTGTGCT 

G CTGAGT G CTTC CTTG TCCTTGCACAAGAG CTACACT CCACT C CT AG TTTTG ACTGG AAACTTTC CT AATTT ATT ACTAGAAT ACT 

6 0 GAAATGCAGATTCCATTTAGTTAGTAGTTTGTTTTGTAGTAACAAGAGATAATCT 

GATTGGGAATGTAGCTCAGTGGTAGAATACTTGTCTACTGGGC^TAATTCC 

ACCAATGTAACCCAACAGATTGGTAAAAGTCTCTGATTCTA 

CAAACCTGCCATGGCTAGATAGGGGAAAGGAGAAGACAGAGAT^ 

AATTTCCACTTCTCTTATATTGAAATAAATTACACTGGGTATGGTGGTACATGCTCATAATCT 

6 5 GCCTCAAGCATTAAAGATTATGCTTACCCACATAATAAAGTGAAACCAGCCTGAACTACATGAGACCT 

TTCACAAAAAACCAAACAAACGGGGCTCGCGGGATGGCTCAGTGG 

TCCCAGCAACCATATGGTGGCTCACAAC CATCCGTAATGAAATCTGATGGC CTCTTCTtSGAGTATCTGAGGATAGCTACAGTGTAC 

TTAC^TATAAATAAAAAAAAAAAAATTAAAAAAAAATTTAAAAAAACAAACAAA 

TGGCCAGTCTATAATTCTAGCATTTGGAAGACAO&GACAAG^ 

7 0 AGACAGTCCTGAGGAACACTGTCTCACAGAAGAAGGTGGAGAGCAGTCAGGGGA 

AAGCACACATGCGCATAAAGTACC^3GGAGCCGAGCCAGGCGTGGTGATACACGACTT^ 
GTAGATCTCTGTGAGTTAGGGGGCCACCCTGTCTATAAAGTGAGTTCCAAGACAGCCAGAG 
GAACGAAAAGGTGCAGGGCTGGAAAGATCGCTCAGTCACTGCCTGATTTTTCC^ 
TAGTGGTTTGCAGCCTTCTGTGTGTCAGTTAAAGGGGAACC 
7 5 TCAACAGTTAATCAAATGTTCCTACAGATAACATGAAATTAAATCT 
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ATTGGACAGTTGTATTTTCCCTCAAAATTAGT^ 

ACTGTGTT AC C AATGTG CAGTT AGGTTATTG <^TG TGTGTG TCAGAG AGAATG AATATGAATGAGCTCTGC CT GAG CTTCT"TT ATT 
G CTGGG AT CGG CATGAATCACCATG C CTGG CTGTTTTTT ATGTG CTTGTCrTrGTTTTTTTTT AAG CTCAT AG C CATGAT AAATTA 
AAGGGGAAAGT AAATCTTTTCAAAT CCT ATT AACATCACTC CTT AAATCACT AAAATGCAAAT ATAATCTGTGTTCT CTTTG AG AT 
5 AGTTTrCCATTATGTATAGCTOVAAATATCCTCAAACTTC 

TCAAGTCTCAGAACCAAAAAAGACAGACACGACAAAAAGCCCTCTTTT^ 
TATGCCATCACACCTATCTACrCATTAGGGTTTTTTGGTT^ 

GTTTTTGAG ACAGTTTT ACT ATGG AG CCCTGGCTGGC CTCAAATT CAGAG ACTCACCT ACCTTTGC CTC CTG AGTGCTGGGG CT AC 
AAC^TAAACTATACACCCAGCTCTAGTCATATTCTCCTG 
1 0 CATCCCTCTGCGCTCCTCTTGGGTCATGACCTGCGCATACGCTCCCT 
GCTCCATTTACAACCTGAAAACTCGTCAAGGGAATCT 
CAGTGGTGTAAGCTGCTAACTGGGATGTATACTGCTCITGTGATCT^ 

AAATTTTCTG AAAGTG AAGCCAAACCAGGG C CAATATG CCTTCATCTTT AAG AAACATGGAAACTGG TTTTTr AGCCAG TTTTCGA 
ATTCCCAGCTGCITCAGCCATTCTTGTTTATCAACGAATTGATGT^ 
1 5 GCACAGGAATTCTCCTAGTCTAAGCAAGTGTCCATATGCACAGC^ 

GCCGAAGAGAGTCTGTGGTCCATTCACCTTTTGAATCTAACCAGCATTAA 
CATTTAGGACACATTGCCTTTCCAAGGAAGAAAGTCTCATAGTA^ 

T CGCATG T AAG CAAAGG ATTG TT CAT AACTAACTGTCATGAAG AACATG GCAGTTTTCTGTTTT AACC 
TGCATCCTTTGGTGCTTCITAAGTTACATTGCCAGTC 
2 0 GGAAGCATAAGCCACCAGTTCTAGATCTCTTCTCTCTCAAATA 

CACGCCTTTAATCCCAGCACTCGGGAGGAGGTAGAjGGCAGGAGGATTTCT^ 

AGGATAGCCG AGACTATACAGAGAAG CCCrGTCTTGAAGGGGAAAGAAAAAAAAAAGG CAAAATCCATACAGGCATGGCTTCCTGT 

AAATTTTGATC^TAGTATTTAAATTACTCT^ 

TCTAGGTCTATCCCAGCACCATAAAACTGGACATCATGATAAT 

2 5 ATTCAGTGAGTTTAAGGGCAGTCTGGGCTATATGAAACCCTGTCTTAAAAAAGAAGCATGGTATA^ 

AAAGAAAACAATCGGGGAGAAGTCTCGGTGACTAAGAACAGCCACTGCATAAGCACAAGGACCT 

AG CAGGCCTGG OGTGTGAT AATTT CATCATTG TAGG CTGACT C CAG C CT CG TT C AGTGAGAGGCCCAGT CT CAACGGACT AT AG CA 
AACAGTG AT AG AG CAGGAT ATTCAATGT CTGGTGTC CT CTG TT AGT CT CTG G AT AGGCAT AGACCACAT ACACAAAT ACAAGT AAG 
ACGGATATCTAAAAATCCITGGGGGGGGGGGGGTCCTGTTGAATTT 

3 0 AGAGACTATGCCTGTCTTCAGACATAGCAGAAGAC^GCATCAGAT^ 

• ATTGAACTCAGGACCTCTGGAAGAGCAGTCATTCCTCTTAACCGCTGAGCCATCT 
ATTTGGAGAjGATGGTrAAAAACACTGGCTGCTCrTT 
TATATGTAATTCTAGTCCCAGGGAATTTCATGCCTC 
GCAGTCCTGGTTCTACTGCTGTGATGTGACATCATGACCACAGCAA 

3 5 gctcagaggtttagttcattattgtcatgggaggaagcatggcaacatgcagacaga 

tatttggattcccagacagcaggagagagtgaczactggcctgtcitgagc^ 

aacaaagccgtttcaacaaggaggtgttcctaatagcaccattccct 

aggcagaaaattcatatacattaaaaattttc<;ggctggagagatggcttag 

TGAGTTCAAATCCCAGCAACCACATGGTGGCTCACA^ 

4 0 ACAGTGTACTTACATATAATAAATAAATAAATCTTTAAAAAAATTTTTTTTO 

TTAAATTTGAACTGGT<X5TCTACCAAGACATCATGCTATA 

GTCTTT CCATCAG AAGG AAAGAATCTACATCTG TGT ATTTCAGGTG TTG CATG T ACATTT ATTGCTG CTGTCTG TATCTC CTTG AA 
TGATCGGTTGTTTT ATCIATGGCTTAGTATCC CCTCTCATAT AAT ACTTTACTC 
CAGCCTCAATGCTCACTGTCTGCATGACATTTITTCTTGTTGAT^ 

4 5 TCCTCACCTCTGTCTAAAGTCTCTGCCTCTAGGAGCTGGGAACGTAG 

TGAATGAAGCTCTAGGTTTGATCTTCAGCCCAGCATAAAACCAGATCTGGTGATGTAGCC 
GAAGCTGGGAGTTGAAGGTAATCC1TTACTGTGTAGTGAGTTTGAAACTGCCT 

GTTGGGGCAAGTCCTGTACCTTGTAAATACTTTACATATAGTTTCTTCTTGACITATAATGAGGTTATA 

AT C AAT CAATCAGT CTCT CTGT CT CTCTGTCTCTCTG TCTGTCTGT CT AT CTG T CT ATCT AT CT ATCT ATCT AT CT ATCT ATCT AT 
50 CTATCTATCTATCTATCTATCTATCTATCTATCTAGGGTTTCTCTATATAGCCCTC^CTATCCTAGAACTCACTCT 
CTGGCCTTGAACTCAGAAATCrGCCTCCCAAGTGCTGGGTATCCAGAAATCTCTTTAATGTT 
ITGACTCTGTCTAGCCTGCCATCCCACATCTTCCTCATCTTATATTGACCAGGCrGG 

G TG CTCAG ATTGAAGTGTG TG CT GCTTCG ATGGGTGT ATAAACT CTTTGT AG C CT AGG ACATTTG AGGCTG CTG TGTT ACATTCTT 
ACAG TCCTGACAG AGAG CTTCAG TTCATTGCTGAT AACC AG T AC C CTGGG CAG ATT ATACAT ACCACTT ATTGATAGGTTGAAAGA 

5 5 ATTG CATTT AAAATTTG AAG TTCATTTGCACTAAAAAAAT CCT 

TCTCTCTCCCTCTGCTTTCTrTTCTGCTGTl 
CriTCTTTCTCTCTi^iCT 
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTrCTCTCTCTCr^^ 

TG CTCCAGCCTCTGCATTCTITTCCCTCTGCCTCCCCAGTG CTGTAATT ACAAGTGTGC^CCACCATGCCTGGATTCAACATTGTG 

6 0 TTTGAAGAGCTGTATAGGCTCACATGC^CTCAGACAAACAACGT^ 

CTTTTTCTGGTTTCCTGTATGCCAAAAAGCTACTCCCTCATAACACC 
TTTTTAATGTCTGTATGTTTGTTCAATGAAATTTAATTTACAAATTAAAGAAA^ 

AATATAGCAATACACTATAATTAAG'ri' , n , TG , riTTriTl 1 1" T'l 1' T G A CTCATGGCACTGAG AATTTT ATTT ATTT ATTT ATTTATT 
T ATCT ATT ATTTTTT AAAGIATTT ATTTATTT ATT AT ATGT AAGT ACACTG TAG CTGTCTT CAGACACTC CAG AAGAGGG CAT CAGA 

6 5 TCTCGCTACGGATGGCTGTGAGCCACCATGTGGCTGCTGGGA 

CCG AGCCAT CT CACCAG AC CT ATTTTTTATTTTCT AAGG AACTATCTGCTTCT 

CCAGATGAGGCTGCAGTGGCTAGAGTAAACTGCTTGTGAGAAGTCTTGA , V , i ,, r , I ,, l i'Tl'lTTT'l' ITTCTTTTTTGAAAAGATCTGTTT 

TATACTAGTAGTACTAGTTGCTGTTGTTGCTGACAGGGTCTCTCTGTGTAGCCCTGA 

CTGACCTCAAACTCACAGAGATCCATCTGTCICTGTCACCACCTTTGA 

7 0 GTCAAACATGAGGAGATOIAGGATGATTCTATAGAGTTAGCTCT 

GGAAGCTCCTCTACCCACTGACTCATCATGCCAGTCTCTGACTCT 

GGACAACITGTGGG AAGCTGG CTTCTCTCCTC CACt^TCT CAAATGGCAG CAAG CAC CT 

GT ACACATGGTC CCATCTTG CTGG C CCCTGACCTCT ACTC CTIX3 AAGG AT AGTTTTGCTC 
TACAGCTGGTGTCACCCTCATGTGTATCTGTGTTTCCCCTGCATAAAT^ 
7 5 AGAGTGAGGGTACTGGGTTT CTCCTGGAACTGGAG" 



^GTTAAT AC TT ACAACCCATCG TG TTGC TC CAATG AATTG AATCTGJ* JGTCTTC 
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TAGAAAAGTAGCCACTGTTCTTAACCACTGAGCCATCTT 
GATTTTTGCAGTTTGATGGTGATATTCCTAGACGTCTAA 
TCTTTTGCTAAATCGTAAGGTTTGAGCTGTGTCTTCCAATGCCCTGGTTTCT 
CCTCACGTGTTTTGTTTCTATTCAAGTCAATTCAGTTGACTTTGGTCACT 
5 GCAG CATTTGAAGGAAGCTCAGCTGG AGT AG ACGG CAAGGGCACA CTGGGG AGG CATGGAG AGTG CTCACATAAGTG TC 

CAC^TTTGTGTCACATGGTCTCTTCAGATTCAGAATTTCTC 
GTCTTCTTATTTATAAACAAGATTATTTCrCATGTTTCTGAATATAAT^ 
TCTGATCTTGAGGTATATCTITGTGGTCCTTTCCTCTTGAGTGTGGAT 
AATATAATTGTCATAGCAACCGTGGGCTCGTTATGTTCCT^ 
1 0 CAGCAAAACTCTTACTTGTCCTCCTGTCACTGGAGCTC 

GTGCCTGTCTGTGTAAGTTAGGACTGTTGGAGGCTGCTTTAGCACTTCT 

TGTAAGACAAAACCATTGAAGTCCGTTGAAGAACAGGCGGTTGTCCATTTACGAGTT^ 

AGAATGTATAGGCTATCTTATAGGATAAGGAAACCCTAGAAATGGATGTAATTGGGTTCACAATAACTGA 

ACTATTTATATTATTATTTATAATTTATAGGTAGAATGTATATTC'riT T T'l III I I AATTCAAAATAATTAGCCAGGTGCTGTGGC 
1 5 ATACTCCTTTAGTTC^CACTCAGAAGGCAGAAGCACAGGACT 

CAGCCAGGGCTACAAAGAGAAACCCTGTCATAAACAAACAAACAAATAAGACAGGCA 

GCGTGTACGTGCA dTl"n"l"rC'l"n"l TT I T T 1' 11 ri 'GAAAATGTGACTGTTGCCTAGATACTAi i 1 l i AAGGCGACTGGAACGTAG 
GCACAGATTGGTCTCAATGCTAATCCCTAGGGACTGATGAAGAAAG^ 
AGTTCTGGTCCCGAGGGCCATGATTTGCCTGCCCTTCTGAT 
2 0 AGCTAGAACATTCTGGTGTATAA'rri T TTTCATCTAATGAATAATTGGGTGTTTTA 
GG ^ T TTTTTTTT QTTTTTTGQGGXX^TGTTTGTTGGTTTO 

GAT AT AG AAAAAAAGATTCTGTCTGT CTG CAAAGT CAG CTGTG AAACA.C AGAGCTGGT CAG TTACTT ACTTCTGTT ACTGT CGT CT 

TGGTGTCCCCTCCCTCCATGTCACTCCAGCAAAAAGGAAGTGCTACTTTCAGGGCT 

TGCACAGAACCCATACATATCAGGGAAGTTCTCAGTGTC 

2 5 TATGTAAAGTACACTGTAGCTGTCTTGAGCTAAGACAGGATCACTGCTACAGGATCACTGTAGCT 

GGGCGGGTGGTCGTGAGCCACCATGTGGTTGCTGGGATTTGAACTTCGG^ 
GCCATCTCACCAGCCCCCATTGTGTGAAATTTTATGTAACCATTTATAATTTC 
CAGAACTATGTGACTTTCAGATATCAGTCnTTCAGTTACT 
CGCTGACCCCTGTTCCATCCACAGCTTCTAAGAAATCTTTCCAGTGGGTGTCAG 

3 0 TCACAGCTCTTGAGAAAGCAAGCGCAGTCACCCGTATTTTTGG 

CAATAAGTAGATGAGCACACAAATGAGGATAAAAGGAAATGGAAAGACACAGGTGGAA 
CTGCTATGTGCTGTGCAGGTGTGGGTCCGTGTGAATCATGAACCCATTGAGTCTGCCrA 

TCTtSGGCATTTTT ATT AACGTGGTTTTCACAGGGAGAGGCTCCT CTAGTCTTGGTT ATTT ATTTGTGT CTGC 

3 5 TCTTTTTCAGGTTATCTGTCCTGTTG CCGGTTC CTGG^ CTGGAG ACACCACATGGTG AG T ACC 

TGACAAGGCAGCAATTGAGGCTTGAGGTCTGTCTACTGTTCTT 

GCTTCTTGATCATAACCTGACAATTAGAAAACAAAGTATCAATGAGTAACTCATTGCATCATCTGT 

TGAACAAAGAACTAAAATGTAGATATTTAAACrATAGTTGTATATCAAGTACCTGGTTTTTTCT 

CTGGTGTTTTTAGCCCTGTGGGGTTCTTTTTCACTCT 

4 0 GCTCTGTCAGATTTGTTTATTTTAGAATGTTTGCTTACTAGTAGAT 

AAATGACCAGCTGGTTATTTTAAGAGAATCATTTTAAACTTAAGCTGCCCTTGTGG 
AGGTCAGTAGCAATATTTTAAATITGATTTAGTAGCATGGCTTACAGGCTGGT 

AGCCAGGCGTGGTGGCGCACGCCTTTAATC CCAGCACTCGGGAGGCAG AGG CAGGCAG ATTTCTGAGTTTGAGGCCAGC CTGGTCT 
ACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCTAAAAACAAACAAA 
4 5 GTAGTCAGGTGGACAGTGAAGATTAGTCATTACAAATTTCAAATTGCATGT^ 



TGGTGTTGCTGGGTGAGACAAGCTGGTGAGGTTGACAAGAATGGAC^GTGTATCATG 

TAGCCTGACTGCCATTTCCATAGCCAGCTCCCGCTTTATGTAATGCTGATGATATCGTGGCTTTTAGCCACTCTC 
CCCTTGGAAAACCTTGTACATCCATGAAGACCAT 
5 0 GCCACTTATTTACATTCCTAGTGTCTAGGAAGGTGGAAAATGAGGGGATGGAAAGAATAACACAG 
CAGACCTTAGCATTTTCTGCCATCCCACAAACTGCCCATGATTATCTTGTGTGAAGGACA 

AACCTGGAAACAG CAGGC CCAT ATG CT CAT C CAT ATGGTTG TTT ATTG TTTTCATT'lTCri 1' 1'CCTTAGCAAAATCTCATCACAGA 
AGTAAATGTGTCTTCCCCGTGGCACAGAAGTCAT CTGTGATTCTGGGTAGT ATAGAGGATGTTGTTCTAG^TG ATTGGT CCCATGT 
GCGTGCCTGATACCTACTTTTCATTrCTTCCTGCACCAGTGCCCTGTGG 

5 5 GACACACTGGAGATGTCATGAGCCTGTCTCTTGCTCCTGACACCAGACTGTTTGTCTCTGG 

TGGGATGTCCGAGAAGGGATGTGCCGGCAGACCTTTACAGGACACGAGTCT 

GCTTTAGATCTCAACCTTTGTTGTTGTTCTTGAAGATTTGGG'l'T'r 

GTTGCCGGAACAAAAACACCATGATCAAGGCAACCTAAAGAGAAAAAACATTTCATTG^ 

TGT CCATCACTG TGAGG AGCCTGG CTG CAAG C AGG C AAG CATG GTG CT AAAG AAGT AACTG AGAG CTTCATCTTG AG AT AT AAAT A 

6 0 ATCACAAGGCAGAGAGTGAGTGAGCGAAAACTTGGAATGGCATGGCCTTTTGAAA 

ATACCTCCCAAACAGTTCCACGAACTAGGAAATGAAGTGTTCAAACTTGA 

TG C CT GT CTGTG TATCTTGTATCTGGGTGTGTG CATG CGT ATT AGAG CCAAGT AGG CTG TT AGG AG ACACCTC 
AACCAAACTTGG TTG TAG T AAATG G CTTTT AAAG TACCTTT AAAG T AC CACT AC CTTT AAAAACATTTTTTTT CAA^ 
AAAATTTGTGCTCTGGTT'lTTTGTTTGTTTGTTTra 

6 5 TTACCTAGAACTGGAAGTCTCTCTAGCACTAAGATCAC 

ACCAGAGTCAGAGTGTTTTAGAAGCAGCTAAAACCCTGTGCTGTGCATATA 
AACTCTTCCTTTTTTTTAAGTGACCAGTAATTTAGAGGCTGAATATCTTGCC^ 
TTGCTAGCACCTCCTGAAGAGATGAGTCTGGTGACTAGAGCAGTGACT^ 
CCAGAGGACAGAAGCCCTTCAGCAGTGCCrCAGTCTCATTGCTTTT 

7 0 GTACTATGTCAGTACCCCCCTAGTACATCCCCCATTCACT 

TTTC C ATGG CAG C CATG CCTCTGCATTAACAT CATG TCTTGTTGAACATTTACTTTT ATG TT AGT AT AT AT AATT CAG CTTG ACCT 
TGAATTCACTATGTGCCTGAAGGCTGTGTACTCTATTC 

CTGAACTATGGGCCTTACCCACGCCAC^CAAGTGTTGAATCAGCTGGCCTGTGC 
TGTCTTCTCITTTTAGTTCTrTCCCAATGGCA 
7 5 CAGACCAGGAGCTCATGACCTACTCCCATGACAACATTATCTGTGGTATCAC^ 
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CTTG CTGGGT ATGATG ACTTCIAACTGT AATG TCTGGG ATG CACTCAAAG CTGAC AG AG CAGGTCAGTGG CCACATGG C CTTTCTGT 
GTTTTGGGGGCACCAGCTTGCTTCTTTCTATGTGGTAAAATGAGGTT 

GTTATGTCATAGCTCAGCAGAG<3GG ACAGTTAAAGGCCACATAATTTG AG CTCTGGCACTCCAGGAAGT ACCAAAAG CTCTCCTGT 
GAGGAGAATGTCGGAGTTGATAGTATGCCAGATACCACTGAGTTCAATGTTC 
5 TAGCCTAAAACACTCTATTAGACCAGATGTCCTTGAATTTAAAAGTCTTCTG 
TACCATCAATTTTCAATCTTTCTGACTCATTCTCTGTAGTCACT 
TGTCTGAGCCTATCTATACAGCTTTGGCATCTGATGCCATGCCCCTTCGTCTC 

T AATGATTCAATTT AAAG C CTG AATTT AAT ATGT CAGAAT CAAG TGG ATT AAAT ACAAGCTGG ACAAAAGATT AGAAAG TTTTTGT 
CTCAAGAGTCTCCCTTTGAGCrrCAAGCTGGCCTTAAG 
1 0 GGAGGGAGGGTTGTGACCATGAATCTCGTGGGTCCCACTGTGGCT^ 

TGGCTCACACATTTAATCCCAAGTCTCAGAAGCATGCAAATTCTGTGAGTTGTAGGATA^ 

CAACATTTCTGCCTTTGCCATTGTAGCCCAGGAAGTTGACTCTGAGTTTAA 

CAATAGCTACAGGACTAAATGTTTCTGAAGTTTCAAACCCTrCTGTO 

ATGGGTCCATGATTTCAACTGTAGGCTCTGTGTACCCTGTGGTTAGTTGTTCTGAGTTTACTATTCA 
1 5 CATTATGTTTTGTTTTGTTTTTTTTAGAGATTTATTTATTGTAGC^ 
AGGCACAAGCAGGCAGATCTGAGTTTGAGGCCAGCCAGGTTTAC 
TGGTGGATGGGGTGGATTTATTTATTTTATCAATATGAGTGTTTTGC 

TCTGGAACTAGAGTT ACAGATGTTTGTGAG ACGCCAT ATGGGTG CTGGAAACCAAG CTCAG ATC CTGAGACACAGATGG ATGTTT A 
TTCATTATTTACTTGTTTGTTCATTTGTATTTTCT 

Tri'lTGTrGGTGTACAAAAATTATCCCAAGGGGGTCTGTGTTTATGTCCTTGTT 
CTCTCTTAACAGAATTGCCCATACACCTAAGGATCGTTTCTACTTGGATAAGAACCTTCT 
TCTTAAGTGCTTGTTTTACCAAAGTCTTCTAGCGTTC 
GTTGTGCTCTTTTTGTTCTCAGGTGTCTTAGCTGGACACGACAACCGAGTCAGCTC 

2 5 GGCAACAGGGTCCTGGGACAGCTTCCTCAAGATCTGGAACTAAAG 

TTTCCTATATGGGTTATAGATTTACAGAGATCTTGTTGATCTGGC^ 
GAGAACAGTGGGAAGAGCTCGGGTTCTTTCTGTGTGCACTGACTCCAG^ 
ACTGGAAGACCATTCCAACCTTGAATCGTTACAGCGATAGCATATCCTATCCA^ 
AGAACTTCAAAAGGGGAAGATCTTTTTTCCTTCACTTATTGCT 

3 0 TGTAAACTAAACAAATTGTGCATTTTTTCTGGGGCCAT^ 

TCATTTTGTTTTGGGTTGGTCCAATTTTAAAAGGAAATACT 

GAGAGACACGCCTTTTTACTAGTTCATTTGCATTTAAATCAGTGATCCTGTTCTGTGGCA 

GTTTATCTGGAATTGTTTCTAAATATAAGTCATTTGTAGCAAGACTTTGGAAGCATTCTTTC 

TAGTTAAGAGAACCAGGGCCCACGGCGGTCTCTCACn'TGTCACTGATGT l ' l ' ri ' l ' l" 

3 5 TTTTTTTTTGTCTCATGTCACAGATGACTGTTTCGCACAA 

ACATGTTAGGATGGCCACGAATGCTTGGTCAGAGTCCTTCCTTCCCTTGCTCAGCAGCGGACTTACT 

CCTGTCCCTCCTGTCCCCTCACTGTCCACTTTCTGTTCTTTCCTTGGGTTT 

CACTAG CCGTCCTGC CCCTGCTGTCAGAGATTTTTTTCTCCAGAG AGGTGCCTGTCCT 

GTATG AAACAATG T AC AAAT CAATG TTTTG AAAAT AATG ATCT C AG ACTTTCT AAG TT AAATTTT AAAC AATTTG ATTGTTTG C CA 

4 0 TATTGGGTGGGGTTACTCTTAGAATCGCATGCTGTAGAAATGCTCTAAAGTGCATATAGGACTCAGTCCTT 

TTTAAGAAATAACCTCTTCAAGTTGTAACCGTGGCGGCTTTGTCCATTACTGTTG 
GCTTTACACACTTGAGTAGTGGGATAAGTCACTGTTTTCTT^ 

TGAGGCGACCATTTTTACACCATGCAGATCITCGTGGTTATCATCTCAAAG 

4 5 TGTGGGCCCTCCCCTCCCCCATATCCTGCACCCCCTCATAGCAGTAGTGGCTTCGCCATTCTGTTTTCTGCAGTC 

ACTGTGCTGTGACTGTGCGGTAGGCCTGGACATGGCAAAGAGAATACAGCGAAGCCCCTTCTCTTCCTACCA 
GCTCTCTGCACCCTGTACCCTTTTCACCTTTTGTATTTAATTTTAAAGTCAG 
TATATTAAACTGTACTGTTATTTAAGATGTAATAAAGCAGTTTGACATGAGGGACTCT 
CAGTGTGTGGGGGTTTGCCCCAACTGCTGACTCAACAGTTATATAACGTCATG^ 

5 0 ATGTAGTAGTACACACCTTTAATCACAACACTCAGGAGGCAGGGCCAGATGGATCTCT 

CAAATTTTAGGACTGCCAAGGAATATAGACACTGTCTCAAGGGACGTGGGGAG^ 

AGGTCTGCAGAGCCACATGTTTATCACGACAAGTGGCCTTTCACCCTCAGCACCTGGGCCCTGTCTGCCG 

CCAGCTGCTTCGACTTCCTCTTCCTGCACTAAAATGTCT 

TCCTTCTCTGTCGATAAACX3TCACTGCATCTTGAAGGAGCGAGTAT 

5 5 CCCATTCCACTGCAGTGCCTGCTTCTAAGTTTTATGATAAAAATTTTAATCTTA 

AAT C CGTCTG C CT CTG CCTCCCAAGTGCTGGG ATTAAAGG CGTGTG CCACCAGACCCGG CTTAATC TT AAATTTT AAG ATT AT AAA 

CCAGGGTC^GCGGTGGCCCCTGATACCACACATTGTGGTGTGATGGGCAGGTCCAGCAGATGG 

6 0 TCTC CT AGTCTCCTTGAAAT AAATGT ATTTTT CT^CACAC^CATTTTG CAAGTCTTT ACCT AC CTCAACTTC (ZACTGAG AATG CAA 

AACATTTGAGGTCCTTCTGATCTAGAAGGTGAAGCTCTGACCCTAACT 

ACCTACCTGACAACAAGCAGGCCCCAAATGGACTGGGGTAGATCTGTGTCT 

GCTGCCCACrit jT TTAACAAGCAGTCATGGAA 

TTCACAAATGCTCAGGCTTACACATCTACATCCACCATTAAATACCACT 
6 5 GTTGCTGCCCATGGTGGAAGTACACCTGTTGAAGTGCACACAGGCAGAT 

ATCATTTTCCTCCAGCGTATGGGCTGATGCCATTCCATGACCCCAGGATT^ 



TCrTCTCCTGCCGCTCTCACTGTGCrCCTGTATTCTC^ 

CATCCAAGAGCAGCCAGCAGTCTTTGA.CCTCTGCCrrCTTCAC^ 
70 ATAGTCTGGCAGGAATTCTATGAGAACAAAGTGCCTGGTGTGAGATTACAGCT 

GACATTTCT AAG CACC CTAGGCAAG ACT ACTCTG CTTG CTG T ATC C AAAACGTG AG T AC CTGG CAG CTGTCCCACTGATTTGGCAG 

AAGAAGGGATAAGGTTGCCCCTCACCCACCCCAGTGAAGTGACCCTGATGA 

ACTTTCGGAGAATGGACAAGATAACTAGTATCCCATGACAATCAACTCTCCTCT 

TTTAGGTACTGGACAGCCTTCACAAAGCAGTTTATCCCACCCTGGTCACATGAAA 
7 5 CAAAGAGCTACAAAGCCTCCCTGATGCCTGCGCCTGAG^ 
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GCACAGGCCCATCCTGTTTCCTCCTTGTTCTTCCTCCTCCCTTATCATTTAGCCTTTGAAG 
GAAGOlAAAATGTGTGTAATTGCCTTGATTTCGTCCTGCTACCTCACT 
AGCCCAAATTAAACACTCTAGGAGTCTTTTGAGTACCCTCTTCCC^TTCCAAGC 
CA.GTGGTGGGAATGAGCACTTGGTATCTGAGTCTTCAGAGTTTGA 
5 TCCTCAGAAATGGAGAAGCCACCTGCATAATCTAG^CCTTC 
GCTCTTTTAATTTAAGGCTACACAGGTTACTTG 
GACTGCTCCCACTCTGGATGTCTCTCCAGGTCTGCTGGCCATGTGC 

ACAATAGCAG CATAGTTACAGCCTTCGGTGGGTGTGGCTGGCGTTCACTCTGGGGAAG AAG CATGAGGAA 
GTTACAAAATTTAAGAAATCCCCGCAAATTCCCACCATCCTCCCTACCCACTATTT^ 
1 0 TGGAAGGACAGCACACAGGTCTCAGGAGTTCCCACGAGAG 

GATTGGGCTATACAGGAAGGGCAATAGTGTGAACATCTTAAAACTACAGTG 

ACTGTGCACACTGGTGGATATCAATGCACGAG CATGTCCACCAGTTCAG CATATCCCTGGCTCAGCTACAAAGTCGGTGGCCTCCC 
TCTTTCCCTCCTTACAGCTGTGGGTGAGGCACGCTATTTACACAGTCGCTTCT 
AGGGTCTCTAACTATGGTACCCCTGGCTGGCCTCGAATTCCTAGAAATCCGGAGCTCTC^ 
1 5 TGTATCAACCAGCTGTCAGGTTTGTTACTTTCAGGCT 

GAG AG CCAT ATGAGCGCTGTCACC CtlAGTTCTC TCCATG CGCTGGACATT AAGAACAACGTGGGAACA 

GGAGCCTGGCACGAAGCCCACCTTCCCACAAAGCTGGCCCGCGTTGTCCACATGGTCCTGGAGACCT 

GAGTGTG ACGT AGGACAGG AAT AAAG AT CAGACCG CGAGGCAAGGAGG CAAAAAAAG AAAGAAAG AAAGAAAG AAAG AAAGA 
AAG AAAG AAAG AAAGAAAG AAAGAAAG AAAG AAAG AAAGAAAAAGGAGAGAAT CT CT CTGTTG CTGCTGGG ACTG AG 

2 0 CTTTTTGCCCCTCCrrCGGTTACTCITTTTTTTTT^ 
CCCGACTCCTCCACCCTTTGGCCC 




CCCACCCCCCCGGGCTGTCTTCCGCTCCTTGGCCCCATCCCATTCCGCTATGCCNNNNNNNimNNNNNm 
NNNNCACCCCCCAGCCTCTGATCCrTGGCCTAOVAAGGGTTT^ 

2 5 CGCCGCTCTCCCCACGTGCCCTGCAGCGCATG CGTGCGGGCTGCGCTTGCGCACTGACAG CCG CGGCG CAGCGGGGCTTGGTGGTG 

CCG<3TCTCTGGACCGAGCGGCCATGGGGTGGCGGGCGGCT 

TCGGCCCAGCATGTAGACGTCCACGCGGGCGCGCGGCAGCCGACGGTGAGTGCGGGGCCTGGGGTCGCCCTCTGCCCGGCGTCACT 

GGCCGGGACCCGTGAAGACAGGGTCACGGCGCCGGCCCTGCCCACCTGCCCGAGGCCGGGCGCTGGTG 

GCGTCCGAACGTGGGTTGCAGCCAGGATGACGCX3TGGTCTCGGGGAAGGAGCCTAAG 

ACCGCAGCCGAATCGGCTCCATTGGTCTTCCAGTGGGGCCTTAGCAAGCGCCA^ 
AGCGTTTTGTAAGCAATCGAGGTCTGAAACTCTCCAAAACCCCATCGTAAACTTC 
CCTGAAGAGTCCATGGAGATGTTCCTAGAGGCTGCTGCCTCTCAGGATGTCACACTCACAGGCCGCT^ 
TTTCTTCGTTATGGGAACAGGGCGTGCCCAGCGCAGTCCCAGGA 

3 5 TCCCAAGCAATGGGCTTTCTCCTGGCATTGATTCrG 

CAAAACTTATTCGTTTGGCTTGCCTCTTGAGAAGAGTGTTATTCAAACTT 

CGGGCCGGAAAGCTGGCATCCTCGCCTTGGTAATTAAAAGCACGGCACCTTCCCTCTGAACGTATTTGTCCT 

CTGTGACTTACACTGCCCATCTTGGGCTCTGGGACAGCCACGGGTGCCAATATACAGTAT 

TGG CATCTCAGAATG CTTCrG CATTITCTT CriTGT AGATTT ATTT ATT AAAAACG ATGC A 

4 0 ATCTTGTTTGAAGTGCTACAGGTCCAGCCCAGATCCTCACACA 

TTGGGTAG AAGTGGCT AACAGATAAAACAG CAGTACTTAAATGAATAGTTTAG CTCTCTAGGCATCACTTGGAGATGGAG AG AAGC 
AAGTGGTGGTTTTTTCCTTTTTTTTTTAATGGGAGTGGACACTTGCTACTCT 




4 5 GAACTTCTTGTCCTGTATGTTCCTCTGCATGTTGGGATTACAGGCATGAACCAAACCAT^ 

TGTGATGTTGGGATGAAGCCCAGGGCTTTGTGTGTGCAAGCACTGTACCAACTGAGCTCTTTT^ 
rrrrrr rr r cr n . crr TC ^ TCT 

AGACCAGGCTGCCCTTGAACTCAGAGATCCCCTGCCTCTGACTCCTGAGTGCTGGG 
GTCCTACTCTTAATGCrTTTrfTTTTTTTTTTAAAGATTTATTTATTTATTTTATATC 

5 0 CTCATTACAG ATGGTTGTGAG CCACCATGTGGCTGCTGGG AATTGAACTCAGGACCTCTGGAAG AG CAGTCAGTGCTGTTAACCAC 

TGAGCCATCTCTCCAGCCCCCACTCTAAGATTTACTTTATTTTGGATTTATGTGTATGCATGTC 

GCACACGAGTATGTGTGCCTTGGAGGCAGGAAGAGGGTGTCAGATCCCTTAGAGTTGGAGTTACAGATGGTTGA 

GTGGGTGCTGGGAAGGAAGCTCAAGTTCTCTGGGAGCAGCAAGAGCCTT 

TGGATTCATCTCCATCTTTAC CTTGAATCTGTCTGTTTTCTGGTGTCGCAGCAAG AGGGAG CAAAATGCTCAGCACGCG 

5 5 GAGGAGGAGCCTTGGGAGGACAAGAGGGGTGGGCTTTGATCTCAGGCTCTGCTTTCTA 

ACATGGCTCCTGCCCTCCTGAAGCCAGAGGTGTCGCTTTCAACCCCAGGAAACCATCCT 
TCATTGTCCTCTATTCTGATCCATGGCACCTGTCCTCCCAGGGCTACTGCATTGGGAACAAGTTCGTGAAAA 
AAGGCAGAGACGGTACTAGCAATGTAOTCCAGTGCTGTCCACGGTGCAGTGGAAAGGGCAGCAGCGCCTGACC^ 
GACCAGAGTGCCTCTCTTGGGTCTGTGAAACTATTCCACCACrAGATGGGTTCTGTGATCCCGTTCCT 

6 0 GCACACTTCTCTGACCTGTCTGCTAGCTTGGGTATCATTTATTTGTAGAAATGAGGTAGGTATTAGGTAATC^ 

CTGTGG CT CACAG T AGTG AT ACCATCATGTTTG AT ACAG CAACACGTTG TCT CAT AAAACAAATG AACAAG AACTGGAAACTTTT C 
CAACTTGGT ATT AACTAATG AAGTTAAACATTT AC CTGTTG CTTT CACT AATTTTC CTT CT1TTT ATTriTTTT CT CTCCATTT CT 
TCCTTCATTCCCTCTATTTTCAAAGACTCAGACTTTTGTCACACA 
GGTACGCACCAACACACCTGGCTTCGCTITTCATTTACCGCTTCTACATATATTTC 
6 5 CGTlH'CrrGGGCTGACTCTCTGTGTGTTCATGCCTGCCTTTGGTTCCTGTTGTACATC 
GCTACTCCATGATGGTGACTCAGTTCClTrrC 



GAGTTACCTAGGGAAAGGTTGGCGACGAAAGAATTAAATAATGGAATTAAATAGAATTAAAAACAGAGGAGGAG 
AGTCCTAGCAACCACCCGGGATTCACTCCCAGGGACCCTCAGGGTAAGAGGAGGGATT^ 
7 0 AGATTGTGTGATGCGAGTGAGCATGTGTGGACCAACCC^ 

TCTTTTAAAAAAAAATTGGGGGAGGGGGTGTTGGGATGTTCAAATCAGGGTCT 
ATCTGTAGACCAGGCTGGCCTCACTTTAGAGAGCCACCTGCTTCTGCTGGGATTAAAG 

MOUSE SEQUENCE - tnRNA 

75 G CTTGX5ATTCTGAAGTGTGG AAAGCACTGAG ACGTG AAGATG A CGGCAG GAGGC CGAGCAACTG AAGAA 
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CCAAATTAGAGATGCTCGTAAAGCGTGTGCCGATCCGACTCTTTCTC^ 
TGCGGACCJUKiAGAAGACTGAGGGGGCATCTGGCA 

TCTC^GGATGGAAAACTCATCATCTGGGACAGTTATACCACAAACAAGGTTCATGCCA 
CTGCGCATACGCTCCTTCTGGGAATTATGTGGCCTGTGG1GGCCT 
5 GGAATGTGCGTGTGAGTCGTGAGCTGGCX5GGACACACAGGTTATCTGTCCTGTTG^ 

AGTTCTGGAGACACCACATGTGCCCTGTGGGACATCGAGACTGGCCAGCAGACAACCACATT^ 

TGTGCCGG CAGAC CTTT ACAGG ACACGAGT CTG ACAT CAATGCCAT ATGTTTCTTTCCCAATGG CAATG CCTTTGC CACTGG CTCA 
GACGATGCCACATGCAGGCTGTTTGACCTCCGTGCAGACCAGGAGCT 

ACAGAGCAGGTGTCTTAG CTGGACACG ACAACCGAGTCAG CTG CTTGGGGGTGACTGATGATGGCATGG CTGTGGCAACAGGGTCC 
TGGG ACAG CTT CCTCAAGAT CTGG AACT AAAG C CAGGAG CAGGTGG ATGGATGCCATGGCG ACTGGAAG ACCATTCCAAC CTTG AA 
TCGTTACAGCGATAGCATATCCTATCCAACCATACTAACGTGGACACCC^ 
TTT CCTT CACTT ATTG CTC AACCAAGAG CACAATTCCCATT AAGAGAAGG ATCT CAGTG 
15 TCTGGGCCATTGTTTTTGGTTTATTTTGTTATTTTGTT^ 
AAAAGGAAATACTACAATAAAAATGTTAACCAGA 

MOUSE SEQUENCE - CODING 

ATGAGTGAACTTGACCAGCTGCGGCAGGAGGCCGAGCAACTGAAGA^ 
2 0 TCTTT CT CAG AT C ACAAACAAT ATTG ATCCAG TGGGAAGAATC C AAATG CGGAC CAGGAGAACA CATCTGG CAAAG A 

TTTATGCCATGCACTGGGGC!ACAGACTCAAGGCTCXriT^ 

ACW^CAAGGTTCATGCCATCCC!TCriX5CGCTCCTCITGGGTCATGAC CTGCGCATACGCTCCTTCTGGGAATTATGTGGC CTGTGG 

TGGCCIGGATAACATCTGCTCCATTTACAACCTGAAAACTCGTG 

GTTATCTGTCCTGTTGCCGGTTCCTGGATGACAATCAGATAGTTACCAGT^ 

2 5 ACTGGCGAGCAGACAACCAGATTTACTGGACACACT 

TGGTGCTTGTGATGCTTCAGCCAAGCTCTG«GGATGTCCGAGAAGGGATC 
ATGCCATATGTTTCTTTCCCAATGGCAATGCCrTTGCCACTG^ 
CAGGAGCTCATGACCTACTCCCATGACAAGATTATCTGTGGTATCAC^ 
TGGGTATGATGACTTCAACTGTAATGTCTGGGATGCACTCAAAGCT 

3 0 G CTG CTTGGGGGTGACTGATGATGGCATGG CTGTGG CAACAGGGTC CTGGG ACAG CTTCCTCAAG AT CTGGAACTAA 

HUMAN SEQUENCE - GENOMIC 

AAGAGGGCACTAGATCTCATTATAGATGGTTGTAAGCCACCATGTGGTTGCTGGGAATTGAACT 
AGTGCTCnTAACCTCTGAGCCATCrCTCCAAGCCCTTTGTTTTI^ 

3 5 CTTTTATTTTTTAATTAGATATTTTCTTCATTTAC^ 

CTCCCCAACCCACCCACTCCTGCTTCCTGGCTCTGGCATTCCCCTCCTTTGTTTG 

TAGACCAGGCTGGCCTGGAAGAGACAAAGAGATCTCACTGAGATTTTTATACACACT 

GGGTGGCTGGTATTAAAGGTGCTGGCCACCACTGTCCAACT 

TTGAGGCAGAATTGCTTGAAGATGGGAGATAGACCAGGAAGAAGGCCATGAGGCCT 

4 0 GGGAGAGGGTTGGTTGTTTTATATCAAGGCCCTTCCATTCCTAATTAGCAAGG 

GAAAAAAAG T TTTTTTTGTTTTTGTTTGTC^ 1 1 1 1 AAAGCAACAAGGGCTCT'l"lT'rA"l"XTlT'l TAAAGATT 

TATTAATTATATGTAAGTACACTGTAGCTGTCCTCAGAGACT^ 

CATGTGTTTGCCGGGATTTGAACTCAGGACCTTTGGAAGAGCAGTCAGTGCTCTTACCCACT 
TTGTTTTTTTTACCACTTGGACTACTGACCTAGGTGGGATTTGCCG 

4 5 GAGTAATAAGTGCTCCTGAGGCAGCAGAGCCACTTTGTTTCTAACGCTTGGGGTCT 

TCACAGATTCAGCTCTTTAAATGGACATGATACTTAGCTTTGAATCCT 

CACTATAGCTCACTCTGCCCCATAGCAGGTACTTCAGTCAGTGTGGGGGAGGGAAGGAAAT^ 

TAAGGCATTGCTGAATTTTTAAAAAGAAGCATTACCITTCT 

GTCTCCACTGCAAAAGGGTAGAAAAACTTCCCATGCTAriTGGTGAAAATTGAGTTGTGCCCAGAAAC 

5 0 (^CAAGTTGTACTTACAGTTCCTTCCCTTTGGAGAGGGCATGCTGCCACTTTCATCTGA 

CCTAGAACAGTGTTTAAGAGTTTCG AAGAAGCCAGGTGTCX3 CAGGTG 
GATTTCAGAGTTTGCGGCCAGCCTGATCTACAAAGTGAGTTCCAGGACAGCCAGGGCTATC^ 
AAGAGTTACGTTGTTGCTGATAGGTACATATGCCATGTACATAAAACTTTCT^ 
CATGGCCTTCATGTAGTGTGAGAAAGTGGAGAGAAAGAGTGGGCAGCAAGCCAGGAT^ 

5 5 TAG T AAAAT AAACGGG AAGTTTG ACAAGAGAATT AAAAGACAATT C CTG AAT ATT AG TTTTTTT CACTTCTG AAACATGG AT AT AT 

AAGCCCCAAGATCCGCCATTTAGATGTTTAAACAAATTTGGATAAGTACCGAGGTTGCATCAT^ 

GTG CG CT CCAGTTGT AC CGG AAC AAAATG ATGCTGG CACAG CAAGCTTCATGTC CTTT AG AACTTT ATT AT AAAGTTGGAG ATGT^ 

TTTTAGATACATTGGTATATTTGCIATTATAGACTTAATGAATGT^ 

AATATTAGTAAGTTAGGAGTGTTGGTGGGTAGTTCAAGGTAATCCTAAACTAC 

6 0 AC(TTTGTCTTTAAAATATCAAAATAATAATTAAAGAAATGATTTAAACAT^ 

GTTTAT ATTT ACAGAATTCCGAGTT AT AT CT AATTG C CTTTCTT CT CTTTCCTAATC CTTCTTTACTCTCTTG AAGTGGTTTTAAG 

GCAAACCCCCAAATCTTGCCCCCCCCCCCCCCGCCCCAATGAATAACAAGTTAGGAGTTCAAATCTC^TAGGATATCT 

AACAATAGTTTTCTTCTGTTTTCTGTCTGATTTTTCCCTT 

NNNNNNNNNNNNNNNNNNN^ 

6 5 CCCAGTTCTCACAGTCCTCCTGCTTAGGTTTGTTT^ 

TGCACTCTAGCTCTCCTCAGACACTCCAGAAGAG 

AGATTAGAAGTGTGTCCCACCTCACCGAATTTGAAAATGTTAATTATTTAGGGAACTATTTACAAAT 
AGCTGAACTTCTTATATACGTCCCTATCCATTTCTCTACCTTTTGCT 
70 • TATATATGTGGGTAGACTGTAGCTGTAGATGGTTATGAGCCTTCATGTGGTTGTTGGG AATTG 

TGGCCCAAAGGTTTATTTATTATTATACATAAGTACACTGTAGTTGACTTCAGT 
TACAGGTAGTTGTGAGCCACTATGTGXrrTGCTGGGATTTGAACrCAG 

TCTCACC AG CC CC CAAGG AG ACITTTCTTAAAT ACTTTTTTGT^ CTTT AAAT AT AAT CGT AAACGGT ACT ATT AT CACACTTT AA 
AATGTTAGCAGTTGCTTCTTATATAGCCCCTCAGCACACTTGTTTGCTT^ 

7 5 TCCrrrCTTCCTCCCTCCCTCCCTTTC^CATTGAGACi^ 
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GG CTGG CCTTAGACTCACAGAG ATCCAACT ATTTTTGCCT CATTGACCT ACT ATT CCCAG C CACG 

ATCATAACTTTAAATCTTTTAGAAATTCTTGTCTT^ 

CCCAGAACTTAGGAGGCAGAGGTAGGCAGATATCTGTGAATCCT 

AGAGCTATACAGTGAGACCCTGACTCAAAAAACCAAAGGAAAAAAGAAAGAAAAAAGAAAAT 
5 GTTCTTTGTCAATGTGACTTCTCAGATGTCAGGCATTAA 

CAGT AGT AT AT CACTT CAGTAGT ATAT CACTTCAGT AGTAT ATCACTG AG AGG CTT ATGGGATGAGGG CGTATCAGACTGATT ATG 

TCACTATGTGATATAGATGATGTTTGTTTCTTTGTTTT^ 

AT AAATAAAT AAT ATTTTTTTCAGACGAG AGTGAAAAGTGATT^ 

TCAAGACATGGACCATTAAATTTTAAATTCTAG 

TTTCATCAGTGGCAGGCTTGCAGCCCAAGTATCGTCAGTAACTGTGTGAG 
GGCTCCTTCACAGGGTCTTGACCAGAGGAGCGCTGTGCTGCTGATGCCTT^ 
ACGTGAACTGTTCTAATCGTGCTGACATTTTATTTAAAGATATAGCATGT^ 
CTTTAGTCCCATGTTCTTACATCATTTAAATTTCCTGAACAGCAGCTCT 
1 5 CGTGAGCCTCAGCAGGTGCTGTCTTAGTGGATTAGGCTGGAAAGTGCTACCTCAGCACA^ 
GGAGGAGAGCTGCCCAGGCAGGCAGACCGGATGGTGGGGTTTCCAGAAGGGTCAAAGG 
GAGACGAAACGTAAGTAAGACGTGTTTCTAATTGCTAAAGTTGTGTAAACAGTATTATGGCA 

TCACTCAGACTGTG AG AAGGTAAT ATTGGGACAGCAGGTAAAGGCACTCGCTGTGATGTTGCA^ CAGGAGAGAACCAG CTATTT 
CAGGTTATCCTCTGTCCTTGGAATATGTGTTGTGGTACATACCCATACACAATTAAATAGGTAAAATGATA 
2 0 AAAGATCGTTTTCTGCCCAGCAGTGGTAGTAGTACACGAGGCC^^ 
AAAGAAACCCTGTCTTGGAAAAAACCAAAAACAAACAAACAAAAA^ 

GACTTAACCCACAGAACCTGGAAATCATTCTTTTGATTGAATGCTTTCT 

AT ATTG CTT AAAAGTAGAAAAATGT ATTT AC CAG AAACCT AG TGAGG AATAGAACTG ATGGG CT ATTG AGATAACTTTGGGT AAGG 

2 5 TTGGTGG CTG CCATG C CTG AGGAG TTTTTG AAGCAG CTCAGAAAG AT C CCCTGTTT AT ACAACA CTGGTT CAG CATAG TTGT ATT C 

GGTCTCATGATGTAAGAGGTTTAGTGTTTATGTCCTGCTCAACT 

GTrGCTACTTTTTCTTACGTTAATGCTGGATTGGATGGAAGGTC^ 

AGGTGGTGATCTTGACTCTGTTAAACrAGTAAGAGGGTTTCT^ 

AC AAAAATGG CAGAG T ATCTTT CTGTGTTTTT ATT CATTTGG GATTTT ATT CTGC CAT AT AGGAATG AAT ATT CCT AAATCTAGTA 

3 0 AGAGGGTTTCTTCTTGGACCCTTTGTAAGGTAAAAACAAGTCAAA 

TATTCATTTGGGATTTTATTCTGCCATATAGGAATGAATATTCCTAAATTTTAGATC 
CAGTCTTCTCCTGCAAAAGTCATGCCAGGTACCAGGGAGGCCCAAGGTGTCT 
GATGGTGGATATAACCTGTAATACTAGCCCTGGGGAAGGGGAGGGAGTAA 
CTGTACTGCTTAGCAAGACACTGTCTTAAGAAAGGCAGGAACGTGACTGGAGAG 

3 5 CAA1TCCCAGCAACCACACAGTGGCTCACAACCATCTATACTGGTGGG 

AC^GACACTCAACATACATACCATAAATAAATAAATAAATAAATAAATAAATAAATAAATA^ 
GATAAGGAGCATCAACAGAAAATGAGAGTATCATTGTGGTCAGCAATCTTTTAGAAGCCGTCT^ 
TGAGCAGCAGCAGCAGCTGGTCTAGGAACATGGAAGCCCTCTTTCAGTGCACAGCTTTACCT 
CCCCCCCCCATGTTAAGTACTTAGTAAGTCGACCCTTCITTTTGTTCT 

4 0 ACCATGAGCATATAGTGTGGAACTGTAGAATAGAGAAGTAAGGAACGTGAACAGTT 

GTGTCTTACTCTAACAGTGATCTCAGGACACAGCAGG CAG AGCTCGGTGTG AAG CCAGCCCAGTCT ACAAAGCAAGTTCCAGAACA 

GCTAGGGCCACACAGTATAACCCrCTCTTGC^TGGTGGTGGGGGGCAAGGAAAAGA 

TTTTTTAACTTTGACTrTTGACAAAAAGTTGATCTTC 

TAG ACrCAAAAGGAATCTTTCTGCCTCTGCAT ATCCCAAATTC TGAGATTATGGGTATG CT ATCAT ACCT AACTTT ATTATTATTA 
45 TTATAAAATTTGTGTGTGCGTGTGTGTTTTATGTTTCTGTAGGTATGCT 

GTTTATGCAGGTATATTCATCTATGTATGTATGGAGGCTGATCATTATTTTAT^ 

TGAACCAGAGCAGTGGTTCTCAACCCGTGGGTCATGACCCTTTTTGGGAGT^ 

GAAAACAGACATTTATATTATGGATCGCACAGAGGCAAAGTTACAATC^ 

CCACCACATGAGGAACTATATTAAAGGGTAGCAGCATTAGGAAGGTTAAAAACCACTAGAGGTTTTC^ 

5 0 GTCATTGAACCCAAGAATCCATGTGTATCTGCCGAGACCATTTTGAACCTCTTTAATTATTA 

GGGAGTGCTGGAAGATGATTCAGTAGTTAAATGCACTTGGTGCCCTTCCAGAGGACCCT^ 

GCTAAOU^CCATATGACTCAGATTCCAGGGAATCTG 

ACTAAGATTCTTGCATAAATTATATTGGTATTAAGTTGTTTCAC^ 

ATAACCJU^C^TCAGGTCAGAATTGTAGCCTGTGTGCCCCAGGTACAGTGGTAGATGAGGCCTGCT 

5 5 GAGGCAC^AGATTTGCTATTAGTTCGAGGCCAGCATAGGCCACAGAAATAAGT^ 

TTCCTCTC^GAAGTGAAGTCTTTTTGTGCTGTTGTTTATTTT^ 
C^ACTGGTGCATAGGTCTGGCTGGCCTTGAGTTCGGATCCTCCCTTTCCTGCTTCC 
TACACTCTGAGGTATTTTATTTAAGGCATGCTCTACACTCTGAGGTATTTTATTTTCT 
GCCTTAGACTACTrGTGTAGCTGAAGATGACATTGAACTTTTTC 

6 0 TGGG AATT CAT ACATTG TAGGT AATCAGT CTACT AGCTG AACT AT AT CT CCAAC C CAGG AGGTC AGGTTTG TTTGTTT GTTT AACA 

ATCTAGTTTTGAAACAGTCATATCCTAGGCTGGarrCAAGTTATGTAGTCAAAGA^ 

CAAGTGCTGGGATTATAGATATGCACACCACCACACCTCATTTGTCT 

C^GCACTGTACCAACTCGACTTTTGCATACTCCATTGAAACT 

ATGTTTTATATGACTGTGTATGGCTTTGCAGCTTTGACAGCCTTATTCAAGCCCATATCT 

6 5 CTCCTCAGCATAGAGTCTGGGGAGAAATAGAAGACTTTTGAATGTCCCTT^ 

GTACTGGTTTTCCAGATTACCTCAGTCCTGTTTCITAAAGAAAT^ 

GTTTGTTGTTGTTCTTCTTCnTCTTGATTI^ I I C I 1 1 I A TT Al ' rri ' TTA AAGATTTATTTATTTATTATTACATGTAAGTACTGT 

AGGACCTCTGGAAGAGCAGTCGGCGTTCTTAACCACTGAGCAATCT 

7 0 TTCCATGAGAAGGACCTGTGCTGTTTTAATATTGATTGATTGATTGAT^ 

CACCACAATATTTTCAGCAGTATATTAGTACAGTTAATAAAAGTGAAGATTGTC 
GTTCTGATGAGTGCTGCAGAAAAGGTCTCTGCATTGTGTGATTGTAGCTGTGAC 
TTCTTGCCATGTTTCTGTTGCAGAGCTTGGATTCTGAAGTGTGGAA^ 
7 5 GCAGGAGGCCGAGCAACTGAAGAACCAAATTAGAGTATGTAGTCATGCG^ 
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GTGTGGATAACAGAGTGAGATGAAATTGATCTATTATATGCTAGTTATATATCTGAA 
TCTTATTAGCTACAAGTTGTTGCTTGAATGTACATGGCCCATGTCTGTA^ 
AACTAGGTTTTGGACCCTCTGTGTGGCCACTGTGGCCGCTC 
AGCAGAAATAGATACTTTAAATGAAGGGATCCX3TAAAGTGGCATTTTAGATG 
5 TGTGCTCCAGGAAGACCATTGGAATGTATGTAACAGCTCTTAGGCCACCATAGAGAGCA^ 
ATGGAAGATAATTCCATGTATTCTGAGTTGTTCTTTCTGT^ 
GGCAAATGTGAGTTTTTTGGTTTTGGGGTTTC 

CTCAGTGTAGACCAGACTGGCCTTGAATGCAACATTATACAGAGATAAGTATGTC 

TTTCTTAGACTTGATCTAAGAGCATGGAAAAAATGACTTAAGAC^ 
1 0 ATATTTGAAATAGCTTCCTCTGATTTT 

AATGGTTCACATGTCITTAACTTGTCAGTTGTGTTCTCAGGCTC 

ACTTTTACTTGAATTTAAGAAAACATACCAGAACTGCATAAGTTT^ 

TCACTGGACATTGGTGTCTACGCTGTTCCTCCGTGTGTCAGTTTCT 

CTGAACTATACCTAGCCCTTGCCTAAAATCTTTTTAGGGTTAGGATTTTTC 
1 5 TAAGATGATGTCAAAATTTGAGTTCAAATGATCCCATGTGTCAGTCTCCCTTGTAGCTC 

TAAACTTTTGCATGACTTAAAAACTTAAAGCCAACTTAAAGCCAGGG^ 

GAGCGATAAGAGTTCAAAGTCATGATCAGCTACATAGTGAGCTCAAGGCTCAC 

TGGAGAGGTGTCTCAACAATTAACAGCCCTTACTACACTTTCGAAGGACT 

AATCTTATGTAACTCTTAACTCCAAGGAATTTGACACCT^ 
2 0 CAAAATAAAAGTTTAAAGTCAGGTACAATAGGGCATGTCTCTAATGCTATCAAG 

GC CTG CGCATGGG CCAT ATG CTTTTG AGATTTTT ATTTCTTTGT CC CAAAG TT CTGAAAATT AAG ACT AT AGAAATGACAGAGGAG 

TTTTCAGACAAGGTCTCACTATGTAGCTGTGGCAGTCCGAGAACTCACTATATAG^ 

CACCACGCCTGGClTTTGTGGTTTGAACrTC CTGGTTTT^ TTTCC CTG TGTGAC C CTGTGACTGTCCTGGAACTCACT 

TTGTAGACCAGGCTGCCrrCAAACCCATAGAAATACCCTTCCTCT^ 

2 5 TTTTTAAAACGGGAGCAGGGTCGAGATGGCTATGCTGTTTTA 

TTTAGGGAAAGGATAAGAAAGCCCAAGTGTTGAAGAAAGCATACTTAGAACAGATAGAAAGG^ 
TAGCGGATTTACTAAGCCATATGGTTGTAGCTGTGTAGCTGCrrc^ 
AGA Tl T G TTTGTTTCTTTGTTTATTATATCTtSTGAACACTGTAG 
CAGATGGTTGTGAGTCACCATGTGGTTCCTGGGATTTGA^ 

3 0 TCTCTCCAGCCCAGATAAAGAGAGATAGTAAATCAAAGATAAATG 

CCCTAGTACCCACCCGCCTTTCTGCATCCAAATAGTGCCCCTGTA^ 
AAGGCCAGCTTCAGTGACATATCAAGTTTGAGGCCAACTGGGCAACATAAAGTC 
TATTAGTAATATGAAATCTTAAGTTTCATCTAGTCCCATGTAAAGTATO 
TACTTACATTTCAGTTTTGTTTTGGTTTTAA 

3 5 GACATGCACAGATAGTTGGTGACCCCCCCAAGACCTCTGTTGTGGACAGCACCAATT^ 

CAACCTTCATGATGAG CATCATCTTCACAAATGG CTC CATGGG TTTGG CCTTG C CT CT CAG CACTGCACTAATTTT AG AGT CG ATT 
GGTGAGGAGATGTGAAGGCCATTGAGTCTCTAAATGGGAAGGGAAGACTTTTGTTTG 

TTGG CTCCCCTGCAACTTGATATGTAGACT ACTGGCCT CAAACTGCTTGCCTCTG CCTCCCCAGTTT ACTTTGG AGACT AAAGCTG 
TGTGCTACCTT ACCTGGCAAGGAAT ACTCTTT AAAAAAGT AAAATT CTGG AG 

4 0 CTGCTTATAACTGTAGCTCCAGAGAATCCAAGACTGCCTGTCTTTTGGTTTC^ 

AAAACAAAAAACTTTTTTTTCTC^TATAAGGGAGGAAAAATGTAAAATGTT 
AGAACTTAGTACAAGGGCTGGAAGGATGGCATAACTCAGTAGTAGTCTAGTGCTT^ 
AAAATACGAGGATTATCTTCACTTTGAAGGCCAGCCTAAACTACCAATCAAGA^ 
CTGGTGGTGTACGCCTTTAATCCAAGCACTCGGGA 

4 5 AGTTCCAGGATTGGGTGAGACGGCGAGCACTTTTAATCTTAGCAGTCAGGAGGCAGAG 

GGCCTGGTCTACTTAACAAGTTCCAAGATAGAATTATATAAAGACCCTATCTCAAAACAAA 
TTAATCAATGAAGTAAAGGTTCTTGCCACCCAACAGTGTAAG 

AGG AGAGAGT ACTGTCTCT AC^AAGTTGC CCCATG AC CTTC CAT ACTCT CAAGCACACTG AT ACACATCAGT CA CACTCTTGG 
GTTTGTTTGTTTGTTTGGATTTTTACCAACAAAGCGTTTGTG 

5 0 TTTCTCTGTGTATGCCTGGCTATCTTGGAACTTG CTCT 

TGAGTACGTACTGGAATAAAAGCTGTGTGCCACCACCACCTATAC^GTCAATATACI^ 
CTAGCTGGTGTCTTTATACACATACACAGCTT 

ATTTGTTTTTTATGTGAACTTCCCCTGGGGGAAAAAAGGAAAAACAA 

5 5 CAGGCTCTCAAGTAGATGTTTAATTAAAGCAC^CACGCCrrTAAATACAGA^ 

CATGGACCAGAATGTAGCGACCCAGAGCTGAGGGTGGGCCCAGACT 

GCCAG CCTCTT CTGT ACT CAAAC CAAA CAGG ACAG AATG ATTG ATTAAATGG AT CAGTCT AC AT G AATCCAT CAAG AAAT CATTGT 
AAAGCCGGGGGT<KXjGGTGGAG lTTrTrr T fTTrr ^ 

GACACACCAGAAGAGGGC^TCAGATCTCATTATC^ATGGTTGTGAGCCACCATGTGGTTGCT^ 

6 0 AAGAGCAGTCAGTGCTCTTAACCACTGAGCCATCTCTCCA 

AACAGATCTCTGAGTTTGGGGCCAGCCrrGGGCTACACAGAGAAACCCTC 
GAAAGGAAG AAAGG AAG GAAAAAAAAAGAAAAGAAAGAAAGAAAAATT ATTG TA^ 
TCACTTGGGGAGGCAGAGACAGGTCAATATGTATAAGGTTGAGGCCATCCTAATCAAC^ 

GG AC CCTCT CAAT CAT AAAT CAATCAGT CATT CTCAGG CCAG CAG C<5TG AGTCATCAACT AAAAGT ATGTCAAT ATTT AT TTT ACT 

6 5 TATAAGTGAAAACACTTTAATTAATATGGTAAAAATTAAAGAAAAATTACTATAGTATATTGTTT^ 

TATGTTTTATAAATAGAAAAGATTGCTGTTTTGTTA 
TAC^CTTAGATTTAATCATTGTTTGGCTTA 
TAGTTGGTTTTCAGCCTGTTCTTTCATGCAACCCCTTX3TC 
GGGAGTCAGCCTrGGCTTCCAGGCTGCTCACTGCrcTACCAGCTCCCA^ 

7 0 TCTCCATCTCTTAGTTAGGTTTAGGTGTTGTGAGTTGTGTCTGTTTTGGC 

TTAGAGAAGAATTAAGGAACAAGACAGTTTGTAGAGGCTTCTTGAAACTT^ 
AAACTAATTTCCATTTGTTATTTCATAGGATGCTCGTA 
TTCAGCTCTCCGTAATGCGCAGTCTC^CCGCGGGCTTGTATCATGTGGGGTC 
CAGC^CTTTTCATTTGGAGGTTGCTTTGCTGATGG^ 
75 GTTTCACCa*ACAATTTAATC^^^ 
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CCATCCC^GAGCTCAC^GGGTAGAAGGAGATTCAGAGGTGGTTCTCTGACCTCTGTGAGCATGTGCCGCTG 

TACAGTTAAACCGTGGGCATCCAGCTCCTGCTGTAGTGTATCTACTGCTGTG 

TATATCGGTTTATACTTCTAGGTAATAATATATCATTGGAAAATCAGGGCAGGAACCATAGAG^ 

5 ACAGTCCTCCATACATAGGCCCACAGAGGAATCTGATCTAGGCAGTGTT^ 

AGTTAACAATAAAAACTAGCAACACACACACACCTATACTCACACCTACTCTTGGCT 
GACATAGAriTATTTTCCCTGTGTATCXATGGCAGTGGTAAAGTGAGTATGTTAGG 

AGCAGT AAT AAAATGGT AG CT ATT CTATG AAATGGT AGTT ACATGTGAT ATCAG ATG ATCTT AAAATAT CTTACTG CATGT ACTGG 

TGCGACAGGGGATGACAGTGCCTATATCCTTGATAAGAAGAAGC^ 
1 0 TGCTGATAAACATGCTGTATACTTAAAGCATGGACTCACTGGACAAG 

GTGCTACITAAGGGAGCAATGTGCAGCCGGGGAGTGGTGG 

TCTGAGTTCX3AGGCCAGCCTGGTCTATAGAGTGAGTTCCAGGACAGC 

GGGGGGGGGGGAAGCAATGTGGAATTTAAAGTTTATGAATTGGCTGCTG^ 

TCTAGAGGACTGGGATTCTGTTTCCAGCACTCTGTAGTGGTTCACAACTGTCT 
1 5 CTAAAAATATTCTCTCACATAAATATAGGATTTTCCATTTAATATATTTTTO 

AACGCAAACCAAGAGTCTATTTGCTAAAATTAATCTTCACTAATTT^ 

AATCCAAATGCGGACCAGGAGAACACTGAGGGGGCATCTGGCAAAGATTTATGCCA 

ggggccttcctaaagcagtacttgaaatgttgcctatctgctc^ 
gtgatagcacactttgaatctctgtactcaggagacagaagcagacaaatc^ 

2 0 GAGTTCCAGGCTAGCCAAGGCTACAATGAAATCCATCTCAAAACCA^ 
CTGTATCCAGGTCTCATGTTCACAGGTCATACTCAGATTTCCGTTT 
TCCTTTTCTTCACCCTATACCITlTAGCrGTTGGGT 

GAAGGTAAAAGGTACTGAGTTTACAACAGCAGACTGTGGTCCCAAAAACTAAATC 
GAAGCCTTATGGAAAATGAACGAATGGCTATCCITATCATC 

2 5 AGATACrACCATTrGGGGGTTCCTrTCTTTTATACTTTGATAACT 

AACGAAATAAAGTGCGACATGGTTTGTGTTCTCCTTCCAGACCTAAAGGAATCTCAG 
TCTGCTTAGCTTTCTCITTGTGAAATTATCK3 

TCCCATACAATTGCTTGTGCAAATGAGCGCrATCTGAGGCTGTTTAATAATC 

GGCATTGTGTT CTTG CTG CAGCTGCAG CTTCAGTT AG AT AGAT CATGG ATGT ATTCT ATTGTTG CTGT CAGAATTG TT AAGTG CAG 

3 0 ATTTGT AG ATTT ATTG ATTG ATTGTTTTTTGAGTGC CGG ACATTG CAAG C AATTGG TGTACCACT A 

AGGTGCACCTCTAGTCCAAGAATAGTTACCATATGGTCCTAAAAATGTGAGATAATGACTGTGTTAAT^ 

GTAATATAAACATTAAACAACTTACAGTCCATAGTTACTGTCAATTAAAAATAAAGATCATAAAA^^ 

AAAATGGACAAACAATGGTGGCA CTri ' TrU ' lTrT TTrT l TTTrr r r 

GCAAGTTCTAGGACAGCTTGGG CTACACGAAACCCTGTCTT AAAAAG CAAATAAACCAGGGGCTGGTGAGATGGCTCAGTGGGT AA 

3 5 GAGCACCCGACTGCTCTTCCAAAGGTCCGGAGTTCAAATCCCAGCAACCACATC 

GCCCTCTTCTGGGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAAATAA^ 
CCTTGCAAAGCATT ATTT AGGG ATTG CTTTAAATGAGCATACTTTCTG 
TTTAAGGAGTTGTTGTGGGTATGCATACCCCTCTACCCCCTTACATCATTTTCATATTTTT^ 
TGAGCTTGGTGTGGTTGCAAATGTCTTTAATCACAGTACTT 

4 0 GTCTACATAGTGCGTTTTAGACCAACCACGTAGCACAAACTTGAAGACCT 

TGGCC AG AT ACTT AG AACC CCACTG CTTCAG TCTGG TCG AGGT AAAAG AATT ATTGGG ATCACTGGC C CT AG CGG AAAAATGATGA 
GCTTCCGGTTCIAGTGTGAX3ACTGCTTGAGAGGAATAAGGCAGAGAGTCTAGAA 
TAGGAACACACACACACACJVCACACACACACACACrAAAATTCAGC^ 
GGAGGTCAGGGCATTATGACCAAGGAATCAGTTCTACTTCT 

4 5 GGGGGGGGGGGGTCTTTCAGGAAATGTTTCCCTGTGCATCCTTGGCT 

AGATCCTCCTGCCTCTACCTCCTGAATGCTGCTGGAATTAAA 

GATAGGCACCTTTACCTGAGCCTTCTCTTTAGCCCTAAAATTTAACTTGTAATACT 

GTTTTTAATCCCCTCTCTACAGAGCCrGATGAAGTAGTCT 

GTCTATGAGTGCCTCCTTTTGTTCTAGGCAGCTAATTTGGCTAGTTCTCAGTTTCT 

5 0 TTTAGACAATGAACCGTGTTGATTCCTCCAAGTTACATGACCTGAGATTAGAAAGGCGTC 

CTAACAGCIT AG CATTGTCTT AATT CCTCACTG AGTT AGCAATT ATTG AG 

TACTAAAGATGTCTCTTGATACCCrTGCTCAGAGTCAAAAAGACCAAGCAGAAATAGAGCAGCAG^ 

CTATAGAAATGGTTGAAAGGACCTTGGGTATAATGACTGAACTGCAATAGATGGGGAGCTG^ 

TTGGCTACACAGTACCCTATCTCCAAGGAAAGTAAAAGAATCGTCTGAGGAGTTGGAAATACT 

5 5 TTCTCAAGGCrrCCAGAGTTTAGTCCCTAGAACCTCATAAAGAAAAAATTTTA 

GAATTTACIXSTGAATTGAATCTCTAGAACTGAAAAAGAAAT^^ 

TG AT AAATTTTG TG ACTAAGGATGG AAAAGG ATTTTATTTCTTG TCACAT AT AT ATGTTG TTC TT^ C 
AGAAGATAGCTTGGCAGGGTATGATG TAG TACAGCTTTAATG CAAG C^ 
GGTCATCCOSGTCTACATAATAAGTTCCAGGAC^GTCAAAGCrACATAATAGAGGAACCCTG 

6 0 CATACAAGCATGGTCTGATTAAAAGTGTCAGAGGGCTCAAAATGGCGTGGT^ 

GTTCAAATATGCCCAAATATACnTGTGGCTGAAATACCTAGAATGTATATATTTTGA 
ACTATCACATGGTAGACCTGGGAGAGCCTCTCACCCTrCCTTAAGAGCACATT 
TCACTCAGTTACTTCCCAGATAACTGAGATTCCTrTG 
TGCAGAGAACTGCAGGTTCGAAACTAGGACAGAGTCATGAGC 

6 5 GTGCJUVTTCTGATGTCATTTATCATCTCACTACCACACAGTTTT 

AGAGTTCGTTTTTCTTTTAATTTTGAGGGATAGAGTCTGGG<: 

TTTATGAGTTACAACGTATGCTACCATTGGAGCTCATGGTTGTGTTCT 
AGCAACTTGAAGAAGATTTTGTTTGAGTTTATAGTTGTAGTCTA 

7 0 ACTAGCTTTCTCTTTAGGGCTTACTTAGCCK3CCATCCAGGACTA(^TGC 

TCACATGAATTATTAATCAAGGACATACCCATAGGTTTACCTATGTGCCAA 
TTCCTTAGAGGACTCTAGCTTCIGTCCTATCCCACCCC^ 
TGGTTCCTGAGTGGGTAGTGTTTCAAGCTTITCCCATAATATTCAA^ 
ATTTtSACATGTCTCATGTCTCTGTGTATCTAATTTAACTTGGAATAAAAAAGAAACCCCT 
75 TC-TCGGGTGATaAGAGAAGAGCTGG<^AGTCTGGTCAT 
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TTGCTCAAGCATTGAGATCCTCTTTCTTATTATTAAGTGTA 

AGAT l TTTTGTTTCT GT l TT TT Cri ' lT CCTTTAT C T T ' 1 ' T CTCCAGG CTCCrTGTCAGCGCCTCTCAGGATG<»AAAACTCATCATCT 
GGG ACAGTTAT AC CACAAACAAGGTGGG CAGTG CAAGAC CCTCC CTGG AACAGTG ACACrTCTCTTCATGG AG CAATGATG ACCTCG 
GAGGGTGATAGAAGAGCTTGTGTTGCrGACCCATTGAGCT 
5 GCTGAGTGCTTCCTTGTCCTTGCACAAGAGCTACACTCCACrCCT 

GAAATGCAGATTCCATTTAGTTAGTAGTTTGTTTTGTAGTAACAAGAGATAATCT 
GATTGGGAATGTAGCTCAGTGGTAGAATACTTGTCTACTGGGCATAATTCCCTG 

ACCAATGTAACC CAACAG ATTGGT AAAAGT CTGTG ATTCTAGT AACTCAGG AGG CT AAGGT AGG ATGATCGAAT AAACCCAG AGGG 

CAAACCTGCCATGGCTAGATAGGGGAAAGGAGAAGACAGAGATACTAGGTAGAA 
1 0 AATTTCCACTTCTCTTATATTCAAATAAATTAC^ 

GCCTCAAGCATTAAAGATTATGCTTACCCACATAATAAAGTG 

TTCACAAAAAACCAAACAAACGGGGCTGGCGGGATGGCTCAGTGG 

TCCCAGCAACCATATGGTGGCTCACAACCATCC<3TAATGAAATCTGATG^ 

TTACATATAAATAAAAAAAAAAAAATTAAAAAAAAATTTAAAAAAACAAACAA^ 
1 5 TGGCCAGTCTATAATTCTAGCATTTGGAAGACAGAGACAAGGAATCCA 

AGACAGTCCTGAGGAACACTGTCTCACAGAAGAAGGTGGAGAGCAGTCAG 

AAGCACACATGCGCATAAAGTACGGGGAGCCGAGCCAGGCGTGGTGATACACGACTTTAA 

GTAGATCTCTGTGAGTTAGGGGGCCACCCTGTCTATAAA^ 

GAACGAAAAGGTGCAGGGCTGGAAAGATCGCTCAGTCACTGCCT 
2 0 T AGTGGTTTGCAGCCTTCTGTGTGTCAGTTAAAGGGG AACCCAG AG CCCTCCCTCCTTT AG CTTTCCAXMCTCTGTACTCAGTTAA 

TCAACAGTTAATCAAATGTTCCTACAGATAACATGA 

ATTGGACAGTTCTATTTTCCCTCAAAATTAGTAA^ 

ACTGTGTTACCAATGTGCAGTTAGGTTATTGCATGTGTGTGTCAGAGAGA^ 
GCTGGGATCGGCATGAATCACCATGCCTGGCTGlTITrTATGTG 
2 5 AAGGGG AAAGT AAATCTTTTCAAATC CTATT AACZAT CACTC CTT AAATCATT AAAATGCAAATATAATTTGTGTT TT CTTTGAG AT 

AGTTTTCCATTATGTATAGCTCAAAATATCCTCAAACTTGCC^ 
TCAAGTCTCAGAACGAAAAAAGACAGACACGACAAAAAGC^ 
TATGCCATCACACCTATCTACTCATTAGGGTTTTTTGGTTTGGTTTXKSTT^ 
GTTTTTGAGACAGTTTTACTATGGAGCCCTGCCTGGCCTCAAAT 



30 




GCTCCATTT ACAACCTG AAAACT CGTGAAGGG AATGTG CGTGTG AG TCGTG AG CTG G CGGG ACACACAGGTACACATCTCATGTTC 

CAGTGGTGTAAGCTGCTAACTGGGATGTATACTGCTCTTGTGATCTGTTGATTATTG^ 

AAATTTTCTTGAAAGTGAAGCCAAACCAGGGCCAATATGCCTTCATCTTTAAGAAACATGG 

3 5 ATTCCCAGCTGCTTCAGCCATTCTTGTTTATCAACGAATTGATGTTTCAG 

GCACAGGAATTCTCCTAGTCTAAGCAAGTGTCCATATGCACAGCC^ 
GCCGAAGAGAGTCTCTGGTCCATTCACCTTTT^ 

CATTTAGGACACATTGCCTTTCCAAGGAAGAAAGTCTCATAGTATTTTCTATCTATCCT 
TCGCATGTAAGCAAAGGATTGTTCATAATTAACTGTCATGAAGAACATGGCAGT^ 

4 0 TGCATCCTTTGGTGCTTCTTAAGTTACATTGCCAGTGTTTATAACrA 

GGAAGCATAAGCCACCAGTTTTAGATCTCTTCTTTCTCAAATATATGAACTTAAAAGTGG 
CACGCCTITAATCCCAGCACTCGGGAGGAGGTAGAGGCAGGAGGATTTCTGAGATCGAGGCCAGCC^ 
AGGATAGCCGAGACTATAC^GAGAAGCCCTGTCTTGAAGGGGAAAGAAAAAAAA 
AAATTTTGATCAGTAGTATTTAAATTACTCTTCTAGGGGCTGG 
45 TCTAGGTCTATCCCAGCACCATAAAACTGGACATCATGATAATTTGTAATCCCACCTCTCATGAAGTGG^ 
ATTCAGTGAGTTTAAGGGCAGTCTGGGCTATATGAAACCCTX3TCTTAAAA 
AAAGAAAACAATCGGGGAGAAGTCTCGGTGACTAAGAACAGCCACTGCAT^ 

AGCAGG CCTGG CGTGTGATAATTTCATCATTGTAGGCTGACTCCAG CCTCGTTCAGTGAG AGGCCCAGTCTCAACGGACTAT AGCA 
AACAGTGATAGAGCAGGATATTCAATGTCTGGTGTCCTCTGTTAGTCTCTGGATAGGCATACACC^ 

5 0 ACGGATATCTAAAAATCCTTGGGGGGGGGGGGGTCCTGTT 

AGACACTATGG CTG TCTT CAG ACAT AG CAG AAGAGGGC AT CAG AT CT CGTT ACAGATGG TTGTGAG C CACCATGTGGTTG CTGGGA 

ATTGAACTCAGK3ACCTCTGGAAGAGCAGTCATTGCTCTTAACCGCTGAGCCATCT 

ATTTGGAGAGATGGTTAAAAACACTGGCTGCTCTT^ 

TATATGTAATTCTAGTCCCAGGGAATTTCATGCCTTCTTCT 

5 5 GCAGTCCT<K3TTTTATTGCTGTGATGTGACATCATGACCAC^ 

GCTC!AGAGGTTTAGTTCATTATTGTCATGGGAGGAAGCATGGCAA 

TATTTGGATTCCCAGACAGCAGGAGAG AGTGACACTG^CCTGTCTTGAG CATCTG AAACAT CCACCCACCGT AGC CCTCTTCCTCC 

AAGAAAGCCGTTTCAACAAGGAGGTGTTCCTAATAGCACCATTCCCTATGGGCCC^ 

AGGCAGAAAATTCATATACATTAAAAATTTTGGGGCTGGAGAGATGGCTTAGTGGTTAGG^ 

6 0 TGAGTTCAAATCCCAGCAACCACATGGTGGCTCACAAC 

ACAGTGTACTTACATATAATAAATAAATAAATCTTTAAAAAAATTTTTTTT 
TTAAATTTGAACTGGTGGTCTACCAAGACATCATGCTATAAGAAACAGAAACT 
GTCTTTCCATCAGAAGGAAAGAATCTACATCTC 
TGATCXX3TTGTTTTATCAT<3GCTTAGTATCCCCTCT 

6 5 CAGCCTCAATGCTCACTGTCTGCATGACATTTTrTCT^ 

TCTTCACTTCTGTTTAAAjGTCTCTGCCTTT 
TGAATGAAGCTCTAGGTTTGATCTTCAGCCCAGCATAAAA 
GAAGCTGGGAGTTGAAGGTAATCCTTTACTGTGTAGTGAGTTTGAAACTG 
GTTGGGGCAAGTCCTGTACCTTCTAAATACTrTACATATAGTTTC^ 
70 ATCAATCAATCAGTCTCTCTGTCTCTCTGTCTCrrCTGTCTGTCTGTCT 
CTATCTATCTATCTATCTATCTATCTATCTATCTAGGGTTTCTCT 
CTGGCCTTGAACTCAGAAATCTGCCrCCCAAGTGCTGGGTATCCAGAAATCT 
TTGACTCTGTCTAGCCTGCCATCCCACATCTTCCTCATCITATATTCA 
GTGCTCAGATTGAAGTGTGTGCTGCTTCGATGGGTGTATAAACT 

7 5 AC^GTCCTGACAGAGAGCTTCAGTTCATTGCTGATAACCAGTACCCrGGG 
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ATTGCATTTAAAATTTGAAGTTCATTTGCACTAAAAAAATCCT 

TCTCTCTCCCTCTGCTTTCTTTTCTGCTGTCTGTCTTTCTTTC^ 

CTTTCTTTCTCTCTTTCTTTCTTTCTTTCCTCCTTTCCCTCCCT 

CTTTCTCTCTCTCTCTCTCTCTCTCTCTL'rrrC'riTCrri'U 1 1 TC 1 rrCT T 1 L 1 1 L T 1 1 1 GGG TT ACTCTGTTTCT AGCT CT AG CT 
TGCTCCAGCCTCTGCATTCTTTTCCCTCTGCCTCCCCAGTGCT 
TTTGAAGAGTTGTATAGGCTCACATGCACTCAGACAAACAACGTCCTC^ 
CTTTTTCTGGTTTCCTGTATGCCAAAAAGCTACTCCCTCATAACACCT 

TTTTT AATGT CTGTATGTTTG TT CAATGAAATTTAATTT ACAAATT AAAGAAAAG AGATTAACAGT AAGC^ C 
AATAT AGCAATACACTATAATTAAGTTTTTG'lTr'rrr 1 TT ITT T T G ACTCATGG CACTGAGAATTTT ATTTATTT ATTT ATTTATT 
TATTTATTATTTTTTAAAGATTTATTTATTTATTATATGTAAGTACACTGT 
TTTCGTTACGGATGGTTGTGAGCCACCATGTGGTTGCTGGGATTTGAGCT 
CCGAGCCATCTCACCAGACCTATTTTTTATTTTTTAAGGA^ 

CC1AGATGAGG CTGCAGTGG TT AGAGT AAACTGCTTC TTT 



15 



TTGACCTCAAACTCACAGAGATCCATCTGTCTTTGTCACCACCTTTGAGTTAAATTATTT^ 



GGAAGTTCCTTTACCCACTGATTCATCATGCCAGTCTTTGACTTTCAT 
GG ACAACTTGTGGGAAGTTGG CTTTTCT CTT C CAC CATGT AGGTCCTGCAAATTGAACTCAAGT AAT CAAATGG CAGCAAGCAC CT 
GTACACATGGTCCCATCTTGCTGGCCCTTGACCTTTATTCCTGAAGGATAGTTTTGCT 
2 0 TACAGTTGGTGTCACCTTCATGTGTATGTGTGTTTCCCCTGCATAAATGTTGATGCACCACATG 

AGAGTGAGGGT ACTGGGTTTCT C CTGGAACTGG AGTT AATAG TT ACAAC CCAT CGTGTTGGTGCAATGAATTGAAT CTGAGTCTT C 
TAGAAAAGTAGCCACTGTTCTTAACCACTGAGCCATCTTCTCAGCTTTCA^ 
GATTTTTGCAGTTTGATGGTGATATTCCTAGACGTGTAACTTTCnrrT 
TCTTTTGCTAAATCGTAAGGTTrGAGCTGTGTCTTCCAATGCCCTC 

2 5 CCTCACGTGTTTTGTTTCTATTCAAGTCAATTCAGT 

GCAGCATTTGAAGGAAGCTCAGCTGGAGTAGACGGCAAGGGCACAGACTGGG 
CACATTTGTGTCACATGGTCTCTTCAGATTCAGAATTTCTGTGTTTTTATTT^ 
GTCTTCTTATTTATAAACAAGATTATTTCTCATGTTTCrrGAATATAATTACAAGAGC 
TCTGATCTTGAGGTATATCTTTGTGGTCCTTTCCTCTTGAGTGTGGA 

3 0 AATATAATTGTCATAGCAACCGTGGGCTCGTTATGTTCCTTTCAGATGTCT 

CAGCAAAACTCTTACTTGTCCTCCTGTCACTGGAGCTGTCTGTGGGTTTGCTTGCTGC 
GTGCCTGTCTGTGTAAGTTAGGACTGTTGGAGGCTGCTTTAGCACTTCT 
TGTAAGACAAAACCATTGAAGTCCGTTGAAGAACAGGCGGTTGTCGATTTACG 
AGAATGTATAGGCTATCITATAGGATAAGGAAACCCTAGAAATGGATGTAAT^ 

3 5 ACTATTTATATTATTATTTATAATTTATAGGTAGAATGTA 

ATACTCCTTTAGTTCAGCACTCAGAAGGCAGAAGCACAGGACTCTGACTTCAAG^ 

CAGCCAGGGCTACAAAGAGAAACCCTGTCATAAACAAACAAACAAATAAGACAGGCACATACATATGTACATGTGTGTGTGCGCAT 

GCGTGTACGTGCACTTTTTTTCTTTTTTTTTTTTTTGAAAATG 

GCACAGATTGGTCTCAATGCTAATCCCTAGGGACTCATGAAGAAAGAGTGGAA^ 

4 0 AGTTCTGGTCCCGAGGGCCATGATTTGCCTGCCCTTCTGATGTAACATGGCGTTG 

AGCTAGAACATTCTGGTGTATAATTTTTTTCATCTAATGAATAATTGGGTGTTTTACATATT^ 

rGGTCTGGGTTTTTTTCTCCCCTCCCTCTGAAGAGTATTGGTGACCTTG 



TGGTGTC CCCTCCCTCCTVTGTCACTCCAGCAAAAAGG AAGTGCTACTTTCAGGG CTAACCACACAGGCAATTGCAGCACGT ACCTG 

4 5 TGCACAGAACCCATACATATCAGGGAAGTTCTCAGTGTCCTGCCTTGTTGAGTGAAATTTTTTTT^ 

TATGTAAAGTACACTGTAGCTGTCTTGAGCTAAGACAGGATCACTGCTACAGGATCACTGTAGCTGTCTTCAGAC 

GGGCGGGTGGTCGTGAGCCACCATGTGGTTGCTGGGATTTGAACTTCGGACCTTCAGAAGAGCAGT 

GCCATCTCACCAGCCCCCATTGTGTGAAATTTTATGTAACCATTTATAATTTGTCCCTTGGCT^ 

CAGAACT ATGTG ACTTTCAGAT AT CAGTCTTTTCAGTTACT CATTTT CTTCT CC CT CAAAATTTTT ATTGTATTTTGAGG CAG TCT 

5 0 CGCTGACCCCTGTTCCATCCACAGCTTCTAAGAAATCTTTCCAGTGGGTGTCAGCAGAA^ 

TCACAGCTCTTGAGAAAGCAAGCGCAGTCACCCGTATTTTTGGAGCCT^ 

GTTGGCCT6TAGCCCTTTCTCCCTTGGTTCCGTCACTATCTCCCTT CTCCTTTGTGTCCAGCTGCTCTACT 

CAATAAGTAGATGAGCACACAAATGAGGATAAAAGGAAATGGAAAGACACAGGTGGAAGGATC 

CTGCTATGTGCTGTGCAGGTGTGGGTCCGTGTGAATCATGAACCCATTGAGTCTGCCTATAGT 

5 5 TCTGGGCATTTTTATTAACGTGGTTTTCACAGGGAGAGGCTCCT 

TCTTTTrCAGGTTATCrGTCCTGTTGCCGGTTCCTGGATGACAA 
TGACAAGGCAGCAATTGAGGCTTGAGGTCTCTCTACTGTTCTTGTTAATACT 
GCTTCTTGATCATAACCTGACAATTAGAAAACAAAGTATCAATGAGTAACTCATTC 
TGAACAAAGAACTAAAATGTAGATATTTAAACTATAGTTGTATATCAAGTACCT 

6 0 GTGGTGTTTTTAGCCCTGTGGGGTTCTTTTTCACTGTAAAAGTGAACTTAGCATATT^ 

GCTCTGTCAGATTTGTTTATTTTAGAATGTTTCCTTACTACT 
AAATGACCAGCTGGTTATTTTAAGAGAATCATTTTAAACITA 
AGGTCAGTAGCAATATTTTAAATTTGATTTAGTAGC 



6 5 ACAGAGTGAGTTCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCTAAAAA 

GTAGTGAGGTGGACAGTGAAGATTAGTCATTACAAATTTCAAATTGC^ 

TTGTG CTCTGTGCTTCTTCCTTCCAGTGTCCCTCATGCCT CAAAG AT AACCAG CATTCATTGTAGTGAACTTGGTTGATTGGAT 

TAGCCTGACTGCCATTrCCATAGCCAGCTCCCGCTTTATGTAATGCT 

7 0 CCCTTGGAAAACCTTGTACATCCATGAAGACCATCTTTGG 

GCCACTTATTTACATTCCTAGTGTCTAGGAAGGTGGAAAATGAGGGGA^ 
CAGACCTTAGCATTTTCTCCCATCCCACAAACTCCCCATGATTATCTrGTC 
AACCTGGAAACAGCAGGCCCATATGCTCATCCATATCGTTGTTTATrcTTTTCA 
AGTAAATGTGTCTTCCCCGTGGCACAGAAGTCATCTCTGATTCT 
75 GCGTGCCTGATACCTACTTITCA1TTCTTCCTG 
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GACACACTGGAGATGTCATGAGCCTCTCTCTTG 

TGGGATGTCCGAGAAGGGATGTOKrCGGCAGACCTTra^ 

GCTTTAGATCTCAACCTTTGTTGTTGTTGTTGAAGATTTGG^ 

G TTGC CGG AACAAAAACAC CATGATCAAGG CAAC CT AAAGAG AAAAAACATTT CATTGAGG CTT ACAGTTTCAG AGGG CG AAT CCA 
5 TGTCCATCACTGTGAGGAGCCTGGCTGCAAGCAGGCAAG<^TGGTGCTAA 

ATCACAAGGCAGAGAGTGAGTGAGCGAAAACTTGGAATGG CATGGCCTTTTG AAAGTTG/U3GCATCCAACAAGGCCAT ACTTTTT A 
ATACCTCCCAAACIAGTTCCACGAACTAGGAAATGAAGTGTTC^ 
TGCCTGTCTGTGTATCTGTATCTGGGTGTGTGCATGCGTATTAGAGCCAAGTAGGCT 
AACCAAACTTGGTTGTAGTAAATGGCTTTTAAAGTACCT^ 
10 AAAATTTGTGCTCTXK nTrn 'l XOT TG l T l^ ^ 

TTACCTAGAACTGGAAGTCTCTCTAGCACTAAGATCAC^ 
ACCAGAGTCAGAGTGTTTTAGAAGGAGCTAAAACCCrcTGCT 

AACTCTTC CTTTT 1 ' X ' l " ! ' AAGTGACCAGT AATTTAGAGGCTGAAT ATCTTGC CAGCTAAGGGAATGGCAAATTT AGAACTTACTTTC 

TTGCTAGCACCTCCTGAAGAGATGAGTCTGGTGACTAGAGCZAGTGACTTGCT 
1 5 CCAGAGGACAGAAGCCCTTCAGCAGTGCCTCAGTCTCATTGCTTT^ 

GTACTATGTCAGTACCCCCCTAGTACATCCCCCATTCACrCTAGGATGACAGCCCT 

TTTCCATGGCAGCCATGCCTCTGCATTAAC^TCATC 

TGAATTCACTATGTGCCTGAAGGCTGTGTACTCTATTCCATGGTTGAT^ 

CTGAACTATGGGCCTTACCCACGCCAGGCAAGTGTTGAATCAGCTGGCCTGTGCCCCAGGCCT 
2 0 TXSTCTTCTCTTTTTAGTTCTTTCCCAATGGCAATGCCTT^ 

CAGACCAGGAGCTCATGACCTACTCCCATGACAACATTATCT 

CTTGCTGGGTATGATGACTTCAACTGTAATGTCTGG 

GTTTTGGGGGCACCAGCTTGCTTCTTTCTATGTGGTAAAATGAGGT^ 

GTTATGTCATAGCTCAGCAGAGGGGACAGTTAAAGGCCAGAT^ 

2 5 GAGGAGAATGTGGGAGTTGATAGTATGCCAGATACCACTGAGTTCAATGTC 

TAGCCrAAAACACTCTATTAGACCAGATGTCCTTGAATTTAAAAGTCT 
TACCATCAATTTTCAATCTTTCTGACTCATTCTCT 
TGTCTX3AGCCTATCTATAGVGCTITGGCATCTGATGCCATC 
TAATGATTCAATTTAAAGCCTGAATTTAATATGTCAGrAATCAAGTGGATTA 

3 0 CTCAAGAGTCTCCCTTrGAGCTCAAGCTGGCCTTAAGCT^ 

GGAGGGAGGGTTGTGACCATGAATCTCTGTGGGTCCCACTGTGGCTGCrTATAACCCACCAG 

TGGCTCACACATTTAATCCCAAGTCTCAGAAGCATGCAAATC 

CAACATTTCTGCCTTTGCCATTGTAGCCCAGGAAGTTGACTCTC 

CAATAGCTACAGGACTAAATGTTTCTGAAGTTTCAAACCCTTCTGTGAGAGAAGGGGTGT^ 

3 5 ATGGGTCCATGATTTCAACTGTAGGCTCTGTGTACCCTGTGGTTAGTTGTTCTGAGTTTACT 

CATTATGTTTTGTl TTU l^L M r i T T TTAGA 

AGGCACAAGCAGGCAGATCTGAGTTTGAGGCCAGCCAGGTTTACAG 

TGGTGGATGGGGTGGATTTArrTATTTTATCAATATGAGTC 

TCTGGAACTAGAGTTACAGATGTTTGTGAGACGCCATATGGGTGCTGGAAACCAA 

4 0 TTCATTATTTACrTGTTTGTTCATTTGTATTTTCTAGACAAG^ 

CCTCTGTCTCCTGAGTGCTGGGATTAATCCAGCATGTGCGATTA 

TTTITGTTGGTGTAOAAAATTATCCCAAGGGGGTCrGTGTTTATGTC 

CTCTCTTAACAGAATTGCCCATACACCTAAGGATCGTTTC™ 

TCTTAAGTG CTTGTTTTACCAAAGTCTTGTAGCGTTGTTCATCCAGATGAGCCTCCTG CTTTGTCACAGCCTCATGTGTGTTTGTG 

4 5 GTTGTGCTCTTTTTGTTCTCAGGTGTCTTAGCTGGA 

GGCAACy^GGGTCCTGGGACAGCTTCCTCAAGATCTGGAACTAAAGC 
TTTCCTTATATGGGTTATAGATTTACAGAGATCTTGTTGATCTGGCATTGTAGCAGATCTG 
GAGAACAGTGGGAAGAGCTCGGGTTCrTTCTGTGTGGACTGACT 
ACTGGAAGACCATTCCAACCTTGAATCXSTTACAGCGATAGC^TATCCT 

5 0 AG AACTTCAAAAGGG CAAG AT C rrrfTl' CCTTCACTT ATTG CTG AAAC CAAG AG CACAATTC C CATT AAG AG AAGGATCT CTGTG C 

T GT AAACT AAACAAATTGTGCATTTTTT CTGGGG CCATTGTTTTTG G TTT ATTTTG TT ATTTTG'ri''ri'GTT T TTG TTTT T TTGGTT 

TCATTTTGTTTTGGGTTGGTCCAATTTTAAAAG 

GAGAGACZACGCCTTTTTACTAGTTCATTTGCATTTAAATC^ 

GTTTATCTGGAATTGTTTCrrAAATATAAGTCATTTGTAGCAAGACTTTGGAAGCATTCTT^ 

5 5 TAGTTAAGAGAACCAGGGCCCACGGCGGTCTCTGACT^ 

TTTTTTTTTGTCTCATGTCACAGATGACroTTTCGCACA 
ACATGTTAGGATGGCCACGAATGCTTGGTCAGAGTCCTTCCTTCCCT^ 
CCTGTCCCTCCTGTCCCCTCACTGTCCACTTTCTC 
CACTAGCCGTCCTGCCCCTGCTGTCAGAGATTTTT^ 

6 0 GTATGAAACAATGTACAAATCAATGTTTTGAAAATAATGAT 

TATTGGGTGGGGTTACTCTTAGAATCGCATGCTGTAGAAATGCT 

TTTAAGAAATAACCTCTTCAAGTTGTAACCGTGGCGGCTTTGTCCATTACTGTTGCTA 

GCTTTACACACTTGAGTAGTGGGATAAGTCACTGTTTTCTTTACTTTACA^ 

ATTCTGG AAGGGAGGGG AAAGCTG CTCTGCTGCCTCTTTGCCACTTCCTGGAAG C CTTCCTCCCTGGAAGAACAGCCTTTTGCATC 
6 5 TGAGGCGACCATTTTTACACCATGCAGATCTTCXSTGGTTATC^ 

TGTG^CCCTCCCCTCCCCCATATCCTGCACCCCCTCATAGCAGTAGTGG 



GCTCTCTGCACCCTGTACCCTTTTCACCTTTTGTATTTAATTTTAAAGTCAGTGTACTGCAAGG 
TATATTAAACTGTACTGTTATTTAAGATGTAATAAAGCAGTTTGACATGAGGGACTCTGG 

7 0 CAGTGTGTGGGGGTTTGCCCCAACTGCTGACTCAACAGTTATATAACGTCATGA^ 
ATGTAGTAGTACACACCTTTAATCACAACACTCAG^ 
CAAATTTTAGGACTGCCAAGC!AATATAGACACTGTCT 
AGGTCTGOkGAGCCACATGTTTATCACGACAAGTGGCCTTTCACCCT 
CCAGCTGCTTCGACTTCCTCTTCCTGCACTAAAATGTCTGAGGACCTGCCC 

75 TCCTTCTCTGTCGATAAAGGTCACTGCATCTJrfGA 
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CCCATTCCACTGCAGTGCCTGCTTCTAAG 

TTGGTTTTTTTCAAGACAGAGTTTCTCTTTGTAGAACTGGCTGTCCT 
AATCCGTCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCGTGTGCCACCACACCCX^ 
ACATAACTGATCTTCyiGGCTCACATCCCCTAAACATTGACTGGCC 
5 CCAGGGTCAGCGGTGGCCCCTGATACCACACATTGTGGTGTGATGGGCAGGTCCAGC^ 
TCTCCTAGTCTCCTTGAAATAAATGTATTTTTCAACACACACATTTTC 
AACATTTGAGGTCCTrCTGATCTAGAAGGTGAAGCTCTGACCCTA^ 

GCTCCCCACTTGTTTAACAAGCAGTCATGGAAGACATGTCCACTTT 
1 0 TTCACAAATGCTCAGGCTTAC7VCATCTACATCCACCATTAAA 

GTTGCTGCCCATGGTGGAAGTACACCTGTTGAAGTGCACACAGGCAOATAAGGGGC^ 
ATCATTTTCCTCCAGCGTATGGGCTGATGCCATTCCATGACCCCAGGATTCCA^ 

TATTTG CCAAATACTAGAGGCTGGAATCCTGATGAAAGGCCAG CCGGGTCCAGCTTGTGCCGTCCACCGTTTCCTTTGGCGCCTAC 
TCTCTCCTGCCGCTCrCACTGTGCTCCTGTATTCTCAG 
1 5 CATCCAAGAGCAGCCAGCAGTCTTTCACCTCTGCCTCTTCACACAGGGTC 

ATAGTCTGGCAGGAATTCTATGAGAACAAAGTGCCTGGTGTGAGATTACAGCTGGTGACTA 

GACATTTCTAAGCAC CCT AG GCAAG ACTACTCTGCTTGCTG T AT C C AAAACGTG AG T ACCTGG CAGCTGTCCCACTGATTTGG CAG 
AAGAAGGGATAAGGTTGCCCCTCACCCACCCCAGTGAAGTGACCCTGATGATGACCTCAGTATT^ 
ACTTTCGGAGAATGGACAAGATAACTAGTATCCCATGACAATCAACTCTCCTCTAAAATACACT 
2 0 TTTAGGTACTGGACAGCCTTCACAAAGCAGTTTATCCC^ 

CAAAG AGCT ACAAAGCCTCC CTTG ATCCCIXKTGCCTGAG AGCTT ACCT C CC CCCATGT G ACAAC CT AACTTTGG 

GCACAGGCCCATCCTGTTTCCTCCTTGTTCTTCCTCCTCCCTTATCATTTAGC 

GAAGCAAAAATGTGTGTAATTGCCTTGATTTCGTCCTGCTACCTCACTCATCCCATGTGACCT 

AGCCCAAATTAAACACTCTAGG AGTCTTTTGAGT ACC CTCTTGCC ATTCCAAG CCTTG ACCTTGACCTTGKjCGACTGTCTTTAGCA 

2 5 CAGTGGTCGGAATGAGCACTTGGTATCTGAGTCTTCAGAGTTTGATC 

TCCTCAGAAATGGAGAAGCCACCTGCATAATCTAGGCCTTGCCCCCACTGTGTAAAACAGATTTCCCTAGC^ 

G CTCTTTT AATTT AAGG CT ACACAGGTT ACTTGGT ATGTGAC CTTG G ACTG CAG ACCAT AG ACTGGGTG C CTTTG CAGAAG AAT AT 

GACTGCTCCCACTCTGGATGTCrCTCCAGGTCTGCTGGCCATGTGCCTGCCATTTACCACT 

ACAATAGCAGCATAGTTACAGCCTTCXX3TGGGTGTGGCT 

3 0 GTTACAAAATTTAAGAAATCCCCGCAAATTCCCACCATCCTCCCTACCCACTATTTT 

TGGAAGGACAG CACACAGGT CTCAGG AGTTCCC AC CAGAG AACATCCTG ACTG CAG CAGGT ATGGGGGGCTGTGGGG AAG CG TGG A 
GATTGGGCTATACAGGAAGGGCAATAGTGTGAACATCTTAAAACTACAGTGCTG^ 
ACTGTGCACACTGGTGGATATCAATGCACGAGCATGTCCACCAGTTCAGCATATCCC^ 
TCTTTCCCTCCTTACAGCTGTGGGTGAGGCACGCTATTTACACAGTCGCTTCTCAACT 

3 5 AGGGTCTCTAACTATGGTACCCCTGGCn'GGCCrrCGAATTCCTAGAAATCCGGAGCTCT 

TGTATCAACCAGCTGTCAGGTTTGTTACTTTCAGGCTCAAGTC 

GAGAGCCATATGAGCGCTGTCACCCGAGTTCTGCTTCTGAAAGATACCAGGTCCATCCGCT 

GGAGCCTGGCACGAAGCCCACCTTCCCACAAAGCTGGCCCGCGTTGTCCACATGGTCCTGGAGACCT 

GAGTGTGACGTAGGACAGGAATAAAGATCAGACCGCGAGGCAAGGAGGCAAAAAAAGAAA 

4 0 AAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAAAGGAGAGAATCTGGTGG 

CTTTTTCCCCCTCCTCGGTrACrCTTTTTTTTTTO 
CCCGACTCCTCCACCCTTTGGCCCCGCCCACTCTCCCGATCTT^ 



CCOVCCCCCCCGGGCTGTCTTCCGCTCCrTGGCCCCATCCC^^ 

4 5 NNNNCACCCCCCAGCCTCTGATCCTTGGCCTACAAAG^ 

CGCCGCTCTCCCCACGTGCCCTGCAGCGCATGCGTGCGGGCTGCGCTTGCGCACTGA 
CCGGTCTCTGGACCGAGCGGCCATGGGGTGGCGG^ 

TCGGCCCAGCATGTAGACGTCCACGCGGGCGCGCGGCAGCCGACX3GTGAGTGCGGGGCCTCGGGT 
GGCCGGGACCCXSTGAAGACAGGGTCAC^CGCCGGCCCTGCCCACCTGCCC 

5 0 GOTTCCGAACGTG<^TTGCAGCCAGGATGACGCGTGGTCTCGGGGAAGGAGCCTAA 

GGGTTTAGGC^GACGAGTCAGAGGTTCGGGTCCCC^GAGAAGCCATGTAGGCCGCCGTTCTGT 
ACCGCAGCCGAATCGGCTCCATTGGTCTTCCIAGTGGGGCCTTAGCAAG^ 
AGCGTTTTGTAAGCAATCGAGGTGTGAAACTCTCCAAAACCCCATCGTAAACT^ 
CCTGAAGAGTCCATGGAGATtjTTCCTAGAGGCTGCTGCCrCTCAGGATCTC^ 

5 5 TTTCTTCGTTATGGGAACAGGGCGTGCCCAGCGC^ 

TCCCAAGCAATGGGCTTTCTCCTGGCATTGATTC^ 

CAAAACTTATTCGTTTGGCTTGCCTCTTGAGAAGAGTGTTATTCAAACTTATATTCAT^ 

CGGGCCGG AAAGCTGGCATCCTCGCCTTGGTAATT AAAAG CACGG CACCTTCCCTCTG AACGTATTTGTC CTCAGCAGTGACTTCA 
CTGTGACTTACACTGCCCATCTTGGGCTCTC^ACAGCCACGGGTGCCAATATACAGTATGACCAGGTAGCTTCT^ 

6 0 TCGCATCTCIAGAATCCTTCTGCATT TTC TTCTTTGTAG^ 

ATCTTGTTTGAAGTGCrACAGGTCCAGCCC^yGATCCTCACACAAGTGCTGTCC 
TTGGGTAGAAGTGGCTAACAGATAAAACAGCAGTACTTAAATGAATAGTTTAGCTCT 
AAGTGGTGGTTTTTTCCTTTTTTTTTTAATGGGAGTGGACA 
TTGGTTTTCTCCTCCTGCTCCATGGGCTCCATGGATCAAACCCAAGACATC 

6 5 TCTCTCTGATCCTTCCTTTGTTTTTAGACATAACTTTATGTATGCCAAGCT 

GAACTTCrTGTCCTGTATGTTCCTCTGCATGTTGGGATTACAGGCATGAACC^ 
TGTGATGTTGGGATGAAGCCCAGGG CTTTG TGTGTGCAAGCACTGTACCAACTG AG CT 
ITTr TTTTTCTTTCITTCTTTCTrr'rr T TTCCAGGGTGGAGGATGGAGTT 
AGACCAGGCTGCCCTTGAACTCAGAGATCCCCTGCCTCTGACTCCTGAGTGCT 

7 0 GTCCTACTCTTAATGCTTTTTTTTTTTTTTTTA 

CT CATT ACAGATGGTTGTG AG CCAC CATGTGG CTG CTGGG AATTGAACTCAGGACCT CTGG AAGAGCAGTCAG TG CTGTT AACCAC 
TGAGCCATCTCTCCAGCCCCCACTCTAAGATTTACTTTATTTTGGATTT 
GCACACGAGTATGTGTGCCTTGGAGGCAC^AAGAGGGTGTCAGAT 
GTGGGTGCTC^GAAGGAAGCTCAAGTTCTCTGGGAGCAGCAACJ^ 
7 5 TGG ATTCATCTC CATCTTT ACCTTG AATCTGT CTGTTTTCTGG TGT CGCAGCAAG AGGGAGATC CGTG CAAAATG CT CAG CACG CG 
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GAGGAGGAGCCITGGGAGG ACAAG AGGGGTGGG Crr.L'GAT CTCAGGCTCTG CTTTCT AGG ACTTG TGAGAT CTGTTT ACAGTT ACC 

ACAT<X5CTCCTGCCCTCCTGAAGGCAGAGGTGTCGCrrTCAACCCCAG 

TCATTGTCCTCTATTCTGATCCATGGCACCTGTCCT 

AAGGCAGAGACGGTACTAGCAATGTACGCCAGTGCTGTCCACGGTGCAGTGGAAAGGGCA^ 
5 GACCAGAGTGCCTCTCTTGGGTCTGTGAAACTATTCCACCACTAG^ 
GCACACTTCTCTGACCTGTCTGCTAGCTTGGGTATCATTTATTTC 
CTGTGGCTCACAGTAGTGATACCATCATGTTTGATACAGCAACACGTTC 
CAACTTC^TATTAACTAATGAAGTTAAACATTT^^ 
TCCTTCATTCCCTCTATTTTCAAAGACrCAGACT^ 
1 0 GGTACGCACCAACACACCTGGCTTCGCTTTTCATTTACCGCTTCTAC^ 
CG'nTCTTGGGCTGACTCTCTGTGTGTTCATGCCTGCCTTTGGTT 

G CT ACTCCATGATGGTG ACT CAGTTC CTTTTC CAAG C CAGCCAAGCT ACTCAACAATGAAAAAAATCC CAACAGGG AT AACT AAGT 
GCACTCACCCCCCCCAGCTACCTCAGCACCTGAT 

GAGTTACCTAGGGAAAGGTTGGCGACGAAAGAATTAAATAATGGAATTAAATAGAATTAAAAA 
1 5 AGTCCTAGCAACCACCCGGGATTCACTCCCAGGGACCCTCAGGGTAAGAGGAGGGATT^ 
AGATTGTGTCATGCGAGTGAGCATGTGTGCACCAACCCATAAGTAAATACGTGTATC^ 
TCTTTTAAAAAAAAATTGGGGGAGGGGGTGTTGGGATGTTCAAATCAGGGT 
ATCTGTAGACCAGGCTGGCCTCACTTTAGAGAGCCACCTGCTTCT 

HUMAN SEQUENCE - mRNA 

ATGGGCGGCGAGTGGGGAGCGGGGCCGGGAGTGGAGCAGCCGCCGCGGCGGACTGGACCGAGCCTC^ 
AGCGCCCGCGGACGCGCAGCGCGGCCCGAGCGCGACGACCTGCCGAGCGGCGGCCGAGGCGGCGGTGTGGGCGCGTC^^ 
CGAGGGCGCTGAGACAAATTTAC!ATGTATTGGAGACCAGACCAGAA 
TGAAGAGCACTAAGATCGGAAGATGAGTGAGCTTGACCACTTACGGCAGGAGG 
GGAAAGCATGTGC^GATGCAACrCrCTCTCAGATCACAAACAA 
CTGCGGGGGCACGTGGCCAAGATCTACGCCATGCACTGGGGCA^ 

TATCATCTGGGACAGCTACACCACCAACAAGGTCCACG CCATCCCTCTGCGCT CCTCCTGGGTCATCACCTGTGCATATG CCCCTT 
CTGGGAACTATGTGGCCTGCGGTGGCCTGGATAACATTTGCTCCATTTA 

CGTG AG CTGGCAGG ACACACAGGTT ACCTGTC CTGCTG CCGATTC CTGG ATGACAAT CAG AT CGT CAC CAG CTCTGG AGACAC CAC 
GTGTGCCCTGTGGGACATCGAGACCGGCCAGCAGACGACCACGTTTACCGGACACACTGGA 
CTGACACCAGACTGTTCGTCTCTGGTGCTTGTGATG 

ACTGGCCACGAGTCTG ACATCAATGC CAT ATG CTTCT'l'l' CCAAATGGCAATGCATTTGCCACTGG CTCAG ACGACGCCAC CTGCAG 
G CT GTTTG AC CTTCG TG CTG ACCAGG AGCT CATG ACHT ACTC CCATG^ TCT CCTTCTC C A 

AGAGCGGGCGCCTCCTCCTTGCTGGGTACGACGACTTCAACTGCAACGTCTGGGATGCA 
GCTGGGCATGACAACaSCGTCAGCTGCCTGGGCGTGACTC^ 

GATCTGGAACTAACGCCAX3TAGCATGTGGATGCCATGGAGACTGGAAGACCATTCCAACTTGGACGCGTT 
GTACTAACGTG ACIAACCCTACACCTCCCCTCAGAACTT CAAAAGGG CAAGATCTTTTTTCCTTCACTT ATTGCTCA 
AACCAAGAGCACAATTCCCATTGAGAGAAAGATCTCTX3TGCTGTAAACT 
TGTTTTCTTTTTTGTCTTGJUVTGAATTTTA 

GACAGACACACTTTTCCACCAGTGTATTTGAATTTTAGACCAGTGACCCTGTTT^ 
TGTTCATCTGGTCATCGTGTCCAAATTTCAGTCATGTTTGTAGCAAGATTTTG 
CTTTGTGTTCAACAGTTAATCAAAACCZAGAGAGTCTAGGGCAGCCTCTCTGATGTTGTCAA 
TAATTTTCTGTCTGATGTCACAGATCATTGTTGCAC 

AATGCX3GGGACGGTCTCAAAATGCGTGATCAGAGAATCCTTCACCTTATGCTC 
CCTGACCCATCCTGTCTATCTTCTCAGTTGAGTTTTTAATCTCACTT^ 
CTAACACTACCCCACCCCGftACTAGGAGGAACCTCT 

GTGTATGAAACAATGTACAAATCAATGTTTTGAAAATAATGATCTCAGACTTTCTAAGTTAAA^ 
CCATATTGGGTGGGTTTACTCTTAGAATCGCATGCTGTAGAAAATC 
TTCTTTTAAGAAATAACCTCTTACAGTTGTAACCATTGCGGCTCTGTC 
GTACAGCGCGCGGCTCTACACGCTTGGGTAGCGGGATAAGTCACTGTTTTCTTT 
AACX»CTGCTGTTGGATTCTGAGGGTGGGGAGGGAGAGAGAGGGAGGGAGAGGGAGTG 
TCACGGGCCCTCCACATCTGAGGTGGCTCATTCCCATCACACA 
GCGTTTGGAAAGCAGGTTCTGTGGGACCCCCCGCCCCGCCCCCCGCACTCCT^ 
TCTGCAACATTCTATACAAAACTGTGCTGTGACCTTGCGGTAGGCCT 
TCITTCCGTCCAACAACTCTGTAGAGCTCTCTGCACCCrTACCC 

AGGAAGCTGGATGCAAGATAGATACTATATTAAACTGTACTGTTATTTAAGATGTAATAA^ 
HUMAN SEQUENCE - CODING 

ATGAGTG^GCTTGACCAGTTACGGCAG^AGGCCGAGCAACTTAAGAACCAGATTCGA 
6 0 TCTCTCTCAGATCACAAACAACATCGACCCAGTGGGAAGAATCCAAATGCG 

TCT ACGCCATGCACTGGGG CACAGACTCCAGG CTTCTCGTCAGTGCCTC^CIAGGATGGT AAACTTATCATCTGGGACAGCTACACC 
ACCAACAAGGTCCACGCCATCCCTCTCCGCTCCTCCT 

TGG C CTGG ATAACATTTG CTCCATTT ACAAT CTG AAAACTCGTG AGGGGAACG TG CG CGTG AG TCG TGAGCT GG CAGGACACACAG 
GTTACCTGTCCTGCTGCCGATTCCTGGATGAC^TCAGATaSTCACCAGCTCTGGAGACACCACG 
65 ACCGG CCAG CAGACGACCACGTTT ACCGG ACACACTCG AGATGTCATG AGCCTTTCTCTTGCTCCT^ CGTCTC 

TGGTGCTTGTGATGCTTCAGCCAAACTCTGGGATGTGCGAGAAGGCATGTGCCGGCAC^ 
ATGCCATATGCTTCTTTCCAAATGGCAATGCATTTGCCACTGGCTCAGAC 
CAGGAGCrCATGACTTACrCCCATGACAACATCATCTGCGGGATCACCTC^ 

TGG^TACGACGACrTCAACTGCAACGTCTGGGATGCACTCAAAG CCGACCGGGCAGGTGTCTTGG CTGGGCATGACAACCGCGTCA 
70 GCTGCCTGGGCGTGACTGACGATGGCATGGCTGTGGCGACA 
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Table 53 

MOUSE NOMENCLATURE 
ICSGNM Ccnd2 
Celera mCGi3070 

5 

HUMAN NOMENCLATURE 
HGNC CCND2 
Celera hCG24964 

MOUSE SEQUENCE - GENOMIC 

GGAGCOU^CAGCCCCTTTTCTCTACTTACAAGCCAAGCAGGGG 
ACZAGGCAGATGAGGGGTGGGCACACCTCTCCTGACTGCTGGCCAGC 

AGAGATGTTTAGG AAGTATACCAAATACAATAGTTT ATTG ACCACA CACACTG AAAAGCG CCATGTTT 

GTAATTCTAGGAACAGGTATCTTGCTGGCTTTCCCTGAAGCAGAGAGTACCCA^ 
AATATTGAATGAGAAAAAAAAATGGGACCTAGATATTTCCCCTGCATGTCGCTAGATACT 
AGGGNNNNNNNNNNNNNNNNNNNNAAGGAAGGAAG^ 
CTAAATACTCCAACGTGGTGGAATCTGCGCCATTCTTTO 
AGCCTCCCTTAAAGAGGCTCACTCAGGCTCTAAACJTCCCT 
AAAACAGGGGTATATGCTCTAACAGGATCTGTCCTGAACATGCC^G^ 
CATCCTCATTCTGAATATGCCTTTCAGACTTGACT^ 

CAGX3TGAAAATGAATAAGAAGACTCCAGCTGTCAAATGAGGTTAAGAGTCT 
TTAAGATCITTATATAAAAGAGAGACTACACAGACGCTGCCGATGCTCTGTGGATTGGAGT 

TGGGTG ATGGT AAGGGTGGGTGATGGTAAGGGTGGGTG ATGGT AAGGG TGGTG C CT ACAG AGAG CTG AGGAGGG ATGT AAAG ATC C 
CTTATTACAGCCAGIX^ATTCTGTGGATGAGCAAACTCT 

GGGACACCAGGAAGCAGAATAGGAGACCTCTGACTTCCAAGGTAGCGCTGTCTCACTTACATG 
CreATGACATGCAAGCTACTAAGACITTCTGCCAGGCCAGG 
CTGGGTTCIAAITCCAAGCACCTACGTGATAGCGCCCAACT 

AC ATGTGGGCAAAGCAC GAAGCACATGAAATTAAAATAAATTTT AAAAAATG T AGTCTT AATTTAATCTTT AAACAGGG TTGG CTG 
TGTCTGTATGTTGCAGGATCAGCTATTTTAACCTCAAGTC^ 
CCAGGACAAGCAGACACTCCGAAACACACAGCCACTCACCAAAAAACCAGGAC 

TCTTT AACTAGT CTCAAAGG ACTAG TTTTCAATGTTTC^ C CCAT CCTGG TCT CTG CTG TGCCT CT AACAG ATT ACAAATT A 

AGAACATGTTTGCAGAGTGGAATTTTAAAACCACK5CATAACCCAGGTAATAGTTCCT 
TCTCCCTGGTACGAGTTACTCAGGTCAGCACAGTGTCTCCrTCTAAACCCTTTGACCT 

TTGTG CATGTCAGG AAAAAAAACCACTCTATG AAACAGCT ACT CT CAT AC CCAGACTTT ATTATTTTAAAATACCACT CATT AT CT 
AATATCTTAATTCTTTCCCACATGTAATCACTAGAGAATGCACTACTTGACTCCAGGA 
TCTTCCAACACATTTACTGACTGTCTATCAGCTATATGCCCATTA 

AG CTAT C CCTGAGTTCAG CTTTT AG AAAG AT CCAGTTTT AATCAAAT AAATG AAGTG CAAAAG C CAG ATG CAT AG AAATCTCTG C C 
CTGTCAGCTCATCCATTAACACCACTCAAATTCTACCTTAC^GTCCAAATTAAGCCG 
GACCTGTGATGTTATCAGCCCAGCACATATGCGATCTCTCAGCA 
CTCCTGAACTCGGGGGCATATTTGATGACTGCCCAAAACCT 

AAGGTAAGTATGGTTTTCGTCTCAACTGTAACAAATTTCATGAGTGTTCCT 
AAAATCTCTG^TGATTATACTACAAAGAGTAATTATGTGAACGTTTCTCGTACTGA^ 
CTGCCAAGGAAATTTTAAATTAAACACACTAGTCTATTCTCAAAAGAAAACAGAAGAATGT 

4 5 AAAGGTAGTTCTTCTTCAGTTTACCATGAGACAGTGGGGACCAGGGGCTGCATCTGAC^ 

GTTTTTTTTTAAAGATTAAGATTTCAGGGTCCTGAAACAATTGGT 

CTGGCAAATCAGCCCAAACCCTTTGCCTCACAACACCCTTTCTC 

TTCTACTGACTGGCTGCTCTCAAGCAAACGGAAACATACCTGGGCATGATGAAGG 

5 0 CTCIT ACT CATTTAAAATTCTGTATGGG'rrrr LTTTCTTTC CTTT CT C CT CTTCAT CT AGGTTG TGGGTGACTTAGG TTT CAAAAA 

TTGTCAGAACTGAATGTTCTTGGTGCTATGCCCTCCAACAAAAACCTAACCATATGGTGAT 
AGGCCCCTACATCCAATCACTX3TCAGACCGAGACTCTCTCCTATCCCTTGTCACCATGACTACCATAACTA 
TGAAGAAGCTGCCTCTACATGTCACAAAGTCATCTGACITACAGCCAATAG^ACCACA^ 
AGCAGAGACATATTTGAATTTTTTATTTTTCTGCTCT 

5 5 ACATAGACACTGACTGTCAAGGTCATGCAAAGGCTGATGCTCTGACTTAAAAO^AGCCAAGAGGAC 

CCGAGCTAGTGTGGGCTTAGCTTAGCATACAGTCACAAACTGCAC^ 
TGTACATTTTGAAGAATGTAAGACTTTGAAAACACCCAGCTTCAAAATG 
AAAAAAAAAAAAACAATACCAAGAAAATAATTTTAGAAGTAAATTTCAACCTG 
TTGAAGAAGAAAATCATTTTTCCTTTAATACCACTGA^ 

6 0 GGGTAAACTGTGCTAACAAATGCTGACAGTTCTTATGCTTA^ 

TTTTATACGGCTGCAATTTGGTTGTAGCTGATATATTGATACATAGCTCAATAATTACA^ 
AAAGTGGATGTTTTTTCAGAGGGGAAAATGTCTGCATTTGAACAAAAGTAA 
TTTAAAAAATCTGGTACGGGTAGTGTTAAATGTTTTACAGAGAGA^ 
TTAAAGGAGACCTGTCi tttAGGTTCTTA 

6 5 TCTTTGTATACACAAAATAAATGTTTATCACATTGGTCACCT^ 

AAAGGAATTTATTGACATTTTTATTAATTATAATTCCACCCCT^ CTTGGATCAG ACT AGCA 

ATCATTAAACGAAAAAGAACTAAATAATAAGTTAAAAATAGAGAATGTGTTCAAAGGAAGCAAGAAAG 

TGTCCTCTTCJU^TAC^CACCTTTCCJJIGCCT 1 rX'TGAAACAAAT AATT C CACA 

TCAAGTCCATTTCAAAAGAAGCCCCTGGCTCCTCCCACAAGCCCCTTTACCTOT 

7 0 ATTTTrCTCCCCAGCACCCACTCTGCAGCTCCTTCAATGTGAG 

CATTTTAGCCCCCATGCCTCCAAACTCAAAGATTGAAAAGATTT^ 

TAGTTTTCAACCCCTGCAGTCTCTCTAGAAGGTTCTCCACAAG 

CCTGCACTTTCGCTCATCTCTGGTCTTCTCACAGGA^ 

TTCCCATACATCCCCAGGAGCTTGTGTGTCACCACCATAAGTAACGAA"i~iCAGCGTATGi 
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AATAGGTCGCGCGTGAAAAAATCCCGCGTGTGA^ 

CCGGGTGGAGACItKSCTTAGGAATTCCCGGGCGGCC 

CTGGACCGCAAACTAAAGAAACCAGCCTTTTCTGGAGCAAACGG 

AATAAGTGTCTAi3GGAGGGGGAAACCATTXX3GGNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNHNNNNNNNNNNN^ 

tWNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNNT^ 

AGGGGGAGATGCGAGGTGGGTGAGCCGCCTGCAGGCCCT^ 

CTGGGTCCGCACCGACACCCACCCGAGGGTCGCTACGATGCACAA.CCTGGGA^ 

TGAGTATGCTTTTTGCACTTTAGGCCGCTTGGTCCCGGTGTGGG^ 

CCCTCCC CTXTCTI t3CAGTTTAAACCGGGTTTTCATGTCTTCCCCCACTAATC^ 
1 5 GGTCTGGGAGGTTTGATTGCCTGACCACCCATTTGTCCATGAAGTAACACGAAC^GTC 

GGCGATGACCCCCAGTTCCTGGACGGAGATGTCCCTGCTCCGAGGTGACGGGATCCGAACCAGCAGC^ 

TCAGGGTCACCTGGCTGGCTGGTATCCGCCAGGTGAGCrTGTT 

T AAACAC CCAACACC CGG CTG CACCTT ACACCT ACAGGCTTTG T ACACITT CCAACCGAAAC^ CG CAAGAGGG AGGG AGA 

CGGGAGGGGGCACGTCCTGGAGGGATGGAAGTTCGGGAAAACTCGTATTTG^ 
2 0 TAAAGGATCTTTCCACCTTCGGGGGTTTAAAAAAAAAAGTAGTGGTGG 
AGCAGAGCACACTCGTGCTTCTGCACACCCGAGTC 
ATGTGCTAGCTGGCG CACTTTTTATTT AAAAAAAGAAG AAAGAAAA 
CAGAGTGGCAACCGACAGAGAAAACACAGTCCCAGTCCCAACCTCTCCAGGGACGTGCT 
ACCCCACGTGCGGACTCGCAGCGACGTTCACCTTCGGGAACCCTCATTAGTAGGAGGGGGAGG^ 

2 5 GGGCCTCTCCTGCAGGACGCGCTTTAACTTTCCCATCTCCAGACAGAGG<^CCT 

AGCCCTCCCAGCCTCAGGCCACCTCTCCCACCCCCAAGATCCCCTCCCCGCTGGAATTTCACT 
AAGAGGCGAGGGGCCCTCCGCTGGCTCT^X3GAGAACCTGCCA 

CCGCCTCTTTTCCGCCCCCGAGCTCCGGATGCTGTCTTC^CTGGAGATGCGGAGGAGACT 

3 0 CGTACTTTGTGCCCCTCTGCAGGAGCAGTGTCAGATCT 

GTCATAAAGCATAAATTCATATATCTGTGCAGGATAACACCGAGACCTCTATTCGGGAAACCC^ 

ATGGACAGAGTGCACTAGACAAGCGTCACCTTGGGCGCAGGACGGTTCCG 

CCCCCACCCACCTCCGAAAGCTAATGGCTTCCAT^ 

AGTAGATTGGATACTAATTCCCGCTGCC^TTGATAGGGTCCGAGGAGCCCAGGCCCCTGTGCGGGCCTAGCCCXSCCCGGGTC 

3 5 CTCTCCCTCCGCCCTTCCCGGGCTTCCTTTTGATCTAGCGGAGAAGGCTTCCTACT 

AATCAGCGCGCAGTAGCIAGGAGCCCCAGAGCrAT^ 

TTTTAAAGTGATATCCCCTCCCCCCGCCCACCCCTTCCGCCCCACCCTCGTTTAAAGGGGCrGGC 
TGCAOINNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AGGAAGGGTGGCCTTGCTGGCGCAGGGAGAGCCAAACCTAAACCCTCCCT 
AAAGCTGTATCAATGTGGCCACGCTCCGCGCAGACACCTAGGGCGGCTTGTCAGCAGATC 

G CGTGG CCG AGGC CCG CGG AGG AAC CGCTGCT AGC CCTG C C CCCGGTCTGGGGC CG CT CTGAG AGGCTGGC CGC GTTGCC CAATGG 

4 5 CGCCCCCAAGAACAGAAAGGTTTCTG CAGGAGGGTCAT ATTCTACTAGGCTT AAGAAG CACCCCTTTCTCCAACATCCACCCCTTC 

CTCCTCTTCCTCTCCACGCACGTGGCTCGGGGCGGACAACAGCTTGAAAG^ 
TCCCTGGTTTGGCGAGGTTGCAATTTTCrCAAATAAGCCITTCCT^ 
ATCCACATGGTAAATCTATCTACCTGAATCTTGAATAATGGGCTGTTT^ 
ACACGTACACACCCTTTATGCCCCCATGGTATGCCTACAGAATG^ 

5 0 CCCCTATCCGAGGCCCCTAGCATGCGGGGCTGGATGGGGAGAG^jGCCT 

GATCGTGTTTGAAGTTTGGTCAGGCCAGCTGCTGTGCTCCTTAATAACAAGAGG^ 
AAGGAGGGGAGGGACGCTAGAGGAGGGGAGGAAAGGGGGAGGAGGAACATGAGAGGGGGAGGAG 
CXCTCACTCCATGAGCGAGAGAGGAGAGCGAGCTGAG^ 
GAGGCTCCGCTCGCCCACCTTCCACTCITCTCTCTCTCrCTCCCTCTCT 

5 5 CAGCCAAAGGAAGGAGGTAAGGGAAGCACTCCCCGACCCCCCCGCACCTCCAAAAAATAATAATAATAAAA 

GAAG ACCG CGTTCTGCAGAACCTGTTGACCATCGAGGAGCGCT ACCTCCCGCAGTGTTCCT ATTT CGTGCAG AAGGACAT 

CCAACCGT ACATGCGCAGG ATGGTGG CCACCTGGATGCT AG AGG T AGGTGTG CGGG CG ACG CTCTCAGGGAGAGGCAAG CCCAG C C 
GCTCCATCCCT ACCTCCAACACGCAGAG ATCCTAAAAACTG CGGGAGAACTACCCCTG CGCTGGCCTCCCC CGCTCGTGCGCGAGG 

6 0 TAGCCCCGCACCCTTGGGCGAGACGCTAATTCTGTCCCTTGCrAGATTAACTAGTGGA 

CCCCCCCCCGGGGGGTTGGCATCCCGCGCGTX3TGCCTGCATGTGGCTT 
AAGCAACCCCGGCTGCCTTCAGTGAGGCCCCAGAGGGTGTTGCGACCTAACAATCCATA 
CCCCCGACTCTCAGGGTTGGGGCAAACCGTCrGAGCCTTAGGACTCCGCGTTGGCACTTTG 
CAGGGAATTCAGGAGAGCCCCGGAAGTCATACTGAAAAAAACXSTGCGTTTAGGGCAATCTGA^ 

6 5 TGCAATCTTTGCCGAGAGAGCAGGGTGCCCACGT^ 

CTTCCCCAGCCCTCGGTTCACCAAGGTTACTTACAGTTTGGCTGCT^ 

GGAGGTGTGTAGAGCAGGCTAGGGGGCACCCCCAAATGAGAGGTCAGAGTCCCGCATGCCTCCTGGAGG 
TCTCACCCAACCCCCAGOVGGCACAGCCTTCCTAGCTACIGCCCTTGTGGGC 

7 0 GAT CT AGAG CC C CTC<5GG TGGAG ATAC CGACCTT ACGG ACACCCGAAACCT CTTTT C CAGGGAG CCTGGGACCT CT CTGGGGTTTT 

CAGGCCCACAGAGCGTTTCTAGGATTTTCCCTCGATTT^^ 

CGGCCGGCCGAGGCGCGCTTCCTCCTCCTCCCTCCCCCTCTCTrCT^ 

CACCCTGGACATTTTTTTQ^TTGTCGGGAATGGTAGAATAGG 

TCCAGAATATTTTTATTGATTTITTGAAAAGATGACGAAATCCAAAAAAGAGTGAGTTGGTG 
7 5 CGGCAGGCGGCXXKrGGCTCTGGGCTTTGCCTCGGGGTTTCTGGAGG 
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AACCCCAGGAGCCCTTCGGATTTACCCCACCGTTCTCATCTCTATTCTTACTATCCTTTCT 

CATAAAAAATT CTl ' T TT G TTATCTCAGATTATGC^ H IT CCCTCTGTTTCTGATACTA AX I ' l ' l ' lt XCTATCCCCCT 

CCGTTTCTTGGCTGGAGTCCCGACTCCTAAG^ 
5 AGACCATCCCGCTGACTGCGGAAAAGCTGTGCATTTACACCGACA^ 

CTGCTCCTGACTCTGCCCTTCCCCATCACACTAAGGCCGCCACACTAACCAAGACTCT 

GTAGGAGAGAGCTAGTCTTCCCACATCAGGTAGTGAGCAAAGCTATC^TGCCT 

GTGTGTGGGTGGGGGGGGGGGCATAAATAGAATATTCTAGTTAATGATGTTATAATTA 

ACACGGTCTGATTrCTCACATTCTAGTTTGT(j'l''IXj'l 1 T CCCAAGACCCTGGAGGTGTAACAGGTAATGTCTTAAAGACCAATAAAG 
1 0 TCTCATGGACAAACATAGTAGCTGATCACTAAGTACCAACA^ 

TGCCTTTCTATTGGTGGATCTGAAGCATGAGAGTTCAGCTCCAAAAGG 

TGGTTTTCTCCCAACCATCrrACCrrTITTTTTT 

CTCTCTTTTACAGTGGTTTGCACACCTrcCACT 

TGGG CCTG CTT AAT CT CCTGGGCAAAGACTGG C1TCTTG AACTG AGCC CTTTGTGAGG AG CCGTCL'riTCTGGGTCTGTG CTGCCA 
1 5 CCTAGTGGCAGACATGTGOVAGGCTGGGGGAGGAATTGCCT 

GAGGACGAGTTTGTGCT CACATTG C CCTGGAAG AAGGAAGGGTTTGGG ATTCAAGC CATTTCTGATTTTT AC^ TTAAGGAATT 
TTTTTTTTTTTTATCCTAACTTCCTGCAAAA 

2 0 ACTGGAGGTATTTTTCAAGACACCCTAATTTTTCCAGGACCGGCAAAGCT 
TCTCCTCCCATCCATAAGTTATGCACCCTAGAAGAACCTTCCTATT^ 
GGTCCAGCTGACAGGG<3CTTTATAGCCGGTCCCCAAGTTTAGCAGATGGAGC 
TAGTAGCATCCTGCGCTTTTGAATCCTACTGATTCACCAGGATCATTTCCl'l'T 
AAACAAAAATTGGGACCTTTGCATAATTTCAAGCTAGCCT 

2 5 GGTGTGCATCrCTTTGGGOTATTATTATATTTAGTCCAGTTCATTO 

CAAAGTAGCCTACTAGTGGGGTGCAGTGGAGCCCCCTAAAAGAGACCGTTACAGC 
TCCACATTGCCCCAAATCATAGTGTGCAAAATGGAGTCACAAGTGAC 
AGATTTAGGTGAAAGGGTGAGTOTCATATCCCAAGACITCTTCATGX3TGTGTCAAAG 
TGATACTCTTCCCCC^CAAGGAAAGGCCAGCGAACTGGCTGCCAAC^ 

3 0 GCATCA.TTTTTTTCCATG AGG AGT AAAGAGACTT AGC CAACAAAT^ CAC AGTGG CCTT AG CTAGATCTTACT A 

TAGGCCCACCTTTCTGAGTGGGCAGGCAAGAACATTTTGTGGAGTCAG^ 
TGGTCTTGGCAGTAGTTACCTGGCCTCTCATTATTTTCAGTGGAGCCT 
AAAGCATGAGGGGCACAGTGACAGATCAGGCTGCAGTGGAGCC 
CCGGGCCTGGAGTCAGTCTATGGATTCTTAGAGCTCTGCCCTCTTCCTCCCACC 

3 5 AAGCTGAAGTGGAACCTGGCCGCAGTCACCCCTCACGACTTCATTGAGCACATCCTTCGO 

GTCCCTGATCCGCAAGCATGCGCAGACCTTCATCCCTCTGTGCGCTACCGGT^ 
GGAGGTGGGGTTGGGTGTCTAGGCAAGTCrGAGTAAGCCAGCCrGGAGCCTCGCTCG 
CTCTGCTTCACTCGTAAGTTCTTTATGAGGGCAAGTATGGTAAG 
TAAAGTGCCCTTTGAAGATTATAGATACTGCrrCTCTGGTTGTGTTTAAATCrT^ 

4 0 AGCCCTAAAGGGAATTTAAGGAAGGTGGGGGCAAATATAGGTTAAA 



CCGTTTATTCTGAGCTCTGGAGAGGATTTTTGATACATAGCTTTATTGGCTTCA 

TCAACCTG TGGGT CGAT ACAT C CCACA.GGG CTT ACGT ATC AGAT ATC CTGCAT ATCAGAT AT C CTG CAT AT CATAT ATCT ACATAA 
GAAATCACAAATAGCAAAATTACAGTTACAAAGTAGCIAACGGAATGGTTTTATGC 

4 5 TAACGGGTCACAGCGTTAGGAAGGTTGAGGTTGAGAAGCGCTGCCCTAGAACCTTGGGAATTCG 

TCTCCTCAGGGTTAGCAC^TAAAGAAAGAATTGATCATGAGTGG 
TGATAGGGGGTGGGGGGAGGTGGAGTGTAACATTTTAATTCCTTC 
ACAATAGGGCCTGTGCTTTGTTGGAAGTTCAGAGGTCAGGCAGACTTCGTTACT^ 
CCTCTCTGAAGTTCATCTGGGATCTGQ^AAACAGGCCTTTGGATC 

5 0 CCACAGAAAAGCAAGCCTGGAGCTTCCTGCGAAAGTTCACTGATGTTGAGAATG^ 

AGCCTCTCACTGGCGTGCCGGArrCCATCTCCCTGTTTTAGAGGGGTGTGTGTGTGTC 

ACTTGTGGTGT CCCAGCTCCTTCGTGG AGTCCCCTGCTGTTCCTACZAGTCAGGG AAGAGG ATG^ 
CTACGCTTTGGACAGCACTTTCAGCrCCCTGCT 

GACACCTAACCCCCCGGACTCATAAATTCCCACATCACCCAGCnXaCCAGTGGCCAGAGTCCT 
CAGGTCTTGGGGGGTGGGGGGAGGCTTTCACTTCTTCCTCCCTGATCTTGCACTGGGGTGGGGGT^ 
CXXK5CTTGGATCGATTCCrrCC!AATTCTGTATAGACCAC^ 
TTGTCCAAATCCTTGGGGCAGCTGCCAACTTTGGGGCCCAAATATGGAGCCT 

6 0 AGGTAGGGGCAGTCAGAAAGATGGGCCTGTTGGGGGCCTTGAC 

AAGTACTGCTCCAGAGG<^TGGACTGATGAGAAAGTGGCGGAGGT^ 



TXMTGGAGGGGAGTGTTGGGGCCAGCAGACAGGCAGTAGGCTGGACAGCCTAAAC 

CTTCTGGGG CCAAG ATCT AGCGAAC CCAAGTT CACTAG CAC CCTTG AAT AAT AG T AGGCTCAG AG CAACTCAAGG ACCAAAG ATGG 

6 5 GAGGGTGCTCTTGTGTGCCCCATCACCTCACACCCTCAGTCTCTTAACTCCAGTCCTGAATA^ 

CCAGGGTTTTGAGGGTTCACAAAGGCGTGGGAAGGGGGCAGTCCCre 
GTTTAAGGGAAAAGCCAATTGTGAAAGAGCTGGCAGTCCCTCCGAGG 
GCGTGGGCAGTCOSGGGGCCCAGGGAGGGAAGAGTGTGGCCGACCAGC^GTCTC 
AGGGCyU^GAGGTGTGTCCTCCTrrCCGATCCAGCCTACAGTGGTTTCTGAGAGAGAGGTGGTCT 

7 0 CCCCGCTGGCCTTTGTGGTCGAGGAATTTTAGGTGGGGTGAGAGTGT 

GCAGAAGTGGACTTCTCTCTCCrGGJyGGGCTCCAGGTCAGCAAAC^ 
G CT ACCATCAACCTT ACCAT ACTGAGCACATG ATTT C AGTGGT ATTGAGTTT AAAC^ 
TTGCATTGGGAGTATTGCTGTGGCTGCTGGACCCAACTCCCTGTC 
TTCTCTTAAAGGTGTTCATTTGATTTCCAGACAGTGGTGGCCCTXIAA 
7 5 AGCCTCTCTGGTGGCCAGGAAGGAAGAATGAGACCCACATTTCC^ 
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GGCGGGGGGGGGGGGGGGAAGAGTAGCTTGTCGAGCTGAAGTTTCTGTAACTGATAT 

KNNNNNNNNNNNNNNNNNNNNNNNNNNNNNra 

CCCTTCTGGGGGGTGTCAGATGTATGGATGCTAAGTGAC^ 

TT CTTGAAACCTG CAC CT CAC CACAGGTTGAAGGAGAGG AAGAAG CCG AG ACATCAG ACTTC CATCACAA 
5 ACCGATCTCCCTrcCCCTAGTGGACAACAAGAGTAAAGCAGCCTCCGGGAG 

CTCCATATTAATAAGCTCATTGGGCAGGACCACAGCATCCCACTTTTAAATCCCATCAGTC 

CATGGTTTTTTTTTAAAGTGCAGATTCCAAGTTGGTGGTT 

CirCTGTGGAGTCCrGGATATTTGGTACCACGAG 

TTTTATGTTTCCTTTCGTGGCATGTAGGAGAAAACGT^ 
1 0 TCCAGCAC CC CTAGTGAGTG AG CAGTGGCACTGAG CAGTCGTCTG ACACCCCAAG ATTC CCAT CCTCTGT ACTGGGTCCTTTG CCT 

GCTCCCACCAGGAGGGAGAAGGTCAGGCCTGAGGCTCCTCATAGGAGCTTCCAGGCT 

ACTGGTGGCAAGCAGGGTGAGGGTTGGGAGCGTGGGTGACCX3ATGTCTGGCATCT 

AGTTAATAGCATCGGCTCTTGTAACTAATCCTCAGGATAGCACGGTGGGAAAAAAAA^ 

TCAAGATAGGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCACTCTGTAGACCA 
1 5 CTTCTGCCCCCTGAGTGCCACTACCACCCAGCAAATTTC 

GCCAAGTATGACACACCAAACTGGGTTGTACTG CTGGGT AGTAAACTGGCTACCCAGCCATAAGAAACTG CTACTTCTTGGGAGCA 

CCTGTTTTTCCACTGGGGAAGGAGTAGATGTGCGTACTGGAGCATGGAATAAAGAGCC^ 

TT CAGTACTTAATAAGGGAAGAGGT AAATACCCCCTCCCCGG CCTGACACCTGCACCCCAGGTCAAGCCGC^CACCCCTTCTGGCT - 
CCACATGCTAACCCTGAGTTGGTTTTGTTAGCAGTGTGAC^ 
2 0 CAGCACGAGGGAGGGAACTCTTGAGAGCACTTCAGTGCCCTGTCACT 

CATGGGCTGGCTACCTCATCTACACTATGAATGCCACACACAGCAGCTAACAAGTTTA 
TATCGGGGAGACTTACAACACATCAAGAGCCAGGGAGGGAGGTT^ 

GTCAGCAGCCAGAAGGAGTTGGGTGAAATCCCATAGTATATGGCCCTCAAAGTGAAAGTGATG 
GGAGACAGACAGATGGCCAGGCCTACCTGGCTCCTGGGATTGA 

2 5 GGTGTCTAAGAACAGTCGAATCACGATGCTGAGGTACAATGCTGAGGTACTGGCTCTCT 

CTCTGCTGCAAGAGGGCATTGCCCGAAGAGAGGGGAAGAGTCCA 

TGAGCTGATGTGTGACTCATGTGTCCCTCTGTGCTCTGCCTTCAGTCTCCACTGGACTTACTG^ 

ATTCACTGGAAAGTGGACCCGATTTCACAGCTGCCACTCGATGGTGAC^ 

CAAAGACAAGATCAACTTGACAGTTCCC!AGGTTTCATTCCATCCACAACG 

3 0 TCACACGCAATGCATGCATGTTAGCAGTTGGCAGGATTCTCCTCTGCATTGAAACC 

CCCTTAGAGAATCACTCAGCGGTGAATGAGTTTAAGGTCCCAGGTTGCCCCTACTATTTAAGG^ 
GAAAAATACAAGTGTGGTTAGAATTTGTTAGTCTTTAGTGTTTGAGAAAAGCCCTGTCCTCGG 
AAATAGGGTTAAAAGTTGAGACTCAAGAGAATTAAGCCAACTCACCTTATTCTGCTTCTACATTTAGCTTACATG 
AAAATGCACGTATGTATGCATACCATATGCCTGATGACTGTGGAGACCGGAAGAAGGTATCAGATGCCT^ 

3 5 ACAGTTCATGTGAATGCTGAAAATCAAACCTGGGTCCTCTGAGGATGG 

GGAAGATAACATGTTTTAATCATCTCACTAAAAGGTTCCAGGCATGTGTTG 

GGACTC^AGCCCCGGATGACTAGCTGTGTACTCAGATAGGCACCCTTTCTCTCCTGCCTCTGTCT 
GACAAGTCAATAAGGGCTTCCTGTGGAAGGAGGGAGTCAAGGGAGAGAGTGGCTGTGGA 

GCGCCTTCCAGCTTCTGCGCATCAG CTGTCTGGCGAGCACTGGGCCC CGTTTTCCTTCTGGG CCTGACTTTAAGTTTCTGGTTCTT 

4 0 TGCAGACTTCAAGTTTGCCATGTACCCGCCATCGATGATTGCAACTGG 

ATG AAGTGAACAC ACTCACGTGTGATGCCCTG ACTG AGCTGCTGG CCAAGATCAC CCACACTGATGTGGT AGGTACCAGCTCTTTT 

GGCTGAGTCCCCTTGGCTGAAACCCGATGTCTCTCCTCAAGTTCCAGGAAGGATG^ 

TGTTTGCGTTACTGTTTTCTTGGTATCTTGTCTTTAAAGGTCAAACTCCAGCAGA 

4 5 TTTGTTTTGTTTTGTTTTTAGATTTATTTATCTTATGTCTGTG 

GTTCCCATCACAGATGGTGGTGAGCCACCATGTGGTTGCTGGGATTTGAACTC^ 
TAATCTCTAGCCCTC CTTCTCT AAATTTTATTGAGAGACAATCTCACAAAGATGCCTTGAATTC 
CCTATAGCTGGGATTACAGX3TGTGCACAAAGACTGCCTGCTTCCT 
AGAATCTCAGCAAGCTGATtXXK3GGGGTGGGCAGTCAACTGATTTTC 

5 0 GTCAGGCCTCTGCTCTGCCATGGAGAGAGTCCAGGTCTACTCACTCCCCAATGAGGTGGGAATGC^ 

CTGTGAC^AGGGAGCAGATTTGAGAGGTGGCCTTGTGAGTGATATCT 
CTTTTAACTCAAAGTTTTT^TGGATGGCAAAGCACACAA 
TTTTCCATTCTGTGTGGATGGCCCCGAGGCTGCTGCTGGCTCCCAGAATTCA^ 
GGATCGCACAAGGTCCTGATTTCACTGGCTAGGATGTAACAGTTGAAGAG 
55 TGCATCCCTCCTCCCCGCAGCCCCCCCCTCCCCCCCCAGGCAGCAGAGAAAGGATGCAGCAGAGAGGCAGGACAGGTAACAGCAGA 
GCCAGTACCCAAGC CTCTGAGCTCTGTCTCCAG CCTGGAAACTTTG CTCATCTCTGTGGCTCTTGGGAGCTTTGTATTTTG CAAAA 
GG CATTCAGTGGAAAGGCTAAGCAGAGAGGGGAGTATTGTTGGCTCTTAGGTTCTGGTGTT 
AATGCAGGGGAGATTCACAAGATACCCTCTGGATATGCAGGGGCATTCAGACTGCCCCATGAGGGAGTTTACT 
ACGCCGTGTAATGCACAGGCCTGGGGCTGGTGCCTCGCAGAAGTCTGCAAAGCACAGGCTGTTTAA^ 

6 0 TCCAGCAACTCACCTGTATTTGGCCTTGACACTACCTCAGAGAGGTGAAGGCCTTAAAGGC 

TTCTTTCTCrcAACCGGGTTTTAAACAG 

TGTGTGTGTGTGTGTAGGGGCTCAACCCAGGCCCTAATCTTCACTTTTCCTAATGTGTCCTGCTAAGGATTACTTCA^ 
TGTTAGCCAGTAACAAACTTTTCTTCAACTTCTGAACATGC 

GGGAAACTAGGAAAGGGGATAACATTTGAAATGTAAATAAAGAAAATATCTGGG 
ACTGCTCTTCCAAAGGTCCTGAGTTCAAATCCCAGCGACCACATGGTG 

TGGTGCATCTGAAGACAGCTACAGTGTACTTATGTATAATAATAAATAAATCTTAAAAAAAGAAATGA^ 
CAAAAAGACACATGTAGAAACACACGTACACACATATATACACATACATACAAACATATATACACACAGACACA 

7 0 GAAGAOU3TAAGGGCTGGAGATAGTTCAGTGATTAAAAGCTCCGACTC 

GG CAGCT CCCAGTG ATCTG T ACCTCACACATG TGGTGCACGCAT ACGTG CAG ACAAAG CAC CAAT ACAC AT AAAAACAGATT AATT 
AAAATTCAATAAGGAAAAGAAAAGTCTGCAGGGGCGGGGGTCCTCACA 
TCTGCAAATGAACT^AGGATGCCTAATGCAAAATTTGAGCAGAGTAAAAAAATTG 
ATGTGGTATGGCCACATGATGGGATGTCAGAAAGCCTTCCAAAAGAAATGTCATCCTATAGACAOT 
7 c pajirwrvarfzrT 
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AACGGAAGGAAAGTTGGTCAAAAGGTTGGAAGGAAGG CAGGAAGG AAGAAAGGAAGGAAGGAAGGAAAACAGGTAGGTAGGGAAGT 
GGAACATCCTAGCCATAACATACAGTATGAATGTACATTGCACCAAAATGCACCAAAAATGA 
TTTTTTAATTTATTTATATGTACAGTGTTCTGCCTACATGCATC^ 
TATGAGCCACCACGTGGTTGCTGGGAATTGAACTCACGCCTCCrrGGA^ 
5 GCCCCTATGCCATGATATTTTATTCATATAAATATACCACACTCATACAGATCATCCCTC 
GTCATGTGGAGAAGTTAAACCAATCCAAATAGAAGCCAGGAGC 
CCGAGTTCTACTCTCACGGCTGGCCCGAGGTCAAGGTTGAAAGTTGGTC 
AGCrCACTGGCAGCTGATAGCTGGGGCTGCAGGGACCCACGCGGCTCrrGA^ 
AATTGACTAGCTCCATTCCGAAGAGAAGTCTX3AGCGAGAAGCCTGACAGTCTTA 
1 0 ATTTTAGGTCCTTTTCCCTrCCAGCCACAAGTTAAGAGTTACTAAAATCAGACACT 
ACTTCTACATACCCAGACTTCTGTTGCCAAGAGGGAGTTTGAGAGCAGGG 
TTTCCATCCCCTCCACAGTGTAATCTTGCTGTGGTTCAGAGTTTACCT 

TACCTTGAACTTGGCTTTTT ATCCCAAG CCTCCAGCAGTCATTAG AAACT AAGG CTTGGAGCAAGGGAGGGAAGGATGGAAGGACG 
AAAGGCAGACAGGGCTTTCAAATGTGCCTCTCATTCATTCAGGACCTTAGG 
1 5 CTACATCTACAAACTTAAACAGAAGATTTATCTTGGAG 

AGAGCTGGTATGAGGCTCAGTGGTAGAATATTTGCCTAACGCGTGCAAGGTTCAATCCCCAA^ 
GTTGAACTGAGGCGTCCATGCTATAAAGATTTGAGGAAGTTTCTAGAAGATATATAATAGA^ 
TCAGTTCTGCCTCGGTTATAACAGGACCCCTTCCCXSTGTTCTCCAGAACTGC 
TCATCTTGGCATTTTATGATAGAACTTCTCAGCGAG 

2 0 CTTTTCTGGGGAGAGGGGGTCCCCATCATCTCTTATGTCCCCCGCCCCCCTTG 

T AAT ACCACCTT CAGTG TG CTT AAGGGGGGCTT CGGGG AGGTAGGAATT CTGG AAAG CATGCCTGGGGGTGGGGAGG CGGGTAGAG 
CTGGGNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 
NNNNNNNNNNNNNNNNmiNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNTOJNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNmiNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNN^ 

GGCGGTTGTAACTGTTGGGATTGATTGATGGATGATTGATCGGTGGATGATTGATTGATTC 
GAGGAATACCATGTAGGTCTCTCCACCCT CAGGAAGG AATATGTTCTACACTTCCTGCAGAT A 

AG GACTG AGGG AC C CTTG TTG AAGGT AG AG CTAATAACTGACCACTT ATG AGG AAG TGG AAGCTGAT AG ACAG CAGTT AAG CAGAC 
CTGATGGGTTTAGCAGTCTCCOIGGTGAATGGCAGATGACTCTGTAGCCATC 

3 5 TCTCCAAATGAGAGGGAGCAAAGATCAGGGATCATCCAGTC^GTAATGATTTG 

AGTTTACATGAAACCTTTGGATCTTCCTCCAGTGCTTA 

ATCATCTreAGCTCTTGGCTCTGAACX5TGCAGTAGAAATGACTGCCAGATGTC^ 

CCnTTAAAATCTTGGGCAGCTAGAAGGGAAGGCTTACTGTAGGAAGAGTAGGGACACTGC 

TTCTGCAGGCTGATTTATGAAGACCATCTGGCATCATTAATACAATAGAGATGTAC^ 

4 0 ACCCAGATGATCTTTTCCGGGAATCAGAAGATAGCGGACTAGTTTAAAAAAAAAAAA^ 

CACAAAACTTCCTACTGTAGAGCAGAGTCTCCCGTGTACATTACATCAGAAAATATTC^ 
CACCGTAAAGATTCCTCATCATTTATAGAAGGTCTCACTCTGCAGCC 

CAGCTACAAATCCTACTGTTGTAAAATACCAGAAGCITCCAGGCCT 

4 5 GGCATGGATGTGGTTGCTGTCTCTGGTATTGGGGTGTTCCCACTCATTCTCTCT 

AAATG CTG CCAGATTGGCCTAGCAGGAAACATGGCAAACAAATGAGAAGCTGTGAGGACC^ 
GCAGGTCCCAAGGATCAGGATGAGAGATGTTAGAGCCTGGGC^ 
CGGTGACCCCCGCCCCCAATGCTGGACCACACrATCTCTAGGATTAACCTTCGACA 
ACCTTTGTTCACAGAAATATAGGATGGGCGTGTTTGTGTACTTAAAGTTTCAAC^^ 

5 0 AATCCAAAGGATTCTTTCAGATTCCTGGACAGATGGAGCCACTAAAACACACACAC^ 

TGCTACTTAGCTCTTCATTTTTATACTTGGAGAAACCGGGGTTTGCAAATGGAGCT 
CATCCCCATCCTGCTTCCCXjCCCTTCCCAGGTTCAGGAGAACGTT^ 
TCTTCAAGCrACAGTTTGAACTGGGGGGCAGATGAGAGATGGGAAAGAATA 
GCAGTAGGGTGGGCGGGGGTAGAGGATGOUlACTTTTGCrCCrTAGT 

5 5 AG AT AATG C CTTT ATGTTTG TGG AAAAAAAAT AT CAATTCCT AGGTGGT G AGGT CT AAG CAGAAAAAT AAACAGAGGG AAGTTT AT 

GGTGAACTGTGAGC^GATACCAGGAGATTCGAGTTAGCACCGTTTCTATACGCACAGGATG 
GAGAAACTAGGGTGAAAATCCCTGGTAGGGGCCCCCCTCCCCCCAANNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNl^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GAAACACTTAAGGCGTCCGCTGATGCGCTGGTTCTGACCAGCTTGATCTGGGATAGGCTC 

GCTGCCTGCTGTGGCCCCCCACTTTGTATACTGAGTAGTTTCCATGGCATT^ 

AGGATGTGAGTTGGATCGCGCGTCTGCCITTGAGAGGCXjTTCTATGGATTTGCATGTC 

TCT AGGATTGTCTCAAAGCCTX5CCAGGAGCAAATCG AAGCTCTG CTG CTGAACAGCCTGCAGCAGTTCCGTCAAGAGCAGC ATAAC 

6 5 GCCGGATCCAAGTCTGTGGAAGAT CCGGACCAAGCCACCACCCCTACAG ACGTG CGGG ATGTTGACCTGTGAGG AAGCCATTCGGG 

CGGCAAGAGAGAGGCGTGTTCGTCATCTGCTAGCCCCTTCTCTCTCT 
AAAAAAGAAAAAAAAAAAACAAAAAACAAAAACAAAATCTGCCCCCACCT 

AAGGCCGTATAGTTGTAACGCATGTGGTGCCTGTTCAAAGTCCTGCATAAGGGGATCCCTGTACACTC 
GATGTAAAAGTC^TAGTGAGATGCTTACAGGAGAACCCGC^GACTAGCTAGAGAGTGTGC 

7 0 AGGAGACTGTCTTTTGTGCTTGTGACCTAGTGCATATGCCCCCrTTAGTAGAATTGCAAGGAACCGTGGGTC 

GTAGTTTGCAAAGCAGTGAACCCAAGGAGGAATCAGAAACGAGAAGGGACATC 
GACTG AACCATTTTGGATGTAAGAAGCGCCrTTGATGCCTCIACCZACGTGTTC 
TCTTGCTGACTAAAGTAGCTGCTACCTAAGGGATATGTCITTTGC 

TGAATTTGGG CTTCT ACTT CCAAACA.TGGTTCATTGTAGATG TATTTTACTGTT AT CTGATC^AGGGAATTCGCTATGGG CCATC A 
7 5 CCAAAACGCATCTAAATGGGGGCAGATGGAGACCATAGACGGAATCTAAAAGA^ 
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ATCTTTAATTCCCTTGTTAAGAAAAAGAAAAAAG 
TTTGTTATAAACCATTCCATTAGAAAAGCACTITGAAAATTTTG^ 
TGTTGAATACGCCTTCCTCCTTTGCCCCTCCCCTTCCTGACCCTCCT 
CCAGCTCTGGTGTCCCrGCCAGGGCTGAGAAAGCCCTCTGCTGATATCCT 
5 TCTAAAGCATTGTGATGGCTTAGAAGGATTGAATCCCCGTCTATGTTTTTCT 
CACAGGCnaAACATCAGGGTGTGGCATTTTGGTCCCT^ 

ATT CATTGTACTTGAAGACC C CCCT CTCC CACri' crrGGAGACAGTCAGTAAACAGAAGTGGGAGCCACAGTTGGTTTG<j 1 I I I GT 

TTTcrrn ttttgttttti i r rr r m i gcacaattataattg acaggttatgaagctatttgttaaaatatctgct^ t 

I'rri'rriTTGAGAAAAGATCAGAACAGGGCTGCTTTGAAGAGTTAT 
1 0 TGGAGAAACATCTAAGGGTCTCTTATCTTTCA' ri ' ri-V T C 1 1 1 1 GTTCTGTTTATTATTATT ATTTTTCGCTGCTAAGAAGCTAAGA 

TCGTTCACCCCCATTCACATTAACAGTACCTAGCTGTAATGTT^ 
TTTTACTGCTTAAATTTCAAGTCCCGAGGTAGATGGTCGAGAGACGAGTTCT 
CTTCTCGTAGCATCGATGGGCTGCGTTGCGTTTGGTrCCGTTTGGTT 

CT CCTCTG ACCACATT CTTCAAAGAGAGT ATT CTTT AC CT CAGGTTT ACTGG ACAAAAACAAAAC AAAACAAAACCAAT AGTG AT A 
1 5 ACAATGTACTTCTTTTCGTTTTCCTAATACCTCACAGCT 
AGAAGACAGGAACTGGACATCCrCCTTGATAGAAAG 
AAGATTGCCAGGCTCCCCTTTGACGCAGGCTCCCTT^ 

CAT CAAACACAGACAGTGGACT AGGT CAG CCTGATTTGCATGCGTGCAC CAGGG T CTG CAATGAAGAATTCTGG CAG ACAGTT AGG 

AGACCCTCCCTCCATCTTGTAGCCCATTCAGACACAGGACTT^ 
2 0 TTCTTGGCATCTATACCCCAACTTCGGATGGCCTCCGAAGTCTTCCT 

GGATGATATTCCTTTCTCCTTCTCTTGTCCTCrGCAGTCT^ 

CATTGAATGGCTTCAC^GGTTTGGATCCCGTTGAGTG 

AAGGGAGAGGCTGTTTCTGTGGTCrCGGGTGTATTTAGGAAGGATTG 

TTGTGGTAGGACGGTGGGTGATGGTTTGGGAAGTTTGAGAGAGAAAAC^ 
2 5 CACGCACTGCACAGCACACTCTTCTGGTCCTrAAGGGACCCCACAGA^ 

GGTACCrrC^TC^GTGGCCTTTGAGCTTAACTGAAACATTC 

CCCTCCATCCCTCTTACCTGGAACCTAAGACACTCTTCCTGTT^ 

ATTCTGCCTGCTCCTCACTTACACAAAGGAAGGGAGCCrGGATTCCIT 
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TTGCAGATTCTGGTCTTTTTCTGCCTTGCT 

(TTAAGTCGTTCCCCCAGCCTGCCJUVATTTTGATCCTTCCCCT 

AAAATAATATGCTAAGCATAAGGAGATTCATAATGGTrCCrAAGAACCAGGCAGCC 

AGCTTGTAGAGGCrGCTTTGGTTTGAACTGTCCCCrAGGACIACCACG 

GCATGCCTACTGACACGGGACTCCTCTCTAGCAGCCAGGCTGGGG^ 

GTAAGTGACCCCTCAGAGGGAGCTTAAGAGCriTTCCTATAGTCTAGCTCT 

GAGCAACAGGCTAAGACAGGGTGGCTTTCACau^GAAGGCAGAGGA^ 

GTCCCTGTGTCCTGGACAGTGGTGTGGGAACTGGAGGTAGCAATCAGAAGTCTAAGTAAGCCAGCTGACCTCT 

TCCCCCTTGCTTACCTTAGACAGTCC^CCTTGGGAATAGCrTTA 

CGACAAGCCCTCCATAGGTGCACACCGCACACATAGGCTC 

ATCTATTTGTA03AAATAGTGTAGGGAAGCCCCAGGAGGTGTTATAGCTTCCGTATACCATGTGG 

GAACAGCACAAAAGAGTCCTCGAGCCGGAACATGCGCTACTCC^ 

GTTTTACtaAAGCCATCAATACATACACACCCTTGATGGAGCrc 

GCCTTACTGACrrACAGTAGCCACTCAGTTTCAACAGGTGC 

AAAAAAAACAGAAAAGAAAAAAAACGTTAAAATGTCTCTTAATTTTCA 



^ rr ^ GTTCC r r ^ rrTT T ^ TTT ^ TTTOOTTT ^ G ^ 
CACC<rrGTACrCTCTAACAGGTGTGGGTGGGGTTTATGGTTGGGATACGA^ 
AGAAAAGGTTTTTAGGAAATCCGGGGGAAGTACATTGTATTGACTACAATGCri^ 
TAAAAACAAAAAAACAAAAAAAAAATCAACCCTAATTTGTTCTTCTG 
AAAGGTTTrGAATGCAATTAGGTTATGCTATTTGGACAATAAACTCACCT^ 
AGACrcCTAGCCCTCAGAGXaAGAACATACCCGTCCCCCAGTTTTAGAC 
TGTCTTCATTAGGTGGCTGAGGCAACAGTGTCATG^ 

CTCACTCTCTTGACTGTGAGTAAGTGTCTCCTGGTATGGTTGCCAGAAACCC^ 

AACATGAGCCAAGTGTTCC^AAAGGTTCCCATAAAATGCCTCTCCCCCCCCCCCCGCACCCTTTCCTTGTATCT 

GTCCTAGAATGCATAAGGGTCCCCCnMAACrGGAATCACAGATGCCTGTGAGCCACCAT 

TTGTGTCAACTTGACACAGCTGGAGTTATCACAGAGAAAC^AGCTTC^ 

TTTTCITCAATTAGTGATCAAGAGGGAAAGGCCCCTTGTGGGTGGGACCATCTCTGGG 

ACTGAGCAAGCCAGTAAAGAACATCCCTCCATGGCTTCTGCACCAGCT 

ACAACAGTATGGAAGTGTAAGCCGAATAAACCCTTTCCTTCCCAACTTGC 

AGACAGGGTTrCTCTGTATAGCCCTCGCrcTCCTGGAA 

CCTGCCTCTGCCTCCCAAGTGCTGGATTAAAGGCGTGCGCCACCATGCCTGGCCCC^ 

AAACCCTG ACT AAGACACTGGG CATCAAAC CAG AGAAG CCAG AGGTCTT AACTGCTGAGC CATCTCTT C!AG ACC CAT ACTTTTG CA 
TTTCTTGTATTTCTCTTTTTTGTTCTTGTTTC 



6 5 CTCCCTCCCCAATACTGGGATTAAAGTTCTGTGCTGCCACAAAG 

ATCCATGATAGCAACAAAATCAGACCTTCTGTTGGCITATCAGGGACGTTCCTAACATCCCTCT 

GCCAG ATAGTGTTCGGCCTTGGGCTGGGGAGGAGGAAAGTGG CAGAAATCCCCTTGAG AGAGGTGG AG AGATGGCTCAG CAGCT AA 
GAGTGTTTGCTGCTCTTGCACCCAAATCAGGGGGCTCACAACT^ 

ACGGCITCTGAAGGCACTGCACAC^TGACACACATACATACGTGAGGCACGGATGTATACAT 

CACAATGGAAGCACCACAGCTCTGTGGCITGAAGAAC^ 
TGGATCAGCAACTGGGGTTGAATGTGGOUIGAGCCGGTGTTGAGA^ 
CCGGCACATGAACTGCAGAAAACACrTGTTAGGATGTCCATGTTCTTTGGAAA 
AAAGAGA'i\iAAAAGTGrTTTTTCAGG 

7 5 CT C CACAAAT ACATGT AGG AAGTGGG CTAAACTCGGC CACAG ACACTGAC CCAGGG CGGC CAC CT ACCTG ATTTCTTGGGAAGC CT 
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GAAGGTAAGGCTGGAAATTACCTGGTGACTGGTGACTTGTATGTTCTGGGTTTT 

TGC CT CTAG'ITCTG T T CACAGACTTGAG CAGCCAG AATTGACTGC CAGAGAAAAAAAGAGTCCGG CAATCi 1 1 i ACu 1 i I CGTC 
CTCCTCTACCTCTCTATTGTCACCTCATCAGCCTTCCAAAGCTTTC 

5 GTCATAGGTGGAACCAATCTGCAAAGCCACTAGCATGGGTCTATCTAGAGACCA 

GCAGCCTGCAGAGACCCATGTCAGGCCTGTCCCACACATGAACAATG^ 

TCCITTTGGATTAGCTGGGACCCTGATTTATTGCCTTGT^ 

CCAGACTGACCGGATCCAGCCTGCACAGAGTGAGAAGGACTCCTACAGTCACT 

AAGATCTTACCAATTCATTGGGAATCTAGGTCGTGGAGACCCOVGGGCr^ 
10 CCCCTTCTCCCTGCTCCCn^CCTCTACCCACTCCCCTGCCCTACCCACrrAC 

TCTTTCCrCTGGCTGTGCAGTGGCATTGCTCCAAGGGCTGTGGAAGCT 

TTGAGGACCAGCTCTTCTCCCAGAAGACAAGAGACCCCTGAATGAAGCTTTTGTCT 

GGGTGATACAATCCAATGTGCATTAAGCATGAGCTTCTCCCATTTTT 

TTTGCCTTCATGTATGTATGTATGTGCGCTACATGTATGCATG 
1 5 GTTACATGTAGAGAGCTTTGGGGGCTGCITGGCTGGAATC^ 

ACAGGAAAGTAAGTTCAGGCAGGAATCTAATTTTAAACTAGAACAAAG 

CAGGGAAGTGGGCTCATTGTGATAAGCCCTGAGAAACAGTGATGGTGGGACT 

CTTGACTGTGTTTAATGCCGTGTTTGTTCCTCAACATAGAACTGGCCTTATCG 

TGGGAAGAAATAAATCTGCOTCAGCCTCAGAACTGGCTGGGGTCATGCrrATAGTCT 
2 0 CCTGCCCTGGAGAACCTGTTGACTGACTGGCTTGGTCAGTTTAAGATGGTT 

GGTCCTCTGCAAGAGCAACAAATCCTCTCCAACTCTGAGCCACCTCTCC^ 

TTTAGTTTTGGAGGGTTTTTGTTTTTGTTGTTTTTTG^ 

AGGTAGATCCCrGTGAGTTCGAGGTTAGCCTGGTCCACATAGGGAGCTGCAGGCCA 

ACGG AGGGGGTTTGC AAAAGCTGTTTGGTTGGAGAG ACATTG CTCAAGGTC AAAAG TATGACTATCC CAAACTC TTTT ACAGAG AT 

2 5 GATTGCAGGGACCGCCCGTTCACTGCTCGGATGAGCTC 

GATGGCTACTTTGCC^CTGACCAGGTCCTGTAAACTTTGATCTTGGTGAC^ 
TGTATTGGTGGAGGATGCAGAAACTCTAGTATGAAGCCTGCTCTGGAATCCCTGCCAGTCCT 
TTCACAGGACTCACAGTAAACTCAGCTCAGTTCCTTTATCTGTGAAGC^ 
ATCAGAATGAAGTGAGTCGATTCATTGAAAAACATAGAGCCAG^ 

3 0 TAAAGCANNNNNNNNNNNNNNNNNNNNT^ 

TGCTGTACACAGTCTCTGGGTTCCAGCCCCACCACTTAAAGGTTGGGCAT^ 

GCAGGAGGATCTCTGTGAGTTCCAGGCCAGCTAGGGCTACATAGAGAAACCCTGTCT 

GTTCACAACCCTCAGTCTCTTCTGGTGGGTGTGACTCCA 

35 MOUSE SEQUENCE - mRNA 

GGCGAGCTGAGGAGAGCCGGGCAGTTCGGAGGGAAGGACCGGTGCGAGTCAGGCGGCCCTTGAGGCTCCG 

TCTTCrCTCTCTCrCTCCCTCTCTCTCITTGCCA 

GCACTCCCCGACTCTCTCTCTCTCTCCAGCTOVGTCCTCCAAAAAATAATAATAA 
GGCCGGCTGGCTATGGAGCTGCTGTGCTGCGAGGTGGACCCGGTCCGCAGGGCCGTGCCGGACC 

4 0 TCTGCAGAACCTGTTGACCATCGAGGAGCGCTACCTCCCG 

TGCG CAGGATGGTGGC CACCTGGATG CT AGAGG T CTGTG AGGAACAAAAGTGTG AAG AAGAGGT CTTTC CTCTG GCCATGAATT AC 
CTGGACCGTTTCTTGGCTGGAGTCCCGACTCCTAAGACCCATCTTCAGCTCCT 
GAAAGAGACCATCCCGCTGACTGCGGAAAAGCTGTGCATTTACACCGACAACTCTC 
TGGTAGTGTTGGGTAAGCTGAAGTGGAACCTGGCCGCAGTCAC^ 
45 CAAAAGGAGAAGCTGTCCCTGATCCGCAAGCATGCGCAGACCTTCATCGCTCTGTGCGCTACCGACTTCAA 
GCCATCGATGATTGCAACTGGAAGCGTGGGAGCAGCCATCTGTGGGCTTCAGCAGGATGATGAAGTGA^ 

CCCTGACTG AG CTGCTGGCCAAGATCACCCACACTGATGTGG AT^ CCTGCCAGG AGC AAATCGAAGCTCTGCTGCTG 

AACAGCCTGCAGCAGTTCCGTCAAGAGCAG CAT AACG CCGG ATCCAAGTCTGTGGAAGATCCGG ACCAAG CCACCACCCCTACAGA 

50 TATGTCTTGTTCTTTGTGTTTTTAGGATGAAACT^ 

MOUSE SEQUENCE - CODING 

GTTGACCATCGAGGAGCGCTACCTCCCGCAGTGTTCCT 

5 5 TGGCCACCTGGATGCTAGAGGTCTGTGAGGAACAAAAGTGTGAAGAAGAGGTCTTTCCTCTGGCCATC 

TTGGCTGGAGTCCCGACTCCTAAGACCCATCTTCAGCTCCrGGGTGCAGTGTGCATGTTCCT 
CCCGCTGACTGCGGAAAAGCTGTGCATTTACACCGAC^^CTCTGTG 
GTAAGCTGAAGTGGAACCTGGCCGC!AGTCACCCCTCACGACTTCATTGAGC^ 
CTCTCCCTGATCCGCAAGCATGCGCAGACCTTCATCGCT 

6 0 TGCAACTGGAAGCGTGGGAGCAGCCATCTGTGGGCTTCAGCAGGATGATGAAGTGAACAC^ 

TGCTGGCCAAGATCACCCACACTGATGTGGATTGTCTCAAAGCCTGCCAGGAGCAAATCGAAGCTCTGCT 

(^GTTCTOTCAAGAGCAGCATAACGCCGGATCCAAGTC!TGTGGAAGATCCGGACCZAAGCC^ 

TGACCTGTGA 

65 HUMAN SEQUENCE - GENOMIC 

CTCCGCAAAATTTGCTTAATTCCCTTTTCA 

AAAGAGG CTC ACCAGGTCC^ATGG ACTGTCACCT AAAGAAGGATCTGCAAGTT A CGCTCT 

GATGGGGGCTCTGTTTGAGTTGTAGGAATAAGCTTTAGGTTAGrTTAAGCTTTAGGT^ 

ACCCTGGAGCAGTGTTATAGTCCATATATTCTTATGCTCTCTATATTTTTGAAGAC 

7 0 CCTTAGGACAAGGAAAACAGC!ATTTTTCCTCAGCTGACATGTACCAAGTC 

ACTCTTGATAGATTAATTCTCCCAGATACCTGGGAAGTGAGAGCATACCrAAACCCACAA 
AGTATATAATAAATTGATTCCGATTTTGATGATGATAATAAAGTCACTGGAAT^ 

ATCATTT ACT AC!AGC CAGTT AATTTTG T AGCTG AGG AAACT CT AT CCCCAAGG TCACAGTGCTGGT 

T AG TG C A .G AGT AG AAC C AAC AAAG AGGG T ACAGTCGGCCCTCTCT ATO CATOCGTTC CGT AT CCACAG ATAC ACCCAACCACGGAT 
7 5 CAAAAATATTCAG AG AAAAAAACACAGTAAAAAAAT AAC AAATG CAACAATGAGAAGTAATACAAATAAAAACACAG CATAACCGC 
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TTCCAAAGCATTTACTTTATATTAGGTACTGTAAGT^ 
AATATTACACCATTTTATATCGGATTTTTACATCCTGGATTTTGGTGGT 
TGAGGGATGGCTGTATTCTGACTTCTAGGCTAGAGCTGT^ 
TCTAGGAGACAGTCAAGCTAATAATCAGAGGTCTATATTCTG 
5 GACCTCAAATTCCCTAGCCAAAGGAGTGAAACACTAATCAATGCCCrGTCAGA 
GCTGTGGGAAATACAGCTACTCAAAGAAAGGAACGCTCTCMTATGATGGAC 
GGACTGACTCCCATATATGTGATATGTGAATCAGACATCCAA 

AGGAACATAATTCTTAAAATCACTATGCAATCCAAGTAACAGTTACTTGACTAGACT 

TCCCTGGTTTAAGGTATGTTGTGGAAGGTGGATAAAATAGCATATCI^ 
1 0 AACATGTACTTATCCGGGGTATTATTTCCTATGCTTCATCACTAGACAATGTGCTTCC 

T C CTTC CTGT CTACCTTCCAAAACATTGTCT ATGTCCACG AT AT CC C CAATG CCT AG CATAGTG TTTGGT ATG TGGT AAGG ACT CA 

ATTTTAATCAAATAAATGAATGGATAAATTGCAAAACCCAAAGGCATTCTCCAAAG 

ATTAAAACCACTTAAATTCTACCTTACAGATTGGAATCATACCCCTGACATC 

TCAGTTTAACACATATGTGATCTCTCAGCATGAATAAC 
1 5 AGGCGTATTCTGTGACTGX3CCAAAATGGCATGAGGTCCTGCAGTCCATTC 

TGAGAATGATGCTTTCCTGAATGGGAAAGGTTTCCACTAAATGAAACATACAG 

AAGGTCAGACTTTGATTTCTTGAGGTAAGTATGGTTITCATCCCAAG 

AAATCATTTGCTAGATGACATAAAATTGCTGGTAATTACACTACAAAGAGTG 

ATTTGAGCCTTATTCAATAl'TTTGTTAGGGAAATTTTAAATTAAATA 
2 0 TATTCAGAAGAAAGTAGGGGGGTAGAAAAGGTTGCAAGTCACTAGTTTACTGGAACTA 

ATACAACTAAATTTCCTGAGTTTAAATGATTTCAGTGAGACCCCCACACAAAACGG 

CGCCTCTCTCAAAATATTTTTTTAAAATAATGGTTCACGATCCTGAAATAAT 

ATCTGAAACCnTATGTAAACTAACCCAAACCATTTGCTTCAGAAT 

CATTTGCCTGATGGACTGACTGGCTGCTCTCAAAAATGGAAACATATGTGGGCACGA 

2 5 GAAATTTT CTTTTCCTTTTT AATCCTGTL T T1 TT1TT T TTG CTTCCCCTAGGCAATAG<?TAT ATTAAATGTTGGTGACTTTTTATT 

TTCAAAAGCACAAAATTAAATGTTTTTTGTGTTATTTCTCAGAAC^ 
TTGATGACAATGTTTAAAGAAATAAAGTGAACATCCCCCCACACACACTCATCAAAAGA 
ATCATCATAATTACTATTATTACCAATGAAGCTTGAAGGAACTC^ 
GATGAGACCACZAACGAAAGTATGATTTGAGATAGCTAAGAGCAGAAATAAATTTGGA 

3 0 TAAACCCCCTTTGCTCTGGATTTGTGCTTTTTGAGTTGTTTGCATCATGCA 

AGATGCCCTTATTTAAGGCTGAGTAAAGAGGAAATGCATTCCTTATAGGCATCTACATC 

TACACGAACTACCTATTTTCAGGTCAATTCTAAAATAGAACATATCTCT 

GT CAACCTGTTT CACATTTCAG AACT CATT CCTTTT AATGC CAACAAAAG AACA 

TGCCTCCTGAAATTTGCCACAAGGAAAACAAACCAAAAAAAGCCTCAAGAAGAGAACT 

3 5 TTTTTAAGTTTGGTAAATATGAATTTAGCATGTT^ 

AAGATTGAAAATTAAAATATGAAATATTTATTAGAAATACTTTATATTGCTTCAAT^ 
CCATGAAATCGAAAGAATATTTAATCACATTGGGAAAGTGGACATTTTTAAGGGGAATC 

TG CTTAGAAAACAAATAAAATTGACTATTCTTCAAAAAAAAATT CTGGTGAGG ATAGTCITGACATTTT ACAAAGAATTTACATCT 
CTAAAGTGCTTAACTTAACCTCCACTACTTAACTGTCTGAAATC 

4 0 TTTAAACACrTTCCTTTGGCTCCTTCCTTT 

CAAATCAGGCACCAAAAAGCAAGCACTCTTAAATACAGTTCACT 

TAGCTCATTTTTTCCCCCAGAAGAAACAAAACCAAGTCTCTTGGGTCAGACCA 

GTTAAAAATAGAGGGAATGTGTTAAAAAGAAGGAAAAAGTACTGTTCATAACCAAACTGTGTCACCT 

CACTT AAT ATCTGATTCAGACAG AAAGCTTCT CT C CT ACTTTTGAAACAAAT AATTCCACTT CAAAT CCACATTTTCAG ACACCCC 

4 5 TAGCTTTCTGCATCrGATTCCTACAACAGCCCCTTTACCTCAAAGAGGTAAAAG^ 

CACTGAAGCTCCTTGTAAATTTCAGCATAAACTCAGAACTGTCACCGTGGGTTTAGA 
CCAGATTGAAAGATTAAAAGGACTTTGCCCAAATTTAGGGGAAATGAAGGTGCGTGG 
TTGCAGTTTAGAAGATTCTGCATGAGCACGGATGCCACGAGGGTATCCAGCGAGTGGGAAGGGAGTCG 
CGATTTGTAAAACrGTTTTGAAAGGGTATTTCTAGATCCATTTGTTCCAAA 

5 0 TGCACGGGTTATCTTACATGTAATAGCCATCGTATTAACCAACCCAGCTGATACTGAGATGGTGAAT 

GAATATGTCG CGCATG AAAAACTCCGGGTGTAAAAGTCAATG CAGAACrCACrCCCCGAGGAACCGAAAAAGACATTCACGACTCT 

CAAGTGGAGAGAGGATAGAAAGTCAAGTGGCGGGGGTCGCGACCAGTGATCGGGATTTCAGTCTGATCT 

GGACCATAACCTAAAGAAATGAGCTTTTTCCGGAGCAAA^ 

AAGATGGCGAGGGGCGGGGGGCGGGGAGGGAGGCGGGTCGCGGCGCTGGCTCCGGGGTCCGCGGCACGGCCTCCT 
55 CCCAGGGCCCGGACCCTGCCCGGCGCTGCGCCC^^ 
CCCCTCCCCCGCCGCTTCCTCTTGCTOT 

GGCTCCCCGAGCGGAGGCGCGATTCCGCCGCCCAGCGGCCCCCT^ 
TTCCTTCTCCTCCCGGGAGGCAGGACCCCCCGACGCC^ 

TAAAGCAAATGTATTTCCAGAGOTATTTTCGGCCGCGTGAGGCGTGTCCTAAGCTGAATCAGA 

6 0 CTTTTAATTTTTTTTITTTCCGAAGGGAGGGGAGT^ 

CTGTCGCATCTGCCGCCTCAGTAATCCAGCCCCGTCCAAGCCGAAATTCGCCGAAGGGAGTGCGGATG 
TGCCCCCCTCCAGAACGCAGCGGCCCAGCGCCCCGGCGGGCGCGGCTGC^ 

CGCACTGCCCGTCCCCACCCACCCTGCTCCCCCGTGCTCTCCGCTTCCCGACGTTTTCCTCTTCTCCr^ 
TTGCCTGCACTCCCT CCTTCCTCTTGATGTGCCTCCTATGTGTCCCCTCG 

6 5 TTCGGTTTTGATTGACCGTCCATCCCTCCACGGAGAAACACAAACACAGCTCCACT 

TCCCATTTCTGAGCCTGAACTCAGTCACTACCCGCTCCCCACCTGGCCT AGGCGCCC CTGCGCGGAGAAGG CGGGACTCGAACTCG 

CGCTGCTCCCGGGCCTTGAGCCGACCGCGGAATCACCTGGCT^ 

CTGCAGGCTCCTGCAGAAAAGTGCCTATAAACACCCAACACCCTGCTGCA 

AACTC CAAAACG CTAGCGTAG AGGGTGGGGAGCCGGCCGGAAAAAGAATAAAAGTCCAATAAAACTGG CGTTCGCTAAAGTTTATC 

7 0 ACCAGTCCTACATGGGATATATATATATGTA' rriT A " r' X T TC CGTGAAGGGTGAAAAGG AGATAAGG AAGAACCAACAATCTACCCC 

CTCCCGCCGCTCCCCACCCCCGCG CCACGGTG ATCAGTTTGGACTTCAAAGCCAGAGCACAGG CTCTTGCX3CTTTTTCTTGAAACC 
GAAGTCTACACTGAAAGAAAGTGTGCACTTTTGCCTAGAAGGCAACATGCGT^ 
GACATTAGGGTTTTAACACATGGCTGGAGTGGCGACCAAAAGGGAAAACT 
CAACCTCCGCACCCTCTCACGCCCCACCC^ 
7 5 CGGGAACAGTCAGTAATCGGAAGGGGAAGTGGACAGGGGAACTTCAAGAGGCGAGCCTGC 
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GTCTCCATGAATGO^GAGGGCGCCGGGAAGGGGGGGCATCCGGCCGCGACCCTCT 
CTGGAATTTCTCTGTAAAGCAAGACGGAGTAGGGGGAGGGGGAGAGGGAA 

TCGGGAGACCCAGAGCTCTGGATGCCTTTCCCCGGAGAAGGGGGGGTGTGCGGAGTCGGGG 
ATATCAAGTGATGTCCIAGAGGCTGGGAGCCCCGGCGGCCTCT 

CGCCCTCCCTCCTTCTCTCGCCTCTCCCCGCTGCAA C1 T T L 1" T T GATCCGCTCARAGGTCGCTTAGGTGAAATTGGAGTAATTCCC 

TTATGGGGGTCTTAAAATGTAAGTGAATGTCCTTATCCGGGGTGACT 

ATAGGTGATTGTTeGGGGCeGACTGTGTGCGGGT 

GAGTTTGCAACTTGGGTTTTTCAGCGAGCTTTTTTT 

CCCCTTCCCTCCCCTTTCTTTACATATAGGAGATGGGATACTCATTCCCGCrrc 
CXCAGCTTTCGCCCGCCCCAGCGCCCGCTCTCCCT^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTAATCTGCCCGGT AACAATCAGCGCG CAGTAGCAGGAG CCCCAGAGCTATTGGCTA 

TGCAAATAGAGGGAGGGGAGACGGCGCCCCAAACTCTTCCTCACCCTTTTAA^ 

TCCGCCCCACCCrCGTTTAAAGAGGCTGGCTCCGGGGCCTGAGTTAATCGCTTGC^^ 



TCGGTGAGGGAGGCACTCAGTCGGCCTCtMTGTGCCCAGAGAGCTC^ 
GGTCAAGAAACGCCACCAGATCGTATC^^ 

CTGGCGGCGACTTCACCGCAGTCGGCTCCCAGGGAGAAAGCCTGGCG 
CGAAGGACCGAGCGTGGAGGCCTCATGCCTCCGGGCAAAGGAAGGGGTGGTGGTGT^ 
CTAATCCCTCACTCGCCCCCTCCCCCTCCCGGGCCATTTCCTAGAAAGCTGC^ 
GCGGCTTGTCAGCAGATGCAGGGGCGAGGAAGCGGGTTTTTCCTGCGTG^ 

C CGGCCTG CGG CGGC CCT AG ACGCTGCAC CGCGT CG CCCCACGG CG C C CG AAG AGC C CC CAG AAAC ACG ATGGTTT CTGCTCG AGG 

ATCACAT^CTATCCCTCCAGAGAAGCACCCCCCTTCCTTCCTAATACCCACCTCTCCCTCC 

CAGGGGGGGGCAGAAGGGACGTTGTTCTGGTCCCTTTAATCGGGGCTTTCGAA^ 

GGGACATGACCTTTATCTCTGGGTATGCGAGGTTGCTATTTTCTAAAATCA 

CTAAATTGTCTGAGGTCACCCCATCTTCAGATAATCTACCCTACATTCCTGG^ 

TTTTGAAGAAGCCAAAGTIXXaAGGGTCGTATTTTGGCGTGCTACACCTACAGAATG^ 

GT AGTTTTTTGTGGG TTG CCCTGTC CGGGGCCC CTGG CATG CAGGGCTGG ATGGAGGGAGAGGGGTGGGGGG TGG CGGGGGACCG C 

GTTTGAAGTTGGGTCGGGCCAGCTGCTGTTCTCCTTAATAACGAGAGGGG 

GGGAGGACCGGGAGGGGAGGAAAGGGGAGGAGGAACCAGAGCGGTGAGCGCGGG 

TGCGTCAACG CTTCAGAGCGGAGAAGAGCGAGCAGGGG AG AGCGAGACCAGTTTT AAGGGG AGGACC^ CAGCCC 

CGAGGCTCTGCTCGCCCACCACCCAATCATCGCCTCCCTTCTGCTCCACCTTCTCT 

CTATTTAGCCAAAGGAAGGAGGTCAGGGGAACGCTCTCCCCTCCCC 

GG AAAGCAGG AGGG AG AGGGG C CGC CGGGCTGGCGATGGAGCTGCTCTGCCACGAGG TGGAC C CGGT C CG CAGGG CCGTG CGGG AC 
CGCAACCTGCTCCG AGACGACCGCGTCCTGCAGAACCTGCTCACCAT CGAGG AG CG CT AC CTT CCGCAGTGCTCCTACTTCAAGTG 
. CGTGCAGAAGGACATCCAACCCTACATGCGCAGAATGGTGGCCACCT^ 
GCCAGGACCCCrCCGGATGCTCGGGTCCCCGGCCGGAGCCCTAAACCTGGGAGAGGGCAATCCCCGCGCCGGC 
TGCGGGAGTTTACCGCGCGCCTTCTGGCGAGACGCGTGGCT 
GGAAAATCTCGGAGAAGCXjAGGCTGTCCTGGGCGGGGGTAGGGGAGCATCC 
CCCACCCCCCTCGCGACCTGTCTTTTGCGAAGCCXSCCGCGGCTGCTT^ 
ACCCCGAGTAGAAAGGCAACCCCCCCCAAAAGGCCAGAGCAAATTCGTCTTGGCCTCAGCT 
CTGGCACTTCACCGGGGAGGTGGAGGGAGGAGGGAGAAGGAGAGAAACGGGGAATT 
GCGTGTTTCAGGGGAACCCAAAAGAACCGCTTCTTGCCCCTCACTCCAA 
GCTCTGGACCTGCCGTGGTTTCGCCATGTTGCTTTGCAA^ 

CCATTATCCCC^CCACCCCACTTCTAATCGTGCCCTCTCCCCCACCCCCCACCTCACGAAGATTACTTACAGT^ 

TTCCTCGGTGCGGAGATTTGGGTGGGGTAGGAAGNNNNNNNNNNNN^^ 

ACCXK3AGGGCACCGAACX3GAACGGGGCACGGAACGACGGCGAACGAGGAGG 

GCAAAGAAACGCCGCCGACCCAAACCCCCGCACGACAACC^ 

AGAACGCAGCAACCCCACCCGCCAACCCCAGCAAACCGACGCCAAAAGCAAAAC^ 

AGCACCCCCGGCCGCGCCACCCCCCACCCCACGCCGGAAGGCCCGGAGACX5CAAAAOT 

AAAGACAAACGGGCAGCCCCCCCCCCAGCCTCCCGGCCCGC^C^CCC^ 

C^CCGCGTTCCCTAGTTTCTGTCCHTGTGGGCCGC^ 

CCAAAGCCCCAGGGGTCGGAATACAGACCTTTCTGGCACCCGAAACCT 

GCCAAAAGGGCTTTTCrTAGGATATTTCCTCGATTTTTAATTATTTTTTGAAACC 

CTGCGGCGCTCTTCCTCCTCTCCTCCCTCCCCCTCTCTTCCCCACCT 

GGGATCCCCCCAGACATTTTTTCCAATTGTCGGGAATGATAGAGGA 

GAGCCTCCAGAATATTTTTATTGATTTTTTGAAAAGATGACGAAATCCAAAAAAGAG 

CCAAAGGGAGCGTCGCGGAGCTGCCGCGGCTGCTGCGCACTT 

GATCCCTGGGAATACCAAAGCACTGATGGGCTATTCTGATTCACTCCAGTTTCCT 

TGGTCCCCTCCCCCTCCCACAAAAAAAAATTAATTTTTTTTGTT^ 

GCTATGCTCTCCACCCCCGCCCCCCAACCCTTTCCCACrrCCCATTATAC^TCTGTGAGGAACAGAAGTGCGAAG 
CTCTGGCCATGAATTACCTGGACCGTTTCTTGGCrGGGGTCCCGACT 

TTCCTGX3CCTCCAAACTCAAAGAGACCAGCCCGCTGACCGCGGAGAAG CTGTG C ATTT ACACCGACAACTCCATCAAGC CTCAGGA 



GCCAGAGCAAATTCTTGGGATCCAGAATGACCCCACCAATAGAATT^ 

TGTTTATTTTTTGTGTGTTCCTACTATGTGCCCAGGCTCCGTGCCCAGCACTGAGGCTACATCTG 

TTTGGGTTGGTGAGACTCATTTGCAATTGTGTATGCATGTGTGTAGGGGACGC^ 

AATTAAATTCCAAATTTCC^AAAATAGTCTGAGAACGTGTTCACATGATAGGTA 

TCAAAGGTAATGCCCTCTATATCTACAAAGTCTCAAGATACCA^ 

AGCAAAGTCAAAATGTGGCCTTTCACTGAAGGAGTCCXLAGG 

AGGGATCCTGTCTTATCTCTGCTTTTCTACCAGGGAA 

G ACGACCAT ATTGCAGCTGT AC CG CATGGAATAGCGGCT CCAAGGCCCACCTTCGX3TGGTTTGCACACCCCTCTTTGCACTGTTCA 
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CTGAGCTCTTTGTGAGGAGCCTTCCrTTCTGGGGCTCTGCTG 
AGAATGTGGAGAGGGGCCTGAATGACTTCACCTTTGAACCGCTGCCAGGTT^ 

AGGTG C TTT c rrCTGGGAGATCTGCTTCCTTTAATAACGGTGCAAGACCAGTGGTTCGTACT^ CATT ACCCTGTGAAGAGGAAGG 
GCTGGOVTTTCAAGCCATCrrTGAAGATGATGGTGGTTTA^ 
5 GCAGATTTTAAAGCTTATGAAGAAAACATTGTT^ 

ATATGGTTCATTTGTTTCGAGAGGATGGTGAATTGAGATGTTCT^ 
TGCTACAAACAAACTTOGCTTTTTCCTATTCCCTTTATCCCAGAAGT^ 
GAGGCCTGGGAGGGAAAGGGAATCACCCCCAGCCCCACAGCTGTGGGACTGGTGCT 
TCCAGGTTCTCCAGGTGAATGGGAGAGAGGGTGCATTGGCAAACAGGCCCTGCCT 
1 0 GCATATTTTCTGGTAGCACATACGATATTTTTAATGTTTTATTTT^ 
GTGAAAGAGGGAAACAAGGTTCTTTTTCAGGTGGGTTGCTG^ 
TGAGGGATGCTGTGACACTGAGCAGGCACCCTCACCTTGCCCC^ATCACCTCCA 
GTCCTTTAGTCATGATGACAATGGCCTGTGAAACATT^ 

AAAAGGGG AGTTTGAG CAG CAGAG AG AT AAAGG CACTTTTTGGCACTAAGT AAAAGCAAG CT AATTGGTTGC CAAGG AGTTG GGC C 
1 5 AAGGAACCTTATTTTGAGCTTTATCTGTGTC 

CAATAAATTACCATGAAAGGGCCACAATGACCTTGGCCGTATCTCATTC 

CCTGCAAGTCAGGGTTGGGAGGGGGTAAGGGGAGCTGGAGTGGAAACGAAGTAGAAGAAAA 

GCTCTCATTATTTTCAGTGOAGCCTGACATTTGAAGCATAAGAAGAGCT 

TGGGCCCGCCITACAATCTAGCAGCCACGTCCTAATGAGCGGCCTTAAAACT 
2 0 GTCAGCCTGTGGAATTTAGATTCTCCTCTTCTCTTCCTGCCTTCCCCTCCAGGA^ 

GGAACCTGGCAGCTGTCACTCCTCATGACTTCATTGAGCACATC^ 

CGCAAGCATGCTCAGACCTTCATTGCTCTGTGTCCCACCGGTAAGATGAGGC^ 

GGAGATGT CCX3GGG AG AGGCAAAAGGCCG T AGGAACTTT CATTTT AG TT CAGG AG TTTGGAGGAGGGGTGGT CTTG AGT AGT CT AA 
AGTCTGAAGTTTCATTCCTGGGTTGCrTTTTCTTTATCTGAGCTTTATAT 

2 5 AGGACAGGTGTGGTTAAGGCAAAAAAAATTAGTGCCTGTTAAAATATTCTT^ 

ACTGCTCACTGGTTGTGTTCAAATCTTTAATCT 

TGGGGGCAAATACAGGTTAAAGGAACAACGTCAAGATAAATATGCTGCCTCCCTTG 

TTTTTITTTTnrTTTACTGTAGGGCATTTATGGC 

CCTGGTTGGCTTTCGGTGAGGCITTACrTTCTGAGCGCTGGGT^ 

3 0 CAGCCTTTAACGTTGCAATTCCGCCTACCACGTGGATCCCACAATTGATACATCATTC 

TAATAAACCAAAGX5CAGTGGTTTGCTGTTCTCTTATTA 

TAGTGGCCATGGCAGAGACCCTGGAACCTTACTCATCCAAGGGAGAGATGTCAGG 

TAAAAAGCTGATCACTTGGGAAGACCTTTCGGTGTCCTTCACCTCA 

GGTAAC T rTT T TCAGGATGTTGTTACATGTTACTGGTGAGTTCTAC 

3 5 CCTGTGCTCACAGTCAAAAAGTCAGGCTTTGCTGCATATCAGTGAGTGACCTTTAATT 

TCTTCATCTCCAAAACAAGCCTATTGATACATCATCCTACAGCCTITTCAGAGCAGCTAGGA 
GCACGTGGACGTGCTTTGTAAACCGAAGTGTAGGTTGGACCTTCA.CTAATTGTG 
TTTAACTGTTCCGACTAAAGTGGATATGATGGCTrrCCTTTTCTGGCTGCCCTTACGTGTTTCT 
CTTGCAGGCTTGAGTGTGGAGTCCTCCCGGCCCCCTGCCTGTTCATCTAGGGCACAGGTTTCCCCTTGC^ 

4 0 GTTCTCCTGGATGCAGGCCAGGCCTCATGGCGTGCACATGCGCCCTGTG CCCCTTGGTGGGAGAGTCCGTTATCCTTCCTG CAGCC 

CTGGGGGCGGTTCTTAGGACCCCTGTGGACCCCTGCTGCCTTTTGGATAGTGGTCACT 
CCGGGCCCGAGAGGTCAGTCCTCCAGTGACCTGTCATTTTGTCTCCCAG 

TAGCCAGAGCTCTGCTCAACACTGTCrCTTGACTAACAGATCTTTrG 

4 5 GGGTAGGGGTGCTGGGTGGAGTTACCGGGGTGTGGGCAGATCCCCAGATTCCGTCTAGACCCATrCTCGA 

CCCCCACCCGTGGCCTTCTTGCCCAGATCTTTGGGCTGGCTGCCAGCTCCGGGGCCAGAGTAGGGAGCCT 
GGACAGAGGCCCAGAGGGATACAGAGGCAGTTATAAAGACTGTGTTTTGGGGGCCTTGAACTAGCT^ 

GACACCCCCAAACCACATGAAAATGGTGTGCTAGGCACTAAGCCTCAGGCGTGGACAGGGCAGAGCACAGGTAGGTCCTCGGC 
TTCCAGGGTCTGG AAAGG AGGGGCCTGTATCGAAAGAAGTTGCTTTCCCTCATCTGCACTTGG^ 

5 0 ATAGAATCCTGGCTGCGTGGAGTTTTACAACAGTCGGCCTCACAGTACAA 

CATGGAGAGGCGATTGTGCATTGTGGGGCCAGGGTGTGATGAATGGGCAAGGACCCAGGACTCTGGCCAG 

ACTGAGAGGCAGAGGTAGGAGATGCAGATTCCCCAACCCGAGCTGTTGCCCTCACCTGCTCCTCAAATCGCACACTTCCCTC 

TCTCGCAGTCCTGGATGTTCCGTCCTGGAGCTGATGAAACACCTGCGGCCAGGGTCTCGGAGGTCCACAGTGG^ 

GTCCCG AGAG ATGG CCGCCAACCCTCAGGCCGTGCAGCTATTTG ATGGG AAAAG CTGGTTGTGAAAGAGCTGGCAGTGGTTCTGCA 

5 5 GTGTGTGGGGAGCGTATTTGTGTGTGAAAGGGGAAGGGAGAGCAGGAGAACATAGAGGGGCCGCGTAAGTGTG^ 

CTTGCTrTATGGGGCTCTGGATTAAGATGGTGGCA 

GAAGAGCAAGGGCCGCAAGCTGCTGGTATCCCTCTGCAGCAGGTCCTGGGCAGTGCAGGCATGG^ 

CTGACTACAGCTCCTAGCATTCCTGTTCCTGGAGTTTCCAAAACTGGGGAAGCAGAACTGAGCCATAGCTTCCC^ 

TGCTGATAGAAGACCCAGGTCTGCAAGCGAGGAAAGAAGGCGRTTG^ 

6 0 GAGATGTGATACCATGCTCGGTGGATGATTCCAGTTAGTCTAGCAGAGCCAATGGCATTGAGTTTACACCTG^ 

GAGGTGCTCACCCTCCCTCTCCAGGCACCCACATCCCCTCTTTCTTCACTAGAAGCCCACCCTAAGATG 
TGCCACACACGGTAGCCACCCCCAGGCCTCCCCACGCCAA.GGCCAGAGCCCCATTGTTGACTAGAACCCA 
CCTTCCCTGGCTTTACCTACAGGTGTTCCCTCAGATTCCATCGCCAAGTGAGAGGGAGACAGTGGTGGCCC 
CTTGGCCCTTGATCTAGAGTGGGAGATTTCGCCTTGATGGGTGCTGGGAAGGAAGAATGGAAACCCGTCTTTCCA 
6 5 CTG AGCT AC C CCTG AAGC CG CGAGG TCATGGG ACTG AGGGGG AGTCGTT AG CATG C CCTGTGGGG ACAAG CAG CCAGGCAGTGTGG 



C^GTGGCAGATGAGGTGCGTGCCCAGTGACGTCCCTGGCCTGTC 



7 0 GAACAGAAGCAAAGCCATCACCCCTGCACCCCAAGACCTCGGCCACCA 
CATGTATTCCGTGGTTTTAAACTTACTTAGCTCTGAACTCTTTATTTC^ 
AAATGGAGCACTTCTCAGGGCCCATG^CCAGGTGAGGGAGCCCAGCGGGACCCCCAGAGGCTC 

GC CTTTTGTGCTTGCTGGTG CCG<X^CTGCGGCTGGGACCCAATTCCTTTG ATTCCTG AG CCAG AGCGCCACCTTTGGGGCTCTTC 
TCCCACTTCCGACGTGAAAGGCTGCCCTGTCCTCCTGAGTGGTATATGGGAGGTGACTTTGAGGAG 
7 5 TACGAAGAGCTCTG CIATCCAGAGTCTCAAAGTCTCATCCGGATCCCAGG CCCAACACTCACAG7GAG7GACATCAGCCAGATCACT 
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TCMCTGCCAAGftCTTCATTGATTCAGCT 

CATCATCACCGCCGTCATAAGCAAGCATGCAJGCCCACAGCAG AGCTGCC CCTGGCGC CTGTGGGTCCAGCTGGGATTCCCCT AAGG 
TGTGTCCTTTGCCTCCACCATTTTGCCAGGAGGGAGGCGGGAATGTCGG^ 
GTGAACTGCATCTCAATGAATTGTGGAATGTGCTGGGTTAGAGTTGCCAAC^ 
5 TTTTCATTCCCCTGTGGGGCTGCACTTTCTCCCAGTTCTGCA 
CACATACACAACTGGATAGTGACAGATTCrrCCAGGAAGCACAG 

CCTGCAGGAAGTACTAGGAGTGAGAGCAGAGGAGGGGCAGGTC 

CACTGTGCTTTTACAAGGATTATTTTAAGTGAATCCTCAGAACA^ 
1 0 AGAAAATGAAGACTCAGAGAGGTCAGGTAACTTGCTCATCTTCACACAC^ 

ATTCCAGAGCTAATACTCTGAACCAGTGAGCTGTGTTCTCCAGAGGTCT 

GACGGCCTCTGTTGCCCATGGCAAGGCGTGTGGXnXSCTCCTGCTCTCTTC^ 

CTGGGTCTGGTGGGGACTGGGGCAGGAGTGGAGGCTGCCCAAGCTCCGTGCACTCC^ 

TCTCCAAATCCTTCCCATCTCCCTGAAGAGTCTTCTGAGA 
1 5 TTTTTCTTTTCTTTTTCTTTTTTTTTTTO 

CAACCTCCAACTCCCGGGTTCAAGTGATTCTCTTGCCTAAGCCTCCAGAATAGCT 

TAATTTTTGTATTTTTAGTACAGACAGGGTTTCACCGT 

TCGGCCTCCCAAAGTGCTAGGATTACACTGTGAGCCACTGCGCCCAGCCrr 

GTTATCCATATAGTAGTACAGACCTAGCCATTATAGCACACCATCCTCTACTATCTAATGCATTGA 
2 0 GAATATTGTACGTTTTGGGGGGAGTCCTCCATGCTGCCTGGAACAGGG 

GATCGGTTCATTGCCTCTCTCTCTGCTGCAGGAGGAAGGATCTCAGGAAGTCACCAG 
GATGGCGAGGGCACACCCTCACCCCTGAGCGTGCCTTCTGCACCAGCATA 
ACTGCTGAGAAATTTGTACCTGGTTTTTATTTGTTAAAATAATATTACGTACAAATTT^ 
AACAGAACAGCTTTTATTTTCCGTCCATCCTTCCAGTTCATCCATAGGC^ 

2 5 CGTCGCATGCATGCATGCAATTTGGTTTTCCGCTTCCrTCTTCTGAGTGGA 

AAGCTGATGGTTTCCAGGAATGGCTGCATTGGGTAGGTGGAGCTTTCCACAGTTGA^ 
GCTAGTGCACAGATGAAATGATGTTTGATTTTTTTGTTTTTCTCT 
AGAGTGGTCCTGGGG CAGC CCTGGTGCCACGGGG AAGTAGCGC^ 

ACACCGTGATCCATTCTATTGTCTCCTCATTGTTCTTCTCCTTGGGAGTCATTCCCATT 

3 0 ACCTGTAAGGCACCTGGGAGGTTATCCTGCTTACTATCTTTAAACTCTTGACCT 

CAACT CAGGAT ACTTGT ACAAGTAT ATCAAACTG ATACAAAAGTTT CT C CG AAC AAT ATTT ACTCTT AACG ACAT ACGG TG AG C CT 

GATACTTTCTATCCTACTCTATTTGGTTAAAGGAGAACAACAACAGAAAGTGCTG^ 

TTAATGAGTTGCAGCTAGAAGTTAGAAAAAC^CTTATCT 

GGAAGTGACTTGCCCACAGTCCCACGTCTGGACCTAGCTGGAGGGCTGATGTTGCCTCAGTATTTG^ 

3 5 GGGCATGCTCAGGGATCCCGAGGAAGATGCTCGTCTTCTGTGTAGCGGGCAACAGAGGGCAGCACGG 

CGGCGTCTCTCCTCTGGCTGTGGGAACACACTGCTCCTTTGGTGCC^ 

TGCTCCCCAATTCTGTGGCATTGGACAGTGTCCCCAAGAGGACTCCCCTGAAAATCTTCTTAGTAA 

CTCCCTGTGCAACCGGTGCACCTGTTCCCCCTATGCCTGCTGGCTTCCAGAAGTTCATTCGTATGCCA^ 

GCTAGCAGTGGGACAGCAGAGATGGGCATGTGTGAGGTCTGCTTCCCGGTCTGA 

4 0 TACTTGCCTGCCTCTTCCCTGGCAGATGCTCCCACCCG 

GAATGTCACAGGGGGGAGCTAACGAGGTTATTGGGCAGAGAAACACTCTCCTGTGACC 

G AGGCAGGGAGGCTGG CTTTGTCACACTGTTCATGGCGACAGAGAAGGCGTAGGAGGGGCGGGTGGACAG CAGGCGTGGGCGCAGC 

ACCATTATATACGTCCCTCAAGAGGTGGAGATGGTGAGAGGGCAATGGAGAAAAAGCCAAGG 

TCrGCGTTCATGATGGGAGGGGAGGGCrGTGCTTGGCTCAGGGATCCACGG^ 

4 5 TGGGCTGACCATTAAAAGAGCTGCTGTAGAAAGTGTGTTTGTAATAGAGTG 

TTTTTGTAGATTTAACCTGATTGTTTTTGCTGTAAGTTGAAGGGTGACCCACGGTTCCCTGTATGCTGTG 

GCCCCTTCCATGTCTAC^CACAGTAAGCTCACTAGAGCAATGCTAAAACGAGAAA 

GGACCGTCACTAGCGTGATGGCTGCAACTATACCCACCCTAATATCTGTCAACACGAGCA 

CAGGGTTCn'CACCCATACCATTAGGGTGACGCTCGTGTGC^TCTTCTGTGGCTGATGGGTAAATTAAACAGGACG 

5 0 GTACTGCATGTGGTGGTCATTGGTAGCTGTCAGCAGCTGTAACAGGAGAAGT^ 

GCTGCTGGATCATGGCCACCCAGCCCCTCTGAAGG AGTG CTCGG AGG AACAC AAGGGGGCTGGTCTTC CAA GAGGAGC CCTCTAGC 
ATGG CAGTGGACGGGGTTCAAGCCC CTCTG ATTTATT ATCCTCATTCTGCACTTC CCTGTAGC^ 
AATAAAATGGGGCCTTCAGTCTCATGAGTCATAACATCGAAACCGAGTCACCCATCTGATTCGGATGCAATG^ 
CTTAGTTAG AG CCCTATGTTAGGGTTAGAATCCTCTGCCCT AATGTTCTG AGGGTTCTTGCTGTTGGG C CAGAGGTTC 

5 5 GTGGAACTCAGCCCCCACTCGCCCACTCCCTGCATTGCTGCTTCTCTCATTTCCTTTTTTAGATCAAAC 

GTGGGGGTCATATTAATCCTCAATGCCTGTCCAGAGACTATAACAAOATCCCATCTGAATTAGCATTATGCATAGATTTTGCCCGT 
TTCTCCCATCTCCITGTACTTTCCCTCCTTC 

GGTCAAGAATGTT AACTCAG ACATG CTTAAT ATGACTGGG AT CTGGGG ATTGGGG ACCTTG AATCCTCTGTGCATTCATTGTGTGC 

6 0 GCTTAGATAAGCTGTCTTCCTTCCCTGCCTCAGTCTCCTTCGTTTAATCrGGAACAGC 

CTTGTGAAAGAACTCGAGGGAGGACATGCCTGTAGGGCACGGGGTCACTCGCGCCGCTGACCTGCTGCCG 

CTGCAGTCTCGGGAGCTCCAGGATCCAAATTCGTTCTGTG CCCTGACCTT CTGCCTCTCATTCCTTGCAGACTTTAAGTTTGCC AT 
GTACCCACCGTCG ATG ATCGCAACTGGAAGTGTGGGAGCAGCCATCTCTGGG CTCCAGCAGG ATGAGG AAGTG A CTCGCTCACTT 
GTGATGCCCTGACTGAGCTGCTGGCTAAGATCACCAACACAGACGTGGTAGGTGGCCACCACCTTCTTGGCT 

6 5 CTTCTCAGCTCAGGGAAGGAGACGACGGATTTGATTATGTTGTGTCAGAG 

AGGG AGTG CAAAGTT CAGCAGGTGACACC CC CTCTT AAGGTTCTGAAAAG CTGTG CAC GGG AAAAT ACGG AAGAAAACTCAGCAAG 
CAGATGAAGGGGGTGGGGGTGGGAGGGCTGTTTACCGATGTTCCTTTATGGCGTGG^ 

TTGCCATATTTGTACACGCTGCCAGGCTG CCTCTGCCCGG CAGCGAAG AGCTGCCGTGTCTGCITTCACACGCTGAGGCTGTAAGG 
CGCACAGAGGAGAAAGGAGAGCCAGGACTCAGCTGCAGTGGATG 

7 0 GTCCCTGGCAAACCTCrCGACCCCTCCCCGCTCTCTCTTCCCTTTTGTTCAATGCAGTAA 

AAAGTGCCATAGAAACTCAAGAAACTATTATTAGCTACGGAGCTTCCCCATTCT 
CTTCCTGTTGCCAGAATTTGGAGCTCCTGAGAAGAAAGGTTTTGAGAGAGACCAGGGTCAAGTGA 

G AAGCTGC AGTC CT AGCCCGTCCGAGACAGG ATGGTGCCAAG AGGTCAGAGAGG CC ATTCCTTCGTTTT CCACGGGCAGG ACAGGG 
ACTX3AAGTAACCAGCCAGGAGCGGGGTGGGCAGTGGCTCAGCT 
7 5 CACrcCACCCTTGTTGCCGCTCCTGGGATCTTTTGGTTTGAAAAAAAACCCT 
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C CCTGAGCCCTCATGCATCACCATGGAG AGGCTG AAG AATG CTGGGCACTTCACGATGACACCCTCTGGATGGTCCGGGGGGATTC 
AGATGCCCTCTGCGGGCTTTTATTGAGTTAAGGGATCA.TGTGGTTTATAGGCCT 
GGTTAAACCTCAGAGAGCCCA.CGGCTGCAGCTCGCCGGCCrrCCTCGCCCGCCCACCTG^ 
AGAGAAATGGCGCAGGGAGATGCTGACGGAGCAAATCCGGGGTCCTCCTCTC 
5 TCCAAACATACGCTTTTTAAACGAGGACTCCAGAACACAGACTGAAAACCT 
CTCCAGAGGAAAATCCTXXIACGGGACTTAACGTTTAT^^ 
CTTACAAATAGGACTCTCAGCCAGTTTAAGACA 

CCACCCCCCTCCTCCTGCCCCCAGGCATTGTAACCTTTTCCCCCGATGTTGATAGACCT 

CTCCAGCCTTTTCCCACCTCCCTTTGTCCCCCGTTACTGATCTCACA 
1 0 ACGTCTCTTGGCTTTTCGAGACCGCAGGCAAGTCAGTC 

CCAGTCCATTTAACTCAGCTC^CCAACTCTGTTAAATGGATATC 

CTAACTGTTGGTCAAGTTGATATCTCATTTCTATAATCGGCACAC 

ACACCGGTGGACCTGGGTTGAACATGTTCATATTTAAAAACCAAGAGTGAACGCG^ 

AGAAAAATCTGAGCCCTCAGGAAGTTCCAGGCTTACCTCTGGCACCCCGGAGGCT 
1 5 AAAGGTTCAAGCAGCTCTGGGCTAGGCTGTGTCTGTGTGTGTGTGTGTGTG 

G CT AAAT CTC CTC C CATCCATCAG CCTGTGTTCTCTGAAAAGCTGACG CTCAGAGGG AAG AAAAAT AAAAGGACT AGAATTT AGAA 

CACATTAGTGTTTGGAGGGAGGTGAGTTGTGCTTCCTGAGCTCTGTTTACCA 

TCGTTTCTGGAAGTCTTTACTGAAGGCCCAGTTG 

CTGCCITCAAGGAACTTAGCAGTTGTATGATGAAGCAAGAGTAAAGCG 
2 0 TGCTACGTAACX5TGTTACGAACCCTAAATGCAGTAGGAAATCAACCCAAAAGAAGGAG 

CTCCCAGCAGAGAAAACTTGGTGATAGAAACTGTTCTTAGAGTAGAGAAACAA 

CTTAAAAAGAAATGAAATTCCGATCCATGTTATAACATGGGCGAACCTTGAAAATACTATG 

AAG ACAAAT ACT AT ATG ATTC CACTT AT AC CAGGT AC CTGG AAT AGTCAAATT CGT AG AG ACAG AAAGT AG AATAG AGGTTCCCAG 
GAGTCTGGGGAGAGGAGAGGATGGGGAGTTAGTGTTTAATGC^ 

2 5 GAGAGTTCTCATC^CCACACAGCAGTGTGAATCTACTTAATGCCACTGTACATT^ 

TGTGTATTTATGATAGTAAAATGTGTGTGTGTATATATGTATATACACACACATACATAAATGG 
ATGTCTXjAAACATTCTCGGGCC^CGCTATGAAATGCCC^TAGGTTGATGTTGAGCT 
ACATTTOSCCCCCTGGCCCTAGTGCATTGACAAACGG^ 
CAGCTCCTACACACCTTGGTGGACATGATCCCTTCAAGCT 

3 0 GATCGAGCTGTGAAAAGAGGACACCCCTCATTCITGAG 

TTG AT ATG AGG AACG C AT ACTGCT CTNNNNNNNNNNNNNNNNNNNNAG CT C CCAGAGCCCCTGAAGTT AT ATGT CA 

GCTCCTACACACCTTGCTGTTTCTGATTTCTTCAAGCTTGCAATGCTCT 
TCGAGCTCTGGAAAGAGGGTCCCCCTGATTCTT'GTGAGATGGCC 
GATATGAGGGACGCATGCGGCTCTGAGAAAGTCGGCCTTTGTTGCTCCGTCTGGTT 

3 5 AGAGAAACCTGAGTGAGGGGCCTGACAGTTTTATAC^AATAAACGTCTCAGAOft-GTGGTATAT^ 

TCGTAAGAGTTGCTAAAAACAGCCAGAGAGGCAGTTTGGCTGTAGGGGGGA 

GAGAGGGGAGGGTGCCACTGGCCTGGGTCCCCGGTGACCATGGCCCTGCTTCCTTGGGCTGAATCGGGGTCCT 

TGCACACTAGTCC^GCTGCGTGCCAGGATTGATCTCACTGCTTCCTTG 

GCTCTTTATCCAGAGCCCCTGGCAGAGTCATTGGGAGTGATGCTTAAGAGAG^ 

4 0 GCAGGCAGATGGGTTTCCAGTGGGTTTCCGATTCAC 

CTGTGTCAGGGCCTTCCGAGCCTTCTGAGTTTTCCCCAACCTTATCTTAGAGTAGGCT 
ACAGTTGCACAAATTCACTAACTAAAAAATGGAATGCAGGCTCACAGGCC^ 
AGAAGAAAGACTCTAGTCATCTGTTCCTTATATTTATCCTTTTTTGTCTAAGTCACAT 
CAAAAAGC^AAAAACAAAACTTTAGATTCAGGGGATCCATGTGCGGTT^ 
45 TTCAGTCGAACCCGTCACCCAGAC^GTGTACGCAGTACCCAATAGGGAGTGTTTCAACT 
AAATATTGCCCAGAGAAGATTCCAC^ATATCCCTTCCTTAC^GTCATCCT 
AAAGCAGGGAGGTGGAGGCGCCGTGGAAGAACTCTCAGTTTCGACCACATGTGTGTGAro 
GCTCCGTAGCTC^AAGTGCCGCCACCCTCGCCCCACATGTCATTTTCAAGCCTCCACTGTTACAGAT^ 
TCIAGGGCCTGTG^TCAGGAACCTTGTGAAGTGTTGCTGGAAGCT 

5 0 TCTCCTGCCTCATTTTTCTGTGGGCCTCATTTGGTATGCAAGGAAATAAGCCAAACAGCCTGTGAA 

TTCCTAAGTTGTCTCTGCCTCAGTTGGCCGCGGGTCTGGGGCT 

TGACAATGAGGTGGTGCAGTTACACTGAG<KSCTAAT<K3TAGGAGTTCCGTCAGCCCATT 

CGGAGCTCGAGCCAGCAAG(ZAGAGGAAGTCAGAGGCCCTTGTAACGAGCTGAAGATCAC!AAACACT 

GGGGGCGGGGGCAGCCAGGAjGGGAAGGGG<5CAAGGGAGGGGGCCTATCATC^ 

5 5 TAGGGTACCACCTGTCC^GGCTGACAGACGAGCATTATGGCT 

GGTGCTTTGGAGGCTGCAC^AAAGCTGATGTCCTAGAGCCGCCACACTTGACATGCGTGTAACACCT^ 
GTGTTTTTCCAGCCACTTGGGCTCTGTCTCTTTTTCTTC^IAGATTCTCTGGTCTGTATGTCTTTACTTA 
TATATAGGAAAAAGGAGTTCTGAAATTGGAGGCTTGCCC1TTCTAAAGCCAGTCTGAG 
CACXTCAACTCTCCACGCTCCATTCCCTAGCTGGGAAAC 

6 0 TTCTCCAAGTCTGTGATTATTTGTGTATGTCTTCTTATAATCCAGCCCCTTT 

CCTCT AAT AG AAGCAACCTGGTTGGATTT AAGGTGG AGGTCTTCCTCTCT CCAGGTGGCTGCCCTCATCTCT ACCCC CACCCCAAT 

CCCATGCATTCTCCTGCCCTCCTGCTTTGGGCAAGACCCAGGCGATOT 

CACCATCTCCTCTGCCTTCACTTCCCACCAGCACACAGAATTTATATGCCCT 

CAAAGGTATTCATATGCAAGG CATGAGCTCTGTCTCATGG AAGGGAGAGG CCAAGTTCTCTTTCCAC CATCAGAG AGTAGCGTTCA 

6 5 ACAT CATTT AT ACAAG ACAGG CTTTTG C C CTTTT AGTCTTG CTGAACT AG AATGAGGGATTTGTGT AGT CG AAAGTCAAG CTG ATC 

ATTGTCAAGTTTCGTGGAAGTGGAACCnSAAGGAAGGTTAGGTGGAGACT 
AGTCTAGAAGAGGGTGGGGAAGGGAGTTCAGAATT(TTTCATTCATGGAGT^ 

TCATCTGAGGTCJAGGAGTTCX5AGACCAGCCTGACCAATCTGGTGA 

7 0 ATGGCGTGCACCTGTAATCCCAGCTACTTCGGATGCTGAGACAGGAG 

AGATCGOVCCATOSCACTCCAGCCTGAGTGACAGTGAGACTCCGTCTCAAAAAACAAA^ 
ATGATTTGGATTGAAATTAAAATGTGACTAAAGAGAGAGTATGCAGTTTTGTCTATCTATGAAACCT 
CACCTGCGCACCCACTCTTGK3CTGGAACTTACATAAAAC^ 
CCAGTAGAAGTGACAGCCAGACATCCCAGTCCTCCCTGTAGGG 
7 5 GAGGOCTGATAGCAGAAAAAGTGGGGACACTGGGAACT 
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GAGCCTGGTCTCCCTATGTGGAGGGTC^TC 

TATTATTGCGGTAAAAGATACATTAACCCAGATGACCTTGCCGGGGGCCAGAATAGA^ 
CGGCCGGGAGGCX5AGAGGGATTTTGTTGAGGAGGAAGG 

ATGCGAAACCCTCTTTAAAGGAACCGCATCGCTACGCCTAACGGGCAriTCl TTT 1 1 AATGTAATGGTTCAGAGCTATTGTCTACC 
5 ACGCCTCGCGTGCACACGCACACACACGCAAGTTCCCTCAGTC^ 
GGCCTTATGCTITGGGTAGGATTTGTCITCCATGGACA^ 

GATACTCCGATTTAGTCCCTCTGCCTCTTGGGGAAAAAAAAAAAAAATTGTAAACATC 

CAGTGGAGGAACATTACACGTGAACTTCCTGGATTCAATGGATTC^ 

AAGAAAATATCAATTTAGATTGTTAGGTCCAACGTTTCACTCAGTGAG^ 
1 0 GTTGAAGCrcCTGTTCTGGGTCTCAGCTCCTGCCCATGTACCXrre 

AACTCCTTCACACGGTGCTCAAGTGGAGAAAGGGCTGTCCA 

ATTCCTCTCTCATGTAGAACCTGCCTAAGGGACATTCCAa^GCAGTTC^^ 

TTTGATTCTATATTTAGTATTAAATTTTTTAGATTATTAGGATTT 

ACATTATACTTTCAGGCCGGGCTCAGTGGCTTATGCCTGTAATCCCAGTACTT 
1 5 GGAGTTCAAGACCAGCCTGOTCAATCTGGTGAAATCCCATCTCAACTATAAATACAAAAATCAG 

ATAATCCCAGCTGCTCGGGAGGCTGAGGAAGGAGAATTGCTTGAACCCGAGAGGTGGAG^ 

GCACTCCAGC CTGGGCG ACAG AG CAACACT CTG T CT CAAAAAAAAAAAAAAAAAAAAAAAAAAC CAG AAAAG AAAAGAAACATT AT 
ACCTTAACTTTTCGCACCTAGTGAGTGATAACCTATO 

GAACAGGGATGAGGCAGGTGAGCCCCATGTGATTTATCGTCATTCATTCAGCTCAACAGGTCGGACAG 
2 0 AACCAGGAGATGCCACAGGAACGTCTGAGAGGTGTTAGCAAGTCCCAGAGGCCAGCGCCCGACT 
GGCACGGAGACTCCTGCTGCTGCGTCACGTCTGTCATGTTCAGCACT 
CTTGACACACGCTTCATTAGTCTTGACTAAATGCGAATCT^ 
TTTATTGTTATACATAATCACGATCATTTATTGAGCTCC^ 

TGATTATTTTGTGGGGAAACTGAG AT CAGAAAAGTTAAAT AACTTAGG TGCCAC CTG CAAGT AT AG TGGG TGGG CAG CAGAT ACCA 

2 5 CCCTTCCTCATTTCATTGTTCCAGGTGATATTTCGTCT 

GAAAGCTGTAGTTTGAAATGTGGGATAGATGAGAGATAGGAAAGCAAAATTTAGGGG 
TCCAGCTTTATTTTGTTTATTTATATGTTTGTTTTTAAACrATCGC^ 
TCAG CTTCCCGCCTCAGCCTCCCAGAGTGCTGGGATTACA 
AATAAAGAGAATGCTTTTGGGCTTGGCAAAATAGAAACCTCTG 

3 0 TATTAAACTATGAAAAGACACAGGGAGGT^CAATTTAACATATTTGCCTT 

tttgcagaggagagaggcnagaaaatgcccgttagaagcaagtctacggtt^ 
ctagtgatacccatgacttctcattgcattgcatgtcatctgcctaagagaagctcct 
cctacaatatagactgatactaaagcaaggctgcatattcttgc^ 
acattctgcattatatctttatgagagttaaggaaatggtaaaga^ 

3 5 tgaggcgcggtggctcatgcctgtaatcccagcacittgggag<;cc<^ 

tggccaatatggtgaaaccctgtctgtactaaaaatacaaaaattagc™ 
ggaggctgaggcaggagaatcxscttgaacccggaaggtggaggttgcagtgagccgagat^ 
acagagtgagaaaaagtatttacagtttatgtgac^gtcagtacgttc^ttcat^ 
tttgctaatgtcatagcacttctagtagccttcccctttt 

4 0 ccttgattaccaggcatccataagtgaccatcatttaggccagtgct 

gttctgaatagctcggcctgggttgactgagattctgcggttccaacaagctcgcaggtgatgctgatgctg 
ccacctggagtagtgaggttccttggcactttaagaacatccctcctt^^ 
tatgcaagctaaattacgcatgttttctccgtaggatgctct 
ctaggattgtctcaaagcttcccac^agcagattgaggcggtgctcctcaatagc 
45 gatccaagtcggaggatgaactggaccaagccagcacccctacagacgtg cggg at atcgacctgtg agg atgc cagttgggccg a 
aagagagagacgcgtccataatctggtctcttcttctttctggtt 
attctgcccccacctagatcatatttaaagatcttttagaagtgagag^ 
ggtgcctatttgaagtacagcataagggaatcccttgtatatgcgaacagttattgtttgat^ 
ttacaggaaaacctgcagagtagttagagaatatgtatgcctgcaatatgggaacaaattagaggagac 

5 0 tgagctagcacatacacccccttgtagtataatttcaaggaactgtg 

ctcaagaaggaattgaaataaggagggacatgatggggaaggagtacaaaacaatctctcaacatgat^ 

ctgctacctaaaaaaagatgttttattttgccagttggac^^ 
ttcttcc^aatgcagttcattgcagacaccaccatattgctatctaatgg<x3aa^ 

5 5 gaaacggaggcagatggagaccaagggtcgxiatccagaat 

ATTTCTTCrTAGAAAAAAACrAArrrTIXKn'G 

AAAGCACTTrGAAAAATTGTTCCCGAGOSATAGATGGGATGGTTTATGCAAGTCATGCTG 

CCCCTCCCTTCCTGCCCCCAGTCTGGGTTACTCTTCGCTTCrGGTATCTGGCGTTCTTTGGTAC^ 

GACTCAAGAGACACCCCITCCrGCTGACATTCCCATCACAACATTCCTCAGACAAGCCT 

6 0 TGGCACAGAAGGATCTTAATTCCCATCTCTATACTTCTCCTTTGGACATGGAAAGAAA^ 

TGAACATCAGGGTGTGGCATTTTGTTCCCTTTTCCGTTT^ 

GTACTTGAATTTTTCTTCCTCTCCACTTCTTAGAGGCATTCAGTTAGCAAAGA^ 

ACAATTGTAATTGACAGGTAATGAAGCTATTTGTTAAAATATTTGCCT 

AAG AATTATTTT ATACACAGATTCTG CCTTGTTTCAT AGTATG AGGGTTG AAGACGGAAAACAAT CTAAGGGTCTCTCATTTTTTT 

6 5 AATTTTGTTTTGTT C AGTTTGG TTTTTTTTTT^ CTG CT AAGAAGCT AAAGTCAT CC AT CCTT ATT CACG TTGACAGT AC 

CTAGCrTCTAATGTTTCACAGAGTGTGCTGCTATTTTATAAACAT 

AGTAGATGGTTCAGATATGAGTTCTTCGTACTGGAAAAGCCCTT^ 

TTT r i' l - r L " lT l TT T G<3TTTTTTGGTTTTT T T'l T riTCCrrCT 

ACAAAATCAATAACT ACAAAAGG CAATGATT C^CXJCTTTTGTTTT CAT AAT AC CTCACAACCGT ACAGTT^ CC 

7 0 CATTC<5CAT<3AGGAATACAGAAGCAGTGTGAGCAGGG CTGACTCCCTCTCAGGTGG AAGGCAGGG CGGTCTCACTCCCAGGGACCT 

TTTTGGTCATGGAGGCCATCGGGCTCCCAGTTAGACCCTGGTATCCT 

CCTCCCCTTCAAGG CAGACGTTCAGT ACAAACATTTATGCGGT AGG CTCAG ATGTCGT AATTTGCACTTAGGTACCAGGTGTCAGG 
AAACAGACTAAAAAGAATTCCACCAGGCTGTTTGGAGATCCTCATC^ 
CCITTCATCTTGAAGTTTTTCCCCTCCGTCTTTCCCT^ 
75 AGG TCTGAATCTG CG AG T AGATC AAC CTG CAG CAAGCAG CGTTT A7GG7G CTTCCTT CT C C CT COT C TUTCTCAAACTGCGCAGG C 
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AAGCACTATGCAAGCCCAGGCCCTCTGCTAAGCGGTACTAAACGG 
AGGTCAGATAAGTATGGGATGATAGTTGAAGGGAGGTGAAGAGGCTGCTTCT 
TCGGAAGTGTGTTTAGAAGGGTTCAGGACTTTGTGAGTTAGC^ 
TGAATTCTGAAGTTTTGGAGAGGGAAGTGGAGCAGCCAGCAACT 
5 ACTCTCCTGGGCCCCAAGGAGTCCCACCXSAATGGGGAAAGCGGGAAC^ 
GAGGTGCAAATTCATTTCATGGTGACTGACCCTTGAGCTT^^ 
AAGCCTACCCGACTCTATTTACAGTCTGTAACTTTCCACTCTTCCTGTAGTCCCG 

C CCATCCTTGGGG C CTTGTGTG ATG ATGGGTGTGGGGCTGC CGATGGG AAAGT Ct3G<^3GTTGTT AGG CTTTTCTGCCTGCTCCTGC 
TT AAACACAAGAAGG AAT C CTGGATTTTG CCCTCTCCTT AG CT CTT AG TCT CTTTGG T AGGAGTTTTGTTC CAGAGGAGCTCTCCC 
1 0 CCTTGGATTTGAACTTGCTCTTTTTGTTGTTGTTGTTCTT^ 
GGTCTTTTTCTACCTTGTTGTGTTTCTATCT^^ 

CCCCAGCCTGCCAAATTTTG ATCCTTCCCCTCTTTTGG CCAAATCCT AGGGGGAAGAAATC CTAGTATGCCAAAAATATATGCTAA 
G CAT AATTAAACTC CATG CGGGTCC^T AACAG CCAA^ C CAAGGG CAGTTC CCT CCGCAGAACACCCCAT 

GCGTGCTGAGAGGCGAGCTCCrrcAAGAAGGGGCTGTTCT^ 
1 5 ACAGCATGCCTTACTACTGGGTGATCCTTGGT 

CACACATGGCTTAAGTGACAAAGCCAGAGGAGAAGACAACCCTGACAGCATCAC^ 
CTrCAGACGGAGCTGCGCTTCCCTGCAGTCTAGCACCTCTAGGGCCT 

AAG AACAT AAGG CAGG ATCAGATGACTCT CTCC AAGAGGGCAGGGG AATTTT CT CT CCATGGG C CACAGGGG ACAGGG CTGGG AG A 
AGAAATAGACITGCACCTTATGTCATGTAAATAATTGATTTTCT 
2 0 GAAGGGGAGGAAGTCTCAGTAAGCCAGTTGGCITCTAAGCCAAAAGG^ 

AAT AG CTTTAAAAGGGAAATT AATGCTG AGATG ATAAAGT C C C CTT AAG CCAACAAAC CCTCTGTAG CTATAGAATGAGTGCAGGT 
TTCTATTGGTGTGGACTCAGAGCAATTTACAAGAGCTGTTC^ 
GATATTTATTACTTCCTCATACCACATGGCTrTTGATGATTCTGGATTCT 
CTACATTCGTGTGTGTCTGCTTTTAAACTTGGCTGGGCTATCAGACCCT 

2 5 TGTACGTGCATGCTGTAGCTGCAGCCTGCATCCCTTCGCCTGCAGCCTACT 

TACAGTATCCAATCTCITTTGACAGGTGCCTGTCCTrGAAAAACAAAG 
CTGGCCAACTCTTACATCCCCAGCAAATCATCGGGCCATTGGATTT^ 

TCTCTCrCTCTCTCCTCTCTCTCAGTTATATAGTrrcrrGTCTTGGAtJ'l 111 T 11111 LI XX TCTTTTTC in 11 r 1 r i i iGCTTT 
AAAACIAAGTGTGATGCCATATCAAGTCCATGTTATTCTCTCACAGTGTACT 

3 0 TTAGAAAGTGCTGATAAGTAGCATGATCAGTGTATGCGAAAAGGTTT^ 

TGACATGATATAGTCAGCTGCCTTTTAAGAGGTCTTATC^ 
AGTTTAAAGATGGATTTGAAAATGGTTTTGAATGC^TTAGGTTATGCT 
GCCGTTTTTGACTTATTTATAAACCAGCAGTCCTCAGAATGGAATACACTTGTCTCATGTCAG 
TTTGGCAGTGGCATAAAAGACACATTCTAACCTGGAATGGAATCTACATGTGCCTCTGTGTTAT 

3 5 ATTGCTAGGGGACGAGGGGTGCCCATGCAGCCAGCTrATAAGCTAGAAGCCTC^ 

ACTATGACTGGTCCTCAGAGGATCGTCCCTGATTGTTGCCAGCT 

TACCAGAAGCCTGCGGTGCCATCTCCAGAGTCTAGGGAGTCTCTGGGAACCCTGCCCCTC 
TCTTCGTGGCTCCCACCTGATTTCTGATCCTTCCTCCA 

AAGCAGAGCCTGGGAATGGGATGTGACAGTCTCCXSTACTGCTGAAATGAATTGCC^ 

4 0 GCTCAGACCAGCTTCTTTCCCGAGAGTGATGGGTGTGCAGAAGACA^ 

TCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGOSCCCACC^CCA 

GTATAGCTTTGTTGCA.CAAGCTATAACAGACATTTTGATGTTAC^ 

CGAAGGCTGTTTTCCATTTTAATTCTGACCTGAGTTGTAAACAAGATTTAAAAC 

4 5 CAGGCTGATCTCAAGTAACTTCACTTAATAGGACrTTTA 

CATCCGCCATTCTTAGCGACAAATGGGGGTAACTTTTTTCT^ 

GGGCCTCTGGTTGCCATTCTTAACCATCCTTGGAATTCTCTTTGGCATGACATAATGTCCAACT 

CTATCTTTTTGCTCTGTAGGATCAATGCTATATCTGTTTATAGCAT^ 

TGCTGGGCTTCCCCCATTCTGAACAATGGATGTTTTAAGGGCGGGAAGAGCAGATGAAGC^ 

5 0 CTGTCCCATCTACGCTCTTTC^GACCCAGATCATCTO 

TAAAAATGCAGTGAAGACCTGGAAATCCCCGATCTTrTGTGCGTGTTCAAGGGAGTGCTTTTGTATCT 
TTGGAGCITTTATTTAGTATGCAGAGX3TATCCCCAAAATTCACCTTCCCAGTTGGCT 
GAAGCAAGTACTCAAAAGCTGAATATTACACCAGATTACTTCCCTGCTCrCCT 
AGAGTAGGGAGATTCAGAGTATCAAGGTAAGCCTAGTACTCACAGCCGATGTGTTATGT 

5 5 AG CAC AG AAAAAAACAGGACACACAGCCCTATG AAAT AGC AG AG AGAAACAACAG CAAATG CAAATGAGTCTGGGGGGAAATCGAT 

GCTAAGGTGTGGGAGGGGGCCCTTGCAGAGGAACAGGTTTCCTCAAGCAAGTGAGTTTTAGGAAAAC^ 
TGTGGGTCAGCATAGTTTGGAAACGGAAACAGAGTGTGACCTTTTCCAAAA^ 
AGGGAAAATATTCCACACAATTTAATAGATTTTAAACAATATAAACACAG 
AACATTGATTTGGGATTACCACCCACCTT^TTTCTCATGAGGCTTAAAGA 
60 • CCTTGTCTTTGGGCTCTGGGGGCCTCCCCGTGCTGGAG 

GCGGCCTGAGAAAGCAAGACCAGCGCCCCTGCCTCTGTCTCATTGTCCCCCTG 
ACTCATTATTATGGTTTCTTTCATGCTGATTCGCTCGGAGG^ 
ACTGTCCGCATGGTTTGAACCTGCAGTTCTTCCTGCGAAGTCCACTA 
ACCCAGAGCCAGCCGGGTGTAGGTTGCCTCTGCCCTGCACGGAAGGGAACACCTCCCT 

6 5 CGCCTCCGCCATCTCCCCAGGGCAGCCTGGGGAAGGAACGTCAGGCCTTTC^ 

TCTCACTACCGCCACATCGTGACTCCATCTGGGGTGGCrGGTC 

ATGGCACACTGGCAGCTTACAGACCAAATCCAGCCTGCAGACATAATTAGCT 

ACTTGCCAATACAAGATGAGAGCTTTCACATAAAAGTCAGGAT 

CG CXSAGGCCGCGAGGGCTGCAGCTGAGTGTGACTGGCCCCTTTCTCrACCCCCTC 

7 0 TCTITTACGGCGCCrrGCCAGTCXJGCTTAGGGCTTACAGCTCTGCAG 

GAGCCAGTAAGTCCCAAGTGGAGTCATCCTCCTCCAACCCTCTCCACCACAAAC^ 
ATAACCTCTCCTTCTCAGACGAAGAAAGACCGTACTGCCTCAGTAAGCATTAAGATCTCTITATGTCT 
CAGATTACCCCOSGGGCrGGGCCAGATTITTGAAATTCCTTTC 
CAAA.TAAGGTGGGGTGGTAAAGATCTGCTTTTTGTTTTTTG^ 
75 TG AGG CT AG AGGG CAGTGG CX3TGATCACAACT CACTfJ C^GC CT CAACTT CCCA.GGCTCAAG TGATCCTCC CAC CTCAC CCTCC CAA 
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GTAGCTOX3ACAACAGGCATGCACCACTACATCT 

GCTGCTCTCAAACTTTGGGCCTCAAAAGATCCrrCCCACCTCAGCCTCCCAAAG 
GACAGCTTTTCCTGCTTTTGTTTGTCTGGAGAGTCATTTTTAG 
AGATTTCTATATTTGTTGAAAAAGTGCAAATCTCTGTGGCTATG^ 
5 CCCCTTCCCACACAGGGCTCCTGGGCACTGCTCGTTTATTCAACAAATC 
CTCATTCAACAAATGTGTCTCAGTACTTGCTTTTCCAGGGATGAGA^ 

CGCTTTGTAACCTTGGGCTCATTACTTAGCCA^ 

TTTTGTAGTTTTGAAAGAGCTTTCCACAAGCCTCTCCTATAATCC^ 
1 0 AGTGAATGGAGGGTC^CACGGCTCACTTGGCTTATCACAAAGAGTTAGGCT 

CATGCATCTTCTGCTACACTGGGCTACTTTATTGATCCATATTTTGC^ 

AGTGTGGGCAGGAAGGTTGCAGGGGGTGGGGGGGGTTGGATGATGACATCAC^ 

AGACAAAGTCAATGAAGAAAGTAAAGGGCCTTGTCATCTAAGGAGGAGTTTGTC^ 

TTCCAGCGTCTTTGCCTTGCCCCTTGGCTGTTCTTCATGCTCATTCCAT^ 
1 5 GGAGCCATGGGTTCTG CTTTCCTCCCAAATCCCCAAACCTATGT CAATT AAAACAAGAATCACAGCCAGGCGCGGTGGCTCACACC 

TGTAATCCCAGCACTCTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAGGAGAACAAGAC 

ATCTCTACTAAAAATACAAAAAATTAGCTGGGCGTGGTGGCATGTGCCTGTA 

CACCCGAACCCAGGAGGCGGAGGTTGC^TTG AGCCGAGATTGCGCCACTGCCCTCCA CTA 
AAAAACAAACAAACAAACAAACAAACAAACAAAATCACATATCTAGTTTGTCCCT 
2 0 TTATTATTTCTCCCCCAACGCTCCCTCCTTCCCCCACCAAGCCCAGAGAATCTCAGTCCAGAG^ 
TCAGCCTCCCAAGCCTTTGCTATTTAAAGATTCCTTTTGTTATCCTATA 
CTTCCTTGCCCCGAAAAAAGGGACACAGGAGGAAGGGCCCAAAAACGTAG^ 
TTCCAAAAATATTATTTTACCATGGATGCAATGGCCTGAGGCCCTGCAAGGGTGAATGCCACA 
AAATGAACAATGAAGGACAACTGCTGGCCTCTGCAGGCTTCCATTCATTC 

2 5 CCCTCAGAAAATCTTGAAGCACCACAGGTTTTAAAAGCAGTCACTCAC^ 

GGACTATGGAAGATTTGCTTCCATATGGCACTGCCTAGGTATTAGGGCCAGGACACCT 
TTTCCTTAGCATTCTGTCAGGGCTCTGGaiAGAACAA 
TGGAATATTATGGACTGTATCATACATTTCTGTATTACATACAATCATTGT 
GGTAATAAGATGATCTGACTTTGAATCCCAGGGCTGCCACTTAGCTTTACAAGCT 

3 0 TCTGGATTATTAGAAGGATTAAATGAGGTAGTCTATGTAAAGTGCCTAATACATAGAAAGCTTT 

CAT ATG CAAACACACCAAGG ACCGTGTTCATTGCTTTTG AAGG CATTGT AGATG CAATGTT ATTTT CC CAAAAAG T CAGGGGTG G A 
GGTGGATATGGGTTTTCTGTGGATCCTGATGAGACTACTATCAAGAGCACAGCAGTGAAACAAAA 

AGGCAAGGCATGAAAAGGGATTAGAGGATGCATGTATTCAAATAAAGGGGAGTTCCCCAG^ 

3 5 TTCGACTGTCCATCAGTAGGAAAGCTGTTTGACTGGCAGATATAATTTTTTTGG 

ATGTCAGATGTTGGCAGAGCAGCTGCCAAAACAAGTCTCCCrrCCACCTrCAG^ 
ACTTGGGTCATTGGTCACATTATTATTTCTCAGTTGCTGGACAAAGCAG 
GAAATTCTGTTCTTCCATCAGGGTACAGCATGTAATCAATTATGACAGTGGGTGGTGC^ 
GACTTTCTCTTTTATTAAGTCTGCATAAACCACTACAATTT 

4 0 AAATTVACTTC^GT AAT CACAAAAAC CACAGTT CT CAGTTGGTC C CAAACACTCTCAGT CTGT ACT AT CTGATGTCAGTTTTGTCTG 

TCCATTTTTATTCrGAAACTTAAAATCCTCTTGTCrCTATTCT 
CTCTGTCACCCAGTCTGGAGTGCAGTC^GACGATCTTGGCTCACTGTAACCTCCA 
GCCTCCGAAGTAGCTGGGACTATCATGCACAGCTAATTTTGTATTTTTAGTA 
TTAAACTCCTGACCTCAAGTGATCTGCCCGCCTTGGCCTCCCAAAGTGTG 

4 5 TATTCTGGTAAGTCTTAAAGAGCTTTATCTTAGAATCATAAAGTGACCT 

ACCCCTCAACTAAGGTTTTTTTAAAAAAAAAAATTCTCAGCTGTATTCTATTACATAGTAAA 
ATGTGTGAGTAGCCACATAGAAGAGTTTATATGTTCTCCATCACCCACCCT 
TTTGTATGCATGTGTCTCAGGAAATAAATTCTTTAGTAATAGAGCAGAACAA 
CTCTTGACATAAATTACACATTTTTTCTTGTCTTTTAAAA^ 

5 0 TTTCTACTCTTTTTTCATGTAATCTCCT^^ 

G ACACGGTGGCTCACATCTGT AATCCCAGCGTTTTGGGAGGCC CAGGCAGGAGGATCACCTT AGGT CAGG AGTTTGAG ACCAGCCT 
GGCCAACATGGTGAAACCCGATCTCTACTAAAAATACAAAAATCAG 

GAGCTGAGGCAGGAGAGTT^CTTGAACCCTGGAGGCGGAGGTTGCAGTAAGCTGAGATGGCGC 
AGAGAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAGGAAACAAACCAAAAAAAAAAAAAGA 

5 5 ATGAATTGTTTGGGCTCCTACAATATATCTAGTACTACTTGAGGTGCTGAAGACACAGAAGG^ 

GGAATATATATCCTGCTGAAGAAAGACAAGCAATGAACCTACAAAAATGCT 
TG^GATAGTGACTGGGCAGCCTACTTTGCTTGAGGGAAGACCTCTTG^ 

AGGCAGCCATGCAAATATCTTC^GAAAGAGCTCTGTGAGCAGAAGGAAC^GAGAATGCAAAGATCCGGAGG 
CGTGGTCTACGAACAGAGAGAAGCCCTGACGGCTGGAGCTGGATGAGGAGGGGCAGCGAAGAGAGAATTG 

6 0 AGGGGC CATG CCACAT AGGC CTTG C AGGAG TTGGG ACTTCATT CT AACTG CAATG AG AATT C AATAG AG AATTCT CCAGC AAGGG A 

GTGACACAATAGGGACATCTCTTCAATGKjCTGAGT^^ 
ACGAGAGGTGATGAAGGCTCAGCCTGGGACACAGTGAGACTCCAG 
AACTTTTTTTAAAGGTTTTGGATAGATAGCTTTTATGATTTATTTATTAT^ 
AGACGTATATATAGAACTCTGTATCCAATAATGAGTGAATCCACATTCTTCACAAGGACAC^ 

6 5 TGGACTAGTCCAC AAAGCAAGTCTCTGTAAATTTCAAAAATGTAT ACCCTTG AAAATCT CTG AAGTCCTGGAG ATTT AAGTCTG CA 

ACTATCTTAGTCTTACTACCAAGAATAGCCCCTAAGGAAAAAAAAAAAC^ 

ATTT CACAG ACT C CAG CT AAC CAG CTTTTGGT CACAATGT AATT AATTTCATTCTT AATGCATTGCT CCTG 

GGGAACTGGCTCCAACCCACAATCTCCTCCCCTACCCTAC^ 

GACAGTAATTAATCTTTATTTGAAAACAAGATTTGGGAGGCA^ 

7 0 GCCAACATC^TGAAACCCCATTTCTACTAAAAATATA 

GGAGCCTGGXSGC^GAAGAATCGCTTGAACCTCGGAGGTGGAGGTTTCAGT 
ACGGAGCAAGACTCTGTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCAGGAAAACGAGTA 
TGAGCCTACAGAAATCTATGATGAATTTGACAGGGAATATTAGGATAGGTAAAGAATCAAG^ 
GTGTTACCATTTAC 

75 
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HUMAN SEQUENCE - mRNA 

AGAGCGAGCAGGGGAGAGCGAGACCAGTTTTAAjGGGGAGGACCGGTG 

AATCCTCGCCTCCCTTCnSCTCCACCTTCTCTCTCTGCCCTCACCT 

AGGGGAACGCTCTCCCCTCCCCTTCCAAAAAACAAAAACAGAAAAACCCTTT^ 
5 CCGGGCTGGCCATGGAGCTGCTGTGCCAC^ 

GTCCTGCAGAACCTGCTCACCATCGAGGAGCGCTACCTTCCGCAGTGCT 

CATGCGCAGAATGGTGGCCACCTGGATGCTGGAGGTCTGTGAGGAAC^ 

ACCTGGACCGTTTCTTGGCnK5GGTCCCGACTCCGAAGTCCC^ 

CTCAAAGAGACCAGCCCGCTGACCGCGGAGAAGCTGTGCATTTACACCGACA^ 
1 0 ACTGGTGGTGCTGGGGAAGTTGAAGTGGAACCTGGCAGCTGTCACT 

AGCAGCGGGAGAAGCTGTCTCTGATCCGCAAGCATGCTCAGACCT^ 

CCACCGTCGATGATCGCAACTGGAAGTGTGGG7VGCAGCC 

TGCCCTGACTGAGCTGCTGGCTAAGATCACCAACACAGACGTGGATTGTCT 

TCAATAGCCTGCAGCAGTACCGTCAGGACCAACGTGACGGATCCAAGTCGGAGGATGAACTGGAC 
1 5 GTGCGGGATATCGACCTGTGAGGATGCCAGTTGGGCCGAAAGAGAGAGACGCGTCCATAATCTGGTCT 

TTTTGTTCTTTGTGTTTTAGGGTGAAACTTAAAAAAAAAATTCTGC 

GAAAAAGGTCCTACGAAAACGGAATAATAAAAAGCATTTGGTGCCTATTTGAAGTACAGCATAAG 
CAGTTATTGTTTGATTATGTAAAAGTAATAGTAAAATGCTTACAGGAAAACCTGCAGAGTA^ 

ATGGGAACAAATT AGAGG AGAlTT r iTTITTCATGTT ATG AGCT AG CACAT ACACCC CCTTGT AG TATAATTTCAAGG AACTGTGT 
2 0 ACGCCATTTATGGCATGATTAGATTGCAAAGCAATGAACTCAAGAAGGAATTC 

AAACAATCTCTCAACATG ATTGAAC C ATTTGGGATGG AG AAGCAC CTTTG CT CTC AG C CACCTGTT ACT AAGTC AGGAGTGT AGTT 
GGATCTCTACATTAATGTCCTCTTGCTGTCTAC^GTAGCTGCTACCT 

TTGGCTCCTGGGTTTCATGTTCTGTG AGATCCTG CTi'CTTC TTC CAAATGCAGTT CATTGCAGACACCAC CAT ATTGCTATCTAAT 
GGGGAAATGTAGCTATGGGCCATAACCAAAACTCACATC 

2 5 TTCTGTTATTGT ATTTAAAAGGGTAATGTGGC CTTGGCIATTTCITCTTAGAAAAAAACr AATTTTTGGTG CTGATTGGCATGTCTG 

GTTCACAGTTT AGCATTGTTAT AAACCATTCCATTCGAAAAGCACTTT^^ CCCGAG CGAT AGATGGGATGGTTTATG 

CAAGTCATGCTGAATACTCCTCCCCTCTTCTCTTTTGCCCCCTCCCTTCCTGCCCCCA^ 
TGGCGTTCTTTGGTACACAGTTCTGGTGTTCCTACCAGGACTC^ 
CAGACAAGCCTGTAAACTAAAATCTGTTACCATCTGATGGCAC^ 

3 0 GGAAAGAAAAGTTATTGCTGGTGCAAAGATAGATGGCTGAAC^TCAGGG 

TATTGTTGTTGTTAATTTTATTGCAAAGTTGTATTCAGCXjTACT 

CAAAGAGGTTGGAGCAACAACTrTTTTT TTTT XT 1 1 T l'GCACAATTGTAATTGACAGGTAATGAAGCTATTTGTTAAAATATTTGC 

CTTTTTAAGTAAAAAAGAAAAATCAGAACAGGGCTATTTGAAGA 

GTTGAAGACGGAAAACAATCTAAGGGTCTCTCATTTTTTTAATTTTC 

3 5 AGAAGCTAAAGTCATCCATCCTTATTC!ACGTTGACAGTACCTAGCTGTAATC 

TTATAATATATTATTTTACTGCTTAAATTCCAAGTCCTGAAGTAGATGGTTC 
GTAGTTTGTTTTCTTCTGGTAGCATATTCA^ 

CATTCTTCAAAGACGGAGTATTCITACCTCAGGTTTACTGGACAAAATG^ 

CATAATACCTCACAACCGTACAGTTTCTGCTTGGG AG CCCATTCGCATGAGGAATAC AGJ^GCAGTGTGAGC!AGGGCTG ACT CCCT 

4 0 CTCAGGTGGAAGGCAGGGCGGTCTCACTCCCAGGGACCITTTTC 

CAT CATG ATGG AAAAAATACATTGAACCAAGGGATCCTCC CTCCCCTTCAAGGCAG ACGTTC AGTACAAACATTTATGCGGTAGGC 
TCAGATGTCGTAATTTGCACTTAGGTACCAGGTGTCAGGAAACAGACTAAAAAGAATTCCAC 
TGGAGCTTTTTCAAAAGCGGGGCTTCATCTGCAAAGGGCCCTT^ 
TGGACACCTTGTGTTTAGGATCATCTCTGCAGGTTTCCTAGGTCT 

4 5 GTGCTTCCrreCTCCCTCCTCTGTCTCAAACTGCGCAGGCA^ 

GGGTTTTCAATCACACTGAATTGGCAGGATAAGAAAAATAGGTCAGATAAGTATG 
CTTCTCTACAGAGGTGAAATTCCAGATGAGTGAGTCTCTTGGGAAGTGT 
CCTAAAATTCTAGGGGATTTCTGGTGKK5ACAATGGGTGGTGAATTTTGAAG 
CTAG CCAG AGTTTTCTC AAGAGCC!AG CTTTGCTCAGCACACTCT 

5 0 CCTGGAGTTCTTGGGAATCTTGGAGCCTAAAGAGAAACCGAGGTGCAAATTC^ 

AGCAGCAAATGAAAGAACCGGACAAATAAGGAAGGGCACAAGCCTAC 

TAGTCCCGAGGCCCCTGGGTCCTTCTAGCTTTTCrcrrTTCCCATCCTTGGGGC 

GAAAGTCGGGGGTTGTTAGGCTTrrCTGCCTGCTCCrGCITAAACACAAGAA 

GTCTCTTTGGTAGGAGTTTTGTTCCAG AGGAG CTCTCCCCCITGG ATTTG AACTTGCTCTTTTTGTTGTO CTTC 

5 5 TTTTTCTTACCTCCCACTAAAGGGGTTCCAAATTATCCTGGTCT 

TGTTTGTTTTTTTCTCCATCAGTGGGGGCCGAGTTGTTCCCCCAGCCTGCCAAA 
AGGGGGAAGAAATCCTAGTATGCCAAAAATATATGCTAAGCATAATTAAACT 
GAGAAAGCCAAGGGCAGTTCCCTCCGCIAGAACACCCCATGCGTGCTGAGAGGCGAGCT 
GGCCTTATTTTGAACTGGCTCAGGACCCCACTGGAGAGCACAGCATGCC^ 

6 0 TGGAGGCTCTGTTCTGCCTCTTATCAGCCAGGTCAGGGGCACACATGGCTTAA^ 

GCATCACGCTGCATCCGATTGCTAGCAGGATTGGCAACTCTTGAGACGGAGCTGCGCTTCCCTGCAGTCTA 
CTCCAGACTGTGCCCTGGGAGCTCTGGGACTGAAAGGTTAAGAACATAAGGCAGGATCAGATGACTCT 
AITTTCTCTCCATGGGCCAC^GGGGACAGGGCTGGGAGAAGAAATAGACTTGCACCTTATGTCATGTAAAT 
TCAAGAAGATAATATTGGTAGTGTGGGAATTGGAGGTAGGAAGGGGAGGAAGTCTGAGTA^ 

6 5 ATTCCTCTTTGTTTATCTCTGAGACAGTCCAACCT^ 

AGCCAACAAACCCTCTGTAGCTATAGAATGAGTGCAGGTTTCTATTGGTGTGG 
CCATCCATTTGTGCAAAATAGGGTAAGAAGATTGAAGAGGATATTTATTACTTCCTCATACCAC^ 
TCTAAACAACCCAGAATGGTCATTTCAGGCACAACGATACTACATTCGTC 
CTATTCTCGGCTCAGGTTTTGAGAAGCCATCAGCAAATGTGTACGTGCATGCTGTAGCTGC^GC 

70 TACTTTGGGGAAATAAAGTGCCTTACTGACTGTAGCCATTACAGTATCCAATGTCTTTTGACA 

GTTTCT ATTTTT ATTTTTAATTGGTTT AGTTCTTAACTC CAAATCATCGGG CCATTGGATTTT 

TTCCATTATGTTCATGACCCTTATATCATGTACCrCAGATCTCT 

CTTTTTTTTTCTTTTCTTTTTCTTTT^ 

ACTCrATAAGAGGTGTGGGTGTCnGTTTGGTCAGGATGTTAGAAAGTGCTGAT 
75 TAGG AAGT ATGG C AAAAATGTTG T ATTGG CT ATG ATCCTC ACATCAT ATAG TC AG CTGC CTTTT AAG AGGT CTT AT CTGTTC AGTG 
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TTAAGTGATTTAAAAAAATAATAACCTGTTTTCTGACTAGTTTAAAGA 
T ATTTGGACAATAAACTCAC CTTGACCT 

HUMAN SEQUENCE - CODING 
5 ATGGAGCTGCTGTGCCACGAGGTGGACCCGGTCCGCAG 

CCTGCTCACC^TCGAGGAGCGCTACCTTCCXjCAGTGCrCCTACTTC^ 
TGGTGGCCACCTGGATGCTGGAGGTCTGTGAGGAACAGAAGTGCGAAG^ 
TTCTTGGCIXaGGGTCCCGACTCCGAAOTCCCATCrGCAACTCCrGGGTGC^ 
CAGCCCGCTGACCXSCGGAGAAGCTGTGCATTTACACCGACAACTCCAT 
1 0 TGGGGAAGTTGAAGTGGAACCTGGCAGCTGTCACTCCTCATGACTC 
AAGCTGTCTCTGATCCGCAAGCATGCTCAGACCrTCATTGCTCT 
GATCGCAACTGGAAGTGTGGGAGCAGCCATCTGTGGGCTCCAGCAGGATGAGO 

AGCTGCTGGCTAAGATCACCAACACAGACGTGGATTGTCTCAAAG CTTGCCAGGAGCAGATTGAGGCGGTG CTCCTCAATAGCCTG 
CAGCAGTACCGTCAGGACCAACGTGACXK3ATCCAAGTCGGAGGATGAACT 
15 CGACCTGTGA 
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Table 54 

MOUSE NOMENCLATURE 
ICSGNM Tnfr3f6 
Celera mCG21463 

HUMAN NOMENCLATURE 
HGNC TNFRSF6 
Celera hCG22983 



10 MOUSE SEQUENCE - GENOMIC 

GTAGAGATTAGGACAGCTATTTTCTGAAGCTTAAAAGAGGAATACAAAAGTAAATTA 

TCTTACGTTCCTGAGCTAGAGAGGGAGATAGCCCAGATAAACACAGTGCACCAATGTTGGTAACTGTATGC^ 
TCGGTGAAGCACACAGTCTTGAGAAAGCCATTTGTGAAATC^ 

TGGAAGGAAGGACCATCCAGAGACTCCCCCACCCGGGGATCCATCCCATAAACAGCCACCAAACGCAGAC^ 
1 5 ACAAGATTTTGCTGACAGGACCCTGAAATAGCTCTCTCTTGTGAGGCTATGCA^ 
GATGGAACACAGGGCCCCCAATGAAGGAACTAGAGAAAGTTCCCAAGG 

TATGAACTAACCAGTACCCCTAGAGCTCATGTCTTTAGCTGCATATGTAGCAGAAGATGGCCT 
GGCCCTTGGTCTTGCGAAGATTATATGCCCCCAGTAC!AGGGGAATGCCAGGGTCAGGAAGC^ 
GCGGGGGAAGGGTATAGGGGACTTTCGGGATAGCATTTGAAATGTAAATCAAGAA 
2 0 ATCAAATCATTAGTATTTAACAGGCTTCCATTCACTGAGGGAGAAAACT 

AATAATTGAATTAGAACACAAGTCCCTTTGGTCTGTCCTCAGAGGAAGAAAACATTCCCATGAAC^ 
AAGGGAACCTTCGAGCITCTTTCTCCCTGCT 

TACAGAAAACTGTCTGTACACCTGTCTCTGCTTTGACTAAAGTGCCCTACACTCT 
CCAACTCCAACTCCATGATACCTTTTTAACTCCACAGATCT^ 

2 5 CCCCTCAGCATCTGCTGACAGGGAAAGGCTCTGCAGGCAGAGG 

AAACTCCCTCAGTCCCCCTCACCCACTCACACCTCACT^ 
GCTTTGGTAGTCTTAGTTTGTCATTTGGGGATAGTCCA.CAGGCT 

AGGGACAGTGTCTTTCTGGTAGGGAGTGATCTGGAAAACAGTTTTAGTGGTATGCAGGCAA^ 
TAGGTATTTTCTATTTTTTGGTAAGGAATGTTGACTITTCCATTTTCG^ 

3 0 GGGGAAACAAGTAAGTTGATTGTTAAAAATACATTTTACTGG 

ACATGTGTGGGCAAAACACTCGTGCACATTAAATAAGTAAATCTGAAAGGAAAG 

TGTCGTCAGGTGGCCAGTTGTCCTTTTTAGCCAGGCCTCCTGATGATTACAGAAAG 

TGGATACTTACCCATATGGATGGCITATGCTGAGTAATTTATAGTC^^ 

CGAT CATGTCTTAAGT AGTG CCTACCTGAG ACT AAACCC CATC AACAG CAGGTG AGCG CTGATG AAG AC AATG CTT CTCATT CCT C 

3 5 TGCATGCCAGTACATGCATGGAGTCCTGGAATGTCTCTGTGTTTAGTGCAGTCTCATATACAGAAATGG 

CCTGGGTGTTGAAGTCTGGGGCTGCAGAAAAGCrATTTCCCCCACCTGTCTGCTGC 
GCTGCATCTCAGTTTCTCAATGTTCTTCTCATTATCATTTCCATGT^ 
AGGAGAATTTGGAAGATTGAATAGCTTCTTTCACATCTCAGGGGCT 
CTTTGGGCCGTGCTGTCCCTTCAGGACATGCTTGTACACAAAGAAGCT 

4 0 TAGTTTGACTTGTAAAGCC^TGGTGGGACTAAC^TGTGTACTTTGGATCTGGCTTTCAGTTATGCT 

AT AGT CT CTTCATCCTGTGGG AAATATAATCTTCCAAAGGTG AAAATGCCTGTATG ATCCT 
AGTTCTTCCTTTTCCCTGTCTATGAACATAAATACATTTTTT^ 

GG CCT CGAACTCAG AAAT CCGCCTGTCT CTGCCT C CCAAGTG CTGGG AT CAAAGTTGTGCAC CACC ACTGGCTGGCT AG ACATTCC 
45 AATTCTTAACACACATACCCAAACTTTGAAGAGATGTATATTTATTATGTTCACGTACCC 
ATGTACACAAATGAAGAGCCACGCCACGTATTGTTTCTCGTTGAACTTATTCCAGTTCCT 
GACCTCACATGACACACACACAACATACACAAAATTTCTGTCTT^ 
GACCACAGACATTACAGTT , CTtyiCAGG<K3AAGGAA 

ATTACTGAAAATATTCCAGACATGAATGCATGACTGAGACTCCAAATTGCCATTTAGTAT 

5 0 GACCCTTCTACAGAATGTTATAGTGACCCTGGAGACAGTCATAGCCTCT 

AAAAAAAAAAATTTACACTTTTAAAATGGGATGAGGTGGGATTCCCTC^ 
CGTTTACAAAATCGTTGATAAAACGATTCCTCCTGATTGTACAAGAAGGGAG^ 
GATCAGTCTTGGCITTCTTTCTCTTCGTCTGAGGCTGGACTATGGTTTTCT 
TGGTCACTTTGCCTCTTGCTCTCCTCrTCCCTTT l ' G TTGAC^ 

5 5 CACCTTGGCAGGGGAAAGCTTC^TCTACAGCCACAGAG 

TCGTGGCGGCGCTGCGTGGAAGATGCGTGCAACGGACAGAGCTACACTGCCTAGCCACCA 
ATACATTVrTTATTGGTCAATAGAAAATCTCrATACGTCCTTTGGTCAG 
TTGTAGAGACTCTCTGTGTCTTAAAGGTACCATCTAGGCAATTTTCCTTATTTAACTCT 
TTACnTTTGGTAAAAAGGGACTITCITACACrrAACC^ 

6 0 TACAAAAACTGAGAATTTACTGGTTAGGCAGTGTGGGGAGGCCAAGTTTGAATCCTAAAGAT^ 



GATGCOVAGGTTAATTCCAATGGAATGTGATATGTTGTTTTTCriTGGCCAAG 
GCTGTTATGGTTAATATTATCTGTCACATTGTCTAGACTC 

6 5 TCTATTTTGTATGAAATTCAAACTAAATCCACAGGCTrTC 

ACTCATGAAAAAGCCTGACTATGATTGAGAAAGGGAGGAGTCCTACCAGCCTTTGGATTT 
ATCTGX3TCAATATGCACTATAGGTTCTAGGCACATCACACTAGGCAATCATGTG 
CACTTCTCCAGCCATCCATTTATCCTCTATTCTAGTGATCCTGTTTCT 
GGGGGCTAGAGAGATGGAGCAGCAGTTAAGAGCATTGACTC 

7 0 CTCACAACCATCTGTAATG<3GATCCGACACCGTCTTCTGGTGTC 

AATAAATAAATAAATAAATAAATAAATAAATAAATCTTTAAAAAAAAACAACACTCTTATCAGATA 

GCAGTTTTTCTAGACCCATCCTGGAAGTCTCATTACTAGGAAGGAGGAGAA 

CTACAGTTGGAGACTATTAAATCAACTTATTGCTATTTAGTTAAAGCAGTTTC 

T AAGG AA AG A AT AG AAAT ATTGTCTG T AGTGAAAAGTT AAAATG TTT C T ACCTATT AAGT r T i"i ACAG T CACTT ATG7 ATGTG AAT 
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AGTATGCAATCTATCCAGAAAGCCGTGCTATCTCT^ 

ATG ACT ATCOIG AGC ACTCCTCACCTC CAGTGGCATG CCTGGAGCT 

AGTGGGAAAGATTGCTCGGTGTGCAAGCATGACTGCT 

TTGTAACTCCAGATCTGTCTGGGGACAGAGATAAGAGGATTTCT 
5 CAAGCTCCAGGTTCAATGACAGAACCTGTCTCAAAGGAATAAGGCAGACAGTAATAG 

CATTTGCAC^CACATACACCATTTG^TTTGGTT^ 

AAAAATAACCTATAGAAATACTTAeTGCCTTAATTTC 

AGAGTTTATTTCATCTTATCTATAACAACrCCCAGGTCACCATCCA 

AGGAAAAACCACTGCTCACCTCCTTGCTCTCCTTGGCTTC 
1 0 GCACCGCTCACCCTCCCATGCCAATCAGCAATCCATAAAATGCCCACAGGCTATCT 

TCTTGCCAGG1TACTGTGTTTGTGTCAAGCTGACAAAAACTAACCAGTGAAAAT 

GTTTACTCACTCAATAGATATTCCTCCAAAGGTTCCATCAGAAAGAACTTCAG^ 

ATACAGTATATGTTAGO^TATTACAGACTATGAGAAATCACAGATTAGGATACTCTAAGCA^ 

TGTGTATGTATAGTGTGTGGCATAGTTAAAAGCTGTGGAGGTCAGAAGACAC^ 
1 5 TCTGGATGATGAATTCAGGCTGTCTGCATTTCCTGTTAAGTGC 

ATGAATTTTATGATTCAGTGAAGTTGC1GACTCAAGTTTATACTCAAGAG 

CCCAACTCCCCCCTCTITCTCCTCCrrcCCCCTTTCTCTTCT 

CTTTCCTTTTGCATGGAACTTTTCATTT 

GGCTTCTTCATGGCAGCAACAACCTAGAGGAAGATGAAGCAAAGGGGG 
2 0 TTT AGTGACAGCTG CCAAACAG AATG AACAG AACGCCTACAGG AAGG AG AGGAAGG AG^ TGACAATCT AG ATTCAG AG CACTTTT 

AAAAACACCTGCAGAAGACAATGCTGCAGGATGTCCCCCGACTCTCT 

TCTCTCCATTCTGTGTGTGTGTGTGTATTTGTATATTCITAATTAGATAGATCCTCCAGGGAGGA^ 
GAGCGAGCGATCAGTGGATTTTGCAGACCTTCTGACCCATAGAGGTGGAGTCCAGA 
CTCACCTTGAACAAACTTACTTACCTTCT 
2 5 GATACCCTGCATAGCAGATATTCACATTACGATTCACAACGGCAGCAAAGTTACAGTTG 

TTGAGCGTTGCCATAGCATGAACTGTATTAGAGGGTCCAGCATTAGCGAGGGTGAGTACC^ 
TAAAGTGCTACCTACTTTTCCAAAACAAAAGTCACTTTGAGATGCA 
ACGAAAATTGGTCAGGAAATAATCCGTAACAAAAGGAGGAAGTAATAATGTCTAACA 
AAATAGACTTCTCTTTTTGATAGATTTATATCCTGACTATGGTCCCCT 

TTGGATGAAACAAACAGAAGAAAAAG AAGC CCAAGAG CTGG CACACAAAACAG AGACAAG C T CATGTGCACACTCAGGG AT C C CTT 
AAAAACACTAAATTTGAAGGGATAAATGCATATGCAAAGGACCTGTTGCA 
GTGAGGTCATGTGAGCTTTGATCTTGTTGACTTGGAAGGCCTTGTTT^ 
CTGCCTCCTCCTCTTGGGGTTCCCTGAGCACTGAGGGGAGGGATTTGATGGAGACATCTC^ 
• 35 TTCTCTCTCTGAATAATATTTG^CTGTGGATCTCCGTATTAGGAGTTATTTTATTATAA 
ACATACATATTTGTTTTTTTCCCTAGGTCCCTGGGCTATCT 

GAGTGGGGCTAAAGTCAAATCCAGTATTGATGGGTTGCTCCCACAAACTTTGTGCC^ 
GACACCATCATAGATCAAAGGCTTTGGGGTAGAC^TGAGGTTTACCTTTCTC 
AGATGTTGGCATGTAGGGATGAAGGCTCTATGTAGGCACTTTCTTCTTGCTCT 
4 0 TCAGTTTGTGGGGAGCAATCTATAGTCTTGGCAACAGGCTGCATTGTTTGGGGACTCC 
ATTAGATGCAACCCCATCCAGGAAAGGAAAACTTCATTTGATC 

GCTATAGATCACCTTCATGTATGCTTATACTTTAGGAAGACTCTGCTAGGTTTCCATACCT 

TCTCTCCCTGTATTCCCATTCATCGTCTTCCTTTCCCCTCCCCCACTTGATCTTCCCATCATAACTAC 

TTAATAAGATCTTTCTGTCCCCTCCCCTAGTCCCTTACTCrATATCTAACCCCCGTGGTT 

4 5 TTAACTGCTGTGTTATTTGTCATAGTCATTTACCT 

CAAAGCCTATTATCTCTACCTCrGCTGTGGTGATTGTCACTTCATGTATCATATATGAGCrCT 
ATCCTAGGGTGTCTACCTCCTCrGTGGTAATTACCATTCAGAGGCCCCAGAAGCCCCAGCAGACCTGTCACT 
AGTGAAGGCAAAATGGTCTTTGTCTCCAGCAATCAGCCAAAGTCCAATTCA^ 
AGGCAGAAAAAAAAATCTCACTTGACCTGAGGGTGTGACTGTGGAAGC^ 

5 0 TGCTITGGGGCCCCTTCTTTCCCCCGCCCCCCACCCCCACCCCGCTCAGGCrc 

AGACCAGCAGAACTTTCTCGCAACCCTTGGGGCTTTACGAAGATTGTGTTTGGTCCCT 

TCAAAGCGATTTCTGGGAAGACCTG AATAGGAGCG AAGCGGTTTGTG CGTGCCA CCTAGGCAAAGT ACAT ACCCAGA 

GGCAGTCTAGAGCTGGTGGGTGGGGCCGCAGGCTGCCCACACAGGCCGCCCGCTGTTTTCCCTTGCT 

5 5 GCTGGCATTGGGTGCCACCGGTCCAGATGTTTAGA 

AAGTCCAGCGTTTITCTGGAAAAAGTrrcGGTCGGGGCGGGGGGCGGG^ 

CGATCX3CTTTAGTATGGCTGGAGAGGGACTCTGGCTTATAGACTTGCCCAGAGACT 

GTTCGCTCCGAGGCTCCCTGGCATTTGACACrrCTGACAGTTCGCT 

AAGCATT AG ATCATAAAATTTCTCAAAGT AACTTTGT CTG CGAAG ATTTG TTT ACT CTCCTGT AACAACACG GGAAACAAAAAAAA 

6 0 CCTTCCCTTCCGGGAATTGAGCTGTCAAATTGTGGATGGCTGAACTGGTGGATATTTTTATAT^ 

CCCTCTTTCTGCTGGCACTCTCACAGCAGATTTGTCCCAGTAACCCTAGGGCT 
ACAGAAGTAAGAACTCTCCAGCATTCGGTCTCTTCAGAAATGCCTGTTAATAAATGCCA 
CAATCCAGTTCTGAATTTAGAAAGATTGCTITATCAGGTCCCTAAAAATTTGCCGGGC 
AGTGGTAGAAGGGAGGAAGACAACCTCTCCTTACITTTACAAGGTTAAAAGAGGTTCCGAG 

6 5 ACAGCTAGCAAATGAC^GTGCCGGGCTTCAATTGAGATCTAACT 

CACACACACACACACACACACACACTACTGCCCCAT 

CAGAGGCATCTGGAAGAGCCCAGAACATAGAGAGAAAGAAGTAAACTGAACATG^ 

GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATGGAGCAGAGGCT^ 

GAG CGAGAGAAG CAGGCGCAGAGGTGAGAGAACAGAGTG AG AAGAGCAG AG AG AGCAGGTTTT^ T CG CTAGTGG 

7 0 ATGGGAAGCC CAGG AG CTGGAGGGAGCTTGAGT AGGGGTGG CAGTG AG CAGG ACTAGCATGCACTTTGAAATG TGCAACAGGCACT 

TGTGGTCCAAGTATGTGCTTTGATGTGCTGGTTAATAGGCACTACAGGTAGTCTCCTATGTCCCA 

GACTTTCCAATGGATCAGAGGAGGCACATTTAGTCAAATCCTTAATAC 

GTTTGCTTGGCAGGGCATGTACAAACATGTCATCTTGAAGGGTT 
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AGGGAGACATCCTTTAGAAGCACAGTCATGGGGAGTGGAAGAG^ 
GGCCCAGGGGTGCATACAGAAATGCCTGATGTACCTGCAGGT 

AGCAAGCATCTCCTAGGGGAAGGAGAAGAGAGACAAGCAGTTAGACAGCCTTAGAG^ 
AGAGACAGACCTAAGGAGGCAGCTGGAAGCAGAGAGAGGAAATGGACAAGCTA^ 
5 TGGGAGGGAGAACAGGGTGAGAGAGGAAATCAGCAAGTCTCTTGGTGACAAGA^ 
TGTCCCCAAGTGGGCCAAGAC^CnTGTCAGAGAGAAATC 
AGTAGGGAGAGAGAAGCTGTCTGATAAACCAGCATGACTTCTGTAGCAACA 
AAGTAAAAAGCCGCTGAAGAAGCGGGGCTCTAAAATTTGGAGTTGGACCTGGTAAG 
TACCTTAGAAGAGTCAAGTCAGAAGL M riX KJVri X3CTTAGGAGAAGCTGAGCCCCT^ 
1 0 GACTTAACCACTCTACGTGTGGTGAAGGGGGAGGTCTTTATTGTGGATATGAGAGA 

AAACCAGAGAGAAAGAAATAGACTTAACATGGCCAGGAGTCTGGGCCTGGTCATAAATA^ 

TGGGG AC AG AGAAAT CAGTGAGAGAAGCAGGGGTGGGG TGGGGCT CAGTGAGAAG AGCTGGG TT AGGAGCC CTATGAGTG ACTGAT 
GGGGAGGGAAGCCCATGAGCTGAGGGCAGTTTAGGAGAGGGGAGGTGGTGAGAAGGGCTAG 
ATGTGTAACAAGTACTTGTGATACCAGCATACACTTTGACATACTGACCCTT 
1 5 GGCTTTTATCTAATTTCCAGAAATCCTATAGGCCAGGT 

GGGGGGGAATTGGAGTTTCTTTTGGGCACAGCAGTTTGCTGATTTGCA 

AGG CT CTT AT ACTTGGT AC CTT ATG AAGGGGTGG AAAAAAATTCCTTTTGGTTCTIT^ 

GAAATGTCTTTAGGAAATTTTGAGACTCTCCTCAGTGCAAA^ 

2 0 TACCACAGGATGTGTGCCTCAAGCACCCAGGAAACAGCCTITCTAAAAGAAGAC^ 
C^CACATACTAGTACATAAATGTTAAAATTAAGTATAATTTGTTATATTTA 

AGTATTATAAAGTCAG ACAAGTAAAATTGTAAT AATGATCCTC CTAT AGTACCTTGCATCC 

CTCCCATGAGACGCAGACGTGTTCTTACTCAGAACGTGGACT 

AGTCTCTCCTAAAATATGACTCC!ACATGTAGTATGGCT^ 

2 5 TGTGTAGTGTGTGTGTGTGGGTATCTGAGTGTGCAGTGTATGTGTATATGTTTCTGTG 

AGTGGTCACTAGAAGGTTTACCCTGCAGTTTTTAGGGATG^ 

GCAGATGTTAAAAATGGCTGAAGAAGATGAGAGGTGATTTAGAAAGCAAGGAGTAA 

TTTTGGTTCATTTTGAAAGGTGGGCAAAGACAGCTGTTGATTTT 

ATGTTTTTTTGATTTTTGAAAGGGCTTTATTTATCCCA 

3 0 TTAAAGCACTAACAAGTCTGGAAAATCCAAGTTTTGTGGGTTCTGA 

CAAAACCCAAGTTTTAAAATCATAGCAAGACTGAGAAGCAAGAAAT^ 
CTACTCCTGTTTACAATAGACAGAATTATTGCCTTTAGAGCCAATGGA^ 
AGATTACAGGAAGGTGAAAGGTGATAAAGAACTGACAACCAGAGCCn'GA 
TCGGGGTGAGAGCTGCCAGTGTATCAGAGGCCGGGGGCTCCAGGATG 

3 5 TGGAAAGAATCACTGTTTTTCAGAG AT AATTAG AAACTTGAAG AAGG AAAT^ CTTCT AACTTATGTG CTGT 

GGGTGAATTCTCTTGGTTCCAAAAGTCCCGTACTAAGCACATGTTTTATACGAG^ 
CAGCATGGGATTTAAAGATGCAATGAAATAATGCAATAAAAAT^ 
TTTTTGAGTAAGTAAATATTTAAAAATTCCCCAAACAGTGTC 
TCAGCCTTATGGGAACATCTAAAAATTTTACTTACAGCTTCTGAGCCTTATCCAC^ 

4 0 GATGTATGCTCTGTTCTCTCACACTCAATATATATGGTA 

TGATATTTTATAGAGAAGGTATTTATTAAATGACCTTGAATTAGCATAATTTC^ 
TTGGGTGTAAGAAGTACrTTGATGATAGTGTTTACCTAGCATGTACCAGGCCrc 

AAATCAGTCAATTC CACAAAGAAG AAAGCAGTGAAT AGAAGTG CTTG CTGGCAAGTCTTCCAACCAGAGCTAGGTCCCTGGC CCCC 
ACGTGGTGTAAAAGAGAGCTGTCTTGCAGCTTGGACCTCCACTAATGCAGATACACTCAC^ 

4 5 ACAGGCACACATGACACATGCACATTCATGTAAGTACCACTGCCTGTAGTCTTGTTT^ 

TTTGTATAACTCATAAAAGCAGAAAGAAAACCCACTTAGTTACCTACCAGTACCCC^ 
GAGGGGG CTTGGGAGCTGGCTCCATCrCAAAAACTGCCGCTATGCAAGTACGGAGTTC 
TTTTAGGAAGGAAATGACTGGCCATCTTCTCTAACCACTG 
GAAGCAGCAGCCACAATTTGTTCAAGCJUVTTGTCCAAGCCCCACT 

5 0 GGCTAGC^GAGTATGGTCACCAGGAGGTAAGAAGGCAATTCAGTTCAACACAGCTAAGTAGCCAGTC 

TCTG ACC CTG AGAT Gl ' UT ATTTT AGGCACATGGCJ^AATTGGGTG AAAACTTTC CTGGTGATAATTT ACTCAGGACTGTGTACACA 
ACAAAGCrTAAGACATGACTACACAGAAAGTTCCTTAT^ 

CATAAAGTACATAAGGCAATGCAGATGTGTGTGTGTGTGTCTGTGTGCAA 
55 AACTT ATTTTAACACACGTAGT ATGCTATGTAAAAT ACAT AGAT ACG AATTTATTTAT ACACTGAT CTTT ATTGTTAGTTGG AAC CA 

TTTAAATGGTAGATGTTTTTTTTATCTATGTTGGAA 

CCGATCTGGCCCCTGGTTCCAGGTGTGGCAGCCCAGCTCAACAAAGTCCACT 

TAGAGCTCAAGCATTTTTACCAAACTCTACCATGTATGATTCTGGTATTTGT^ 

TCTATCGGGGTTTCTGAGAGCAGTTTTAGGAAGTrTGCTTTTAAAGTGTC^ 

6 0 AGTTCACCrTATTATGTAAGTGTTCTTGCAGCCATAATTTGCGAGTGTAGTAAGATAAA 

GCATGCTGATGGCATGATCTGGAAGACAGACAACCCAATTTGAAAGGGACACAATTTACCC 
TTTGAACTTTTGGAGTTAAAAATGTTTACTTTCAAGTTC 
AATCTTCTCCAATACTTTCCCAGGAGGTAACTAAGAACATTGCATCACT^ 
CACACACACTCAAACAAAACAAAGAAACAAACAAAAACCTGAT^ 

6 5 CAAACATATTTCTTTATTCC CAGTTCTAGAGAAGTACTGAGTGTC AACTAAGTCACTTT ATTTTGCTGT CT CATTGATAACCGG AA 

GGACTTGTCAGGACATGGGTAAGAAGGCACAGCATATTTTTATAGCAAGTATTGTGTAGAAA 

GACCACTCACCATGAACTGCAGTCACAAGTTTGTAAAGTTTGTAGTGAA^ 

TCCCGTATAACTTrAAACGGACCCTGTACrTTTTGTCCAGTTTGCTTA^ 

TAGGTCCAGAGATC>GTAGAGCTC^ACrCAGAGCACCAC^GGAAATAGATAGCTCTGACC^CACAAAGT 

7 0 TAAAAAGGGGTTGTATTTATGATCCTTAATTCAGCAAGCTCa^ 

CAAAAATGTTCATGAAAGACCAAGTGGTACTCACTCAAACACAAAC^ 
TTTACTACTGTTATCTCAAAGCTCCTAATCrrTATC^ 
GCAGCCCCCGAAGTTGCCTCTTTTCATTTGTTGTAGAGTGA 
TTTTAAAAAGAATTTCTTGTCTTTATTTTGTTTTGTCIT^ 
7 5 TCTGCATGTAGGAACCAGTCTAGCATCAAACTGGCAATCCTCCTGCTTCAAACT 
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TATGGCTGGGTCAGACATATTCTCCTCTCCTCCTTTCCT 

CTTCCTCCTCCTCCTCCCCTTCCTTCTCCTCCTTCTTTCCAAATTCCTCGACTTCT 

GT AACTGTG CACT ATCGT CTT ACTGAAGAAGATG CCCTTGTGG u i~r lt3 ACT AGG AGT AAG A'l'ri T ACTG ATTACAATGG AGACAAG 
GCTT<^CTGTGTCACTACAGAGACACTGTGTCACCG 
5 CTGAGCCAGGCATCTAGCAACTCATTCTT 

AAAAATAAAGTCTTATCTTGTTTGTTTTGAACCTGAA^ 

TCTCTCTCTCTCTCTCrCTCTCTCTCrCAAATCTAGTTTA^ 

TTACAGGAGGTTCAGGTCTTCAAGTAGGTAGCTTATTTCAGTCACCT 

GTGCTGACATGAAGATGTAGTGAGCTTGTCCTAGAGGGTACCATGGCACAGGGGCCAGTCCT 
1 0 ATGTClTCrGTTTGATAACTTGAGTGGCTCTGAAAAGC^ 

TCTCATAATCrGTTAACAGGTACCCACGATTGTGGGATTCCCT^ 

GTTCCAGATCCACAACTCTACCAGACACACTGCAATGTAGCCATGGCTC 

AAACTGAGGTTGGCAGGCAAGCCTCCTATCCCTCCACATCCGAGTC^ 

TAGACTAGGTTCTTGAGCCCAACTTCTGGACCTGAGAGAAATCTTCT 
1 5 AGGTTCCTCATAAAACTAACATGCTGACATTTGTCCAGAATTGT 

GAAGGAAATCCTTACAAGCATGCATCAGAGCCTAGTGCAGGCTTGGAA 

AAATCTGGGCTTTTGGGAAAGTCATTTCTCAAATGTTGTGTTCAC^ 

TTTGCTGT AACTGAAG AGGTGGTGCATGTGAATG AACTG AAAGGG C CAGGCAAGTGT CACAGG CAAG ATG CAAAT AAGG ATGG AT A 
CT CT CACATCACATGGG CG C AAG AG ATT C AGAAAAGGAG AG AG AAAAAACAAAT CTT CTG ATTTGCAGG AGTGAAG ATT AG ACT AC 
2 0 AGATTAAAATCTGTAGGTAATGAAAGATTCAAAGAGATACTCrrTATAATC 

TATTTGTGAGTGTGGGTGTC CATGGAGGCCAGAGAGGGTTACAGATCTCCTGGT ACTG AAGTTAC^ CTGCCTGG 
CTTAGCTCCTAGAAACCACACTCTGGTCCTCrcTGAAAATGGTGCATGTACT^ 
CCATTTTTAAAATCTGGCTTGAAGCTGAGCAGATGTCACGATTCCT^ 
AGGAGATATAGACGTCCTCTGTGACTGTGAGGGCCAAAGCGCTCATTCC 

2 5 CATGACTCTCATGCTTCTCCCTTATATAGCATAGGCTCTGTAAC'rriT'r 

TATTAGGTATT/TATTTCATTTACATTTCCAATGTTATCCCAAAAGTCCCCCACATG 
CCCACTTCTTGGCCCTGGCATTCCCCTGTACTGAC<X1ATA 

GGTTGCAGATCCCCCCAGCTCCTTGGGCACTTTCTCAAGCTCCTCCATTG 

3 0 ATCCACJTTCTGTGTTTGCTAGGACCCGGCATAGCCTAGCACATAG 

GTCTCTCACITC^GACACTGTCCAGTTATGGGACTCTGGGTTACTGTACA 
AGATACTCCAAGCCATGGGTATAGCAACATGTCATTGGGTCATTTGATTGCTAT^ 
CTAGCTGAGACCACTTTTT CTT AGTGAAAAACCCCT CACT C^ 

G ATTT CAC CCTG ACAAACC CATT AC CHTTCTT CTTTT ATTTTT ATT^ CTAG AG ACAGG CTT C C 

3 5 TTTGTGTAGCTCAGCTGTCCTGGAACTAGCTCTATAGACCAGGTTGGC 

GCTGGGATTAAAGGCCTATGCCACCATTGCTTGGCTACATTTCTTTCT^ 
ATGTCACCCTGCACACAGAGTTTTGGTCAGACACTCTGCAAGTATCCTGAGATGAA 

GAT AATC CTCT AT ATT ATG T AT AG ATGCACCC CATTCAG TCTG CTG AATGTGTT AAT AAAC!AAACA C T AG CCTG CT CATTT CTTCA 
GTAAGAAAGATTTCCATCCTCAGACTGTCATAGGACTTGAGATTTTGATG 

4 0 GAGGGCTGACACAGCTAATTCTTAAAACACCCTCCACACACAAACACCCGTCT 

ACACACACACACACACACACACACACIACACACACACACACACACACAC^ 
CTCACTTATATATAGACAATGCTACAAGAGAAAATGTATCTTTCCACTGTT^ 
AGTAGGTACTCAACAGTACATTTGCTrGTTTGATACTTCGTTTAGTAGACrGTGCACT 
ACATTTAACTAACTCTTCTATTCCCCTCTTTTCCCTCCTATCCTTGGTCAT 

4 5 GCTTATTCTGCTTCTCAAACAAAAC!AAATCAACAACCACCCACT 

TGAAATGCTCCTGTTTCTGCCTGAATGGTTTCAGGGAAATCACTAAATTCTGCAAAGATGA 
AATAATAATAAAAATTCITTAAATACAAACrGTGTGGAGCATGGCTTACT 
TGTCGTGTTAAGTAAGGAAC^TTTTCAGAAGGACTCTGAAGTGTGTGTGAGACCA 
CAGCTGGCTGTGTCTCrrCACrCCrCITGACCTCAGAATTCAT^ 

5 0 TCAGTGCTGGGGAACTAC^ACCCCATGCACTTTTCX5CAGTATGACACCCTTGAGCT 

TTATTTTGAGATGTCCCCCCTATGTCACTAGGCrGGCATGGAACTTGCG 
GAGTAACTAGGGCTGTAACACTGCACAACCAGGCTTCCATGGGTGAGGTTTC 
TGAAGTTGAGGAGAGAGGTATATTTCTTTTAGAAAGCTTCAAAAGATA 
TTTTCTTTATTTACATTTCAAATTTTATCCTCTTTCCTCATTTCCT 

5 5 ACCTCCATTCACTAACCTACCCACTCCAGCTTCCTTGTCCTGGCATTCCCCTACACTG 

CCTCTTCTCTCATGTTCCACAAGGCCATCCTCTGCTACATATGCGGCTGAAAAGCTTGAATT 
GGCTTAGTCCCTGGGGATACTGGTTGGTTCATATTATTGTTCCTCCT 

CTACCrCCTTCATTG<3GCACCCTGTGCTCAGTCCAATGGATGG CTGTGAGCCTCCACGTCTGTATTTGTCAGGCACTGG CAAAGCC 
TCTCAGGAG ACAG CT ACATT AGG CT C CTG T AAG AAAG CACTTG TTGG CATCCACAAT AATGT CTGGG TTTGGG AACTGCTT ATGGG 

6 0 AT AGATCCC CAGGTGGGAC AGTCACTGG ATGG CCGTTCCTTCAGTTT CTACTCCAAACTTTGTCTCTTTATGTCCTCCCATAGTT A 

TATAGATCCCTCTTCTAAGAAGGACCGGAAGTATCCATACTGAGAAAGCTACTAGTA^ 

CTGCTCTGTTAACTTGAATATTTATGCACTTGTGAATCCCTCAGCTGTTGCTTAAGAT^ 

ACAGGGCACATACAATTGC<;CAAAAGCTTCTTAGCTTTGTCATTTTTTC 

TT C AATGTG TC!ATTTTCTTGT TTCTATG^AAAAC CAGAATTG ACAGG AAAGAG AAG CAATTTTTG CTG ACACCAG CCTT ACTC CTT 

6 5 ACCrCAACCAACGTGGCTGTTGCTAC^AATCGTTACTCCTCCTCCACTCAATGGGTTAGACT 

AATATTGATCGAATGCTCCTAGCCTTTGAGGTATTTCTCTGACTTGAGATTTLrr 

CCACGTGTGTATTTGTGTGTGCATGCCTGTGTGTGTATATGTATAAGTGTGTGTGTGTATGTGTGTGTGTGTGTGTGTGTGTC 

GTGTGTGTGTGTGTGTATGTGTGGTGTGAGTGTGTATCTGTCTGTCTGTCTC^ 

CCTCAATCATTCTTCACCTTATTTTCTTGAAAGAAGTTTTTCACT 

7 0 CTCCCAGGAATTTCCCTGTCTCCCTCTTTTTCCCCAGTGCT 

TTAATGCTTGCCTCCCAAACACTTCACCAACTGGGCCACCTCCCAGATCTAGTTTATTTCCATGT^ 

AAATGGCTTAGTTCATCTCTCATTGGCCAGCAGGACTTTGACACAGCC^ nil GT 

GCAAAACCTCTCTTATTTGAACAATTCATGTTGGGCAT^ 

TAGCTGATGGCTATCCAATCTGAAAGAAGACCCAACATCCACT^ 
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ACAGATTTGAGT(rrcrriTAAAAGCA^^ 

CAATTCCCAGCACCACAAAAGGAAAACTAAAAGGACATCCAAATATAAAGTGTC 

ACACATTTATGTGCAAGTCTGCCTTTGGGAACTTACCTTTTTCCCATCAGTAAT^ 

AACAAATGTCTATTGGGTTCAGCTGACAATACT^ 
5 TCrrCTTGTCATTTACACATAATTGCTCAAGTGCAGAAAGTCAGTCT^ CTCATT CATCTCTGAGTACAGGGAA 

AAAACACTATACTGAGAGCrGGAAAACTTTAGAGACATTCTAG 

TCCTGAAATTTTAGTCTCACTITTCCCTATTTGAAACCTGGATG 

TGGCTACCACAGAGGATCTAAAATGTACTTAGAGAAATCCTAACAAAGGGACCA 

TTTATTTTTTTTGCTTATTTTCCTAAGCAAAGATA 
1 0 GAATTAGACATAGATTTAAAGATTTGTTCTCXATAAAGA 

AGTGGGAAGTTTATTGTATCTTTGAAACAjGTGAGACATTGATTCT 

AAGAATCAAGTCTGATCTGAGTGACTCCTTATAGCTTCATGATTAACCTGTAAAGAO 

GAAATCAAGGCACAGAAAAGTCAGTCACTCTTGGTAGAAACAAGGTTTCAAACTCT 

TGTTGTGGGGACTTAATGTCAGTAGTAACAGTAGAAATACCACTGCTCACAGGAAACATCCT 
1 5 AGAGCTTTGAATGCATGGTTCTATTTAATTATCACTACAGACCTGTGAGACAGTG 

GCCTGGGAGGCAGTATGGATTCTCAGACCTCCCCTCCA 

GTGGGCTAGAAAAACCTGCGGTTGAACCTGCCAAGCCACAGGGT^ 

TACACAGATCTAGTCACTTCCAGGTGTGTGACACGTATTTCCCACGTTGCTTGG 

TCCTTTACAGGTGCTTGCTGGCTCACAGTTAAGAGTTCATACTCAAGGTACT 
2 0 GGGTTCGTGAAACTGATAAAAACTGCTCAGAAGGATTATATCAAGGAGGCCCATTT^ 

CATGAAGCATTCAGAGATGAGCGTGAGAAGCAGGGTGAGAGATTGAGGTGCCCGATATGGAAGAATGT 

T ATTGGCTGTGCT ATAGGGT AG AG CATT ACAAAGAAGGGGGAG CAGG ACT AT AAAATTG TTTT CC AAG CACAATT CTTT CAT CTTT 

ATCATATTTGTCTGGGCTGCTTCTAAAAGAACATCATTT 

TTTCrCACAACCTAAGGAAAATTCATTTGTGAGGATTAAAAAGCCTAA 

2 5 GGGGGCTCCCTTAACAAGAGATTTGTAACCTrCIAGGGGTATGTG 

GGCIACTGAAGCCTGCAAAAACTACATTTCCATTTTTTCAG 
TTTCTCAAAGATCAGTGTTTTGAAATGCAAATCA^ 
ATAATTCTGTAAACTTAGTTTCCACCTTTTGGGTTGACCACAGT 
GTTGGCTCAGTGGCATATAGGTTTGTTTGATAGGTCTCAGGCAAT^ 

3 0 CAGGCCTGTGGTACTCAAAGTGCAAAGCAGGTTAAGGTCACGCCTATACAAAGACAAT^^ 

TTATAATACAACCCACGTATACCTAAGCTCCTAAACACTTACTCTTTTCCA 
TTCTTTTAAGTTGTC^TAGA(ZAAGCAATTTATGAGTTAAAATCTG 
GCTTAAAATTTAAAAGAACAAACTAAAAC!AAAGGCAATGCGT^ 
TTACTCTAATTTTGTCTTGTTTGCATTTAAAAGGGGAATCTTATGCCTGGTGTTT 

3 5 GTGTAAATGGTGCAAACATTTAAGAAGGAAAAGGGAACATGGGTGGAG 

TAATTGATCTGGAAATGCCTTAAAGATTACAAGTATTGTGTATATGGCATTTCA 

T AAATT CTGGAGT AAT AG AAG C CT GGGAT AGTTTGT ATT AAGCAG ATT CTAAGTGCCTCAG C CTTTG TTTT ATTTTT C T AACTTAG 
TC CTTAGTTG T ACGATTTCTTTT AGCAATTG CAAC CTC^ CATTCTGG CTGT GTT AACAG CTTTTCTCTG TCT CTT CACT AAT ACAT 
CCCAACTGCCATTCCACACGCACTCACAGCTCTCTCCACCTGCCT 

4 0 TTTCAAGTCTGTTTAGAGCATTGATGATAGTAAC1ATAGA 

AATGACLAAGTTTTATTTTTTTTTTTAAAGAAAACTAGGCTGAGCA 
ACTTTCTTTTCATCIAAAAAGAAGAGAATTGGGAGAAGGG 

GGTGGATGCAACTTTAATGAAGAGACTATAACTCATCTCTATCTCAAACGATTGACATA 
ACIACAATATAGACTTTTCCAGTGATTCCATCTGTGAACTGGTGCTTT^ 
45 AAAATTGGAAATCG ATTCTTCTTCTTT ACAATCAT CC CG ACCAC AGTGTCCTCTCCTTC CATGCTTCCTGT CTTCCCTCCCCAC C A 

CCAGCTG CACCT ATTC C CTTCAGAAAAGAG CAGG C CTCCAAG AG ACAACAACC^ ATT AT GACAACCCAGCAQ3 AGG AAAGC AG AG T 
AGGCAAGAGAGTCAGAGATA<^CCCGCTCCCACTGTTAGGAATC 
CrGGGGCAGACCCAGGAGGGCTCTGTGCrTGCrrCCTCTGTCTCTGTGAGCT 

TGrrCCCCTGGTATCCTACrCTGCCTCTGCCTCCTACAGTCTTCCTCCCTTCCCCCTTCCATGAGGTT 

5 0 GCAACACAATGGAGACCTCCAATTTAGACTCTTTCTCTGCATAGTATATGG 

GGGAGAAGCCTCTCTGATGACAACTGGACAAAGGACCGATCTATGAGTATAGTAGAAT^ 
TTCTTTCTTTTTNNNNNNNNNNNN^ 

AGCAATGGTGGCCTCCCCACCACCACCACCCCACACACA(^CCCGTCAGCTTTTGGAGAGCAACCCTCT 
GGTTGTTTAGGAATTCGCACZAGGACCCCCTCAGGCAACAATTCAACCAGAT^ 
55 TAGAAAAGATGGCCAGTTCTGACTCTGTATCCTCCATTACTTGGA 

ATCACACTAGGITTCCACATCGCCCCAAAATGTCCACCAATTCCAGCTCTCT 
TCCAACCTGATACCTCGCACTCCCATCCCCACCGATCCCAGTCCACCTGCAAAAT^ 
TGAATCrCCCCCAAGAGTCCTCCTCTTTAACTAATTTCrrCrGGGTCTGTGAATTGT 
ATTCAATTATAAGTGAATACATAATTGTGTTCGCACAAATACCATATTTGTCT^ 

6 0 TTTTTCTAGTTCCATCCArrTGCTCACAAATTTCAAGCCAGAAATTTTTAA 

TCTTTCTTTTTTCATTTCTCTATTGATAGACATCTAGGTTGTTTACTGGTTATTACAAATAAAGCTGCAGT 

AGTGTCCTTGTGATAGCATGGAGCTTTGTTGGGGTAGAGCAAGAGTGGTATAACTGGACCT 

TGAAGACCTGCCATATTGATTTCTGTATGGCTGTACAAGTTTGCA 

6 5 CCTGATGGCTAAGGATGGTAAACATTTACTTAAGTGTTTCTCAACCAT 

TGTCCCATTTGAAATTGGATTACTTGGGATATATATATATATATGGTTTCTTAACTTCT^ 
TCTGATGTGGAGTTG£TAAAAATATTTTCCTATTCTGTAAGTTGCTC 

TTAATACTTGAAAAGCCTTTATCACATTTTGATATCCAGTAGGATGAATCCAA 
70 TACATACrTATTTGTT'GCTTCTATATGTGTATGTATTGTGTATGTAAGACATACACA 
TAT ATACAC A(TTT ATCTGTT ACTT CT ATAT ATGT ATG T ATC 
ATGTATATCTGTGTGTGTGTATATATATATATATATATATATATATC 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

7 5 NNNNNNNNNNNNNNNNNNNNNAAATATATC 
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CAATAAATATGTGTGTGTTACTATCAGAAAATGATACAT 

TTGTAAATTT AGATTGTG CACTTTAG AGGAAAGAAT CTC AGGCT AATTGT ATC CAT AT AAAGCT AAG CATGCTCT ACT ACCTAG CT 
CATGAGATATTTCAGTTCCCTTAAACAGCAGTGCTGTGCCACCTC 
GCACTACTGTGGGGCTCTCTGGCTCTGAGGraCTATTTTAAT^ 
5 TCTGTTTACTAAAAGGTTACAAAAGGTCACCCATATTTTCACTTTACT 

GG AAT CCT ATGAGGT AATTCACCAAGC CGTGC C CTAGGAAACACAG CAT AGJVITCCATTTGCTG CTG TGT CCATGTGGAGCAGATT 
TGACTGCGTTTGTCATTTGTATTCCTGGCTAGAGTTAATTGT^ 
CTACATTTCCCTCTACATTTGCTGCACrGAGCCCATTA 
GCCTGAGAAGGTGAAGGCCCTGGAGGCTAATCTCTTGTCTTACT^ 
1 0 AAACAGGCATTTAGTAAACGCITGTGAAATATGTGACCriTTCCAGGAGGC^ 
CTTACTTTAATAAATCAAAGCAAAAGATGAATGTGTATTTCAAAT^ 

AGAGGTTTGTTTTTG TCT ACAAAAGAAGT CCTGTG TTCCTGGAAAAGAAACTG AAAGGGG AGAGAA 
TCTGTGGCCTGGGCZATGGCTCTGACCATAAAGACAGAGAAGTCCTCAT^ 
TGCTCTATCAAAGATTTCACAACrACTTCAATTTTG^ 
1 5 TATGAGCAGGTTTTGTGGACTTTGAAAGACAAAATGA 

CCACCTCCATGATCTGGCCTCTCCTTCCTTTCAC^GAAGTGCATCTGOT 
TAACTATCCTACT A TI T T I 'TTTTCTTTTGAGACAGGGTTTATTTGGGTAGCACTGGCT 
TGACTTGATTAAAGGCATGAGCCACCACCACCCAGCTrAACTATCrrACT^ 
TCCTGGCTTTTTTTTTTCTTTTAACTI^ 

CTTCTrGTGGAAGCTGTAGTGTCAATGTATCCCACAGTATTCCCTCTT^ 
CATATTGGTGTGGCACGAGGCCTITGGCTTCrrCrTACrCrGT 

CCCTGTATCATGGAGATACTGTAGTTTGGAGTCTGTAAGATTAGCCCCTTCATGTATTCTAAC^ 
GAGGTGGGCCAATGCAAAGGCCCTGATCTGGGTCCGAGAGGTATCTGAGCTGGTC^^ 

2 5 AGGCCTGGCTCAGCTATACTCTGCTAGCCAGGTGAGGTGCGTGGCCTGCT 

TTCTGCTCTCACAATCCTGGGCGAGGTCTC^TGTCrGCCTTAGATGGTGATGGACAA 
CCGTGCCACCACTCAGCAGACAGGAGGTAGAACTGGCTCTCCTCCACTCATGCCCTCAGGGCC^ 
CACCCAGGGCCAGCTCTCTTGTGTTGCCCAGGC^AAGGT 
ATCTTCTCTTGTGATCCTGTTGGGGTCCAGCTCrTCAGCCT 

3 0 GTTGAAGCCACCACACAGCAGACAAGAGACAGAACnXKMTCTCCCACATC 

TCAG CTCTACTGTG CTGCCCAGGAGAG ATGCAGTGAGTGCACTGCTG AGTGCCACAGGTGGTGAGGGGC^^ 
TGCTGGCTCTGGATGTGATTCTTTCCCCAAATAGCTGTCTGTCTTrTAC^ 
CrGATGTTAAGATTCTTTATTATCATCCTTACATACTTTGCCTAATGT^ 
TTCCTTCCTTCCTTCTTCCCTTCCITCCTTCCTTCTTCCCr 

3 5 CCTTCCTTCCTTCCTTCTTCCCTCCATCTCTCCTTCCTCTCTTTCTC 

CTCrr rc TTTA Cn . crri . A . rrA - i .. rfr CATGACAGG 

CC CATGCACAG AAGGGAAGGAGTACATGGACAAG AACCATTATGCTGA CACACTCTGCGATGAAG AGCATG 

GTATGTATCTACAATGCAATTTAGAGAGGCCAATCCTGAGCCTCCATATAGAACCATGCAG1TTGA 
CATGCCAGCATACAGGAAGGAGCGGAATG AGGCT AGCTGGGTAATGTGGAGCT ATC 

4 0 TCACAGTGATTCTTCCAAATGGAAGCAAAGGACTAAATAACTCGAT^ 

AATCTTTCCCCTCTATTTCTATTAAAACTTGTCTGTAGTTCTTGTTCATGGC 

CCTTCATCTTTGGGGATCCCATCTATCATCATTCTATTTGGGGACCTGACTTTTTG 

ATGGATTTTAAGAAATGACAGAAGCAGTTGTACTTAGGTTTGACATCAAGAAAACTCT 

CT CTTCTCTCTCTTTTTTTTT AAAGGCAT AGGG CTTTT ACATT AG T CTAT AAAG AAGCG ATG CAGAAC C CTGG CT AT ACATT AAAC 

4 5 CTGCATCAGGAGCTGTTGAAATACTAATGCTTAGGTCTCAGCTCAC^ 

TG AT AAAGCCT AGTTT AGG CTT ATCAT G AAG CAG CATGATGGTGAT AATGATG ATG ATG ATGG C GG CAATGGTTT CCAGG CC ATGT 

ATTTAAGTTGCAAGGCTTTCTATTAAAGGCATGGCGTTGCCAT 

CACCAAAGGCTCTTTTTCATGTTGGTATTTCCTTTTGGTT 

AACAAAAACCAAAAAACTATGCCATCCCTGGCCATTTTTC 

5 0 TATTATACATAAGTACACTGCAGCTGTCTTCAGATGTGCCAGAAGAGGGCATCAG^ 

TGGTTGCTGGGATTTGAAATCAGGACCTTCG<JAAGAGCAGTCAAGTGCTTCTATCCGCTGAGCCATCTCA 
ATTGTTAACTAGCTACTACAATTAAAACTTTTTAAACTGGCTTTCCAGGTT^ 

ACCAAGTGCAAGTGCAAACCAGACTTCTACTGCGATTCTCCTGGCTGTGAACACTGTGTTCGCTGCGCCTCG 
CAGTGCITAAGTCACTAGCTATCAGATGGAACATCCT^ 

5 5 GTTCTTGTCTGTCCTTCTTTTGCTCAGTTGG(^CCAAACGCCACTGAGCTCTTTTACCAAAGCCATCCGG 

AGCCACTTACGGTTGTGATGC^IAGACACGTGACTGTCATCAAGGTTTTGGTTCTAGGGCAAAG 
GTGGGTGAAGTAATCAATGCTTATTGCATGTCAGCTCCTTATGGTGTCTGGCA 

TTATGCAGGAAGTCCTTTTGACTGTCTGAAAGGTGAGAGTATGGAAGAGGCAGGAAAATGCACCAGC^ 

GTGTGTTCAGGCAGAGAACCX3CAGCTTTAAAAAGGCATAAG AAACTCCCAT AT CTG CTGGAGGGT AG AAAAGCCTTGGAGCGAGCT 

6 0 GGATAAGGACTTTGCCTTGATGCACAGGGGTTCGCTTTTGACTCCAATCTGGAAAGTCTG^ 

TGAGCTAGTTTGAGCAGATCACTTACTCTGTCTGGACTTCAGT^ 
CACCTGCCACATTTCTCTGGAGACCCACACTAGGTAATATATTTTGCAAACACAGCT^ 
TAGATCACTTGTCCGGCATGCTCCAGGAGTCTCGGGTTCGATTCCCAGCACTGGAAGAAAAGTA^ 
ACATTGTACAAAATGTTTATAACGGCTATTTCCTTGGCATTCCT^ 

6 5 AATACTAAATCTATTAATG CTAGCTGT ATT AG AGACATG CTGCTTT ATATTTTACT ATT AGGAAAAG CATTTGTGTTGCCTCCCTC 

TCCCGCCCCAAGTGTCTGAGAGTGATGAGAGGCAAGGTATACATTGTGGGGAAGTAGCTA^ 
AGCTCCATATACTTTATAATCACGATTTAATTTTCTTCTCACTGGCACrCTT^ 

GTGAGATTTCAGATTCAGGCTCCTG CAGGTCAACTGCTTTC CCACCCnTGCCTGTTTAGGTTTC AGG AAAAGGTAGGCTGTGAACC 
TGCAGAGGTTAGCTAGCTGAAGAAAGTCCCCCAAGTGCCTCTTCACTGTCCTGAGCACTGCCT 

7 0 AGGTGGTGGCTACCAGCAGGCAGTTTTAAGGGTTTTTTAAACIATTTTTTTG 

CTTATTTTTAGCTCTTATATCTTAAGAACTC^TGTGTCACCAATTTAGTTTTTAA^ 
TGGTGCTGAGGAGGGCCAGAAGATGCTGCTGGATGGGACTTACAGAAGGCTGTGG^ 

TTCTTCTTCATGGG CAGCCAACX3 CCCTAACCATTGAGCCAGTTCT CTAGCCCCrATGTCTTAAACTAGATTTAAGATTTTCAACCA 
CTCACTCCCACTCCITAGTTTGTGTGAATTTTAGATGGCGAT 
7 5 GATAAAAGTATCCCTAATCTGTAATGTACATTGGAAAAATGGAGAAGGAATCTACGCAGGCCT^ 
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CCTAGGTGTGAACATGGAACCCTTGAGCCGTGCACAG 
TTCCTCACATGTCAGAATAAAAAGATCATTGGCT^ 

TCATACAAAATATCCAATATTCGCAJGGTCCCAGAAATCGCCTATGGTTGT^ 

CCTAAGTTTGTTCAAAATXn'AGATTAAAGTGACCTGAGAAGTAGCCACTGTG 
5 TTATAATGACACCCAGTAAACCTGGCCTTCTCTGTGCCTGTCCT 

CACTCCTCCTTATTTTGTGTCTTTAATTTAAAATGTATGACA^ 

GAAAGCTGGGTATCATCGGGGTTATTCTGAAAAACCATCT 

AGAATCTTCTAGTCTCTCCTGTGTTGCTGGAACAAGAAGCTGT 

GCATTTGAGTCTGACAGAAATCTCTTCGATCTTTCAGTATTTA 
1 0 GGATGACCCTGAATCTAGiUVCCTCCAGTCGTGTGAGTATGCAAATGCATACAACA^ 

ATTAGCACTTTCTTTAAAACAAATTCAACTCCCTCCATC 

ACTAAGTTTTGTGGGGTGCTACACAAAAAAGGGTCTGCCTCCrrATAG 

TTCTTTCTCAAATAGAAGACCATCCCCGCTCTCTACTGAGCTC 

AGACTCTTACCAGAGTTTAATAAGGGAAGCAGTTGTTTTCTGACCCCTGCT 
1 5 AATCAACCAGTATTCATATGTTGCCGTCAGCATTTACAATGACTTATTTACCAAGG 

ATGATGTTAGACTGAATTATCATAGCACCCATGGACACATGTGTCATCACGGGTGGGGGACA 

TAAAGCCGTGCATTAAAGTTTATGATTGTGCGTGAATCC CAGGTAT AGTTGGGTTGTTT G CAT TGTTTTCTCAAAACAGGGGGGTG 
GGCTTCTGAAGGCAAGCTTACAAGAGATACTTACrAAAATGAAATAAATCCTTATA^ 
GAAGGAACATGTGCCTACCAGTGTGAGCTGGGTAGGGATGTGTAATGTTCCCTGTTT^ 
2 0 ACTCAGATTTCCAAACCCCCGCAAAGATTTTGCTCTTAAAATCCAGCACGTGGGAGACZAGAGAC^ 
CCAGCCTGGTCTATCTAAACCAGGAGGGTGGAACCTTTCAGAATCAAAACTGAATGAC^ 

TAGTTTCTTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGTCA TAATCTCT 

TATATTTCCTTCTGAATTTAGAAAAAGTTAAATGACATGCATGTATGTTTTTTCT 

AACCATACCAATGAATGCCTCAAGTAAGTCTTTCTATTATTTTATTTCACAGAAGTAGCT 

2 5 GAACAGGCTCTAAGATTTTAGAGTGAAATAATGTTTGAAT^ 

GCCAAAGCTTAAATTGATAGCAGTCTCAATTTAAAATATACGAAGAAGCGATAAAACA^ 
TTTTCATAATCAGTTAGCTGTATCACGGGACCCTTTGGAATCCGATCAAATGGAACCATGAACTCT 

GCTCATCAAGTCTCTAGGGGGCATTAGAGGTGATC ^ ^ 

3 0 ACATTTTTTATTTAATTAAGGAAAGCCTGTTCCAAACATCAGAAGTCTCTCTATAATCCT 

GTGTTCTCCACATTGAAAGTATATAGATAGTTCCACAATTAAGAGCATTTGCT^ 
ATCCTCATAGTGTCTCACTGCCATTTGTAATTTCAGTTCT 
TATGGCACATGCATACATTCAGACAAAACACAAGCATGCACACACATACACA 
GGTGATGAAAGGAGGGGAGGAAGGAGAGAGAGGGAGAGAACTAAAATAAAAATCCTC 

3 5 GACGTAATAGAAGCAAAAGAACAAC^TTCCAATCCATGGTACAATGTGTTAGTCTG 

CCTTATTTAAAATAGAAAATACATGTTCAGTATACAAACTTCAAAAGATGCACTAGTATAATGTGACA 

TATAATTAGTGACTGTGCACACTCTCAGTACTAAACTCT 

CCTGGACCTATGTTATGCTTGCATTTGAATTCAGATTTGGGGAAAGTT^ 

ATATAATGGAAACTATTTAAATGTGCAATATAATGGCTTTTTAAGTCATCAAAGTAATTTAAAATAGTTAAATG 

4 0 ACATGATAACTTCTGAAACTAAAAGTAATATGTACTGAACCAATTTTCAAAATCT 

AAAAATTCTTATGAAGTCCTCTATGAAAGGAAACTGTGACAAGATCCTAATTATACAGAAACCAT^ 
GCAGTATAACTGTGGTTTCATAGAATGAATTGGGTAGTGTTCCTT 
AGGTCTTCTTTGAAGTTCTGATACAATTCTATACTA 
CTTTTGGGTTTGTTTAAAGAAGATTACTTCTTGCTTTTT 

4 5 CITTGTAGGGCTGGATTTGTGGAAAGATATTGTGTGAATTTGGTTGTGTCATGGAATA 

GAGTTTTGCTAGGTATAGTAGCCTGGGCTGGCATTTGTGTTCTCTTAGGGTCTATATGACATCTC 
TACTCTCTGGTGAGAAATCTGGTGTAATTCTGATAGGTCTGCCTTTATATGTTAATGAC 
TTTCTTTGTTTTATACATTTGGTGTTTTGACTATTATGTGATGGGAGGAATATCTTTTCTGGTCCAA 
GTCTCTTGCATGTTTATGTGCATCCTTTTCTTTTTTTATTAGATATTTTCTTTATTTACATTCAAAAT 

5 0 TATACCCTCCCACCGCCCTGCTCCCCTACCCACCCACTCCCAT 

GCAAGACCAAGGGGCCTCTCTTCCCAATGATGGCCAACTAGGCCATCTTCTGCTACATATGCAGCTAG 

T ACTGGTT ATTTC AT ATTG TTGTTGCAC CT AT AGGGTTGCAG ACC C CTT CAG CT C CTTGGG T ACT CT CTCTAGCTC CTC CATTGGG 

GGCCCTGTGTTGCATCCTATAGATGACTGTGAGCATCCACTTCTGTATTTGCCAGGCACCGGCATAGCATCACT 

CTCAGGGTCCTTCAGCAAAATCTTGCTGGCAAATGCAATAGTGTCTGGGTTTGGTGGCT 

5 5 T AGTCTC CGG ATGGTC CAT C CTTTTG TCTT ATCG C CAAACTTT TT CT CTG T AACTC CTTC CATGGGTGTTTTGTTC C CTCTTCTAA 

GGAGGAGTGAAGTATCCACACTTTGGTCTTCATTCTTCTTC 
TGGGCTAATATCCACTTATCAGTGAGTGCATATCAAGTGACT^ 
CTTAAACAGATTGGTTGCCTTCACACAGGAATTGNNNNNNNN^ 
CAATTCTATTCCATTGATCTACTTGTCTTTCACTATACCAGTACC!ATGC 

6 0 TCAGG CATGGTG ATTCG ACCAGAGGTTCTTTTATTG CTGAGAAT AGTTTTGCT ATCCTAGGTTTTTT ATTATTCCAGATGAATTTX3 

CAAATTGCCCTTTCTATCCCAGTGAAGAATTGAGTTGGAGTTTTGAT 
AGCCATTTTGACTATATTAATTCTGTCAATCCATGAGCATGGGAAATCT 
GAGACTTGAAGTTCTTATCATACATATCTTTCACTC 
AAGC^TGTTGTTTCCCTAATTTCTITCTCAGCCTGTTTATCCTT^ 

6 5 TCTTTGAATA T M ri Tr T CTTTTTTATTTT Tr r r CCATTTrATTAGGTATTTAGCT 

CCCCATACCCACCCCCCCCCCACTCCCCTACCCACCGACrccCCCTTTTTGGTCCrGGTGTTCCCC^ 
TTGCAAGTCCAATGGGCCTCTCTTTCAAGTGATGGCCTATTAGGCC^ 

GTACTGGTTAGTTCATATTGTTGTTCCACCTATAGGGTTGCAGTTCCCTTTAGTTCCTTGG^ 
GGGCCC!TGTGATCCIATTCAATAGCTGACTGTGATCATCCACTTCTGTGTTTGCTAGGC 

7 0 TATCTGGGTCCTTTCAGCAAAATCTTGCTAGTGTATGCAATGGTGTCAGCXST^ 

GGCAATCACTAAATGGTCCATCCTTTCGCCACACTTCCAAATTTTGTCCCTGTAACTCCT 

AAGAAGGGGT AAAGTTT C C ACACTTTC^TCTTCGTTCT CTTGAG TTT AATGTG TTT AG CAAATTGT AT CTT AT ATCTTGGG T ATGC 
TAAGTTTCTGGGCTAATATCCIACTTATCAGTGAGTACATATTGGAGTTAA 
TTTTCTTCTATAAATTTGTTGACGATATTTACTGGCCCTTTAATCTGG^ 
7 5 TGGTCTTCTAATTGTGTC CTGG ATTTCCTGGATTTTGGGGGTT ACTAGCTTTTTGTTTTTTC A'l T T I C'iTTTACTGTTGTGTCAAT 
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GTTTT CT ATGGT AT CTTCTGCAGCTGAGATT CT CTCTTCT ATCTTTTGT ATT CTGTTGG TG ATGCTTG CATCT ATG ACTCCT 

TTTTCCTAGGTATTTTT ATCTCCAGGGTT ATCTCCCATTTGG^ 1V1TT CCATTGTT AG AT CCTG CATGGT 

TTT ATTC AATTC CTTCACCTGTTTGG TT CTGTTTTC CTGT AGC ACTTT AAAGG ATTTTT ATG TTTCCT CTTT AAGGGCIT 

GTTTTTAAAGGAGTTATTTATGTCCTTCTGAAAGTCCTCTATTATCCTC^ 
5 CTTCTGTGTTGGGATATCCAGGCCTTGCTGTGGTCGGAG 

TTCTTGCTGTTTGCCTCTCTCCATCTGGTTATCTCT 

CCTGTGTGTCAGTACTCCTTGGAGACCAATTCITTTTGGGAGGA^ 

ACAGACAGAAACCAAAAGGATCCCCAGCTATTCCTTTATTCCTGTGTCCTGAT^ 

TGAGAAGAAATGGTGGTCTTACCTGTGCTCACAGGTGTGTCAGCACTGAGA 
1 0 GTGGCACAGGATCAGCTCCTGGAACAGACCAAGCTTCC^ 

TGGCTCAGAGAGTAAATTATAAAATACTTTAGCAAGTTTGGGAGGACAACTCA^ 

TGAACCCGGCTTCTGTAAGGGGGTTTCTCCTAGACAGACATTTTCAGAAACT 

ACATCCCGAGAATTGCTGAAGACATGACAATCCAGGAAGCTAAAAAATTTGCTCGAG 

GAGATCATCCATGACAGCATCCAAGACACAGCTGAGCAGAAAGTCCAGCT 
1 5 TGCATATCAAGATTTAATCAAGGGTCTCAAAAAAGCCGAATGTC^ 

TTGGAAAAT CAAC CCCAGACACTGGAAATGAAAATG AAGGACAATGT CTGG AGTG AAAA C C ATGAAGAGAG 

GAGAGAGCCTGCCACCCATGATGGAAACAAAATCAATGCCAACTGTATTGACATTG^ 

AATGGT AGTTG AT ACTCAG TG AGGGT CAAATGACT AGCAGGTT C CAGGGACTGCTTCTGTTATTCTCTG CAGTTGCTG AG ATG AAC 
CATTTTCTCTGTCTACTGCAATTTTTACATTCAAATGTCCATG 
2 0 T ATATTCATATACTATGAACTGAGGAG^^TTATAAACTG AAACAAATACT CG CAGTTAATTGAAGACCTTCCATTG ATGG ACAGTT 

CTTTTCCTCTCTATGTGGAAATGTATAATAGAAGAAATAATTTTTAAATTAAAGTATCT 
CTTGCTTGTTATTCCATTTTTCTTTACTCTAG^ 

CCTCTTAGGAGGAAATTCCTTTCCAAAGCCTTITAATAATTAAATGGTT 
GGCAGATGTCTGATATTCTTGAGGAATCTGATGACCGTGGTCTCAAAAGTGCATT^ 

2 5 TGCCTATAGTTTTCTCTGAGGTATGACAGTTTTCTCTGAGGTATGTCCT 

CCAGTAGAGTTCAGATATGAAACTCCCACGTTTTAAGGTCAGATTAAGCATAAGAACATCCTG 
TTGAACCAAGAACTCTGAGTCCCTTTGAAAAGGCTTTGTATTGGA^ 

GGAATGACCACTCCCACATCAATAACCATCGCTGTGTTCTGGACTTAAAGTGACCTTTTGCTC 
TGAATCTTTTTTTTTGTGGTACACATCATGAGAAGGAATACTTATG 

3 0 AGGTCAGGGACCCCCATGGGCCTTGCATACTTAAACTTCTAAGCATTCTTCAT^ 

GCCCTATGCAGTACTACATTGTTAGTTTCCATATAGTC 

CTCTCATCTGAATGGAAGTTTTCCTGCTTGGATTTTATACCAGGGGCTG^ 

TAAGGTTCTTGAATCAATCCTTACTACCCANNNKNNNNK^ 

CCCTGTGTTGCATCCTATAGATGACTGTGAGCATCCACTTCTGTATTTGCCAGGCACCG^ 

3 5 CAGGGTCCTTCAGCAAAATCTTGCTGGCAAATGCAATAGTGTCTGGGTT^ 

GTCTCGGGATGGTCCATCCITTTGTCTTATCGCCAAACTTTTTCTCTGTAACTC 
AGGAGTGAAGTATCCACACTTTGGCCTTCATTCTTCrTGATTTTC 
GGCTAATAACCACTTATCAGTCAGTGCATATCTAATGACTTCTTTTGTGATTG 
CATCCATTTGCCCAAGAATTTCATAAATTCTTTGTAATACGTGAGTAGTACTCCATTGTGTA 

4 0 TTCCTCTGTTGAGGTACRTCTGGGTTCTTTCCAGCTTCTAGCTATTATAAATAAGG 

TATTACTAGTTGGAACATAGTCTCAGTATAAGCCCAGAAAAGGTATTTCTGGATCTTCGGTAGTACT 

CAGCATCTGCTGTCACCTGAATTTTTGATCTTAGCCATTCTGACTGGAGTGAGG 
CCTGATGATTAAGGATGTTGAACATTTTTTAGATGCTTCTCAGCCCT 

4 5 ACCCCATTTTTAATGGGGTTATTTGAATTTCTGGAGTCC^CTTCl^ 

TTTAGGATTGGTAAAAAATCCITTTCAAATCrTTTGGTGGC 

AATTTCTTGAGGTCTCATTTGTCAATTCTTGATCTTACAGCACAAGCCATTGCTATTC 
ATCTTCAAGGCTTTTCCCCACTTTNNNNNNNNNNN^ 

ATGTACAAAAGGAGGTCAGGGACCCCCATGGGCCTTGCATACTTAAACTTCTAAGCATTCTTCATTAGTAACC^ 

5 0 GTATTTGACCAGGCCCTATGCAGTACTACATTGTTAGTTTCCATATAGTGATCTCGATATGG 

TTATTTCCCGTGCTCTCATCTGAATGGAAGTTTTCCrrGCTTGGATT^ 
TTCTTAGCACCCTTAAGGTTCTTGAATCAATCCTTACTACCCAG^ 

AGATAGATGGATAGGCAGGCAGACAGATAATTAGGAACGGAAAGGGGAAGTCTACATAATGC^ 
CTTTTAGAACATCAGTCCACTGTATTCTTGTGTTCATTGTTAATGATATG 

5 5 CCTCTCTTTTCCTCTAATAGACTCTGAAGGTTCCTTTACCACACTTTCTTC 

GGATTTATTAATCTCTTTCTCCTCCCCCATTTCCCCTCCCrCTTCT 
GTTTTCTTGAGATCTTCCTCTGTGGTCTAGGCTGGCCTGAAATTTGTAGTAATCCTCCTG 
TAGTAAACTTAGTTTTAACTCATTTGCTAAGCAGTATGACTTTTATTTTATrTTT 
GAGATAGGGCTTCTCTGTATAGCCCTGGCTGCCCTGGAACTCACTTTGTAGACCAGAC^ 
60 TCTGCCTCCCAAGTGCTGGGATTAAAGGCATGCTCCACCACTGCCCAGCTATATTTACCTTTATCCCTTGAGAT^ 
ATCTCGCCrrCCCTTTCCTCTCTCTAACACCTTCATTTCC 

GTATATGTTTATGTATATGTATACGTATATGTATATATGAGTGTACACGCACACACACGCACACACACGCATGCATATACCA 
CTTG CTGTTCTGAAGACTTG<^TCTTTTCTCTTTGAATAACTCCTGTATGAGTCC CTC AATTCTTTCCTCCCAGAGTAGAGCATCA 
CCTCCTGTAGCTTTTTGAACTTCTCTCTGCTCTGATTGTTTGAAATTATAGTTT 

6 5 TTACTCTCCTCAAATTAACCATTCCCATTTTTTATCGTTTTGTCCACT 

' TACATATGTTTCTGCTTCCTATTCATGGATATTTTAAGGTTACGTGGAAAGCTG 
ATATCTGGCAAGAAGCAACCAAAGGAAGAAGGCTTGCCTTGTG 

TGGGGGACCATGGCAACTGGTCAAGCTGTGTCTGTAGCAGGAAGGAGAGACCTTGGAGGGC^ 
TCCATTTTATTCACACTCXSGATCCTGGTTCATGGGATGC^ATGGTGTGCAGACTCTCAGTG 

7 0 TATCTGGAGCATACCCAC!AGGCCTACCTAGAGGTGGGTCTCTTAGGTGATTTTAAATCCAGCCAAGGCT 

CCATCATCCTACTTTAAGGAGTCTACAGCTCTTCATGAGTCCTTCCT 

GCTGCCAATCATAAACTCACTGTCCTCACATGGCTCCCAAGAACTTATGCATGTGCACT 

TGTCTATGTGCACGTACTGAGATGCCATAGAATAAGAGGTATCCTGATTTAGCATCCTCCACCTT 

ACTG Afi C CTfW A CCT AG ACTGGCAG CCAGTC CAC CATCTCTGC CT ACTT CCCATT ACCAAG CTT AC AG AT ACAT ATG TG C CAT AAC 
7 5 TGCCTTTTACAGGAGTTCTAGGAATTTGTACTAAGGTTACCTGT 
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ATTTTTGAATTTGAATTGTAGATCTGAATGACAAGGTNN^ 

AGG CAAT AGTGCATATGC CAG CAAG ATTTTG CTGAAAGG ACCCTG AG AT AG CT AT CT CG AGTGATG CT ATGC CGGTGC CTGG CAAA 
T ACAGAAGTGGATGCTCACAGT CAT CT ATAGCATGGAACACAGGGCTCCCAATGG AGGAGCT AGAGAAAGT AC CCAAGGAGCTGAA 
GGGGTCTGCAACCCTATAGGTGGAACAACAATATGAACTAACCAGTACCCCCAGAGCT 
5 GATGGCCTAGTTGGCCATCATTGGGAAGAGAGGTTCCrrTC 

AGAAGTGGGAGTGAGTGGGTAGGGGAGC^VGGGAGGGGGAGGGTATAGGGTA 

AT AT CT AAT AAAAATTTTT AAAAAT ATGTCTTAAT AT AT C CTGAATGTGCTATGGT AAATGATTT CTTT ACATTT ATTTT AAAAGC 
CATGC^TTACTAGTCCTTTATTCCTAAAAGACAGATAAAAATGGAAATCCCTG 

AAAAAAAAAAAAAAAAAAAGGTCT CAG CTAT AAAAT AAAAGG CCCAAGGATCTCTCJUIT AAAATAGAACAGTGGGCTCCAAACAAA 
1 0 TGGGAAGGAAAGCGGGGCCGTGGTTCCACACGGCCACAAGATGGCACTGTTT CT 

CTCCAAAGTATGTCACAGTTTAATTCrrCATATTCAACTTC 

AAATAAGGCMTGTCT^TTCTAAGTCAACCCCTTTrTATGGCCT 

CCATATCrcTAAATTAAAGTCACAGTCCATCAGTTGTTCCTATAACT 

AAACTCTAATAGAGTATGACATTGGTTCTTAAGATTAACGTTCATAGAAAC 
1 5 ATGTGGCTTTGGGAAGGGGGGGGGAAGCACTTGTCTCATACXrC^ 

CCCAGGTCCAGACATGCTTCAGAAGGTTTGAGACrrGGAAATATTC^ 

GTCTCGGGTCACAGCCTGAAAGGCTTCAGTATGCCAACACCCATGGAAAGAAATAT 

CTAAAAGTTGAACAAGCACTCCACC^TTTTTAGACTAACTAAAGGTATO 

GTGCCCATTCTCTTAAGTTAATAGAATGGACTTGAGAAGGAAGT 
2 0 ATCTGGAAGTCAGCTGATAGACAGGTATGCTCATGTTC 

CCCCTATACACTCTTATTTGTCTATAAAC^GTGACTTGGTGTCTT 

GTGAGAAAATTTAGCAACAAAGTTCCATGAACCTCTTAGAATTCTTTC 

TCTAGCTCT^GGCCAGAGGCCAGTGTC 

2 5 ACAACAAAGTCTTT ACAAATTGCAGCAAAACCGG CTT CATTTGG AAATAT ATT CAAT ACTCATCAATAAAGAGAGACAGCAGTTT A 

TGTTGAACCACTAGGACTCTTGATCrTTCrACAACTATCT 

CATGCTAATAAC!ATGCTAATAACATACAAAAAGACCACCAATCTGGAGCT 

ATGATTGTGCAGAATACTATGTTATGAGATTATAAATGCTCCGATCCGTTTATCAGGA 

AGAGAGAAAAAAAAAGCAAGGTGTGTT ACTACTAT AT AGAG TGG TCAGGAGAGGC CT CAGTG AG ATGG AG CAANNNNNNNNNNNNN 

3 0 NNNNNNNTCCCTACCCACCCACTCTCATTTCTTGACCCTGGTGTTCCCCTGTACT 

ATTGTTGTTCCACCTATAGGGTTGCAGACCCCTTCAGCTCCTTGG^ 

CCTATAGATGACTGTTAGCATCCACTTCTGTGTTTGCCAGGTACTTGCATAGCCTCATATGAGAC^^ 
GCAAAATCTTACTGGCATATGCAATAGTGTCAGGGTrra 

3 5 TCCATCTTTTTGTCTTAGCTCOVAACTTTGTCTCTGTAACT 

CCACACGTTGGTCTTCCCTCrTCTTGATTTTCriTATC 

TTATCAGTGAGTGCATATCTAATGACTTCTTTTGTGTTTGAGTTTCCTCACTAAGGATGATAT 

AAGAATTTCATAAATTCATTGTTTTTAATAGCTGAGTAGTACTCCATTGTGTAAATGTAT 

TCAGGGACATCTGGGTTCrTTCCAGATTCTGGCrATC!ATAAATAAGGCTAG 

4 0 CTCTTAGCAGTTCAGTGCTTGGATGAATACCTTGGTTCTAGGCAGAAAGAAGATAA 

TATAAAGTGCTGATCCTATGAAGTAGTTCTGTTGGGGTATGACAAACCCCTTCCAGA^ 

AAACATACTGACGGAGGTTGTGCTGTCACCTGAAACACCTTGGAGATCTGTGATGTAGATGT^ 

AGAGAAGGGTTTTCCTTGAGATATAGTCAACITTGTA 

AAGCATTTGGTGCGTGTGCTTTCCAGCCTGAATTTGCATCCCCAGCCCTAAAAGAAACCAGTCACTGTAA 

4 5 TAACCCCATTGTTGGAAAGCAGAGACAGGTAATGCCAGAAGCTCCT 

CTCAAACCAAAGCX3GTGGAAAGCGATAGAGGGGGACATCCTAAGTA 
CATCACAGACACGATAATTAAATATGTTAGAAGGGAAGATGTTAGTTAACCAAGTC 
GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAATATGATGTATATGATG^ 
ACTTTCTAGTCAGAAAGACAAGATCTTCATTCTAGACACTTCACCAT 

5 0 TAATACTGCITTAGGGGGAAGATTTTTTAGTGAGAGGAGACACGTCCTTGATACTTTGAGCT^ 

AGGGTAGAAGGATCTACCTGAGCCACAAGTCCAGAATCIATACAAGTGGACAAAATGGTCCTA 
AAATATGAATCACTTGACTGTTGGTTACTATGACATGACTTAA^ 
AATGTCTC^GCTCCAGATATATCCATTTGCCTAAGAATTTCATAAATTCATT 
ATGTACCACATTTTCTGTATCAATTTCTCTGTTGAGGGACATC^ 
55 GAACATAGTGGAGCATGTGTCCTGAGTGAGGTAACCCAATCACAAAAAAAGTCACTTGA 
GCCCAGAAACCTAGAATACCCAAGATACAATTTGCAAAACACAAGAAAATCAAGA^ 
CCTCTCTAGAATAGGGAATAAAATACCCATGGAAGGAGTTAC^^ 
NNNNNNNNNNNNCAAGCTCTACTGTACTGAAAGAGAGTGA 
ACACAACTGCCAAGGTTTGTTGGCTAAAGCCAATGGAACTTTTTCT 

6 0 CTGGTTCCCAGAAAACATGGTGGGAAAGATTGAAGGTCTAGCTCTAGGCCAGAGGCCAGTGTGTTATCCT 

ATGCTCAAATCC CAT CTT AGG TAT CAAG AGTGCACAACAACAAAGTCTTTACAAA 

ATTCAATACTCATCAATAAAGAGAGAAAGCAGTTTATGTTGAACCACTAGGACTCTTGATCTT^ 

TCTTACTTTT 

65 MOUSE SEQUENCE - mRNA 

GCCGCAGGCTGCCCACACAC^CCGCCCGCTGTTTTCCC^ 
CTG^CTCAC7U3TTAAGAGTTCATACTCAAGGTACTAATAGCATCT 

AAAAACTGCTCAGAAGGATTATATCAAGGAGGCCCATTTTGCTGTCAACCATGCCAACCTGGTAA 
AATGAATGGGGGTACACCAACCTGTGCCCCATGCACAGAAGGGAAGGAGTACATGGACAAGAACCATTATGCTGATAA^ 

7 0 GATGCACACTCTGCGATGAAGAGCATGGTTTAGAAGTGGAAACAAACTGCACCCT 

GACTTCTACTGCGATTCTCCTC^CTGTGAACACrGTGTTCGCT^ 
CAGCAATACAAACTGCAGGAAACAAAGTCCCAGAAATCGCCTATGGTTGT^ 
TATATCGAAAGTACCGGAAAAGAAAGTGCTGG AAAAGG AGACAC^ATGACCCTGARTCTAGAAC 
ATGAATGCCTCAAATCTTAGCTTGAGTAAATACATCCCGAGAATTGCTGAAGACATGACAATCC^ 
7 5 AGAAAATAACATCAAGGAGGGCAAGATAGATGAGATCATGCATGACAGCATCCAAGACACAGCTG 
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TGTGCTGGTACCAATCTCATG«3AAGAGTGATGCATAT 

GATAAATTTCAGGACATGGTCCAGAAGGACCTTGGAAAATCAACCCCAGACACTGG 
AAAACTACCTCAGTTCCAGCCATGAAGAGAGGAGAGAGCCTGCC 
GGCAACTCCTGGTGTGTTCTCTTTGCCAGCAAATGGTAGTTGATACTCAGTG 
5 T CTGTTATTCTCTGCAGTTGCTGAGATGAACCATTTTCTCTGTCTA 
AATGTGARGTGGAATCTGCAGTGTTTGTGTTTATATrCATATACT 

TT AATTGAAG ACCTTC CATTGATGGACA G'l T C '1'l T l'CCTCT CTATATGGAAATGT AT AATAGAAGAAATAAl ri iTAAATTAAAGT 
ATCTCTTTTTGCATTTCA 

10 MOUSE SEQUENCE - CODING 

ATGCIGTGGATCTGGGCTGTCCTGCCrrCTGGTGCTTGCrrGGCT 

GAGTTT AAAG CTG AGGAGGCGGGTT CATGAAACTG AT AAAAACTGCTC AG AAGGATT AT AT CAAGG AGG CCCATTTTG CTGTCAAC 

CATGCCAAC CTGGT AAAAAAAAAGTTGAGG ACTG CAAAATG AATGGGGGT ACACCAACCTGTG CCC CATGC AC AGAAGGGAAGG AG 

TACATGGACAAGAACCATTATGCTGATAAATGCAGAAGATGCACACTCTGC 
1 5 CACCCTGACCCAGAATACCAAGTGCAAGTGCAAACCAGACTTCTACTGCGATTCTCCT 

CGTGTGAACATGGAACCCTTGAGCCATGCACAGCAACCAGCAATACAAACT 

TTGACCATCCTTGTTTTGTTAATTCCACTTGTATTTATATATCGAAAGTACCGGAAA^ 

CCCTCAATCTAGAACCTCCAGTCGTGAAACCATACCAATGAATGCCT^ 

AAGACATGACAATCCAGGAAGCTAAAAAAITTGCTCGAGAAAATAA 
2 0 ATCCAAGACACAGCTGAGCAGAAAGTCCAGCTGCTCCTGTGCTGGTACCAATCTCATGG^ 

CAAGGGTCTO^AAAAAGCCX5AATGTCGCAGAACCTTAGATAAATTTCAGGACATGGT 

ACACTGGAAATGAAAATGAAGGACAATGTCTGGAGTGA 

HUMAN SEQUENCE - GENOMIC 

2 5 CTTAGACTTAAGCTCAAAGCCTCCATTCAAAGAAAGTGGCCAAAATGTCCCCCCT 

GACTCAAGCCATATAAAGTCAAGTACAGTTACAAGGCAATGATAAGAAAGAATGAGATAGCTCAATTC 
TCTGATATGTAAATTTAAAAAGAGAGAGATAATTTTCCCAGACT 
GAAAGTTCTAAATTTGGAAAGGTCACAAAGGGATAAAGTGAGTCAGAGG 
ACTATGGAGGAATAGGGTCCTTCCAGAGGGGATGTTTGAATGACAGAGTAGGAACAGGGCA 

3 0 AGCACAAGCAAAGGGCAGGCCCAGCAATATCCAAGAGAGAGCGGGGCCAGTCCA 

G T CCT AACTCGAGCTCTGCCACAT ACT AGCTGTGTGCAAATT AGTT AAA CAC CT AAC CT ATGTTTCTT AAT CTGT AAAATGAGG 
ATAATATTAGATGTTAAGATTAGGTGAGGATCAAATGAGACATTGTGTGCATAAAGCACrTTTC 
AATATTAGCCACTTATTAACCATCGGGTTAAGAGAGAAATGACTTTTTTTT^ 
TCGTGATTTATACTTTTTAACTATATTCACACCAGGAAAAGTTGATAGTAAATT^ 

3 5 GGAAAGGAAGG CAAATTG AAG AGG T ATG AT CCT AGG ACIAG AGG AAG AACCAAGG TCAAAACIAGGG ACATTT CC CAAT 

CAGCTGCTTTGTTAACAGTGCCACAAGCAGTGGGGTGGGCAGAGTGGGAGA 

GGAGAGAGAGGCTGGATGGGGGAAACGGTAGATAAAAGATTGGGCAAAGTAATTTAGTCCCAGT 

CTGTGCTCGTGGGTATGTAAAGGTTTCAGATGGGCTTTCCATGAAAA^ 

GGCATTATC^TGATACTGTGCAAAGAGCCCAGGCTTTAGAGTCAAGCAAATCTTGATACCTTTACT^ 

4 0 GTTACCTCTATGTGAATC!AGK5AAAGTTATTCTTTCCCTGTAGGTGTGGGTAT^ 

TAACACATAGTGAGTGCTTGATTAACTGGTATGTGTTAGTTCAAGAAGG 

TGCACACAATAAGAAGACCTACCTCCCAAGATTTTTATGAGCATTGAATGAGATC^ 
CTCATACTAAGTGATCTAAATGAAGAGGTATTATCATTAAGAAGCAAT^ 

4 5 TAAAAGACAATGGATTATTTGGTAAGAGATCTCACCGAGCTTAAAT 

CATAAATGGCAGAATATGTGGTACAGAACTGTAAGATACATGTCCCACGACAGAAATAAAAAAGGC^^ 

ATTCATGG ACATTT ATGGTCACATTATCCTGATTGGCTGAGGGAGTC AG CCAGTTTTCTTGCTGTTCACAG AAAT 

AAAAGCTCAGTGTCTTTGAAAATGAACTAAGTTTTTTTT^ 

TTTCTGCTAAGTAAAAATTCAGCCTAAGCAGTCTAATATGAA 

5 0 GAATTTGTTGATTTCCCTACATTTCCCATCTACTACCACTAAAGAAATGCTG 

GGGGTTAATTGGTCCTTATCTGTTTGAGGTAAACAACCGAACTGATGAGTGGTTTCCCTGAGCAAAG^ 
TCCTTTCCAGAACGTGGCATCAACATCACATGGGAGCTGGTGAGAAATTCATAACCTCA 
AATCITAATAATATTGACAGGTAGTTTATATCCATATTGAGGTrTGAAATGCACTGAG 
CTTAATTAGATTATCTTTATTTCAATCCAATTTGCATGGCTATGTGAGCTATC 

5 5 TCTTACTTAGCAG ACTT ACCCAGTG CAAGAAAG AGTTCCCTTAT CTCACTGT ATT ATG ATCTT AGTCAATATG ACCCGG AATTTAA 

GAAAATAATATGACAAGAGTATGGTTTTAATCAACCGGAAACCATGTGTTCAACAGAAC 
TTAGGGAGTTTACATCTTGATGATAAGGCCACTGAGGTAACTAAAGAGGAAATGGATTCAGCATAGGCT 
TTACCTAGTAACrGTGTGACTTTC^GCAAGTCTTTCCAAGTTTC 
AGTTGTAAGGTTTAGATAAGTTAAC^GGTTATGCAAAAAAAAAAAAAAAAAA 

6 0 TGTTCTGGACATTTTATAAGCATTAGATATTACTGAT 

TCAACATATTTGTGTGGTTTAAATAAAAATAAGTTTGAAGAAATTTCA 
TCCAAGGGGATTTCAAAGCCAGGTGGAGGAATGCCTGCTTAGTGTCTTTCCTCAG 

CTCAGTTT ACTGTG ATCAG AG ACTTAGGCTGCCTAGTTATCCTTGTTG CCT CATG CAGGTTTGTG 

TAGAGTTTGTGGGGAGGTTTCAGATGGTAAAATGCCrCTTCCTACCTCAGGTGAGGGGAGGGTTA 

6 5 GCTATTCTAGTTCTGTTTATAAGTAGTGTTCTTTC 

GGGAGAATTTCTAACAACTGCAAAGTGTTCTGATATGTAATAAATTGCCTTGA 
GCTAAGGCTGGTGTAGTCTCCTAGGGCTGTTGTAGAGAAGAGTCTAAAGTG 
CACACTACCTTTCTGCCTGTATAGGTCAGAATCTATTAAGAGTGAGTAGGAATGK3ATT 
TGATAGTAGCACACTAGTGTCATATGTCAAGTGGCIAAATGACACTAGCACCAAATGTGATTCTGCA 

7 0 TGCATCTATTTGCTATATGAAAACATAAGATATCCCAGTGTTTGCAAACTGGTTCAA 

ACACTAAGGGAACTTCAAAAG^CTGATGTGGTCCATGCTCTCTGAGCCAGGCCTCTCTTC 

ACCACTGGGCCTGTCTAAATGGCCAAGTCATATAGGAAGAGACAGAGTGAAAGGGAGAGCAAGAGCTAGAGTGAG 

GAGACATCTTCTCACCIT'CAGACCTGATCAGAGGCAATAATGGGAGACCATGGGAG 

GTT A'lTTTC TGGAAAGTTTAAA^AATATCGTATT CTTCATGGAGAAGCCAAGATGTTAGGCTGCAGGCAGTGCTTATG CTCTCT AG 
7 5 CCTAGTGAGGAGGAAAGCAGTATAGGTAAATAAAATGACATTTAGTTGGTAATATAGGCAGGGCCTATATC^ 



1151 



WO 03/008583 



PCT7US01/51291 



AAGGCXnTGGTGATGGTCTTTAGGGGGCCATTGGCAAATTG CTACAACAAAATAAT AAAAT AAATAAAATAACAAAATCCAAGTTT 

ATCTCCCAGAATCTATCTGGGCATATCATTACTGTTGGAAAGACTTATGTTTACCAG 

ATATTATGCACAGCTTCTCAC^TCTAGTGAAGTAACTGAACTAGGATTTATATCT 

5 CCTTTCTCAGTTGAACTAGATGTTAATACCACTATTGTATTAGTCGACT 

CTGTAGTCCTCXCTGCCCATCCATTATCTCCCTCAAGCTGTCCCCTCA^ 
AGTCCCCCACCCCCATCCCACTCCCTGGCACTAGATGGG 

TGTTCCTTCCAGTCACCTCCCTTCCAGGCCCATGAGGAGGGCCTGAGGCCCACACTAGCT 
1 0 GGCAGGGAGGAAGTCTAAAGTAGAGAGAAAGCGCTGCCATGCCTTAAGCCAACCTT^ 
GTTTGGGAAACAATTTTTATTGGC^TGCAGGCAAATATTTTG 
GGCAAGGAATGCTGATTTTCCATTTATGGTTATAATGTAATGTTTTA 

CTTAAAAATAAATTATTAGTTAAATATATATTTAGTATATACAAGTAGACTACAGAAATGGCAAAA 
ATTGCCTTAAGCTAAGCCCACTCAGGATGAAATTTTA^ 
1 5 TGTCAGTTCTTAACACCCAGGGGGTAGGCATGCTTCT 

AAAAATCTCAGGGGAGGGGTACTCAATTGGATTAAGAAGCCAGGGCCCACAGAACAGATC 
TCTGCTTTTTAGTTGCCACAGTCTAAACATTCTTTAATGCATTC 

TTT CAT AG TGGGAAAAATT AGGAG ACTT AAT AATTTGC TG CAT ATCTTGGTGGCCAAGTTTT CCAGC C CTGGTCAGTT AGGCTTT A 
GGACACTTTTCTTAGGCTTCAGGAATGTCAGCA^ 
2 0 GTAGTGCAGGAAGGAAGTAGTGCAGGAAGGAAGTGGTACAGGAAGGAAGTG^^ 

TAGGAAGGAAGTAGTGGAGGAAGGAAGTAGTACAGGAAAAGTGGTTGTAAGCCATTTAACTTA 



GGTGCTTTCCATATGTCA1 TT CTTTAATCCTGTAGGA 

GATTACCTATCCAAATCTAAGATCrCCGTCCTCTCTCTTCCTTTCCCAAGAACCC^ 

2 5 TATTAATAGACTTTGTCACTTGGCCACTGTGGGCT 

T AG AT AGC AGTCTACAG AATTGTTTAAGCAATT ATCTTT ACTTC CT AATTTG TAACAATT AACTTTTG 

TAACAAAAGAATTATTATACTGATCAAAGTTCrGATAATTTATCAGTTAAGATT^ 

ACTAAGAATTTGTAGAGTTGGGTAACTTTGGGTGGTCCAGGGTTCA^ 

CAACCTTGATCTTTCAGCATGGTTTCCTCTCAGTC 

3 0 GCTCCCTTCACAACCAACCAC^TCAGTTTCACATCTCACATATTT 

TGTTTCCAGTAAAGATCCCAAAATCTACTCCAACTGAATGTGATGTGCTGATTGGCTTAAGCCA^ 
CTCCCTCCACCTACACATATACCCAAGCCCTGGGATTGGT 
CCATGGCACTTGGCTATTTAGTCAAACACTATCTTAGATGTTTCTGTAAAGCA^ 
CACTTTGAGTAAAACATTTGCTTTCTATAATG^ 

3 5 TGACTTCCACTGAGTGAGTGAGAATTATGCCAGCAGACG^ 

GCCMTGCTAAAGTTTTGGACTTGCCAGCCTCTACAATCATGTGAGCCAACT 
TCTATTTCTCTAGAGAACCCTGACTAAAAGAACCACTCACATCAAATAGTTAAGAAG 
ACTTGGAAGTACTATTACCAGAAGGAAGAAGAAGGAGAAAGGCAGGATAAACAGT 
AAATCAACTTATTGCAATTTAGTAAAAACAATTTGGTTGACAACATTTT^ 

4 0 TTAGTGGTAAAAAGATGTAGAAGTTTTTTTTTTTTTTTTTTTTT^ 

TGTTTAAGTGTTATTCAGAATGTTCAGAAACCAGGATATCrrTTTGAGT 

GGTTGTCCAAAACATCTCCAACATCTGTTGGAGTGCCTGAAATTCTATTATTAAAATTTTATT^ 

ACTCTCAGGAATATGCTGGTAAAATAAAAATAACCTTTAGAGATGCCCAAAC 

TCATTCAATAGATTCTTCAAAGGATTCCAAAGGCAAAGAAGTTTGGGGAACAG 

4 5 TACTAAGGGCCCTGAGAAGTTTTGGA1TAAGAAAGTTTTCAAATTAAAGTAACCCAG 

ATATGTCTCCCCACAAAGCACATATTCCTATCTCCTTGAACTTGA 

TATGGCATAGAAAGAGCAGGACCTTGGGAGCAAGAATATCTAAGTT^AATTCCT 

CAATGTTGCTTAAGCTTTTTTGGCTACATTTTTTTATT^ 

G T ATT AAG T AAGG AAG ATCCAC AT ATGTG AGTTGCTGG CTT AT AATT CACACT CAAG AG AT ACTG ATTTTG TCAATTG T C CTTTC C 

5 0 CCTTTTTTTCTCTCTTCCCTCCTTCCATTCCTTCTTCC 

ACATTTTTTTATTTAAATGAACTTTTCATTTTGGAATAGT^ 

ATGGCACTAACAGTCTACTGAAAGGTGGAACAGAGACAAGCCT 

AAACACAGGGTCATGGAAAGCCCTCAGGAGGGTAACCTAACCTAGATTTGAGGGCCCAAA 

5 5 AGAGGAAGCCTGAAGGATGAACAGTGGGCTAAGCAAAGGGTTATTAATGTGTTATTAATGGG 

AGGTTGCAGAGTGAGGTGCAGAGCTTGGTGGACGATGCCAAAGGAATACTGAAACCTTTAGTGTG 
AATTCAGGTTCAGTAATGATGTCATTATCCAAACATACCTTCTGTAAAATTCATGCTAAACTACCTAAGAGCT 
AAGCAATAGTGACTTTGAACAGTGTTCACCAGAGCACGAAAGAATTACAAGATTTTTTT^ 
AGTAACGAAGGACAGGAAGTAATTGTGAATGTTTAATATAGCTGG 

6 0 TTGAGAAATAAAAACTAAGGGGCCCTCCCTTrrCAGAGCCCTATGGCGCAACATC^ 

CAGAAACGTCTGTGAGCCTCTCATGTTGCAGCCACAACATGGACAGCCCAGTCAAATGCCCC 
AGCAATTAGCCAAGGCTCCTGTACCCAGGCAGGACCTCTGCGCTCTGAGCTCCAT^ 

T GAACT ACAG CAG AAG CCTTT AG AAAG G G CAGGAGG CCGG CTCT CGAGGT C CTCAC CTG AAG TG AG CATG C CAG C CACTG CAGG AA 
CGCCCCGGGACAGGAATGCCCATTTGTGCAACGAACCCTGACTCCTTCCT 

6 5 CCAAGTTGCTGAATCAATGGAGCCCTCCCCAACCCGGGCGTTCCCCAGCGAGGCrrCCTTCCCATCCTCCT 

TCGTGAGCTCGTCTCTGATCTCGCGCAAGAGTGACACACAGGTGTTCAAAGACGCTTCTGGGGAGTGAG 
GACTTGGCTGGAGCCTCAGGGGCGGGCACTGGCACGGAACACACCCT^ 
CCCGCGGGTTGGTGGACCCGCTCAGTACGGAGTTGGGGAAGCTCTTTCACTTC 
GACCCTCCTACCTCTGGTGAGCCCTCTCCTGCCCGGGTGGAGGCTrACCCCGTCTC 

7 0 GCGGG ACGCGTGCGGGATTGCGG CGGCAGCX^CGCACGCX5GGCACCTGGGAGCGGCG<K5CTGCTGCGGGAGG CGTTGG AG ACTGG C 

TC C CGGG GG CTGTT AGG AC CTTC C CT C AGG C CCGGG TG CTCAG AACGCTGG AGG ACTTG CriTT CTTGGG CCTTGATGCG AAGTG C 
TGATCCCGCTGGGCAGGCGGGGCAGCTCCGGCGCTCCTCGGAGACCACTGCGCTCCACGTTGAGGTGGGCG 
AATTGAAGCGGAAGTCTGGGAAGCTTTAGGGTCGCTGGAGGGGGACCCCGGTTGGAGAGAGGAGCGGAACTCCT 
ACAAGCCAAGCCAAAGGTCCGCTCCGGCGCGGGTGOGTG^^^ 
7 5 AACAGATATTGCTCATTTTCTGGCAGTTCTCAGACGTAGG 
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AACGGCACCTTTTCrTTCTCGAAAAAGTTATATGGGGGCTGAATGAGCTTCT 

AAAAGGAAACTGCCTTGTCTCCCTTCCGGGAATTCTCTCTTTAAGACTG 

TTCIX3CCCTTCTCrrTTCTTCTTTTGCCCI^ 

ACTCGTTCCCACCGCACAGAACCCGGCGCCTAITATTGGCCAAGAAACTTGAGC^ 
5 GCCAGCTTGCAGATGGCTAATCAAAGAGACGTGGATCCAGGAGGCTCATTTGAGTACCGGA 
TTTTATTGGGCCCCAACAAAGTGCCJVGGCAT^ 
GTGGGGAAGATAACCACATCCCCATTTTACAGATGGCGAA^ 

GTAAATGGGAGATATGGGATTGAACTCAGATCTAATTCCAAAACTCAGGCTGTTTC 

AGAGGCTGAGATGTCTGGGTCCATTTTATGATTAGAAAAAAAAATCC^ 
1 0 TAGAAACTTCCTTGGAGAAAATTAATGATTCAAGATTGAGAAAACA^ 

T AACCATTG CAC CACTG CT ATGTTGGCTG CTTTGAAAATCT CAT AGTTTTTGGACAGCAGT ATTACT^ 

AAAGCTTGGAGCTATGCTTGTTGAACTTTTGTACCAATAGCACC^ 

TTTTCTGAAAGAATACACACACACACGCATATGTAAATATTCATACAT^ 

TATAACITATATTTGTATATGATATATGGCCTAGGAAATTAAGGCTTATTAAATAAAATT^ 
1 5 AGATCAGAC^TAACTCTATCACCTAAGTAATCATTGTTTACGTTTTG 

TAAATTTAGTTACACAGAAATTGGTTTTGCGCTCACGACATGCCT 

TGTCACCTCATGTGTCTACAGAAAATGrriX X nnTTATGCCTTCACGGTTATGlTri^ 

CAAAGTGCTAATTACTTGGAATACAGAAGCAAATTTTAATTCT 

TTTCCCCTCTTCCCCTAGTCCTCTGACTCTGGTGATAACTATAGTCAACAA^ 
2 0 TCTTTCCTTCTCTCAACCTTCCTGGTGTTAGTTTTCT 

TGAAATCCCAAAAAGTTGACAAATTTCTAAAGGAGACAGAAAAAGAG 

GGTTGCATTTTTTTTTTTTATTGTGCATGAC 

AATCCCAGCTTGCTAATTGAAGGGATGTGGATCCATGAGGCTCATTTGA^ 

ATTGGGCCCCAACAGGTTTTGTCTTG CATTATTTAGGACATTGG CCAGGCAATCACTGGACAGCCAGTTTGCrCTCCrTTACAT^ 

2 5 TGTAGAATGGTTATCTTATGGTCAAAGGAAAGGGAGAGGGCCTTTG£ 

GTTTTGTGGGGAAAATAAAGAACAGTTATTTAATTTGTAATTTAATCATl" 

TTTATTTATTrrTGTAGCTTAATTTGTAATTTAATAATrrrrcr 

TTAATGGGCCATATTTACATCACAAAGGACTTGAGAAGTCT^ 

C CAGG ATGGAAAAACACLAAATTTGTAAATG ATGG CCAGGTTTG CAAGCAAAAATT AAATTG AT AAATGG CAG CACCC C CACT ACCA 

3 0 G CT ATAGTTAGACTG ATGAAAAAT CCT CTAATGTTCATGC CTG AAAT ACAGTT ACCACCCl^ AGCAGCAAATT AG AGG CAAAGAAG 

TTCTGGAATTTCGACGTTGTCTCTGAGTAAGCTCm 

AGGAG AAGTGTCAGTGTTTCAAAGGAATCTCITTGAAGGCrACT CTTT AGGGGTG CCAGATT ACCAGAGTAAACTGCCAG 

TTTGAGAGAATAACTGTTTCTTAAGAGATGGTCTGAAACCTAAGTAAGGA 

CTITTGGCIAAATTATCTTGGGCCCCAAAAGTCCCTTGATG 

3 5 AAAATAAAATAACACAATTTAAAACTTGAAATTGATAAAAAGCAACTTGAAACCAAA 

TTTTTAAATAGGTAAATATAGAAAAAATGACCACATGATGTGAAACAAAAAGCAAATATT^ 

ACCACIACTAATGTAAAGAATTTTGATTT'TACTTATAAATTCITAGCCr^ 

TATATATAGTCTGTTTTTTGTTCCTTTGCATGTT^ 

ATACA.TTATAGAGAGAGCATTTATTAAATGAGCTGGGATTACrrATAATTTTT 

4 0 AATTTACrATTGAAAAAAAGCATTTCTTAAATGTACrrGGGATAC^^ 

TAGAACTTACATAT^AAAATTTTAGGACTTGTGATGAAAAATAATAAAATAATT^ 
TTGTACCTTGTACTTCCTTATAAGGTACAACACTACAATAGCCTTAGTGTGTAGTTT^ 
CATAAAAGTGGAAAAATAACATTTAATAAATCCTCTTTATrrTCTGGAAAATCCT^ 
AGTTCTACTTCTAACACCAATTAAATATCTrCAATTCAGTCTCAAGGAGGA^ 

4 5 TTTGAAGTGACAGTGTGAGTAGTAAAGCCCAAATCAGAGAGAAATTGIACTTr 

AATGAGGTGGTTCTTCCCITTGATCACAACTGCTTTCTTGA 
TTTAGAAATGGCAATCATAGTTATATAGGACAACTAGCCTGAGTCCC^ 
CCTITAGGCTGGGCAGGTGACTCACACCTCTCACCCAGCAC^ 
AGTTTGAGACGAGCCTGGCCAA(IATGGCAAAAACCCAACTCT 

5 0 TCTCAGCTACTCGGGAGGTCGCGGCAGGAGAATCACTGGAACCTGGX3AGGTGGAAGTTGCAG 

TCC^GGCTGGGCGACAGAGTGAGACTCCGTCTACAAAGAAAACAAAGAAACAAAAAATCATACCTTTAGA^ 
GAATTAGCAAATCTGACTACAAATTCAATCAGGAGATGTACTC^ 
ATGGACTGGGATAAACTGGCAAAATTAGAGTAAATTTGACAATTTGACATTGGATTC^^ 
CAGGCAGCATTGTGTTTCTATTCACCTCTGTGTTTTGTGAGGCTATAAAAATATAAGATATGAGAA^ 

5 5 TGAGGGCAGATATCITGAAAAAGAGAGCGAGAGGAAGTCAGAGTATGCXXjGTGAAGACT 

TGGTTTTTAGCCATTCTCACAGAGATCATTGGTCCTAATTAACTGGTTGTTGTTGTTTTTT^ 
AGGCTTGATTTTAAAAATTGGGCCTATACTCACAGAAAGACAAATACTGCATGATCTCGTTTATATGTGGAATCT 
ATCCATAAAAGCAGAGAGTAGAATAGTGGTTACCAGAGGCTAGTTGGGGG 
ATACAAAATTTCAGTTAGGAGGAATAAGTTCAAGAGATCTAGTGTACAACI^ 

6 0 TGAAAATTACTAAGAGTAGATTTTAAGTGTTCTCGCCACACACAC^ 

ATT AG CT CG ATTT AG CT AATT ATCTTC CAAAAT ATGT AC AT AATTAT C!TTCCAAAAT^ 
TACCATAAATATATATAATTTTTTTATTTGTCGAATAAAAATAATACAAAAAAACCCTGGGC 
ACAGGCAGAGCTCGTCCCAAGCCCCTCCCTCTCrCTGTAGTCCCTTATCCCAGGT^ 
TCTGGGCTTGTAAGGGCTCn'GGGGCTGTATTTTAGGACTCTATAGCCCTTACATTACCCATT^ 

6 5 T AT CTTCAAG CTT ATTTTTGG AAG GG T CATT ATTT CTT CATGAGAAG T CAAATTCTG ATTTG CAAGTGCTG AACACAG ATG C CATT 

AGTCTATAAAACTCTTATATTCTCACATGTAAATGTAGTAACCTGCTAACCTCAGATTCTGC 
ACTGTGATGAACCTCAAAATCACCAAGGGTACTTCCCCATTCTCAGAGATCCTGATTCCT^ 

TTTT ATT AAAT AATG CAT CACT ACAGG ATTCTG ATG TCTCT CGGGG TCTTG AG AACT ATTTTTCTGACT CT AG CACT AAACTCAAA 
ATGTAATTrCAGATGGAATTTCCTAGGCTCATTCATTTTATTTTAT^ 

7 0 ACTGTTGCTTCAACTACCTCTGTCCTGGACTTTCCAGAGATTGGCTTTAAGAAAGAAAGAGGACATCCCT 

GTGTATTTTTATGTACACATTTTGAAAAACATACAAATGAATGGGATCTTTGCTTATCT 
TCTATTTCAGGTTTTCAATAATGTAGTAGGACAAGATGACATCA 
TCTTATTGGAAATTTTAAACTATTGAATAATAAACTATTAGTCTT 
GTCTACAAGCAGACAAGCAGAATTAGTCGGTGGAACTTCATTTC^ 
7 5 CAACCCAGCCTAGCAGGACCAATAAGTCTGGAAAGAAGTAGCTCTAGTTC^ 
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GGGAAATCTCrrcCCTATTTAGCTCAGTGATC^^ 
ACCACTAATGTCTCACAAATCAAGAATCCAAGAGAATC^ 

GGGTGG CTACAGTTTCGGGGGCT AGAAGTCTGATATCAAGGTG CCAACTGGACTGGTTCCTTCTG AGGACCACGAGGGCAGGATCT 

5 TTTTTTAATTTAG AAATTT ATACTGTCT ACAAGCTTTATAT AG CTIT AGTTAAAAAAAAGCAAAAAAAAAAAAAAATCAAAACAAA 

AACAGTGAAGCCAAGACACTCTTCXAAAGTCTGG^ 

TC CTCHTCTT AAAATGT CACCAGTCACATTGG ATT AAGGCTT ACC CACCTCTGT AAAGACC C 

TATCTCCTX^TACAGTCACATTCGGATGCTCAGGAAGTTAGGGTATCAGTATCACT^ 

GTTTAGCCCATGACAGTCCTGAATCTCCrrTTGCTAGATGCATGACTGTG 
1 0 ATCAGAAAAGGGGTTTGGGCTAGGTCATCTATAAAGTCCTTTCTGCTCT 

ATTGTTTAAAATTTTAGAAGCTAACTGGATGCTTTG 

CTCACGCCTGTAATCCCAACACTTTGGGAGGCCGAGGCAGGCGGATCACG 

AAACCCCGTCTCTACTAAAAACACAAAAATTAGCTGGGCCTGGTGGTGCAC^ 

GAGAGTCACTTCAACCCG«3AGGCGGAGCTTGCAGTGAGCCGAGATCGCGCCA 
1 5 CGTCTCAAAAAAAAAAAAAAATAAATAAAAAAAAATAAAATAAAATAAAATAAAAATCTCA 

GT ACC CCTCTGCCTTCTGTT AC"ri'L "l"l"l 'G CTGCATT CTGAG AGG AATGGGGTGGGG AGCACT CTTT CCTGTAGAAC CCCAT AAC CC 

ATCTATTITCATCAATTTATTTGAAATTCAAAGCCAGGGGTTAAAATGT^ 

CTATTAACATGCATTTCTAAGTGACTTTTAAGCrAAGCATTTTAT^ 

AATAAAGTX3CACTTTCAAAAATACTCGGTTCAAATGTAAAATAATATT 
2 0 GTCTTTCTTCCCAGAGCAGTCTCTATGGGTGCCTAAACAATACTGCAT^ 

TTAACCCATGTCATCTCTAGAGGCATAAAAATAGACAGTTGTrCTGAGAAACATATCTC 

TGAGATCCTAAATTATGAGGTACTTTTTGTTTATCTTATTGTCT 

TCTATCAGGGTGTGGTTAATAAAGTACGTTTCCATGGTAAGTATTCTTTAAGAAAAAATAAA 

TTACTGCAAACTGTACTTACCAGTTTCTGGTG ATTTGT ATAACTG AG CTTTCAATTT CAAAAATT AAAAAGTTCTCTTT AGTGTAA 

2 5 CATATTTTTGTGTCTAGTAATTTTACTTCAAAAATAAAAGAGAATTTTCTT^ 

ATATTTAGTCCTAGAGGACCAAAGAGAAATAAAATGGCTTTGATTATGCT 
TATTTGTGATGTTTGATTTAGTAAAATTATAAGACTACCAAAGAAAACTAGAAGAGCA 
AATGTTGATAATTGAAAGGTTTAGAATTTTTTCCACCTT^ 
TTTTCCATTTTACTAAAAAATAATAACACAGTTCXaTTATGATAAATC^ 

3 0 TAAACTTTTTTATTATACAACAAACACATATGCATTATCAAGA 

AGAAATAAGCACCATTAAAATTTCTGTGTGTTAAAAGATAATATACTC 
TCACTTTTCATTTTCTTAATCCCTAACCTTCTACTGTTGAAT^ 

GTTGGCTCATGCCTGTAATCCCAGCACTTTGGGAAGCTGAGGCGGGAGCATC^ 

3 5 ACATGGTGAAATGCCGTCTCrACTAAAAATACAAAAAATTAGCCATCATAGTGGTGCATTCCTC 

TGAGGCAGGAGAATTGCTTGAATGCCGGAGGCAGAGGTTC 
ACAGAGCAAGACTCCGTCTAAAACAAAACAAAACAAAACAACAACAACAAA^ 

AAGCTTGCTTTTGTCTCTCTAAATCACAACAAACCGTTTGCAGTGTGATGCTGTGCTCAAAATTGAGAATTCT 
AGGCA.TCTCTCTACTCATTCTTGCCATGCTGAGGGCTGTCATTTCTCA 

4 0 AAAGTCTTTTCCTGTTTGCTTTGAACITCAATAAAAACAG 

CTTCCTTCTTTTGGTTCCTTTTCAGATCCTAGCTCATCAA 

TATTTCTTCCAGTAGTTAGTTTAATTCGGTCATTGAGCTGTTGCTCTATTTGTAACCGTCATGCTAGAT 
CTGAGAAAGTGTGGAGATCCTTTCAGACTGCCATGGTGCTAGGGCAGA 

T CATTT CTTCT ACTCTGT AACCTGGTTGG CT ATGAAAAG TG ATT CTTCCT C ACTG ATG CTG CAGGCAATGGGTG AAT ATCCAATTG 

4 5 TTCTCTCAGGTTCTTGTCATCGGATACAAAGAACAAATTTCATGTGTCCCTGTTTCTG^ 

TTCTAGCITAGGATGACTTCCAGCTTCTAACTATTTGTCTGTGCATTCCATTTCCTCACCCCTGCGAGAGGTTGC^ 
CGGCTGTTCTGAGAAATTCAGTGCCCAAACTGGAGGGAGGGGAACACAAACTCAGCTTCT^ 
AGTTTTGCCTTCCCCGCCAGCAAGAGCCCTGGCTTTAGGGGTCTTGCTACAGTTTTCCC^ 
GGTTCTTAGCGAATTCTTGCCAAACACCTCCCCITTCCCCAGTATCCATTTG 

5 0 TTGTCTTAAAACAAATGGAGTTGTTTGCATACCTAGATGGAGTAAAATAT 

TTTGGGCCAGAAGTCCTGGTCCTGAGTCTCTTGTTAGCCACTCTTAACTGT^ 

AAT CTGTCACCTGGGT ACTTGATGATTT AAACC AGG AATTGTGG C C CTT CT C AT AAT ACTGTCTCCT AAATATTGGGTGAATTT AG 

TATATCAAGGAGGGCAGGGCACTATACCCGTAGGGGACTCTCTCTCTCCCTCT 

TGGCAGACCTGGGGTTCAAACCACAGTTGTTTTGAAATTT 

5 5 AAAAGTAGCACACTTATTGGGCATGTATTTGTTATGTTCCAGGTTCTGTGCTAACAGCTTTAAG 

CAACAAGCCTGTGAGGAAGAGGTTATTTCCACTTATAGATAAAGAAACTGAGACTTGGAGAATGTAAG 

CAGGGCAG TAAGTGGG AGGCC CTGTT CAC CTT AG AACAC CTGGCTT CT ATCATT CATGGTGCT GTTTCT AGTG TGGTTTG AAG AAC 
CTGAGATCCAAACTGCTATACAAGTGACCTGCTGCITrCTTG 

CCTGTGCACA.GCAGATACTGCCAATTTTGGGTGGGTTACACITGTTTACCACGTTGCTTACTTCAG 

6 0 ATGCTTTTATTTTACAGGTTCTTACGTCTGTTGCTAGATTATCGTCCAAA^ 

GGATTGGAATTGAGGAAGACTGTTACTACAGTTGAGACTCAGAACTTGGAAGGCCTGCA 
CTGTCCTCC^GGTATGTTACACAAAACATCCAGAGATTACAGTGAAA^^ 
TAATTTTACAGTTTTTC^TTCCCCTATATTATATAACATAACTGAGAGAAAAACAACTATC 
ATTT AT ATTTATCACG CTT ATTTGATGTGGTTATGGAT AAATTT AATTTACAAGTGAGATC 

6 5 TATTTGGCTCCATTTTTTTCTAAGCAAAAATGACTCATTTGTGAATATC 

AACAATAGTGGAACTCTGAAAGCAAGAGACTTCTCCTCTTAAGGGATATGTCT 
AAGCTGTCCTGGCACn'GAAGCCTGGAAAATATATTTGCATTTTGCAGCT 
CCAACAGCTTTTTTTTTTAAJVTCAATACTTTGAAGGGCAAACTAATTTCA 
TAAGCAAAATGCAATTGTGCAAACATTTCTTCAACTTTGAGTAG 

7 0 TAATAGAACAAAAAAGGACTGTAGTATCTTTCATGGTTGTCCITCCCAGGCTTC 

TTTGTCAAAAGATTCTACCAGTCTCCCTGTGCTTCATGTCCAGATTTTAATAAC 
C^TTTATACTTAAACCCATTTACACTTAAATCTTCTCCAACACGAATAAAGAATGGCGTC 
CAATCAATTTGCTCTATGAATTGAAATCTTTGTTCATAATTTTAAAACCTAAAATGT 
(^TCAAACTGTCAAGGATTTTTGTCTATTGCTTTTTCGGTTTTGCTTTGTTTTA^ 
7 5 AAACTTTT AAG AAAT AC ATTTGG ATA.T ATGAGT C^AAAATTTG AAAAAAAA 1 1 1 1 1 rri AACITTTTTTTCTTATTATTATACTTT 
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AAGTTTTAGGGTACACATGCACAATCTC 

CATCTAGCATTAGGTATATCTCCCAATGCTATCCCTCCCCCCTCCCCCCAACCCACTGTTTATACTTC 

TATACGAAAATACCCTGAGAAAATAATCAAAAATAAAGGAAAATATTGTACACATGATATT^ 

CTTAGGTAAATTCTGGAATATTTAAAGACTGAAATAGTATGTATCAAAGTCATGTTCT 
5 TATATAAACCTTCTTAAGTAATTTCCAACTGATCTGGCTGGATTAACTGATTCTC^ 

ATCCAACCATGTT AAAT ACTCTTCCCACTT CAATTC C CTCITGTTG AATTTT AGGCTT AT CAAT ATCT AAGTTGCTTCCAAAT CTG 

TTTATGCTCATTAGTGTTGTGATATTCATATGTAATTTGATTGAATATTC 

GATTTTATTTCTAAGAAAACTAGGTTGAGTGCTITGGAGAAT^ 

GGGAAAAATAAAAGTTAAGTAGTAGGAAAAATCACAAAAAT 
10 TTCATCAAAAACGAAGGGAGGGAGGGAGGGAGGGAGGGAGGAG 

AGGAAATTATAGGTGATGCATTATGGATGTGACTTAAGAAAGAACCTGGGATT 

ATTATATAGTTATATTTTATAAATTAAAACAAAATATTTACAGTATTTTTAAGTG 

ATGATCAAAGAAGTCCTCCCCACATCATCATATTCAATAAGAAGGATTCTATGGTATA 

GTACATATATATATATACACACCTGTTATATATATGCT^ 
1 5 ATAACGGCAATATCTCCAGATTGTTGATATATAGTTTTTTGTTTGTTTC 

TCCAGGCTGAAGTGCAGTGGCATGATCTCAGCTCACTGCAACCTCTGCCTTC^ 

AGTAGCTGGGACTACAGGAGCAC^CCACAACACCGGGATA 

TCTGGTCTCAAACTCCTGACCTCAGGTGATCTGCCTGCCTTGGCCTCCCAATCT 

GCCTGATACATAAGTTCTTATTATACTTTCCTGCCTTTTCCAACATT^ 
2 0 TTTTAAAATCTGATTTTCAATAAAACAACTACTTATTTTCT^ 

CTGTGGTTTACTGTAT CTCCATT AT ACCAAGG AAGTTGTG CTAGGCACATTCTGTAAGATAATTCATTAAAGAG AC CAATTC^ 

AG AG ACC AG T AT AAT ACCACT AG CAAAAAAT AAAATTAT ACTTGG ACTTGGGTT CTG'rrri' L*TTT AAAT ATTGCT ACAATG TT AT A 

TAGCCCTATGCACTATTTTTGAATTTTGTTCTGGGAATCT^ 

AATAGCATATATATTTACATGTCATAATTTGCTCATGTAGTAAATATTTTGTTTTATC 

2 5 TTCATGrrCCTAGGAAACACAGCAGAAATTAATTTTGCTGTATAGGTGAGCAGAGCAGATTA 

TTATACCATTAACAATTAGAGTCACTAGTAATGTTTTrCTCACACTTCCCT^ 
TCCCTTACCTTCATCTAGCTTTCCCCATAGCATTTTTTTAGCACT^ 

CAAGAAGGT<1AAGTCTTTGGTGATG AAGGGTTT ATCTT AT^ CCTG CAT AAT AT CAAACTT CT AAAAAG TTC CT AGGACT CAAT AG 
GCATTTAATAAACACCTATCAGATAAATGAATTTTTTAAGAGAACACATTACA 

3 0 TT ATTTTT AAAGTAAATT AATG CAAATATGTAT ATT ATTT AAAATTG TTTG TTTTT AAGAT AACAAG CCT^ 

TGGCIATAAATATGTTAATGCTTATCAAAACTCATCAGTACATATCCTGTCTTTTG 
AGAACTCTCATGCTCACTAGGCAGGCCGAGTGCAGCTCTGACAGCAATGCAAGA^ 
AAATTTAGGCTTGGAGAAAACAACACTT^TTAAAGATTTTATAGCT 

CrTTTCCTTATTACACTGTTTClTATCAGG ATTT^ GAG AG CAATGTGAGTTAGTTTGAT 

3 5 AATCTCAX3TGGCCAGTAACAGC!TATCCCCACTCTGAATATATACCAACCAACCTCT 

AAAGGCATTTCATGCAGTGTACTTTGTCATTCTAACCT 

CTrCAGTCACTCTTTTAGCTTTTCTCTTGGGAGACT 

ATATCTCATTAGCCTACCCCCCCTCCCCTTGTGTTTTAGAA^ 

TGCCCGTGTCCTGTTCAAACACTrGCrCCriTTTTCCTTGGGCAGGTGAAAGGAAAGCT 

4 0 CCAGACTGCGTGCCCTGCCAAGAAGGGAAGGAGTACACAGACAAAGCCCATTTTTCTTCCAA^ 

TGAAGGACATGGTAAGAGTCTTAAAATGO^ITGAAAGAGGCCAATCr^ 
ATTGGGGCCTACTGTGGTGCTATGTTGACACACAGGAAAGGGAAGGACAGGTGGCT 
CTACTGGTCTAGACCTTTATGAGTAAGTCTAG&CAATTCTTCCAGA 
CATCAGCTCAAATCAAAAGCrCAGAAATTATTTTTTT^ 

4 5 AATTCAGAAGCCTGCAGATACAAGGACACAGAAATCCAAAAGGTAGGTAGGAATGCTCCCCCT 

TGATAATTCCATCTGGGGACCTGAATTTTCATGGATATCTCAATGTATTCTAAGGACCCAGATTGAAGT 
AGTTTTC^TGACATGAAGAAACCTCTGGACATAAAGCTTTTCCCATCCCTATTCAGCCTATAAA 

AGTGCATT AGAATTGCTT AGGGAG CTACT AAAAATGTTTCATGTTTAGGTCCC ACCT CAGGGCAATTACATCAGAGTCTCTGAGTG 
AGGACCTTGGTATTGGCCTGATGGAGTAAGTCTGGGGTAGAGCCTAACATTCTACATTTT^ 

5 0 CTGCCAGT CCC CAGGT C AC CCTTTGAGTG TT AAC^ CT AG AG ACT AG AGGAACAGGGGAGACX5TCTAAGT ATTGGTGAGG 

TGGTTGCTGCTAAATGATTGCTGGCCATTTCAACCTGGGGTTTCCTGTTCTGTGTTTA 

TGTG ACTCTTGATAT AAGCAGTGGATCTCAAAAATCCATGCAG CTCCT CTT ATAATT AG CCGCTA 

TAACTAATAGTTTCCAAACTGATTTTCTAGGCTTAGAAGTGGAAATAAACT 

CC^AAClTri'l M rTGTAACTCTACTGTATGTG AACACTGTGACCCTTGCAC CAAGT AAGTTTT AGTCTTT CTCTGATT AAAACACT A 

5 5 GATATAACATGAGAGTTATCATTTTCCTAGGGAAGTAACACTGACTGAGAGTTAAGAATTAGGGTTCTAGTCCTC 

AGCAACTAGATGACTGTTTGCTCATTTAAACACTGGTO 

GAGCCAGACATATGCCTGCCATCAAG<STATCAGTCCAGAAGCACAGACAGTCAAGG^ 
GAGAATAGCATCAGAAGAGACTTATGTTCCAGGTGTTACAAATTATA 

AAGACATGAATGTGTCTCATGACTTGAATCTTGAAGAAAGAAAAGAAAGAGCATTAACAGGTGC 

6 0 AGGCAGGGAAACAGCAGCTGCAAAGAGACAAAGGAAGCTCAGGAAAACAG 

TTCAGGGCTGAATCATGAAGGGCCTTGTCTTTTAGGCTAAAGAGGTTGGTT^ 

TCTTCGGAAGCTTTTAGGAGTTCATCTAATAGGCTTAAGAAATAAGCACATGATCCCT^ 

GTACAGTTGTCCCAGGCTCTCCCCTGGCCACAAGTGTCTGAGAGTGCCCCT^ 

TG ACTTAG T AAAAT AAT AACAAT AAT AGTAATTC CAAAT ATTT C AT AATCTTG ATTT AATTT T CTTGT C AT CAT AACCTTTTGAAA 

6 5 TAGATAATAITATTTCCATGTTATTGATGAGATAAAAGGCTCATGGC^ 

GAGGCTGTGAACTCAGACAACCTGATTGTGAATGTTTGTCTGTCT 

ACCGGTTGCTAAAAGTGGCAGCCTCTAAGGGCOAGCTGAGTACC 

TTCTTGGTCrATAGGAATTTTTTGAAATTACTTTTG^ 

TATTACTAGGTTTAAGTTTATTTTTGTATCCACTrCATCTCTCTTGTGTGTCA 

7 0 TTAAGACAGTCATTCCTTATGATATTTTTCATCCAGCCATCCAAATT 

ATTTGAAGGAATACGTTTGCCAGAGATGCAAAGATGAATAAAATGGCCCCTA^ 
TATrCTGCCAGGCrTTTGAATTTCTCXTG 

CACCAAGTGCAAAGAGGAAGGTAATTATTTTTTTACGGTTATATTCn'CCTTTCCCCCAACC 
AATCACTCTTGATTAGTAGAAAGTCCTTTATTTAATCTTAAAGATTGCTT 
7 5 GATCCAGATCTAACTTGGGGTGGCTTTGTCTTCTTCTTTTGCCAATTCCA 
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TGCAGATTGAAATAACTTGGGAAGTAGTTCACAAAGATTTGCCTCATTCTTACCTATAA 
ATTGTTAATTTCTTGCCCCTGAATGCAGCCTTGAGAGCTGACTG^ 
CAGTTTTGGGGCA.TTGAGTGGTATTCTCATCCTTCCT 
ATTCTTCACTATTCATTGAGATCTCCCrrATGCyWVAAGAGAACACAAG 
5 CTTTAAAAACirGTACTTCACTGCCTCCTCTTCCTCGG CCT AATTGCTTGTTTTT AATT A i i T uT C CT TCTl' AACTT AAAAT ACT A 

TGGGGACACATGTTATACAAAGGTGACTTAGTAGAGTCAGTAGAAAAGCC^ 
CCTTTAAGTCATTCATCAACTACAGGGTCACACQ^^ 
GTTTCTTCAGTTCAGTTGAAGAAGAAAGTTTGCCTTGCCT 
TAATTGTGCTCAGAGACATACAGATTCTTCTATCTCACATTGACT^ 
1 0 GTGTGTGTATTTCATTTCn'GGGCATCCATAGCAAGTTGATGTTGA 
CGTCCACATCTTTGCTGGACAGAGAGTGGTGCTTGC 

AGATGACTAATTAAGATATGTCCTTTCACTAGAACACTTGACTTAGTAGTACGAAAGTTCW 
TGGCCACTTTTAAGTTTCACltlAATTTCTCCTT^ 
AAGGCTGAGACCTGAGTTGATAAAATTTCTTTGTTCITT 
1 5 GAAAACCAAGGTTCTCATGAATCTCCAACCTTAAATC 
AAATTAGTGATTTGGCTrTTTGTTACrTC L " !"^ 

TAGCCCAGGCTGGAGTCCAGGGGTGCAATCATGGCTCACTGCAGCCT 
GACCCTGGGACTACAGGCATGCACCACCATGCCAGGTTAATTTT^ 
AGGCTGGTCTCAAACTrTTGACCTCAAGOVATC 
2 0 CGGCCTGTTACTTTCTTTATTTAAACATTGCTTGTGTTT^ 
ACCCAAATTAAAAGTAAACTTGTACAOVAAAACCGT^ 
AAGAACrTGGCCAACCTCCATGTTCCGTGTTGTTGTTATCCCT^ 
CTGAGCAAAGAGATTGCTCTTAAAGTTTGTCATAAGGGCACAGCT^ 
TCTTI^CCCAACATGGTATCAATATATTTCAGATAGCAAGAAATACA 

2 5 CTTATTTGGGGCCAGGGCTCAGTGGGAGTTAGGAT^ 

TTAAGAATAAGAATAGTTACAGGGAAGAAAATTTTAGAGTAGTTATATTT 
TCAACCTATTTTATGAATTCTTTAGGTTTCTTGCCTTTA 
AGAATAAAAATTGAATGGTAAAAGTAACCTTCTTAATCACTT 
CTTTCTGAATTAAGGAAAAATTAGAAGTTCACATTTAGAATATTCTAAAGA 

3 0 TGCTTTCTAGGAAACAGTGGCAATAAATTTATCTGGTAAGGCTTT^ 

GGCCAATTTCAGAGTAAAATAATGTTACTAATTTCAGTGACATATTATGGGATCTTGTTATT^ 
CATAAAGCATTTATCAGGCAGTTTGTTTAAATTTATAATGAATACTC^^ 

CTGAGGCTTAAGAAACTTTTCCTTCATAATCAGCTAGATGTATTACAGAACTCCTGCCTAAAAAGATCT 
GTAGACTCAGGTATTATCAGTGTACCCAACTCTATAACAACATACATGATTCCATTCAGTTGT^ 

3 5 AAGATGATCATAAACTCTTTGCTTATACTTTTAGATTTGTGGGTCATTGATCATT 

TCCAATCCTCTTAATTTATAAATGAGAAAAGAAGATACAGAA 
TATGGCAGGATTTGGAGTTAGAACTCAAGGTTGTCITGTTTCTGTATTCCCCT 
ATATTCTGAAGTACTATAAAAAGAGAAATAAACATGGTTTTCACTAATGGGAATTTCATTTAGA 
TCTATTTTTCAGATGTTGACTTGAGTAAATATATCACCACTATTGCTGGAGTCATGACACTAAGTCA^ 

4 0 AAGAATGGTGTCAATGAAGCCAAAATAGATGAGATCAAGAATGACAATGTCCAAGACACAGCAGA^ 

TAATTGGCATCAACTTCATGG AAAG AAAG AAG CGTATG AC AGATTGATTAAAGATCTCIAAAAAAG CCAATCTTTGTA 
AGAAAA1TCAGACTATCATCCTCAAGGACATTACTAGTGACTCAGA 
AGTGAAAAACAAC!AAATTCAGTTCTGAGTATATGCAATTAGTGTTTGAAAAGA 
TTTTACTGGGTACATTTTATCATTTATTAGCGCTGAAGA^ 

4 5 GATGTTT AAAAT CTAGTTGGGAAAACAAACTT CATC AAG AGTAAATGCAGTGCCATGCTAAGT AC CCAAAT^ 

GGATGAAAGATTAAGATTATGCTCTGGCATCTAACATATGATTCTGTAGTATGAATGTAATCA^ 

TCCACAC^CTAACCCCACTCTATGAATCAATAGAAGAAGCTATGACCTTTTGCT 

TGCCTCTAAATTACCTCTGATAATTCTAGAGATTTTACCATATTTCT^ 

AAGTATATGT ATTTX3AGTG CAGAATTTAAATAAGGCTCTACCTC AAAGACCTTTG C ACAGTTT ATTGGTGTCATATTATACAATAT 

5 0 TTCAATTGTGAATTCACATAGAAAACATTAAATTATAATGTTTGACTAT^ 

TACTTCTTTCTCAGGCATCAAAAGCATTTTGAGCAGGAGAGTA 
TCTTAATGGCCTAATCCACCCCCAAACATGGAAATATCACCAAAAAATACT^ 
ATTCTAGCCTC^TTTC^AGATACTAACTGCTCTCAGAGAAAGTAGCTTTGTGACATGTCATGAACC 
ATAAAATAGATTCTTATTTTTCCCCGACCCCCGAAAATGTTCAATA^ 

5 5 TAGCAATGGTAAAATCATCATCTGGATTTAGGAATTGCTCTTGT 

ATCCTACGTTTAAATATCTITGAAAGT^GTATTAAATGTGAATTTT 
AGATAGTTATAAACTGAAGCAC^TACCTGGAACCACCTAAAGAACTTCCATTTATGGAI^ 
TAT AAAAT ATAGGTAAAAGTACGTAATTAAATAATGTTTTTGGTATTTCTGGTTTT 
TTTGTCTTCCTTTTCTCTAACTGATGCTAAATATAACTTGTCTTT^ 

6 0 TTAACCCTTITAGTAGTTAAATAATTATTTCCATAGGTTGCTATTGC 

AACAGTTTCGTATTCCAGATACTGGAATGTGGATAAGAAAGTATACATTTC^ 

GG ATTTTG AAATATTCTTTC CTGCATATTATCCATTCT AGCTACATGCTGG CCAGTGGGCCACCTTTCTTTTCTGCAATTTAATGC 
TAGTAATATATTCTATTTAACCCATGAGTCCCAAAGTATTAGCATTTCAACATGTAAGCATGTCGGT 
TAGGGTTCCCTCCTGTGTTATGGTCTGGAAAGTGTCnTTAGGCAGAAAGTCTC 
65 TTCTGAGCCATCATAGTCTCTGCTGTCTGCTCTCCAGTTTTCTATTTCTAGACAG 
CATGGCCAGAAGTGCAAGTTCTACTTTGCAAGACAA 
CTTTGAGTTCCTTTGGGAGGAAGACAGTGGAGAAGTCTTTGTACT^ 
CCAAGCATGACTTCTCCCATGTCAATGAGCACAGCCAC^TTCCCGA^ 
TGAACCTGGTTCTCTTTGTGGTGGGCATACTGGGTAGGAGAATCIACCCA 

7 0 CAGAAGGAGCTCACAGATCACATTGAAAGCATTGCATATTCAAACATCTTGGTCT 

CCTCTGATTAGATCAGACACTTTTTAGATATT 

AGAA T l' l 'Trri'CrATGCrTACTCCTGATTGGTAATTTGTTTGGGTTTAG 
TTTAAAAATATTAAACCATGTTTTCTTAACATCTGTTGCTGTTGATTATGTCT 
ATCTGTAAT/TTTTTTTCCTCTGGTAATTrTTAAAGCTTCCCCT 
75 AATGTGTATTTATTTTTACTTAATC™^ 
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ATTTTCTCTTTAAATAGT GTTrCrTlA CCATTCT 

TTTTCTAGGTCTCTTAA TTTTTTT TAAAACAATAATO 

AACTGTTAGTTTTTACTAAGCCCGGCTAGAATAGT^ 

TCTrTATTGTTCCTTTTACATTTTTACTTGTATATC 1 1 1 1 1 C 1 11 CTCTTTTCTCTCTTT 

5 TTTTTTTTCTTCTTGCAGTGCAGACAGACm 

TGTCCCTTCCTCCTTCTCCCCTCGCTGAGCCCTAGTGTGTCCGGAATTG^ 
GCGGACCCTCACGGTGAGTGTTACAGTTCrTAAAGGCAGCGTGTCCGGAGT^ 

CCTCTGGAGTTGtS^ 

1 0 AGCTCATGAAGGCAGTGTAGACCTAAAGAGTGAGCAGCAGCAAAATTTATTGAAAAGA 
AAGGGGACCCGGGCCAGTTGCCACTGCTGGCTCTXSGCAG 
TCCATTTTACAGAGAGCCGATTGGTCCATTTTACAGAGAGCTGAT^ 

CCTGAGCTAGACACAAAAGTTCTCAACGTCCCCACTAGATCAGCTAGATACAGAGTATCAACAO 

AG AGT AG CT AG ATACAG AGTG TCG ATTGGTGCATTC ACAAAC C CTG AG CT AG ACACAGGGTGATGATTGGTG TGTTT ACAACC CTT 
1 5 GAGCTAGATACAGAGTTGCCGATTGGTGTATTTACAATCCCTTAGCTAGAGATAAAGGTT 

CCCAGCTGGCTTCACCCCGTGGATCCTGCACCGGGGCTGCAGGTGGAGCTGCCTGCCAGTC 

GCCCTTGGATGGTCAATGCGACTGGGAGCCTTAGAGCAGGGGGCGGTGATCGTTGGGGAGGCTTGG^ 

CGAGCCCTGCCCCATGGGGAGGCAGCTAAGGCCTGGTGAGAAGTCAAGCACAGCAGCTGC 

CCCCAGGGCCGGCGGGGCTGGCTGGCCGCTCCCAGTGCAGGGCCTGCTGAGCCCACGCCCACCCGGAACTC^ 
2 0 CACCATGTGCAGCCCCGGTTCCCGCCCATGCCTCTCCCTCCACACCTCCCTGCAAGCT 

CCAGAAAGGGGCTCCCACAGTGCAGCAGCGGGCTGAAGGGCTCCTCAAGCACAGCCAGAGTGGGTGCCAA^ 

AGAGCGAGCAAGGGCTGTGAGGGCTGCCA6CACACTGTCACCTCTCACTAGGAGGCTG 

T C CACCTGAAACACATG AG AATGG AG AGCTT AGGG CTGG C CTTG CTT CTGG CT C CT ATTGTCAT AACTCT AATGTT AT C ACTTCAT 
TGAAAAGATTAGACCTCTCCTTCCCCAGTCCCTAGGCAGGGAACCCCAGCCACTGGGGAG^ 

2 5 GGTATATCCTCTAGACACTTCCGACCCCACCeCACACCCTTTAGTC 

TATCATAGTTTAATATAACTTATTCCATTATTTACCTTCTTGAAGA 
ATTATTTTAATTTTAACTGCAGTGATTTCATGCTGACTTTGTAGG 

TTCTAGGCTTATCTTGAATAGCCCA' lTr ' ri '' l ' 1 'CTCTCTGCACTCATCTCTCCATATTTAATAGTTCTGTGTGTTCTTTTGCCAGTr 
CTTCCAGAGTCCCCAGTTCTGAGCCAAGATTTACTGAGATAATCTTGAGGCTCATA 

3 0 TCAGTCATGCATTTGGGGGCAGCTTGGCTGAGCCACTGG^ 

TAGAAGCCACAGTCCCAGGGAGTGTTTGGCAGTAGGTCTCrrCTTAGTAGCTCTGTC^^ 
GAGAGAGAAGAAGGAAAAGAAACTGGTTAGGCAGACAGTTAGGAGGGGTCCTTGGTAA 
AAAAATCAAAGCTACAGGCACAGATAGAGCAGCCTGGGGAAAAATTTAG 
GAAGCCTTTGTTCTTTGGATAATCAGCGGGCTCCCAGGAAAA 

3 5 AACTTGCACAGGGAGAGGGGCTTACCTAAAACATACCCACAGTTTCATAAACAAGAGAAGCT 

ACCTGCAGCTGCACAGGTAAGGAGAGTTACACAGACAGCTACT^ 

GAGrrTTTTCTTATAAAAGCTTTTGGATTCAACTGTAAAAGTGGCAACCCTCT 

CTTTGAATTATTAAACTTTTACTCCAACCTCACCCTTGTC 

ACTACCTCAGACAACAAG ACTGAAACATTGTGGTG CACTGGCAAG ACTGC AACAGTTCT ACATCAG CCCGTGGTTTTAAGAAGGG A 

4 0 GCCTGGTTCCIGTTCTG CCTCAGGTGGGGCACATTT ATTCCG CAAGCATTGAT 

CCAACT AAGCTTTGGCTTGTTGTGC ACAAGTCATCAT CTTGTAGTTGTTT CCATTCTTATACCATGAAAGGATTG AT CTTATTTTT 
G AGCTTGGCT ATGCCTTTTT ATTTTCCTTTAGAGACGTTCT ATGTATCATTG CT ATTTG CTCAGATTGGTAGGAGGTGGCCCTCAA 
GAAATGAACTTACAATTrGGTATTAACCCCAGGGCCCATTAGATAACTAGAAACAATC 
AAACTAAGGATCAAGTTGTTTTTCAGTTATATAGGGCCCTTTGACAGG^ 

4 5 CATGTCGAATTGTAATCCCTAATGTTGGAGGTAGGGCCTGATTGGAGGTGA 

GCACAGTGACTCACACCTGTAATGCCAGCACTTTAGGA 
GCCAACATGGTGAAACCCCGTCTCTACTAAAGATACAAAAAATTAGC 

GAGGCTGAAGCAGGGAGAATCACTTGAACCCAGGAGG TGG AGGTTGCATTGAGCTGAGATCGTGCCATTG CACT CCAGCCTGGGCA 

5 0 ACAGAGCGAGACTCCATCCCAAAAAGAAAAAGAGAAGTGTGTACCACCTCCCCTG 

TGCATGCTTTCCCTTCACCT CCCACCATGTTTGTAAGTTTCCTGAGGCCTTC CCAG AAGCAGAAGAAGCTACATTTCCTGTACAGC 
TTCCAGAACCATGAGCCAATTAAACATCTTTTATTTATAAATTACCCAGTCTCGGCT 
GCACTTTGGGAGGCOVACGCAGGCAGATCACAAGGTCAGGTGTATG 
AAAATAOVAAAAATTAGCCGGGTGTGGTGGCGGGCACCTGTAATC^ 

5 5 CAGGAGGCAG AGGTTAAAGTGATCCAAGATTGTG CCATTACACTCCAGCCTGGGCIa^ CAAAAAACAAA 

CAAACAAAAAATTACCCAGTCTCAGTTATTTATACCAATGCAAGAACAGACAAATACAGGACT 
AGGGTTTCCCACATAGGTCTGCATCTTATGATAGTGGTGAATT^ 
AAGGTAGAAACAAATGGCTTAATCTCTTCGAGTCTTAATTTCCTTATTTTTTGTGA^ 
TGTTTAGCTCTGTACCTTGATAGATGGTAACAACAACAATAAAAAACCrC^ 

6 0 ttactagatgctcagatatggggtgggtttgactaacctaaaccagtgcctccagtcaagatt 

cctcttcatttatctacttcttgtccatctcttcatatatttgcttgccaaact 
cctatctaaatatcacagtccagaagaaatgacacttatgccccccgtcctccaagcacctcccct 
cttg ctg ctgagagccattcttattcaacn'gtttagatcataa 
tcgattggcttaatccacaggccccattcrtggagcctggcataggatcagcttcc 

6 5 gcaggaggacttctgttaagaggatgatagagggttctgggtaggagtgtcaggatgtc 

tagaaaaaaatgtattacctcatatgacaggcagtccaaaggtgtg<k:tcactgaagccta 
tactaaattcttaggaattttgaaagccagttgttaaatcrgttgatagc^ 
attgtcaaaggctacagatcaggaatcrcccccgacccccacaaagacagttactggcta 
gattcagtgtctcaatgatactatcagggtcacaggattcttttcta^ 

7 0 agttatttgtttttgcatggtcatccaatggctttatc^ 

agagagaaaaacatgaaaaaattctcccctcagatatggatcatataccrtggcctttta 
gcctacctgtgtggtctgccatatactgattgactcaggcaactc^ 
aagctttatgagttatgggaaaaagtagatcagaatggaaatactactagaaaaacaaaagggtga 
ctctgtagaaaagaacaaacaatatcaacracacatattcrccatc 

75 ATCCT AGGGCAAAAGG AAG TT CATTGTT ATTGGCCTT ACTGTTT CAG AACC CC C AAAA r: A G & C & C a TG GACTTTTTTATGAAGTAA 
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AGAG ATCTTCAAG AATCTTTACAC CTTTT AAATGACCTCAACTG AGTCT AGGTCAAGTCATGG CTGCTGG AG AATG"r i"i u ri'GATTG 

GATTTCTTTGAAAGAAG CCTTGAGGCAAGGTTTGGGT A TGATTCCAGGAAACATGGTGATGATGTT 

GTAAAGTAAGACAGGGAATAAGGAAAAGCCCATAGTGCAGGCCCTCAT^ 

ACTGGGGTCCCTCTGAAAGGCTATGTAGGACAAACCTCCACATTGTACCAA 
5 CAflTClTTCATTGGTTGAGGGCTCTlTCrAGGGCATTAACTCCCTAATAT^ 

TTGG T ATC CTGAJGG ACAGACAC CTTT AAG CAGAGAGATGG AGGAGG CCATG TGTCACATTTGTTAGCAGGTGATCTCTTGATG ATC 

CAGGGTCCAATAGTGTCTATGGCAAACAGCAGGCTGCTGAGATGTCT 

TCTGCATX3AAAAAGGTAATGGGCATGGTGAATTTTGTTTATTAA 

GTCTATTGCTTAAATCTCTTGCCCATGGATGAAAGCAAAGAAAAATCAATAAAGAAA 
1 0 AAAACACTGGTCTCTCATCCCTGTGTCAAAATATAGTGCAGATTTATC 

CTCTCCTCCTTAACACGTGCATATTGTCCAAACCACCATCTTGCTTTGGTATAAGG 

CACTCCTGTGGAATTCTTACCCTTATGCCATCTAAACCATGCACCTCTGATA 

AATTATCTCCCCTTTGCTAGGCC^TAAGCCATCTGAGGGCAGAAATCATGATT 

AATGAATCAGTTAATGAAGGAACATTATGTATGCAGTAGT 
1 5 GTCTTAGGGAAGCTGAACTTTTTTTTTTTTTTAAACATTCT 

CCCCCATCCCTGTGTCATTCCAGCCATATCCAGCTACTCCTTCCA^ 

CAAATCTTGCTCTTCACTGTCTGCACCrrTGTGCTCTCCTCTCTCTCAA 

GGGAAATCACTAGTTCCCGCAGGAGGCTGTAGGCTGTAACCAACTCTCTGAAAGC1TTGAG 

2 0 GAAGGCAGAAAAGGAAGCATAAAGAAGTGGGTAGGAGTGTAAGCTCTGAAGACTGTG 

GGACCTTGTCACCTTGGGCAACrTATTTACCTCCTCrGGGCCT^ 

ATGTAGTGCTGCTCTGAGGATTAAATGAGATACTCCACTrAGAGCCCATGGAATGGTATCTC 

CTTTCCTATAATTACAGTAATGCCCAGGGCTCCACAGGCCCAGCCCAGCACCAGTA 

GG CAG ATTGAGGAGAACACACAAAGGG CT AATGAT GTT CAAAT ACACATCAGATTTC 

25 

HUMAN SEQUENCE - raRNA 

GCAAGAGTGACACACAGGTGTTCAAAGACGCTTCTGGGGAGTGAGGGAAGCGGTTTA 

GG CACTGGCACGGAACACACCCTGAGGCCAGCCCTGGCTG CCCAGGCGG AGCTGCCTCTTCTCCCGCGGGTTGGTGGACCCGCTCA 

3 0 GTCTGTTGCTAGATTATCGTCO^AAAGTGTTAATGCCCAAGTGACTGACATCAA 

CTACAGTTGAGACTCAGAACTTGGAAGGCCTGCATCATGATGGCCAATTCTG 
AGGGACTGCACAGTCAATGGGGATGAACCAGACTC 

CAAATGCAGAAGATGTAGArTGTGTGATGAAGGACATGGCTTAGAAGTGGAAATAAACTC 
GATGTAAACCAAACTTTTTTTGTAACTCTACTGTATGTGAACACTGTGACCCrTGCACCA 

3 5 TGCACACTCACCAGCAACACCAAGTGCAAAGAGGAAGGATCCAGATCTAACTTGGG^ 

ACTAATTGTTTCGGTGAAGAGAAAGGAAGTACAGAAAACATC 

CCTTAAATCCTGAAACAGTGGCAATAAATTTATCTGATGTTGACTTGAGTAAATATATCACCA 

AGTCAAGTTAAAGGCTTTGTTCGAAAGAATGGTGTCAATGAAGCCAAAATAGATGAGATC^GAATG 

AGAACAGAAAGTTCAACTGCTTCGTAATTGGCATCAACTTCATGGAAAGAAAGAAGCGTATGACACATTC 

4 0 AAGCCAATCTTTGTACTCTTGCAGAGAAAATTCAGACT 

AATGAAATCCAAAGCTTGGTCTAGAGTGAAAAACAACAAATTCAGTTCTGAGTATATGCAAT^ 
GCTGGCTGTAAATACTGCTTGGTTTTTTACTGGGTACATTTTATCATTTATTAGCGCTGAAG 
ATATCTCATGATTCTGCCTCCAAGGATGTTTAAAATCTAGTTGGGAAAACAAACTTCATCAAG 
TACCCAAATAGGAGTGTATGCAGAGGATGAAAGATTAAGATTATGCTCTGGCATCTAACATATGATTCTGTAGT^ 

4 5 AGTGT ATGTT AGT ACAAATG T CT ATCCACAGG CT AACCC CACT CTATGAATCAAT AG AAG AAG CT ATG AC CTTTTG CTG AAAT ATC 

AGTTACTGAAGAGGCAGGCCACTTTGCCTCTAAATTACCTCTGATAAT^ 

CTCTGAGAAGATCATATTTATGTAAAGTATATGTATTTGAGTGCAGAATTTAAATAAGGCTCTACCTCAA^ 

TATTGGTCTCATATTATACAATATTTCAATTGTC 

GCATTTTACTGGCTCAAAACTACCTACTTCTTTCTCAGGC^ 

5 0 T CT C CATTTTTGC CTTGGTGCTCAT CTT AATGGCCT AAT GCAC CCC CAAACATGG AAATATCACCAAAAAAT ACTT AAT AGTCCAC 

CAAAAGGCAAGACTGCCCTTAGAAATTCTAGCCTGGTTTGGAGATACTAACT 

AACCCATGTTTGCAATCAAAGATGATAAAATAGATTCTTATTTTTCCCCCACCCCCGAAAATGTTCAAT 

CTG CT ACAAATGG CAG CTT AT A CAT AG CAATGGT AAAAT CAT CATCTGG ATTT AGG AATTGCT CTTGTCAT AC C CC CAAG TTTCT A 
AGATTTAAGATTCTCCTTACTACTATCCTACGTTTAAATATCTTTGAAAGTTTC 

5 5 TATTTCTGTAAATGTAAACTGTGAAGATAGTTATAAACTGAAGCAGATACCT 

TTTTTTGCCCCTTGTGTTTGGAATTATAAAATATAGGTAAAAGTACGTAATTAAATA 

HUMAN SEQUENCE - CODING 

ATGCTGGGCATCTGGACCCTCCTACCTCTGGTTCTTACGTCTGTTGCTAGATTATCGTCCAAAAGTGTTAAT^ 

6 0 CATCAACTCCAAGGGATTGGAATTGAGGAAGACTGTTACTACAGT^ 

TCTGCCATAAGCCCTGTCCTCCAGGTGAAAGGAAAGCTAGGGACTGCAC^ 

GAAGGGAAGGAGTACACAGACAAAGCCCATTTTTCTTCCAAATGCAGAAGATGTAGATTGTGTGATGAAGGACATGGCTT 

GGAAATAAACTGCACCCGGACCCAGAATACCAAGTGCAGATGTAAACCAAACTTTTTTTGTAACTCT 

ACCCTTGCACCAAATGTGAACATGGAATCATCAAGGAATGCACACTCACCAGCAACACCAAGTGCAA^ 

6 5 AACTTGGGGTGGCTTTGTCTTCTTCTTTTGCCAATTCCACTAATTGTTTGGGTC 

GCACAGAAAGGAAAACCAAGGTTCTCATGAATCTCCAACCTTAAATCCTGAAACAGTG^ 
GTAAATATATCACCACTATTGCTGGAGTCATGACACTAAGTCAAGTTAAAGGCTTTGTTCGAAAGAATGGTGT 
ATAGATGAGATCAAGAATGACAATGTCCAAGACACAGCAGAACAGAAAGTTCAACTGC 
GAAAGAAGCGTATGACACATTGATTAAAGATCTCAAAAAAGCCAATCTTTGTACTCTTGC^ 

70 AGGACATTACTAGTGACTCAGAAAATTCAAACrTCAGAAATGAAATCCAAAGCTTGGTCTAG 
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Table 55 

HOUSE NOMENCLATURE 
ICSGNM Irf2 
Celera roCG1847 

HUMAN NOMENCLATURE 
HGNC IRF2 
Celera hCG16244 



10 MOUSE SEQUENCE - GENOMIC 

CCTGAAAAGAGTCTTTAATTTTCCCTCTGAAATT^ 

TCCTACAGAACATCTGACAGAGCACTTAACAGGGAATGGATCT^ 

CATGGTCAGGTTCTCACAGGAATACCCCACTATGCn'GGTACCAATTTGT 

GTTGGAGAGGAAAGC^TTTATTCAGCTTACACTTCCACATTGCTGT 
1 5 GGAAGCAGTAGCTGATGCRGAAGCCATGGAGAGATGTTACTTACT^ 

ACCCAAGACTGCCAGCCCAGAGATG<3TCCCACCCACAJ^G^ 

GATCTCATGAAGGCATTTCCCCAACTGAAGCTCCTTTCTCTGTGATAACT 

CAATCATCTTAGGGCCA.GGAGTGGGATAGAAGGAGGTGCA.CACAGAACCACGCA 

GTCTGTGGCTGCTTAGATTCGCITGTCGCACGTCACACCACTGAA 
2 0 ATTCATGGTGGGCTTTCAGAGCTGTGCCGAGCAGA 

TG CCTGTTTGTTTCCTTTCCACAACTATACTGTCTGAGCAG AAGAACGCCACG ATGG CTTGGGCATTTTT 

GCCTGCCTGCATGTCTGAGTACC^TTTATACACCCATGGAGTTTATGTTACAACT 

CTATCTGCAAAAAGTATCCCAAAGTTTACGCTGATGAAAATC 

CTTCCACTAAATATGGAAATCATCAGGCTGCATAGTAGGTGGTATAAGGACCAAACAGGAAA 

2 5 ATGTATTCAAATCGAAAGTGTTTGACZATTTCAAAGTATCTCCTTTGG^ 

TGTATAAAATTTTAATTTTCTCTTACCTTCAATC 

TTTATGTGATCAGAGGTAAATGCTTGGGAGAGATAACAGAGTGAGTGGCAAGAGTCCTCCAGGCTCT 

TTATAAGGTCGTCTCATGGAAGGGTGCCAGCATTGGTCGTATGTCCTGTGAAGCIACA 

TGAGAAGTCAAGGGAAGACGTGACGACAAATAAAATGGAGATAAGTAGAGAATATCCTGTCTAATGATGGG 

3 0 ACCCTGACATGGCTGGGCACACAGTGTGAATTTAATCAGGCCCAC 

TTGGGCTGCTTGGTCCAAACACrTCCITCrCTCTTCTATCGTATGGT^ 
TGATCTGAACTTGGAGTCTGACTGGCAGATGGTAATAGCAACCGTTATCTTCT 
GCCCAGAGCAGGATTAAACAGGTAAGAAGTAAACCTCTTTGTTAAGAAGAGGGA 
ACCCCAGAACTCTGGAGGTGGAGGCAGGAAGATAAAAAGTTGGAGATGAACCTCAGCTACATA^ 

3 5 TGCTCTAGACCTAATTTCAAAACCAAACATACAGAGAGGAGCTTAAAGTCAGGATCC^ 

ACAACAACAACAACAACAACAACAACAACAACAACAAGCA.TGACTGACTGTTTA 
ACTTAAAATTTTAGGAACTATTACAAAAGAAGTTGTTCGTGGACAAAACAAAC 

ACCTAAAAGAATG TT AGAAATCAAACT AT AAAAC ACAT CCCAG AAC CATT ATC ATG ACACAAT CT CAAAAAAAAAAATCATT ATTT 
TTTTTTCACTCAAAAACCTGAGATAAGTAGAGTATGCCTAAAATTTGTGTGTCCC 

4 0 TGCCTCAAGAGTCATTAATTAGATTCACATTATTTACTTGTG 

GCTTAAGAAGTAAACACACTGACCGGAGTTCAGCATGTTTCT 

GTCAC!AAATGCCTCCAGGGGGCCAGCAAGTTAAGAAACCTTTGTGCTTCTCTGATGTAGAAG 

GATTTTTGAAACCGTACGTGGATTCCATAATTCCCACTGCTCA 

CACGTTATCTCCTCCCAACTCTTTCTGCCAACTCTCAGAAGATGCCAGGTCT 

4 5 CTGTTCAGGTTCCTTATCCACACTGGTATGTGATGTCACGGCTTCCTCCCTGTTTACATGCT 

ATGTGACCTTATATTTACATACTTATGTGTTTGTTTAGAGACA 
AGACCAGGCTAGCCTCACATTCAGATCTCCCTGCTGGAATCCAGGCATGCACC^ 
TTGAACTCTGGTCCTCGTNNNNNNNNNNNNNNNNNNNNAA 
TGTCTCCAGCTCTACTTTGCTGAGTTTAGGGGTTACACATCTGAAAT^ 

5 0 CGGATGAACATCAAGATACAGCTGTGGAGCCGTCCTCACTTCGCTGCTAA 

ATGTCIAGGGTTTGGAAGTGGCAAGCCTTCCCTTTGGACAAGAACACTCATCAC^ 
GAGGGAGGAAAGAAGGGAGGGGGGGAGAGAGAGAGAGAGACAGAGAGAGAGAGGCAGAGAGA 

GAGACAGAGATACACAGAGACAGAGTTGGGGGGCAGGGACAGACACAGAGACAGAGTTAGGGGAAGGGAGAGCTATAGAGCTATCC 
TGGAAATCGTAGAGTTATGTTTCCTTTCTATTTATCAGAATACTTATTACTAAGCATTGGGTAATCTC 

5 5 TCTCATGATGTTGTAACAAACTAAGGGTTGTTTCTTTTTCCAGTGAAATAGAAGCAAG 

TGTTTCCAGATTCACAGAACTCCCTTTAAGGCGTGCACATCACCGAAGTCCTGCTGCC 

GT AGTCTTT ACTCACTCTGT ACTT CAC CGGGATG ATCT CTGCAG TG ATCTT CTCTGCACGGACTT AAACTTGT ATTGAGCATG CGG A 

CITTGCCTTGTATTGAGCATGTGCTCTGGCAGGCATGGAACCCT 

CACAGAGGAGTGCCCCAGTGGCTGGCACAGGAGTGGTTCCAGTGACGGGGGTGGGG^ 

6 0 GGTCTCTGGTTTCATCTCAAGGGTACTTAGACTTTGGTTAACAAGCTAGACAGTACACCTTCTTAC^ 

AGGCCTGTGATAATAGCTAGAGGCCATGTCCCTGCCATTCCGGGGATGCTATATTGGAAATCGACATGGT 
TAAAAAAAAAAAAAGTGTTCCAGATGACTGTGGTTTTGGTTCCTTTGGTCC^TTGC 
GACGCCCAGCACGTATTTGTGAATTGAGTAATGATGTTATTAAAAGTATCT 
AAAAGG^GTTCACTCTGAGTGTGTCTTCAGTTTCTCTGGTTTGCCAAGAT 
65 TACCTATTGTCCTAACTCTCCTGAAGAGATCCCTATCCACACACCCAGGGAGATCCT 
AAAATGCAATCCCTGAGAATATATTCTAATTTTACTATAACGCAAATGGTACTC^ 
CTATATTGATTCATTTTCCACAGAGATGTGTGTGTGTGCATCCGTATC^ 
CATCATaUUVGAGTCAAGAAATATTTTGTGACTATGTGGATCAGTACCATGTGGATCTTCTG 
CAGTGAC!ATTTCATCAAGGGCX3TAAGAACACGGTCCTATTCATCCTACGTTTTATG^ 



70 



TGGCTCCTGATGACATGACCAATATGCGTTTGTAGGGAAAGATGTTATGAGTACTT^ 

AAGCGAGGAGGATCCTGACTTCAAGGCAGCCTGGGCTTGAGATGGTCTGTGTTTTCCTAGTGTGAGTA 

ATATATGAATCTCACTGGCATCTAGGAAGTGATATATGTCAGTAGCTCAGAGTTTATT^^ 

CTCAACTTGT AAG GTG CCTGACC CCCAT AAACACATTT AC CAGTftfi A AGC ACTOT ATCCTAAAATG CT ATTC ACAT AC CT AG TTT A 
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GAGAAAGAGTGTAGCAACTGGGAGATTGCAGTGT 

GCCATTGCCCAACATGGCAACAAAGTATTATACTGCTTAGTGCTAGTCCAGG 
CTACTGAGTCCTTGTTGCTTTTCTACCATTGCATAAT^ 

TATCTCAGGCTCTGGGCGTGGATGAGAGCG CCCAAAGAGAAAAAAAAAAAAAAACCCAAACATATAAAATGAGAAAATAAGAGGGT 
5 TGCAGACGGGGTGCCTGGAAACACCATAATTTAGGAGGCTGGCAGCAA 

AACGATGTCACAGAAGCCCAGCGGGGGATGGGAAGTGCAGACTGCACCAG^ 
TCGAAAGArTACGAATGGGAAGCACATAGAAATATCCATAGGTCTATTTCTGAGGATAAAATTC 
TGTGTGTGTGTGTGTGTGTATGTGTGTGTGTGTGTGTATACAGTTATGTACATGCTGGATGTAAA 
CTTACrTTAAGAAAATACTACCAATTGAGCTAC^TCTTCAAC 
1 0 CCTTGAACTCAGACTTCCTCAGCCTG&CAAATGCTGG 

TTCTCTATGTTTTCTGACTTCCATCTAATGTTACATTTGTATTAAAATGTTCl^ 

GAGGACGTACAACTCTTGCAGAGGACCCAGGTTCAGGTTCCAGTACCC^ 

AGATCTGCGTCTTCTTTTGACCTCCAGGCACACACAGAATC 

AATAACGTTTTTTTCCTTT ITTT 1 1 ' 1' I AAAGGATTT ATTT ATTT ATTTTT ACAT ATGTGACT ACACTC 
1 5 CCAGCAGAGGGCAGCAGATCCCACTATAGATGGTTGTX3AGCCACCAGGTGGTTTGCT 
CAGCCAGGGCTCTTAACCACTGAGCCATCTCTCCAGCCCCTCA^ 
GAGAAAGTCTGGACTATTCCGTACGAACTCAGAGATAAGAGCAA 

GGG ATATAGG CACTTGAG CTGGTAACACGTGAGGCTAGCTGATAGTCTGCTACCACATTAAACTC 
2 0 AGATGGCCTCTATTCTGTAAGCTCCAGATGAGCTTCTGAGGGGCAGAGAGGACA 

TTGATTTGGATAACCATGTGTCTATTGACGTGGACAAACCCAAGGGAGGTGACrGGGCAT 
CTCATCrK^AGACCCTGGGGGGTTGCATGGACCTCT 

GATGGATTGCTAGACCGTGTGGGAAAGAAGGAATCAAGAACrrGAAAAAAAAAATTAACAGTCC^ 

T AAAGACCACACAT AAGT AGGG CATTG CTCTTTCTGAAACAGTG AGATTGG GGTG CA CTT AG AGCGGGCAGG ACTGTTTCCTG ATG 

2 5 GAGAATCTAGTGAAGACACAAATGGGAAGCCAATCGTCAGGAAAGCAOT 

CAGGACTAGCCTCTGTTCACGGATACAGGGCCTTAG 

TTCATCTCTGAGGAAGCTCCTATCATAGATAGCAGGGAAGAGACATCGGTTGGCCCCACCA 

CGGCCTGCACATGAAGGATGCTGAACCGAGATAAGAAGCTCAGAAAGAAGG 

GCCTTTCCTCTCCCrGTTCTAGGGAAGAGAAGACAATGGGTTGCACACCGATTA 

3 0 CAGG ATTCAAG AGC CAAGG CAGG AACG TTGTT AGTT CCCGGACAGCTTG GAAAACAATTCTGG TTCACCTAG ACTT C CT AAG ATGA 

T C CTGT CTAGAAAAAAAAC AAAAAACAAAACAAAC AAACAAACAAAAA^ CA 

GCAACAGCAAAGAAAACAACAACAAAAG<^GCCGCAGAGATGGGGGAAGGG^ 

ATCCCGAAGLACrCTTGAAAAGCCAGGTCTC 

AGTGGCTGGCTACCAGCCTAGTTCTAGGTTCAGTGAGAAA^ 

3 5 TGGCCTCCGTGAGTACACATGCACACATGCACACATGTACCAC^ 

TTATTAGGCTTTTAACTTAGGTGCCTGGTGGGTAAAGACATACAACTACCATG<ZAGTGTGTATATCCCT 
AAATATAAATG CAAGG AGTGTGCAGTGTTATTATGGGAGAAATAAAGGACACTGTGCAGGAAGG 
AGGGTCCTTAATCTCACTGCTCTCCTTCTACAAGGGCATGCATACTTTCCTTTATCCCT 
AAAGAGGAGGTAGAATTATTTTTCAGATGTTTACACATAGATAGTGACAGTCTCCTAGGGTGGGATGCT 

4 0 AAGGCAGGTGGTGGGCTGAGGAAACTGTAAGCCAGGCAAGAAGCAGGTGTAACTA 

CCAAACAGCAAGAGGTTCCAAGCG AG CTGG CCT AGTAG AGCCCAT AGGCCGCTG CACCTGTCATAGGTACTGGGCCCTCTTCTTGC 
ATTCAGTATTGTCTCCACGGAACAGCACAGAGGACAGCAGCCAAGGATGTTCAACCAATAA 
GGCAGCGTTCCCTCCAGGTTCAGTCCCAATCAAGACAGTTTAAGTTCATGTAACT 
GTTATGGCAAAGACATGAAGTAACTTTGCAGAGTGACTTTCTAGGGGA^ 

4 5 NNNNHNNNNNNNNIWCCCAGTAGGA^ 

CCTTTGCAGCGGGCGACTTTCCAAGATGAGTTTGTATATGCAGC 
GGGTCCCCAGGCTTCC<K3GGCTCTCTCrGGTCCATCTCCACCCTTCAG<3TA 

GGCCACCACCAGCTGGTGGCCAGCTCCCAAGCCAGCGCCCCGTCCACCACTTCTGGTGCCCCACACT 
AAAGCGCGCGCCCTTCGGCGrcATCCCCACGCTCACACGC^ 

5 0 TGAAATTGACTTTTCCGAGAAATGATGAAAGCCGTGCCGGC 

GCTGGAGCCCTCCCTCTGCCTCCACCTCCGGGTTGCACAAGCTTGAAACAAACACT 

GAGGGGGAGCGAGGCAGGGAGCGGGGCCGGGAGAGGGCGC^KK^TGGG^GGGATTTCCAGCC^ 

TGTTTTGCTTTCGATCTGGACTGTTCTCAGGCAAGCCGOGGACTAACTTTTAGT 

TCATTTCCATTTTACACACCCTAACAACACTCACACCTTGCGGGATTGTATTGGTAGCGTGGAAAAAAAAAAGCA 
5 5 AACATTTTTTTTTTTCAAATAATAGTTGTGGCTGTGCGTGTGTTTTCTCT 

TGT ATG TG CGTCCTG AT GCTTTGACACG ATGTTTTG CAGACGG AAAAATG C CGGGGTT CATAAAT ATT AAT ACCGATTTTTGG AGG 
AAGAGTCGGCGCAGTTATGGATCATCTGATTTTAGGGGAAGCAGGGGTCTGCTGCTGTTG 

G CAAAG AGGAGGAGGTAGAGAGGGAGG AGG AGGAGGTT AGTG AAGGTTT AATTT CACTTTTGGATTTGCAG ATGAGAAGAGGTGGC 
TCCATGGACACAGATCTGCTCTGTGTGATTCTGCGCTCCAGGTGTCGTGTGTTGTGGGTACAGG 
60 TAAAATAAAATTGGTGTGTGTGTGTGTGTGTGTGTGTTGTATGTGTATGAGTGTTTACAGACACT 
GGCAACCGGGTTCTTCTCGGTGCCCAGCGGCCCTTCTTTCCCGGCGGTCTCTGGTGT 
GCCCGTGCCCGCGCAGTGACCACTGTAAACTTGAAGGCTGCCCT^ 
GGTGACCGGCGTCTGGTGCCAGTGGACACTCG^CCGGGC^ 

TCCTCGGGACTGGGCCGCCGCCACACGCCAGOAACGTCGCGTCCTCCrCCCCGCCGGCCAGCGT 

CCTAGCTCCGAGTGCCTCCGGGTCCCCGCGGG^^ 
GGGCGAGGTGTGGGGGCGGGGAACCTCCGGGGACCGAGCGGCGCCC^ 
TTG<3GGTCTCCGGGACGTGGTGG<3AATGAGTGCCCGCCCTGTGCCCT 
CCGAACATGGTTG<jTGCAGAGCG<3CAGCC^C<^OT 
70 CC^TCGCGGTGCCCAGCTAGTCCCTGTCGCTCGGCCTTGGGGGC 
GGGCCCCGGCTCCCCGGACGCTGTCCGCTGGGCCCCGGCTCT 

AAGGGAAGGAGGCTAGTAGCCTCCGCCGCGAGCCAGGACTGTGCTCXSGGCCCCAGGTGAGACGCCCGGATC^ 
GATCCTTGGTGGTCGGATGTAAATAAGACCCCTGGGATTGGCA^ 

AGCCCAGTGTGTTTGAAACTTGGGGGACATGi i iv,i iGTCCCCTCCTCCCTGCCGCGTGGGCACTCCTTGTCCTTC^- x iGGTTGTG 
75 TCTGTACCAGGGAGGCAAAGTAGTCCTGGGGTTAGCTTTC^ 
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ATGGCTCCTGAGGGTGGAGGACAGTTACATTTCCTCTTGTGCATTTT^ 

AAGTCATTTATTAGTTTTGTTTGTTGGTTGTTTT^ 

GTTAGCAGCCTGCAC!TTAAGGTGCTTAAGGGCAGGTG 

TGGTAGACAGATGCACACTGCTTGGTTAGGCCAGAGCCCTAGAGGTTGCCC^ 
5 TCTGATTTITCTCAACTCTCATTCrrTTTTG<^ 

GCTTTTCTAAGXjCAGCTTCAAGCAAATGGAATATT^ 

GATGGGGCAAAGTCCGGAACGTCACTGTGATTGGTGAATGATTATA^ 

CTTGTGCTTAGATCCAAGAGATCITGCCTGTTATGATTTGTT'A 

GTGGTCTTTTTTCCCC CCCTTGACCTGCAAAAGTTAGGAAGAGTTG CACGTGTTTCAG AATGACTTAACCTGTAAAAATGGCATCT 
1 0 TCATGAAACAAAAAAAGAG<3ACTTGTAATCAAATTGGA 

GGAACTATGGGCTCCTGAAATACTTTATTAAGTAAATATTrAATATACCAGAG 

ACTTGAAACTTTGGATAGTCCCTTGACGCACATTGATTTCCTCACATATC 

TAAAATCTGGATCCTCTTTTCCAGCTTTCCCCCCACCCCACCCCCTTTTTTAATGC 

TTATTGCAGATCCCTTATCTGCCTCCTGCCAAAC^ 
1 5 GGCCATGAACTT ACCAGACACAGTGGCAGGCTCTGTGAACCCAACT AGAACCGTG AAGG ACAGAT AG CCT CTTCAAGTTCTACAGAA 

GTTGCATTCTAGCTGTGAGCTGACAACAACCAAAAATAAATCCAGTAA 

TTCTACTTACCGGGCTGCTTTGGGGGAACAATTGTGATACTGTGTG 

TCATGTATTGACTTTCTAAGTTCTGTTAGATTACTGTTAAAGCGAATATTAGAGAGGTGA^ 
ATTGAGGCTGAGGCAGGAGGATTGTGAGTTCAAGTTTAGATACAGGCTGAG^ 
2 0 AGGACAGGACAGTGCCTCTGAGGGTACACAAATCATGGCAAAGAATTACACATTTGGATAACTATTATA 
GTTTAGCAGTGAAATATTATGTGAA TTTTTTT CTCTT TTT T TT GG^ 

TG ATGTGGGG ATAACTGACTTGGATG CATGTTTTTCCAATTT ATTT AT ACATT AAATT CTTT ATTG T AATGAACTTACC CATAAAG 

ACATCATAAAGGTAGAGAGAGGCGGCCATTCAGACAAACACTTGCCCAATCCCCGTCTCTGTGATTTCTCTGA^ 

TGGAAAGGGCCCCTCTGAAGTCCACAAACATCACCTG<3GGGOCTTGCATTTCGTT 

2 5 TTAAAACCTAGAAAAACATTAGCTTTATGTTCTC 

TACTCTTTCTTGGTCTCTTTTCTAAATAGCATAGGA 

GG AC CTG AGT ATG AGGG ACAAT ATGTTTCC CCTTTTT AT AATGCAACAC ACAGGTG TC ACT AT^ CTGAATTTTCTrCAC 

CGTCGTGTCAGACrCATTAAGTTATTGCTGTTATGGGTATGGCTTAGTTCTATTCT 

TAGITCTATTCrcCAGTCCATTGCGCrGTCATTAAAGGGACTTAGTTC^ 

3 0 GTTGACTTGGATTCACTGACGAGTGCGCAACACTATCTGAGTAA 

GATTTGTAATATATGATATGTGAACAAATACATTGCTGTGAAGTAACAGGAAGTATTAACAGTAAAGCCAATGCTATTCATATGTA 
TGGCACTCCATACTCCTTCCTGGACAGGGGGAGAGGTGAGCAGGACTTTTCT 
TTTGCATG^ATTAATATTTAGTTAAGAATATCAGCTTAATGTATAATC 
TAGCTAAGGAGGTTTGATCAAGCTTTGACTTAGCCCCAAAGGAATAATAA^ 

3 5 TATTTTCAAATAAAATGCCAAGGAAAACAATGACCCAGAAAAATGCAAGCCCCAGGCAG 

GAAGATTTCCCAGGCTCIGTGTGAATTCAGAGCGAGCAGTAGGCCT^ 
GATTTTACACCTTACAGGATTCATCTTAGGACAGTCCCT^ 

ACTTGG ATACCTCTGGCITCCAGTCAGAACTCCTGGTCA^ CG AG ATGGCT CAGTG AG 

TAAGAGCACTGACTGCTCTTCTGAAGGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCACAACCATCT^ 

4 0 ATGCCCTCTTCTGGTGTGTCTGAAGACAGCTACAGTATACTTATGTATAATAATAAATAAA 

TCAAATCTAGTAAAATCTAGTAAGCCTTATTCTACCCAAACAGAGG^ 

TGGGACCAGCTGAGGCCITCCCTCAGAACTTCGAGTTCCTCTGGGGACCCAGGAATCCTATACC^ 

TAAATGTCGTGATCCACAAACATATTCATTTATCAAGGATTTGTAGACAACTAGCCATGTACAGGG 

ATGGGCTCAGAAGCCTAGAATTCACTACATCCATGCACAAAACACACACACAC^AAA 

4 5 ACACAC^CACACAC^C^CACACACACACTTGCACCC^CAAGC^ 

TAAGCAAGTGCrCTACCATTCAGCTATACCCCGAACCTTGGCATTGACTTT^ 

TTGGAACCTTAATTCCACAAACTGTGGTTTGATAGTTCTGCAATCCTC!A 

CCAAGGGAAAGGAACATTTAGAAACCATTCAGGArrCITTAGATTGGAGACTGGCrC 

AATATAGTATAGTATAGTATAGTATAGTATAGTATATT'ACAGTATAGTATATTACAGTATATCATCrATCTCTTCCT 

5 0 AGTATGAGCCTCTTGTCCCCTATGGTX3TGCCGGGCCCATCATTCCAGTCTTCCTCCATTGTG 

ACTCAGTGTTC^AGGGTCrCTTCCC^GGACrAATATGGATAACCGTGGATG 
GCAAACAGCAAGCJUUAGCTGCTTGCTTTTACACTGTCCATACAGCTT 

CCTGAGCCCTCACAGCCCAGATGATATTCACATCATGGCTCTCGTACTCTGACTTTTCCATTTCCAGGAAGAC 
GCAGCCTTCGTGTGCAGGAAGGCCGTAGAGAGTGCGGGAGGGTTATTTTTCCTGACTTGTTTTGTTrCTG 

5 5 CTTCCATCAG CTCT AAT AAGCCAGAAAGAACTAG AGTTCAAC AAAT AGATGTGTGGGG ACCCAT^ 

AAGTGTTTTGTTTTAATTCTGAGAGAACAACCAATGTAGTCGAAC 

AGTTTTCATTAAACGATTTTCTTTTTTTTGAATAATAAAATACCTCCCATGTT 

TATCAATATACTGCTATCTTTGTACCAGCTTTGGATGAACCTTCACTTTAACT 

G AT AG AGG AT G CTCTTTTGG CCTG TG AG CTT CACATCT CTG ACT GATTCCACATCTTG TTTGG TTG AT ACTTTG AACGAGAAGG CA 
60 CTGAArCCAAATGGCCTTATTTCAGCCGATATC^CTATGC^ 

AAG CTATC^ AAG AGAGTGTCCTGATACCTAAGGAGGGTGTTGGTAGGGCGGTGGGTAG AAAAGTGG CCAAGGGGGAATTGGAGAG 
AGAATAACGAGAAAGCTAGATGGTGGGTTCTTGGTACTGGAAGAGAGGGAAGGGGCAGTTA 

AC71GCTGG CATGG ATGGTT AAT ATTGATTTTT ATTT CATGTG T ACG AG TGTTTTG C CTG AATGT ATG TCTGTGT ACCATTTGCAG A 
CCTGGTTCTTTTAGAGGCa^CAAGGGTGGCAGArrCCCTGGAGCTGGAGTTACAGATC 

6 5 GAACAGAACCTGGTGACTCrGGAAAGCATCAAGCACTC 

TTTTTAACATTTTAAGCGTTATATGCCACTAAGTTAAATTCCAGGAAATTCCTTAC^ 

AAGGAAGGAAAGGGGAACAAGT ACTTAGTCAAGGGTCAATTGAG ATGCTGAAG AGTTTCTG CCGTTG CCTTTGGGGTTAAACCTTT 
T AACCC CGTGG AAGATGG ATTGTGACAGG AAACTGCAG^GC^^ G CTGG ACTT ACAGG AGACTGT CACCT 

GGGCATGGTTGGGGTCACCTTGAAGGTATTGATGTTGAAGTATTGGGAACTGAAGG 

7 0 TTTCTCTAAAAAATAACC<^AAACAAAACAAAACACCCCCAAACCTCCCCACA 

TCTCATAGGCCCGCCTCCTCTCACCCATGCTTrTCTTTGATTGGATGTTACTC 
GAAATATAGAATGGGCAGTAGTGTGTTCTCTTCAAGAATAGGTGATGGTTC 
TGGGAAACTGTGGGGACCATAATGTAGCTGAGGCTTTACACAGAGGAATTTTCTGGAA 
TTAAATGCTGAATTTAACATGCTTGAACATTTAACATGCTACAGTGGTGCATGCTTGCCACCCTA^ 
7 5 AGGAGGATTGTTACAAAGTTGAGGTTGTCTTGGGGTTACACAGTGAGTGTG AGGCTAATTTGGATACACAGTGAGACTTTOTTTCA 
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AAACAAACCTCAACCAAACAAGAGTTAAACTTTCTGT^ 
TAGAACAAAGGTTTATGTATTAGGGTGTTTATAGAATG^ 

TAGTTGGCTGGGAGAAAAGGATATTAGATTGTATGCCACACAATTT^ 
5 TGGTGAAAGAAAAGAGTTTGATGGTAGGATTATATTTGATTTTTT^ 

GGGACAAGAAAAAGGAAATTtSGCCCGGGCAGTGGCGCACATCTTTAATCC 
GGCTCAAGGCTAGCITGGTCrrACAGAGCGAATTCCAGGACAGC^ 

AT AAACAAACAAACAAACAAAAGG AG ATTGAACGTTG CTGAGG ATG CT ACCTGT AG ACT AAG CT TTT CTTCTGCTCACTGTGCTC C 
1 0 CITGTGCAGATTCCAGGTCTAGAACTGTGGCCTCTTCTA 

TGGTCTAAGATCTTACCCCATTTGACAGTTCTGCCTT 

GAGCTACTGATATTTCCAAGCAAGCAAACAAGAGGATCATGCAGTC 

CATTTCACTGTGTGGCGGAACTGTCCCTCTCCCCTTCCTATAGGATCAGGGT^ 

GGACTGTTCCCTATCCTGTCTGCAGTAGAAATGCTTGGGTAGATAGGGCTCTTCTTAGur 
1 5 AAGGGATTTGGACTGGGTGGTATTCrTTGACCCTGTTCAGGW 

ACAAATGGCCACAGATCTAATTCTCAGGGTGTATGCAAAGTAAGAGGCCTGGCGAAGCATCCCTTCCC 

CCCCGTTGACTTCATGTCCTTTGTCCCAGGAGGGCAGATATTCCCCCCATACCC^ 

GGAAGCCGCAGTCACACTGTGATGTCTGATTTTTTGCAGTGGAAGACGACT 

TAGAACAGTTTCACTCCTATTATGAATACAAACTTATTITATCATTC 
2 0 AGGCAGAAGGATGAAACCAGAGAGAGGCAGGATTAATGAAGAGGACAGTCTCCAGTACAAAATAC^ 

TCTTATTTTCTTITAGTATCTATATTAGAAAAGTAAAAGGAAGAGGTGACA 

G AT ATT ATT ATTT CAATAT AATC AGTG CAAAAACAACGTT ATTTTG CATTCC CCTTT AAAGCC CAGT ATGTTG CTTTTTAAG CTGC 
AGCTTGTCTTAATTCAACCTATCTACATTCTGCGGGCTCAGCAGATACAGGTC 

GGAG ATGCACAGCGGGGAGCTGG CG AG ATGCTCGGTTGCTATAGT ACCTGTTGCACAAGCCTG AGGAC CAG AGGCTGATCCCCAGG 

2 5 ACCCATGTAAAAGCCAGGCCTGGTAATCCACATTTGTAATCCCAGCACTCAG 

GCCTTTTATCAAAACAAAGGTAGTCTCTACCTAAGGAGAGATATTGTAGTTATCCCCTGATACCCT 
ACACAGACACACACACACACACACACACACACACAC^ 

AGTGCCCTGGGGACAGTTTTAATACAAGGAATGGAGTTTAAAAAGACCGAAGGGAGGCAG^ 
CTCATGGCTCrCCTTTGCCCTCTCATCATAAGCTTTA 

3 0 CAAAAGC AGCG CAT AGCACGT ACTG TT C CCATGG CC7VCGAG CTGTGTGT ACACCTGAGTGT CTG TGGG CCTCTCGGTGTG CTGTG A 

AAGGCTCTGGTTATTTGGA C1TTT1TTTT TATTTCCCTCT 
CTCAAGGACACTTGATACGCAGTAAGAACTAGATAAATATTTGTTGCTCTGATT 
TGAGCTCCAACTTGGAAGGGATGGTACAGGTCTGCTTTGTGTTAGAAACCT 
TCGTCACTTAGOVAGGAGACTTGTCACTATGC^GAaSAGGTGCAACACT 

3 5 CCAAGACATTTGAAAGAGCCI^TATTTCATTCCCAACAGTGA 

TGGAGACAAGCTTCCGAGAATTACTGTTTTCn'GTATGGATTAAAGTAGTGATGCCTTTGT^ 

CCTTAACCTGTGCAGTGGAACTTCCrrCTG AAATCAAATGCAGTGTTTT AT AGCTG CCTC CATTTGTTCCCCTTG ACTTTAATTGTCG 
GGCTGAAGCACCAGGCCTGTTCACAGAGCTAAGGCAATTTATTTTTACTGAA^ 

GCAAGATTAAATGTGACCTTCATGGAATCGTGTTAGCAGGGCCAGCAGCTAGAAAAATAAAGACTAAACCAAACTGTT 

ATGGTTTAGGGATTTGCAACCGACAGAAGAACAGAAAAATATAATCTCGTTT^ 

GAGTCITTCTTGGGGGGAGGGGCTTGGTGTTTTCCTCATATGGGAGA^ 

AAGGACTTCTTAATAAGGAAAGAAAAGCTGTAATGGAGAAGTCAGGCTT 

GGGCAC ATTGGTACCTGGCG CCATATTTGACACTCCCCTTTCTT CTCTCCACTATAGAAGGAAGGCAGACG CTTGAAAACCCATAA 

4 5 GGGAGGCCCAAGGGGACTGTAGATGGACTGGTTCTTACCATGAGATATGTCGCACT 

ACATAGGAGAGAAATTGCTAGGGACAGGGGGCTTTAGGGTATCCAAAATATTGATTACT 
GTACAGCTACAGTXTGGTAGGTGAGATCAAGGAAGGTACTCXK3CACAAAGGATTGACTC 
TGGGGGTTATGTGGTGAAAACTGAAGAGGGGGCAGGTTTTCrTAGCTAGGTATTCTCCACAC 
GGGATCAAACAGCTAGGCAACTAAAGAAAGAGAATGGTTTTAGCAGCCACTAGAGGT^ 

5 0 TTCTCTCATCITGTCrGTTTGGATGTTACTGGTGTTTGCCCGTGA 

GG GC CTG ATT CCATTGAGGG AAACTGATGTG ACT C AGG C AG ATGG CTGT CT C CACT TG AT A CTTC CCTTTC C CCAG ATG CAGTCAC 
TGGTCCTGAATAAGTGCTGCCCGACTCTCTCCCrGGAGGCTTGGGCTGAGAG 
TCrCACCTGGCCTGAGAGGACATCACCATGAAGGAAAAATTGGCTTTCAGTTAATAGAAAA 
TTCTCCCCATCCCTCrCCCACTCTCTCCCCTCCCACTCCCCCTCTCCCTCTCT 

5 5 CCTCTCTCITTCTCTTTCTCTGTGTATGTCTATAGTGAATAC^ 

CTGAAGAAAACAGAGGCAAAGAGTGGCAAACTTATAAATAAATGCACT^ 

AGTCATAGAATTTGCCTTTCCAGTCATCTTTAAGTGGAGCCCn'CACCACITAAC 

AAACTCGATTCATATTTAAAAGTCACTTCGTTTTTCTCGGTCCCTGGAAACCA 

TGTCIAGTAGAAGCATCTCACGTTTGCCATTCTGTCTGACACATGTCACCCGGTGGCATGTTTTCTTTC 

6 0 TC1GTCAGAATCTAGCTCCACTTTCGAATGAAAGATGTTC!ACGCGTATTTTG 

GCTATTATCTTTGGGCTCTC^TGAGCAATGCAGCTTTGAATACTGCTGTAAAAAAAATATATCGAT 
CrCCTCTCCATTTCrCTGCTCCACTAAGCCCCTGTGGGTATCrCTCT 
AAAAGGCTCTCCCAGGCCACCTCTTCAGCTGGTTCCTCCTC^ 
CATGTGAAAAGGGGGTCCCTCTCCTCTACCATTTGGG ACATGTCT 
65 GAACIT'CTTGTCATTACAACITTTACCCCCTCTGCTACTTCCA 

GAGGGGCTGTGGTCCCACATCTTCCITAAGCCCTTCTGGGCCTCTGTGATATTGGAGTCCTCCCrCGGATG 
CCCTGTCCCTTCCGTCTCTCAGCTCCCTCCTCCCGCTGCTTCTGTGTGCC 
CCnSTATGGCCATTTATCTTTTCCTCCTTGGACCCAGAGATTITGTTAAAACCTCCCT 
ACAGTACAGTTCAATCAATGAGCAAAAATTANNIWNNNNNNN^^ 

7 0 CATTGCACATGATTTAAAAATCTGTAACAAAATTTAGCTGAAGATTTATAACAAAGGG 

ATTTGCTCTAACTTACTCTGTCTGAGATTCAACTITTAGGATATATCTTAGGATATATTGCAT^ 
TTTAAAAAATAGATCTACrAGGAAAATAGATGTTTTTAAATAGCAACCATGTGGCT 

GAGGG AAG CTACC ATGTGCCTGG<^GG AATCTTTACTAAATTTAAGAGAGTGTGTGCAATG TAACTTT ACC ACCGCCACCATTAAG 
ATGACCGTGTGGTATAACCCAACTAAGGCAACACTCATAAAACT 

7 5 CAGACTAAGGAGGCACCGGCAGGTCtC 
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AAGGTTATAAAGATTTTCTTAGAGATATGTGC 

GATAAGGAAAAGAGGTTTTGTTGAATAAACTATGCTTTATTCTAAGGCTATAAA 
GGTGTCTCTTGATCTACACCAGACTTCTAAAACATGTAGGTAAAGAACA^ 
GTCTACTCGX1AGTAATGGTAATTGTCTTAAAGGTCTATGTCTAGCCTCTGT 
5 AAACTGAATTATACCAGAAACTATGGTACTTTGTAATGGTGTGCTATGAAAGAGTCAAG 
AACTTTATTTTTGGTCTAAGAAAGCTCCCAGTTTCCCTGTGAACTG^ 
AATCITCAACTCAAATTTTAAGGCTCAGACTTAACAGGGATAAW 
AAAGCTTAAAGATAATTAAGATATGCAAGTCGGTGCTCACCTTATAAATGTTCC^ 
CCAAATGGCATTCATTTACAGTCACAAG L T n 'C l ' l ' T AGCTTCTTCT 
1 0 TGAAGTrTGTTTAAAATCAGCTAGACTTAATAGATAACAGTCTTCAAACTCTTC^ 
TAATGTAAAAAACTTCCCATGATTGATAGACAGAAAGGCCAGCGCTTAGTGTC^ 

TCAACTCT ACTTGGGTTGTGG TGACATCAGC CACTGCTTCATGT CT CACC CG CTG C CAGGGT CCTG C CCAACCTGTGGACCCTGTT 
GCGTTGACTGCTCTGTTTTGCCTCATCAATATAGGTCACTGTTTTCAAGTTCC^ 
TGGCAGCAGCAGGCCTGTGCTGTCCTGTGTGGCAGCTGTAGACCAACCCCTGCCACCCTCCT 
1 5 AAAAAAGCAAAACAAAAACCAAACAAACCAAAAAACCAA 

TCTCTGTCATTTTAATTGATACATAGATCATTTGGATTATACTTCCTGCTACA^ 
GGCTCACTAGCTACATCAGTTTAACAGCATCAAAGCAAGACAACT 

CATATTACCTCAGGTCTCACTITCCTAGAGGGTCTAGGTCTCCTGCTGAAAAAGACAGGC^ 
CCTTCCCACAGATAAAGGTAAACTCACAAAGCAGGTTTCAATGTAACtS 
2 0 CATGGCTGCTCCTGGTAATCAATCGCC<^TGCCTCATGGTAAATCAAAGTCTTATTGGTGTTACTC^ 
GGGCATGGGGTACTCAAAGGCAGGCACTGAAGTCCCATCCTAGCCTGGGAATG^ 

TCTAT ACTG CAGT AC CAT ACTGT ACC CT AT CCT AT AG CTGGTGT ACTC CAGG C AG AATT CCCTCAG CTGTCCI^ ACT 

AGGAAGGGGTCTTGTGAATCCGTTACATTTCCGGGACITCCnXSAGACCCGGGAGTTGTTT 

CCATAGAGCAGGATGTTTCACrTCAGATGTGCTCrTTATACACTAGGCCCAGCATCTGCATGCT 

2 5 CTCAGCCTTAGGACGGTCCTCTTTGTAATGGTTCAGTACTATACTATC 

TAAGTAAATCGAGACAATTGAATGTTT13TCATGAATCATCAATAGGCCCTGGAACCTCTT 
AGCTGAGATCGATCTCAAAGTTGTGATCTTA<TTGCCTC^ 
GACATCITACTGATTCTTACAAGAGCTATAGTGGATGAAGACGT 
TTTACTTCATTCTTTTTGGAAAATTGACTTC^ 

3 0 ATGGGAACGATGCATAAGACTGCMCTAATACAAGCTGGGCTACTAGAAG^ 

AAGGAC^TTAATTTATTTTTATTCATGAGCCTGCATGTATGTCTCT 

GCATO^TCTCCAGGAAGTGGAGTTATAGATGGTTGTGGGCTACMTGTGGG 

CAGCCG&TCCTCTGAATAGCTATTTCTTCAGCCCTGGGGAAAA 

CTACTGTGCAAATGAACATGACAGGGAAGAAATTAAGCTGAAGATGGCTCTGCAAACAA 
3 5 TAATTCCTTCAGCAGATGCTTGCGAGCAGCnX3TGTGTGAAACACTGTACT^ 

CTCCAAGATCTTATCGTTTAGTCATGGAGGAAGCAGGCGACZATAACTGTCTGCGGTGACAAAGAAAAGAA 

TCAGAGAAGCATAACAAGTATGAAACTACTCAAATAAAGAAAATATCTAATAAAATAAAATAAAAAT 

TACTCACGGGACGAGGGAGGAGGTCACACCATGGAGTGGGCTTTATTC^CAG 

ATTGTGGTTGGAAGAACAGAAGAGAAAAGGAAGACAGGGAAGGCTGT^ 
40 GTAGCTGAGGAGATGAGTCGGCAGGGGCTATGACAGTCTCTACCTGACAGC^ 

TGTGAGTGATGCCATGC CAG CACATATGTGTG AGTG ATGCCATGCCAGCACAT ACGTGTGAGTG ATGCCATG CC AGCACAT ATGTG 

AGTGATGCCATGCCAGCACATACGTGAGAGTGATGCCATGCTAGCAATACATGTGCCAGCA 

GCACGT ATG T ATGCC AG CACATATGTGTG AGTGATCCCATGCCAG CACAT ACTTGAGAATGATGG 

NNNNNNNNNNNNNNNNNNNNNW 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNN1W 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GGCAGTGAAAGAGGACTAGATCCTGGCCGTGTTAATAACAAGGCTCATCCTTTAGACT 
CTCCCCCTCTTCCTCCTTCCCATAATGTTGGGGATAAAA^^ 
5 5 CCTTAGCTTGACTGCTTTGTTTGTTTCTAAGGGGGGAGTAATACAAAGATGGAGAATAT 

GATTCATTATGAAGTGAAGTGATTCTGAGTOKSTTrATAATACTCCACGTGAAGAACAGGTT 

AGATTAAGGAGGGTGAAGAACGGCAGTGAAGGCCTTAGCAGGGTGGGACATG^ 

ACTTGGCCTTTCCTCCTAGCAGTTGCTCTGCCTCCTCCCGAAGACTACACCATGCT 

60 TGGTGGTCAGAGCACCCAGCTTGATGGGCreCAACGAGGGCATTTATCAGGTATTTGCTGAACTCT 
GAAGATCCAACTTAGAGACCITACCTGTTTCCGGGGCTTATT^ 
GAAACCAGTTATGCATTCTGGAGGGCAGTGAAGTGCITTTCACAGGGCAG 
GGGTCGAGTTTAACCGTATGGTTGATCTGTCTTTCTCATTACCT 

CCTGGAAAGAGACAGAGCAGACAGTGGCTAGGACAGCATCACAGTCACGTCTGTACATC 
65 CCCAGCTCACAAGGACCCCTAGAAGGCACAAGGATCACATGCTCTCTrG 

CAACATCTGTCTGTTGCCCAGTGGCAATACATCTCTTACTAAGGATGGTTCAAATGC^ 
TCCCTrCTTATCr i TTTVrCTGGATTATTT rT TTTTCm 

TCTTCTTTGAAATCTTCrrrGGGGACCTTCATTTC T TCTGAGCTTTGATGTAAGAAGGTAGAGCTCTC 
ATCCACCTATGGACAACACAGGGTGGCAGTCTTGGGAGATGCCTCTGCA^ 
70 CTGTTGTGTTGGATGCCCACATTGGGAATGCGATATAGCAGGAGAGGTCTCAG 
TAGTGCCTTTAGCACATCATGACTTGTCTTTGCITAGCTCITTCACT 
CTTTGCTCTTCCTCATCCTTGAACTTACTCTTCATCGTG 
AATGGGTTCCCTCGGTAAGGTGGGAATAGTAGATTGGTTCACTTTTTAGCCT 

ACCCATCCGTGGTAAACTT ATCACAGCACAGAGTTAATATGTTC ATG CGCCHTCCCCTC CCTGCTCCATCTGT^CTCCAAAAGGGA 
75 TTC ATCT ACTTC CTT ATTaATT ACT AATT AAT AG ATAT CG TT AT AC AC AC CCTG TT AGG G AAGACACTTG C CACTI AAGGACTTTGC 
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ACAGTGATAAGCGTGCCTTCAAGTCCATGGTGTGGGTGTTGGTGTTGTTAGGTGGGAG 
TGGAATGOAGGTGGATTACAGTGGCCGGGGTGAGACTCTTAGTCCACATTGTGGAACT 
GGGGACATCTATCTGGCCATCCCTGTTCAGGGCAGGGGATAGACTTCTC 
CATTTGTCITCTCTCATTAGGTAGAGCTCACCCTGTCCTTTGATGTCT 
5 TG CAGGATT CGATTGAGATTTTGGTTGATGGGTGAGGACGGT AGACATG CTTACAGGCTCTACGTGGGCAACATTCAAATCTAGGT 

GTGCTTGACACCCAACCTGAGCTATTTCTACCGTGTTAGGTTTTAGTAAGGTA 
TGAAGCTTTACTTACTTATCGTGGGCACGCTCCACACTGCATGTGCAGAGX3TCA 
CCGCTATCTGGGTTTCAGGGACTGACCTGGGGTCGTCAGGCTTGGC^ 
TTGAGGTTITGAAAGGAGTTTGCTTCAGAGGAGAATTTGGC^ 
1 0 AATTGAGAGGCAGTGAGTATCGTTTAAAAGTATGCTAAGGCACTCCTCCTGCT 
ATTCTTGTACCTAGAAGAGGGAGCCATGTTACTTCTTTGCTGTTGG 

TCAAAG CT CTG AAAACT CAGTG CCT AAGTTTAT AGTT CACATTTGT ATCTTGG AAC CAAG CTTT ATCTGAATG T AAGGCACTGTGT 

TAGAGATGATGGTGATGACGATGATGACGACGATGATGATGATGACGATGATGATGATGATGAAATTAACA^ 

CCTTTATTGAGTGATTACTAGGACAACTGGATTTTGGCAAGAAGG 
1 5 TTTATTCTCACAGAGAGGGCAGTGCAGCAGCCTTrCCrACAGCAGGTGTT^ 

TCTGTGAGCAATAGGCATGAAACAAGAGCAGTGAACAAATTGAAGCTTT^ 

TATCCTGTTCCTTGGCTTTTCTCACCAATGTACTGAACATATTAGTGTCGAAGATTGCT 

TTAT1TCTTTCTAATGTGGGCAGGTGTTTGGCCTGCCTGCATGTCTCTGCACCACA 

GGTGTGTTGCATCCTCTGGCACTGGAATTACAAGTGGTGGTGAGTAGCCAAGTGGGT^ 
2 0 G AG CAG ACAGTG CTCTTAACAGCTGAGCCATCTCTCT AGCCCCTGTTAGTCTTGTTTT A 

AGGCCTCACGGATGCTAAGCAATTGTTCTACACTGAGCTATAGCTGGATCCCT 

AGAAGGACATGGATCCATGAACGTCTCAGATCATCTAGATTCTGCCT 

TCTGCCCATCCTCCTTGTGTAGCCCCACCCTTTTCTTTAAAACCACATCTCTGAGGACT 

ATT1TATATGTACAGGTGTTTTGTATGCATGTGTGTCTTTGTACAACGCGCATGCCT 

2 5 CTCCTAGAGCTGGGGTT ACAG ATGGCTCTG AGCTGCT ACCTGG ATGTCG AG AACAG^ CTGGG T CTT CTGGATGAG CAGTCATTG 

CTTTTGACTTCTGAGCCATCTGTCCAGGCCCGTTATGTTCCATCTT 
TGGTTTGGTTTTTGGTTTGTTGACTTAAAAGTGATATTCATCAGTATATTTT 
CCTATTTAGCTAGCTACAGGACCACAAGAAGGTCTACCAGCATGCTGCAGTATGCTC 
CATTACGCTCTGTGGCTGAACTGTGGTGCATAAATTA 

3 0 GCCTCCCTGAGAGCTGAAGAGCATCTTGCTATTTTTCGGGTGTGCATGAAGAC^ 

ACACCrrCATCTCAGAGAGGAOU^GGGATGTGTTGAGTGGTC 

TTTCGTTCTCCTGTTCTGTGAGACTCGGAGTGTTCAGAGGAGATGCTGATTCTGGG^ 

ACGTAGAGAGAAGACTCGAGAAGGCAGCATGAGCTGGAATCCATATGTTTGGAAATCTGGCTGTGGCAATTC 
CCAAACTGTTGTCATCTAGTGCTGGCTCTGGGGCCATGCACATGTAGTTCCCCCA^ 

3 5 AGAATTGTTCTAGAAAACGATTATTCTAAAAACTGATGGAATAAAATAAATATGTTATAATTTAGTT 

TTTGTCCAACCCCACACAGTri"rrAAAAAAATCri'l"riAGGTTT 
GCATAGTTTTTTTTTTTTAAAGAAAACAGGTTTCACACATGACT 
CATTTAAAAATTTTGATGCTTATTGCTAAGCTTCTATCAGGAAAATTTGAACCT 
GTATTCTAATATTTTTCTTTGTGTGTGTGTGTGTGTGTGTAGACTGGAG5GACAACTTTGAATGTCATT 

4 0 ATAATTACAAAGACTTGTTTATTTTTATTTTATGTCTATGAGTGTTTAACT 

CATGG AG ATCAGAGATGGCACCAGATG CCCTAGAATTAGAGTTAT AG ACAGTTGTG ACTTG CCATGTGTGTTCT AGGGAACAAATC 

TGAGTTCAGAGGTGTCCTTAATCTTGGAGTCATTAGTCCAGAAGAGAGGCTTCCCTGG 

GGTCTCTCCnTGGCTGGCTCTTTCATTAGGCTAGACTGGCTGGCCAGCAAGCCCT 

CAAAT AG TG CACACGTGCGCG CG CRGACACACACACACACAAT ACAC ACACTCAT AAAAAT AC AT C C CAAACAAAACAAAGTAAG C 

4 5 CAAAGCCAAAG C CAAAAC CAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAGG AAAG T AAT C CAAC AAACCTCTGGTTGT AGGG AAA 

TTGTAACTTGGAGAGCAAGTCCCTGGTCAGTGTCTCAACCTITCCCTCTTTTCTC 
TGCTATGGCrrTCCTTCCAGGTCTTGTCTTCCTGCCTGTTTTGCCACAAAAGCTCAAGTGGACAGGATG 
GCTC^CCTCTTGGCTCCCTGTGTTCTCTAGATGGACGATG CAG AGGG CACAGTGGGATTTCAGGCT^ 
CTGCTGGATTTCTTGTGATGGATTGCAAAAAAACAAATTGTTTTrATCrGCC 

5 0 TCTTTTAGGAAAAGTCCCAGACAGTTCGCTTATCTTCITGTT^ 

AATCT AAACAG AGTTAAATGTGTTGG ACCCAATGAGT ACTGTGG CCTT ATGCGTTC CTGTGTG AT CCAT AAGCTTGGGAAG ATGAC 
ACATTTCAGGTTCCAGACCCTGGGAGACTCTTGCCAGAGCGA 

AGGGTCCAAAGGCAGRTGGAGCGAAAACTTGTCCCTGTGGGGATTTCCTATTAGCAACGTTACCTCTGAGGCTGAAGA 
GTTACCCAGCGCCTGTGCTTATGGAACTTGGTGTCCATGCAGTGGGTGAAACATAGACTGAGACACAGTCAGAACCTT 
55 AAAGGTGGCAATGGTAGCGACCTCACAGGGTGACATCAGTGAGAAGCCATGATGAATTGAATGGCAGATATGCAGTGCCTGGC 
TGCTGTTGTGATCGTCCCCTACCCCAGGGTCACCTGAGGAATATAGAAAGCATGTTTT^ 

TAATATATAACATTATATAATATATAACATAATATATAATATATATACACACGTATACATATATACGCATATATATATGTATTTGT 

GTGTATATATATACATATACATAAATATACACACATATATACATATATACACATATATGTATGTGTGTGTGTATATATAC^ 

ATACATATATACACATACACACACACACACACACACA^ 

6 0 TTCAGTTCAGGCATAGCTAGCTTAAGCACTTGAGCCCTTTCAGGAGTCTTG^ 

TTCTCCCACCCCGTGCGGTGTGTGTTCAGGGAGGCCACAGAACGTAAGTGCTTGCATGGGTCCCATTCAT^ 
ATCCTTCTTTCCATCAGAACTGAATGATAAAAGCCATGTGTCCAGAAAGCTCTCCAGGC 
CATCCTGCTACCCATGTTTAGTTTTAGCGTCACTCCACGGCACTTGCATGOTGGTGAGACTTAC 
GTTTCCAGACGTGATATAGATCTCCTCCAAATGTATACAGAAACAAA 

6 5 T ATATT ATAG ATTG C CTTT ATGATAACTTITGGAATAAAACATTTACGTTT ACT TGTTTGTTTGCCTGGTG TGTGTGTG TGTG TGT 

GTGAGTGTGGCTGTGTCTGGTCCTTTG AGTAAGTATGGCCTCCACAGACT TGTGTGAATGCTTGGCC C ATGGGGAGTGGCAC 
TATTAGGAGGTGTGGATTTXSTTGGAGTATGTGTGGCCTTArTAGAGGAAGTGTGTCACTG 
ACTTTAAGCTCCGCCCAATGTGGAATTCAGTCn'CCrCCrGGCTTT 
TGTCTGCCTGAACACTGCCATGCTTCCTGCCTTGAGGATAATGAACTA 

7 0 GTTTTATAAGAGTTGCTGTGGTCAC^GTCTCTCTrCACAGCA^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

7 5 NNNNNNNNNNNNNNNNNNNNNNNACACAC^ 
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CCCATTTGCCTGCACAGCAGGGAAACATCAATATAra 

AAGGGTACAGAATCCTGGAAGTTACTATCGCAAGTCTTCTGGCAGTACAGCTTC 

TTTGGCCCTAGTCAGTTCAGACGTCTAATTTTGCTGCAGAATAT^ 

TGTCTGTGTGTGTATGTGTGTGGGGGTCT^ 
5 ATTCCTAGCCCAAAGATCTTGAATGAAAGCTTGTTCCAGCXjCAGT 

GAACGGCCCCAClTrriVrTTCCTTAGACAAAATCT 

CCTCTTAATATCTGTTTCCAGGATAGCCACTGACCT^ 

AATCACACGGTCACCATGCTGACCATCACAAACTCTTGGTTATAGGC^^ 

TATATGGAATTTAAAACAGGCAATATCCTGATATCTTGTAAGTAATATAACATTATTT 
1 0 T ACTIGCCTTCTT CAGCAGAATTTTCT AT AGCACATAT AG G TAAT AGAGAAAAG AT AITTT ATTGCCT ATGAATTT CT(nTTGTTCA 

CCATGGTGCTTGATGTrCAAGACACACCCACTGGGGGTACTTTGCAAT^^ 

TGTGTATCTATATTATGTGTATATAT^^ 

NNNNNNNWINNNNNNNNNNNN^ 

NNNNNNNIWNNNNIWNNNNNNNNNNNN^ 
1 5 NNNNNNNNNNNNNNNNNAC^CACACACACACAC^ 

CCCAACATCAATAATAAAATAGAACAATTCTAACAATATAGCATAAGTTACTT^ 

ATTTAATATTTTCAGACAGTGTGTGACCTTCAGTAATTAAAACCACGGAAAC^ 

GTTATTTATTTAAAAAATATTGATTAACCTTTGGCTTTTGCCACAGGCT 

TTGGGGAGACTGAGAAATCTTGCTGAGGGAGTAATTTTGAAGCTAAATCAGA 
2 0 ACT AGACACTATGTTGCT CTAG CTGG CCTGG AACT AATTTTG T AG ACCAGG CTGACCTTGAACTCACAAAG ATC CACTTG CCTCTG 

TCTGAGGGGTTAGAGGCATGCCCCACCACACCTGGTCT^AGTTACTTTTTTGTCT 

AAGGGAAGAAAGATTTGTTTTGACAGACATTTGGATACAATGTCGTATGGATACACTGAATTT 

GGCCAAAGGGCATCGTGGTGGAAATGACTGTGGCAAC^TAGGAGTAAGAGGCTGCT 

TGAGGGGTCAGGCGGCAAGGCTGGCCTAGAAACCCCAAGGTCTGCAGATAGCTGACTTCCT 

2 5 TTCCAAAGAGCAATCTTCTAAGCCTCCCAAAACTGCCCAGGGAACTGGTAACTAACT 

CGGAGGGCGCTTTCTCAGACAGTAGCAGCAAGTGGAAGAGCACATACATGAGGCT 

GACACTCCCTGCCCCCGCCCAAGCAGTTTGGGGTGCAAAGACAGCACAGGGGC^ 

CAGCAGACGGACAGAATCTTAACTGGTGTrTTTAGAGAGAGTGTTCTGGCTGCTCTC 

ACT CGGGATGGCTG CTTGGTCCAGTCTTTGTAGAAGG AAG CAATTGTT CTTGAG CCACTGG CTTCACTTTCCATGGGCACAGAGT C 

3 0 TTAGCTGTTCCTCCCTGGACAATGGTrATATCTACTCTGCTTGGTTCTCT 

GGAACAAGTAGGACTGAAGGTTCATTCAGACTTCCTTCTCCAAATGA^ 
CAGGGTGGGCCATCGTGGAGCCAAGTACCCGAAAGGCTGTGGGAGTGAGGATT^ 
GGACAATTTTAGGGGCTGGTGGAAAATAGAGAGCTTTTGGGCAATTGGCTACTGGGG 
AACCTGTGATGCCTGCCCAGCAAAAGATGGCCGAGATGTTTTTCTT^ 

3 5 GTGAAATCAATGGTGTTTTAAAATGTTAAAAGCCGCCCTTTTTCTTTATTTTAGTAT 

GTCTACATTTGAGAGACTTCTCTAAGCAAGCAGGCCACCTCTTCAG 

GACTGATG AAGGGGGACTGCCCAT CTTT CCCAGGCAATGAAGCTCAGAGGCTCATGG ACCAG ATCGTTGTTGGGTCATCC CTTAA G 

CCAGGCAGAATAGATGGTAATGGTGTTCAGTGCCTGGAATAGAAAGAAATAGAAAGTTAACTGAGGCACCACAGAA^ 

AGAATTTGTTGTATTTGC^GAAATTTGTTTTCTTCCTTCCTTG 

4 0 TCC CCTTTTGAACT AATTTCAGACTTAGACTGG CGTTAT AAG AATACTACAAAG AATCCCTTTTTTCTCTCCAACTGCTCCCCCCA 

CATCACCACAGCATGACCACCAGGACCAGGAAACGCACTGGTGCACTGGTACCATTAA 

GTCTCAATAATGGCCTTTGTGTTGCGTCTGATTCAGGGCCAGGCACTGC^ 

GCATGAACTTGACATTTTTTTTTTTTTTGACAATTAGAGGCCATTATCCTATAGAAT 

4 5 CCTCGTGATGTTATTCAGGTCAGGTACTTTCAAATCACATCCTCACAAGTGTGA 

CGACTCTGATCTGACCCTTTCCCGATGATGTTCATTTCCTCGCCTGATTAA 
TTCCCCCCTTTGTTGTAAAAATATTTTGTGAGAGATACTTTTAGGCCT 
TTAGTATTTACCAATCTTTCTCCACCTGCTCCTCCTATGTTT^ 
AGAGGGCAACTTGCTGTGATCCGTTCTGTTTTTCTACCGTGCA 
50 CTCTACCTACTGAGCCATCTCACCAGCCCTAGTTATTGTTCCTTGAGACAGGATCTGGTGACA 
AATATGGAGCCCTTGTAGGCCTTAAACITGTGATGCTGCTCCTGCACCAGCCTTCCAGGAG 
GCATGCCCCACTTTTCATGGGTGCTCCTTACCTGAAGAATTATCGCTTTGCTAGGTACCAAA 
CTGTTTGTTGATTGACATTCTTCTTTATGGAAAAGCTTACTCTCTGTTTACTTATT 
TTTGGTGCTCATATTCTGTCTGATTTGGCTACCATG^ 

5 5 TCTCTTCTCTTCTCTTCTCTTTTCTTTTCTTTTCTTTTCTTT^ 

CTTTTCCITTTTTCCCCCAAATAAGTAAGCAAACACTTTGG 

CATTGTTCTAAGAATCTCTGGTTCTGGTTAGTG^TGTATTCAGAAACrAAGCTTTGAAC^ 

CTTGTTCCCAGGCTCTCTGAGAGGACAAGGATAGAGAACACATATACACATGGACATACATGT^ 

TACATACTTACACGCACATGCTTTTTCAC^TCTACATTTAC^ 

6 0 AGTAGACCATTGCTCTCTTCAGACACACCAGAAGAGGCCATCAGAT^ 

AATTACAGGACCTCTGGAAGAGCAGCTGTGCTCTTAACTGCTGAGCCATCTCTCC^ 

ACCATGAATCCAAACCAATTCCCTCTC CACTGCT AGCCCAGGGCCCTCCATG AAGCTAGCCTT CTGCCGGT ATAACTTCCTTCCCT 
GTATTCTGACTGTATTTATTCTGTACTTGCCTAGCTGTGTGCAT^ 

65 GT CCTCCCTGCACCTCCCCCCCACAATGGCCTTCCTCCTT^CATAT ACAAGG ATC CCG CTGCCTTCTCTCCCCACTG AAG CTCACT 

CTATGCCAGTCCCCATGCTCACCACGTTGACCTTCTCTCTTCATCCACGGCTATTCTACTT 
CCGTTGAAAGCCACAGAGATGGATCCTCTTAGTTCTGGAAGCCAGAAGCCTGAAAGCAAGGTGTTGGCCAAGT 
AGGCTGAGGGAGAAGACTGGTGGAANNNNNNNNNNNNNNNNNN^ 

7 0 GA^TTGATCATATGGGACTTAAAGAATTATAT 

ATTTTTTCAAGCTGATTACTCTGGGAAACTC^GGCACAGGAGTAGGTCTGGGAAGCT 
GTAAAGAGAGAGACAATGTATTAGAGAGAGTGTTAGCATCAGAGAGGGTTGCTATGACCACTCTTATTAC 

CC AAC AGTG CTTT AGG CCACAT ATG CACTTCACATG CATGCAACCAT AACCCCACAT AAGC CTCAAATGCTT CTG T AG CTGAAG AT 
GTTG CTT AT ACTC CAATC ACCTGC CT CCG TTTGGGTCTTGCATTCAGTGG GATG CCTGTGTGT ATGGTTT AG AT AGGGTTTCAGAT 
7 5 AGGACTTTCCTCTTGTTAATCTGCCTTATGCCATCTO 
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CATTTCCTTCCCTACAGGAGGTAGGCCTACTTTAATTCA 

AACAAAGGAGCTGCATTOICAGTTTTTGGTAGATGAGAATGTCGTGAGGCTGGGGTG 



TCTTTCCAGTGGGTTCAGGAGGGAATCCTCTATAGATAGATTAGACCCATCGTCTC^TCT 
5 GAACAGTCTAGGAAACTCTGGATTCAGAGTGAGCACAGGCTAGCACCCATTTTC 
AGCCTTCACTTTGTTa^GGAAACATTTCCACCTTCTG 
TTTGAAGCCAAACAACTATGCCATTGTTCAAGATGCCAGCCAAGTGTGTC 
GGGAGGGGTCAGCCATGACTGTCACTAATX3CAGAAGGAGAAGA 

AGAGAAAGCCTGCTGTGTAACCTGTGGCACAGGGCATCCAGTAGTTTGTGCGTGAGTCTTAATT 
10 CTTCTTCGCCCTGATTTCTTCTCATTCCAATGTCTTAAACATTCC 

TTTACTGCTGTGAGCAGACGCCATGACCAAGGCAACTCTTATAAAGACAAC^ 

AGTTCATCATCATCAAGTCAGGAACATGGCAGTGTGCAGGCAGGCGTGGTGCA 

TAGCAGAATACTAGCTTCCAGGCAGCTAGGATGGGAGTCTTAAAGCCCCCACACACGGTGAC^ 

CCTCTAAAAGTGCCACTCCCTGGGCCAAACATATTCAAACCAGTATACATTCCCAGTATCTCA 
1 5 TTT AAACAT CTTC CAGG AGTCT AG AT A'rrrrrriTT CAATCAACTG AAATGCTTT AAAAGT AT AAAC AGTGTT C CTT AAGCATCTT 

TGGTTTTTCTTTTACAGAGTTGAGGGCTGGACCCAAGGCCTT 

CTTCCCTTAAATATCITTAACCATACATTITGTATCTGCAGAAAT^ 

ATTTAATCATCTCAAACTCCGAGGAAGTTTCATTTTTC^ 

TCTGCATTTGGTTCTGGAAGTAGTGCC CTGGG AAGTGGATGCAGTTGACAGTGGCCACCTAACAG CTTTTAAGTTGTGGAGAGAGT 
2 0 ACT AG CAG AG AGTTTT AT AATGGCAGG CTG ATTTCT C ATTT AT AAAATCC CAATT CTT AT CTCAT ATAAGGTGTGG CAATAAAG AG 
ATGCCCAGGCTACAGGTTCACCCTGCACACTTAAAAGACACATAGGAACCAAGGAGGTTT 
TACTTAGAATATCCTGGAGGCATAGTTGCTAAGACACCAAAAAACCATCTTACATCCT 
TAAGGAAAGACAATGCTGGGCCCTATGGAGATCATGACTCTTTAAGACATCCCATATTGGGGCAAGAGAAAT 
GAGTGCTTGTAGAACAATTTTGAGGACAAGAATTCCTATCCCAGCCCCTACATAACAAACCT 

2 5 AAGTGGGAAGGAGACAAGAGGCTCTTGCG ATCCTGCTGG CTTCTAG CCTAGCAGG AAGGTGTG AGCCCCGGGTCG AGGAGAGACTT 

TATTTTAAAATGAATAGGCAAAGTTGGAAGACACCCGACTATTTTTTCCTGGACT 

AT ACTTG GACACAGATGGGT AACHTAG AT AAT AAT AGTCTTTTT AAAAATCT CGGGTCAGAA 

GCTGGTCGTGTGCTGCATCTGCAAGGCTACTGGGCATGGTCGCACACTTTTAT 

TCAATTTAAAGTCCGTTTTGACTACATAATATGAAACTTTTCTTCA^ 

3 0 TGTTGAGTGATCCCCTGGAGACTGAGGCAGGAGGATCAGGTGTTTGGGACTAA 

TTCTAAGCAAAAACACAAGTTGTCAAATTAATCAGCTATGAACCAGGCAAGATGACT 
TTATCAAATTAGTAAGGCCCATGGTAACAGTTGGAACTCCACAGGCCTTAAATCTCCTr 
CTTGGCCCAGGGAGTGGCACCATTAGAAGATATGGCCCTATTAGAGTAGGTGTGACCTTGT^ 
TGGGCTTTAAGACGCTCATCCTTGCTGTCTGGAAGCCACTATTCTGCTAGTAGC 

3 5 CCTGTACCATGCCTGTCTGGATGCTGCCAAGTTCCCGCCTTGATGATAATGGACTGAA 

AATGTTGTCCTTATAAGAGTTCCCTTGGTCA^ 

CCAAGATATAAGATACAATTTGCTAAACGCATGAAACTCAAGAAGAACGAAGACCAAAGTGTGGACACTGTGCCCCT^ 

TGGGAACAAAACACCCATGGAAGGAATTACAGAGACAAAGTTTGGAGCTGTGACGAAAGG 

CAGGGATCCACCCCATAATCAGCTTCCAAACGCnSACACCATTGCATACACTAGCAAGATTTTGCT 

4 0 GTCTCAAGTGAGACTATGCCTAGCAAACACAGAGGTGGATGCTCACAGTCAGCTATTGG 

GCTAGAGAAAGTACCTAAGGAGCTAAAGGGGTCTCCAACCCAATAGGTGGAACAACATTATGAACTAAC 

GG AC TCT AG CTGCAT ATGT AT CAAAAG ATGGCCT AGT CGG CCATCACTGG AAAGAG AGG C CC ATTGG ACACACAAACTTT AT ATG C 

CCCAGTACAGGGGAACGCCAGGGCOUIAAAAATGGGAATGGGTGGGTAGGGAAGTGGGGGGG 

TCGCACTGGAAATGTAATCGAGGAAAATACGTAATAAAATATTAAAAAAAGGAAAATTAAAGATATATAA^ 

4 5 TCACCT AAAAAAAAAAAAAGAGTTG CCTTGGTCATGGTGT CTGTTCACAGCAGTAAAACCCrAACT AAG ACAGG AGGGG AAGTTCA 

AGTCTTCCTCAGAAAGCCTTAGGAAGTCTCTTGGTATTGAGAAATACAGGTCAA^ 

AGTTCTCTGG CTTGGGAATCTGG CACTCTTGTGAAGGAGTCGG AAG CAACATGGCCTCTT AGCTTCrTGTGCTTCTCCCAG CAACT 

GGCAGGAGGTAGAGATGGGCAGATAGGAAGGGCGTGTTAGTGGCTTCCAATACCTTCAGTAAGTGTTAATGTC 

GGTGGTGGGAGATTAGCTTTCTGTAGGACTTAGGCTCCTGAGATAGCTCTCCAGGTTAAT 

5 0 GCTTCTCCTCCTTTGCTTTCGGTTCCTTTC 

GGGTGGAAAGAGGGAAGGCGCCTGCTTCITAGCACACAGAGAGTGGGGAAATACTGTTTTCCTTGGAAAACT 
GTGGCTGGACAGGGTGTGTG^TTATTTATGTTAATCCCT^ 
AATGGCTTCCTCTTCAGTACCCTC^TCCCreGAGCrGACACTGAGTTTGTT 
55 CAGCACAGGCTCCCAGTGTGGAGAAGGGTCAGATGGTGACATCCAGGCCTT'GTGGATACTCCATG 
TTTCTTACCCAGAGGGGAAATGATTTGGATTTAAGATGGCTC 

ACCCTATTTAACATCAAGGGCCTGCTTCAGCTGTAGAGTAGGAATCACAGCAGCACTTTCAT^ 
CTCTTTATGCCTATAACTTGCACAATGCCATTAAGC^ 
GCTGGACTTGACAGCCTGTGCTGTTTGCAGGAGCTTGCAGGTCT 
60 TAGCAGTAGGTTGTGTTCTAAGAGGTGACTAAGCTTGCAGAATATGAGCrAGGCTGTTTTCAAACCAAGCT 
CGCTCAATGGCTAGTGGCTAGCCTCCTCTGGCACCTCCACAGGGGTGTCCAGGGAA 
TGACCCTGACTTGGAAACrGTCTTTTACTCTTAGGTGAGGTGTTAG^ 
AGATAAAGGCGATTGTTGCAAAAGTCTGCCGGGTAGAACTGATCTATACTTCTT 
TTTACACATATAGCAACCTCAACGCIAGGCTTCCGGCTCCAGAGAACCTTTGTCTTGATGGAT^ 

6 5 CACTGGAAGGCAGTGTGATTATATGTTAAATAATTGTTAGAGGTTGTGT^ 

AGGGTAGACATTGTAGAAGAGATAGCATTGGAAAATATGCAGAGGGTTGGTTGACTATTT 
AGCCTCATGCATCTAATTGACCCTCCAACATTGTTTGGACTGTCCCAAGGAGATCT 
TGTCCTCAGACCACATTGGGTTGCAGCAGCTGAACGCCAATGGCTCCCIACCrCATCG^ 
GCACAGATGTTTGAAGAGTTTGAAGGAGAGTTAGAAAGTTAGAAGGAAGAGCC 

7 0 CACGATCATCTGCTGCC^TTAAAGATGGGGTGAGTCCATGAAGCTCGAAGG 

AGGAGGCACTGGCIAAGGAAAGCAGCAGAGACCOTAGGATCGGTGGAGGCGGAGTCAC^ 
AAGAGCGAATGAGGCAGGTTGGTGAGATCC^TTCGTAGTGAGGAAATGGAGCTACT 
TTCCAGGGAGCGGAAGCAAGAGAGCCAGGTCGAGAC^CAGAAAAGCACCrrGGGGAA 
AACTAAAATGATTCTGGAGAATTTTGGAGACCTAGGAAGGACCTTTCTT^ 
75 ATATC?JGATGGAAftGCTCTGAAC^ 
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GTCAGAATCGGCCACTGCAGGACTGATGAAGA 
GGGCAGCCCGGTGGAAGCAGATGGCTAAGCGTGGCAGT^^ 
GGCCAATCAAACTTTAAGAGGTCTAGCGGTTTTCTGA 
GAAACCCAATGGGGCCTCTGAGCACAACACAGGGGCAAGAGCAl^ 
5 AACCATTCTTGTGAGGCTGTAATGAGAACCGCTTTCTGGGATC 
CTTTTGAGCATACATATGGAGACCCTAAGGCAGTTTTGCTGTC 
TAAGTCAAACTGCGTTAGGTACCACGGTAGTCTTGTGCC 
TCTCCCAAAAGGACCCAGAACTCACCTAATCAGAAAATTCTGTGTAG 

CGTGGGAAG CCTGCTTTGTGAT AAAGG CATTTTGGAATTT T AATTT CATGTGCATTTGTTTCGC ATT AGTCATGAACTT CATT CAC 
1 0 CAGTAGCCTCCTCCTGTCCCATGACTCATTCGGAJGATXr^ 

AGTGGCAGTGCTTCCCCCCACTTAGTGGCATATCGAAATTACTCT 

TTATTAGGCTGGGGTTAGGGAATTTTCCTTCAAAGTTCTCCAG^ 

CAAACCAGAAAAGGTGGCTGTGGCTTGTGCTGCCTCTGCCTCTGCCT 

CCTCTCTGCCTCTCTGCCTCTCTGCCTCTGCCTCTGCCTCTC 
1 5 TCTCTGCCTCTGCCTCTGCCTCTGCCTCTCroTGAGTGTGTCT 

GTGTATGTGCATGTGGAGACCAGAGTTTGCCCCTtKSCTTrCT^ 



TATAATGTCTGGCTTTTCACACAGATGCTGGGGACTGGGTCCT 
TGGTCTTTGTATTTATTTTCTGTGCATTTGTGTG 
2 0 AGGGGGACAT AG ATCTCCTGG AACT AGAGTT AT AGATGGCTGTT ACACACTG CTCGTTGCGGCT CTT CACTTCCAAG C C ATTT CTC 
CAGCCCCCCTGGAGTGTATTCTTTGATGGAACTGACATATATCTTTA^ 

AGTTTAACAAAACTTTG AAAGGAT CAG CACT CT ATTGGTTT AG AT AGCAAT AG CCATTTTTTTCT ATGT C CAT CCAAATGT CAAAC 

CATCTTrCTGTTTTCATGACTGGCAGTGTTGACTTACT^ 

AGAGACACAGGAAACAGCCAAATGTGAAATCTAACACCTCCTCCTCAAC^ 

2 5 CCCTGTCCATGCAAATGTCAGCAAGCTCTTGTACAT 

TTTGTCTTCTTTTATCCCTTATACTTGCCAATATCCTCTTTCCCTAC 
CACACACACACACACACACACACACACACACT^ 

GTATTAGTATCAACGATGCTACAGCACCATATATGATGAGGTCTGJU^TAGGAG 
TAGCACCTTCTTATATTTTCTATTCTATACTCCAGTCCATTAAAATTAAATTAAAC^ 

3 0 T TT TT T CT T CT rCTTCI rCTTI 1 GGTTTTCCAGGACAGGGTTTCrCTCTGTAGCCCTGACTC 

GGCTGGCCTCGAACTCAGAAATTTGCCTGTCTCTGCCTCCCGAGTGCTGGGATTAAAGGC 

GCGGTTTGAGAAACACTGGGTTTTGCTGTGGCCCTT^ 
TCAGGCTAAGTAAGTGGCTCCATTTCTTCCAGTACAAGATGGAGATGAGATO 

3 5 CGCTATGCACGGGAAGCTCTGAGCTCCTCGTAAGCTCCGGAATGAATGTGGT^ 

GC^CTCCCCGTCTCTGTGCTtCTGCATGGACGTCCTTACACACATCAGTCAGGGTGAG^ 

ATGTGGATTAGGGGGAGAATTATTGAGTGCTAACCAAGAATTACTACTGGAAGT^ 

TCCCCCTTTCTGTTTTCAGGGGGAGCATCGCCCCTCTTTCCTAGTCTT 

TTTATTTTTAACTTATTTATCTCATCTTTTGGGATGGGTTT I ITAGCTATTTTTACTCATCTCTAA 

4 0 AATGGAAATAACAATATCTGATGGCTAAGGCGGTTGTGTGTGCTACT 

AGTGGTGGGTGACTCCGGAGCATCATCTGTGATGCTGCCTGAGGCC^TTCCT 
CTCTTTTATCTCA.CCCTGCTTGTCTGTTGAACTCTCTTTGGCCAGGCA 
GCTTTCCTTGGAAATCCAATTAATAGTGATTTATGGATTCCTTATGAACTAGTTAACTTAGT 
CTGGTCTCCACTGCATCATTTTATTATGATAATTACAGAATTATTAGTTAGCGTTATATGATTTATC 

4 5 TTATGAGTTCCCTGAGGGTAGCAACCATAGCTTACATATCTCCAAGTACACCTACCCCC^ 

CTGTAATCCTGTGTGTTAAGAGCTATCTGTTGAGGGTAACTGGGGACCCAGACAGCATGTGCCTGAA 

CTGGGGACATGGCTGTGTGACATGGTAAGTGGCAAGAACACX3C 

ATGACCACTGTCAGCTTCTAGGACAGAGATTCAAACCCAGCTTTGTCCTG 

ACTTGTGCACCTGTCTAATATGACTCCTACTAAACTGTGTATAGAAAATTCTGGAAGGTTAATAGTTGAGG^ 

5 0 CTCTCTCTGTCTCTGTCrCTGTCTCTGCCTCTCTCAACTCTCCr 

TGTTTAGGCTATGGTGGGAATCTGTAAGTATTTCrCCAAGTAAAAAAATGACTTGATT^ 

CTG AACAT AAAAATC CACAAGGTGGTCITTTT CCTTT AATGC^ CT T T AAAATTATTTTTGACATCG TGTG CGTGCATG CG TG 
TTTGTGCACATATGCAGCCACAGTGCAGTGTGGGGGTCAGAGAACAAC^ 



55 



60 



65 



70 



75 



CTGG CTT C C ATTGTTT CTGTGG AGT ACTG AATTTGAGCTT AAAGGTT TGGG AT A 

CATGTCTTTTGCCCTCTGTTTTTTTTCTTTCTCTTTCTCTTTTTT^ 

TGGGTATGGTTTTTATCTGTGTTTGGCCATTTGTGGTTTT 

TCAAGAATTTGCCTCACACTTTGTCTTGTCTTTGTAGACTTTCAAATTAGCC 

CTATGCTCGTCTTACACTGTCTCATCTTTGTGACTTTCAGCCTCACTCTGGAATTTCCTTTGACCT 

CCCCCACCCCCATCCAGGTATCAAAGCTATTTTTAGGCACACCCCATAAGTTCTTAA 

T AAAATTTTT AATGGTTT ATCTG ACAAAATTTTT C CAC CTTTG C CTTTTGTTTTC CTAT AG CTGGTTTT AG AACT ACC CAAGGG CT 

AGGGGGTGGTTCTGGCTTTCAGTCAGAATTTGGGAGGCAG 

TGAGACCCTGTCTAAAATAAATGCCTGAAGATGTTTGGTCrGTGTGCAT^ 

CTGTGGCCTGTGGGTCTGTTTCCACTGATGTTTGTCGCCATTGATCTTTA 

TCAGAAGAACTCCAGACAGAGTGX3TTAACCTTTGAGAGTC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNI^^ 



NNNNNNNNNNNNNNNNCGCTCCCTCACTCCCTCGCT 

TCAC^GATCCAACTGTGCTCCTCATGTTTACAACTCAAGCACTCTACTG 

TTACTCTCATTTATTTTATTTTATTTTTCAGTTATC 

CCCTTGGCAGCTACAGGCATTGTCTTGCCCTCTGAGCTGGAGTTACAGGTGGTTt^ 

TACAAACAAGCCCTGGTCirCTGAAAAAGCAGCAAG^CTCTCAACTGCTCAGCT 

CAATGCTTC*GGAAAGTTCCAGGGCnTCCTT^ 
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ATTCTGGTCAGCTTCTGAGCCCTGCAACCTCCTCCCAGAGGTTGGCACATGCACT^ 
AAGTCAAAATGTTTGAAATGGTATTTTAGGAGGATGCACGTCCTGGTGGTC 
GAACAGCAGAGAGGCCATGCTAACTGAGGGCGATCTGGAGTAACAAGTAGAGCCCCTGGCT^ 
ATCTCAGTTTCCauVC C T T TC TG CrrT T CTCT 
5 GGGCCGAGATATTTAAATAACAGAACTCAACAAAACTTCAGTACTAATAGT^ 

AATATGGTAAAATTGTATTGCTATCCCAATAACCAAATCTCAGTTATTTAGCCAGAGA^ 
AGGACCGTGGCGTTGTTAACTTCGTTGGGAACCACTT^ 

TCCGCAACACTGATT ACCGAACAGT AGTTT CAC CATCTCATTGGTTTTGCTTC CTGTGGTTTCTCTTGCC CTGGGTCAGCTGTGTT 

CTGAAAACATTAAGTGTAAAATTCCCTAAAATTCCATAAATAAACACGAAGTACT 
1 0 CCTAACTCACCTGGGCAGTAAGTCACCCACTTGTCCTGTGTGTCCATGTCC^ 

AGCTGTCACAGTCTTGGATGGTCCGAGCTCAAGTAACATTTATCTTAGTTAAGCA^ 

TACCAGTGAGGTGGCCAAAAAGAAATCACAAGGTTTCTTCATATTCAAAGAT^ 

GAGGACAACTCTATGGTTGTCCTGAGTCTGATCTTTCTACATTTT^ 

CACATTTTTTTGGTAGATAATTATAAAGGCAAGAGTAAGGTCAACACA 
1 5 AATAGAGACTCCTGTTAGAGTGTAGAGCTAACTGAATCCTCAGCAGAAAAATCAAGAGGGGCAAGTTGCT 

GCACATGAGAGCCTTAGTGTGTATGTTGTATGCAGCGGGAAAAATAGTGAAAATAA^ 

rrnTrrn L L x x TACATTTTTATTCACATGTCTTGGTTTATTTTGTGTC 

AG AAAGGTTTTTAG T CTT AGAGGTT CAGGAACATGAAAC CTCAT AT AGC AGACATG ACAGGG AC AGAG ACTTTTGGAATT AG ATTT 

2 0 CTGTCTGCATGCCATGCCTCGAGAGGCCAGAAGGAGGCATCAGCTTCCTTGGAG 
GGGTGCTAGGAACCAAGCTCGGGTCTTCTGCAAGAGTAACT 
TCATTCCGGAACAAATCCAGCCCCCTGGGACCTAT 
TTCTTGAAGTTCCACCAGGTCACCTTGTACCTCACTAGGTAAAAGGTA 

ACCAAACAATCAGCTACAGTATTGTGGCTATCGACGAGATAAACGTTGAGTCTGGGTTGCGTT 

2 5 ACAGACAGATGGAGCAGTAGGGCCTGGAAATTAGATGAATGGAAAAGATCTGTCAGCTGGAAAGACCAGACG 

AG AACAC CCAAG CAAT AAAT ATGT AAAAATT AAAGGGT AT CTGG ATCCTGG CTTCT CAGAGCAT ACAAG CTTAG TTTTCT CT CTGG 
CAATTGAGCAGTTGCACAATGGCCTTCCAGGACTTGTAGGGTC^ 

TTG CTGGTCrrTl'C I' T CTTCAAGG TCGTTGTGTTTTCTCCTTC CTGT ATC C CG T AACAAAGT ACCAG ACTGT 

GTGGCTTATCACCATGGTTCTCTATTTGCCACAGTTCT 

CCTTTGTGTAGCTGGCTTCTCTCCATATCTATATCCCTAGTTCCT 
TTCTGGTGAAATGCTGCTCAGGACATTGCGATCAGTAGATAAATAGATAGC 
GTTGACTAACACTGAATTTATGCTGAGCAAAGGAATATTTACCTTTAATCT 
GAGTTCAAGGCTGGTTTGGTGTACATAGCAAGTTCCAGGGAAGCCAGAGCTAC 

3 5 CAATAAAATGATCTTACAGCCAAACACATATGACACTGATATGACACATGCCTGAGTCT 

ATTTTCTGCATGAGGCGCATCACAGCCTCTGTGTGTATGTGTGTGTGTGATCTTGAA 

TG AAT AATTTGTAAAAGACAGAAATCCATTCTTCCGTAAGTCTC AC CTGGAAAGTCCAAGACCAAAGCTGAAG CATCT AGT ATAT A 

CTTGGTCTATCITTCACATGACAGAAGGCAGAAGGGCAAAGGTGA^ 

GAGAACCCACCITGAAACCACTTTTACTGAAATTTAAATGTGCTTATATC 

4 0 GACCTCAATACCCAACACTGATATATTAGGGTCTGGGTCTTACTATGAAAGT^^ 

GGGTGAGTGTGCATATGTGCATGTTTGAGTATGTTGGTGTGAATAACTGTAAGCATGGAAGTATGAATGTC 
TACATTAGAGTGTGGTATGTGTGTGTTTGAATTGTATGACTTT^ 
TGAGCGTGTGGGTGC^TTTTAGCTTATCTGAGTGTATAGGTGAGGGGCrrc 
CTATCAACTGCTTGATGTACTCACAGCTCGGTACCTTATACTl"r'r 

4 5 AAAAAACACAGGGATGTGAAAAATGTCATCCCTAAGTAGACCACAGA 

ATAGAGAATTGCTCTGTTTGACCTCAGCTGGGTGTGTTTGCCCCAGGCATCC^ 
TATGGAAACIAATGTAAGGATGTGTTTCGCA.GTTACAAATACRTCTTTAGTAAGCTATT 
ACACCAGCTGTATCTCTCCTTATGAACTGTTGCTTAAAGGAGGGAAAGCT^ 
TTAAATAGGTTATCAGTGGAAAAAGCATGGGGGGGGGAATATATACTGAGATGATCT 
50 CCCTCACCTGCCCCCTCACCTGCCCCCTCACATACCCCTCATCCrrGCCCTGCCCCTGAGTTTAG 
ACGTCACCTCAGAAGTAGCGTCCATTGCTTTCTCTTCATGACTCCTCAAACTGTCCCC^ 
TGGTCAGGTCTTCCTCTAAGGGAAGATTGGAGGAC^TTGTGGTCACTACT 
CTAGCTTCTCGCAGAGTTCCCAGCATGCTTTGGCAACTCCTTACAC^ 

AAGAGTCCTGTAAGAATCAAG ATGTTTT AT CCTGTC CAATTCTCAGT^CCTCCTTTTCAGTCCAAGATCTTTGAATGG AT AG ATG^ 

5 5 TGAATGACTAAGAGATTGAATGAATGTGAGAATGGAACAAATTTGAGA 

TTCCCATGTCCTTTGTAACGCCTCCAAGGCTTTTCCTACTGC 

CACCCTTCCATGGCGGCTTTGCCTTGCTAGCTGAGAACTTACC^ 

GGTTTTTTTTTITTTTTTTTACAAGATTAC^ 

GGAAAGTGAGGCTGCACACAGCTGAAAACITTATAGGCATAACAGGAGATGAGATGAAAGC 

6 0 CAATGTCTGCTTCTTGAGGAAGATATGTGAGTGGTGTAGGAGTTGGGAGAGGAAGGGTT^ 

CCITTGTGAGAAGTGGAGAGGCTAGGTCTTGGAGCCTGGTGGAAG^^ 
AGACACAAAGTGGACTTGAGCTTTGTGTGGGGGCAGTGTACGGAGAAAAGGGGGGATGGCT 
TTGATTCATGTC^TGGTGCTTGTTTTCrrGGCACC 
GCAAGATG<»AAGAGAGAATTATGTGGCATGAATCATCTCGA 

6 5 AAAAGAGTTAAGGTTGTAAGGTCACATTGCCATGGATGCCTGAGTCAACATCACCTTG 

GTACAAACATGAGTAGGAAGATGTCAGGCAGAGAGTGGGTAGCAGCTTT^ 

AAAAGAGCCGACTGTATAGTCTGGTAGGTAGGAGTGTCTGTGTGTGTGTGTGTGTGTGTGTCTGTC 

TGGTGGAGTGACAATATTCC AGCCATGCAAATGATTGTC!AAAGGAAAAATGAG CAAAGGTG AAAGG CATG CAAGTG AGGCCAAGGC 
TGGGAGGCTGGACAGGAAGGGAAATGCTGGATGCAGTGACTAAGAGATTGTACTAGG 

7 0 GAAGTCACTGCIGAGCTCTGAACATAGAGTACGTCTAGAAGGAATGACTAGACAGTi^ 

TAATTAGCATTTTAGTGCCAGTATTGAGAGGTCAAAGTATTGATACATCCGAGGTCCTCAAGGGATA 
TTGTCTCAGGACTGTTTACCCGGCATTTGATTTCCTTTTGGATGGT 

TTCTATAAG CACAGCACAG AG AGG AAGG ATAAAG^TTTATTCG AG AC AAAGTCACTGAG AGGCATCTCGG CCTACAGCTAGTTGGG 
TGCATGGCATACCA7GCCCA7AAAAAC77C7TTCAGC7G 
7 5 GGGAGAACAGGATTGTGGAAAGAGCAGATGGCAGGGGAACTCAATATACTCAACT 
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CCGTGCCAGGGGGTAGAACTTTGTGGTGCAGATAAGACCGTCAGTCGCAGAAATACCAAACGCTGTC 

TACOVTGCCGGTGGAACGGATGCGAATGCGCCCGTGGCTGGAGGAGCAGATAAATTC CAATACGATACCAGGG CTAAAGTGGCTGA 
ACAAGGTAGGTGaSCAGTGTOTGTGTGTGCATGTGTGTGTGOj 
ATGTGTGCATGTGTGTATATGTGTGCATATATGTGTATGTGTGAATGTGTGTGTA 
5 ATGTGTATGTGTGGATGTGTGTTCGGGGACATGAGAGACAGAGAGACAGAA 
AGGAGAGAAGAGAAAAGAGAAGAGAAGGTAGGTGTCTCTTGATC^ 

T CTT ATG CTCATTCCT AT AGGAAACAGC CAGTGATTAT AAAATGCAAATTCATCAT AT CAAAAAGGGTTTGGC CACATTT ACTG AA 
CGACAGTGGGGCACAGGGCAAGGGTtXSATGTTAAGTGATG^^ 
CCTGTAGGCACTGCCTGTGTTTTCTTATAGAC^ 
1 0 TCATGAAAGGCGCTAGGTATCGGT C ri > G , XT T r C 'I' l 1 I 1 1 CTGTATGTGGTCAGTTTCCCAGCACCAAGTTGTTGAAAGAAGATGTG 

TTCTCCATTCTGTATCCTGCCACAT^CTGAATCTATCATTTCTA 

ATCATCTTATCTGCAAATAGAGGTAGTGTGACTT C TTl'CTCT ATTTGTATCCCTTT AGTTTCCATCT CCTACCCTGTTGCTCTAGC 
TGGTGCTTTGAGCACAGTACTGAAAGTGAATAGGGATAGTGGACATCT 
TGTCTrGAlTr r C TGA CATACAGCCTTTACT^^ 
1 5 TCGACTTATGTAGTTTATTATATTTATTG 

TTGATGTGTGTCTGTGTTTGGTTTGCAAGTTTCCTTTT^ 

GGTTGGAGAATCCAGTCTTGTTCTGAATTTATCTGTATGGTTG 

CTCTACTGCATGGTTCATCATTTCTCTTGTAGCTCTGTC 

CCAAGG CACTG CAGCTG CT CCTTGGACCAATG AC CTTTCTLT'l'l^TrCTCACrTT CAAACAGG CTTGTGTT.TrTGTAAAT ATCAG AC 
2 0 TATCTCTGCAGAGGGTAAAAAGTCCAAACAGAAGGAACAAACTAGTTAGAACATC^ 

TTCACCATCCTCAAGACAGTCTCAGTAGTTCTCCCrCAGT^ 

CTTTCCATTGGGAGTGTGTACAGCCACTATGTGTGCIAGGACCTTGAAGGAT^ 

TGTAGACATGTTGCAACACGGTTCCTATCCCTrGGAACATAAGAATCTT^ 

2 5 GAATCTTGGATCrcC^GAGATTGGTTTAGGAG 

GTCAGAGTCCCTCACAACCAACTGTAACCCCAGCTTCTTGGGGACC 
CATNNNNNNNNNNNNNNNNNNNNATATGTATC 

GTAGTTGTTAATCATTTA^TTTCCCCAAATGCACATGTGTGTGCATTTCT 
GTCCTGACCCCCAACCATAAAATTATTTCATTGCTACTTCATAACT 

3 0 T ATG CAGGATATCTG ATATG TGATG CTCAG AAAGG TCTTGAT AT AC AGGTT AGG AACT ACTG<3TTT AG^G^ AACAG CT AAG T ATG C 

CTCTACTCTCTCTCTTCATTTTTTAATCTAAAGGACACG^ 

GCTTAAAACCAGAAAGCAGATTGTGGCCAGCTTTTAGTGAGGGAAAAAGATATTC 

TCTGG TTTCATTTT AT CTT CT CAGGTT C CT CTT AAACTTTTTTCTTTC CCTT ACCT ATTT ATTTTG 

3 5 G AAT ATCT CTGCTTGAGT CCTTT CTTCCAG AAAGG ATGGG ACAC CG AAAAATAG AAAT AAAT AAACT ACAT ACTTG CTTT ACATG T 

CCTGTCATG TACTGG AAGGTTTTGTGG CAAGCTTGG AAGCTG AGG CTCAGGT AT ATG CTC AGT CTCTG CT AAAAACACAAGCAAG A 
GCTCAGGTCACCCTTGATCGCTCAGTGAGCAACAATCAAATAGGGGAATAAG^ 

AGATATAGGGTAACAGGTAAAAGCAATGGCAGTTGGTATTTGTTAAGTAGTTTTTGCTACTTACTCATCT 
ACATACATCAAGTCACTTAATCCCCACTCTGGGAAAGGGCTGTTATTTTCCATGT^ 

4 0 AAGGAAGGCTCCAGTGTGAGATTGATTTTGAAGAAAAGCAAATGGTTTGTT^ 

CTCTCTTTTGCTCCTCCTGAACCATATCTGTATGTGGCGAACAC^ 
CTGTAGTTGGGTGGTTTTTACAGGAGGTTGACCAGTAGTCTCTAGCTCTAAAGACCT 
CCCTATATGGGGGTTTCAGATGCAGT^AAGGACrrCAAGATGGGAGCATCTGGATTAATC 
GTGTCTTTAGAAGAGACTGAAGACTGAAGAAGACACACACAGAGAAGCTT^ 

4 5 TCCTGGAGCTATCAGAGGCTGAGAGAATCTCTTCTAGAAGCTTTTAAAACACCGAAACCCTGCCA^ 

GACTTTCAGAACTGTGATAGGATGTATTTCACCTTTAAGCCACTTAGGGTATGGTCATCTTTTATA^ 

TCCCCTCTAGGTTCTTCTCATTCATGGGGGCTCCTTGTATGCTATCCACTACT 

TTGGGCACATCCTTTGCCACTGAGATACATTCCCAGCCTTG 

TATAACTAAATCCTTAGTGTGCTCAGGGGGGCArrCCCTGCTGGGTTTTTCTGCTCTGG 

5 0 AGAAGACTGGAGAAAGGGACAACCTGGTCATCTCTTGAACACTTCT 

TCGTTGGCTCTAACTTCCTCCACTTCTCAGGACAATGTTGACACAGGAATGC 
CAGGTAGCCCACAGGGCTAAGCTACAGTAGGTGTGTGTCTGTGCACATAAACAGTAGGGGCCTCAG 
TCCATACCTAAACTTCTACG<X3CATCTTTTTATAGGTTTCCT 
ACCAAGGGCATGTTGCCTTTTCTTTACTATTCACAAGTACTCTAAAGAACCC 

5 5 ACCTCATTTCTTGTTGAAGACTGTTGGGATTCTAGTAAAACAGA 

GGTTTCTCAGTTCACCTTGTGTTTTCATGGTTGCAGAATGTGTTAATCTGCT 
TGGCAGGAGCTTCTGTACAGCATGGAGCCCAAGTCCTGGTGAGGGCTGCTTTACAACATG^ 
AAGCTAAGAGAAAATTTCTGGGCAGGGTCCGAGTTGCTGCTAAAAAAAAAATAGCAGC^ 
ATCAGACTCAATCCACTTCTCTAAACAGCACCTACCCATTCATAAGGGCAGAGTCTGTGATCTAATTACTCT 

6 0 TCACTT AACCCATC CGTG CTGCAGACAAAGGAAGCCTTGCAAGACACACT CAGGCTGCATCCAG ACCCTAGCAGTGG ATGCTCAGG 

ATACACTTTTATCCACGCCTATGAGCTGTCAGTACAGTACAGTATAGCTGT^ 

TATCTGATCAGATAGGAGGCTTTTCATATASCGTGCTTATGAATCTGTGAATAAAGGCA^ 

TTGTTGTTGTTTTAGACTAAGATAACCAAGCTTG^ 

AATGGGCTATCTATGGAAACCCTGCCAGCAGCAGCAGAAGAAGAAGAGGAGG 

6 5 GAAGTCAGGCAGGTAGGCAAGTAGGCAGGCAGGAAGGAAGGAAGGAAGAAAAAAAAGA^ 

GGATAATGGCTTGACCACTATCATGTCTGTCTCCAAGGAAGAGTCCAGTCAGTAATTATTTAA 
ACTTCTAATAAAATTGAAGGAGACCTGTCATGTAGCAGGGCCCAGCCACAGTGGAAGAG 
GAGCAGCATCCGTACC^CTTGTTTGTGAAAGACACCGTGGAAGGGCCATCTACTCTTTACGACAGTATT^ 
GGCTGAGTCATACATTTAGAATGAGTAGAAGACCCAATCTACCCACATCGAGGTl^ 

7 0 TG G CT AG AAGGGGG TTTT ATCTTAAAG CCATTGGATG ATGTTG CATAT CTGT AAGGT TT AACGGT ATCT C CTT ATT AT CTC CT AT A 

TCTTTGTCATTTGG AC AAAGTCAATAATTTTTTTTTTTT^ 

ATACCTTCTAAAAATAATTTATTTTTGAGATTATAAATATTAACCTCACTTCCTCC 
CATCCCCTGCCCCCCACACTCTTGTAAATTCATGGCCTCTTTTTCTTCCATTGATATT^ 
CAGCCTGTATAATATTACTTCTATGTGTGTTCTCAGGTTGACCATT^ 
7 5 TTT ATAGACTGGTCGGACTAGTTTGCAAG CCTG CCZAGCAGTGAGTG AGAGTTCTX3ATCTTACCACATCCCCTCTCCAGC CTTGGCT 



1169 



WO 03/008583 



PCT/US01/51291 



CTCTATTGTTTATTAATCTTTGACATTCTGACrc 

GGATTGTGAACAATTTTTTTTTCTGATATTTCTTAGACT 

TTTTAAATG<K3TCTTTTGTTTCriTTGTTTGACT 

AGCTGGCAATGATTCTCACCCATTCTCTGAGCTTACXSCTAATCCTTGTTGACT 
5 GTTTCATGAAATCTCCCTTGTCAGTTGTTGACTTTAATGTGCAA^ 
TCCTGTAGGCACTGCCTGTGTTTTCrTATAGA 

TTCATGAAAGGCACTAGGTATCGGTCTTGTTTTCTTTCTTCTGTATGTGG 
GTTCTCCATTCTGTATCCTGCCACATGGCTGAATTTATCATTTCT 
TATCATGTTATTTGCAAATAGAGGTAGTCTGACTTCITTCTCT 
1 0 CTGGTGCTTTGAGCACATACTGAAAGTGAGTAGGGATAGTGGACATCTCT 
T r r ClT CATTTCGGATAATATTGTCTTGACriTCTGACATACAGCCTTTA 

NNNNNNNNNNNNNIWNNNNNNNNN^^ 
NNNNHNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNITONNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNHNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNKNNNNNNNN^ 

ATTGGTTTCCATGGTCACCAAACrcTGTGTGTGTGTG 
ACCATGCTCAAGACAGTCTCAGTAGTTCTCCCTCAGTTGGGCAGGTCT 

TCCATTGGGAGTGTGTAC^CCACTATGTGTGCAGGACCTTGAAGGATTGGGTCACTGGACTCTGAA 
AGACATGTTGCAACACGGTTCCTATCCCTTGGAACATAAGAATCTTTGACAA^ 

2 5 TCTTGGATCTGGAGAGATTGGCTTAGCAGTTAAGGGAATATACTGATCI^ 

AGAGTGCCTCACAACCAACTGTAACCCCAGCTTCTTG 

ATAAGTACAAAATAAAGTGCAGGCCTACAATGAAACTTATACTTTTCCATGAAAGTATTT 

CAGTGCTTTTTATGCAATATGAAAAACTGGTTAAAAGCTATGTGTGGTT^^ 

ATTTCAGACAC^TTAGGCACCTGCAGGGCATACCACAGCCACCTACCAACT 

3 0 CACCTCACCCCTTAAGCAGGGACTGCCl'T'l'nLl rCTTTCCTCAGTTCCTGGCAACTGCTGAGCCACCTTTAGCCTCTACCAAAGC 

TTCTACAGGAAGCTTTGGGCTTCCTGGGCATTCGGTATAAACGGGGCCATTGGATGTTTGT^ 

CTGGGTTATTTTTGTTAGAACCATCTTTAAAAAGTCCATCCATATGGTTGTATGTC 

AAATGGAATGATATCCAATGAGATGCAAGAAGTTTAG 

ATCAGCTGGTGGGCGTTTGGACTGCTTTCATCATCGGGCTGTATAAGTATGTGTATATCC^ 

3 5 ATAATTCTTTGAGTACACATCCAGACATTAAGCAAAACTGTGGCATTGCCAGC^ 

TTGTTTCCCACAGTGGCTTTGCTAGTCCACGTGTC^ 
NNNNNNNNNNNNNNNNNNN^^ 

ACCAACACATTTTCATGTATGTATACTTTTTTAAATTTTATTTTATT^ 

CCCGCAAACATGCAATG CTTG AAAAG ATGTTGCCTGGCX3AATTCAGGTTGTGTTCCGT CGGAATTTGTTTCATCT AGAAAACCTTC 

4 0 TGTACTTATTCTTTGATGGTTTCATACATGTCAACAGTATATTTTGA 

CCCACTTCATGTTCTTCATGTTCTTCATGATCTTCATGTTCACTCTTTTTGTT^ 
ATGAGTACACAGGTATGGGGCCGTCAGCTAGAGCATGAGTGATATCTTAGCAGCCATACCT 
GAG CAGCAGCCAACTGTCAAATTGCrCCTCAGCCCGGGTGAGACTTTGTGAGCCTGTCTC CC 
TGGTTTTGCATGGCAGACATAGCTGGGGCTAGTTCACAGGTCTAACAGTCATGACGTCTCTCGA 

4 5 GTCACAGTCTTTCTGCCCCTTTGTCCGTGCCACTGCCTGAGTTTTGAGG 

AATAGTCACTTATTCTTAGTGGCTCTGAATAGTTACATTGGATGCTGCCCACTGCACA 

C AGC CAGGGCTTG CAACTTTTG C CCAGGTTT ACAATAGCAG ACAC AAAT CTGCACCTGTGG AG C AGGT CT CAAATG CAAGCAGAGA 
GAAGCTGGTT ACAGACATTGCTGTCACACTGCACC AGTGG ACACGTTCC CTGG CTGGTCTGCATTATAGCATACAGAGTCCAGTGC 

5 0 TGGACCACTGACAGTTTAAGACACTGATGTGTTTCCTCCCTGGCATCTCTGTAGCACCTTG^ 

GG AAACCTTCCCATCAGGTCATGTCCTTTTCATTCACATTCATTTTC^ 
GGATTTTTCATGCACATTAAAAATTTAAAATTCCACTCAGCCCCCTTCATTGGC 
GAATATGAAAATAGCTGTAACrrTCAGTTCTTCTTTATTAACTGCCTGTTAT 
ATGGAAACTCTCCAGCATTTTGAAATATCGTCAGTTCAGTCTCTGTGAG7GGTTTTGTGTAGAGTT 

GG CAT CCCTGGT ATTT AGAAATT AT CTGGAGT CCCCAAAAACTTG AAGGAAGG AAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGA 

AGGAAGGAAGGAAAAGAAAAAAAAAACTVCCTCTCCCCCAACCTCCATCACATTCCTGGGCT^ 

CAGGACAGAGAATCCAAATCTAAAAAGATGAGAGACTGTGTGTGGGGTAATAGGGGGTCACATGA 

CTT CG ACC C C CAAGAATT CCTTGG T AAAATAATTG ATTC CTTTCTTCAGTT C CT CACTTG T AAG ATTGAAATCATAAGTTTAACTC 

6 0 CTTAGGCAATAACAAGAAACAAAAATGAGTCTCATGTGCACGTGCACAGTCA 

CATGGTCAGAGGGCAGGCATGGTCAGAGGGCAJGGCATGGT 
ATGGTCAGAGGGCAGGCATGATCAGTGTGCAGGCATGGTCAGAGGCCAGGCCT 
TGAGTAGGACCTTTTTCCTGGAACATTAGTTTCT 
AGAGTCTAGTATAAAGTTTATATTGAAJVTTATTGCCIAAGCT 

6 5 ACCTGAGAGGGATTAGAACAGTATATGCACGTGTGCAGACACA^ 

ACTGCAAAACTtjAAGTTACTTGTTTCCrGAAGC^ 
TAAAATGCTTACTAGAAGTTCO^CCASAGGCT^ 
GAGAAAATCTTCACCTlTGGCCrTATAATATACGTTTTTGTTTTGTTT^ 
CTGGTCAGATCATTAGAATTAGGGACCCAGTGGGGCAGTGGGAT^ 

7 0 GCTGCATATGTACAGCATGCTCACGGGGTTCCrAGTGTGCAT^ 

GACTAGGATTACAGGCTGTGCTGGATAGACTrATGTCA^ 
AAAATGCCTCTGGCrCTCTGGCrATAGACAAGGCTGTAGGAATTTTrCT 
GGGTGGTGTCATGGGTGGGCTTGTGGTCTTGC^CTCAATAAGAAATTAGGCTGAACAAA 
7 5 TGGCCTCTOTATCAGCTCTTGCCTACAGGTTCTGCCCTGTTTGAGTTCCT 
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TGTGTAAGCCAAATAAACTTrr 1 ITT CCrACTTTGGTCGTGGTGTTTTATCACAGCA^ 

GCACCTAG C*l"riTT ATGTGGGTGTTT ACACTTGGGTGTT CATGTTTGCATGGTG AG CACTT AG CCAACTG AG CCAACTTCCCAGCC 

CAGGGGCTTGTCTGACAGTCX1A.CTTCCCTGCAGACA 

GATGCAjGCCCATGTGTCTTCAGTCAAAGCCCATGTCTTGCTTCAGTG 

5 gaagattttccagatcccctggatgcatgcggctcggcacggatggg^ 
tccatacaggtgttaaaccttgccttgtttattcacacagttnnn^ 
nnnnnnnnnnnnnnnnnnnnnnnnnn1w 
tac^aaaattctgcaaggagatatctttagatgtggacctt^ 
ttgtgtatagttggcatctc^ttctaccattrtattttccccctcaaa 
1 0 gtggcttccattgtctaggcttttaaaaataaaatgccagcaccttagaaatc 

TTAGAACAGAAAGGCTTTTTGCACCTCCCCTCCITCCCAGCA 
TCCTTATCTTTGGCTATTTTGCTAGGAAAGCATCAACCAGGAAT 
TGCCATGAATTCCCTGCCCGACATTGAGGAAGTGAAGGACAGAAGCATAAAGAAAG 
TGCCCTTATCCGAACGACCTTCCAAGAAAGGTAAAGATGTTTACCGACTAAATAAACGAACC 
1 5 TTGGTTTTCTCATTTCTGCCTCTGTCATCACTTTCCAGGGTAGAATACTT 
CTTGGAAAAATGACTTTCAAGGCACCAAAC^GCAGAAA^ 

AAAGAATGTCATGTAAGAACGTCCATGTTTCTCTGTCCTTCCGTGAATTGATCCAGAACT 
CCAAAGACAGAAAAAGAAGAGAGAGTTAAGCACATCAAGGTAATCTTTGGTTG 
GGTACCTGTTCTGTCTCATCAGGACCTTGGCGGAGGCTTTGGCCTTTCTCCTC 
2 0 TGCATGGACCCACCTGCAGACACAAGTCCACAGATACTTTGCCTAGTCCA^ 

TCCGCAGAGCTTGGCAGTCCCATGTCTCAGCTCCAGACACCAGCTTGTATCTAAGG 

AAAATCAGAGAAAAAAATTTTTATCTCTAATAGAGATGTATTAGCTC^ 

TCGTATACTCCATTAGCAAGAGTTTCAGTCGTTCATTCTCTC 

TGTGGG CTT AAGGTTTT CAGTTGACT AGGTTGGT CT AAC CACACCC CT AT ATTTCAATGAT CTTAG TGCATAGAAGGTGT AGTAG C 

2 5 CCTTGTTAAATACGGAGGAAGGTGTCAGATAATCCC 

AGCTAGAGACCCAAAGACCACCCAAGAGATXIATATAAGTGCCTGGCTTCTGTGTACAGAGTAAG 
GCCTTTCTGTAAGATCCCATGGCTCCTTAGCACTGAAATTGAAGTGGATGTAGGA^ 
CGAAGGGACATAATTTAGGTATCTCATTAGACTTCAGAAGGGACAGCATCCTGTGTC 
GATTCTCCGCCCCTGCCAG<3TACCTTCATGGAAAATGTAGTAGTGTCTGTGTAGAACCTATCCA 

3 0 ACCCCTTGGTTACTTTTTATACCTAATATGATATGAAAGTTGTTGTGTTTTATCAT^ 

GACTTTTTTTTTTTTAATGTGTCTAGGCTGCCTACATGTATGTCTG 

CCCTTGGAGCTGGAGTTAAGGACAGTTGTGAGCCTCCAGGTAGATGCTCAGA 

CTAGTAACTCTAAAGCCATTTCTCCAGCTCCTGTAGTTAGA 

3 5 ATCTCTGTGATGGAAAATAAATATTTTAGGTAGACTAGCAGTGACACTTTGTTTCA 

TTAAAAATAGCACCGGACTGGGGACTGGAGAGGCAGCTCAGTGGTTAAGACTGTGTCTAGGGGA 
TATGTCAGGTGGCTCCCAACrACCTTTAACTTCAGCTTCAAGAGATCCAAAGCC 
CTCTCrrCTCTTTCTCTCTCAATCTCTCTCTCTCTTTCCCCCCCCCATATCCCTCTCT 
TTCAATAAATAGCATAGGACTAATGATTAATTTTGTTGTAATTTTTATTGTGGA 

4 0 TACTGTAGATCACCCTCTAATTTTAATATCAATCAATT 

AATATCATCATAGATATTTGACCCATTTTGACTACAGAAACAATTTGTGATAGTTGTCAAAATA 

G ATG TTTCTTCTGT AAGCATTT CACT AT AT ATTTTT AAAAGT ACATG CC CTTT AAAAAT ACCATG CTG T CATT ATTTT AGCTGATG 
AATTTGACAGGAATCCCTCTCTATCGTTAAATATCTAGCCAATGCTCAG 

ATACATATATATCTGTAATATAGTTATACATATATACGTGTAATATATATTTTATATATGTCTATTTGTATCT 
4 5 AT AT ATT ACCTT AACTAAT AC CCACAC CATCAG CTGGG TG TGGT AG AG AGCACAT ATCTGT AAT C CCAGCAT C CAAGAGG CT AAGT 
TAGGGAGATCTTGAGGGTGATTGTCCTGCACCATATAATTTGAAGAAAACT 
CCCCATAGGAACTTTTTGCTCTATCTCAAGAATTTTrCT^ 

TTGGTGTTTGGCX5TAGGTCTTGTCCGTGTGAGGCAGTCAGATCTGGAACTGGAGTTG 
TGGGAATTGAGCCOVGGTCCTCTGGAACCCAGAATTTTCTTTCTTCCTTCC^ 
50 CTTTCTTTGTTTCTTTCTTTGTT^CTTTCTTTCTTTCTTT 

TTTITCTTTGAGACACAGTGTCACTGTGTAGCCTAGGCTAGCCCCAGACTTC 

GATCATGGACCAGAGCCACCACAGCCACCACGAGTGACCITGCTT^ 

AACAAAACACAGCATTATTTTCTCCCTATGATTATAAAAAAACT 

TGCATTTTG AGTGTCCCAAAT AT ATCTCTACATCTCAGT AAGAGATG TATAACTTTTTTTTCCCAC CACAAATAC CTAGTTTGAAG 
55 AATTCACTGTACTCTTAGCTGTGATTGTCTCATATTTAGTATGTTCA 

TGATAACTGAGTGGGAGC^TGAACATTTATTAAAAACGCAGATATATTTATTCTCTAT^ 

ATTTAATTTCATAATTGCJVGCGAGTAGAAAAACA^ 

TATTCCATTCAGTATCAAATACCAAAGGTTATATGGCTGGCC^ 

G CAGT CAAGGCTG AG ACATGCAG CATC C CGGGG CTGT AGG AATG T AAGGAAATGTGCTTCCT AGGGTGGGTGG CATGTGGGGGT C C 
6 0 TCGTAAGAAAAGGGTAGCTTTGGTTACATCAGCCTC 

TTCATCCAGGAAGTGAGCCGCCTTAAATGAAGCCACGTGGGTGGAGCTTCCTGCTCAGGAGCATCTGTTAT 

AAAGCT ATGATT AACACTG TGTTCCT CAAGGGACATTG CGG AAGGCTG ATG AATCAT ACTT AGTGCAG CCT AAC CTT 

TGGATGTAGCTATCITTTGCATTTGTCTTGC^TGCAGTGTTATAGTAGAACGTG 

GTTTATAGTTGCTAAAGGCATTGATCGATCTGCAGAATGATTCCAAAGTTAATTCC^ 

6 5 GATTGTCTTTAATAAAGGATTAATAGTCTACTTTTTTGTTAAAAGTCTAAGTA 

TGTAATCCTAGCAACTGGGAAGTGGAGGTAGAGGGTCAGGA^ 
GTGCTACAATAAATCCTGGTTCATAAAATAAAACAACCACCAGCCATACTGGGCC^ 
TTCAGAAAATAGTTTCTCCAAGGATCCGTGGAGGGAGGACGTTGTCAGAGT^ 
GTCAGTCTGGCCrTGAO^GGCTGGAGCAGGGGGATATTAAGTT^ 

7 0 AAGATGAAGTTGGTTXSTGCCAGCTCITCCCAAAGTCAAGGG^ 

CTTGGCAGAAATGAAGGTAT IU 'TTGAAGAGAAGTGAACAGACAATTCAAACA 
ATCTCTTGCAGGGAGAAGGAAAAGCCCCCAACCAATGGGGAAACTTGTCTCCCAGTGA 

G AC CACX5CTTTCATCTCG AAGC CTGTTCTGCAAGC CATGTTTGGTCG CCCAT C CTC AAT A (^C ACTT AAAAGAG CC AAATT AAT AG 
7 5 TAGAAAACTGAAAATTGAGATTTTTTTTCAATATGACAACATTGTG 
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TT AAGAAC CTGAAAGGCCACCAT CTT CAGGGTCC CGAAAGG CAAT AAAAG TTT AAAT AG AAGACAG AGAACATGAACATCT AT CT A 

CACAGTGCTX^TCAAGGTGTGGTCCTGCCCACCACTGACCCATCATC 

GAAGAGCTACACCTCTAAAATATCTTCATTTTTATCACTGTCTATCTGAGAGTA 

AAAATAAACTACCTCrGTGGCGCTACCTGTCTGACCTAAGAAAAT^ 
5 GTGAATGT ATGC G TTTCAGAAAAT ACCATGGAACAG ATTGC CCTGG T ACAACAGATTGCTCTG TT CCATAT AGG CAAG AC CAAGGA 

CGATAAGGCTAACAGTAGCCAGGCT^TAGGTGTTGACAA TITIT A 

GGGATGGTCAGJWGTTTTCCCCTTTTCCCTATCT 

GGAGTTTGAGAACrCTTCTGGGAGCTGCAGAACrrTGAGCCTGACCC 

CTTAC^AAGGGATGCTTCATGAGTGAAGGCCTAGAGGTCA^ 
1 0 AATOVGGGGCATTGTGTGGAGAACAATGGAAAATAAAGATGCCCAGCT 

GTGGGGGGGGGTTGGTTGCAGATGAAAACACAAGCATTTCCAAATCAAre 

GTAATGTCATTTGGTCGAAGTCACTGGTCTGTGTTCGTTCCGAGGG 

TCTGCAGACCTTTCTCACAAGACTGTTTCCTGCAA.CAGTACATTC 

TGCTTGGGAAAACTCTTCAGGTTTAAC4ACTATG^ 
1 5 TGGATAAAGCCAGCTAGGAAGATGAAGCAGGTTAGGAGCTATCCATCACTGGCCATTCCCACCCCCC 

GTC^CITCTCTGGACACAGAAAGGGATAGCTTTCATCTTAGAGGAAAGAA 

TG<3CTCTGGAACACCGATTTAGGTCACTCCAGATTCCATGTTCCA^ 

AGGCCTCATATCTGATGGCATTCTTACTTTGGGAGTTGTTAGA^ 

TTAGTCCTACACCA.TTTCAGTCTCCTTATCTTACTGAAGTTCTACT 
GTCTTGTTAACACTTCTTTAGGAAATCAAGTTGAGCrrACTTGGGGCAAA^ 
ATGGCAGACTGCACAGTAACAAATATAGAOXAAGAT<X3TGACCGTT 
TGAAGACAGGAGTAGTTAAAAGAGGAAGATACAGAGAAGGAATACTAGGT 

2 5 GCAGATGTCATGCTGAGTGTCTGAGGCCCAAAGGTGTGACCCTTTACTCC^ 

C CAAGACAGAAATT CTCCGAGATGT AG<^AAA.GAAAAGGG AGTG AGG AATT AG C CATGGGAAG CTGGTCAGTT AGAGATGAGT AAA 
GATGGCCTTGCTTCAGGGAAGGAAAGGT<X1AGGCAGGCTGGAGGCT 
GGTGACCTCTGCACCCTCTAGAGTTCTGTCTCTGCCTTTCTTCATGAAGCT 
CTTAGATCTGAGGTCCTCAGATTGACTTTGAGAGAGT 

3 0 GGCCCTCGGAGGATGGATCTAGACACAATGCTTTAGCACAGGGATGAGGC^ 

TCACTGGTTCCTCTGTTGCTTTCTCTCACAAGCCCAGCTAGATGGCAAGCCCATCT 
GACCCTGTGTCGGGTGTTAAGGATAGCCATCAGTCACCCTTAA 

TT AAG CT ACAC CAGGT CT AGATTTTGT AATTACAGAAG ATGCCTTTGAG CCT AAAT AT^ CAGTGCAGAG 
GGAATCTACCrGAGAGTGGGTTACTTGTTTGTGTTTGTTTGTTTGTTTCrT 

3 5 AAACCAATCTAGCCCCAGGTAAGCTGATTTAGAAGAGAACCTCTA 

GGTAAAGTGAGGGAGTATTGAGCCAGGCAGAGGTGGGTCCCCAGAGATAACCAC^ 
ATGTTCATAGAGAACCATCACCTTTTCCC^C^GTTTGGAACCCTTGGGTTTTTAAT 
ATTGCCrTATTTAGTATCTGTTTGTATGTCTGTGCATGTGCCCAAAC^ 
AACCCOSTGCCTCTTTTATTCnTATTTTTCAACCTCTTAGC 

4 0 TCTTGAGGAAGCTTCCACACCCTGGAAGGAGACTTGTCAGGGAGCTAGCACGGC^ 

TGGCTACTTCATAGCTTCAAATTTGCTACTTTTGTGAATCATAATG 
GAAGGGATCATTAACTTCCCCAAGCAGGTCATGACCCACAGGTTGAGAACG^ 
CCAGAGTGGTTGTAGGAGGAGTAAGACATTAGTTGAGTGTTCTACCTC^ 
45 ACTCCATCCATCCTGTTCTCC^TCTGGATGGGTGCAGAAGATGAGGCTCCTGCATCA^ 
AAATGAACTTAGTGAGTCCTAATTTTGAGAAGTGACAAAAGTGTCGTTTGTCA 

T AG AGGTCTCTGTTGCAAAGTCGGAAT AGTTCTATTTT CAG ATAAACGCAAAC CTGGCTGTTCTGTT ACTTTGATAAGAATTCTAG 

AAATAGAATCACGGGTAAGAAAGGAAACAAAAC^AGAACGCACAAAAGCAGTTCTTCCAGCC 

TTTTCTTGGGGAGCAATTGAGGAAAGGGCGTGCCATAATGGA^ 

5 0 GGGAGCCCCAAAGAAAAGATGGCGCCC^CGGTGAGGTGCGCATGTGCAGGGTCTGCAGGTGATTCAGGCTC 

AGTCCTCGCTGTCCTGTGGATGGAGCTTAGGGGAAGGGGCTTGAGAAAGGACCCTTG^ 
CTTCGATGGTAGCGAGTCTTCTTTAAGAATAAATGTCT 

CCt^TTACCATTCTCAGCCGCCATGTGGCTAAAATGACTCAAGAGGAATTAAAGGGTTAA^ 
CAGGCTTGGTCACCTGGTCTGATTCCAGAGTCATCT^ 

5 5 CCAAGG^TGTGCTTGAGXSAGGTGGAGGGAGCCGAGGCAGGCATTTGTCT 

CCCTCCCCCTCITCCTGAAAGAACAAGCCTTCTCTTGGAACTTA 

ATTTCTGTTCAGTGG CTCTGCCCTTG^GTGTTCTGGGTATTTGACCT ATGGT ATAGTGGGT AAACCCTC CAGTTAGGGCTTTTTGT 
TTGTTTGTTTGT TT TGTTTTGTTTTCCTTAAGCTGA 

C AG AGGATC GTG AGG ACT AT ACAG AG TTTGATCTCAATTC CCC AGTG TG ACC AATTGATAAGAATG C CCTGG ATG C CTT AG AATG C 

6 0 CCAGCACTCCTGACrTTACTGCTTTACCCACAACGCTCCTCTGCT 

CATTTTGGATTC CTCGGG AGTTGT C CG ACT ATGTGG CTTTTGT CTGG CTT ATTG CTGTCT CTG AAACAAATG T AGT CTT ACCGGAA 
GG AAATACCCTCATCAAAGTTCTGAGTCTCAATCATACTTTAGAACT 
GAC^TTCAAGTACACTTGAATGAAGAACAGATACAGAGTTCAAACA^ 
TATGTAGTAGCCTCTTGGGCTGGGCGGGAGTTITAGGCACAGGAAGGATGTTGATC 

6 5 GTGATAGGCCAGTGAGAGGAGACTTCAGCTAAGGTTGTGGCTCATGTTACCATCCAGC 

TAGCACATGATGAAGTGTGCTCAAGTT(TTCACTTCCATTTT 
TTAAAGAAGTGATTGCAAAAGTTAGAAGCATGCCTTCTCTGGAGGA^ 
AGGAGGCTCAGGACCCAGGCAAGGGACG(!ACAGAACAGTGCrACX^ 
TGCAGGTATTTCCCGAGGGCAAGCCAGATGTGCTGAGAACATTTCCTTC^ 

7 0 AATG TCAGG CTG ATTTAAAG AG ACAGGGGATGCATTTGGGACATGACACG AATG ACT 

GGAGAATTGAAGCTAAGAAAGGGGAGAGGAAGGAGGTGTTCTCAATTCTTTT^ 
CATCTTTGGGGCTTAGTAATGGAGTAAGTGGCTTTTCTCCTGAGTATC 
ACGGTGAACATCATAC<3TAAACTGTTTTGACATCTTCTTTTTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAGTAATCTCAAAATAACA 
75 AT CC CAGTGCT AG CC CCT AGGTTT CAGAG CTT C CAC AATG AGAT ATG AAGGTTTGT AG AG G AGTG AG AA CCAAG AAJLAGAAGG CCA 
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GGCATACAAGCTTCCTCTCTGGTGGCTAGCGGTTi^CAGCACTTCATCAATCA 

AACTTGGG AGCCAAG«GAGAGGAAAGAC CATCT CTG TGGGTCCCCTT AG CATTG AC CTT ATG CACAT CTGC CCAGAGGTGGCACACA 

TTTCATAGGCCAAGAAACTGAGGCACAGAAACCCTCTGTCA 

GAGATGCAGTTCTGGGAATGAAGTAAACAGGGAGCGATCTGGGCTCTATTCCT 
5 ACAGGTAAGCAAAGGATCCTCATTTGTAGTCTGTGATGCAAAGGAAG 

GCATGGGTTGGGAGGGTGTCCTCAGAAGAGTGACCTTCCGCCATAGTC 

AGGATGCTGACAGAAA.CCCTACAAAAAAGCATCTGAAGGAACA 

TCATGAC^TCCCCTCAGAGACAGCGGTTCTTTGATGGCTGTTGTC^ 

GGTCCACCTGTGAGGG ATlTlUTT CCTrA 
10 AGTCCACCTTTAATCTGGGCCACACCTTCTGCTGGCAGCCTATAAA 

TTGCCCTCACCCTCATTGGCAATTCCATTCCITCACT^ 

GAGATATCCATGCTTATGGACTGAACAAATGCTAGATTCTTGGACTTTTC^ 

AGTCTATAAGCCACTCTAAAATCCTTTATATGTGTGTATATACAAATATATGTCTGCA 

TACATGTATATGTGTX3TATATATATATAATTTTTTTCTGTTGGTTCTGTTATTCT 
1 5 CAGAAATCGCTCTAGAACAACAGAAGTATAATATGCATTT^ 

ATTGAGGCCIXrrCCTGGGAATTCAAAGAATATTAAAAAGATGTGAACTG 

CTAATTCACCAGTTGTGAGAGGCAACAGATGTGGTGGCCCAGGATAGAAA^ 

GATTGTTGAGTACCCCTAGTCTCTCTAGATCATTTAACAGTGGAAAATAATGAGCT 

TCTGAATAGTGTGCAGAACAACAGTGTAAAGCTGACCAGCTC 

GAAAATTCAAGTCCCAGTCTTGGAGGATGTTGGAGTAAGGACATTAATT 
TGTGGGGAGACTCAATTGAAGTTGAGAACTTTGAGCCCTCAGGTTCrrTC 
CTCAGCAGATAGACTCACACTACCCCCTCTGAAATTTTGCCCTIT^ 
ACCAGCAGTAACTTTCTCTGAAAGAGATGCCAGCCATGACAATACTGTTGTCCCT 

2 5 TAATCAGACTTAAGGCTAAGCAGGCTCCTAGAAGGGAGCCGGAATGTGTGGATCACAAGGAAGTGC^ 

GTTTAACAAGTTTGCTGATCGTTTATGCAGAACTCTAGGAAATATGTGTG^G 

AAGACTGAGTTT ATTGAT ACGGT CC CTCTGAAGGG AGGTC CT AGGTTT AAT ACGGAAGTTTGCTCAGTT AAAAG G ACTGTG AAAAT 

TTTGTTTGAATGGTTGGTGAAGCACTTGTCAAAAGATGGCCTATCAAGGAG 

TAATGAAGGGCTTTTACGACTCAGGGAAATTGCAGTGCTAGAGTGGAC^^ 

3 0 CTTCACTAGTCTTATGTAAGACAATTAAATTGGTGAGAG 

ATCTTAGGGTTGGAAAC1ACTACTGCT<^GTTGGATGAATTAAATGCAGTA 
GAGGTGAGGTGATTGTAGTTATCACCAGCATAGACCAAGCAATATTCATAGTGTCCTC 
AATTTTATAATTAGATGACTCATGTGGACrTTAGGTGCTGGCTAATCAA 
CTGCATTTGTAGTTACTATGTGTCAGTGGAAAAATTCTCAAACAAATGAAGGAAAGG 

3 5 CAGCCTATAAACCAGTTTTCAGGCTTCAGCCCGTTTGCAGACCCAGAAC 

GATAAAAGGAGTTAGTCTTTCTCCACTCCITCCCCCCAGAAGGATATATGGCCT^ 
AACAGTCAGAATTTCCAGGGTCTATTGGATACTGGTTCTGAATTGAC^ 

TGTTAAAGAAGGGGCTTATGGAAGTCAGGTGATTAACAGAGTTrTGGATGAAGTACAACTAAC^ 
ATCCTGTAGTTCATTTCCCAGATCCAGAATGTGTAATTGGGATAGACATCTAAATAGACGTTGGTATGATCCT 

4 0 GTGGAGTGAGGGATATTATGGTTGAAAAGGCTAAATGGAAGCCATTG^ 

ATTTCCCATTCCAGAAGGAATTGAAGAAATGAGTGCCACCCTC 

TCCCTTTAACTCTCCTATATGGCTAGTGCAAAAG AC ACTGT ATCATGGAG CATGACTATTGAAAACTTAATCAAGTAGTGACT CCC 
AATTTGTAGCATCTOTACTAGATGCGGTGTCCTTCCTTGA 

GCAAATGCCTTTTTCTTGGTACCTGTCCGTGACX5ACCATCAGAAGCAGTTTGCCTT 

4 5 GGTTTTACTCAAGGACATAAGAACTCTACAGCCCTGTGTCACAACTT^ 

ATATC^aVTTGGTGCACTATATTGATGATGATATGCTGACTC^ACT 
ACACATTCTCATCAAAGGATGAGAAATAAATCCAACCAAAACTTCAA 

T AGGTGAGG AATAAGTT ATTG CACTCGGTCCCTGTT ACCAACAAGAAAGAAGGG CTGAATGTTATGACAGTTAGTGGGTCTGTTTG 
GATTCTTAAGACAGCCTATTCCTCATGTGGGTATGTTATTCCAGCCTGTAT^ 

5 0 GGTCTGAAACAGGAGGGGGCTCTTCAACAGGTCCAGGCTGCTTTACCCCC^ 

AGGTGTCAATACTGTTTGGAG CCTTTGGCAGG CCCCTAGAGGTG AATC^CAGAAG AG ACCTTTGGGATTTTGG AGCAAGGCTCTAC 
CATCATCGGCAGACAACTCTTCTTTCTCTGAGAGACAGTTCTTTGCCTGCT 
GGCCCCAAAGTTACCATGCAGTCTGAACITCCCGTCATGAGCTGAGTGTC 
GXIAGAAATCTGTCGTCAAATGGAAGTGGTATATATGTGACTAGACCTAGGAAGTC 
55 TGCTGAAACACCTATGGTTTCTACTCCTGTTAGAGTGT 

ATCATCCGAGTGAAGGAAAGAAGACTAGGGCCTGATTCTACACGCTATGCAGACGTTGC^ 
ACTCCCTTTCTGAGACAACCCTGAATGACATCAGT^ 

CAGTTTGCTTTGGAAGGAGAAATGACCAGATGTGAATTGTCACTGATTCATGGGCTG 
CTTGGAAAAGGCAAGGTTGGAAAATTGGTGAGAAAGACATTTGGGGAAGATGTGGCTAGATCT 

6 0 AGATACTTGTGCCCCATGTAAATGATCTTGAAAGGTGACITTC^CAGAGGAAGCTTT 

CTGTGGACAGTCAGCCTCTTTCCCTAGCTATCCCTATCATTGCCCAGAGAACCCATG 
GGTTATACACGGGCnTGACAAGGACITCTACTCTCTAAGTTTCACCT 

ACCAACACAGAGTCCCAG AT ACTACACCATCCCCTGGAGTGAC CAG CTAATGACCTGGTGGCAGGTTGACT ACATTGGGCCCCTTC 
CTCrGTGGAAAAGACAATGCTCrGCCCTTACrAGAGTAGCTACTTATTATGGTAAT 

6 5 GTCAAAACTACCATCCATGX5ATTTACAGAATACATCACTTTTCTACCIACCA 

CTAGCCTGATAGTTAAAATGGCCTTTTTAAAAACACAGTTGTAGCATTAATTAGGTGGCAGAAGTCTGA^ 

CCCAAAGG TG CT ATATG CTTTG AATT AATGT CTAAG AT ATGGT ACAATTTT CC CAT AG C C AGG ATT CAG AAAT CAGAGGG T AG ACA 
AGGTAGTTCCACTCACTGTCAACCCTAGTGACCCATAAGGAAAATCTTTACTTTCT 

7 0 GAAATTTGTGTTGCAGATGGGAGAGAGCTCCrcCCAGGAGCCACAACAAACATT^ 

CTACTTAGGGCTTCTGATGCCCTTAAACCAAGX3CTAAGAAAGG 
GAAATTGGATTGCTCCTCCACAATGGAGGCAAGAAGGATTATGTCTGG 
ACATCCTGTTATTAAAGTCAATAGGAAGCTACAACAACTTAATCCAGGCAGGATGAC^ 
TATGGATCACTCCTCCAGGAAAAAAGCCAAGACTTGGTGCAGGTCTTGCCA^ 
7 5 CCTACTTATAAATACCAGCCAACGTCATGTGCCCAGTTGCAGACAT^^ 
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T AAGAAGG CATTTGTGC AT ATATCTGAGTTTT CTGT CCTTGATTT CTTT ATC ATGT AATTT AAC ACCAATT AAG AGCAT AT CAGTG 
GTCATAGTATTTAAGTTCTGAGATCCCAAAAGAATGTTCGTTAAGAGACA^ 
TATGTAAATACTCACTTTTCTAACAATACrTAACATTTTATT^ 
AATTTTTCCTTAGTCGGAATTATTTTACTATATTTTATTATTAAAA 
5 TTCCCAAGTCTTTTTACATCCCAAGTTTGTGGTTATATAAGGGATAATTATA^ 
TTGTCATTATATGTATGATAGTCATATCATGTTATGTATAATTATG^^ 

AGAT ACGATTT CATGT CAAGTT AACTAGGGGTG AACTTTTGGTGGTT ATTC ATGG CTTT CAACTTG ACT AT ATCTGT AATG AACCA 
AAACCCAAGTGTCTGGGTACACTGTGAGGGTTTTTTTCTTAATCAA^ 
TGATATGGAATGATGCACCrAATCTAGCTCTTTTGAGGTGGGTATGT^ 
10 TATCAAGGATTGGAAAAAGGA 

MOUSE SEQUENCE - mRNA 

TCTCAGGCAAGCCGGGGACrAACrTTTAGTTTTGCTCCTGCT 

CAACACTCACACCITGCGGGATTGTATTGGTAGCGTGGAAAAAAAAAAAGCACAT^^ 
1 5 GAATGCGCCCGTGGCTGGAGGAGCAGATAAATTCCAATAC^ 

ATCCCCTGGATGCATGCGGCTCGGCACGGATGGGACGTGGAAAAGGATGCT 

GCAT CAAC CAGG AAT AG AT AAACCAG AT CCAAAAACATGG AAAG CAAATTTTCG ATGTG CC ATG AATT CCCTGC CCG ACATTG AGG 

AAGTGAAGGACAGAAGCATAAAGAAAGGAAACAACXSCCITCAGAGTCTACCG^ 

GGAAAGAAACCAAAGACAGAAAAAGAAGAGAGAGTTAAGCACATCAAGCAAGAAC 

GACA^TCCCATCTCGACAGCRACATTGAAGATCAAGA 

CGACAGTGTGGCCAGTGATGAAGAGAACX3CAGAGGGGAGACCACACTGGAGGAAG 
ACATGGGGACACGGAACACCTATCTGCTGCCCAGCATGGCGACCTTTGTCAC 

2 5 GAC^ATAGCTGTCCGATGCCITACAACAGCTCCTGGCCCCCATTTACAGACCT^ 

CAGCAGTCGGCCAGACCGGGAGLACCCJGGGCCAGTGTCATCAAGAAGA 
CCTTTGACTCrCCCTGGTGGTTGTTGGGATTTCTTAGCnTGTGTT^ 
CCTATCTTAGACACATCTAAGGGAGAAAGCCTTGACGATAGATT 
ACTCAGGACTCCAGCCCCCCCCCCCCCTCGGTAGATGCGTATCTCTAGAACCTGCT^ 

3 0 AGGACCAACAGCCACACGGGCAGTGGAGGTGCTGCGTTGCCTACGGTCAAGGCC 

AGAAAATGTGAACrAGCTGGAATTTTTTTATTCriTGTGAATATGTACATAGG 

CACTACCCACCCCTCCCTCCrCAACCCCTCCATCCaXSTCCTCTATGCGCCCCTTCCCCCCAA 
CTATCCTTTCCCTCCCAACCCCTTCTATCCC^CCCACCACCT^^ 

3 5 GCCTCTCTCCrC^GCCCAGCCTACCCCATCCCACCCTGTTCCTTTCCT 

GAGGAGGAGGAGGAGAAGAAAATGAGTTTCTTCACCGCTGTCCCATTTTAAGACT 
TGCTATGCTTGAATGGCACGCGGTACAAAGGAAAACTGTCATGGAAATA 
TAATTCTAACCAGAGCAAGCl w rri'l , TAlTTTl"lTATACAAGGGGAATATTTTATTCAAGG 
TGTTTTTTATCTTTTCTAC^GCAAATTTATAATTTTAAGATTCCTTTTCCTGTTC^ 

4 0 CATTTTTTTTTTAATTTTGTAAAGGTGAAAAAAAAACTTTTATGAGCTCATGT 

TGAATATGGTATATAGTTAAAGATTTT 



MOUSE SEQUENCE - CODING 

ATGCCGGTGGAACGGATGCGAATGCX3CCCGTGGCTGGAGGAGCAGATAAATTCCAATACGATACCAGGGCT 
4 5 GGAGAAGAAGATTTTCCAGATCCCCTGGATGCATGCGG 

GGGCGATCCATACAGGAAAGCATCAACCAGGAATAGATAAACCAGATCCAAAAACATGGA^ 

TCCCTGCCCGACATTGAGGAAGTGAAGGACAGAAGCATAAAGAAAGGAAACAACGCC^ 

CGAACGACCTTCCAAGAAAGGAAAGAAACCAAAGACAGAAAAAGAAGAGAGAGTTAAGCACAT 

CTTTGGGGCTTAGTAATGGAGTAAGTGGCTTTTCTCCTGAGTATGCGGTCCTGACT 
50 GTGAACATCATAGTTGTAGGACAGTCCCATCTGGACAGCAACATTGAAGAT^ 

GGTTGTAGAAGTG AC C ACTGAGAGTGATGACCAGCCAGTCAG CATG AGTG AGCT CT ACCCTCTACAGATTTCTTCCTGTGTCrTC CT 

ACGCJUSAAAGCGAAACTACCGACAGTGTGGCCAGTGATGAAGAGAACGCAGAGGGGAGACCACACTGGAGGAAGAGGAGC^ 

GGCAAGCAGTACCTCAGCAACATGGGGACACGGAAGACCTATCTGCTGCC 

TCTGCAGGTCACCATCAAAGAGGATAGCTGTCCGATGCCTTACAACAGCTCCTC 
55 CAGTGACCCCCACGCCCAGCAGCAGTCGGCGAGACCGGGAGACCC^ 

CGTGTCAAGAG CTGTTAA 

HUMAN SEQUENCE - GENOMIC 
AGTTGAATATACTTCTr CT TCriTTTl T rGAGACAA 
6 0 CCCAATAGTTCCATAGACAGTTTTTTTCrrGGAAAACCGTATCAATT 

TATCTGAGCTTCITCCTCAGGAAAGGACCTCCAGGTCTGGCAAAAAGTACCTAAGAACT 

CAATGAGATGCCAGGCCCTCATTCATCATGATTGCTTCCTTGCCCCTCCCTAGTTCCT 

CCTGCTACATAAACCCCTAATTTTAGTGGCCAGGGAGATGGATTTAAGACT 

AGCCTTCTTCCTTGG C AATAATTGTTGCTTAGTG ATTGGTGATTGGCTTTCTGTG AGGTG AGCAG CGGG ACCTAG ACTGAACCCCT 

6 5 GGCATTTTGGGGACATGGTCRCGGCTCACrcCAGCTTTGAACTCCTGGG 

GAGACCACAGGCTCGAGCCACCACGCCCAGCTGATTTTTGTAATTTTTTTGTAG 

CTCXSAACTCCTGGTCrCAAGCAATCrTCTTGCClTGGCCTCCCAAGTGCT 

TGGCCTAGCTTGTTTlTT n 'l T n'(HTTTTAAAGT^ 

TTTATTTGTGGAAATGGAAATTGGAGAGAAAGTTAAAGGGGAATAAATATTTTGACTTAGGAATT 

7 0 TGAATTGTTTTAGAGTGTTACAAATGTGTCATGCAATAGACTGAGAGGG 

(XACTTGGCAGGTGCAGTGGCTCACACCTGTAATCCCAACATCITGGGAGGCTG 
AAACGAGCCTGGACAACATAGTGAGACTCTGTCCCTACAAAAAATAAGAAAATTATC 

AT CrACTCAGGTGGTTGAGGCAGGAGGATTGCrrGAACCCAGGAGTT CTGCAGTG AGCTGTGATTGTGCCATTGCACTCCA 

GCCTGGGTGACAGGGAAAGACCCTGACTCrAAAAAACTTTTAAAAAAATCAA 
75 CAGAGACAACTGGGAAATGTATa_GTGTC B -GTTTTTAAAACCACAAGT 
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TTAAAATAAAATGCAAACCCCTTATTCTGGTGTCTCCCACCTGTTCACCACATCCCTGTCCT 

CAGTCATTTCCTGTCTCTTCCCAGAATATTGCCAGACCCTCTTGCTTCCCAGAC 

AGTTGGTACCTOTGGAGGCTCGAATTG 

TCTTATTTGGACATAGGTTAGTTGGAGATATAATTAGTTAAAATGAGGTCCT 
5 CAATATAAGTGTCCTTATCAAAAAAGGAAATGTGGGCCGGGCGCGGTGGCTCACG^ 
TGGAGGGATGATGAGGTCAGGGGATCGAGACCATC^ 

GGACGTGGTGGCGGGCTCCTGTAGTCCCAGCTACTTGGGAGG CTGAGGCAGGAG AATGGCGTG AACCTGGGAGGCAGAGCTTGCAG 
TGAGCCAAGATCGGGCCACTGCATTCCAGCCTGGGCGACAGAG 

AAATGTGGACACAGACACATGTGCAGGGAGAACACCACATAACCATCAAGACAGAGATTGGG^ 
1 0 TGCCCAAGCATGCTGGCAAACCACCAGAAGCCAGGAGGTGTCGAAGT 

CCTTGTTCTCAGACTTCCAGCCTCCATAACTATAAGACAATAAATTTTGG 

CAGCCTTGACAAACTAATACAGGACTACATCATTCTTCACGTCTTGG 

TAGTTGACACAGCATTTTCTACCAAAGOGCCTGTTGATTTCCTTCAGAGC^ 

TGTTTATTTGTCTGTTGCCTTCACCTTCACTA 
1 5 CAGCTCTCAGAGTGCCTGG AGC ATCCCAGTGCTCAATAAACATGGAGGGT AAATGACGCCAAGATGCATCAAAGGTTAAGG AGAAT 

GCG C CAAG AG CTGTGT ACGTGAATTG C CATGCACATTTGGT ATTTAG CAGAG AT AAAGTTTTTT AAGGAAGTGAG AACATTTCTAA 

ATGTTCTCAATCCTTTGTTTTATGACACTCTGCTGTCTTTCT 

TATTCAGAATGCTATGAAGTTTGGGAAGGAAGAGAACACACTGAACATCATC^ 

AAATGGTGAAAGTTGCAGCAAATGAGCAAAGAACCTGAA^ 
2 0 AAAGTGCTCGACATCATGAATCATCAGAGAAATGCAAATCAAAATCACAATGAGATATATCTCCCT 

AGGGAAC CTTTGCACACrreGTGGG AAT AT AGACTGTTG CAG C CATT ATGGAAAACAGTATGG AGGTGC CTC CAAAAATT AAAATG 

GAATTACCATTAGAATGAGACTACCAAATGGAACCACCATGCGATCCAGCAATCCCACT^ 

ATC21ATATCCTGAGGAGATGTCTGCTCCCTTATATTCACTGC 

CACCAATAGATGAACAGATAAAGAAACTGTGGTTCATATATACAATGGAATATTATTCAACCATAAAAA^ 

2 5 TCATTGGCAACATGGATGAACTTGGGAGACATTATGCTAAGTAAAATAAGCCAGACACAG 

TACATGTGGAATCTATAAATGTTAAACTCAGAGAAGCAGAAAGTAAAATGGTGGTTGCCAGTGACTGG 

TT AGT CAAAAG AT ACAAAACG TTTAGTT ATGT ATAAT ATCAATATGTTG TGGG AG CTGATCAT AG CAGGG CGAGTGGCGGATGTTT 
TAATTCATTTGTGGTAATTGTTACACAATATATGTGTATCAAATAATCATGTTGTT^ 1 ATTTGAC 

AATTACATATTTTAITTTTACrrACTTTA 
30 GAGTTCAGTGGCATGATTATAGCTCACTGCAGACITGAACACCTGGTCTCAAGGG 
GACTATAGGTGCACGCTACCACACCCrCTTAATTTTTTTTT^ 

GG CTGGTTTTG AAATCCTGTCCT CAAGTGATCCTCCTGCCTTGACCTITCCAAAGCTCrGAGATTACAGG AATGAG 

GGCTGACAATTAAATATTTTAAAATAAGAAAACAAATATAACAATATTAAAAATT^ 

GTAGTCTCCTAAATTTAGGTTACAAATTTGAAATTTATTCTTTAAAATATCTTCAA 

3 5 AGAACX3TCAAGGCAAAGCTATAGACACCATGTCCCTCTGTCTTT 

TAAGGAATATTTCACATATTCCACAGCCATTTGGGAAATTATGCCAAAAAACACTTC^ 
TTGGGAAAGGAAAAACACACGCACTGCACGCAGACACACACGCAC^ 
ACATAGAGACACAGACACACACACACACACACAGACAG^ 
GGGCAGAATGAGTATCACCCAAAACTGAGCAATGTATTACTCAATC 

4 0 CCCCACTTCCTAGTTTCTTGTAAAATGGAAGGAACCATTACATAAAGTAACAG 

CAATGACCITTGAACTTTGACTGATGGTTTTTrCAATCCCATTG 
TTCAACTTGGACTTCACAATAATGTTCTTTGCAACTCAGGACTGTTAATCTGTGGC^ 
GGAACCCTCGTTTTCCATGACrrTCTAACCrCCTGAATTGCAGCACCTTGGA 
AGTGATTCTGGGGAGAAAACTGAGCATCTGCTTGC^GACCTGGGG 
45 TATATGCCTTCATGTTACACATTCCTAAGTTGCATGTACAGTTCT 
CTGGGATGCCACAGTGGAAATCAACATGTCAACCTACAAGACT^ 
TCACTGCTGTATCCCTAGTGCTGGGAAATGTTCCTGAGCACTCAA 
ATTTTAGGAAACTAAGTTAAAAACCCATTTGAAAGGGGTCAACTTTGAATGTG 

AAGGAGAGATCCGGCACTATTGTGCTCAAGTAATATTGTCTGCACCCGACTCCCCTGAAAAGAAATGAGTCCT 

5 0 AGATCTTTTAGGATATTCCATCCCACAGGTGAAATACAAACATGCTTCATCCAGGAGATGAC 

TCAACTTTTTTCACGTTAGACTGAAGTGTTTGGTCAGATTTCCTCAGTATGACAAAGGT 
AGATGGAGTCTCGCTCTGGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGTGATCTCGGCTC^ 

AGACGGGTTTTCACCGTGTTAGCCAGGATGTTCrCXSATCTCCTGACCTCATGATCCACCCGCTTCGGCCT 

5 5 T ACAGGCGTG AGCCACTG AGCCTACTTTCTT ACCCAGG CTTCCAAAAAGG CTGG AGTTTCCACAGAG ATTT ATT ACTAT ATGCATC 

TCATCAAAGGGCTAAGATACrTCTTGACTGTGTATATCAGTACCATGTGGCTGTTTTGGTT 

GACATGTCATCAAGGGCCTAAGAACAGAAATTCATTCTGTCTATGTTTTATGAGTTT 

ATGCTTCAGGGAGGCCCCAGGACTCAGGAAAGGGGGGAGTCTGTGAGTCCT 

CAAGGCTGTGGAAAAGATGTTCTCTCTGACTTGGGACATGTATAATTTGTTGCACCTGAGACATGTGTAGGA 

6 0 AGCTCAGAAGTTACAGATTTTGGAGTCATCAGCAA 

ATGTAGAATGAGAAAATAAGAGGGTTGAGGATAGAATGCCTGGAAACACCATAATTTAGGGGGCT 
CAAGAACTAGAATAGGACAGCAGGCTGTGACATGGGCATCGTGGAGAGCAGCCT 

ACAGAGGAGATGCGGATTGGAAACGGTCCCTTAGAAAGATCACTAGAAGGAAATACATAAAAATATCCACAGGT 
AGTGAG ATG T CAGGTGGCTTATGTT CTTCTCT C AC CATTCTTG CAT CTTC CAAATTCTGT ACAAG AAACATT AATT ACATTT AT AT 
65 TAAAAAGCTATTAAAATATAAAACAAGGCTCTGATCAAGTCTAAAATATTTTATGAGAG 
ATTCACAAGTATTTCAGTGCCTGTGTGCTGGGCACCATGCTAAAGATAAAACAATG 
AAGGAGCGGAGGCTGGTTTGGCAAGGGAGGGATGTAGGGCTTGTGGGACACAATGGCAGGAG 

GCCAGATTACAATGGACTTTATGTGTCTTGTCAAATAATATGGTCTTTATTCTGAAGGCTACAGAGAGGCTTTTGAGG 
GTTAGGTTTGGAATTGTGCTTTTACCTGAAGTTAATTTTCGACTGG 

7 0 CAGGAGACCAGG AG CACAGGCAGCAGGCAAGTGCAGTAACTGAATGAAGG CAGTGC CCTG AGC ACGGG CCAGG CCCAGAGATATTT 

AGGGGTAGCGTCAATAGGATATGACTGATTGGATGTATGTTGC^CGAAGTAGGGCATTAAGAAAAAAAAGGAACAA 

CCTGCTTATCAAAGACCATAATGAGGGCAATGGCCTTTCCTGAAACACGTTCCTTGGGAAGTGA^ 

GGAAGCTATCTGCACCCIAGCAAGTGAGTTrCCAGTTAGCCrCACG 

CCCCGGCCCTGTCTGGCTCCCCTGAAAT CACACCACAG AGGCAGCTG CCTTTCGTAAAC AC A AGGCAGGCAGTGTTAGCTGGTCCA 
75 ACTAGACCTCATTCTTTCTTTCACCTGGGGCCTGTGCACGGAGGCTGCTGACATGAGTGGACTCCC^ 
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GTTCAAAC-TGACACTTTGCATTCTTCCCTTCTTCT 
TACAGGAAAAAAAAAATGAATACTAGGTTTTTAACTTGTC^ 
CTTTCCCTTAGTCGTCATTGCTATATGACATAAGAAAGT^ 
GGGAGGTTGAAGCAGGAGGATCGCTTAAGGCCAGGAGTTCAAGGTTAC^ 
5 ACCCTCT CTTTTT I 1 1 1 j/lT l" J, TT Tl'lT'lX^G ACAGAGTTTCG CTCTTG TTGC CCAGG CTGG AGTG CAATGGCACAATCTCGG CTC 

ACCACAACCTGCX3CCTCCTGGGTTCAAGCAATTCTCCTGCCT 
TGGCTAATTTTGTATTTTTAGTAGAGATGGAGTTTCT^ 
GCCrrGGCCTCCCAAAATGCTGGGATTACAGGOVTGAGCCACrG 

TTAT AATAG AGGGGATAAGG AAT AATGTGT ATGTG AGGTCT ATTTCTGAGAC CAG AG CCACAAAACAGAATCCTGCACCTT AGC CA 
1 0 TTTT CTT AC CACACGCGTTTTCTTGGATACTTCCATTTTTC CAGTCT CAAT ACTT CTG TG ACAGAG AGGTGTTGGAGT 

tttttutti i gtttttg TTTTTTCG CAACTCTTT ACAT ACAAACAGCT ATGGT cc cagg aagg aacttcctctgttgagt acag 

AAGTCTGGGAAAAGACTGGTGAAATTGCATGCGAGGAAGAAACAGGTCTAACTAAATGTGAATCT 
ACAAAGGGTAGGTCCCAAGGAAGTTGCATTGCrrcCTGCACTTGTCAC^ 
AGCGTTGAAAACTGAGGCCTTAGGCTTTGGTTGAAAACCGAGGCCAAGGGTG^ 
1 5 TTTGCTGGAAGTGCTCCAGCATAAAGCCTCTCTCCAGAAC 

TCCACCTCGCCCTGCGGTGAAGAAACTTCGGTCGCACAGGCTCCAGAACTCATCTT^ 
CCTCCTTTTCGTTCTAGCTCCTAAATGGAGCC^AGCGCCAGGTGGTGTCAGAG 

GTGTCACAAAACGAATTTTCATTTACTCTCAACAGTCACAC 

CCCCGGACCCCG ACTCGGACGTTCTAG CCGCACGTGCGGGCGCCCCTGGGTGCGAGCCG CGG ACTCACGAGCCGCTGCTTGTTGCT 

CCCGCCGCCGCGACCTGGGGAAGCGAAAATGAAATTGACTTTTC03AGAAATGATGA 

CGGCGAACITCCGCACCTCCCGGCCGGGCCGCTC^ 

GGGGAGGAAGGGCGGAGGGAGGAGGGCGGAGAGGGGAGGCGAGGCAGGGAGCGCGGCCC^ 

2 5 CAGCCGCGGCTCTTTOCAGTTTCCTCTCCTTGTTTTGCTTTCGAT 

TGCTCCTGCGATTATTCAACTGACGGGCTTTCATTTCCATTTC^ 
GCGTGAAAAAAGCACACTGAGAGGTAACATTTTTCAAATAATAGTTGTGACT 
CTGTGGGGGGCCGTGTGTGTATGCGTGTCCTGATGCCTTGACACGATGTTTTGCAGACGGAAAA 
TACTGATTTTTGGAGGAACAGTCGGOSCAATTATGGATCATCTG 

3 0 GGGGCTGCTGAGGCTGCGAGGAGGAGGAGGAGGAGGAGGTAGAGAGGGAGGAGGAGG 

TTGGATTTGCAGATGCGGAGAGGTGGCTCCATGGACACAGACCTGCTC 

CAGGT CAGCTGGAGTCGGGCCTCATTT AAAACAAAATTGTGTGTATGTGCGTGTTTGTCTC CAGCC 

ACCGCCGCCACCGGATCCGAAGGAGGGGAGCGGGTCCTTCTCGGTGCCCGCXKfCGGCGCCTCT 

AGCAGGCACCGCTCCCCGGCCGGGAGGAGCGCCTCCAGCCACGGCCAGAGGAAACTTGCT 

3 5 TCGCCTGCCTGCCGGGCTCGGGGTCGGGTCGCGGG CGTCGGGTG CC^US CGGCCGGGGCGGGAGGGCGGGCTCGGAGCCGGGCCGGG 

CGCCGGGCCTGGCCGACGCX5CCCTGCTTCCTGGGGGCTGGGCCGCCGCCGCCGCCGCTCAGAAG 
CCGGCCAGCGCCCGGTCGGGCCCTGGGGCTTTGTCATTGCACTTGTCCCAGCGCGCCA 
• CACCGTGCOSAGGGGTCAGCCGCCCGGGCCGCGGC^ 
CGCTGGGCCTGACCCGCCTCGGCTACGGAGCGGGCGAGGTGGGGGCGGGGAGCCTCCGGGGACCGAG 
40 GTCCCCTCGGGGTCCCGGGGCGCGGGCCGCGGCGGGGTGGGGACGCGAGTGCCCGCCCCGTGCCCTCCGACCCGGC^ 
GGGTGACATTCGCCCGCTGGCCGAACATGGTTGGTGCAAAGCCGCGG^ 
CCACCGAGTGAAGCCGGCCGCGGCCAATCGCGGCGCCCGCCTGCTCCCAGTGGCACGGCCT 
GGAAGCCGTTCCAAGTTCCAAGGGTCGCGGGTGTAACTCATGCCCCTCCCAGCACGC^ 
GGTGCGTTTCCCACCCTCTCCTCCTTGTTCCCTAAGAAGCGGGGGA 

4 5 TGGTCGGGCACCAGGTGAAACGCCCG^TCAGTCTGTGTAGGCCGGACCCTTGGCGCTCGGCTGCGGATGTGATCCCGGGGACTGG 

CGGCCGAGGTAGCTCAGAAGAGCGAGGTCGCTTTCGCTCGTTTCTGACCCGCTACT 
CCTCCCTCTTCAACCTCCTCCCTCCGGTGGCTCCTTGTCCTTGGTTATGGAAGG 
GGTGCKK3CATTCGTCTTCAGCCACCGGCCTCCTTATGGCTGTGACT 
AGTTAGATTTCCTGTTGTGGATTTTCAGGCATCACCGAGACTGCTGTCAAG^ 

5 0 GGTTCTCTTTGAGCAGTTCAGTTTTTGGTGCTTAAGGGCA 

ATGCTAGGCAGATGCAGTTATCCCTACCAGGCCCAGAGC^TGACTCTGTTTAACTTCACCACGCA 

TTTCCCCAACTTTCCTTTATTCTTCTTTTAGTGGGAGAAGGGGCAAAAGTAGAAACCT 

TGAGGAAACTTCAGCAGATGAAATATTTAAGA 

AAGTCCGGAATTTCACTGTAATTAGTGAACGATTATAACTTGTAACTTTGTCTGTTATGTAAGAA^ 

5 5 TC CAAGAGTT CTTGTCTGTT ACG ATTTGTAACTG CTTG TT AAAATT AAAC ATTT ACATATTTCATT CTCAGTG AGTTGT CATTTTT 

TTGTT AG AGAAAG TT C CAGTT ATT TCAG ATTT ACTT AC CTGT AAAAAT GT CAG CAT AATCAAAAG T AAGG ATCTGT AG TCAAACTG 
G AAG CAAATTATGGCAAACACATGCGTGCCATGAAGTTCTGACIAG CCTTATGAAC!AATGGTCCCACITT^ ATCCCACITTCTGAAA 
C!ACTTTATTAGGCAAATAGTGCACCAGAGCAAAAGATGTTTAA^ 
TGCATCAATTTTCTCATGTTACCAACGATGTTATATTTTTATTCTTAACATCATCT 

6 0 CACATAATATCCTTTTGATAGTGAAAAACCACTGTACTCTATCTAAATGGAGTATCT 

CAAAAAGGGCTTAACGCAGACAAGAGAATTTATGTACACACTCA^ 

TAGCCTCTGTATACTCAACTGACAGCGAGAAAGACAGCATCTTTACACTAGGTTGTGTTCT 

AAAAAATATAGTTATTACCTTTTGGGGTTAGCTGTAAGGAGGATAATAATA^ 

AACAATTGAGATACTGTATATGAAAGAACATTGCATAAATTATGGCAGTCTA 

6 5 CAGATTATTCTTAGTGTAAAAAGGCCAGATAATCATGACTGGATTAGACACTATCT^ 

TTACrrCTGAGCATATATAAATAATGATAAAAGATTAAACATGGAAT 
TCCGATATTGTCAGTGAATTTTTTCCTGTCAGAAC^AAAGTTAGGTGGTATCCATGG 
TGGACCAATGTATTTTCCCTCTTTATGTCAGATAAGAAAATCTTCCCAAGCCT 
GTGTCAACrrCTTCATTATCTGTCATTGCAGGTGTGCACTGGGCTGATTGT^ 

7 0 TTTTTTTTTTTTG AGGCAGAGTCTTGCTCTCTTC 

AGGTTCAAGCAATTCCTGTG CTTCAGCTTCCCGAGTAGTTGGGACTACAGGTGTGCGCAGGCGTGCG CTACCATGCCTGGCTAATT 
TTTGTATTTTTAGTAGAGACGGGGTTTCAGCATX5 CTGG CCAGG CTTGT 
CTTCTGAAGTTCTGXK5ATTAC^GGCATGAGCCACTGCGCC^ 
TGTGTGCCCAACACTGTATATGTGACAACATACATGAAAGTAAATATTA^ 
75 TAAT ATATGC AAAT AAATG AAACTGCGGTG AAATGT CAGTfATT ATT AGT AAAAATT AGTT CTT CTTTTCATCTG T ATG AGACATG 
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ATGTTCCTTCCTGCACAGGGGTAGGAGGTCAGGACCTCCGGTCTCTGCCT 
CAGTGGTCGGTGCACGATGCCAGCTTCATGCACAGTGTGCTGAATTGAGGAGTGGCT 
GGGTGAGGAAGTTCCATCAAACTTTAATTTATCAACAAA 
GACAGTATTTTTTATAGAGTACCAAGGAAAACACACAAGCAGGAA^ 
5 AGGCTCGTTGGATTCAGAGCAAGCAGCAGGCCTGAAGTCrTGGCATTGA 

aagctccatcttatgacaaactgtctctgaaacaaattgtgtt^ 
aaagtctcactctgtccaccaggctggagtgcx^tggctcx3atcatagct 
cctgcctcagtctcctgagtagctgggacaacaggcatgtgccaccccaaagcccgacata 
gctatgttgcccagactggtctcaaacccttggcctcaaatgatcctccctcct 
1 0 agccactgcacccggccctagtgcccatttttattcctca 

actggtatcatcaaatctcactgatctcattatagccaaagct 
gagaaactcaccatcccgttaggctttctgaggcactgaaactagccgggacccrrccatcaa 

ggg acc ccatgcaatt atttt aaatgaat aattgattttgttaaatgt ag t aacacatg caatggtttt cttaaatgcacacattg 
tgttattatttcatttgtttattaaatatttcttgacagccagctgtgtac^ 

1 5 cctagagtggccttttagttctccaatcacttcctcctttggaagttttcact 

ggagacagaacctgtgtgtgttgtgaaaatcacctggggaggggactttta 
tgtagccccctttcccitccttccccttcctgccttgatagcgagcctct 
aggtgtgtggccagagtagaaatggaaagccactgaatcgaggatctctt^ 
atgaaacctctttccacattttaaaaagcacttcggacaggggcctgtggtcac^ 

2 0 catacagaaatgggactcatgttaagtttgtgatacattgaa 

ctcctgggctggagggcagtggtgtgatctcggctcactgcaaccrccgcctcctgggttaaagtgat 
cgagtagctgggattacaggcatatgccaccacgccgggctaatttttgtatttttagtag^ 
gctggtctcgaactcctgacctcag^tgatctgcccgcctcagcctcccaaagtattcggattaca 
gcccagttttccattttgaagaggcaagaaggactttattttattttatgta^ 

2 5 aggcgtttggttttgttttgtttctattttggtttataact 

agttgacaaagaatggctgcagatgagaagcaggtgtgtgaggttccttctgatgatggattct 

gaattaactacttgctgtgtgattttcataaataattt 

ctcgcacacctaatgccatcaaaggagagatgagtatacactttaatttgttcttattgagtc 

3 0 tgtcagattttgggaagatccaaagaagtactctgtagacagaatacgctgtcctggccccctaagctr 

cccatttcatgagattgacctttcagatgaggagggact 
aaagaattctittgtcctttttaccctagatctarrggaaaagatgagaaaaatt 
atatgatgggtgtgtcgggtgaaaggatagcaatgggaggctggtggaaagtgagagttgg<k;gatggggtgatc 
aggtgactagtgactcctggattcttgaatgggagagagtgaagagacaatcaggcagg^ 

3 5 actgaaagagttttagagatcggaggggtgaggtgaa 

gaccctccactgctgactgtgggatcagatccttttatcctggggatgagaaaggctgatg 

attataccaccaggaaacatgaagttgtgagtagttgtcacttggacatggctgaagtcacctragggt 

tcacaa.cgtttaggtcatcaatgggccatacacatgactgtggccccataagatgat 

t ag at acacag atacttgccattgtgctacagtggcctacggtc ctcagtac agt aacctg cagtacatgtttgcagcctggaagc 

40 CATAGGCCATACCATAAACCTGGGTGTGTAGGAGGTTATACCATGTAGGTTTGTGTAAGTACACTCTC 
AAAATTGCTTAATGGTGCATTTCTCAGAATGTATAAGTGTGCTGATGTTGATGCA 
ATGTGAGGCCCCAAACTCAATCTAAAATTGCCTTTGGAGGCAACCACGGAAATACGGTAATAGCT 
TACTGATCTTCCTTGGTACTGTTTrATTTTATTTT^ 

AATAATTTAATGCATTCATATTTGTAATTTGTAATGATAAATCAATGTAATTGTGCTATTGAGA 

4 5 CAGGGAAATAGAATGGGCTGATAATGTGTTTTCCTAAAGAACAGATGGGATTGGCCGTGCGCGGTGGCT 

CACTTTGGGAGG C CG AGGCGGG CGG ATTT CCTG AGGT CAGG AGTT CAAG ATCAG C CTGG C CAACAT GGTG AAAC CCCGTCTCT ACT 
AAAAAT ACAAAAATTAGCTGGG CATGGTGGCGGG CACCTGTAGTCCCAGXTTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACC 
CAGGCGGCGGAGGTTGCAGTGAGCCGAG ATCGCACCAG CTTGGGTGACAAG CGTG AGACTTCATCTCAAAAAAAAACAAAAAACAA 
AAAACAACCAGATGGGACAAGTTAAAACTGAACTATGAATTCCATAA 

5 0 CACAGTGACTGTAGTCAGGATTCTACATTGAAATAAACTTTCTGGAATTTACCTTGGTGCTATA 

AAGCTCTCTGTTCCAGAAATTCTGCTTCTTGGAACCTACTTACGTGAAGTAACCAGAGATGGGAA 
GTTTATTGAATTAGCTTTTGTTATGTAAAAACTTGGAAACAAGCT 

CATCCAGACAGACTATAGGTAGCTTATAAAGCTCGTGTTTTCGAGCAGCGTTTAGTGATGGGAA 
TGGGTGGGGAAAAAAGGATATGAAATTGTGCTTCTAGTTTTATTTTTAAAACATATGTGTAT 

5 5 AAGAAAACATATCTTTTTGATGGTGAAATTGTGGGTGTTTGGCTTTTTTTTTTCCTr 

AAAACAGCAAGCACAATTACCAATTACCCTTTTAACCAGAAAAATATTCTCAAGAGAGTGCTAATAT 

CTGGTCCACGTAAGGTTAGGAAAAACACACTGGGTGCTGCCGAGGAGGAGGCTGTGCATGGACGG 

CCTCCATGTGTGGAGCAGGCTCTAGAGCCAGGGCTTTCCTGGCCAGGCTCTTGGAAGCCTT 

GGAGACACTT AG ATTGGTGG CTTTTTGGTGAGGTTGGG ACATAATGTTTTGTCAGTAACTGTGAAGGG CTTGAGATATG ATTTCAC 

6 0 TTCACAGTT AACATT CTTT AT ACAT ATGAATACTG CAG AAACACAAAACC CCTCGGT AGGA^ CTTGTTG AT ATTT A 

CAACTGGGG CAACAGCCTG AGTGACACTATTGTGTCAGGCCCCTAAACCTTG AC CCTT AGGTGACACTGAAAACTAATGG AATGTA 
CATGTGGGCTACACTTTGTTGACAGGGACAGAGCAGCnGTCC 

GAAAAAGGAAAACCAAACAGACAAACACCTTAGCTCCCCTOACATGGGATGAAAAGGTTC 
ATGTGAGACACTTGTAACCTGGGGATGTCTCCAAGGTCIGTTTTT^ 

6 5 CCAGCACCTTGGGAACCAAGGTGAAATTATTTGGTCCTCTTGAGGAGAT^ 

TGT ACAC AAATGG CT ACAG ATCT AATTCT CATGGT AAGTGAAGT ATAAAATGTTTGT AG 
TTCATGTCCTGTATAGTCCCTATCTTCATGTCCTGTACATTCCCTATCTTCAT^ 
GGCTCCTATATTCATATCCCATACATTCCCTATCCTATACATTCC 
TGGTCCCCATCTTCATACCCTATACATTCCTTATCTTCATCTCCTGA^ 
70 ATTTCTTAACAGTCCCCATCnTCATATCCTATATAGTCCCCATCTTCATGT 

ATACATTCCCTATCATCATGTCCTAAATGGTCGCTCTCTTCATATX5CTGTACATTGTCT 

TTCATGTCCTATACGGTCCCTATCTTCIATGTCATAAACGGTCCCTATCTTCGCATCATATGCATTCCCTATCTTCGTGTCCT 
ATTTTTTCCCTTTATATTCrATTCATTTCTGTC 

TATTTTCCCCrCACACCrGCCTTTACCAGACCCCTGTTGGAGAATGACAGTAGAGCTTAAT 

7 5 ACTGAAGACCTCAAJITCTCCAATTTTTATCTGAAAAAAT^ 
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CAGTGGTCGGTG CACGATG CCAGCTTCATGCACAGTGTGCTGAATTGAGGAGTGGCrCTGGTGG CGGGAGTGACTG ATGGAAAAGT 

GGGTGAGGAAGTTCCATO^CTTTAATTTATCAACAAAGAACAA 

GACAGTATTTTTTATAGAGTACCAAGGAAAACACACAAGCAGGA 
5 AGGCTCGTTGGATTCAGAGCAAGCAGCAGGCCTGAAG 

AAGCTCCATCTTATGACAAACTGTCTCTGAAACAAATTGTGTT^ 

AAAGTCTCACTCTGTCCACCAGGCTGGAGTGCGGTGGCTCGATCATAGCT 

CCTGCCTCAGTCTCCTGAGTAGCTGGGACAACAGGC^TGTGCCACCC 

GCTATGTTGCCCAGACTGGTCTCAAACCCTTGGCCTCAAATG^ 
1 0 AGCCACTGCACCCGGCCCTAGTGCCCATTTTTATTCCTCAGAAACACA 

ACTGGTATCATCAAATCTC^CTGATCTCATTATAGCCAAAGGTAGAGACCCG 

GAGAAACTCACCATCCCGTTAGGCTTTCrcAGGCACTGAAACTAGCCGGGACCCT 

GGGACCCCATGCAATTATTTTAAATGAATAATTGATTTTGTTAAATG 

TGTTATTATTTCATTTGTTTATTAAATATTTCTTGACAGCC^ 
1 5 CCTAGAGTGGCCTTTTAGTTCTCCAATCACTTCCTCCrTnKSA 

GGAGACAGAACCTGTGTGTGTTGTGAAAATCACCTGGGGAGGGGACTTTTAGAAATT 

AGGTGTGTGGCCAGAGTAGAAATGGAAAGCCACTGAATCGAGGATCTC^ 
ATGAAACCTCTTTCCACATTTTAAAAAGCACTTCG 
2 0 CATACAGAAATGGGACTCATGTTAAGTTTGTGATACATTGAATTTTCCn^ 
CTCCTGGGCTGGAGGGCAGTGGTGTGATCrrCGGCTCACTGCAACCTCCGCCT 
CGAGTAGCTGGGATTACAGGCATATGCCACCACGCCGGGCTAATTTTTGTATTT^ 
GCTGGTCTCGAACTCCTGACCrCAGGTGATCTGCCCGCCT^ 

GC CCAG TTTTCCATTTTGAAG AGGCAAGAAGGACTTT ATTTT ATTI^ CAGG AAGG C CATGG AG AATT 

2 5 AGGCGTTTGGTTTTGTTTTGTTTCTATTTTGGTTTATAACTCC^ 

AGTTGACAAAGAATGG CTGCAG ATGAGAAGCAGGTGTGTG AGG TT C CTT CTGATG ATGG ATT CT AAGT ATTTTT ATCC CAAAGG CA 

AGACTAATTTAGTTCAGCAAAATACCTTAATTTAGGATTTTGTGTGATAAAAGCCT 

GAATTAACTACTTGCTGTGTGATTTTCATAAATAATTTCATT^ 

CTCGCACACCTAATGCCATCAAAGGAGAGATGAGTATAC^CTTTAATTTGTTCTTAITG 

3 0 TGTCAGATTTTGGGAAGATCCAAAGAAGTACTCTGTAGACAGAATACGCTGTCCT 

CCCATTTCATGAGATTGACCTTTCAGATGAGGAGGGACTGCATACAAAT^ 
AAAGAATTCTTTTGTCCTTTTTACCCTAGATCTATTGGAAAAG^ 

AT ATGATGGGTGTGTCGGGTGAAAGGAT AGCAATGGG AGGCTGGTGG AAAG TG AG AG TTGGGG GATGG GG TG ATG AGAAGG AAAAT 
AGGTGACTAGTGACTCCTGGATTCTTGAATGGGAGAGAGTGAAGAGACAATCAGGCAGGGAGA 

3 5 ACTGAAAGAGTTTTAGAGATCGGAGGGGTGAGGTGAAGTTGGTTAGGCAATTAGTGATAACTCA^ 

GACCCTCCACTGCTCACTGTGGGATCAGATCCTTTTATCCTGGGGATGA 

ATT AT AC GAC CAGG AAACATGAAGTTGTGAGT AGTTGT CACTTGG ACATGG CTGAAGTC^ 

T CACAACGTTT A G GTCATCAATGGG C CAT ACACATGACTGTGG C C CCAT AAG ATG AT AAT AT CAT ACTTTT ACCTTTTGCT ATGTT 
TAGATACACAGATACTTGCCATTGTGCTACAGTGGCCTACGGTCCTCAGTACAG 
40 CATAGGCCAT AC CATAAACCTGG GTG TGT AGG AGGTT ATACC AT G T AGGTTTGTGT AAGT AGACTCTGTG ATGTT CACACAATGAC 

AAAATTGCTTAATGGTGCATTTCTCAGAATGTATAAGTGTGCTGATGTTGATGCATTC 
ATGTGAGGCCCCAAACTCAATCTAAAATTGCCTTTGGAGGCAACCACGGAAATACGGTAATAG 

TACTG AT CTTCCTTGGTACTGTTTTATTTT ATTTTT AAACT T 
AATAATTTAATGCATTCATATTTGTAATTTGTAATGATAAATCAATGTAATT 

4 5 CAGGG AAATAGAATGGG CTG ATAATGTGTTTTCCT AAAGAACAGATGGGATTGGCCGTG CGCGGTGGCTCACGCCTGT AATCCCAG 

CACTTTGGGAGGCCGAGGCGGGCGGATTTCCTGAGGTCA^ 

AAAAATACAAAAATTAGCTGGGCATGGTGGCGGGCACCTGTAGTCCCA 

CAGGCGGCGGAGGTTGCAGTGAGCCGAGATCGCACCAGCTTGGGTGACAAGCGTC 

AAAACAAC CAG ATGGGACAAGTT AAAACTG AACT ATG AATTC CAT AAGTGG G T AGTGTGGG T CACTGAGTGGGG CACTG AGT ACT C 

5 0 CACAGTGACTGTAGTCAGGATTCTAGATTGAAATAAACTTTCTC^AATTTACCTTGGTG 

AAGCTCTCTGTTCCAGAAATTCTGCTTCTTGGAACCTACTTACGTGAAGTAACCAGAGAT 
GTTTATTGAATTAGCTTTTGTTATGTAAAAACTTGGAAACAAGCTAAATATCCAA<2AA 
CATCCAGACAGACTATAGGTAGCTTATAAAGCTCGTGTTTTCGAGCAGCGTTTAGTGATGG 
TGGGTGGGGAAAAAAGGATATGAAATTGTGCTTCTAGTTTTATTTTT 

5 5 AAGAAAACATATCTTTTTGATGGTGAAATTGTGGGTGTTTGGCTTTTTT 

AAAACAGCAAGCACAATTACCAATTACCCTTTTAACCAGAAAAATATTCTCAAG 
CTGGTCCACGTAAGGTTAGGAAAAACACACTGGGTGCTGCCGAGGAGGAG^ 
CCTCCATGTGTGGAGCAGGCTCTAGAGCCAGGGCTTTCCTGGCCAGGCTCTTGGAAG 
GGAGACACTTAGATTGGTGGCTTTTTGGTGAGGTTGGGACATAATGTTTTGTGAGTAACTGTG 

6 0 TTCACAGTTAACATTCTTTATACATATGAATACTGCAGAAACACAA 

CAACTC^GGCAAGAGCCTGAGTGACACTATTGTGTCAGGCCCCTAAACCCT 
CATGTGGGCTACACTTTGTTGACAGGGACAGA^ 

G AAAAAGG AAAACCAAACAGACAAACACCTT AG CT C CC CTGACATGGG ATG AAAAGGTT CCAG GGGT CTT AT C CTAACCTTT CAG A 
ATGTGAGACACTTGTAACCTGGGGATGTCTCCAAGGTCTGTTTTTCATGCT 
6 5 CCAGCACCTTGGGAACCAAGGTGAAATTATTTGGTCCTC 

TGTACACAAATGGCTACAGATCTAATTCTCATGGTAAGTGAAGTATAAAATGTTTGT 

TTCATGTCCTGTATAGTCCCT ATCTT CATGTCCTGTACATTCCCTATCTT C ATATGCTATACAGT CG CTATTTTCATATCCTAT AT 
GGCTCCTATATTCATATCCCATACATTCCCTATCCTATACATTCCCT 
TGGTCCCCATCTTCATACCCTATACATTCCTTATCTTCATGTCCTGAACAGTCT 
70 ATTTCTTAACAGTCCCCATCTTCATATCCTATATAGTCCCCATCrrTCATGTCCTGTACATTCCCTATCT 
ATACATTCCCTATCATCATGTCCTAAATGGTCGCTGTCTTCATATCCTGTACATTGTCT 

TTCATGTCCTATACGGTCCCTATCTTCATGTCATAAACGGTCCCTATCTTCGCATCATATGCATTCCCTATCTTCGTGTCCT 
ATrTT ^ cccl ^ ATA ^CT ATT CATTT CTC 
TATTTTCCCCTCACACCTGCCTTTACCAGACCCCrGTTGGAGAATC 
75 ACTG AAG AC CT CAAAT GTCCAATTTTT ATCTG AAAAAATATTAG AAT AG TTT CACT CTCTTT ATGAJ^.CATGG AGACGTTTTTCGAT 
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TGCTACGGAAGTAGGATTGGGCATGACCATTA 

TATGCACGATATTGTCCAATACAAATATAATGTGAGCCACATGTATAGTTTTAATTTT^ 

AGT AGGT AAAATTAATTGTAAT AAT AT AT CT CATTT AACT AATTTT AT CTAAAAT ATTTCAGT ATGT C GTT AATGT AAAAATACT A 
ATGAGATATTTTGTATCCATTTTTCATACTGAGTATTTAAAATCCAATC 

AGACAATACAGGGGAAAGTTGTAGTACAGTCCTGAAAGGAOTGAAGGAAA^ 

GAATCTCCCTGTGG CTCTCC CCACAAACCATACCTGATCT CTT CAGCCT AAAACAATGAAATGCTCTG AAT1TCT AT AGAAGCAG C 
ATACTCTACATCCCATTCCTAGGGTAGTTAGCTATCTGCATCTTGAT^ 

"aaatcttagattatttgaatttttccatttgtatcacctatgtg^ 
ttcctcaccggtgtttgacacctttaagacctagataaatattagttt 
ggctgaattccagctcaggaagggtggcacagatctagtttgta 
gatttgtcactttgcattaggatttgttactttgtattaggggt^ 

GTATCCAGACCTTTAAAAAATATTATaWGTTTCT T CTTTO 

aataagctatatttggtgatgacctggagataaacatttggaagctgctacttttt 
tattaacaaacacagcccacaacatctaggcgatgaaattcttttgaaato^ 

CTTTGACTGTAATTCTTOKSTTGAAGCAACAGACrr^ 

cttaggtagcttttctctgcaagattaaatgcgaccttcatggaatcatgtta^ 

CCAAACCTTCAGCAGCOAAATTCCCAGCTGCrCAGTGGCTCAGAGCTTT 
GTTTTAAGTGAATTCTAGTAGGTTTGGGGATTTGCAACTGACAGAAGAA^ 
AAGGTAAATCTCTATTGGCTCTTTCTTAGGATGCTGGGATTTT^ 
CCTTTGAAGGTAAGGCCCrAGAAGGOVGACGGAGGCTGG 

ATTGTAGGCGGGTGGAAACTTGGGAAGCCTGCGAGGAAGGTCCCTGTAGGGAATCTAGCT 
GGTCATCCTGACTATTGGTGAAGTCAGGGTGAATTGAAAATGTTACAGGAGAGGATT^ 
TGCGATTTTTGTTTTTCTTGAAAGTCCTCGCAGTTTGGG 
AGTGCTCAGCTGAAACCGTTGACTATTTTGATCATTCATCCATTCAGTC^ 

GTAGAGGGAGGTATAAAACGAAAGAAAATTATCTTCGATTTGAGGTAGCCAGGTAGGTAGATGGTCTC 

GCCTAATTCTTCCAGATAGAGAAAAC^CAATTAGGCAAGTAAGATGACAGTTCCAG^ 

TATAGTTCTTCCATGGGCCCTGTGGGGTTGTTTCAGATTGTAGGGGAAG 

AT ACATTTT A T T1 TT' T T TAAAACAG ACTCTT AAT AGATTGT AAAAT AGG CAT AG AACAACAT ATTGTTTTGTTGTT CTTTT CTGTTT 
TTCCACAATTCTCCCACATTTAGTTTGGAACATGGAACTATGGTCTGGT 

TTATCTCCCCTCTCT ACTTC CCCTTCCCTTGTGGGGAG ACACTGTAACCAGCAGAG CTGCAGAAT CAGTACTCCTGCCCCCCGCGG 
TAAAGTGAAGGAAGCTGATGAAAGATATCCAGGAACTCTTG 

TAAAAAGG1TT T TT T AAGGCTTCCCATTAGTGCTCTAGAGTTACTGTTGCTTATTTAAAATAACAGCCT 
AGTTCOTTGTTTGCTTCCCCACGTCGATCT 

AGTACCCTCTCCTCACAGGGCTTGAGAAGACATTGCCTTGAAGGGGAAACTGGTCC^ 
CTCTTCGCGTGTGTGTGTGTGCGTGTGCATGTGTGTGTGTGCGTGCATGTGTGTGTGTGTGCGCGTC 
GTGCATGCGTGTGTGTGTGTTTGGCTATAGTGAATACTTTAAGTTGCCTGAAATTAAGGAATA^ 
AGGAAACAGTGGCAGCAAGTAGGGAATTCATAAATGCATGTTGAAGCATTCTTTTTAAAA 

TTACCATT ACAAT CATCTTAAAGTGTATAATTTGGTGACATTAATTACATTCACyUlGGTTGCCTAAG CATC ACCACCATCCATCTC 

CAGAACTTITTTCAGTATCrGGATCAGAATCTCTATACCCATTAAACAATAACT 

TTTCrGTCTGTGAATCTGACCACrCGAGATGTCTCATGTAAGTG^AATCATACIAGTCTTT 

TTAG CAGAATGTTTTCC AGGTCCACCCATGTTGTAGCATGCAT CAG AG CTTCATTC CTCTGTAAGACTGAGTAATATTCCTAGATT 
GTGTTCG TC CATT CATCTGTTGGT AG AT ACTGGG ATT CTT ATT AATTTGG CT ATTG TG AAT AAGG CTG CTGTG ATCATTGGT ATAA 
AAGTATCTGTTTGAGTCCCTGCITTCAGTTATTTCAATTCrc 
GAACCACCAAATTGTTTTCCATAGCTGTACCATTTTGCATCCC 

ClAACATTTGTTATTATTATCTTTTTGTTTTGATAGGGCCACCCrrAATAGGTATGAAGT 

TTCCCTGGTAATTAATGATGCTGAGCATCACTTCATGTGCTCCTAGCCATTTGTATATCT 

CTTTCrrTCATTTTTGATTTTGTTGTTGTTGAGCTGTAGGTTTGAGTCA 

TGAAAAACTAAGCTTCTCCCCATTACroTATATGTAGCATCT 

TGAAGCACACTTTGAGGAATTAACTTAGGCTGGAGTGAAGTAAATCAGM 

CATGCTGGAATTGGGAGTTTCTGTAGCAACGATTGCCTGCCCCTITCCCITCAm 

GTGTATTGTAGCCAGGTTAAGTACTTAGTATGTTTTGTTGAACTGT^ 

AGAAATGTCTTGTGAGAGG AG AAAT AGCAGTG CTTGAATATGGTAGTGGATTT ATCCTCG CGGCTGCCCTTGTGGT AGCCTCTCTG 
CTTGGGACTCGGATATCGT AGTGAATTTATCATTGCGGCTGCTCTGTCTG CTTGGGACTCATG AAAAAAG CCTTAGAGTTCTTCAC 
TTCTGTGGTATTTCAGGATAAAGATTAGTAAAAACZAGCCTCTVGAGTAAC^ 

TCATGGTTCAATAGTATTACTGAGTGGATGAAAAAAAAATATTGCCGTAATAACAGATCTGTTAAGT 
GTGTATTT ATTG TGAATTGTTGACAGCAGG AG AAGC CATT CAT^ 
TATCACAACTGTATGGAAGCAAATTCATTGTTGGTTTAGTTTTATGTCCTG 
GTTGAACTTTATTGAAAACTATAGCCTGGCAAGGTCATGTGTGGATTATAACTC 

TATGTGAGAAACAGAGAAATAATATATGTGTTGTGAAGCAGAGTGGTAAGAGGAGTTGCAAAAATTTT^ 
GGCAACGATTTTAAAAATCCTTAAAATTAGTTTTAAGTACTGTGAA^ 
TCTACAAACAAATATCTTAAATAAAACCCTCAGAGGAAGTC^ 
GCTGCGAAGATGTAAGGTTGAAAATTTCAGATCCATGCTCTCAAAGAGCTTATC^ 

ATTCCTGGT CATG AAAAG AACG AAAG ATGGG CTT CATGG AAG C ACAG CAAG CCAAAGG CT AAGGGAG AGAAG AGGAAG AAG CAGT A 
TCTTCTTTTTATTCTTTTTAATTTACTGGAGCTACCCATCATGAACT 
C^CCACTGACTTCTGACTGGCCC^AAGGAGGGAGGCAATGTCATCTGGTGG 
CAGCAAGAGCAAAGACATGAAGGGTTGACAGTTTCTTCCGT^ 

GAAGGAGGGAGTGTCAGAGCGAGTTCCTGGTGGCCACAAATATCTCACTCAGTGCATGTC 

TTGTTCCGAGGATACAGCCAGCTTTTTTCT1TTTTTTTTGGAAGGTT 

AATGATAGCTAATGACTGTTAGCAGCAGGGCX5TGTCGCAGCTGTGCACCT 

GTAGTCTGCGGTCCATTGCGGGAAGGACTGGX3TTGTGCCITTGATAAT^ 

TAAGAGGAAATGATX3AATAGGTACAAAAATATTTTTTG 

GTAAGTTTATAATCCTCAAGTGAGGAGGAATTAAAGACAGTTGGCTCAAATTCAGGTAAGAAACCCAAGl^ 

ATGGCAAGACCCCCAGAAACAATGCCTTCATAGGGCAGAAGTTAGGACAGGGCT^ 

GAGTGACGTCACGCCGGGCCACAfiAAGGGCTGGGCGA^^^ 
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CCCCAGAGCTGCCCGTGCCATATTGAGGCTTCTTCCCTC 

CAACAGACAACCTTTAAATCACCTGCCTCCnTAATTCCCATTTGTC^ 

TGGGATACTTGGGAGGCATTATTCCACCTGTATCATGTGTTAACCCAACCACCT 

5 CTCTTCACTTCCCTGGGGCGATAAArTGGAAGATGTTATTCAAAGGCCTAGATA 
CCACTGTATTATCACTAACTGCCGTCTTTGAACCGGTTGTATGTrGTGGA^ 
TCAGTACGTGTTGGCTGAATTCAGTTGCAGTGACTGACTTGG 

GACATTGGTCCCCCTGGGCTCAGGCACCCGGTTTCCCGGTAGGGAGGGCGGACCGGGCCGCA 

GCCCTGGCTGGACCGAGGGGTTTCCTCCGCTCAGTCGGCAGGGA^ 
1 0 C^GCTTCAGGGAAACCTTCCAGCAGTTTCTCCACCGCGGCTCTCACGTTCCC 

GAC^CGCGGGCGCCACCCCTCTCTTGCCCTTGTCGTTGTTCTT 

CCTCGGCTTCCAGGCCCAGTTCAGATGCCCCCTCTTCTAGGAAGCCITCCAGAGTCCT 

AAACTTCCCCATAGGCCTTTGAAAAAAATTCATCTATTTGAAGTGCAGA^ 

AGTCTGATGCAGAGAGGGAACTCGAACCCAGTGTGTCAGACGCCCGTGATCAGTGTGATCAGTC 
1 5 GCAGGCACCCACTCGGGTGTGCTCTTCCTGG CAGGCTTCGAGTCTCACCTCCGGGACACCACGTCTTGCTTGAAAGCi. TGCTT T AA 

CACTGACGTGCCCACCTTCGTCTTTGCAGGG^ 

CTTTAGATAATGCTTGGCACACAGCGATTTTCATGCGGTTGGCTCGGGGTAAG^ 
CAAAAGTGTATTAACATCCTTAGCCGGATTCACCCGGAAAAT^ 

T AGTC ATTTT ATTTC CTTTG CTT ATCATTTG TTCAT AAGTTT ATC CAT CACAACGT AT AG TT AAT ATCATCCGCCTT ATCCT CCT C 
2 0 AGCCCTTGCCCTCAGTGAGTTTTCCAGGAGGGATTCATCCACTTTCTTAT 
ACTCTGCTCGGTATTTTGGGATGAGTGAGATGGGACTCCTC 
AGCrTGTAAGCTTTGGTTGCATCAGAAGGTGGAAGGAGGCTGAGTGCGGTGGCT 
GATGGGAGGATCACTTGAGGCCAGGAGGTTGAGACCAGCCTGGGCAACATGG 
AAAAAAAGAAGAAGAAGGAAGGGAGGGGGAGGGGAGGGAGAGGGAGGGAAGGGGGAGG^ 

2 5 GGGATAAGAGGTTAGAAGAAGAAGGTGGAGGGAATCAGTGGGAACAAGAGGTTAGAAAGGTGGXaATGG^ 

AGGAGTCCAGATTTTATTCAATGGGCGTTGTGGAACCATGGAAGGATTTTCTG 
GCCAAGAATGTGCAGGGTGGAGAATATCCTTCTATTACAGAAGCTGGACCCTGGACA^ 
AGTCACTGC^IGTTAGGTGGAGCACGGGCCCTACTGTTTGGAA 
ATACTTTTCCTCCTAACATAAACTAGCCTTTATCTGTTTCTTCTCACGGTAGCT 

3 0 GATGAGGAAACTATGGCTTGGCGAAGTTGAAGGTTGTGAGTGGTGAGGTTCCAG 

CACTACCATATTATGCTTTAACAAGGAGAGGGAAAGAAAAGAAATCA 
GACCATCCAATGGATGCCAGATCCGACCCTCCCACCTCT 
TATTTGAAGACAGTTCTGCAGCTTGCAGAGGAGACATG^aATTCTTG 
GCTCTTTTTCCAGTTGGAATTCATTGGTAATTGGGATGGTTTATC^ 

3 5 TTTATAGTTTGTGTGCTGGAGCTTGACCAGATTTGAACATAAAGAGTTAAATO 

CAAGAATAGCTGGTAACCTTTATGGAGAGCTCCCTGGCAGAGGGGACAGCTTAAAGGAA^ 
AGCAGATCCTGAGGGCCAGCCTGTGGATGTGGAGTGTGGGAGGAAATCATTTACTCT 
ATCAACCTTTCAGAACAGGTTATTTTCAGGCAACTGGTTTTTGG^ 
AAGTCAGAAAGCAGCAATTTTGTTGCTCTATGCAAATAGGTTTTCTTAGGTT 

4 0 TArTTGACTTTTCACAAATACTGACCACATTGACATCTCCT 

AC^CCTTTTGAGGTCTGTGGAGCTCAAAAATGAAGTCAAAGGTGACTGGGTGACACAA^ 
TTCTGCCACGCTCCTCCTCCTCAAGCCAGCATXK3CTCAGTTATCCACTCTGATGTCTA 
AACTTGTTCTCTATACTGTTACGTCTCCAGTGTCTGCATCAG 
AGTCTTAAAGGAACITCCTACTGCACTCTGAGGAGGAGGAGCTGTGCCTTCTCGA 

4 5 AGTG CTCAG AGGACGTGCTGATTAAGGG AAATAT AGGATCACTTCTTTTCCCATrrrC CAT ATGG AAAG AAGCTTTGAAAATG CAG 

TATGTGCTCGAGTCCATATGCT1TGAAATCTGTGCATGATTCTTGCTITGCT 
ACCCAGGGCCAAACACACCTAAGGGTCCCAGCATGAGTTTCAATTTATAAAATAC^ 
AATGGCCTTGCATAAATTTTTCTAGAAAATGCTCITTAAAAAAATC 
TATCTGGAAGGTAATTTATCTGCAATTCACAGTTTTCTCATGTTGTCTGTTTTT 

5 0 CAGATTTCAAGATCATTGTAGTTTTTTCITTTTTTAATTTTAAGACA 

CTAGAAGTGACATGGCTGGGTCCAAGGGTGTAGTTTTAAAATTTTTAATAC 
TTTTATACTCCCATAGTGCATGTAAGATGCAAAACACTATTCTAGATATT^ 
TCirCTrTCTTTCTTTCCCTITTTTTTTT^ 

ATCAT AGCTCACTGCAG CCTCT ACCTCCTGGGCTCAAATGAACCTCCTC CTTCGGCCTCCCAAAGTGCTGGG ATT ACAGGTGTG AG 

5 5 CCACCGCATTGCCTT ATTTT AC ATG CTT CTTCATTT AATCCTCACAAAG CCTGATG AGAACAGAG AAGG T CAGT AAC CTG C C CAAA 

GTCACAGCTAAATGATAGAGCTGGGGGCGAAAGCAGGCAGTGCCGCAGAGCTC^ 

CTTCCATGGCAGCCGGCATGAGAACATGAGCAAAGAGAGGCTGTGATGAGACTCCTTCCTTCT 

GCCTCGTCTTTCACTCTGCTT(nTGATTCATCAGGTAGGAC^ 

TCTTTGTGCAGCGGT AT AATTTGGAAAG AAGCAATGGT CACTTTGG AGAGCCAG CCCCTTTGCTC AG CTCTCCTGTGCCCCTGTGC 
60 TGCTGTCGTTGCCCCrGTCACCTCTCCCAGCATCATTTtt 

ATCACCTGAGGAGGGTCTCCAAATAGAGGTGTGAAGGTTTTACAGAGATTGTGTAGGGAT^ 
TTGCTGCTGCATTTGGTGTGAGAGATTGCAAAAAATGTGTATGTTTTTATCGGTGCAACCCT 
TCCTCTTTTAGGAAGAGTCCCAGACAGCTGCTAACTTATCTTCTTGTTCCATCCCT 
TTTCACTCTGAACAAAGACTGAGCAAGCAGAAAGTGTTGGGAAGGGCCC 

6 5 TGGGCCTGGGGGACTTC^TGTTCACTTTCCTGAa*GGAGGAGA<^ 

GGAAACCAGTTTCCTAGCATTTATGTTTCAGG^ 

AT ACCCCTG AAG ATG AAGCCCATTTTTGGATCCTCAGTTCTCATGATAATGG AGTT^ C 
TAAGCCAGAGCACCCTTTTAAAATGAGGATAGTAATGAGAA 

AGTTTAATGCTGAGTGCATCATGGGTGCCCAGTGAATAGTCZATTGCTCTCACAATCCCCA 

7 0 AGGAGAACAATTTTGTGAAGCAACTTTGGACTTTTC 

TAAGTACCTTGGCCCTTATGGTAGGAAGGTCTCTGGGGCCCCITC^ 
ACTTTCATGAGGAAGTATGTGCCTCrrTGAACTGAATTTCTACCCAAGTACCA 
TCACGTTGCATAATATTTCCCATTTGATTCAAAATTACGTGATAAAAGAAATGGAGGAGGT 
TGTTTTTGCATTTGTGCCTCTGGGTATTTAGTCTTTrCGGTTTGCACC^ 
7 5 AGTTGCTTACGTACTGGTCTCAGTGGACGGTGTGGATGTC^ 
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ATTTAACGTGATCCTTGGAAAGCTATTTTTATAATAACT 

AT ATT ACAAATGCCAT (^T AAAGGAACCTTAJGGG AGACT ATTTT AAAGAAATTTTGAAT CT AAAT ATTT AAAGCTTTTCTTATTTT 
ATAAGTATTTTATGTATTTGATTGGTAAGGTAAAGCACATACGTATTGTTAGGTAA^ 
TGGCTACTTAAGAT^CAAATTTTATAAAAATATTTAGGAAAAGTA 
5 TTTTTTCATTTCAGAACTAAACAGTGCrATTATTCACT 

TAAGGAAAATGATGCTGATTTTTCGTGTAGCGTGATAACAAT 

CCAAATCCCTTATTTGCAATCCAAAATGCTCTG^ 

ACATGAAGCTATTCATAGCTTTTATTTATCCCAATCAGTGTG^ 

GGGGTTCTGCCCrAATTACTTTCCGGG G 'ltJ IT'l 1U I I CCTrTGTGGT ATATGOT ATGTGTACCGCATTATCITGCTAACATCCACAC 
1 0 ACTTTTGAATTCTGAAGACTTTATCTGGCCCCAAGATTITGAA 

CAGAGGCTTCTCACATCCTGGTGTTTCCAGGAACAGTCGCTCTT^ 
ATTCAGAAGTGCTCCATTTTGGCTTGGTGAC 

TCAGT ACATTTCTAGTGAATTC CCATG T ATAAAATAAACAAG ATCTACAT AAACAACATT C CAG ATTG T AT CTTG AAAACCT AAAG 

TCATTTAAAGCATTATTGGTAAGGTATCCCCGACTCCCTGTCTAGATGGCAT^ 
1 5 AGTTCATCCTTCTCCTCACTGTGATTGCCTGTGTCCCrGTCTTCT 

TGCTGGGCGTATATAGCAAGTTCATTATTTAGTCAACTAATATTTATTGAGCGTTGGCCT 

GCATGGTGAACAAAACAGAATTGACTTTGGTCCTCATGGAGATTCrrTGTCT 

GGTTTGCTCTCAAGTGACTTTCAAGCCAAATCT^ 

TGACAGACGCAGAGACTTGGGAI^ACCCCTCGCTGGAGGCTTGT^ 
2 0 GTTTGTGCTTTTGGAAGGGTGTTCTGGCTGTAGGGCAGAGCTG^ 

GGTCCAGGCTTTGTGGAAGGAAGGATCGGTTCITCACCCCTGGTTTCGGT^ 

CTGTGTGCTCTTCTCCGGAGTCCGAGGCCAGGGGAGTAACTGCAGTGTT 

GG CTTC CGTCTTG CLATACGTCG AGG AAACGTTGT AAAC TGGGT CTT C C CTG AG CC 

AAGTGCTTGAAAAGCTGTGGGAGTGGGAACTGTTGTCATGATCGAAAATG 

2 5 aaccctgaagtgtcctgcagagaact<x;tggtggaggggcctctggaggacaaacatct 

gcgatggcacttctggcggtttttcttccaccttaactttc 
taaaacataaataacttctattgaattagcac^gaagtttggctgga 

AAATACGCACGAGAGCTCTTTGGCCGTCITATTTCrCCT 
CATCTCCCCCAGGAAATGAAGCTTAGAGCCTCCTTGTTCAGCTAATTGTTG 

3 0 AGTGATTGATGCATGTGCTTATTAATACCATCCTGGTGCATGGAGGACGAATAAACC^ 

GGGGTTTCCTAGAGTTTGTTGTATTTGCAGAAAGGCTGTCGTCGTCTT^ 
TCTTTCTGGCCAAGTCGTTTTCTTTTGAGCTAATTTCAG 

CTTTGGAG AGT AACTTG C CAATACTGAT AT CC CATCACTCCTG AG T ACTTG AATGTGT CTTTGCTGT AACG AAGATAGTCT CCATT 
GAGACCCTGGCACGGCCATCAGAATCAC^AAACTAGCACCGC^ 

3 5 GTCTC^GTCATGTCCCTTGCAGCAGGAGAATCTGATGCAGGGCTAGG^ 

ACTTCCATGAACTTGACACTTTTGACAATGACAA 

TATGCATTTCTTACAAGAATATCACGGCGGTGAGACT^ 

TAATTCGTAACGTTCATTTC^TCTTTTAATTAAGGAGCCAT^ 

T AAAAAAT ATTTTG TGGG AGG T ATTTTTG AGG CT ATGT AAGT AT CC CATTTTTCAC CAGC CTTTT ACTCACTGGTT AG CATCCATT 

4 0 GGTATTTCTrGGCTGAGTACTACTTTGATAGCTACCAAATGTCGATC^ 

TTCTACATTATGGGAAAGCTTTTTCATTCCCCTATTTATTTATTCGTAGATTC 

CATGTATTTGGGTGCTCAGAT^TGTCTTAGATGTGGCCAGTGGGAGCTCCATCAGGTT 

CTATTATCTTTTGAGCCCTTCCTTATTTCCTTTCCCAGGCT^ 

CAAGGAG CC CTG ATT C CTTTT GG TGGAGGATGGTGTTTGGAAAC CAAGATTTGGG CACTG CCTG TG CTCATTG CTGTTGGGGTCTT 

4 5 GCTGTTCCCAGGCCCTTTTAGTGTCAAGGTTAGAGAATATGTGTATTAGTTGCATATATACT^ 

TTTTTTTGAGACGGAGTCrTGCTCTGTCACCCAGGCTGGAGT^ 
TCAAGCAATTCTCCCTGCCTCAGCCTCCCGAGTAGCKKMACTACAGGT 
TAGTAGAGACGGGGTTTCACCGTGTTAGCCGGGGTGGTCTCGAT 
TACCTCTCTATTTATTTCTGTGTTTATCGAGATATATC 

5 0 GGGTTATTGTAGTTTTCTTTCTTTCTGTATCTGTAACTTCCTTCTC 

TATAATTTACTCAGTTGCCTGTGTGTATGTGGTCTTATCTCCCCACGGTTGC 

CCAGGTGGCTGCTGCCCCAGCCACCCTAACAGGTGCTCTCTTTCTGCCAATACCTACCCTGGCT 



5 5 GTTTTGAAGGCCAAAAGTTGGCAATTAAGGTATGGGCAGGGTTGGT^ 

TCTCCTACCTTCTGGTGGCTGCTAGCAATCCTTGGCTTTCTT^ 

CAACATTCTCCTTTCTTCTATGTGTCTCTGTGTCCAAGTTTCCCTOT 

TCTAATCTGGTATGACCTCGTTTAACTTCATTATA^ 

ACACGAATGTTGGGGGGC CACTGTTCAACCCAGTACATCTTCTTTGCTG CGTTGCATC!AAATGGGCITTTGGAGTAAATCATTCAT 

6 0 AAAGGAAGGGTGGAGGAAGGAAGGGAAGAAGGAAACTCAGTTGTTTTGAGAAG 

CT T C TTT TTTTTTTTTr TG TGATGXSA^ 

G CCTCCAGGGTTCAAGCG ATTCTCCTG ACTCAGCCTCTTGAGTAG CTX5AGATTACAGGCACACACCACCATGCCCAGCT AATTTAT 

ATAAATTTTAGTAGAGATGGGCTTTCACCATGTTGACCAGGCTGGTGTTGATCTCITGAGCT 

CCCAAAGTGCTGGGATTACAGGTCTGAGCCACTGTGCCTGGCCT 

6 5 CATCAGGA.CAGTCTTGGAAAACTCAGGACTTAAGAGTGAGCA.CAGGAT 

CGTTCIACCTTCTGTTCTGTGCCAACCCGGGGTTGGGGCACACCC^ 
TGGTGCATGTTGTCAGCCCGGCGCATGCTCTGAGTTCTGTAAGGGTGAGTGAAC^ 

GAAGAGAGCAGCGTGTCAGCACGTGGACTCAGCTGGGGACTCATGGGGAAGC^ • 
TGGTCCTTGATTGCCCrTGATGGTTAAAACCACCTGCTCCCC^ 

7 0 ATTCTAGACAGTAGTTCrCCCTTATCTGTGGTTTTGCTTTTAGTC 

ATGGGAAATTCCAGAAATAAACAATTCATAAGTTTTAAATTGTGCACTGTTGCAA 

TC CTG CCCAGG ACAT AAAT CAT CC CITTGTTG AG ATG AGGG AATCTGCTGT AG TC CCAAT ACT CT ATT ATTTGACTCT ATT ACATT 
TTCTTTAAACATCTTCCC^CTGTCTGGTATT^^ 
GTGTCTGCAGAAACTATTATTTGTTAAATATCAACTATTGA^ 
75 TT ACCG CAAACCATG CGG AAGTTA7CCCATT TTACa 
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